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, (57) Abstract 

The present invention provides polynucleotide sequences of the genome of Streptococcus pneumoniae, polypeptide sequences encoded 
; by the polynucleotide sequences, corresponding polynucleotides and polypeptides, vectors and hosts comprising the polynucleotides, and 
assays and other uses thereof. The present invention further provides polynucleotide and polypeptide sequence information stored on 
computer readable media, and computer-based systems and methods which facilitate its use. 
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Streptococcus pneumoniae Polynucleotides and Sequences 
FIELD OF THE INVENTION 

5 The present invention relates to the field of molecular biology. In 

particular, it relates to, among other things, nucleotide sequences of Streptococcus 
pneumoniae, contigs, ORFs; fragments, probes. - " primers "and related 
polynucleotides thereof, peptides and polypeptides encoded by the sequences, and 
uses of the polynucleotides and sequences thereof, such as in fermentation. 
10 polypeptide production, assays and pharmaceutical development, among others. 

BACKGROUND OF THE INVENTION 

Streptococcus pneumoniae has been one of the most extensively studied 

15 microorganisms since its first isolation in 1881. It was the object of many 
investigations that led to important scientific discoveries. In 1928, Griffith 
observed that when heat-killed encapsulated pneumococci and live strains 
constitutively lacking any capsule were concomitantly injected into mice, the 
nonencapsulated could be converted into encapsulated pneumococci with the same 

20 capsular type as the heat-killed strain. Years later, the nature of this "transforming 
principle," or carrier of genetic information, was shown to be DNA. (Avery, O.T., 
etaLJ. Exp. Med., 79:137-157 (1944)). 

In spite of the vast number of publications on S. pneumoniae many 
questions about its virulence are still unanswered, and this pathogen remains a 

25 major causative agent of serious human disease, especially community-acquired 
pneumonia. (Johnston, R.B., et a/., Rev, Infect. Dis. 7i(Suppl. 6):S509-517 
(1991)). In addition, in developing countries, the pneumococcus is responsible for 
the death of a large number of children under the age of 5 years from pneumococcal 
pneumonia. The incidence of pneumococcal disease is highest in infants under 2 

30 years of age and in people over 60 years of age. Pneumococci are the second most 
frequent cause (after Haemophilus influenzae type b) of bacterial meningitis and 
otitis media in children. With the recent introduction of conjugate vaccines for H . 
influenzae type b, pneumococcal meningitis is likely to become increasingly 
prominent. S. pneumoniae is the most important etiologic agent of community- 
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acquired pneumonia in adults and is the second most common cause of bacterial 
meningitis behind Neisseria meningitidis. 

The antibiotic generally prescribed to treat S. pneumoniae is 
benzylpeniciilin, although resistance to this and to other antibiotics is found 

5 occasionally. Pneumococcal resistance to penicillin results from mutations in its 
penicillin-binding proteins. In uncomplicated pneumococcal pneumonia caused by 
a sensitive strain, treatment with penicillin is usually successful unless started too 
late. Erythromycin or clindamycin can be used to treat pneumonia in patients 
hypersensitive to penicillin, but resistant strains to these drugs exist. Broad 

10 spectrum antibiotics (e.g., the tetracyclines) may also be effective, although 
tetracycline-resistant strains are not rare. In spite of the availability of antibiotics, 
the mortality of pneumococcal bacteremia in the last four decades has remained 
stable between 25 and 29%. (Gillespie, S.H., et at., J. Med. Microbiol. 25:237- 
248 (1989). 

15 S. pneumoniae is carried in the upper respiratory tract by many healthy 

individuals. It has been suggested that attachment of pneumococci is mediated by a 
disaccharide receptor on fibronectin. present on human pharyngeal epithelial cells. 
(Anderson, B.J., etai.. J. Immunol. 742:2464-2468 (1989). The mechanisms by 
which pneumococci translocate from the nasopharynx to the lung, thereby causing 

20 pneumonia, or migrate to the blood, giving rise to bacteremia or septicemia, are 
poorly understood. (Johnston, R.B., et aL Rev. Infect. Dis. 73(Suppl. 6):S509- 
517 (1991). 

Various proteins have been suggested to be involved in the pathogenicity of 
S. pneumoniae, however, only a few of them have actually been confirmed as 

25 virulence factors. Pneumococci produce an IgAl protease that might interfere with 
host defense at mucosal surfaces. (Kornfield, S.J., et al.. Rev. Inf. Dis. 3:521- 
534 (1981). S. pneumoniae also produces neuraminidase, an enzyme that may 
facilitate attachment to epithelial ceils by cleaving sialic acid from the host 
glycolipids and gangliosides. Partially purified neuraminidase was observed to 

30 induce meningitis-like symptoms in mice; however, the reliability of this finding 
has been questioned because the neuraminidase preparations used were probably 
contaminated with cell wall products. Other pneumococcal proteins besides 
neuraminidase are involved in the adhesion of pneumococci to epithelial and 
endothelial cells. These pneumococcal proteins have as yet not been identified. 

35 Recently, Cundell et. aL, reported that peptide permeases can modulate 
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pneumococcal adherence to epithelial and endothelial cells. It was. however, 
unclear whether these permeases function directly as adhesions or whether they 
enhance adherence by modulating the expression of pneumococcal adhesions. 
(DeVelasco. E.A., et a/.. Micro. Rev. 59:591-603 (1995). A better understanding 
5 of the virulence factors determining its pathogenicity will need to be developed to 
cope with the devastating effects of pneumococcal disease in humans. 

" Ironically .'despite the prominent role of 5. pneumoniae in the discovery of 
DNA, little is known about the molecular genetics of the organism. The 5. 
pneumoniae genome consists of one circular, covalently closed, double-stranded 

10 DNA and a collection of so-called variable accessory elements, such as prophages, 
plasmids, transposons and the like. Most physical characteristics and almost all of 
the genes of 5. pneumoniae are unknown. Among the few that have been 
identified, most have not been physically mapped or characterized in detail. Only a 
few genes of this organism have been sequenced. (See, for instance current 

1 5 versions of GENBANK and other nucleic acid databases, and references that relate 
to the genome of S. pneumoniae such as those set out elsewhere herein.) 

It is clear that the etiology of diseases mediated or exacerbated by S. 
pneumoniae, infection involves the programmed expression of S. pneumoniae 
genes, and that characterizing the genes and their patterns of expression would add 

20 dramatically to our understanding of the organism and its host interactions. 
Knowledge of S. pneumoniae genes and genomic organization would improve our 
understanding of disease etiology and lead to improved and new ways of 
preventing, ameliorating, arresting and reversing diseases. Moreover, 
characterized genes and genomic fragments of 5. pneumoniae would provide 

25 reagents for. among other things, detecting, characterizing and controlling 5. 
pneumoniae infections. There is a need to characterize the genome of S. 
pneumoniae and for polynucleotides of this organism. 
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SUMMARY OF T HE INVENTION 

The present invention is based on the sequencing of fragments of the 
5 Streptococcus pneumoniae genome. The primary nucleotide sequences which were 
generated are provided in SEQ ID NOS: 1 -39 1 . 

The present invention provides the nucleotide sequence of several hundred 
contigsof the Streptococcus pneumoniae genome, which are listed in tables below 
and set out in the Sequence Listing submitted herewith, and representative 

i() fragments thereof, in a form which can be readily used, analyzed, and interpreted 
by a skilled artisan. In one embodiment, the present invention is provided as 
contiguous strings of primary sequence information corresponding to the 
nucleotide sequences depicted in SEQ ID NOS: 1-391. 

The present invention further provides nucleotide sequences which are at 

1 5 least 95% identical to the nucleotide sequences of SEQ ID NOS: I -39 1 . 

The nucleotide sequence of SEQ ED NOS: 1-39 l f a representative fragment 
thereof, or a nucleotide sequence which is at least 95% identical to the nucleotide 
sequence of SEQ ID NOS: I -39 1 may be provided in a variety of mediums to 
facilitate its use. In one application of this embodiment, the sequences of the 

20 present invention are recorded on computer readable media. Such media includes, 
but is not limited to: magnetic storage media, such as floppy discs, hard disc 
storage medium, and magnetic tape; optical storage media such as CD-ROM; 
electrical storage media such as RAM and ROM; and hybrids of these categories 
such as magnetic/optical storage media. 

-5 The present invention further provides systems, particularly computer- 

based systems which contain the sequence information herein described stored in a 
data storage means. Such systems are designed to identify commercially important 
fragments of the Streptococcus pneumoniae genome. 

Another embodiment of the present invention is directed to fragments of the 

30 Streptococcus pneumoniae genome having particular structural or functional 
attributes. Such fragments of the Streptococcus pneumoniae genome of the present 
invention include, but are not limited to, fragments which encode peptides, 
hereinafter referred to as open reading frames or ORFs, fragments which modulate 
the expression of an operably linked ORF, hereinafter referred to as expression 

35 modulating fragments or EMFs. and fragments which can be used to diagnose the 
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presence of Streptococcus pneumoniae in a sample, hereinafter referred to as 
diagnostic fragments or DFs. 

Each of the ORFs in fragments of the Streptococcus pneumoniae genome 
disclosed in Tables 1-3, and the EMFs found 5* to the ORFs, can be used in 
5 numerous ways as polynucleotide reagents. For instance, the sequences can be 
used as diagnostic probes or amplification primers for detecting or determining the 
presence of a specific microbe in a sample, to selectively control gene expression in 
a host and in the production of polypeptides, such as polypeptides encoded by 
ORFs of the present invention, particular those polypeptides that have a 

to pharmacological activity. 

The present invention further includes recombinant constructs comprising 
one or more fragments of the Streptococcus pneumoniae genome of the present 
invention. The recombinant constructs of the present invention comprise vectors, 
such as a plasmid or viral vector, into which a fragment of the Streptococcus 

1 5 pneumoniae has been inserted. 

The present invention further provides host cells containing any of the 
isolated fragments of the Streptococcus pneumoniae genome of the present 
invention. The host cells can be a higher eukaryotic host cell, such as a mammalian 
cell, a lower eukaryotic cell, such as a yeast cell, or a procaryotic cell such as a 

20 bacterial cell. 

The present invention is further directed to isolated polypeptides and 
proteins encoded by ORFs of the present invention. A variety of methods, well 
known to those of skill in the art. routinely may be utilized to obtain any of the 
polypeptides and proteins of the present invention. For instance, polypeptides and 

25 proteins of the present invention having relatively short, simple amino acid 
sequences readily can be synthesized using commercially available automated 
peptide synthesizers. Polypeptides and proteins of the present invention also may 
be purified from bacterial cells which naturally produce the protein. Yet another 
alternative is to purify polypeptide and proteins of the present invention from cells 

30 which have been altered to express them. 

The invention further provides methods of obtaining homologs of the 
fragments of the Streptococcus pneumoniae genome of the present invention and 
homoiogs of the proteins encoded by the ORFs of the present invention. 
Specifically, by using the nucleotide and amino acid sequences disclosed herein as 
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a probe or as primers, and techniques such ns PCR cloning and colony/plaque 
hybridization, one skilled in the an can obtain homologs. 

The invention further provides antibodies which selectively bind 
polypeptides and proteins of the present invention. Such antibodies include both 

5 monoclonal and polyclonal antibodies. 

The invention further provides hybridomas which produce the above- 
described antibodies. A hybridoma is an immortalized cell line which is capable of 
secreting a specific monoclonal antibody. 

The present invention further provides methods of identifying test samples 

10 derived from cells which express one of the ORFs of the present invention, or a 
homolog thereof. Such methods comprise incubating a test sample with one or 
more of the antibodies of the present invention, or one or more of the DFs of the 
present invention, under conditions which allow a skilled artisan to determine if the 
sample contains the ORF or product produced therefrom. 

15 In another embodiment of the present invention, kits are provided which 

contain the necessary reagents to carry out the above-described assays. 

Specifically, the invention provides a compartmentalized kit to receive, in 
close confinement, one or more containers which comprises: (a) a first container 
composing one of the antibodies, or one of the DFs of the present invention; and 

20 (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting presence of bound antibodies or hybridized 
DFs. 

Using the isolated proteins of the present invention, the present invention 
further provides methods of obtaining and identifying agents capable of binding to 

25 a polypeptide or protein encoded by one of the ORFs of the present invention. 
Specifically, such agents include, as further described below, antibodies, peptides, 
carbohydrates, pharmaceutical agents and the like. Such methods comprise steps 
of: (a) contacting an agent with an isolated protein encoded by one of the ORFs of 
the present invention: and (b) determining whether the agent binds to said protein. 

30 The present genomic sequences of Streptococcus pneumoniae will be of 

great value to all laboratories working with this organism and for a variety of 
commercial purposes. Many fragments of the Streptococcus pneumoniae genome 
will be immediately identified by similarity searches against GenBank or protein 
databases and will be of immediate value to Streptococcus pneumoniae researchers 
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and for immediate commercial value for the production of proteins or to control 
gene expression. 

The methodology and technology for elucidating extensive genomic 
sequences of bacterial and other genomes has and will greatly enhance the ability to 
5 analyze and understand chromosomal organization. In particular, sequenced 
contigs and genomes will provide the models for developing tools for the analysis 
of chromosome structure and~functionT including the ability to identify genes within 
large segments of genomic DNA, the structure, position, and spacing of regulatory 
elements, the identification of genes with potential industrial applications, and the 
10 ability to do comparative genomic and molecular phylogeny. 

DESCRIPTION OF THE FIOIIRFS 

FIGURE 1 is a block diagram of a computer system (102) that can be 
15 used to implement computer-based systems of present invention. 

FIGURE 2 is a schematic diagram depicting the data flow and computer 
programs used to collect, assemble, edit and annotate the contigs of the 
Streptococcus pneumoniae genome of the present invention. Both Macintosh and 

20 Unix platforms are used to handle the AB 373 and 377 sequence data files, largely 
<x± described in Kerlavage et a/.. Proceedings of the Twenty-Sixth Annual Hawaii 
International Conference on System Sciences, 585, IEEE Computer Society Press. 
Washington D.C. (1993). Factura (AB) is a Macintosh program designed for 
automatic vector sequence removal and end-trimming of sequence files. The 

25 program Loadis runs on a Macintosh platform and parses the feature data extracted 
from the sequence files by Factura to the Unix based Streptococcus pneumoniae 
relational database. Assembly of contigs (and whole genome sequences) is 
accomplished by retrieving a specific set of sequence files and their associated 
features using Extrseq, a Unix utility for retrieving sequences from an SQL 

30 database. The resulting sequence file is processed by seq_filter to trim portions of 
the sequences with more than 2% ambiguous nucleotides. The sequence files were 
assembled using TIGR Assembler, an assembly engine designed at The Institute 
for Genomic Research ( TIGR ) for rapid and accurate assembly of thousands of 
sequence fragments. The collection of contigs generated by the assembly step is 

35 loaded into the database with the lassie program. Identification of open reading 
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frames (ORFs) is accomplished by processing contigs with zorf or GenMark. The 
ORFs are searched against S. pneumoniae sequences from GenBank and against all 
protein sequences using the BLASTN and BLASTP programs, described in 
Altschul et ai 9 J. MoL Biol. 215:. 403-410 (1990)). Results of the ORF 
5 determination and similarity searching steps were loaded into the database. As 
described below, some results of the determination and the searches are set out in 
Tables 1-3. 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

.10 

The present invention is based on the sequencing of fragments of the 
Streptococcus pneumoniae genome and analysis of the sequences. The primary 
nucleotide sequences generated by sequencing the fragments are provided in SEQ 
ED NOS: 1-391. (As used herein, the "primary sequence" refers to the nucleotide 

15 sequence represented by the IUPAC nomenclature system.) 

In addition to the aforementioned Streptococcus pneumoniae polynucleotide 
and polynucleotide sequences, the present invention provides the nucleotide 
sequences of SEQ ID NOS: 1-391, or representative fragments thereof, in a form 
which can be readily used, analyzed, and interpreted by a skilled artisan. 

20 As used herein, a " representative fragment of the nucleotide sequence 

depicted in SEQ ID NGS:i-39i" refers to any portion of the SEQ ED NOS:l-391 
which is not presently represented within a publicly available database. Preferred 
representative fragments of the present invention are Streptococcus pneumoniae 
open reading frames ( ORFs ), expression modulating fragment ( EMFs ) and 

25 fragments which can be used to diagnose the presence of Streptococcus 
pneumoniae in sample ( DFs ). A non-limiting identification of preferred 
representative fragments is provided in Tables 1-3. As ^discussed in detail below, 
the information provided in SEQ ID NOS: 1-391 and in Tables 1-3 together with 
routine cloning, synthesis, sequencing and assay methods will enable those skilled 

30 in the art to clone and sequence all "representative fragments" of interest, including 
open reading frames encoding a large variety of Streptococcus pneumoniae 
proteins. 

While the presently disclosed sequences of SEQ ID NOS: 1-39 1 are highly 
accurate, sequencing techniques are not perfect and, in relatively rare instances, 
35 further investigation of. a fragment or sequence of the invention may reveal a 
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nucleotide sequence error present in a nucleotide sequence disclosed in SEQ ID 
NOS:l-39L However, once the present invention is made available {i.e., once the 
information in SEQ ED NOS: 1-391 and Tables 1-3 has been made available), 
resolving a rare sequencing error in SEQ ID NOS: 1-391 will be well within the 
5 skill of the art. The present disclosure makes available sufficient sequence 
information to allow any of the described contigs or portions thereof to be obtained 
readily by "straightforward^ 

such polynucleotide may proceed in like manner using manual and automated 
sequencing methods which are employed ubiquitous in the art. Nucleotide 

10 sequence editing software is publicly available. For example. Applied Biosy stem's 
(AB) AutoAssembler can be used as an aid during visual inspection of nucleotide 
sequences. By employing such routine techniques potential errors readily may be 
identified and the correct sequence then may be ascertained by targeting further 
sequencing effort, also of a routine nature, to the region containing the potential 

15 error. 

Even if all of the very rare sequencing errors in SEQ ID NOS: 1-391 were 
corrected, the resulting nucleotide sequences would still be at least 95% identical, 
nearly all would be at least 99% identical, and the great majority would be at least 
99.9% identical to the nucleotide sequences of SEQ ID NOS: 1 -39 1 . 

20 As discussed elsewhere herein, polynucleotides of the present invention 

readily may be obtained by routine application of well known and standard 
procedures for cloning and sequencing DNA. Detailed methods for obtaining 
libraries and for sequencing are provided below, for instance. A wide variety of 
Streptococcus pneumoniae strains that can be used to prepare S. pneumoniae 

25 genomic DNA for cloning and for obtaining polynucleotides of the present 
invention are available to the public from recognized depository institutions, such 
as the American Type Culture Collection ( ATCC ). While the present invention is 
enabled by the sequences and other information herein disclosed, the S. 
pneumoniae strain that provided the DNA of the present Sequence Listing, Strain 

30 7/87 14.8.91, has been deposited in the ATCC, as a convenience to those of skill 
in the art. As a further convenience, a library of 5. pneumoniae genomic DNA. 
derived from the same strain, also has been deposited in the ATCC. The S. 
pneumoniae strain was deposited on October 10, 1996, and was given Deposit No. 
55840, and the cDNA library was deposited on October 1 1, 1996 and was given 

35 Deposit No. 97755. The genomic fragments in the library are 15 to 20 kb 



WO 98/18931 PCT/US97/19588 

10 



fragments generated by partial Sau3Al digestion and they are inserted into the 
BamHI site in the well-known lambda-derived vector lambda DASH II (Stratagene, 
La Jolla, CA). The provision of the deposits is not a waiver of any rights of the 
inventors or their assignees in the present subject matter. 

5 The nucleotide sequences of the genomes from different strains of 

Streptococcus pneumoniae differ somewhat. However, the nucleotide sequences 
of the genomes of all Streptococcus pneumoniae strains will be at least 95% 
identical, in corresponding part, to the nucleotide sequences provided in SEQ ED 
NOS: 1-391. Nearly all will be at least 99% identical and the great majority will be 

10 99.9% identical. 

Thus, the present invention further provides nucleotide sequences which 
are at least 95%, preferably 99% and most preferably 99.9% identical to the 
nucleotide sequences of SEQ ID NOS: 1-391, in a form which can be readily used, 
analyzed and interpreted by the skilled artisan. 

15 Methods for determining whether a nucleotide sequence is at least 95%, at 

least 99% or at least 99.9% identical to the nucleotide sequences of SEQ ID 
NOS: 1-391 are routine and readily available to the skilled artisan. For example, the 
well known fasta algorithm described in Pearson and Lipman, Proc. Natl. Acad. 
ScL USA 85: 2444 (1988) can be used to generate the percent identity of nucleotide 

:o sequences. The BLASTN program also can be used to generate an identity score 
or poiynuaeoudes compared to one another. 



25 
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COMPUTER RELATED EMBODIMENTS 

The nucleotide sequences provided in SEQ ID NOS: 1-391, a representative 
fragment thereof, or a nucleotide sequence at least 95%, preferably at least 99% 
and most preferably at least 99.9% identical to a polynucleotide sequence of SEQ 
ID NOS: 1-391 may be "provided" in a variety of mediums to facilitate use thereof. 
As used herein, provided refers to a manufacture, other than an isolated nucleic 
acid molecule, which contains a nucleotide sequence of the present invention; i.e.. 
a nucleotide sequence provided in SEQ ID NOS: 1-391, a representative fragment 
thereof, or a nucleotide sequence at least 95%. preferably at least 99% and most 
preferably at least 99.9% identical to a polynucleotide of SEQ ID NOS: 1-391. 
Such a manufacture provides a large portion of the Streptococcus pneumoniae 
genome and parts thereof a Streptococcus pneumoniae open reading frame 
(ORF)) in a form which allows a skilled artisan to examine the manufacture using 
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means not directly applicable to examining the Streptococcus pneumoniae genome 
or a subset thereof as it exists in nature or in purified form. 

In one application of this embodiment, a nucleotide sequence of the present 
invention can be recorded on computer readable media. As used herein, "computer 

5 readable media" refers to any medium which can-be read and accessed directly by a 
computer. Such media include, but are not limited to: magnetic storage media, 
such as floppy discs' hard disc storage mediumTand magnetic 7 taperopticai Ttorage 
media such as CD- ROM; electrical storage media such as RAM and ROM; and 
hybrids of these categories, such as magnetic/optical storage media. A skilled 

in artisan can readily appreciate how any of the presently known computer readable 
mediums can be used to create a manufacture comprising computer readable 
medium having recorded thereon a nucleotide sequence of the present invention. 
Likewise, it will be clear to those of skill how additional computer readable media 
that may be developed also can be used to create analogous manufactures having 

1 5 recorded thereon a nucleotide sequence of the present invention. 

As used herein, "recorded" refers to a process for storing information on 
computer readable medium. A skilled artisan can readily adopt any of the presently 
know methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence information of the present 

20 invention. A variety of data storage structures are available to a skilled artisan 
for creating a computer readable medium having recorded thereon a nucleotide 
sequence of the present invention. The choice of the data storage structure will 
generally be based on the means chosen to access the stored information. In 
addition, a variety of data processor programs and formats can be used to store the 

25 nucleotide sequence information of the present invention on computer readable 
medium. The sequence information can be represented in a word processing text 
file, formatted in commercially- available software such as WordPerfect and 
Microsoft Word, or represented in the form of an ASCII file, stored in a database 
application, such as DB2, Sybase. Oracle, or the like. A skilled artisan can readily 

30 adapt any number of data-processor structuring formats {e.g.. text file or database) 
in order to obtain computer readable medium having recorded thereon the 
nucleotide sequence information of the present invention. 

Computer software is publicly available which allows a skilled artisan to 
access sequence information provided in a computer readable medium. Thus, by 

35 providing in computer readable form the nucleotide sequences of SEQ ID NOS:l- 
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391, a representative fragment thereof, or a nucleotide sequence at least 95%, 
preferably at least 99% and most preferably at least 99.9% identical to a sequence 
of SEQ ID NOS: 1-391 the present invention enables the skilled artisan routinely to 
access the provided sequence information for a wide variety of purposes. 
5 The examples which follow demonstrate how software which implements 

the BLAST (Altschul et a/., 7. Mol Biol. 2/5:403-410 (1990)) and BLAZE 
(Bnitlag etal. t Comp, Chem. 77:203-207 (1993)) search algorithms on a Sybase 
system was used to identify open reading frames (ORFs) within the Streptococcus 
pneumoniae genome which contain homology to ORFs or proteins from both 

10 Streptococcus pneumoniae and from other organisms. Among the ORFs discussed 
herein are protein encoding fragments of the Streptococcus pneumoniae genome 
useful in producing commercially important proteins, such as enzymes used in 
fermentation reactions and in the production of commercially useful metabolites. 

The present invention further provides systems, particularly computer- 

15 based systems, which contain the sequence information described herein. Such 
systems are designed to identify, among other things, commercially important 
fragments of the Streptococcus pneumoniae genome. 

As used herein, "a computer-based system" refers to the hardware means, 
software means, and data storage means used to analyze the nucleotide sequence 

20 information of the present invention. The minimum hardware means of the 
computer-based systems of the present invention comprises a central processing 
unit (CPU), input means, output means, and data storage means. A skilled artisan 
can readily appreciate that any one of the currently available computer-based 
systems are suitable for use in the present invention. 

25 As stated above, the computer-based systems of the present invention 

comprise a data storage means having stored therein a nucleotide sequence of the 
present invention and the necessary hardware means and software means for 
supporting and implementing a search means. 

As used herein, 'data storage means" refers to memory which can store 

30 nucleotide sequence information of the present invention, or a memory access 
means which can access manufactures having recorded thereon the nucleotide 
sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which arc 
implemented on the computer-based system to compare a target sequence or target 

35 structural motif with the sequence information stored within the data storage 
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means. Search means are used to identify fragments or regions of the present 
genomic sequences which match a particular target sequence or target motif. A 
variety of known algorithms are disclosed publicly and a variety of commercially 
available software for conducting search means are and can be used in the 
5 computer-based systems of the present invention. Examples of such software 
includes, but is not limited to, MacPattem (EMBL), BLASTN and BLASTX 
(NCBIA). A skilled artisan can readily recognize that "any" one of the available 
algorithms or implementing software packages for conducting homology searches 
can be adapted for use in the present computer-based systems. 

10 As used herein, a "target sequence" can be any DNA or amino acid 

sequence of six or more nucleotides or two or more amino acids. A skilled artisan 
can readily recognize that the longer a target sequence is, the less likely a target 
sequence will be present as a random occurrence in the database. The most 
preferred sequence length of a target sequence is from about 10 to 100 amino acids 

15 or from about 30 to 300 nucleotide residues. However, it is well recognized that 
searches for commercially important fragments, such as sequence fragments 
involved in gene expression and protein processing, may be of shorter length. 

As used herein, "a target structural motif,'* or "target motif," refers to any 
rationally selected sequence or combination of sequences in which the sequence(s) 

20 are chosen based on a three-dimensional configuration which is formed upon the 
folding of the target motif. There are a variety of target motifs known in the art. 
Protein target motifs include, but are not limited to. enzymic active sites and signal 
sequences. Nucleic acid target motifs include, but are not limited to, promoter 
sequences, hairpin structures and inducible expression elements (protein binding 

25 sequences). 

A variety of structural formats for the input and output means can be used 
to input and output the information in the computer-based systems of the present 
invention. A preferred format for an output means ranks fragments of the 
Streptococcus pneumoniae genomic sequences possessing varying degrees of 

30 homology to the target sequence or target motif. Such presentation provides a 
skilled artisan with a ranking of sequences which contain various amounts of the 
target sequence or target motif and identifies the degree of homology contained in 
the identified fragment. 

A variety of comparing means can be used to compare a target sequence or 

35 target motif with the data storage means to identify sequence fragments of the 
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Streptococcus pneumoniae genome. In the present examples, implementing 
software which implement the BLAST and BLAZE algorithms, described in 
Altschul et a/., 7. Moi Biol. 215: 403-410 (1990), is used to identify open reading 
frames within the Streptococcus pneumoniae genome. A skilled artisan can readily 
5 recognize that any one of the publicly available homology search programs can be 
used as the search means for the computer-based systems of the present invention. 
Of course, suitable proprietary systems that may be known to those of skill also 
may be employed in this regard. 

Figure 1 provides a block diagram of a computer system illustrative of 

10 embodiments of this aspect of present invention. The computer system 102 
includes a processor 106 connected to a bus 104. Also connected to the bus 104 
are a main memory 108 (preferably implemented as random access memory, RAM) 
and a variety of secondary storage devices 1 10, such as a hard drive 112 and a 
removable medium storage device 1 14. The removable medium storage device 1 14 

!5 may represent, for example, a floppy disk drive, a CD-ROM drive, a magnetic tape 
drive, etc. A removable storage medium 1 16 (such as a floppy disk, a compact 
disk, a magnetic tape, etc.) containing control logic and/or data recorded therein 
may be inserted into the removable medium storage device 1 14. The computer 
system 102 includes appropriate software for reading the control logic and/or the 

20 data from the removable medium storage device 1 14, once it is inserted into the 
removable medium storage device 1 14. 

A nucleotide sequence of the present invention may be stored in a well 
known manner in the main memory 108, any of the secondary storage devices 1 10. 
and/or a removable storage medium 116. During execution, software for accessing 

25 and processing the genomic sequence (such as search tools, comparing tools, etc.) 
reside in main memory 108, in accordance with the requirements and operating 
parameters of the operating system, the hardware system and the software program 
or programs. 
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BIOCHEMICAL EMBODIMENTS 

Other embodiments of the present invention are directed to isolated 
fragments of the Streptococcus pneumoniae genome. The fragments of the 

5 Streptococcus pneumoniae genome of the present invention include, but are not 
limited to fragments which encode peptides and polypeptides, hereinafter open 
reading frames (ORFsj, fragments which modulate the"expressioh"of'an operably" 
linked ORF, hereinafter expression modulating fragments (EMFs) and fragments 
which can be used to diagnose the presence of Streptococcus pneumoniae in a 

to sample, hereinafter diagnostic fragments (DFs). 

As used herein, an 'isolated nucleic acid molecule" or an "isolated fragment 
of the Streptococcus pneumoniae genome" refers to a nucleic acid molecule 
possessing a specific nucleotide sequence which has been subjected to purification 
means to reduce, from the composition, the number of compounds which arc 

15 normally associated with the composition. Particularly, the term refers to the 
nucleic acid molecules having the sequences set out in SEQ ID NOS: I -391, to 
representative fragments thereof as described above, to polynucleotides at least 
95%. preferably at least 99% and especially preferably at least 99.9% identical in 
sequence thereto, also as set out above. 

20 A variety of purification means can be used to generate the isolated 

fragments of the present invention. These include, but are not limited to methods 
which separate constituents of a solution based on charge, solubility, or size. 

In one embodiment. Streptococcus pneumoniae DNA can be enzymaticaliy 
sheared to produce fragments of 15-20 kb in length. These fragments can then be 

25 used to generate a Streptococcus pneumoniae library by inserting them into lambda 
clones as described in the Examples below. Primers flanking, for example, an 
ORF, such as those enumerated in Tables 1-3 can then be generated using 
nucleotide sequence information provided in SEQ rD NOS: 1-391. Well known 
and routine techniques of PCR cloning then can be used to isolate the ORF from 

30 the lambda DNA library or Streptococcus pneumoniae genomic DNA. Thus, given 
the availability of SEQ ID NOS: 1-391, the information in Tables 1, 2 and 3, and 
the information that may be obtained readily by analysis of the sequences of SEQ 
ID NOS: 1-39 1 using methods set out above, those of skill will be enabled by the 
present disclosure to isolate any ORF-containing or other nucleic acid fragment of 

35 the present invention. 
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The isolated nucleic acid molecules of the present invention include, but arc 
not limited to single stranded and double stranded DNA, and single stranded RNA. 

As used herein, an "open reading frame," ORF, means a series of triplets 
coding for amino acids without any termination codons and is a sequence 
5 translatable into protein. 

Tables 1, 2, and 3 list ORFs in the Streptococcus pneumoniae genomic 
contigs of the present invention that were identified as putative coding regions by 
the GeneMark software using organism-specific second-order Markov probability 
transition matrices. It will be appreciated that other criteria can be used, in 
10 accordance with well known analytical methods, such as those discussed herein, to 
generate more inclusive, more restrictive, or more selective lists. 

Table 1 sets out ORFs in the Streptococcus pneumoniae contigs of the 
present invention that over a continuous region of at least 50 bases are 95% or 
more identical (by BLAST analysis) to a nucleotide sequence available through 
15 GenBank in October. 1997. 

Table 2 sets out ORFs in the Streptococcus pneumoniae contigs of the 
present invention that are not in Table 1 and match, with a BLASTP probability 
score of 0.01 or less, a polypeptide sequence available through GenBank in 
October, 1997. 

20 Table 3 sets out ORFs in the Streptococcus pneumoniae contigs of the 

present invention that do not match significantly, by BLASTP analysis, a 
polypeptide sequence available through GenBank in October, 1997. 

In each table, the first and second columns identify the ORF by, 
respectively, contig number and ORF number within the contig; the third column 

25 indicates the first nucleotide of the ORF (actually the first nucleotide of the stop 
codon immediately preceeding the ORF), counting from the 5* end of the contig 
strand; and the fourth column, "stop (nt)" indicates the last nucleotide of the stop 
codon defining the 3'end of the ORF. 

In Tables 1 and 2, column five, lists the Reference for the closest 

30 matching sequence available through GenBank. These reference numbers are the 
databases entry numbers commonly used by those of skill in the art, who will be 
familiar with their denominators. Descriptions of the nomenclature are available 
from the National Center for Biotechnology Information. Column six in Tables 1 
and 2 provides the gene name of the matching sequence; column seven provides 

35 the BLAST identity score and column eight the BLAST similarity score from the 
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comparison of the ORF and the homologous gene : and column nine indicates the 
length in nucleotides of the highest scoring segment pair identified by the BLAST 
identity analysis. 

Each ORF described in the tables is defined by "start (nt) M (5') and "stop 
5 (nt)" (3') nucleotide position numbers. These position numbers refer to the 
boundaries of each ORF and provide orientation with respect to whether the 
forward or reverse strand is the coding strand and which reading frame the coding 
sequence is contained. The "start" position is the first nucleotide of the triplet 
encoding a stop codon just 5' to the ORF and the "stop" position is the last 

10 nucleotide of the triplet encoding the next in-frame stop codon (i.e., the stop codon 
at the 3' end of the ORF). Those of ordinary skill in the art appreciate that 
preferred fragments within each ORF described in the table include fragments of 
each ORF which include the entire sequence from the delineated "start'* and "stop" 
positions excepting the first and last three nucleotides since these encode stop 

15 codons. Thus, polynucleotides set out as ORFs in the tables but lacking the three 
(3) 5* nucleotides and the three (3) 3* nucleotides are encompassed by the present 
invention. Those of skill also appreciate that particularly preferred are fragments 
within each ORF that are polynucleotide fragments comprising polypeptide coding 
sequence. As defined herein, "coding sequence" includes the fragment within an 

20 ORF beginning at the first in-framc ATG (triplet encoding methionine) and ending 
with the last nucleotide prior to the triplet encoding the 3' stop codon. Preferred 
are fragments comprising the entire coding sequence and fragments comprising the 
entire coding sequence, excepting the coding sequence for the N-terminal 
methionine. Those of skill appreciate that the N-terminal methionine is often 

25 removed during post-translational processing and that polynucleotides lacking the 
ATG can be used to facilitate production of N-termainal fusion proteins which may 
be benefical in the production or use of genetically engineered proteins. Of course, 
due to the degeneracy of the genetic code many polynucleotides can encode a given 
polypeptide. Thus, the invention further includes polynucleotides comprising a 

30 nucleotide sequence encoding a polypeptide sequence itself encoded by the coding 
sequence within an ORF described in Tables 1-3 herein. Further, polynucleotides 
at least 95%, preferably at least 99% and especially preferably at least 99.9% 
identical in sequence to the foregoing polynucleotides, are contemplated by the 
present invention. 
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Polypeptides encoded by polynucleotides described above and elsewhere 
herein are also provided by the present invention as are polypeptide comprising a 
an amino acid sequence at least about 95%, preferably at least 97% and even more 
preferably 99% identical to the amino acid sequence of a polypeptide encoded by an 
5 ORF shown in Tables 1-3. These polypeptides may or may not comprise an N- 
terminal methionine. 

The concepts of percent identity and percent similarity of two polypeptide 
sequences is well understood in the art. For example, two polypeptides 10 amino 
acids in length which differ at three amino acid positions (e.g., at positions 1, 3 

10 and 5) are said to have a percent identity of 70%. However, the same two 
polypeptides would be deemed to have a percent similarity of 80% if, for example 
at position 5, the amino acids moieties, although not identical, were "similar" (i.e.. 
possessed similar biochemical characteristics). Many programs for analysis of 
nucleotide or amino acid sequence similarity, such as fasta and BLAST specifically 

15 list percent identity of a matching region as an output parameter. Thus, for 
instance, Tables 1 and 2 herein enumerate the percent identity of the highest 
scoring segment pair in each ORF and its listed relative. Further details 
concerning the algorithms and criteria used for homology searches are provided 
below and are described in the pertinent literature highlighted by the citations 

20 provided below. 

It will be appreciated that other criteria can be used to generate more 
inclusive and more exclusive listings of the types set out in the tables. As those of 
skill will appreciate, narrow and broad searches both are useful. Thus, a skilled 
artisan can readily identify ORFs in contigs of the Streptococcus pneumoniae 

25 genome other than those listed in Tables 1-3, such as ORFs which are overlapping 
or encoded by the opposite strand of an identified ORF in addition to those 
ascertainable using the computer-based systems of the present invention. 

As used herein, an "expression modulating fragment," EMF, means a 
series of nucleotide molecules which modulates the expression of an operably 

30 linked ORF or EMF. 
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As used herein, a sequence is said to "modulate the expression of an 
operably linked sequence" when the- expression of the sequence is altered by the 
presence of the EMF. EMFs include, but are not limited to, promoters, and 
promoter modulating sequences (inducible elements). One class of EMFs are 
5 fragments which induce the expression or an operably linked ORF in response to a 
specific regulatory factor or physiological event. 

EMF sequences-can be identified within the contigs of the Streptococcus 
pneumoniae genome by their proximity to the ORFs provided in Tables 1-3. An 
intergenic segment, or a fragment of the intergenic segment, from about 10 to 200 

10 nucleotides in length, taken from any one of the ORFs of Tables 1-3 will modulate 
the expression of an operably linked ORF in a fashion similar to that found with the 
naturally I'nked ORF sequence. As used herein, an "intergenic segment" refers to 
fragments of the Streptococcus pneumoniae genome which are between two 
ORF(s) herein described. EMFs also can be identified using known EMFs as a 

15 target sequence or target motif in the computer-based systems of the present 
invention. Further, the two methods can be combined and used together. 

The presence and activity of an EMF can be confirmed using an EMF trap 
vector. An EMF trap vector contains a cloning site linked to a marker sequence. A 
marker sequence encodes an identifiable phenotype, such as antibiotic resistance or 

20 a complementing nutrition auxotrophic factor, which can be identified or assayed 
when the EMF trap vector is placed within an appropriate host under appropriate 
conditions. As described above, a EMF will modulate the expression of an 
operably linked marker sequence. A more detailed discussion of various marker 
sequences is provided below. A sequence which is suspected as being an EMF is 

25 cloned in all three reading frames in one or more restriction sites upstream from the 
marker sequence in the EMF trap vector. The vector is then transformed into an 
appropriate host using known procedures and the phenotype of the transformed 
host in examined under appropriate conditions. As described above, an EMF will 
modulate the expression of an operably linked marker sequence. 

30 As used herein, a "diagnostic fragment," DF. means a series of nucleotide 

molecules which selectively hybridize to Streptococcus pneumoniae sequences. 
DFs can be readily identified by identifying unique sequences within contigs of the 
Streptococcus pneumoniae genome, such as by using well-known computer 
analysis software, and by generating and testing probes or amplification primers 
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consisting of the DF sequence in an appropriate diagnostic format which 
determines amplification or hybridization selectivity. 

The sequences falling within the scope of the present invention are not 
limited to the specific sequences herein described, but also include allelic and 
5 species variations thereof. Allelic and species variations can be routinely 
determined by comparing the sequences provided in SEQ ID NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95%, preferrably 
at least 99% and most at least preferably 99.9% identical to SEQ ID NOS: 1-391, 
with a sequence from another isolate of the same species. Furthermore, to 

io accommodate codon variability, the invention includes nucleic acid molecules 
coding for the same amino acid sequences as do the specific ORFs disclosed 
herein. In other words, in the coding region of an ORF. substitution of one codon 
for another which encodes the same amino acid is expressly contemplated. Any 
specific sequence disclosed herein can be readily screened for errors by 

15 resequencing a particular fragment, such as an ORF, in both directions {i.e., 
sequence both strands). Alternatively, error screening can be performed by 
sequencing corresponding polynucleotides of Streptococcus pneumoniae origin 
isolated by using part or all of the fragments in question as a probe or primer. 

Preferred DFs of the present invention comprise at least about 17, 

20 preferrably at least about 20, and more preferrably at least about 50 contiguous 
nucleotides within an ORF set out in Tables 1-3. Most highly preferred DFs 
specifically hybridize to a polynucleotide containing the sequence of the ORF from 
which they are derived. Specific hybridization occurs even under stringent 
conditions defined elsewhere herein. 

25 Each of the ORFs of the Streptococcus pneumoniae genome disclosed in 

Tables 1, 2 and 3, and the EMFs found 5' to the ORFs, can be used as 
polynucleotide reagents in numerous ways. For example, the sequences can be 
used as diagnostic probes or diagnostic amplification primers to detect the presence 
of a specific microbe in a sample, particularly Streptococcus pneumoniae. 

30 Especially preferred in this regard are ORFs such as those of Table 3, which do not 
match previously characterized sequences from other organisms and thus are most 
likely to be highly selective for Streptococcus pneumoniae. Also particularly 
preferred are ORFs that can be used to distinguish between strains of Streptococcus 
pneumoniae, particularly those that distinguish medically important strain, such as 

35 drug-resistant strains. 
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In addition, the fragments of the present invention, as broadly described, 
can be used to control gene expression through triple helix formation or antisense 
DNA or RNA, both of which methods are based on the binding of a polynucleotide 
sequence to DNA or RNA. Triple helix-formation optimally results in a shut-off of 
5 RNA transcription from DNA, while antisense RNA hybridization blocks 
translation of an mRNA molecule into polypeptide. Information from the 
sequences of the present invent^ 

forming oligonucleotides. Polynucleotides suitable for use in these methods are 
usually 20 to 40 bases in length and are designed to be complementary to a region 

10 of the gene involved in transcription, for triple-helix formation, or to the mRNA 
itself, for antisense inhibition. Both techniques have been demonstrated to be 
effective in model systems, and the requisite techniques are well known and 
involve routine procedures. Triple helix techniques are discussed in, for example, 
Lee et al. % Nucl. Acids Res. 6:3073 (1979); Cooney et a/.. Science 241:456 

15 (1988); and Dervan et a/.. Science 25/;1360 (1991). Antisense techniques in 
general are discussed in, for instance, Okano, J. Neurochem, 56:560 (1991) and 
Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression. CRC Press. 
Boca Raton, FL (1988)). 

The present invention further provides recombinant constructs comprising 

20 one or more fragments of the Streptococcus pneumoniae genomic fragments and 
contigs of the present invention. Certain preferred recombinant constructs of the 
present invention comprise a vector, such as a plasmid or viral vector, into which a 
fragment of the Streptococcus pneumoniae genome has been inserted, in a forward 
or reverse orientation. In the case of a vector comprising one of the ORFs of the 

25 present invention, the vector may further comprise regulatory sequences, including 
for example, a promoter, operably linked to the ORF. For vectors comprising the 
EMFs of the present invention, the vector may further comprise a marker sequence 
or heterologous ORF operably linked to the EMF. 

Large numbers of suitable vectors and promoters are known to those of 

30 skill in the art and are commercially available for generating the recombinant 
constructs of the present invention. The following vectors are provided by way of 
example. Useful bacterial vectors include phagescript, PsiX174, pBluescnpt SK. 
pBS KS. pNHSa, pNHI&a, pNHISa, pNH46a (available from Straiagenc); 
pTrc99A, pKK223-3, pKK233-3, pDR540. pRIT5 (available from Pharmacia). 

35 Useful eukaryotic vectors include pWLneo. pSV2cat, pOG44. pXTl. pSG 
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(available from Stratagene) pSVK3, pBPV, pMSG. pSVL (available from 
Pharmacia). 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. 
5 Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lad, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic 
promoters include CMV immediate early, HSV thymidine kinase, early and late 
SV40, LTRs from retrovirus, and mouse metallothionein- I. Selection of the 
appropriate vector and promoter is well within the level of ordinary skill in the an. 

10 The present invention further provides host cells containing any one of the 

isolated fragments of the Streptococcus pneumoniae genomic fragments and 
contigs of the present invention, wherein the fragment has been introduced into the 
host cell using known methods. The host cell can be a higher eukaryotic host 
cell, such as a mammalian cell, a lower eukaryotic host cell, such as a yeast ceil, or 

1 5 a procaryotic cell, such as a bacterial cell. 

A polynucleotide of the present invention, such as a recombinant construct 
comprising an ORF of the present invention, may be introduced into the host by a 
variety of well established techniques that are standard in the art, such as calcium 
phosphate transfection, DEAE, dextran mediated transfection and electroporation, 

:o which are described in, for instance. Davis, L. et ai % BASIC METHODS IN 
MOLECULAR BIOLOGY (1986). 

A host cell containing one of the fragments of the Streptococcus 
pneumoniae genomic fragments and contigs of the present invention, can be used 
in conventional manners to produce the gene product encoded by the isolated 

25 fragment (in the case of an ORF) or can be used to produce a heterologous protein 
under the control of the EMF. The present invention further provides 

isolated polypeptides encoded by the nucleic acid fragments of the present 
invention or by degenerate variants of the nucleic acid fragments of the present 
invention. By "degenerate variant" is intended nucleotide fragments which differ 

30 from a nucleic acid fragment of the present invention {e.g.. an ORF) by nucleotide 
sequence but, due to the degeneracy of the Genetic Code, encode an identical 
polypeptide sequence. 

Preferred nucleic acid fragments of the present invention are the ORFs and 
subfragments thereof depicted in Tables 2 and 3 which encode proteins. 
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A variety of methodologies known in the an can be utilized to obtain any 
one of the isolated polypeptides or proteins of the present invention. At the 
simplest level, the amino acid sequence can be synthesized using commerciaJly 
available peptide synthesizers. This is particularly useful in producing small 

5 peptides and fragments of larger polypeptides. -Such short fragments as may be 
obtained most readily by synthesis are useful, for example, in generating antibodies 

apinst thelfative polyp'eptide/i^"discussed further below: - — 

In an alternative method, the polypeptide or protein is purified from 
bacterial cells which naturally produce the polypeptide or protein. One skilled in 

10 the art can readily employ well-known methods for isolating polypeptides and 
proioins to isolate and purify polypeptides or proteins of the present invention 
produced naturally by a bacterial strain, or by other methods. Methods for 
isolation and purification that can be employed in this regard include, but are not 
limited to. immunochromatography, HPLC size-exclusion chromatography, ion- 

15 exchange chromatography, and immuno-affinity chromatography. 

The polypeptides and proteins of the present invention also can be purified 
from cells which have been altered to express the desired polypeptide or protein. 
As used herein, a cell is said to be altered to express a desired polypeptide or 
protein when the cell, through geneuc manipulation, is made to produce, a 

20 polypeptide or protein which it normally docs not produce or which the cell 
normally produces at a lower level. Those skilled in the art can readily adapt 
procedures for introducing and expressing either recombinant or synthetic 
sequences into eukaryotic or prokaryotic cells in order to generate a cell which 
produces one of the polypeptides or proteins of the present invention. 

25 Any host/vector system can be used to express one or more of the ORFs of 

the present invention. These include, but are not limited to. eukaryotic hosts such 
as HeLa cells, CV-1 cell, COS cells, and Sf9 cells, as well as prokaryotic host 
such as £. coli and B. subtilis. The most preferred cells are those which do not 
normally express the particular polypeptide or protein or which expresses the 

30 polypeptide or protein at low natural level. 
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"Recombinant, " as used herein, means that a polypeptide or protein is 
derived from recombinant {e.g., microbial or mammalian) expression systems. 
"Microbial" refers to recombinant polypeptides or proteins made in bacterial or 
fungal {e.g., yeast) expression systems. As a product, "recombinant 
5 microbiaTdefines a polypeptide or protein essentially free of native endogenous 
substances and unaccompanied by associated native glycosylation. Polypeptides or 
proteins expressed in most bacterial cultures, e.g., E. colL will be free of 
glycosylation modifications; polypeptides or proteins expressed in yeast will have a 
glycosylation pattern different from that expressed in mammalian cells. 

10 "Nucleotide sequence" refers to a heteropolymer of deoxyribonucleotides. 

Generally, DNA segments encoding the polypeptides and proteins provided by this 
invention are assembled from fragments of the Streptococcus pneumoniae genome 
and short oligonucleotide linkers, or from a series of oligonucleotides, to provide a 
synthetic gene which is capable of being expressed in a recombinant transcriptional 

1 5 unit comprising regulatory elements derived from a microbial or viral operon. 

Recombinant expression vehicle or vector" refers to a plasmid or phage or 
virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. The 
expression vehicle can comprise a transcriptional unit comprising an assembly of 
(1) a genetic regulatory elements necessary for gene expression in the host, 

20 including elements required to initiate and maintain transcription at a level sufficient 
for suitable expression of the desired polypeptide, including, for example, 
promoters and, where necessary, an enhancer and a polyadenylation signal; (2) a 
structural or coding sequence which is transcribed into mRNA and translated into 
protein, and (3) appropriate signals to initiate translation at the beginning of the 

25 desired coding region and terminate translation at its end. Structural units intended 
for use in yeast or eukaryotic expression systems preferably include a leader 
sequence enabling extracellular secretion of translated protein by a host cell. 
Alternatively, where recombinant protein is expressed without a leader or transport 
sequence, it may include an N-terminal methionine residue. This residue may or 

30 may not be subsequently cleaved from the expressed recombinant protein to 
provide a final product. 

"Recombinant expression system" means host cells which have stably 
integrated a recombinant transcriptional unit into chromosomal DNA or carry the 
recombinant transcriptional unit extra chromosomally. The cells can be prokaryotic 

35 or eukaryotic. Recombinant expression systems as defined herein will express 
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heterologous polypeptides or proteins upon induction of the regulatory elements 
linked to the DNA segment or synthetic gene to be expressed. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or 
other cells under the control of appropriate promoters. Cell-free translation 
5 systems can also be employed to produce such proteins using RNAs derived from 
the DNA constructs of the present invention. Appropriate cloning and expression 
vectors for use with prokaryotic and eukaryotic hosts are described in Sambrook et 
a/.. Molecular Cloning: A Laboratory Manual, 2 nc * Edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989), the disclosure of which 

10 is hereby incorporated by reference in its entirety. 

Generally, recombinant expression vectors will include origins of 
replication and selectable markers permitting transformation of the host cell, e.g., 
the ampicillin resistance gene of E. coli and 5. cerevisiae TRP1 gene, and a 
promoter derived from a highly expressed gene to direct transcription of a 

15 downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3- phosphoglycerate kinase (PGK), alpha- 
factor, acid phosphatase, or heat shock proteins, among others. The heterologous 
structural sequence is assembled in appropriate phase with translation initiation and 
termination sequences, and preferably, a leader sequence capable of directing 

20 secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N- 
terminal identification peptide imparting desired characteristics, e.g., stabilization 
or simplified purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a 

25 structural DNA sequence encoding a desired protein together with suitable 
translation initiation and termination signals in operable reading phase with a 
functional promoter. The vector will comprise one or more phenotypic selectable 
markers and an origin of replication to ensure maintenance of the vector and. when 
desirable, provide amplification within the host. 

30 Suitable prokaryotic hosts for transformation include strains of E. coli, B. 

subtilis. Salmonella typhimurium and various species within the genera 
Pseudomonas and Streptomyces. Others may, also be employed as a matter of 
choice. 

As a representative but non-limiting example, useful expression vectors for 
35 bacterial use can comprise a selectable marker and bacterial origin of replication 
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derived from commercially available plasmids comprising genetic elements of the 
well known cloning vector pBR322 (ATCC 37017). Such commercial vectors 
include, for example, pKK223-3 (available form Pharmacia Fine Chemicals, 
Uppsala, Sweden) and GEM 1 (available from Promega Biotec, Madison, WI. 

5 USA). These pBR322 "backbone" sections -are combined with an appropriate 
promoter and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host 
strain to an appropriate cell density, the selected promoter, where it is inducible, is 
derepressed or induced by appropriate means (e.g., temperature shift or chemical 

10 induction) and cells are cultured for an additional period to provide for expression 
of the induced gene product. Thereafter cells are typically harvested, generally by 
centrifugation, disrupted to release expressed protein, generally by physical or 
chemical means, and the resulting crude extract is retained for further purification. 
Various mammalian cell culture systems can also be employed to express 

15 recombinant protein. Examples of mammalian expression systems include the 
COS-7 lines of monkey kidney fibroblasts, described in Gluzman, Cell 23: 175 
( 198 1 ), and other cell lines capable of expressing a compatible vector, for example, 
the C127, 3T3, CHO, HeLa and BHK cell lines. 

Mammalian expression vectors will comprise an origin of replication, a 

20 suitable promoter and enhancer, and also any necessary ribosome binding sites, 
polyadenylation site, splice donor and acceptor sites, transcriptional termination 
sequences, and 5' flanking nontranscribed sequences. DNA sequences derived 
from the S V40 viral genome, for example, S V40 origin, early promoter, enhancer, 
splice, and polyadenylation sites may be used to provide the required 

25 nontranscribed genetic elements. 

Recombinant polypeptides and proteins produced in bacterial culture is 
usually isolated by initial extraction from cell pellets, followed by one or more 
salting-out, aqueous ion exchange or size exclusion chromatography steps. 
Microbial cells employed in expression of proteins can be disrupted by any 

30 convenient method, including freeze-thaw cycling, sonication. mechanical 
disruption, or use of cell lysing agents. Protein refolding steps can be used, as 
necessary, in completing configuration of the mature protein. Finally, high 
performance liquid chromatography (HPLC)can be employed for final purification 
steps. 
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The present invention further includes isolated polypeptides, proteins and 
nucleic acid molecules which are substantially equivalent to those herein described. 
As used herein, substantially equivalent can refer both to nucleic acid and amino 
acid sequences, for example a mutant sequence, that varies from a reference 
5 sequence by one or more substitutions, deletions, or additions, the net effect of 
which does not result in an adverse functional dissimilarity between reference and 
subject sequences. For purposes of the present invention, sequences having 
equivalent biological activity, and equivalent expression characteristics are 
considered substantially equivalent. For purposes of determining equivalence, 

10 truncation of the mature sequence should be disregarded. 

The invenuon further provides methods of obtaining homoiogs from other 
strains of Streptococcus pneumoniae, of the fragments of the Streptococcus 
pneumoniae genome of the present invention and homoiogs of the proteins encoded 
by the ORFs of the present invention. As used herein, a sequence or protein of 

15 Streptococcus pneumoniae is defined as a homolog of a fragment of the 
Streptococcus pneumoniae fragments or contigs or a protein encoded by one of the 
ORFs of the present invention, if it shares significant homology to one of the 
fragments of the Streptococcus pneumoniae genome of the present invention or a 
protein encoded by one of the ORFs of the present invention. Specifically, by 

20 using the sequence disclosed herein as a probe or as primers, and techniques such 
as PCR cloning and colony/plaque hybridization, one skilled in the art can obtain 
homoiogs. 

As used herein, two nucleic acid molecules or proteins are said to "share 
significant homology" if the two contain regions which possess greater than 85% 

25 sequence (amino acid or nucleic acid) homology. Preferred homoiogs in this 
regard are those with more than 90% homology. Especially preferred are those 
with 93% or more homology. Among especially preferred homoiogs those with 
95% or more homology are particularly preferred. Very particularly preferred 
among these are those with 97% and even more particularly preferred among those 

30 are homoiogs with 99% or more homology. The most preferred homoiogs among 
these are those with 99.9% homology or more. It will be understood that, among 
measures of homology, identity is particularly preferred in this regard. 

Region specific primers or probes derived from the nucleotide sequence 
provided in SEQ ID NOS:l-39l or from a nucleotide sequence at least 95%, 

35 particularly at least 99%, especially at least 99.5% identical to a sequence of SEQ 
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ID NOS: 1-391 can be used to prime DNA synthesis and PCR amplification, as 
well as to identify colonies containing cloned DNA encoding a homolog. Methods 
suitable to this aspect of the present invention are well known and have been 
described in great detail in many publications such as, for example, Innis et ai, 
5 PCR Protocols. Academic Press, San Diego, CA ( 1 990)). 

When using primers derived from SEQ ID NOS; 1-391 or from a nucleotide 
sequence having an aforementioned identity to a sequence of SEQ ID NOS: 1-391, 
one skilled in the art will recognize that by employing high stringency conditions 
(e.g., annealing at 50-60°C in 6X SSPC and 50% formamide, and washing at 50- 

10 65°C in 0.5X SSPC) only sequences which are greater than 75% homologous to 
the primer will be amplified. By employing lower stringency conditions (e.g.* 
hybridizing at 35-37°C in 5X SSPC and 40-45% formamide, and washing at 42°C 
in 0.5X SSPC), sequences which are greater than 40-50% homologous to the 
primer will also be amplified. 

15 When using DNA probes derived from SEQ ID NOS: 1-391, or from a 

nucleotide sequence having an aforementioned identity to a sequence of SEQ ID 
NOS: 1-391, for colony/plaque hybridization, one skilled in the art will recognize 
that by employing high stringency conditions (e.g., hybridizing at 50- 65°C in 5X 
SSPC and 50% formamide, and washing at 50- 65°C in 0.5X SSPC), sequences 

20 having regions which are greater than 90% homologous to the probe can be 
obtained, and that by employing lower stringency conditions (e.g., hybridizing at 
35-37°C in 5X SSPC and 40-45% formamide, and washing at 42°C in 0.5X 
SSPC), sequences having regions which are greater than 35-45% homologous to 
the probe will be obtained. 

25 Any organism can be used as the source for homologs of the present 

invention so long as the organism naturally expresses such a protein or contains 
genes encoding the same. The most preferred organism for isolating homologs are 
bacteria which are closely related to Streptococcus pneumoniae. 

30 ILLUSTRATIVE USES OF COMPOSITIONS OF THE 

INVENTION 

Each ORF provided in Tables I and 2 is identified with a function by 
homology to a known gene or polypeptide. As a result, one skilled in the art can 
use the polypeptides of the present invention for commercial, therapeutic and 
35 industrial purposes consistent with the type of putative identification of the 
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polypeptide. Such identifications permit one— skilled in the art to use the 
Streptococcus pneumoniae ORFs in a manner similar to the known type of 
sequences for which the identification is made; for example, to ferment a particular 
sugar source or to produce a particular metabolite. A variety of reviews illustrative 

5 of this aspect of the invention are available, including the following reviews on the 
industrial use of enzymes, for example, BIOCHEMICAL ENGINEERING AND 
BIOTECHNOLOGY HANDBOOK, 2nd Ed., MacMillan Publications: Ltd. NY 
(1991) and BIOCATALYSTS IN ORGANIC SYNTHESES. Tramper*r al. 9 Eds.. 
Elsevier Science Publishers, Amsterdam, The Netherlands (1985). A variety of 

10 exemplary uses that illustrate this and similar aspects of the present invention are 
discussed below. 

1. Biosynthetic Enzymes 

Open reading frames encoding proteins involved in mediating the catalytic 

15 reactions involved in intermediary and macromolecular metabolism, the 
biosynthesis of small molecules, cellular processes and other functions includes 
enzymes involved in the degradation of the intermediary products of metabolism, 
enzymes involved in central intermediary metabolism, enzymes involved in 
respiration, both aerobic and anaerobic, enzymes involved in fermentation, 

20 enzymes involved in ATP proton motor force conversion, enzymes involved in 
broad regulatory function, enzymes involved in amino acid synthesis, enzymes 
involved in nucleotide synthesis, enzymes involved in cofactor and vitamin 
synthesis, can be used for industrial biosynthesis. 

The various metabolic pathways present in Streptococcus pneumoniae can 

25 be identified based on absolute nutritional requirements as well as by examining the 
various enzymes identified in Table 1-3 and SEQ ID NOS: 1-391. 

Of particular interest are polypeptides involved in the degradation of 
intermediary metabolites as well as non-macromolecular metabolism. Such 
enzymes include amylases, glucose oxidases, and catalase. 

30 Proteolytic enzymes are another class of commercially important enzymes. 

Proteolytic enzymes find use in a number of industrial processes including the 
processing of flax and other vegetable fibers, in the extraction, clarification and 
depectinization of fruit juices, in the extraction of vegetables' oil and in the 
maceration of fruits and vegetables to give unicellular fruits. A detailed review of 

35 the proteolytic enzymes used in the food industry is provided in Rombouts et aL, 
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Symbiosis 21:79 (1986) and Voragen et at in Biocataiysts In Agricultural 
Biotechnology, Whitaker et aL* Eds., American Chemical Society Symposium 
Series 389:93 (1989) . 

The metabolism of sugars is an important aspect of the primary metabolism 
5 of Streptococcus pneumoniae. Enzymes involved in the degradation of sugars, 
such as, particularly, glucose, galactose, fructose and xylose, can be used in 
industrial fermentation. Some of the important sugar transforming enzymes, from 
a commercial viewpoint, include sugar isomerases such as glucose isomerase. 
Other metabolic enzymes have found commercial use such as glucose oxidases 

10 which produces ketogulonic acid (KGA). KGA is an intermediate in the 
commercial production of ascorbic acid using the Reichsteins procedure, as 
described in Knieger et a/., Biotechnology 6(A) . Rhine et a/., Eds., Veriag Press, 
Weinhcim, Germany (1984). 

Glucose oxidase (GOD) is commercially available and has been used in 

15 purified form as well as in an immobilized form for the deoxygenation of beer. 
See, for instance, Hartmeir et a/., Biotechnology Letters 1:21 (1979). The most 
important application of GOD is the industrial scale fermentation of gluconic acid. 
Market for gluconic acids which are used in the detergent, textile, leather, 
photographic, pharmaceutical, food, feed and concrete industry, as described, for 

20 example, in Bigelis etai, beginning on page 357 in GENE MANIPULATIONS 
AND FUNGI; Benett et aL Eds., Academic Press, New York ( 1985). In addition 
to industrial applications, GOD has found applications in medicine for quantitative 
determination of glucose in body fluids recently in biotechnology for analyzing 
syrups from starch and cellulose hydrosylates. This application is described in 

25 Owusu et aL, Biochem. et Biophysica. Acta. 572:83 (1986), for instance. 

The main sweetener used in the world today is sugar which comes from 
sugar beets and sugar cane. In the field of industrial enzymes, the glucose 
isomerase process shows the largest expansion in the market today. Initially, 
soluble enzymes were used and later immobilized enzymes were developed 

30 (Krueger etai. Biotechnology, The Textbook of Industrial Microbiology , Sinauer 
Associated Incorporated, Sunderland, Massachusetts (1990)). Today, the use of 
glucose- produced high fructose syrups is by far the largest industrial business 
using immobilized enzymes. A review of the industrial use of these enzymes is 
provided by Jorgensen, Starch 40:307 (1988). 
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Proteinases, such as alkaline serine proteinases, are used as detergent 
additives and thus represent one of the largest volumes of microbial enzymes used 
in the industrial sector. Because of their industrial importance, there is a large body 
of published and unpublished information regarding the use of these enzymes in 
5 industrial processes. (See Faultman etat. y Acid Proteases Structure Function and 
Biology, Tang, J., ed., Plenum Press, New York (1977) and Godfrey et ai t 
Industrial Enzymes7MacMillan PublishersrSurrey, UK (1983) and Hepner *?r aL,~ 
Report Industrial Enzymes by 1990, Hel Hepner & Associates, London ( 1986)). 

Another class of commercially usable proteins of the present invention are 

10 the microbial lipases, described by, for instance, Macrae et al. % Philosophical 
Transactions of the Chiral Society of London 310:221 (1985) and Poserke, Journal 
of the American Oil Chemist Society 61: 1758 (1984). A major use of lipases is in 
the fat and oil industry for the production of neutral glycerides using lipase 
catalyzed inter-esterification of readily available triglycerides. Application of 

15 lipases include the use as a detergent additive to facilitate the removal of fats from 
fabrics in the course of the washing procedures. 

The use of enzymes, and in particular microbial enzymes, as catalyst for 
key steps in the synthesis of complex organic molecules is gaining popularity at a 
great rate. One area of great interest is the preparation of chiral intermediates. 

20 Preparation of chiral intermediates is of interest to a wide range of synthetic 
chemists particularly those scientists involved with the preparation of new 
pharmaceuticals, agrochemicals, fragrances and flavors. (See Davies et qL % Recent 
Advances in the Generation of Chiral Intermediates Using Enzymes. CRC Press, 
Boca Raton, Florida (1990)). The following reactions catalyzed by enzymes are of 

25 interest to organic chemists: hydrolysis of carboxylic acid esters, phosphate esters, 
amides and nitriles, esterification reactions, trans-esterification reactions, synthesis 
of amides, reduction of alkanones and oxoalkanates, oxidation of alcohols to 
carbonyl compounds, oxidation of sulfides to sulfoxides, and carbon bond forming 
reactions such as the aldol reaction. 

30 When considering the use of an enzyme encoded by one of the ORFs of the 

present invention for biotransformation and organic synthesis it is sometimes 
necessary to consider the respective advantages and disadvantages of using a 
microorganism as opposed to an isolated enzyme. Pros and cons of using a whole 
cell system on the one hand or an isolated partially purified enzyme on the other 
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hand, has been described in detail by Bud et a/.. Chemistry in Britain (1987), p. 
127. 

Amino transferases, enzymes involved in the biosynthesis and metabolism 
of amino acids, are useful in the catalytic production of amino acids. The 

5 advantages of using microbial based enzyme systems is that the amino transferase 
enzymes catalyze the stereo- selective synthesis of only L-amino acids and 
generally possess uniformly high catalytic rates. A description of the use of amino 
transferases for amino acid production is provided by Roselle-David, Methods of 
Enzymology 136:419 (1987). 

10 Another category of useful proteins encoded by the ORFs of the present 

invention include enzymes involved in nucleic acid synthesis, repair, and 
recombination. 

2. Generation of Antibodies 

15 As described here, the proteins of the present invention, as well as 

homologs thereof, can be used in a variety of procedures and methods known in 
the art which are currently applied to other proteins. The proteins of the present 
invention can further be used to generate an antibody which selectively binds the 
protein. Such antibodies can be either monoclonal or polyclonal antibodies, as well 

20 fragments of these antibodies, and humanized forms. 

The invention further provides antibodies which selectively bind to one of 
the proteins of the present invention and hybridomas which produce these 
antibodies. A hybridoma is an immortalized cell line which is capable of secreting 
a specific monoclonal antibody. 

25 In general, techniques for preparing polyclonal and monoclonal antibodies 

as well as hybridomas capable of producing the desired antibody are well known in 
the art (Campbell, A. M., Monoclonal Antibody Technology: Laboratory 
Techniques In Biochemistry And Molecular Biology, Elsevier Science Publishers. 
Amsterdam, The Netherlands (1984); St. Groth et aL, J. Immunol Methods 35: 1- 

30 21 (1980), Kohler and Milstein, Nature 256:495-497 (1975)). the trioma 
technique, the human B-cell hybridoma technique (Kozbor et ai, Immunology 
Today 4:12 (1983), pgs. 77-96 of Cole et a/., in Monoclonal Antibodies And 
Cancer Therapy, Alan R. Liss. Inc. (1985)). Any animal (mouse, rabbit, 

etc.) which is known to produce antibodies can be immunized with the pseudogene 

35 polypeptide. Methods for immunization are well known in the art. Such methods 
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include subcutaneous or interperitoneal injection of the polypeptide. One skilled in 
the an will recognize that the amount of the protein encoded by the ORF of the 
present invention used for immunization will vary based on the animal which is 
immunized, the antigenicity of the peptide and the site of injection. 

5 The protein which is used as an immunogen may be modified or 

administered in an adjuvant in order to increase the protein's antigenicity. Methods 
of increasing the antigenicity of a protein are well known in the art and include, but 
are not limited to coupling the antigen with a heterologous protein (such as globulin 
or galactosidase) or through the inclusion of an adjuvant during immunization. 

10 For monoclonal antibodies, spleen cells from the immunized animals are 

removed, fused with myeloma cells, such as SP2/0-Agl4 myeloma cells, and 
allowed to become monoclonal antibody producing hybridoma cells. 

Any one of a number of methods well known in the art can be used to 
identify the hybridoma cell which produces an antibody with the desired 

15 characteristics. These include screening the hybridomas with an ELISA assay, 
western blot analysis, or radioimmunoassay (Lutz et a/. t Exp. Cell Res. 175; 109- 
124 (1988)). 

Hybridomas secreting the desired antibodies are cloned and the class and 
subclass is determined using procedures known in the art (Campbell, A. M, 
20 Monoclonal Antibody Technology: Laboratory Techniques in Biochemistry and 
Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands 
(1984)). 

Techniques described for the production of single chain antibodies (U. S. 
Patent 4,946,778) can be adapted to produce single chain antibodies to proteins of 
25 the present invention. 

For polyclonal antibodies, antibody containing antisera is isolated from the 
immunized animal and is screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures. 

The present invention further provides the above- described antibodies in 
30 detectably labelled form. Antibodies can be detectably labelled through the use of 
radioisotopes, affinity labels (such as biotin, avidin, etc.). enzymatic labels (such 
as horseradish peroxidase, alkaline phosphatase, etc.) fluorescent labels (such as 
FITC or rhodamme, etch paramagnetic atoms, etc. Procedures for accomplishing 
such labeling are well-known in the art. for example see Stemberger et a/., J. 
35 Histochem. Cytochem. 18:315 (1970); Bayer. E. A. et a/.. Meth. Enzym. 62:308 
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(1979); Engval, E. etal.. Immunol. 709:129 (1972); Goding, J. W., /. Immunol, 
Meth. 75. 215 (1976)). 

The labeled antibodies of the present invention can be used for in vitro, in 
vivo, and in situ assays to identify cells or tissues in which a fragment of the 

5 Streptococcus pneumoniae genome is expressed. 

The present invention further provides the above-described antibodies 
immobilized on a solid support. Examples of such solid supports include plastics 
such as polycarbonate, complex carbohydrates such as agarose and sepharose, 
acrylic resins and such as polyacrylamide and latex beads. Techniques for 

10 coupling antibodies to such solid supports are well known in the art (Weir, D. M. 
et ai, "Handbook of Experimental Immunology" 4th Ed.. Blackwell Scientific 
Publications, Oxford, England, Chapter 10 (1986); Jacoby, W. D. et a/., Meth. 
Enzym. 34 Academic Press, N. Y. (1974)). The immobilized antibodies of the 
present invention can be used for in vitro, in vivo, and in situ assays as well as for 

15 immunoaffinity purification of the proteins of the present invention. 

3. Diagnostic Assays and Kits 

The present invention further provides methods to identify the expression 
of one of the ORFs of the present invention, or homolog thereof, in a test sample, 

20 using one of the DFs or antibodies of the present invention. 

In detail, such methods comprise incubating a test sample with one or more 
of the antibodies or one or more of the DFs of the present invention and assaying 
for binding of the DFs or antibodies to components within the test sample. 

Conditions for incubating a DF or antibody with a test sample vary. 

25 Incubation conditions depend on the format employed in the assay, the detection 
methods employed, and the type and nature of the DF or antibody used in the 
assay. One skilled in the art will recognize that any one of the commonly available 
hybridization, amplification or immunological assay formats can readily be adapted 
to employ the DFs or antibodies of the present invention. Examples of such assays 

30 can be found in Chard, T., An Introduction to Radioimmunoassay and Related 
Techniques, Elsevier Science Publishers, Amsterdam, The Netherlands (1986); 
Bullock, G. R. et aL, Techniques in Immunocytochemistry t Academic Press, 
Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and 
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Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands 
(1985). 

The test samples of the present invention include cells, protein or membrane 
extracts of cells, or biological fluids such as sputum, blood, serum, plasma, or 
5 urine. The test sample used in the above-described method will vary based on the 
assay format, nature of the detection method and the tissues, cells or extracts used 
as the sample to be assayed. Methods for pTepanng^fotein extracts~6T membrane" 
extracts of cells are well known in the an and can be readily be adapted in order to 
obtain a sample which is compatible with the system utilized. 

10 In another embodiment of the present invention, kits are provided which 

contain the necessary reagents to carry out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in 
close confinement, one or more containers which comprises: (a) a first container 
comprising one of the DFs or antibodies of the present invention; and (b) one or 

15 more other containers comprising one or more of the following: wash reagents, 
reagents capable of detecting presence of a bound DF or antibody. 

In detail, a compartmentalized kit includes any kit in which reagents are 
contained in separate containers. Such containers include small glass containers, 
plastic containers or strips of plastic or paper. Such containers allows one to 

20 efficiently transfer reagents from one compartment to another compartment such 
that the samples and reagents are not cross-contaminated, and the agents, or 
solutions of each container can be added in a quantitative fashion from one 
compartment to another. Such containers will include a container which will accept 
the test sample, a container which contains the antibodies used in the assay. 

25 containers which contain wash reagents (such as phosphate buffered saline, Tris- 
buffers, etc.). and containers which contain the reagents used to detect the bound 
antibody or DF. 

Types of detection reagents include labelled nucleic acid probes, labelled 
secondary antibodies, or in the alternative, if the primary antibody is labelled, the 
30 enzymatic, or antibody binding reagents which are capable of reacting with the 
labelled antibody. One skilled in the art will readily recognize that the disclosed 
DFs and antibodies of the present invention can be readily incorporated into one of 
the established kit formats which are well known in the art. 

35 4. Screening Assay for Binding Agents 
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Using the isolated proteins of the present invention, the present invention 
further provides methods of obtaining and identifying agents which bind to a 
protein encoded by one of the ORFs of the present invention or to one of the 
fragments and the Streptococcus pneumoniae fragment and contigs herein 
5 described. 

In general, such methods comprise steps of: 

(a) contacting an agent with an isolated protein encoded by one of the 
ORFs of the present invention, or an isolated fragment of the Streptococcus 
pneumoniae genome; and 

10 (b) determining whether the agent binds to said protein or said fragment. 

The agents screened in the above assay can be, but are not limited to, 
peptides, carbohydrates, vitamin derivatives, or other pharmaceutical agents. The 
agents can be selected and screened at random or rationally selected or designed 
using protein modeling techniques. 

15 For random screening, agents such as peptides, carbohydrates, 

pharmaceutical agents and the like are selected at random and are assayed for their 
ability to bind to the protein encoded by the ORF of the present invention. 

Alternatively, agents may be rationally selected or designed. As used 
herein, an agent is said to be "rationally selected or designed" when the agent is 

20 chosen based on the configuration of the particular protein. For example, one 
skilled in the art can readily adapt currently available procedures to generate 
peptides, pharmaceutical agents and the like capable of binding to a specific peptide 
sequence in order to generate rationally designed antipeptide peptides, for example 
see Hurby et a/.. "Application of Synthetic Peptides: Antisense Peptides," in 

25 Synthetic Peptides, A Users Guide. W. H. Freeman, NY (1992), pp. 289-307, 
and Kaspczak et a/.. Biochemistry 25.9230-8 (1989), or pharmaceutical agents, or 
the like. 

In addition to the foregoing, one class of agents of the present invention, as 
broadly described, can be used to control gene expression through binding to one 
30 of the ORFs or EMFs of the present invention. As described above, such agents 
can be randomly screened or rationally designed/selected. Targeting the ORF or 
EMF allows a skilled artisan to design sequence specific or element specific agents, 
modulating the expression of either a single ORF or multiple ORFs which rely on 
the same EMF for expression control. 
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One class of DNA binding agents are agents which contain base residues 
which hybridize or form a triple helix by binding to DNA or RNA. Such agents 
can be based on the classic phosphodiester, ribonucleic acid backbone, or can be a 
variety of sulfhydryl or polymeric derivatives which have base attachment capacity. 

5 Agents suitable for use in these methods usually contain 20 to 40 bases and 

are designed to be complementary to a region of the gene involved in transcription 
(triple helix - see Lee et atrNuct Acids Res: 6:3073 (1979); Cooney et at. 
Science 24IA56 (1988); and Dervan et at. Science 257:1360 (1991)) or to the 
mRNA itself (antisense - Okano, 7. Neurochem. 56:560 (1991); 

10 Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression. CRC Press, 
Boca Raton, FL (1988)). Triple helix- formation optimally results in a shut-off of 
RNA transcription from DNA, while antisense RNA hybridization blocks 
translation of an mRNA molecule into polypeptide. Both techniques have been 
demonstrated to be effective in model systems. Information contained in the 

15 sequences of the present invention can be used to design antisense and triple helix- 
forming oligonucleotides, and other DNA binding agents. 

5. Pharmaceutical Compositions and Vaccines 

The present invention further provides pharmaceutical agents which can be 

20 used to modulate the growth or pathogenicity of Streptococcus pneumoniae, or 
another related organism, in vivo or in vitro. As used herein, a "pharmaceutical 
agent" is defined as a composition of matter which can be formulated using known 
techniques to provide a pharmaceutical compositions. As used herein, the 
"pharmaceutical agents of the present invention" refers the pharmaceutical agents 

25 which axe derived from the proteins encoded by the ORFs of the present invention 
or are agents which are identified using the herein described assays. 

As used herein, a pharmaceutical agent is said to "modulate the growth 
pathogenicity of Streptococcus pneumoniae or a related organism, in vivo or in 
vitro" when the agent reduces the rate of growth, rate of division, or viability of 

30 the organism in question. The pharmaceutical agents of the present invention can 
modulate the growth or pathogenicity of an organism in many fashions, although 
an understanding of the underlying mechanism of action is not needed to practice 
the use of the pharmaceutical agents of the present invention. Some agents will 
modulate the growth by binding to an important protein thus blocking the biological 

35 activity of the protein, while other agents may bind to a component of the outer 
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surface of the organism blocking attachment or rendering the organism more prone 
to act the bodies nature immune system. Alternatively, the agent may comprise a 
protein encoded by one of the ORFs of the present invention and serve as a 
vaccine. The development and use of a vaccine based on outer membrane 

5 components are well known in the art. — 

As used herein, a "related organism" is a broad term which refers to any 
organism whose growth can be modulated by one of the pharmaceutical agents of 
the present invention. In general, such an organism will contain a homolog of the 
protein which is the target of the pharmaceutical agent or the protein used as a 

10 vaccine. As such, related organisms do not need to be bacterial but may be fungal 
or viral pathogens. 

The pharmaceutical agents and compositions of the present invention may 
be administered in a convenient manner, such as by the oral t topical, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal routes. The 

15 pharmaceutical compositions are administered in an amount which is effective for 
treating and/or prophylaxis of the specific indication. In general, they are 
administered in an amount of at least about I mg/kg body weight and in most cases 
they will be administered in an amount not in excess of about 1 g/kg body weight 
per day. In most cases, the dosage is from about 0. 1 mg/kg to about 10 g/kg body 

20 weight daily, taking into account the routes of administration, symptoms, etc. 

The agents of the present invention can be used in native form or can be 
modified to form a chemical derivative. As used herein, a molecule is said to be a 
"chemical derivative" of another molecule when it contains additional chemical 
moieties not normally a pan of the molecule. Such moieties may improve the 

25 molecule s solubility, absorption, biological half life, etc. The moieties may 
alternatively decrease the toxicity of the molecule, eliminate or attenuate any 
undesirable side effect of the molecule, etc. Moieties capable of mediating such 
effects are disclosed in, among other sources, REMINGTON'S 
PHARMACEUTICAL SCIENCES (1980) cited elsewhere herein. 

30 For example, such moieties may change an immunological character of the 

functional derivative, such as affinity for a given antibody. Such changes in 
immunomodulation activity are measured by the appropriate assay, such as a 
competitive type immunoassay. Modifications of such protein properties as redox 
or thermal stability, biological half-life, hydrophobicity, susceptibility to proteolytic 

35 degradation or the tendency to aggregate with carriers or into multimers also may 
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be effected in this way and can be assayed by methods well known to the skilled 
artisan. 

The therapeutic effects of the agents of the present invention may be 
obtained by providing the agent to a patient by any suitable means (e.g., inhalation, 
5 intravenously, intramuscularly, subcutaneously, enterally. or parentcrally). It is 
preferred to administer the agent of the present invention so as to achieve an 
effective concentration within the~blood or tissue in which the~growth of the 
organism is to be controlled. To achieve an effective blood concentration, the 
preferred method is to administer the agent by injection. The administration may be 

10 by continuous infusion, or by single or multiple injections. 

In providing a patient with one of the agents of the present invention, the 
dosage of the administered agent will vary depending upon such factors as the 
patient's age, weight, height, sex, general medical condition, previous medical 
history, etc. In general, it is desirable to provide the recipient with a dosage of 

15 agent which is in the range of from about 1 pg/kg to 10 mg/kg (body weight of 
patient), although a lower or higher dosage may be administered. The 
therapeutically effective dose can be lowered by using combinations of the agents 
of the present invention or another agent. 

As used herein, two or more compounds or agents are said to be 

20 administered "in combination* with each other when either (1) the physiological 
effects of each compound, or (2) the serum concentrations of each compound can 
be measured at the same time. The composition of the present invention can be 
administered concurrently with, prior to, or following the administration of the 
other agent. 

25 The agents of the present invention are intended to be provided to recipient 

subjects in an amount sufficient to decrease the rate of growth (as defined above) of 
the target organism. 

The administration of the agent(s) of the invention may be for either a 
"prophylactic" or "therapeutic" purpose. When provided prophylactically, the 

30 agent(s) are provided in advance of any symptoms indicative of the organisms 
growth. The prophylactic administration of the agent(s) serves to prevent, 
attenuate, or decrease the rate of onset of any subsequent infection. When 
provided therapeutically, the agent(s) are provided at (or shortly after) the onset of 
an indication of infection. The therapeutic administration of the compound(s) 
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serves to attenuate the pathological symptoms of the infection and to increase the 
rate of recovery. 

The agents of the present invention are administered to a subject, such as a 
mammal, or a patient, in a pharmaceutical^ acceptable form and in a therapeutically 
5 effective concentration. A composition is said to be "pharmacologically acceptable" 
if its administration can be tolerated by a recipient patient. Such an agent is said to 
be administered in a "therapeutically effective amount" if the amount administered 
is physiologically significant. An agent is physiologically significant if its presence 
results in a detectable change in the physiology of a recipient patient. 

10 The agents of the present invention can be formulated according to known 

methods to prepare pharmaceutical^ useful compositions, whereby these materials, 
or their functional derivatives, are combined in a mixture with a pharmaceutical^ 
acceptable carrier vehicle. Suitable vehicles and their formulation, inclusive of 
other human proteins, e.g., human serum albumin, are described, for example, in 

15 REMINGTONS PHARMACEUTICAL SCIENCES, 16 th Ed., Osol, A., Ed., 
Mack Publishing, Easton PA (1980). In order to form a pharmaceutical ly 
acceptable composition suitable for effective administration, such compositions will 
contain an effective amount of one or more of the agents of the present invention, 
together with a suitable amount of carrier vehicle. 

-0 Additional pharmaceutical methods may be employed to control the duration 

of action. Control release preparations may be achieved through the use of 
polymers to complex or absorb one or more of the agents of the present invention. 
The controlled delivery may be effectuated by a variety of well known techniques, 
including formulation with macromolecules such as. for example, polyesters, 

25 polyamino acids, polyvinyl, pyrrolidone, ethylenevinylacetate, methylcellulose, 
carboxymethylcellulose, or protamine, sulfate, adjusting the concentration of the 
macromolecules and the agent in the formulation, and by appropriate use of 
methods of incorporation, which can be manipulated to effectuate a desired time 
course of release. Another possible method to control the duration of action by 

30 controlled release preparations is to incorporate agents of the present invention into 
particles of a polymeric material such as polyesters, polyamino acids, hydrogels, 
poly(lactic acid) or ethylene vinylacetate copolymers. Alternatively, instead of 
incorporating these agents into polymeric particles, it is possible to entrap these 
materials in microcapsules prepared, for example, by coacervation techniques or by 

35 interfacial polymerization with, for example, hydroxymethylcellulose or gelatine- 
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microcapsules and poly(methylmethacylate) microcapsules, respectively, or in 
colloidal drug delivery systems, for example, liposomes, albumin microspheres, 
microemulsions, nanoparticles, and nanocapsules or in macroemulsions. Such 
techniques are disclosed in REMINGTON'S PHARMACEUTICAL SCIENCES 
5 (1980). 

The invention further provides a pharmaceutical pack or kit comprising one 
or more containers filled wftli one ormore of the ingredients of the pharmaceutical 
compositions of the invention. Associated with such containers) can be a notice in 
the form prescribed by a governmental agency regulating the manufacture, use or 
to sale of pharmaceuticals or biological products, which notice reflects approval by 
the agency of manufacture, use or sale for human administration. 

In addition, the agents of the present invention may be employed in 
conjunction with other therapeutic compounds. 

15 6. Shot-Gun Approach to Megabase DNA Sequencing 

The present invention further demonstrates that a large sequence can be 
sequenced using a random shotgun approach. This procedure, described in detail 
in the examples that follow, has eliminated the up front cost of isolating and 
ordering overlapping or contiguous subclones prior to the start of the sequencing 
20 protocols. 

Certain aspects of the present invention are described in greater detail in the 
examples that follow. The examples are provided by way of illustration. Other 
aspects and embodiments of the present invention are contemplated by the 
inventors, as will be clear to those of skill in the art from reading the present 
25 disclosure. 

ILLUSTRATIVE EXAMPLES 

LIBRARIES AND SEQUENCING 
30 1. Shotgun Sequencing Probability Analysis 

The overall strategy for a shotgun approach to whole genome sequencing 

follows from the Lander and Waterman (Landerman and Waterman, Genomics 

2:231 (1988)) application of the equation for the Poisson distribution. According 

to this treatment, the probability, P , that any given base in a sequence of size L, in 

35 nucleotides, is not sequenced after a certain amount, n, in nucleotides, of random 

0 
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sequence has been determined can be calculated by the equation P = e' m , where m 
is L/n, the fold coverage. For instance, for a genome of 2.8 Mb, m=l when 2.8 
Mb of sequence has been randomly generated (IX coverage). APthat point, P = 
e" 1 = 0.37. The probability that any given base has not been sequenced is the same 
5 as the probability that any region of the whole sequence L has not been determined 
and, therefore, is equivalent to the fraction of the whole sequence that has yet to be 
determined. Thus, at one-fold coverage, approximately 37% of a polynucleotide of 
size L, in nucleotides has not been sequenced. When 14 Mb of sequence has been 
generated, coverage is 5X for a 2.8 Mb and the unsequenced fraction drops to 

10 .0067 or 0.67%. 5X coverage of a 2.8 Mb sequence can be attained by sequencing 
approximately 17,000 random clones from both insert ends with an average 
sequence read length of 4 10 bp. 

Similarly, the total gap length, G, is determined by the equation G = Le" m . 
and the average gap size, g, follows the equation, g = L/n. Thus, 5X coverage 

(5 leaves about 240 gaps averaging about 82 bp in size in a sequence of a 
polynucleotide 2.8 Mb long. 

The treatment above is essentially that of Lander and Waterman, Genomics 
2: 231 (1988). 

20 2. Random Library Construction 

In order to approximate the random model described above during actual 
sequencing, a nearly ideal library of cloned genomic fragments is required. The 
following library construction procedure was developed to achieve this end. 

Streptococcus pneumoniae DNA is prepared by phenol extraction. A 

25 mixture containing 200 |ig DNA in 1.0 ml of 300 mM sodium acetate, 10 mM Tris- 
HC1, I mM Na-EDTA, 50% glycerol is processed through a nebulizer (IPI Medical 
Products) with a stream of nitrogen adjusted to 35 Kpa for 2 minutes. The 
sonicated DNA is ethanol precipitated and redissolved in 500 ^1 TE buffer. 

To create blunt-ends, a 100 |il aliquot of the resuspended DNA is digested 

30 with 5 units of BAL31 nuclease (New England BioLabs) for 10 min at 30°C in 200 
(il BAL31 buffer. The digested DNA is phenol-extracted, ethanol-prectpitated, 
redissolved in 100 \i\ TE buffer, and then size-fractionated by electrophoresis 
through a 1.0% low melting temperature agarose gel. The section containing DNA 
fragments 1.6-2.0 kb in size is excised from the gel, and the LGT agarose is melted 

35 and the resulting solution is extracted with phenol to separate the agarose from the 



WO 98/18931 



43 



PCT/US97/19588 



DNA. DNA is ethanol precipitated and redissolved in 20 \xl of TE buffer for 
ligation to vector. 

A two-step ligation procedure is used to produce a plasmid library with 
97% inserts, of which >99% were single inserts. The first ligation mixture (50 ul) 
5 contains 2 jug of DNA fragments, 2 ng pUCi8 DNA (Pharmacia) cut with Smal 
and dephosphorylated with bacterial alkaline phosphatase, and 10 units of T4 ligase 
IGB'CO/BRLno^"^ 

phenol extracted and ethanoi precipitated, and the precipitated DNA is dissolved in 
20 n.1 TE buffer and electrophoresed on a 1.0% low melting agarose gel. Discrete 

10 bands in a ladder are visualized by ethidium bromide-staining and UV illumination 
and identified by size as insert (I) ( vector (v), v+l, v+2i, v+3i, etc. The portion of 
the gel containing v+I DNA is excised and the v+I DNA is recovered and 
resuspended into 20 |il TE. The v+I DNA then is blunt-ended by T4 polymerase 
treatment for 5 min. at 37°C in a reaction mixture (50 ul) containing the v+I linears, 

15 500 \iM each of the 4 dNTPs, and 9 units of T4 polymerase (New England 
BioLabs). under recommended buffer conditions. After phenol extraction and 
ethanol precipitation the repaired v+I linears are dissolved in 20 pi TE. The final 
ligation to produce circles is carried out in a 50 (il reaction containing 5 jil of v+I 
linears and 5 units of T4 ligase at 14°C overnight. After 10 min. at 70°C the 

20 following day, the reaction mixture is stored at -20°C. 

This two-stage procedure results in a molecularly random collection of 
single-insert plasmid recombinants with minimal contamination from double-insert 
chimeras (<1%) or free vector (<3%). 

Since deviation from randomness can arise from propagation the DNA in 

25 the host, E, coli host cells deficient in all recombination and restriction functions 
(A. Greener, Strategies 3 (1):5 (1990)) are used to prevent rearrangements, 
deletions, and loss of clones by restriction. Furthermore, transformed cells are 
plated directly on antibiotic diffusion plates to avoid the usual broth recovery phase 
which allows multiplication and selection of the most rapidly growing cells. 

30 Plating is carried out as follows. A 100 \il aliquot of Epicurian Coli SURE 

II Supercompetent Cells (Stratagene 200152) is thawed on ice and transferred to a 
chilled Falcon 2059 tube on ice. A 1.7 \il aliquot of 1.42 M beta-mercaptoethanol 
is added to the aliquot of cells to a final concentration of 25 rnM. Cells are 
incubated on ice for 10 min. A I jil aliquot of the final ligation is added to the cells 

35 and incubated on ice for 30 min. The cells are heat pulsed for 30 sec. at 42°C and 
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placed back on ice for 2 min. The outgrowth p eriod in liquid culture is eliminated 
from this protocol in order to minimize the preferential growth of any given 
transformed cell. Instead the transformation mixture is plated directly on a nutrient 
rich SOB plate containing a 5 ml bottom layer of SOB agar (5% SOB agar: 20 g 
5 tryptone, 5 g yeast extract, 0.5 g NaCl, 1.5% Difco Agar per liter of media). The 5 
ml bottom layer is supplemented with 0.4 ml of 50 mg/ml ampicillin per 100 ml 
SOB agar. The 15 ml top layer of SOB agar is supplemented with 1 ml X-Gal 
(2%), I ml MgCi ( I M), and 1 ml MgSO /1 00 ml SOB agar. The 15 ml top layer 
is poured just prior to plating. Our titer is approximately 100 colonies/10 |il aliquot 
10 of transformation? 

All colonies are picked for template preparation regardless of size. Thus, 
only clones lost due to "poison" DNA or deleterious gene products are deleted from 
the library, resulting in a slight increase in gap number over that expected. 

t5 3. Random DNA Sequencing 

High quality double stranded DNA plasmid templates are prepared using a 
"boiling bead" method developed in collaboration with Advanced Genetic 
Technology Corp. (Gaithersburg, MD) (Adams et a/,. Science 252:1651 (1991); 
Adams etai. Nature 355:632 (1992)). Plasmid preparation is performed in a 96- 

20 well format for all stages of DNA preparation from bacterial growth through final 
DNA purification. Template concentration is determined using Hoechst Dye and a 
Millipore Cytofluor. DNA concentrations are not adjusted, but low-yielding 
templates are identified where possible and not sequenced. 

Templates are also prepared from two Streptococcus pneumoniae lambda 

25 genomic libraries. An amplified library is constructed in the vector Lambda GEM- 
12 (Promega) and an unampiified library is constructed in Lambda DASH II 
(Stratagene). In particular, for the unampiified lambda library, Streptococcus 
pneumoniae DNA (> 100 kb) is partially digested in a reaction mixture (200 ul) 
containing 50 ng DNA, IX Sau3AI buffer, 20 units Sau3AI for 6 min. at 23°C 

30 The digested DNA was phenol-extracted and electrophoresed on a 0.5% low 
melting agarose gel at 2V/cm for 7 hours. Fragments from 15 to 25 kb are excised 
and recovered in a final volume of 6 ul. One ul of fragments is used with 1 (J.1 of 
DASHII vector (Stratagene) in the recommended ligation reaction. One ^1 of the 
ligation mixture is used per packaging reaction following the recommended 

35 protocol with the Gigapack II XL Packaging Extract (Stratagene. #22771 1). Phage 
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are plated directly without amplification from the packaging mixture (after dilution 
with 500 \il of recommended SM buffer and chloroform treatment). Yield is about 
2.5xI(P pfu/ul. The amplified library is prepared essentially as above except the 
lambda GEM- 12 vector is used. After packaging, about 3.5xl0 4 pfu are plated on 
5 the restrictive NM539 host. The lysate is harvested in 2 ml of SM buffer and 
stored frozen in 7% dimethylsulfoxide. The phage titer is approximately IxIO 9 
pfu/ml. ----- - 

Liquid lysates (100 ^1) are prepared from randomly selected plaques (from 
the unamplified library) and template is prepared by long-range PCR using T7 and 

10 T3 vector-specific primers. 

Sequencing reactions are carried out on plasmid and/or PCR templates 
using the AB Catalyst LabStation with Applied Biosystems PRISM Ready 
Reaction Dye Primer Cycle Sequencing Kits for the M13 forward (Ml 3-21) and 
the M13 reverse (M13RP1) primers (Adams et ai. Nature 368:414 (1994)). Dye 

15 terminator sequencing reactions are carried out on the lambda templates on a 
Perkin-Elmer 9600 Thermocycler using the Applied Biosystems Ready Reaction 
Dye Terminator Cycle Sequencing kits. T7 and SP6 primers are used to sequence 
the ends of the inserts from the Lambda GEM- 12 library and T7 and T3 primers are 
used to sequence the ends of the inserts from the Lambda DASH II library. 

20 Sequencing reactions are performed by eight individuals using an average of 
fourteen AB 373 DNA Sequencers per day. All sequencing reactions are analyzed 
using the Stretch modification of the AB 373. primarily using a 34 cm well-to-read 
distance. The overall sequencing success rate very approximately is about 85% for 
Ml 3-21 and M13RP1 sequences and 65% for dye-terminator reactions. The 

25 average usable read length is 485 bp for Ml 3-21 sequences. 445bp for M13RP1 
sequences, and 375 bp for dye-terminator reactions. 

Richards et aL Chapter 28 in AUTOMATED DNA SEQUENCING AND 
ANALYSIS. M. D. Adams, C. Fields, J. C. Venter. Eds., Academic Press. 
London. (1994) described the value of using sequence from both ends of 

30 sequencing templates to facilitate ordering of contigs in shotgun assembly projects 
of lambda and cosmid clones. We balance the desirability of both-end sequencing 
(including the reduced cost of lower total number of templates) against shorter 
read-iengths for sequencing reactions performed with the M13RP1 (reverse) primer 
compared to the M13-21 (forward) primer. Approximately one-half of the 

35 templates are sequenced from both ends. Random reverse sequencing reactions are 
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done based on successful forward sequencing reactions. Some M13RP1 
sequences are obtained in a semi-directed fashion: Ml 3-21: sequences pointing 
outward at the ends of contigs are chosen for M13RP1 sequencing in an effort to 
specifically order contigs. 

5 

4. Protocol for Automated Cycle Sequencing 

The sequencing is carried out using ABI Catalyst robots and AB 373 
Automated DNA Sequencers. The Catalyst robot is a publicly available 
sophisticated pipetting and temperature control robot which has been developed 

10 specifically for DNA sequencing reactions. The Catalyst combines pre-aliquoted 
templates and reaction mixes consisting of deoxy- and dideoxynucleotides, the 
thermostable Taq DNA polymerase, fluorescentiy-labelled sequencing primers, and 
reaction buffer. Reaction mixes and templates are combined in the wells of an 
aluminum 96-well thermocycling plate. Thirty consecutive cycles of linear 

15 amplification (i.e.., one primer synthesis) steps are performed including 
denaturation, annealing of primer and template, and extension; i.e., DNA 
synthesis. A heated lid with rubber gaskets on the thermocycling plate prevents 
evaporation without the need for an oil overlay. 

Two sequencing protocols are used: one for dye-labelled primers and a 

20 second for dye-labelled dideoxy chain terminators. The shotgun sequencing 
involves use of four dye-labelled sequencing primers, one for each of the four 
terminator nucleotide. Each dye-primer is labelled with a different fluorescent dye. 
permitting the four individual reactions to be combined into one lane of the 373 
DNA Sequencer for electrophoresis, detection, and base -calling. ABI currently 

25 supplies pre-mixed reaction mixes in bulk packages containing all the necessary 
non-template reagents for sequencing. Sequencing can be done with both plasmid 
and PCR- generated templates with both dye-primers and dye- terminators with 
approximately equal fidelity, although plasmid templates generally give longer 
usable sequences. 

30 Thirty-two reactions are loaded per AB373 Sequencer each day, for a total 

of 960 samples. Electrophoresis is run overnight following the manufacturer's 
protocols, and the data is collected for twelve hours. Following electrophoresis 
and fluorescence detection, the ABI 373 performs automatic lane tracking and base- 
calling. The lane-tracking is confirmed visually. Each sequence electropherogram 

35 (or fluorescence lane trace) is inspected visually and assessed for quality. Trailing 
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sequences of low quality are removed and the sequence itself is loaded via software 
to a Sybase database (archived daily to 8mm tape). Leading vector polylinker 
sequence is removed automatically by a software program. Average edited lengths 
of sequences from the standard ABI 373 are around 400 bp and depend mostly on 
5 the quality of the template used for the sequencing reaction. ABI 373 Sequencers 
converted to Stretch Liners provide a longer electrophoresis path prior to 
— fluorescence detection and increase the average-number ofusable bases to-50O=6OO- — 
bp. 

io INFORiMATICS 

1. Data Management 

A number of information management systems for a large-scale sequencing 
lab have been developed. (For review see, for instance, Kerlavage et aL. 
Proceedings of the Twenty-Sixth Annual Hawaii International Conference on 

15 System Sciences. IEEE Computer Society Press, Washington D. C, 585 (1993)) 
The system used to collect and assemble the sequence data was developed using the 
Sybase relational database management system and was designed to automate data 
flow wherever possible and to reduce user error. The database stores and 
correlates all information collected during the entire operation from template 

:o preparation to final analysis of the genome. Because the raw output of the ABI 373 
Sequencers was based on a Macintosh platform and the data management system 
chosen was based on a Unix platform, it was necessary to design and implement a 
variety of multi- user, client-server applications which allow the raw data as well as 
analysis results to flow seamlessly into the database with a minimum of user effort. 

25 

2. Assembly 

An assembly engine (TIGR Assembler) developed for the rapid and 
accurate assembly of thousands of sequence fragments was employed to generate 
contigs. The TIGR assembler simultaneously clusters and assembles fragments of 

30 the genome. In order to obtain the speed necessary to assemble more than I0 4 
fragments, the algorithm builds a hash table of 12 bp oligonucleotide subsequences 
to generate a list of potential sequence fragment overlaps. The number of potential 
overlaps for each fragment determines which fragments are likely to fail into 
repetitive elements. Beginning with a single seed sequence fragment, TIGR 

35 Assembler extends the current contig by attempting to add the best matching 
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fragment based on oligonucleotide content. The contig and candidate fragment are 
aligned using a modified version of the Smith-Waterman algorithm which provides 
for optimal gapped alignments (Waterman, M. S., Methods in Enzymology 
164:165 (1988)). The contig is extended by the fragment only if strict criteria for 
5 the quality of the match are met. The match criteria-include the minimum length of 
overlap, the maximum length of an unmatched end. and the minimum percentage 
match. These criteria are automatically lowered by the algorithm in regions of 
minimal coverage and raised in regions with a possible repetitive element. The 
number of potential overlaps for each fragment determines which fragments are 

io likely to fall into repetitive elements. Fragments representing the boundaries of 
repetitive elements and potentially chimeric fragments are often rejected based on 
partial mismatches at the ends of alignments and excluded from the current contig. 
TIGR Assembler is designed to take advantage of clone size information coupled 
with sequencing from both ends of each template. It enforces the constraint that 

15 sequence fragments from two ends of the same template point toward one another 
in the contig and are located within a certain range of base pairs (definable for each 
clone based on the known clone size range for a given library). 

The process resulted in 391 contigs as represented by SEQ ID NOs: 1-391. 

-0 3. Identifying Genes 

The predicted coding regions of the Streptococcus pneumoniae genome 
were initially defined with the program GeneMark, which finds ORFs using a 
probabilistic classification technique. The predicted coding region .sequences were 
used in searches against a database of all nucleotide sequences from GenBank 

25 (October, 1997). using the BLASTN search method to identify overlaps of 50 or 
more nucleotides with at least a 95% identity. Those ORFs with nucleotide 
sequence matches are shown in Table 1. The ORFs without such matches were 
translated to protein sequences and compared to a non-redundant database of 
known proteins generated by combining the Swiss-prot, PIR and GenPept 

30 databases. ORFs that matched a database protein with BLASTP probability less 
than or equal to 0.01 are shown in Table 2. The table also lists assigned functions 
based on the closest match in the databases. ORFs that did not match protein or 
nucleotide sequences in the databases at these levels are shown in Table 3. 
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ILLUSTRATIVE APPLICATIONS 

1. Production of an Antibody to a Streptococcus pneumoniae 
Protein 

Substantially pure protein or polypeptide is isolated from the transfected or 
5 transformed cells using any one of the methods known in the art. The protein can 
also be produced in a recombinant prokaryotic expression system, such as E. coli, 
or can be chemically"synthesized. Concentration of~protein~in the final preparation 
is adjusted, for example, by concentration on an Amicon filter device, to the level 
of a few micrograms/ml. Monoclonal or polyclonal antibody to the protein can 
10 then be prepared as follows. 

2. Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of any of the peptides identified and 
isolated as described can be prepared from murine hybridomas according to the 

15 classical method of Kohler, G. and Milstein, C, Nature 256:495 (1975) or 
modifications of the methods thereof. Briefly, a mouse is repetitively inoculated 
with a few micrograms of the selected protein over a period of a few weeks. The 
mouse is then sacrificed, and the antibody producing cells of the spleen isolated. 
The spleen cells are fused by means of polyethylene glycol with mouse myeloma 

20 cells, and the excess unfused cells destroyed by growth of the system on selective 
media comprising aminopterin (HAT media). The successfully fused cells are 
diluted and aliquots of the dilution placed in wells of a microliter plate where 
growth of the culture is continued. Antibody-producing clones are identified by 
detection of antibody in the supernatant fluid of the wells by immunoassay 

25 procedures, such as ELISA, as originally described by Engvall, E., Meth. 
Enzymoi 70:419 (1980), and modified methods thereof. Selected positive clones 
can be expanded and their monoclonal antibody product harvested for use. Detailed 
procedures for monoclonal antibody production are described in Davis, L. et ai t 
Basic Methods in Molecular Biology, Elsevier, New York. Section 21-2 (1989). 

30 
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3. Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogenous epitopes of a 
single protein can be prepared by immunizing suitable animals with the expressed 
protein described above, which can be unmodified or modified to enhance 
5 immunogenicity. Effective polyclonal antibody production is affected by many 
factors related both to the antigen and the host species. For example, small 
molecules tend to be less immunogenic than others and may require the use of 
carriers and adjuvant. Also, host animals vary in response to site of inoculations 
and dose t with both inadequate or excessive doses of antigen resulting in low titer 
i() antisera. Small doses (ng level) of antigen administered at multiple intradermal 
sites appears to be most reliable. An effective immunization protocol for rabbits 
can be found in Vaitukaitis, J. et «/., J, Clin. Endocrinol. Metab. 33:988-991 
(1971). 

Booster injections can be given at regular intervals, and antiserum harvested 

15 when antibody titer thereof, as determined semi-quantitatively, for example, by 
double immunodiffusion in agar against known concentrations of the antigen, 
begins to fall. See. for example. Ouchterlony, O. et ai. Chap. 19 in: Handbook of 
Experimental Immunology, Wier, D.. ed, Blackwell (1973). Plateau concentration 
of antibody is usually in the range of 0.1 to 0.2 mg/ml of serum (about 12M). 

20 Affinity of the antisera for the antigen is determined by preparing competitive 
binding curves, as described, for example, by Fisher, D., Chap. 42 in: Manual of 
Clinical Immunology, second edition. Rose and Friedman, eds., Amer. Soc. For 
Microbiology, Washington. D. C. (1980) 

Antibody preparations prepared according to either protocol are useful in 

25 quantitative immunoassays which determine concentrations of antigen-bearing 
substances in biological samples; they are also used semi- quantitatively or 
qualitatively to identify the presence of antigen in a biological sample. In addition, 
antibodies are useful in various animal models of pneumococcal disease as a means 
of evaluating the protein used to make the antibody as a potential vaccine target or 

30 as a means of evaluating the antibody as a potential immunotherapeutic or 
immunoprophylactic reagent. 
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4. Preparation of PCR Primers and Amplification of DNA 

Various fragments of the Streptococcus pneumoniae genome, such as those 
of Tables 1-3 andSEQ [DNOS: 1-391 can be used, in accordance with the present 
invention, to prepare PCR primers for a variety of uses. The PCR primers arc 
5 preferably at least 15 bases, and more preferably at least 18 bases in length. When 
selecting a primer sequence, it is preferred that the primer pairs have approximately 
the same G/C ratio, so that melting temperatures are approximately the same. The 
PCR primers and amplified DNA of this Example find use in the Examples that 
follow. 

10 

5, Gene expression from DNA Sequences Corresponding to 

ORFs 

A fragment of the Streptococcus pneumoniae genome provided in Tables I - 
3 is introduced into an expression vector using conventional technology. 

15 Techniques to transfer cloned sequences into expression vectors that direct protein 
translation in mammalian, yeast, insect or bacterial expression systems are well 
known in the art. Commercially available vectors and expression systems are 
available from a variety of suppliers including Stratagene (La Jolla, California), 
Promega (Madison, Wisconsin), and Invitrogen (San Diego, California). If 

20 desired, to enhance expression and facilitate proper protein folding, the codon 
context and codon pairing of the sequence may be optimized for the particular 
expression organism, as explained by Hatfield et «/., U. S. Patent No. 5,082.767. 
incorporated herein by this reference. 
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The following is provided as one exemplary method to generate 
polypeptide(s) from cloned ORFs of the Streptococcus pneumoniae genome 
fragment. Bacterial ORFs generally lack a poly A addition signal. The addition 
signal sequence can be added to the construct by. for example, splicing out the poly 
5 A addition sequence from pSG5 (Stratagene) -using Bgll and Sail restriction 
endonuclease enzymes and incorporating it into the mammalian expression vector 
pXTl (Stratagene) for use in eukaryotic expression systems. pXTl contains the 
LTRs and a portion of the gag gene of Moloney Murine Leukemia Virus. The 
positions of the LTRs in the construct allow efficient stable transfection. The 

io vector includes the Herpes Simplex thymidine kinase promoter and the selectable 
neomycin gene. The Streptococcus pneumoniae DNA is obtained by PCR from the 
bacterial vector using oligonucleotide primers complementary to the Streptococcus 
pneumoniae DNA and containing restriction endonuclease sequences for Pstl 
incorporated into the 5' primer and BgUI at the 5* end of the corresponding 

15 Streptococcus pneumoniae DNA 3' primer, taking care to ensure that the 
Streptococcus pneumoniae DNA is positioned such that its followed with the poly 
A addition sequence. The purified fragment obtained from the resulting PCR 
reaction is digested with PstL blunt ended with an exonuclcasc, digested with 
BgllL purified and ligated to pXTl, now containing a poly A addition sequence 

20 and digested BgUI. 

The ligated product is transfected into mouse NIH 3T3 cells using 
Lipofectin (Life Technologies. Inc., Grand Island. New York) under conditions 
outlined in the product specification. Positive transfectants are selected after 
growing the transfected cells in 600 ug/ml G4I8 (Sigma, St. Louis, Missouri). 

25 The protein is preferably released into the supernatant. However if the protein has 
membrane binding domains, the protein may additionally be retained within the cell 
or expression may be restricted to the ceil surface. Since it may be necessary to 
purify and locate the transfected product, synthetic 15-mer peptides synthesized 
from the predicted Streptococcus pneumoniae DNA sequence are injected into mice 

30 to generate antibody to the polypeptide encoded by the Streptococcus pneumoniae 
DNA. 
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Alternatively and if antibody production 4s-Rot possible, the Streptococcus 
pneumoniae DNA sequence is additionally incorporated into eukaryotic expression 
vectors and expressed as, for example, a globin fusion. Antibody to the globin 
moiety then is used to purify the chimeric protein. Corresponding protease 
5 cleavage sites are engineered between the globin moiety and the polypeptide 
encoded by the Streptococcus pneumoniae DNA so that the latter may be freed 
from the formed by simple protease digestion. One useful expression vector for 
generating globin chimerics is pSG5 (Stratagene). This vector encodes a rabbit 
globin. Intron II of the rabbit globin gene facilitates splicing of the expressed 

10 transcript, and the polyadenylation signal incorporated into the construct increases 
the level of expression. These techniques are well known to those skilled in the art 
of molecular biology. Standard methods are published in methods texts such as 
Davis et a/., cited elsewhere herein, and many of the methods are available from the 
technical assistance representatives from Stratagene, Life Technologies, Inc., or 

15 Promega. Polypeptides of the invention also may be produced using in vitro 
translation systems such as in vitro ExpressTM Translation Kit (Stratagene). 

While the present invention has been described in some detail for purposes 
of clarity and understanding, one skilled in the art will appreciate that various 
changes in form and detail can be made without departing from the true scope of 

20 the invention. 

All patents, patent applications and publications referred to above are 
hereby incorporated by reference. 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: Charles Kunsch 

Gil H. Choi 
Pacrick S. Dillon 
Craig A. Rosen 
Sceven C. Barash 
Michael R - Fanxion 
Brian A. Dougherty 

Ui) TITLE OF INVENTION : Streptococcus pneumoniae Polynucleocides and Sequences 
(I ii) NUMBER OF SEQUENCES : 391 

(iv) CORRESPONDENCE ADDRESS : 

< A> ADDRESSEE: Human Genome Sciences. Inc. 

(B) STREET: 9410 Key West Avenue 
:C) CITY: Ro.clcville 

(D) STATE: Maryland 
(£! COUNTRY: USA 
(?) 21?: 20850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Oisitette. 3.50 inch. 1.4Mb storage 

(3) COMPUTER: HP Vectra 436/33 

(C) OPERATING SYSTEM: MSDOS version 6.2 

(D) SOFTWARE: ASCII Text 
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(A) APPLICATION NUMBER: 

(B) FILING DATE: 
(C> CLASSIFICATION: 

(viii PRIOR APPLICATION DATA: 
(A) APPLICATION NUMBER : 
(3) FILING DATE? 

(viii) ATTORNEY/ AGENT INFORMATION: 
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(Bl REGISTRATION NUMBER: 36.371 

(C) REFERENCE/ DOCKET NUMBER : PB340PI 



(vi) TELECOMMUNICATION INFORMATION 
(A) TELEPHONE: (301) 309-8504 
{ B) TELEFAX: (301) 309-8512 
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(2) INFORMATION FOR SEQ ID NO: 1: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5625 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xii SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CCAAGCAAAA CC AGCTAC AG CTAAAGGAAC TTACGTAACA AACTTGACTA TCACAACTAC 60 

TCAAGGTGTT GGTATCAAAG TTGACGTAAA CTCACTTTAA TCAGTAGTTA AAGTAATGTA 120 

AAAAACTTGA AGACGCTATG TCTCAACTTT TTTTGATGTA CGACGGGCAT CTTGTATAGT 180 

AGATGTGTAC TATTCTAGTT TCAATCTACT ATAGTAGCTC AGAAGTCGGT ACTTAAACGT 240 

GCTATATCAA AACCAGTCCT TGAAAAACGT GGACTGGTTT CGTCTTTGGA TTATTACCTT 300 

GAACGACATG CGTTAAAAGT TAGTTGAACC GCCGTATGCC GAACGGACGT ACGGTGGTGT 360 

GAGAGGGGCT AGAGATTATC CCCTACTCGA TTTCGAAATC TAGTGGAATG AATCTGGAAT 420 

AGTCCATCGA GCTTTCTAAT ACTCTTCGAA AATCTCTTCA AACCACGTCA ACGTCGCCTT 4 80 

GCCGTGCGTA TGGTTACTGA CTTCGTCAGT TCTATCCACA ACCTCAAAAC AGTGTTTTGA 540 

GCTGACTACG TCACTTCCAT CTACAACCTC AAAACAGTGT TTTGAGCAAC CTGCGGCTAG 600 

TTTCCTAGTT TGCTCTTTGG TTTTCATTGA GTATAACACA TTGTTAGAAG TTG G TTTAAA 660 

TTTCCTAATC AGTTTGTTCA CATTTACCTT CGATATATTA TATCCCATAG TTAAGGTTGG 720 

TCATACAGAT GATTATAGTC ATGGAGCCGT AAAACTTAGT GTTTCTTTAG TTGACAAAGA 780 

TGCCATGAAA AAAATATTTG TAACTGTAAT AGCATATTTT GAAATAAATA TAGATGAAAA 84 0 

TATCACCGAT ATTCTATACG TAAATGGTAC TGCTATTCTT TATCTTTATT TACGTTCAAT 900 

TGTTTCAATA GTTTCGGCAA TTGATAGCAG TGAAGCAATG TTGCTACCTA TCATTAATGT 960 

TTTAGAGTTA CTAGATAAAT CTCAACCTTT TGAAGAAGAA TAATTTATTA GCTCACTAAA 1020 

TTGAGGGTAA GGAAAAGTAA AAGCAGTAAG AAAAATGTCT TGCATTATAC AGCAACCTTT 1080 

TGGGAATGAG TGGATGGATT GAATAAAATT TGATTAACAG TGGATGATTT ATCTGTAGAT 1140 

TATTATTGGA CACTTACTCT TGAAGTAGTC TAAGAATTAG GTTATAATCA GTAGAAGCCT 1200 

TGCTAATAAT GAGGAGGTTA GTTTATGTAT AGTAGACTCA ATCTAAAATA GTACGAAACA 1260 

ATTGCTAAAA CATTTATAGA AATTAATTTT ACTTTCCCAA TCGATTTGTT CTCATCTTAT 1320 

TTCAATCCGC TATATATTAT GGTATCGAAT CTTCATCAGA ATGATAAAAT TAATCAATTG 1380 

ATATCTGATT ACAAACAGAA TATGAAAGCT TTTTATATCA CTATTGAAAA ATTTATACGA 1440 
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GATGATGAAA 


GCCTTAAGTG 


TTATTTTATA 


AAGGTT ATTT 
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ATTTCCCTTA 


AACTTTGGGA 


TAAGGGCTTA 


4620 


GATTCCTACG 


AAAGTGCCGG 


TTCGTAAAAG 


CTAAAGTAAG 


ACTAAAATCA 


AGAAAGAAAC 


4680 


TTATGATGTT 


TGTAATTGAA 


GAAGTCAAGG 


ATGAAAATCA 


AAAAAAGGCA 


GTTGTCGCTG 


4740 


AGGTTTTGAA 


GGATTTGCCA 


GAATCGTTTG 


GAATCCCAGA 


AAGCACACAA 


GCCTATATAG 


4800 


AAGGAACCAC 


GACACTGCAA 


GTTTGGACCG 


CCTATCAGGA 


GAGTGATTTG 


ACTAGATTTG 


4860 


TAAGCTTATC 


CTATTCGAGT 


GAAGATTGTG 


CAGAGATTGA 


TTGTCTCGGC 


GTAAAAAAGC 


4920 


TTATCAAGGT 


AGAAAAATTG 


GGAGCCAATT 


GCTTGCTACT 


TTAGAGAGTG 


AAGCTCGTAA 


4980 
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AAAAGTTGCT TATCTGCAGG TCAAAACAGT GGCAGAAGCT TCTAATAAAG ATTATGATCG 5040 

AACAAATGAC TTTTATCGAG GTCTTGGCTT TAAAAAGTTA CACATTTTTC CTCAACTATG 5100 

GAATCCGCAA AATCCTTGTC AGATTTTGAT TAAAAAGCTT GAATAATATT ACTTGACATC 5160 

TATTCTCACA GTGCTATACT GTAAGTGTAA TCGCCGATTT AGCTTAGTTG GTAGAGCAAC 5220 

GCACTCGTAA AGCCTAGGTT ATAGGTAGAT AAACGACTGA GCATTTGAAA AAATAGATAG 52 80 

GTAGAAGATA ACCCTTAAGC CTTACTCTTA GCGCTTATTT ATATTCTTTA ATACCGCTAA" 5340 

TATTTTATCA ATTATGCCTG TTTTCGTGTT TCTGGTAGTT GTTCAAGTT? ATTGCTACTA 54 00 

TTTTTGATGG TATCAATGTG CTTATAATGT ATCCCGGTTA ACGAAAGTTT TGGACTTATA 54 60 

CTCTTCGAAA ATCTCTTCAA ACCACGTCAA CGTCCCCTTG CCGTGCGTAT GGTTATGACT 5520 

TCGTCACTTC TATCCACAAC CTCAAAACAG TCTTTTGACT GACTACCTCA GTTCCATCTA 5580 

CAACCTCAAA ACACTGTTTT GCCCAATCTG CGGCTAGTTT CCTAG 5625 
(2) INFORMATION FOR SEQ ID NO; 2: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7571 base pairs 

(B) TYPE: nucleic acid 
IC> STRANDEDNESS : double 
(O) TOPOLOGY: linear 



fxi} SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CTCTCCAGCT TTCCTTGCGA GTTGGCCATG TTGTGTCTTT AAGAACTCTA AAAATATCTC 60 

CAATAAAACG CATCGCTCTC TCCTATCTCG TTTCTCTGTG TGTAGTGTAC TTGCCACAAT 120 

GCTTACAAAA TTTATTTACT TCTAGTCGTG TAGGCTTGAG GTTTCCGCTG ATCTTGATTG 180 

AATAGTTTCT CGAACCACAA ACCGCACAAG CTAGCCTTGC TTTTTTTACT GCCATAACGC 2 40 

CTCCATCTTA TCCATTATAA CAAGAAACCT AGGCTTTGAC AAGCATCTTA GCGAAATAGA 300 

TTGACTATCG AATCCCATAT TGTTTGAGCC TTTTCCTTAA TCTTCGCATC TGAGATAGCC 360 

CGGCTAGCCT CATCTACTAG ACTTTGCGCA CGCCCTCGAA TATCACACAA ATTATCATCT 4 20 

GTCTGGCTAT TATCATTGGT TTGTACTTGT CTTTTTGTAT TGGCTGGTGC AATTCCATTT 4 80 

TGCTTATAAG CATTTTCAAC CGTAAAGGTA CTTCCTGGCG TATAAGGTAA AATGGTATTG 54 0 

GCAATGTTTC TAAAGACATG AGCTGCACCG TTTGAAGTAG AGCCAGCTAG ATAGTGGTTT 600 

TCATCAGTGG TCGGAAAGCC AAGCCAGTGC CTAATCACTA CATCCGGAGT ATAACCAATT 660 

ACCCACTGGT CACTTGTGTA CTCCGGATTG AAAACTGCTT CAGTTGTTCC AGTTTTCCCT 7 20 
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GCCATGACAT 


AGTCTGCAGG 


CGATGAACTA 


ATACCGGTAC 


CGTTGGTGAA 


AGTCCCCAAC 


730 


ATCATACTGG 


TCATCTTGTC 


AGCTACAGAC 


TTATCAATCA 


CCCGTTTTTG 


TGAATTTTTA 


340 


TGACTCGCAA 


TAACTTGTCC 


ACTAGCATTT 


TCAATTCTAC 


TAATAAAATG 


AGCTTCAGCC 


900 


ATTAAACCTT 


CATTTGCAAA 


GGCGGCGTAT 


GCTTGAGCCA 


TTTGAAGAGG 


GTTGGTTTCA 


960 


ACACCGCTTC 


CCAAGGCGAC 


ACCAAGAACA 


CGGTCGACCT 


TTTCCATGTT 


GAGTCCGAAT 


1020 


TTTTCGCCTG 


CCTCAAAAGC 


CTTGTCGACA CCCAAATCAT TAACAGTGCC AACAGCAGGT 


1080 


AGATTAAGCG 


ATTCTGCCAA 


GGCTTGATAC 


ATAGGAACTT 


CTCGACTCGT 


TTTGATCCCT 


1140 


GCATAGTTAT 


CAACCTTATA 


GCTGTCATAC 


TGCATGCTAT 


GGTTATCCAA 


CTGCTTATTC 


1200 


AAAGCCCAGC 


TTGCTTCAAC 


TCCTCGCGTA 


TAAACAACTA 


AAGGCTTAAT 


TGTAGAACCA 


1250 


GGACTACGCT 


TTGATTGGGT 


TGCATAGTTG 


AAATTCCGGA 


ATCCAGTTTT 


ATCATTGTCA 


1320 


GCAACTTGAC 


CGACAACTCC 


ACGAACTCCC 


CCTGTTTTCG 


GTTCCAGGGC 


TACACTTCCT 


1380 


GATTGAGCAA 


ACGTTCCATC 


CTCTGCCCTC 


GGAAATAGCG 


ATGTGTTTTC 


ATAAACAATC 


1440 


TGCATATTTG 


CTTGGTAGTT 


TTGGTCCAGC 


TCTGTGTAAA 


TGCGGTAGCC 


ATTATTGACA 


1500 


ATCTCTTCCT 


CTGTTAGATT 


ATACTTCGAA 


ACACCTTCAT 


TAACCACCGC 


ATCAAAATAA 


1560 


GAGGGGTAAC 


GGTAATCTGA 


GATTTTTCCT 


TCATACTTA? 


CGTGCAATTG 


CGAAGTCATA 


1620 


TCAACTTCAG 


CAGCTTTGGT 


TTCTTGGTTT 


TTATCAATAT 


ATCCTGCTGC 


AACCATATTC 


1680 


TGCAAGACAG 


TATCGCCCCG 


ATTAGTAGAA 


TCTTCTACGG 


AATTCAAGCG 


ATTATACAGT 


1740 


TCCGGCCCC7 


TGACCATCCC 


TGCCAGAGTC 


GCAGCTTGAT 


CCAGACTCAC 


TTCTCATGCA 


1800 


GAAACTCCAA 


AGTATTTCTT 


ACTCGCATCT 


TCTACACCCC 


ACACACCATT 


TCCAAAATAA 


1860 


GCGTTGTTAA 


GGTACATCC? 


TAGAATTTCC 


TCCTTACTAT 


ATTTTTTCCT 


TAATTCTAAG 


192C 


GCAAGGAAAA 


ATTCTTTCGC 


TTTTCTCTCA 


ACAGTTTGAT 


CCTGCGATAA 


ATAGGCGTTT 


1930 


TTAGCCAGCT 


GTTGGGTAAT 


GGTAGAGCCA 


CCACCTGAAC 


GTCCAGCAGT 


GACAATACCC 


2040 


AAGAAAAAAC 


CGCCATAGTT 


AATCCCGTCA 


TTTTTATAGA 


AACAACGGTC 


TTC7GTCGCA 


2100 


ATAACAGCAT 


TCTGCAAGTT 


TTTACTGATG 


TCAGTCAGCT 


CAACATAGGT 


TCCCTTTTGA 


2160 


CCAGACAAGG 


CACCAGCCTC 


TTTTTCTTCA 


CCCTCAAAAA 


TAAGAGTCCG 


AGTTTTCAAG 


2220 


GCATTTTGCA 


AATCATTGAC 


ATTGGTCGAC 


TTGCCTACAG 


CAAACAAATA 


GATTCCAACT 


2280 


AGCAAGCCTG 


CACTCAAACC 


TAGTATAACG 


ATAATCTTTG 


TTAGATGATA 


ACGACGCCAG 


2340 


AATTTTCCAA 


TCGGACCTAC 


TTGGGCTAAT 


TTTTTTCGA? 


CACTACCAGA 


GCGACGTAAG 


2400 


ATAGTAGAAT 


CAGAGTCCTC 


TAGTTCACTT 


GTTTCTTTT7 


TAAAAAGAGA 


AAGAAATTTC 


2460 


TCAAATAATT 


TATCTAATTT 


CATGCGTTTA 


TTTTATCATC 


TTCATCATAG 


GAAGACAAGA 


2520 
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ATTTAGCTAT TTCCTATCCA AATAGGGCTT TTTTTGTTAC AATATCTGTA TGCAATTCAC 2580 

ATTTACATTA CCCGCCTCTC TACCTCAAAT GACAGTAAAG CAATTACTTG AGGAACAACT 264 0 

CCTCATCCCT AGAAAAATCC GTCATTTTTT GAGAATCAAG AAACATAT77 TGATAAATCA 2700 

AGAAGAAG7C CACTGCAAGC AAATCGTAAA TCCTGGAGAT GTTTGCCAGT TGACTTTTGA 27 60 

CGAGGAAGAT TATTCCCAAA AGACGATCCC TTGGGCCAAC CCAGACTTAG TGCAGCAAGT 2820 

TTATCAAGAT_ CAACACTTCA TTATTGTAAA, CAAACCAGAG-CGCATGAAAA CGCATGGTAA- - 2880 

TCAACCAAAC GAAATTGCCC TTCTTAACCA TCTCAGTACC TATGTTGGCC AAACCTGCTA 2940 

7GTCGTTCAT CGTCTGGACA TGGAAACCAG TCCCTTAGTT CTCTTTGCCA AAAATCCTTT 3000 

TATCCTGCCC ATTC7CAATC GCTTATTGGA GAAAAAAGAG ATTTCTAGAC AATATTGGGC 3060 

TCTAGTTGAT GGAAATATCA ACAGAAAAGA ACTTGTTTTC AGAGACAAAA TTGGACGTGA 3120 

TCGCCATGAT CGTAGAAAAA GAATAGTTGA TGCAAAAAAT GGGCAATATG CTGAAACGCA 3190 

TGTAAGCAGA TTAAAGCAAT TCTCAAACAA GACTTCCTTC GCTCATTGCA AGCTAAAGAC 3240 

AGGGCGAACC CATCAGATTC GTGTGCACCT TTCGCATCAT AATCTTCCTA TCCTGGGAGA 3 300 

CCCTCTCTAT AATAGTAAAT CAAAGACAAG CCGGCTTATG CTTCATGCCT TCCGACTTTC 3 3 60 

CTTTACCCAG CCACTTACTT TAGAGAAGCT AACTTTCACT ACCCTTTCAA ATACATTTGA 3 4 20 

AAAAGAATTA AAAAAGAATG GATGATCGTG TCATCCATTT TTCCATATAA AAAAGCAAGA 3480 

CCACAAAGCC TTGCT7TCTA TCAACTCAAG AA7TA7TTAC CAATTTTTGC GAAGTATTCA 3 540 

ACAGTACGAA CAAGTTCTGC AC7C7ATCAC ATTTCGTTGT CGTACCATGA TACAACTTTA 3 50C 

ACCAAT7C7T 7ACCGTCAAC GTCAAGAACT TTAGTTTCAC T7GCC7CAAA CAATGAACCG 3660 

7AAGACATAC CTACGATATC TCAACA7ACC A7TGGATCTT CTC7G7AACC GTATGATTCG 3720 

7TTGAAGCTG CTTTCATACC TGCGT7CACT 7CATCAACAG 7AACGTTC7T 77CAAGAACT 3730 

GC7ACCAA7T CAGTAACTGA TCCAGTTCGA GT7GGAACGC G7TCTGCAGA TCCCTCAAGT 384 0 

T7ACCATTCA ATTCTGGCAT 7ACAAGACCG ATAGCTT7TG CAGCACCAGT 7CAGTTACGA 3 900 

ACGATG7TTG CACCACCAGC GCGAGCACGG CGAAGGTCAC CACCACGGTG 7GG7CCGTCA 3 9 60 

AGGATCATTT GGTCACCAG7 GTAAGCGTGG ATAG7AG7CA TCAATCC7TC AACAACACCA 402 C 

AAGTTGTCTT GAAGAGCTTT AGCCATTGGA GCCAAGCAG7 TTGTAG7ACA TGAAGCACC7 4080 

GAGATAACTG TTTCAGTACC GTCAAGAACG TCGTGGT7AG TGT7GAATAC AAC7GTTTTA 414 0 

ACGTCG777C CACCAGGAGC AGTGATAACA ACT7TT7TAG CTCCACC7T7 AACGTGTTTT 4 200 

TCAGC7GCTT CTTTCTTACC AAAGAAACCA GTAGCTTCAA GAACGA7T7C 7ACACCGTCA 4260 
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GTAGCCCAGT CGATTTGTTC TGGATCACGT TCAGCAGAAA CTW^CnAA TTTACCGTTA 4 320 

ACTTCAAATC CACCTTCTTT AACTTCAACA GTACCGTCGA AACGACCTTG AGTTGTGTCG 4 380 

TATTTCAACA AGTGTGCAAG CATAACTGGA TCTGTAAGGT CGTTGATGCG TGTAACTTCA 4440 

ACACCTTCTA CGTTTTGGAT ACGACGGAAA GCAAGACGAC CGATACCTCC GAAACCGTTA 4 500 

ATACCAACTT TAACTACCAT TAGTGATTTC CTCCTTATGA AAATCATGAA ATTTTTATTG 4 560 

TGAAAAGAGT AACTTGAATC ACTACAAATC ACCTTTCAAC AAACCTATTA TACAACTATT 4 620 

TGAGTTGAAT TGCAAGTATG GCCATTGTTT TTCTATGTTA GTTTCTTTTT AAGACTGTAA 4 630 

ACCAAGGAAT CCCTTACTAT TCATAGCATA ACGATTCTAT AGCATCCATT TTACTAATCT 474 0 

TACGCCCCCG GAAG7AGGCT GAGACATAAC CAAGTAATAG AGCGAAAACT AGAGTTCCTA 4 800 

AAACAGATAA AAGATTTAAT TTAAAAACCT TAGTGATGGA TGGGTAAAAG TCACTTACAA 4860 

TCGCATTCGC CAAACTTCCC ACCCCTTGTG CAACCAAAAA TGCCAGCACC AAGGCGATGC 4 920 

CTACAATCCA GATAGCCTCG TAAATAAAAA TTCCTTTGAC ATCACGATTC TGATAACCAA 4 980 

CTGCTTTCAT GACACCTATT TCCTTGGAAC CTTGCATGAT ATTGATGTAA ATAATGATAC 3040 

CAATCATAAC CGCTGCTACC ACAATAGCTT GTGATGAAAG CACAATCAAT AATCCCTGAA 5100 

TAACACGAAT AAAGGTAATC ACAATATCAA GAACTCTCTG TTGAGAAAGC ACAGTATACT 5160 

TCTTATTTTT CTGTAATTCT TCTGTTACTA CTTTTGTCTG TGATGGATCT TTGAGTTCCA 5220 
AGATAAAATA AGATACAGCT TTCGTAAATC CAGCCTCTTT CAAAATCGTT TCCATTTGAT 5280 
GAGACAGCAT GAAACTGTTG CTGTCCTCCA TGTCATCTTC ATCATTGATT ACACGTACAA 5340 
TCTTCGTTTG AAATTGAGCA ATCTTACTAG TTTCGGCAGC ACTTTCTACA ATGCTGGCTG 5400 
AGACTGATTT GCCAATAAGA TCATTAGCTG TCAAATTTTT TCCTGTCTGT TCATTCCAAT 54 60 

TTTTTAGTAA ACTGCTTGGA ATCGTTAATC CCTGTTCATT TGTATCAGTA TAGAGGGATC 5520 
CAGCCAACAC TTTGTCCCTC TC ATT ATT AC TAACAGAGAT ACTTGTATCA TCATAAAGAC 5580 
TCACTACTTG AGCATAAGAA CCCATCCTTT GACTCAGATC CATTTCTTCC CCATCTATAG 5640 
TAATATTTGA CATGTTCATC CCAAAAGGAC TCTCCAAATA TTTAATAGCT TCTTTCCCAA 5700 
CTGTATCCGT GATATATAGT CAATTGAAAC AAGACCAGGA TAAAAAAGCC TCGTAAAAGG 57 60 

TATTGCAACT TGGTAATACC TTTTTGAGGT GCTTTTTGAT ATGAGCCCAT GTTTTCTCAA 5320 
TAGGATTGTA CTCAGGCGAG TAGCGAGGAA GAGGTAAAAG TTTATGCCCA AACTCTTCGC 5980 
ATAAAAGTTC TAGCTTCCCC ATTCTATGGA ATCTTACATT ATCCATAATA ATAACCGATG 594 0 

GTGTGTTTAA TCTTGCTAAG AGAAAATTCT GAAACCAAGC TTCAAAAAAG TCGCTCGTCA 
TCCTCTCTTC GTAAGTCATT GGAGCGATTA ATTCACCATT TGTTAGACCT GCAACCAAAG 



5000 
6060 
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AAATCCTCTG 


ATATCTTCTT 


CCAGATACTT 


TGCCTCTTAT 


TAATTGACCT 


TTTAATGAGC 


6120 


GACCATATTC 


TCGATAAAAA 


TAAGTATCGA 


ATCCTGTTTC 


GTCAATCTAA 


ACAGCTGCTA 


6180 


GGTGCTTTAA 


ACTATTAAAA 


TTCTTAAGAA 


ATAAGCCTAC 


TTTTTCTGGG 


TCTTGTTCAT 


6240 


AGTAGGTGTG 


GTTCTTTTTT 


CGAGTGTAGC 


CCATAGCTTT 


GAGCGTATAG 


TGGATGGTAG 


6300 


TTGGATGACA 


GCCAAATTCA 


GAAGCTATTT 


CAGTCAAATA 


AGCGTCTGGA 


TTCTCAGTAA 


6360 


GAT AGTTTTT 


AAGTCTATCT-- 


.CTATCAACCT _ 


.TTCTT.GGTTT _ 


TATTCCTiTP_ 


ACTTGGTGGT 


6420 


TTAGCTCTCC 


TGTTTTCTCT 


TTTAGCTTTA 


ACCAGCCATA 


AATGGTATTA 


CGTGAGATTT 


6430 


GGAAAACGTG 


TGATGCTTCT 


GTTATACTAC 


CTGTTCGCTC 


ACAATAAGAG 


AGAACTTTTT 


6540 


TACGAAAATC 


TATTGAATAT 


GCCATAAAAA 


GATTATACCA 


CATTGTGTAC 


TATTTTTGGT 


6600 


TCATTTTACT 


ATATTTGAAG 


AGGCGTTTAA 


ACTATCTGAC 


ATAAAACTCG 


TTCTAGAGGA 


6660 


AAGACATCCT 


TTAAAAAGTT 


AGTTTATTTT 


ACAACTTAGA 


CATCAAGGTA 


GGTTAACCCC 


6720 


TTCATGGAAA 


AATCAAGACT 


CTTAGCACTA 


TCGCTTAAAC 


TACCACTGGA 


CACGTAATCA 


6780 


ATCCCTAAAC 


CACGAAAACG 


GCTAATAGTG 


GTCATATCAA 


TATTTCCAGA 


ACATTCAATC 


6840 


CGAGAACGTC 


CTGCAATTAG 


GGTAATGGCC 


TGTTCAATCT 


GTTCCAATGA 


CATATTATCC 


6900 


AACATGATAA 


TATCAGCACC 


CGCCGCCGCA 


GCTTCTTCGG 


CAGCAGCAAG 


GCTTTCCACT 


6960 


TCCACCTCGA 


CCATTTTCAC 


AAAAGGGGCA 


TAGGCACGCG 


CTTGAGCAAT 


TGCCTTTTGA 


7020 


ACACTACCTA 


CTGCCGCAAT 


GTGATTGTCT 


TTTAGCACCA 


TAGCATCTGA 


TAAATTAAAG 


7080 


CCATGATTAT 


AGCCACCGCC 


AACTCTCACG 


GCATATTTCT 


CAAAAAGACG 


TAAATTAGGA 


7140 


UTACj i . i TT r . 


'"■ a r.T \TC AAA 


^ACCTTAATG 


CAATCATCGC 


CTAAGGCTTC 


TACATAAGCA 


7200 


GCTGTCATCG 


AAGCAATCCC 


TCATAAATCT 


TCTAAAAAAT 


TCAAGGCAAC 


GCGTTCACAT 


7260 


GTTAAGAGAC 


TTCTCACCGA 


GCCTATGATT 


TCTAAAACCA 


AATCGCCACT 


AGTCAAACGA 


7320 


TCCCCATCCT 


TAAATTGATG 


AGGATTCTGG 


AAGGTCACCT 


CGGCATCAAA 


TAGGGTAAAA 


7380 


ACCCTTTGAA 


AAACGGTTAG 


CCCCGCTAAA 


ACACCAGCTT 


CCTTGGCAAA 


AAGCGACACC 


7440 


TTGGCTTGGC 


CATGATGATC 


AAAAATGGCA 


TTGGTACTCT 


AATCTTCGGA 


ATCAACATCT 


7500 


TCTCGCAAGG 


CTGCTTTCAA 


TGTATCATCT 


ATTTGAAAAG 


GGGTTAAATC 


AGTTGAAATG 


7560 


AT7GACATCA 


C 










7571 



(2J INFORMATION FOR SEQ ID NO; 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26385 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTTCCTACTG GCTTAAATTC TTCAGGAAAA TCAGGCGTAT CTAAAAGTCG TGTCGTTTTT 60 

GTTTCATCTA TATAAAGACT TCCTGCTCCC CCTACAACTA GAAAACGTGT CTGTGTTCCA 120 

GCAAGAAGCT GATTAAATAG TTCGATTGAT TTGCTGTGGA GCGGTAGCGT ATCTGGTGTA 180 

TAAGCACCAA ACGCTGAAAT AACAGCATCA AATCCAGTAA GATCATCTTT TGTCAACTCA 240 

AATAAATCTT TTTTAATAAT AGACTCAGCT TGACTTTTGT TTTCACAACG AACAATAGCC 300 

GTTACTTCAT GTCCTCGTTT GACTGCTTCT TCAACAATTG CTTTCCCCGC TTGTCCATTT 3 60 

GCTGCAATAA CTGCTAGTTT CATTTTTTAT ACCTCTCTTG TTGTAATTAT TTTAGTTACA 420 

GAAATTGTGA CACTCTTAAT AATCAATGTC AATAGTCTTG CTTAATTATT ATCAAAATAT 4 80 

TTCTACCAAG AAAACTAACC ATGATTCTAG TGAAAAAAAA TCTTCTTTGT CAACAAATTT 540 

ACTTTCTTGT TTTAAACATG CTATAATAAT CATAGCAAGA CATCTAACTT GTCTGTTTTT 600 

TTAAAACGAG GTGATTATCA TGCGTAGATT CTATTCCCAT CTCCCCTACT ATCTGGTCAT 660 

ATTATTCTTT TATTGGCCAC TTTATGAGTT GTTCTTACTA GTTGTTTCTG ACCCCCTTAC 7 20 

ACTCAAGGGA CTCTATATAA ACAATCTTCT CTTCTTTACA CCTCTGCTAA TCTTGATTGT 780 

ATCGTTACTC TATAGCTACC GTTTCCGTTT CTCACTTTGA TGGTTAGTTG GTAACGGACT 840 

GCTCTTTTAC TTTACTATCA TAACCTTTGG TGAGTTTATA CTAATTTACT TGCTAATCTA 900 

TGAAACAGTT GCTCTGGTCG GCATGGATTC TGGTATTACC ATCAAGCATA TTCTACAAAA 960 

AATGAAAAAC AAAAAACTTT CACAAAATCC TTGAAAAATC TCACAATCAT GCTATAATAA 1020 

TCCATAGAGA CAAGTCACTT AGTCCCTTTC TACTAGACAG TGCGTGGTTG CTGGAAACGC 1080 

ATAGGAAGTC TAAACTGATA CTACTCTTGA GTTTTTTATG AAAACATAAA ACGGTGGCCA 1140 

CGTTAGACCC GATCAGAGGT GTCCCTCTCT TTTGAGGTAC ATAAATGAAG GTCGAACCAC 1200 

GTTGCGACGT CCTTTCGAGG ATGTCGCATT TTTTTATTAG GATACTAATT ATGGAGTTGC 1260 

AAGAATTAGT GGAGCGCAGT TGGGCAATCC GACAAGCTTA TCACGAACTG GAACTTAAGC 1320 

ATCATGATTC CAAGTGGACG GTAGAAGAAG ACCTCTTGGC TTTATCTAAT GATATTGGAA 1380 

ATTTCCAACG ACTGGTGATG ACAAAGCAAG GACGCTACTA TGATGAAACA CCCTACACAC 1440 

TGGAACAAAA ACTTTCAGAA AATATCTGGT GGCTATTAGA ACTTTCTCAA CGTTTGGATA 1500 

TAGACATTCT GACGGAAATG GAAAACTTCC TCTCTGATAA AGAAAAGCAA TTGAACGTTA 1560 

GCACTTGGAA GTAGTCTGCT GATAAAAAAT CAATGCTTAG AAACTATGAA ATAATAAAAA 1620 
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AGGAGAACAT CATGATTAAC ATTACTTTCC CAGATGGCGC TGTTCGTGAA TTCGAATCTG 1680 

GCGTAACAAC TTTTCAAATT CCCCAATCTA TCAGCAATTC CCTAGCTAAA AAAGCCTTGG 1740 

CTCGTAAATT CAACGGCAAA CTCATCGACA CTACTCCCCC TATCACTGAA GATGGAAGCA 1800 

TCGAAATTGT GACACCTGAT CACGAAGATG CCCTTCCAAT CTTGCGTCAC TCAGCAGCTC 1860 

ACTTCTTCGC CCAAGCAGCT CGTCGTCTTT TCCCAGACAT TCACTTGGGA GTTGGTCCAG 1920 

CCATCGAAGA TGGTTTC7AC TACGATACTG ACAACACAGC TCGTCAAATC TCTAACCAAC __1980 

ACCTTCCTCG TATCGAAGAA GAAATGCAAA AAATCGTCAA AGAAAACTTC CC ATCTATTC 2040 

CTGAAGAAGT GACTAAAGAC GAGGCACGTG AAATCTTCAA AAATGACCCT TACAAGTTCG 2100 

AATTCATTGA AGAACACTCA GAAGACGAAG GCGGTTTGAC TATCTATCG? CAGGGTGAA? 2160 

ATGTAGACCT CTCCCGTGGA CCTCACGTTC CATCAACAGG TCGTATCCAA ATCTTCCACC 2220 
TTCTCCATCT AGCTGGTGCG TACTGGCGTG GAAACAGCGA CAACGCTATG ATCCAACGTA - 2280 

TCTACGGTAC AGCTTGGTTT GACAAGAAAG ACTTGAAAAA CTACCTTCAA ATGCGTGAAG 2 340 

AAGCTAAGGA ACCTGACCAC CGTAAACTTG GTAAACAGCT TGACCTCTTT ATGATTTCAC 2400 

AAGAAGTGGG ACAAGGTTTG CCATTCTGGT TGCCAAATGG TGCGACTATC CGTCGTGAAT 24 60 

TGGAACCC7A CATCGTAAAC AAAGAGTTGG TTTCTGGCTA CCAACACCTC TACACTCCAC 2520 

CACTTGCTTC TGTTGAGCTT TACAAGACTT CTGGTCACTG GGATCATTAC CAAGAAGACA 2580 

TGTTCCCAAC CATGGACATG GGTGACGGGG AAGAATTTGT CCTTCGTCCA ATGAACTCTC 2 64 0 

CCCACCACAT CCAAGTTTTC AAACACCATC TTCACTCTTA CCG7GAATTG CCAATCCGTA 2 700 

TCGCTGAAAT CCGTATGATG CACCGTTACG AAAAATCTCG TGCCCTCACT CCCCTTCAAC 27 60 

G7GTACGTGA AATGTCACTC AACCACGGTC ACCTATTCGT TACTCCACAA CAAATCCAAG 2820 

AAGAATTCCA ACGTGCCCTT CAGTTGATTA TCGATGTTTA TGAAGACTTC AACTTGACTG 2 880 

ACTACCGCTT CCGCCTCTCT CTTCGTGACC CTCAAGATAC TCATAAGTAC TTTGATAACG 2 940 

ATGAGATGTG GGAAAATCCC CAAACCATGC TTCGTGCAGC TCTTGATGAA ATGGGCGTGG 3000 

ACTACTTTGA AGCCGAAGGT GAAGCAGCCT TCTACGGACC AAAATTGGAT ATCCAGATTA 3060 

AAACTGCCCT 7GGAAAAGAA GAAACCCTTT CTACTATCCA ACTTGATTTC TTGTTCCCAC 3120 

AACGCTTCGA CCTCAAATAC ATCGGAGCTG ATGGCGAAGA TCACCGTCCA GTCATGATCC 3183 

ACCGTGGGGT TATCTCAACT ATGGAACGCT TCACAGCTAT CTTGATTGAG AACTACAAGG 3240 

GGGCCTTCCC AACATGGCTG GCACCACACC AAGTAACCCT CATCCCAGTA TCTAACGAAA 3300 

AACACGTGGA CTACGCTTGG GAAGTGCCCA AGAAACTCCC TGACCGCGGT GTCCGTGCAC 3 3 60 
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ACGTACATGA GCGCAATGAA AAAATGCAGT TCAAGATCCG TGCTTCACAA ACCAGCAACA 34 20 

TTCCTTACCA ATTAATTGTT GGAGACAAAG AAATGGAAGA CGAAACAGTC AACGTTCGTC 34 90 

GCTACGGCCA AAAAGAAACA CAAACTGTCT CAGTTGATAA TTTTGTTCAA GCTATCCTAG 3540 

CTGATATCGC CAACAAATCA CGCGTTGAGA AATAAGAGTC TAGCATAAAA GCCTCCAATC 3 600 

TGGAGGCTTT TTCTCATCTA TTTTTACTCA AGGACTAACT TCACTTGAGC AAACTGAATC 3660 

CGCACTGTCG TTCCTTTTCC GACCTCAGAC TCGATACGAA TCTGGTGCCC CAGTTCTTCA 3720 

GAAATTTTCT TAGATAGATA AAGGCCAAGT CCAGAGGACT CCTCGCTCAA ACGGCCATTG 3780 

TATCCTCAAA AGCCACGTTC AAATACTCGG AGGACATCAC TGTTTTTTAT CCCGATTCCC 384 0 

GTATCTTTGA TACAAAGCTC TTGGTCATCC ATATAAATCT CCACACCACC TTCCTTCCTG 3900 

TACTTGAGAC TGTTTGAGAT GATTTGCTCA ATAACCACTA CCACCCACTT TTTATCCGTC 3960 

ACGATTTCTT TATCAAGGTC ATGTAGATTG ACATTTAAGC CTTTTTGAAT AAAGAAAAGA 4020 

GCATATTTAC GAATTATTTC CTTGACCAAG TCCTCAATTT GAACCTGCTT TAAGACCAAA 4080 

* TCATCATGGA AACTTTCTAA ACGCAGGTAC TGTAAAACTA GGTTGGTATA GGACTCGATT 4140 

TTCAAAATTT CCTGTTCTAG CTGCTGCTTC AGTTGGCGGT CGACCACTTC TGCAACTAAG 4200 

AGTTGACTGG CTGCAATGGG GGTCTTTATC TGATGGACCC ACAAGGTATA GTAATCCAGC 4260 

AAATCCGTCA GTTTTCTTTC TGCTTTTGAC CTCTGCTGAT AGAGTTCCAT CTCACGCGCT 4 320 

TCTAATTTTT CTGCTAAAGC TATTTCCAAA GGAGACTTGG CTTCCCTCTC TCCATAGAGA 4 380 

AGTTCCTGGC GATAGACCTG CGTTTCCACC AATATG7CCC AAGTGAAAAA TAATATGGTT 4440 

ACAAAGCAAC ACAAGAAGAA AAAGTAGAGG AAGTAAATTC CTAGACTGGC AAATAAAAAC 4 500 

TGAAAGAGTA AGACAAGAAA TGCCAAAGAA AGCAGATAGA TAAAAAGACG ACTACGGGAG 4560 

CGCAGATAGG CTAGAAAAAA TTGTTTCCAA TCAAGCATGC TTCAATCCGT ACCCTATTCC 4620 

TTTCTTGGTC TCGATAAATC CTACCAATCC CTGCTCCTCC AACTTTTTAC GCAAACGAGC 4 680 

CACATTGACA CAGAGCGTAT TATCATCAAT GAAAAAGTCA CTGTTCCAAA GTTCCCGCAT 47 4 0 

CAGGTCGTCA CCTGCTACGA TGTTGCCTGC ATGCTCAAAT AACACGCGTA AAATCTGGAA 4800 

TTCATTCTTG GTCAAATTCA AGACTTGCCC TTGATAATGT AAATCCATGG ATTTGGTATT 4 8 60 

GAGGATAACA CCAGCATATT CCAGCAAACT CTCATCACGC CCAAACTCAT AGCAACGACG 4 920 

CAACAAGCCC TGAACCTTAG CTAAAAGAAC CTGCTCGTCA AAACGCTTGG TCACAAAGTC 4 930 

ATCCGCCCCC ATATTGATTG CCATGACAAT ATCCATAGCC TGGTCTCTCG AAGAAAGAAA 5040 

CATGATAGGT ACCTTGGAAA TCTTGCGGAT TTCCTGACAC CAGTGATAAC CATTAAACAA 5100 

GGGCAAACCA ATATCCATGA GGACCAGATG ACCTTCCCAC TGAACAAATA GACTCAAAAC 5160 
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TTCCATAAAG TCTTCTACCA GGACCACTTC AAATCCCCAT TCAGAGAGCA TTTTCCCAAT 52 20 

CTGTTGACGA ATGACCTGAT CA7CTTCTAT TAATAAAATC TTG7CCATGC GCTTCTCCTT 5280 

TTCCATTATT ATAACAGATT TTTCCATGCT AGATGGTCTC AAACTGAATT TGAAATAGCC 5340 

TGTTTTTAGC CAGTACAAAC AGGCTATGCT ACTAGCTAAT TTGAGGGAAA TTTGCTAAGA 5400 

TAAATAAAAA GAAAGGACCT CTTATGGCCA ATATTTTTGA CTATCTCAAA GATGTCGCAT 54 60 

ATGATTCTTA TTACGACCTT CCCTTGAATG AGTTAGACAT T.CTAACCTTA_ATAGAAATCA 5 52.0. 

CCTACCTCTC CTTTGATAAT CTGGTCTCCA CACTTCCTCA ACGTCTT7TA GATCTAGCAC 5580 

CTCAGGTTCC AAGAGATCCC ACCATGCTTA CTAGCAAAAA TCGCCTTCAA TTATTAGATG 5640 

AATTGGCTCA ACACAAGCGC TTCAAAAATT GCAAACTCTC CCATTTTATC AACGACATCG 57 00 

ACCCTCAACT GCAAAAGCAA TTTGCGGCTA TGACTTATCC TGTCAGCCTC GATACCTATC 57 60 

TGATTGTCTT TCGTGGGACA GATGACAGTA TCATTCCCTG GAAGGAAGAT TTCCACCTGA 5820 

CCTATATGAA GGAAATTCCT GCTCAAAAGC ACGCCCTTCG CTATTTAAAG AACTTTTTTG 5880 

CCCATCATCC TAAGCAAAAC GTTATTCTAG CTGGGCATTC CAAGGGACGA AATCTCGCTA 5940 

TCTATGCTGC TACCCAAATT GAGCAAAGTT TGCAAAATCA GATCACAGCA CTTTATACAT 6000 

TTGATGCACC TGGTCTCCAT CAAGAATTGA CACAGAC7CC CGGTTATCAA AGGATAATGG 6060 

ATAGAAGCAA GATATTCATT CCACAAGGTT CCATTATCGG TATGATCCTC GAAATTCCTG 6120 

CTCACCAAAT CATCCTTCAG AGTACTCCCC TGGCTCCCAT CCCCCAGCAC GA7ACCTTTA 6180 

GTTCCCACA7 TGAGCACAAG CACTTCGTCC AACTGGATAA GACCAACAGT GATACCCAGC 6240 

AAGTAGACAC AACCTTTAAA GAATGGGTGG CCACAGTCCC TGACGAAGAA CTTCAGCTCT 6 300 

ACTTCGACCT CTTCTTTGGC ACTATTCTTG ATGCTGGTAT TAGCTCTATC AATGACTTGG 6 3 60 

CTTCCTTAAA GGCGCTTGAA TACATTCATC ATCTCTTTGT CCAAGCTCAA TCCCTCACTC 64 20 

CAGAAGAAAG AGAAACCTTG GGTCGCCTTA CCCAGTTATT GATTGATACT CGT7ACCACC 64 80 

CATGGAAAAA TACATAATAC TCTTGAAAAT TAAATGTATA CAAAACAAAA GACCTAGAAT 654 0 

ACATACTTTC ATGTGCATTC TAAGTCTTTT TAAATAGAAT CTAATAGTCA AT AAAAATC A 6600 

AAGAGCATTG AGAGATAATG GGGCTTGGAA CGTCCCTCTC GCTTCAACAA AATGACCCCA 6660 

TTATAGATTA AAAAGATGCC ACTTAGAAAA AGCAAAAAAG GAAGTAACAC AAAGGCAAAT 67 20 

ATATAAAAAG CTAACTGAAC ATTCTCGTAT CCATTTTTAT AAAAAAGGTA GGATAGATAA 6780 

AAATAACTTG AAATGAGGGA TAATAAAAAT AATACTGGAT TCCACAAAC7 7CTATTATCC 6 840 

7TCCAAAATG ACACTATAAA GGCTAATACA ATTCCTATAA CGAGATACAT T7CTTACTCC 6900 
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TTTAATAGCT ACATTTTATC ATAATTATCC AAAGAAAAAA GAGCCCATTT ATCCCTCTTA 6960 

ATCCTTCATC TGACTCTCTG CATCGGCCAC GACTTTTTCT AGACTGGTTT GACCAAGTTC 7C20 

TGCCTCCATA GTCAACTGAA TTCTCTCCAA TTTTTGATCC AAAACATCAT GAATATGAGC 7080 

TCCTACAGGG CAATTTGGAT TCGGATTGTC ATGGAAACTG AAGAGTTGAC CTGTCTTACC 714 0 

AAGACATTCG ACCGCCTGAT AAACATCTAA AAGACTAATA 7CCTTAACCT CCTTGACAAT 72 CO 

CTCTGTTCCG CCCGTTCCAC GCGCTACTGA AATCAGCTCT CCCTTCTTCA ACTGGGACAA 7 260 

GATCTTTCTG ATAATGACAG GATTGACCCC GACACTAGCA CCCAGAAAAT CACTGGTCAC 7 3 20 

CTTGCTTTCC TTCCCCTCGA GGGCAATGAT TATCAGCATA TGAGTCGCAA TCGTAAATCT 73 9C 

ACTTGGAAT7 TGCATCCTCT TCTCCTTTTT ACGAGCCTAC CCTGCCTCTA CTCTTCTTTT 7440 

TCTATTATTA TACCCTTTTT AGTTGTAATG TCAATCGTTA CCACTTTTCA ACCAGTCGTC 7500 

TAACTCCCGA TCGCAGCCCT CTTTCTGAGC CAATTCTCTC AAAAATTCCT GATGATGAGT 7560 

ATGGTGGATC CCATTGACCA GACTTTCATA GTAAACCTCA AAATAGGGAA GTCTCAGGTC 7 620 

TTTAGCCAGC TGCAATTCAG CTGCTACATC GTAGTCTACC CCTCGGAAG7 CCATATCTAC 7 680 

CAGGCCTTTC TCATCAAACT CCAAAATCAT ATACTGGGCC CGCAAGTCCT TCCGTAGCTG 77 40 

AGCGTCCAAA AACAAACGTT GGCCAATCGA ACCCCCATTG ACAATCAATT GCCCACCAGT 7 800 

CCCCTAACGA AGCAACTGCT GGTCAATATG TCCATAAACA GCAATATCAC AGGGAGGATG 7860 

ACTCACCAAC CGGTCAAACT CCTCTTGTTT GCCAGTATGA ATCAACTCTC GCCCCCAGTT 7920 

CTTATCAGGC AGATGATGGC TAATTCCCAC CCTCAAATCC CCAAACTGAC GATGAATTTC 7 930 

AAGAGGTTGA 7TGTCGACCA CTTCAATTTC TTCTAGGGAA ATTTCCTCTA AAACATACTG 804 0 

GCACTGGCCC AAGAGATAGC GTTGACTGGG GCGAGTACTG TCCAATTCCT TACGGACACC 9100 

ATGCCAAAGA GTCTCTTCCC AGTTTCCCAA AACTCTAGCC GTAATCGGTA GTTGATCCAA 8160 

CAAGTCCAAA ATCCTTCTAC GCCCTGTCCC TGGCATGACA ATATCTCCCA AAAGCCAGTA 8220 

TTCATCCACT CCTATCTGCC GACCATCTGC CAAAACAGGC TCCAAGGCGG TGGTATTTCC 8280 

ATGAATATCT GAAAGAAGAG CTATTTTCGT CATATCCATC TCCTCGTTTT TTCTCTTGCA 3 340 

ATAAGTATAA CATAAAAAGT CACAGCTAGA GAAATCTAGC TTTTTTTG A T ATACTAGATA 94 00 

AAGATATTAG ACAAGAGGAA ACGAATGACC CCAAACAAAG AAGACTATCT AAAATGTATT 84 60 

TATGAAATTG GCATAGACCT GCATAACATT ACCAACAAGG AAATTCCCGC TCGCATGCAA 8520 

GTCTCTCCCC CTGCCGTAAC TGAAATGATC AAACGAATGA AAAGTGAAAA TCTCATCCTA 9580 

AAGGACAAGG AATCTCGCTA TCTACTGACT GACCTCGGTC TCAAACTGGT CTCTGAGCTC 3640 

TATCGTAAGC ACCGCTTCAT TGAAGTTTTT CTAGTTCATC ATT TAG ACTA TACAAGTGAC 8700 
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CAGATTCACG AGGAACCTGA GGTCTTGGAA CACACTGTCT C7GACCTCTT CGTGGAAAGA 876C 

CTAGATAAAC TCCTACGTTT CCCTAAAACC TGCCCCCACG GGGGAACTAT TCC7GCCAAG 8820 

GGAGAACTAC TCGTTGAAAT CAATAACCTC CCACTAGCTG ATATCAAGGA AGCTGGCGCC 8880 

TACCGCCTGA CTCGCCTGCA CGATAGTTTT GACATTCTCC A77ATCTCGA CAAGCACTCA 894 0 

CTTCACATCG GTGACCAGCT CCAAGTCAAG CAGTTTGATG GCTTCACCAA TACCTTCACT 9000 

- A7CC7GAC7A ACGACCAGGA 777ACAAGTG AATATGGACA TTGCAAAACA ACTCTATGTC 9060 

GAGAAAATCA ACTAATTTCT CAAG7CCCCT ACCAACCCTG AAACT77TAT TTTGGCTCTTT 9120 

TG7CAAC7GT AGTGGGTTCA AGTCAGCTAA GCTCGAGAAA GGACAAATTT 7C7CC77TC7 9180 

TTTTTGATAT TCAGAGCGA7 AAAAA7CCG7 TTTTTGAAGT TTTCAAAGTT CCGAAAACCA 92 4 0 

AAGGCATTGC GCTTGATAAG TTTGATGAGA T7ATTCCTCG C77CCACT77 CGCAT7AGAA 9300 

TAGTGTAGTT GAACGGCCTT GACAATCTTT TCTTTATCTT TGAGGAAGGT TTTAAAGACA 9 3 60 

GTCTGAAAAA TAGGATGAAC CTGCTTTAGA TTGTCCTCAA TGAGTCCGAA AAATTTCTCC 94 20 

CG7T7C77AT TCTGAAAGTG AAACAGCAAG AGTTCATAGA GCTGATAGTG GTGTTTCAAG 94 80 

TC7TGTGAAT AGCTCAAAAG CTTGTCTAAA ATCTCT77AT TCGTTAAGTG CATACGAAAA 9540 

GTAGGACGAT AAAATCGCTT ATCACTCAGT TTACGGCTAT CCTGTTGTAT GACC7TCCAG 9 600 

TAGCGCTTGA 7AGCCTTGTA TTCATGGGAT TTTCGATCCA ATTGGTTCAT AATTTGAACA 9660 

CGCACACCAC TCATAGCACG GCTAAGATGT TCTACAATGT GAAAGCGATC CAACACCATT 9720 

T7ACCA77CC GGAGTGAAAC AGTCTGGGAG AC7CTTTCAG CCTGAGCCTA GAAATTTGAA 9 7 30 

AGCGAAGCTG TTTAGCCAAG TCATAGTAAG GACTAAACAT ATCCATCGTA ATGATTTTCA 984 0 

CTTGACAACG AACGGCTCTA TCGTAGCGAA GAAAGTGATT TCGGATGACA CC7TC7CT7C 9900 

TGCCTTCAAG AACAGTGATA ATATTAAGAT TATCAAAATC TTGCGCAATG AAACTCATC7 9960 

7TCCC7TAG7 CAAGGCA7AC TCATCCCAAG ACA7AA7C77 TGGAAGCCGA GAAAAA7CAT 10020 

GCTCAAAGTG AAAGTCATTG AGCTTGCGAA TGACAGTTGA ACTTGAAATG GCCACCTGA7 100 80 

GGGCAATATC AGTCA7AGAA A7TTT77CAA T7AACTTT7G AGCAATyTTT 7GGTTGA7GA 10140 

7ACGAGGGAT TTGGTGA777 T7C7T7ACCA GGGGAG7CTC AGCAACCA7C A7TTTTGAAC 1G200 

AC7GA7AGCA CT7GAAACGA CGC7T7C7AA GCAGAA77C7 AGAAGGCA7A CCAG7CG777 L0260 

CAAGATAAGG AAT777AGAA GG77TT7GAA AG7CA7A777 CT7CAAT7GG T7TCCGCAC7 10320 

CAGGGCAAGA TGGGCCCTCG TAG7CCAGT7 TGGCGATGAT TTCC77C7GT GTATCCT7A7 103 30 

7GA7GA7G7C TAAAATCTGG ATAT7AGGGT C777AATG7C TAG7AA7777 G7GATAAAA7 10440 
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GTAATTGTTC CATATGATTC TTTCTAATGA GTTCTTTTGT CGCTTTTCAT TATAGGTCAT 10500 

ATGCCACTTT TTTTCTACAA TAAAATAGGC TCCATAATAT CTATAGTGGA TTTACCCACT 10560 

ACAAATATTA TAGAACCGTA AAAATAGAAG GAGATAGCAG C7TTTCAAGC CTGCTATCTT 10620 

TTTTTGATGA CATTCAGGCT GATACGAAAT CATAAGAGGT CTGAAACTAC TTTCAGAGTA 10680 

GTCTCTTCTA TAAAATATAG TAGATTGAAA TAAGATGTGA ACAACTCTAT CAGGAAAGTC 107 40 

AAATTAATTT ATAGAATTAT TTTAGCAGTC AAGGTGTACT GTTATAGATT CAATATATTA 10800 

TATGACTATT AACCTTCTCT TCTCCTAAAA TTGACTTTCT TGTTTTCTTA TCTTGTCCAC 10860 

TCGAAACAAG TATTGTAACA ATTTGATTAT TTTTGAAAGT ACTTTTAATA TACTTGATAT 10920 

AGTTAAAAAA GATTTGAAAC TAAATTCCAA ATTACAAAAA GACTTCAAAT ACTAAAAAAA 10980 

AAAAAGTATA CTCTAATTCA AAACGGTAAC AAAACTAATT TAGACAATGA AATATAGAGT 11040 

ATTTCTCTCT TAAAAGTTTT TGGTGAAACG AGATGTAGAA AGGAGATTTA GCCAAAGACT 11100 

CTATTAGTCC TAGAATAATA GATTAGAATT ATTTTAGAAA AACGAAGTGA GCAGCTTATA 11160 

AATTCAAGTC CCCAAATAGA TTCATACTAG TATCTTTTCC AAAAAATAAA GGGCGACTTC 11220 

CTTCATGAAT ATCAATTTCA TCTATAAGCA AGGTAGCTAA TTGAACTAAC TTATTTATTC 11280 

TCTTTGTCGC TAGAAAAATC AGACCTCCTT GTGAAGATTG AGGAGATACT TAATGAAAAT 11340 

CAAAGAAGAA ACTAGCAAGC TAGTAGCAGA TTGCCCAAAA CACCGCTTTG AGGTTGTAGA 114O0 

TAAGACTGAC CTATATAATC CAAGGTGAAG CGACTGTGGT TTGAAGAGAT TTTCAAAGAC 114 60 

TATAGGCTAG AGAGTAGTGT TTTTATGTCC TTCTAGTAGA AAATGCTAGA CAGAAGAATG 11520 

GCCAACTTGC A7AGGAAAAA TAGATTGAGA AAGGAGGTTA GAAGAGATGA TTATTACAAA 11580 

AATTAGCCGT TTAGGAACTT ATGTGGGAGT AAATCCACAT 7TTGCAACAT TAATAGATTT 11640 

TCTAGAAAAA ACAGGACTAG AAAATTTAAC AGAAGGTTCG ATTGCTATCG ATGGTAATCG 11700 

ATTGTTTGGG AATTCCTTTA CTTATCTAGC AGATGGTCAA GCAGGGGCTT TCTTTCAAAC 117 60 

CCACCAAAAA TATTTGGATA TTCATTTAGT TTTGGAAAAC GAAGAAGCCA TGGCTGTTAC 11820 

ATCGCCGGAA AATGTAAGCG TTACCCAAGA ATATGATGAA CACAAAGATA TTGAATTATA 11880 

CACAGGGAAA GTGGAACAGT TGGTTCATTT GAGAGCTGGC GAATGCCTCA TCACTTTTCC 11940 

AGAAGATTTA CATCAACCCA AGGTTCGTAT AAATGATGAA CCTGTGAAAA AAGTTCTCTT 12000 

TAAAGTTGCG ATTTCTTAAT GTAGAAAGAG AAGAACGATG AAAAAAATGA GAAAGTTTTT 12060 

ATGTCTAGCT GGAATTGCGC TAGCCGCTCT TGCCTTGCTA CCTTGTTCAG GAAAAAAAGA 12120 

ACCTACAACT AGTACTGAAC CACCAACAGA ATTATCTCGT GAGATTACAA TCTGCCACTC 12180 

CTTTACTCAA GGACCCCGTT TAGAAAGTAT TCAAAAATCA GCAGATGCTT 7CATGCAAAA 12 240 
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GCATCCAAAA ACGAAAATCA AGATTGAAAC ATTTTCTTCG AATGACTTCT ATACTAAATG 12 300 

GACTACAGGT TTAGCAAATG GAAATGTGCC ACATATCAGT ACAGCTCTTC CTAACCAAGT 12 360 

AATGGAAATG GTCAACTCAG ATGCTTTGGT TCCGCTAAAT GATTCTATCA AGCGTATTGG 12420 

ACAAGATAAA TTTAACGAAA CTGCCTTAAA TGAAGCAAAA ATCGGAGATG ATT ACT ACT C 12480 

TGTTCCTCTT TATTCACATG CACAAGTCAT GTGGGTTAGA ACAGATTTCT TAAAAGAACA 12 54 0 

TAATATTGAG GTTCCTAAAA CTTGGGATCA ACTCTATCAA CCTTCTAAAA AATTG AAAG A ^ 12 600 

AGCTGGAGTT TATGGCTTGT CTGTTCCGTT TGGAACAAA? GACTTAATGG CAACACGTTT 12 560 

CTTGAACTTC TACGTACGTA rTGGTGGAGG AAGCCTCTTA ACAAAAGATC TTAAAGCACA 12720 

CTTGACAAGC CAACTTGCTC AAGATCGTAT TAAATACTGG GTTAAATTGT ATAAAGAAAT 12 780 

CTCACCTCAA GATTCTTTGA ACTTTAATGT CCTTCAACAA CCTACCTTCT TCTATCAAGG 12840 

AAAAACAGCA TTTGACTTTA ACTCTCCCTT CCATATCGGA GGAATTAATG CCAACAGTCC 12900 

TCAATTGATT GATTCGATTG ATGCTTATCC TATTCCAAAA ATCAAAGAGT CTGATAAAGA 129 60 

CCAAGGAATT GAAACCTCAA ACATTCCAAT GCTTGTTTGG AAAAATTCAA AACATCCAGA 13C20 

AGTTGCTAAA GCATTCTTAG AAGCACTTTA TAATGAAGAA GACTACGTTA AATTCCTTGA 13080 

TTCAACTCCA GTAGGTATGT TGCCAACTAT TAACCGGATT AGCGATTCTG CAGCCTATAA 13 140 

AGAAAATGAA ACTCGTAAGA AATTTAAACA TGCTCAACAA GTAATTACTG AAGCTGTTAA 13200 

AAAAGCTACT GCTATTGGTT ATGAAAATGG GCCAAGTGTA CAAGCTGGTA TGTTGACTAA 132 60 

CCAACACATT ATTCAACAAA TGTTCCAAGA T AT C ATT AC A AATCCAACAC ATCCTATGAA 13 320 

AGCAGCAAAA GAAGCAGAAA AACAATTAAA TGATTTATTT GAGGCTGTTC AGTAGATGTA 13 380 

AAAGACTAGA AAATAGGTGG GATAGTGAGC TGAAAAGCTC TAGCCCAATC TTGTAAAAGA 13440 

AGGGAGAAGG AGAATGGTTA AAGAACGTAA TTTAACTCGC TGGATATTTG TTTTGCCACC 13 500 

TATGATTATC GTAGGATTAC TCTTTCTTTA TCCGTTTTTC TCGAGTATTT TTTATACCTT 13560 

TACCAATAAG CATTTGATTA TGCCTAATTA TAAATTTGTT GGTTTGGCTA ACTATAAACC 13620 

TGTGCTATCA GATCCCAACT TCTTTAATCC GTTCTTTAAT TCAATTAAGT GGACCGTTTT 13 680 

CTCATTAGTT GGTCAAGTTT TAGTAGGGTT TGTATTGGCT TTAGCTCTTC ACAGAGTACG 13740 

CCACTTCAAG AAATTATATA GG AC ATT ATT GATTCTTCCT TGGGCATTTC CTACCATCGT 13 800 

TATTGCCTTC TCTTGGCAGT GGATTCTAAA CGGGGTTTAT GGCTACTTAC CTAATCTAAT 13860 

CGTAAAATTA CGTTTAATGG AACATACACC TGCATTTTTG ACAGATAGTA CATGGGCATT 13920 

CCTATGTTTG GTGTTTATCA ACATTTGGTT TGGAGCACCA ATGATTATGG TTAATGTGCT 13 980 
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TTCACCTTTG CAAACAGTAC CAGAAGAACA ATTTGAGCCT GCTAAGATAG ATGGTGCTTC 1404 0 

AAGTTGGCAG GTGTTCAAGT TTATCGTCTT TCCACATATT AAAGTCCTTC TAGGACTTCT 14 100 

AGTTGTTTTG AGAACTGTAT GGATCTTTAA TAACTTTGAC ATTATCTACC TCATTACTGG 14 160 

TGGTGGACCA GCCAATGCTA CAACGACGCT TCCAATTTTT GCTTACAACC TGCGCTGGCC 14 220 

AACTAAATTG TTGGGTCG7G CTTCAGCAGT TACAGTACTG CTCTTTATCT TCTTGGTGCC 14280 

GATTTGCTTT ATCTACTTTG CTATCATCAG TAAGTGGGAA AAGGAGGGTA GAAAATAATG 14 340 

AAGAAGAAAT CCACTATTTA TTTAGATATT CTCTCACATG TACTTTTAGT TGGTGCGACC 14400 

ATCGTTGCAG 7TTTCCCATT GGTA7GCATT ATCATATCTT CTGTCAAAGG GAAAGGGGAA 14 4 6C 

TTAACTCAGT ATCCAACACG ATTTTGGCCT GAACAGTTTA CATTAGATTA TTTCACTCAT 14 520 

GTTATCAACG ATTTGCACTT CATTGATAAC ATTCCAAACA GTTTAATCAT TGCCTTGGCT 14580 

ACAACCCTTA TTGCGATTAT TATTTCTGCT ATGGCAGCCT ATCCTATTGT TCGATTCTTT 14 640 

CCTAAATTGG GAGCAATCAT GTCGAGACTA CTCGTCATTA CCTACATTTT CCCACCAATT 14 700 

TTGTTAGCAA TTCCCTATTC AATTGCCATT GCTAAAGTTG GGTTAACAAA TAGTTTATTT 14760 

GGCTTGATGA TGGTTTATCT ATCTTTTAGT GTTCCATATG CAGTTTGCCT CTTAGTTGGA 14 820 

TTTTTCCAAA CAGTTCCAAT TGCAATTGAA CAACCGGCTA GAATTGATGG TGCAAATAAA 14 880 

TTTGTTACGT TTTATAAAGT TGTGCTACCG ATTGTAGCAC CAGGTATTGT AGCAACAGC7 U94 0 

ATTTATACAT TTATCAATGC TTGGAATGAA TTCCTGTATG CCTTGATTTT GATTAACAAT 15000 

ACAGGAAAGA TGACAGTAGC AGTACCCCTT CGTTCACTTA ATGGTTCAGA AATACTAGAC 15060 

TGGCGAGATA TGATGGCACC GTCTGTTATT CTAGTTCTTC CATCAATTAT TTTCTTCTCT 15120 

ATCATCCAAA ATAAGATTCC AAGTGGATTA TCAGAAGGAT CTGTCAACTA GACGAAAGAA 15190 

GGAAAAAAAT GAATAAAAGA CCTCTTTATT CAAAACTAGG AATTTCCGTT GTAGGCATTA 15240 

CTC7TTTAAT GGGAGTCCCC ACTTTGATTC ATCCGAATGA ATTAAACTAT GGTCAACTGT 15 300 

CCATATCTCC TATTTTTCAA CGAGGTTCAT ATCAACTGAA CAATAAGAGT ATAGATATCA 15 360 

GCTCTTT G TT ATTAGATAAA TTGTCTGGAC AGAGTCAGAC AGTAGTAATG AAATTTAAAG 154 20 

CAGATAAACC AAACTCTCT? CAAGCTTTGT TTGGCCTATC TAATAGTAAA GCAGGCTTTA 154 80 

AAAATAATTA CTTTTCAATT TTCATGAGAG ATTCTGGTGA GATAGGTGTA GAAATAAGAG 1554 0 

ACGCCCAAAA GGGAATAAAT TATTTATTTT CCAGACCAGC TTCATTATGG GGAAAACATA 15600 

AAGGACAGGC AGTTGAAAAT ACACTAGTAT TTGTATCTGA TTCTAAACAT AAAACATACA 15660 

CAATCTATCT TAA7GGAATA GAAGTGTTCT CTGAAACAGT TGATACATTT 7TGCCAATTT 15720 

CAAATATAAA TGGTATAGAT AAGGCAACAC TAGCACCTC? TAATCGTGAA GGTAAGGAAC 15780 
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GCCGCAAGGG 


AGCCCAATTA 


GTTGTCGGTT 


TAGTCAATAA 


AGAAGATGAT 


AGTATTGATT 


17 100 




GGAAATACCA 


CTATGATATT 


GATTTGCCTT 


CGTATGCTTA 


TGCCTATTCT 


GCGATTACAG 


17160 




AATTGCCAAA 


TCATCACATA 


GGTGTACTGT 


TTGAAAAATA 


TGATTCGTGG 


TCGAGAAATG 


17220 




AATTGCATTT 


AAGCAATGTA 


GTTCAGTATA 


TAGATTTGGA 


AATTAATCAT 


TTAACAAAAT 


17290 




AAAGGAGAAA 


AACATGGTTA 


AATACGGTGT TGTTCGAACA GGGTATTTTG 


GAGCTCAATT 


17340 




GGCTCGCTAC 


ATGCAAAAGA 


ATGATGGAGC 


AGAGATTACT 


CTTCTCTATG 


ATCCAGATAA 


17400 




TGCAGAGGCG 


ATTGCAGAAG 


AATTGGGAGC 


AAAACTAGCA 


AGTTCCTTAG 


ATGACTTCGT 


17460 




TTCTAGCGAT 


GAAGTAGATT 


GTGTTATCGT 


CGCAACTCCA 


AATAATCTTC 


ATAAGGAACC 


17520 
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GGTTATTAAG 


GCTGCACAGC 


ATGGTAAAAA 


TGTTTTCTCT 


GftftftAACCAA 


TTGCGCTTTC 


17580 


TTATCAAGAT 


TGTCGCGAGA 


TGGTAGATGC 


GTGTAAAGAA 


AACAATGTAA 


CCTTTATGGC 


17640 


AGGACATATT 


ATGAATTTCT 


TTAATGGTGT 


TCATCATCCA 


AAAGAACTCA 


TTAATCAAGG 


17700 


AGTTATCGGA GACGTTCTAT 


ATTGTCATAC 


AGCTCGTAAT 


GGTTGGGAAG 


AACAACAACC 


17760 


GTCAGTATCA 


TGGAAAAAAA 


TTCGTGAAAA 


ATCAGGTGGT 


CACTTGTATC 


ACCACATCCA 


17820 


TGAATTGCAT 


TGCGTTCAAT 


TCCTTATGGG 


GGGCATGCCT 


GAAACTGTAA 


CCATGACAGG 


17880 


TGGAAATGTG 


GCCCATGAAG 


GTGAACATTT 


CCGTGATGAA 


GATGATATCA 


TTTTTGTCAA 


17940 


7ATGGAATTT 


TCTAATAAGC 


GTTTTGCCTT 


GTTAGAATGG 


GGTTCACCTT 


ATCGTTGGGG 


13000 


TGAACATTAT 


GTCTTAATCC 


AAGGAAGCAA 


AGGTGCCATC 


CGCTTAGACT 


TATTCAACTG 


13060 


TAAAGGAACT 


CTTAAGCTAG 


ATGCGCAAGA 


AAGCTATTTC 


TTGATTCACG 


AATCGCAAGA 


13120 


AGAAGATGAT 


GATCGGACTC 


GTATCTATCA 


TAGTACAGAG 


ATGGATGGAG 


CAATTGCTTA 


18180 


TGGTAAACCA 


GGTAAACGTA 


CTCCATTATG 


GCTATCATCT 


GTCATTGATA 


AAGAAATGCG 


13240 


CTATCTGCAT GAGATTATGG 


AAGGAGCTCC 


AGTATCAGAA 


GAATTTGCAA 


AACTTTTGAC 


18300 


ACGTGAAGCT 


GCCCTAGAAG 


CAATTGCTAC 


TGCAGATGCT 


TGTACCCACT 


CTATGTTTGA 


18360 


AGATCGCAAA 


GTAAAATTGT 


CAGAAATTGT 


AAAATAAATT 


TTGGTATTC'T 


CCTATTTATA 


18420 


GGTCGACTTG 


CTCCTCTGAA 


AGTACTTTTA 


GAGGAGCTGT 


TTGACTTTGC 


TAGTTTTTGA 


13480 


AACTGAAATC 


7ATTATACTA 


CAAACTATTG 


AAAGCGTTTT 


AATTTTAAGC 


TATAATAATC 


18540 


TCATACAAAT 


AAAGAAAAGG 


AGGAAAGAGG 


ATGCCACAGA 


TTAGCAAAGA 


AGCCTTCATT 


19600 


GAGCAAATCA 


AAGATGGAAT 


CATCCTT7CT 


TGTCAGCCTC 


TTCCTCATGA 


ACCGCTTTAT 


18660 


ACAGAAGCGG 


GAGGGGTGAT 


TCCCTTGCTG 


GTCAAAGCGG 


CTGAGCAAGG 


TGGAGCAGTC 


13720 


GGTATCCGAC 


CAAACAGTGT 


TCGCGATATC 


AAGGAAATTA 


AGGAAGTCAC 


TAAACTTCCA 


13730 


ATCATTGGGA 


TTATCAAACG 


TGATTATCCA 


CCTCAGGAAC 


CCTTCATCAC 


GGCTACTATG 


18840 


AAACAAGTTG 


ATGAATTGGC 


AGAACTGGAC 


ATCGAGGTGA 


TTGCTCTGGA 


TTGTACCAAG 


18900 


CGTGAACGCT 


ACGATGCTTT 


GGAAATTCAA 


GAGTTCATTC 


GTCAGGTTAA 


CGAGAAATAT 


18960 


CCTAATCAGC 


TTTTGATGGC 


TGATACTAGT 


ATCTTCGAAG 


AAGGGCTAGC 


AGCTGTAGAA 


19020 


GCAGGAATTG 


ACTTTGTCGG 


AACAACCTTA 


TCAGGCTACA 


CATCCTACAG 


TCCAAAAGTA 


19080 


GACGGTCCAG 


ATTTTGAATT 


GATTAAGAAA CTCTGTGATG 


CTCGTGTAGA 


TGTCATTGCA 


19140 


GAAGGAAAAA 


TTCATACACC 


AGAACAAGCC 


AAACAAATCC 


TTGAATATGG 


AGTGCGAGGC 


19200 


ATCGTTGTTG 


GTGGCGCCAT 


TACTAGACCA 


AAAGAGATTA 


CAGAACGCTT 


CGTTGCTAGT 


19260 


CTTAAATAAC 


ATGTGAGGCG 


GAGTTTTATG 


TTTAAAGTTT 


TACAAAAAGT 


TGGAAAAGCT 


19320 
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7T7ATGTTAC 


CTATAGCTAT 


ACTTCCTGCA 


GCAGGTCTAC 


TTTTCGCCAT 


TGGTGGTGCA 


19380 


CTTTCAAACC 


CAACCACGAT 


AGCAACTTAT 


CCAATACTAG 


ACAATAGTA7 


77TTCAATCA 


194-10 


A7ATTCCAAG 


TAATGAGCTC 


TGCAGGAGAG 


GT7G7A77CA 


G7AATTTG7C 


AC7ACT7CTC 


I95CC 


TGTGTGGGAT 


TATGTATTGG 


CTTAGCGAAA 


CGAGATAAAG 


GAACCGCTGC 


GTTAGCAGGA 


19560 


GTAACTGGTT 


ACTTAGTTAT 


GACTGCAACG 


ATCAAAGCTT 


TGGTAAAACT 


777TATGGCA 


19620 


GAAGGATCTG 


CAATTGATAC 


TGGAGTTATT 


GGAGCATTAG 


TTGTCGGAAT 


AGTTGCCCTA - 


19630 


TATTTGCACA 


ACCGATA7AA 


CAATATTCAA 


TTACCTTCCC 


CTTTAGGA77 


C7TTGGAGG7 


19740 


7CACGCT7CG 


TTCCTATTGT 


TACATCGTTC 


TCTTCTATCT 


TCATTGGCT7 


TGTCTTCTTT 


199C0 


GT7ATTTGGC 


CACCTTTCCA 


ACAACTTCT7 


GTTTCTACAG 


GTCGATA7AT 


TTCTCAGGCC 


19860 


GGTCCAATTG 


GAACTTTTCT 


ATATGGATTT 


TTAATGAGAC 


TTTCTGGACC 


AGTAGGCTTA 


19920 


CATCATATAA 


TTTACCCTAT 


CTTTTGGTAT 


ACTGAACTTG 


GTGGTGTTCA 


AACTGTTGCA 


19980 


GGACAAACAG 


7CGTTGCAGC 


TCAAAAAATA 


TTTTTTGCTC 


AATTAGCCGA 


TTTCGCCCA? . 


20040 


TCTCCATTAT 


TTACACAAGG 


AACAAGGTTT 


TTTGCACG7C 


GTTTCTCAAC 


AATGATGTTC 


20100 


GGTTTACCGG 


CTGCCTGTTT 


AGCGATGTAC 


CATAGTGTTC 


CTAAAAATCG 


7CGTAAAAAA 


20160 


TACGCGGGTT 


TGTTTTTTGG 


AGTTGCTTTA 


ACATCTTTTA 


TTACCGGTAT 


TACAGAACCA 


20220 


ATTGAATTTA 


TGTTTCTATT 


CGTCAGTCCG 


CTTCTATATG 


TTGTTCACGC 


ATTCCTTGAT 


20290 


GGTGTTACCT 


TCTTTATTGC 


AGACGTCTTA 


AA7A7TTCAA 


TAGGAAACAC 


ATTTTCAGGA 


20340 


GGTGTAATCG 


ATTTCACTTT 


ATTTGGAATT 


TTGCAGGGGA 


ACCC7AAGAC 


GAATTGGGTT 


20400 


CT7CAGATTC 


CATTTGGACT 


TATTTGCAGT 


GTTTTGTATT 


ATATTATTTT 


7AGATGGTTC 


20460 


AT7ACTCAA7 


TCAACGTTCT 


AACGCCAGGG 


CGAGGAGAAG 


AAGTAGATTC 


7AAAGAAATT 


20520 


TCTGAATCCG 


CAGAT7CAAC 


TTCAAATACT 


GCAGATTATT 


TAAAACAGGA 


TAGCC7ACAA 


20580 


ATTATCAGAG 


CCTTGGGTGG 


ATCAAATAAT 


ATAGAAGATG 


TACATCCTTC 


7GTGACACG7 


20640 


TTACGTGTAG 


CTGTAAAAGA 


AGTTAATCAA 


GTTGATAAAG 


CACTTTTAAA 


ACAAAT7GGT 


20700 


GCAGTTGATG 


TCTTAGAAGT 


GAAGGGTGGC 


ATTCAAGCAA 


TCTATGGAGC 


AAAAGCAA7C 


20760 


7TATATAAAA 


ATACTA77AA 


TGAAATTTTA 


GGTGTAGATG 


ATTAAGTACT 


7ACTGAC77A 


20820 


ATAAAAAACA 


GAGGAGAGTG 


ATGGATGAGT 


AGGATGAAAT 


GAAATCGCAT 


ACAAGAAA7A 


20880 


AAGAACTCAT 


TATCCAAGTT 


GGATACGCTT 


ATTACATAGG 


ACAA7ACAAA 


TGAAA77TAG 


20940 


AAAATTAGCT 


TGTACAGTAC 


TTGCGGCTGC 


7GCGGT7C7T 


GGTCTTGCTG 


CTTGTCGCAA 


21000 


TTCTGGCGGA 


AGTAAAGATG 


CTGCCAAATC 


AGGTGGTGAC 


GGTGCCAAAA 


CAGAAATCAC 


21060 
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TTGGTGGGCA TTCCCAGTAT TTACCCAAGA AAAAACTGGT GACGGTGTTG CAACTTATGA 21120 

AAAATCAATC ATCGAACCCT TTGAAAAAGC AAACCCAGAT ATAAAAGTGA AATTGGAAAC 21180 

CATCGACTTC AACTCAGGTC CTCAAAAAAT CACAACAGCC ATCGAAGCAG GAACAGCTCC 21240 

AGACCTACTC TTTGATGCAC CAGGACGTAT CATCCAATAC GGTAAAAACG GTAAATTGGC 21300 

7GAGTTGAAT GACCTCTTCA CAGATGAATT TCTTAAAGAT GTCAACAATG AAAACATCGT 2L360 

ACAAGCAAGT AAAGCTGGAG ACAAGGCTTA TATGTATCCC ATTAGTTCTG CCCCATTCTA 21420 

CATGGCAATG AACAAGAAAA TGTTAGAACA TGCTGGAGTA GCAAACCTTG TAAAAGAAGG 214 80 

TTGGACAACT GATCATTTTG AAAAAGTATT GAAAGCACTT AAAGACAAGG GTTACACACC 21540 

ACGTTCATTG TTCAGTTCTG GTCAAGGGGG AGACCAAGGA ACACGTGCCT TTATCTCTAA 21600 

CCTTTATAGC GGTTCTGTAA CACATGAAAA ACTTAGCAAA TATACAACTG ATGATCCTAA 21660 

ATTCGTCAAA CGTCTTGAAA AAGCAACTAG CTGGATTAAA GACAATTTGA TCAATAATGG 21720 

TTCACAATTT GACGGTGGGG CAGATATCCA AAACTTTGCC AACGGTCAAA CATCTTACAC 21780 

AATCCTTTGG GCACCACCTC AAAATGGTAT CCAAGCTAAA CTTTTACAAG CAAGTAAGCT 21840 

AGAAGTGGTA GAAGTACCAT TCCCATCAGA CGAAGCTAAC CCAGCTCTTC AGTACCTTGT 21900 

AAACCGGTTT GCAGTATTCA ACAATAAAGA CGACAAGAAA GTCGCTGCAT CTAAGAAATT 21960 

CATCCAGTTT ATCCCAGATG ACAAGGAGTG GGGACCTAAA GACG7AG7TC GTACACGTGC 22020 

TTTCCCAGTC CGTACTTCAT TTGGAAAACT TTATGAAGAC AAACGCATGG AAACAATCAG 22080 

CGGCTGGACT CAATACTACT CACCATACTA CAACACTATT GATGGATTTG CTGAAATGAC 22140 

AACACTTTGG TTCCCAA7GT 7GCAATCTCT ATCAAATGGT GACGAAAAAC CAGCAGATGC 222C0 

TTTGAAAGCC TTCACTGAAA AAGCGAACGA AACAATCAAA AAAGCTATGA AACAATAGTC 22260 

CTTAGTTATT CTATAAAAAG TAGTTTTTTA AAGAACCTAA GACTGTATAC CCCCTTTTCC 2232 0 

CTCTACACAG ATAGTGTAAG AAAAGGGCGC TTTTGTTTAA AATGTAAGAA ACTGTCACGA 22 380 

AATTAAAATG AAGTTCT7AC ATAAGCCAAT CATAAAAAAT TTCATTTTGA TTTTAAAACA 22 440 

GTTCAAGAAA CTCAAAAAAT TATTCTATTT GAAAGAGAGG TGCCGACTCT GAAAGTCAAT i250O 

AAAATCCGTA TGCGGGAAAC AGTGATTTCC TACGCTTTCC TAGCACCAGT ATTATTCTTC 22560 

TTTGTCATCT TTGTGTTGGC TCCGATGGTG ATGGGCTTCA TTACAAGTTT CTTTAACTAC 22620 

TCAATCACTA AATTTGACTT TGTAGGCTTG GATAACTATA TCCCTATGTT TAAAGATCCT 22 680 

GTCTTTACAA AATCTC7GAT TAACACAGTT ATTTTGGTTA TTGGA7CTGT ACCAGTTGTT 22 740 

CTTCTATTCT CACTCTTTGT AGCATCTCAG ACCTATCATC AAAATGTCAT TGCCAGATCC 22800 

TTCTACCGTT TCGTCTTCTT CCTTCCTGTT GTAACGGGTA GTCTTCCCC7 GACAGTTGTT 2 2860 
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TGGAAATGGA 


TTTATGACCC 


ACTATCAGGG ATTCTAAACT 


TTGTCCTTAA 


GTCCAGCCAC 


22920 


ATCATCAGCC 


AAAACATTTC 


TTGGTTGGGA GATAAAAACT 


GGGCATTCAT 


GGCGATTATG 


22980 


ATTATTCTCT 


TGACCACTTC 


AGTTGGTCAG CCCATCATCC 


TTTATATCGC 


TGCCATGGGG 


23040 


AATATTGACA 


ATTCACTGGT 


TGAAGCGGCG CGTGTTGATG 


GTGCAACTGA 


GTTTCAAGTT 


23100 


TTTTGGAAGA 


TTAAATGGCC 


AAGCCTTCTT CCAACAACTC 


TTTATATTGC 


AATCATCACA 


23160 


ACAATTAACT 


CATTCCAGTG 


TTTCGCCTTG ATTCAGCTTT 


TGACATCTGG 


TGGTCCAAAC 


23220 


TACTCAACAA 


GTACCTTGAT 


GTACTACCTT TACGAAAAAG 


CCTTCCAATT 


GACAGAATAC 


23280 


GGCTATGCCA 


ACACAATTGG 


TCTCTTCTTC GCACTCATGA 


TTGCTATCGT 


AAGCTTTGTT 


23340 


CAATTTAAAG 


TACT TGG AAA 


CGACGTAGAA TACTAAAGAA 


AGGAGACAGC 


TATGCAATCT 


23400 


ACAGAAAAAA 


AACCATTAAC 


AGCCTTTACT GTTATTTCAA 


CAATCATTTT 


GCTCTTGTTG 


23460 


ACTGTGCTGT 


TCATCTTTCC 


ATTCTACTGG ATTTTGACAG 


CGCCATTCAA 


ATCACAACCT 


23520 


CATACAATTG 


TTATTCCTCC 


TCAGTGGTTC CCTAAAATGC 


CAACCATGGA 


AAACTTCCAA 


23580 


CAACTCATGG 


TGCAGAACCC 


TGCCTTGCAA TGGATGTGGA 


ACTCAGTATT 


TATCTCATTG 


23640 


GTAACCATGT 


TCTTAGTTTG 


TGCAACCTCA TCTCTACCAG 


CTTATGTATT 


GGCTAAAAAA 


23700 


CGTTTCTATG 


GTCAACGCAT 


TCTATTTGCT ATCTTTATCG 


CTGCTATGGC 


GCTTCCAAAA 


23760 


CAAGTTGTCC 


TTGTACCATT 


GGTACCTATC GTCAACTTCA 


TGGGAATCCA 


TGATACTCTC 


23820 


TGGGCAGTTA 


TCTTGCCTTT 


GATTGGATGG CCATTCCGTC 


TCTTCCTCAT 


GAAACAGTTC 


23880 


AGTGAAAATA 


TCCCTACAGA 


GTTCCTTCAA TCAGCTAAAA 


TCGACGGTTG 


7CGTGAGATT 


23940 


CGTACCTTCT 


GGAGTGTAGC 


CTTCCCGATT GTGAAACCAG 


CGTTTGCAGC 


CCTTGCAATC 


24000 


TTTACCTTCA 


TCAATACTTG 


GAATGACTAC TTCATGCAAT 


TGGTAATGTT 


GACTTCACGT 


24060 


AACAATTTGA 


CCATCTCACT 


TGCGGTTGCG ACCATGCAGG 


CTGAAATGGC 


AACCAACTAT 


24120 


ill un I »n 




TGCCCTTGCT GCTGTTCCAA 


TCGTCACAGT 


CTTCCTAGTC 


24180 


TTCCAAAAAT 


CCTTCACACA 


GGGTATTACT ATGGGAGCGG 


TCAAAGGATA 


ATACTCTGCG 


24240 


AAAATCTCTT 


CAAACTACGT 


CAGCTTCACC TTGCCATACT 


TAAGTATTGC 


CTGCGGTTAG 


24300 


CTTCCTAGTT 


TGTTCTTCAA 


TTTTCATTGA GTA7AGGAAA 


ATCAATCTAT 


CAAGATACAG 


24360 


AAGTATATTT 


TATAGATTTA 


GAGAATATAG ACCTTATAAC 


TGTCTACAAA 


ATGGAGGGTA 


24420 


TGCAGTTACT 


TTATCAACTT 


TTGTCAGACA CTTATAAACT 


TAAGAATGCT 


TTTAGTTAAC 


24480 


TATCAGAAAC 


GAAGGAAAGA 


GTATGATTTT TGACGATTTG 


AAAAACATCA 


CCTTTTACAA 


24540 


AGCCATTCAT 


CCTAATTTAG 


ACAAGGCTAT CGACTATCTC 


TACCAACATC 


GTAACGATTC 


24600 
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TT7CGAATTA GCAAACTATG ATATTGATGG AGATAAAGTC TTTCTAGTTG TTCAGGAAAA 24 660 

TGTCCTCAAT CAAGCTGAAA ATGATCAATT TGAGTATCAT AAGAACTATG CAGATTTGCA 2 4720 

TTTGCTGGTA GAAGGACATG AATATTCGAG CTACGGTTCA CGTATCAAAG ACGAGGCAGT 24 780 

AGCATTCGAC GAAGCGAGTG ACATTGGCTT TGTTCATTG? CATGAACACT ACCCACTCTT 24 840 

GTTGGGTTAT CACAATTTTG CGATTTTCTT CCCAGGTGAG CCACATCAGC CAAATGCTTA 2 4 900 

TGCAGGCATG GAAGAAAAGG TTCGAAAATA TCTCTTTAAA ATTTTGATTG ATTAAAAATA 24960 

GG ATGAATTG TTTTTTTGTA AAGCTTTGAT AATACTCTAC CATGAAATTC ATCTTTGTGA 25020 

GGTAGAGAAA TGAGAATAAA ATATTTAAAA ATTGGTATCT TCTAAGTATG CTGCAAGAGC 25080 

TAGTTTCTTA GATGGACAGG GGATTACAGT TGATGAGATG GCTTGGATAA TTACGCCCAT 25140 

TGTGAATGCA TTCATTGCTA GATACATAAA ATTAGGTACT TATGCCGCTA AGTATGGTAT 25200 

TACTATGGCA CGCTCGATCT TAAGTAGGGT AGCTGCAACT GCAGCAGCAA GAGTAGGATT 25260 

ACTGACCAAG ATTTCTGGAT GGATTTTACG AGTAGCTGTG AATGTAGCTG ATGTATATGG 25320 

TAATTTTCCC AACAATATTG CTGCAGCTTG GCATGCATAT GATAAAATTC CTAACAATGG 253 80 

TCCTATAAAC TTTTAAAATG CGAGAATGAA AGCACTTTGT ATTTTTTTAT TGAATATCTT 254 40 

AGCTTGGACA GTGCTTGCAA TGATAATTCG TCGAGGCCTA GATGGATTTG ATAGGCATAC 2S5O0 

TTGGACTACT ATTTTAATTG CGTCGCTGTT CGGGGTATAT GATTATAAGC CCATAGATAA 25560 

AAATAGAAAA AAGTCCAAAA GAAAAAATAG ATTTGTTCAT GGTAGGGACT TATGAAACCT 25620 

TTACTGACAA AAAAGAAAAC AGTTTACAAA GAAAAATGAT GGAGGAGCAA ACATGGCACA 25680 

AAAAGGAGTA AGCCTTATCA AGGCAGCATT TCATACAGAT AACTTTCTCA TCCCTTTTAG 25740 

TGAGAAGGTC 7TGGACATCG TGACAGCCAA TCTTCTTTTT GTCGTCTCTT GTTTACCCAT 25800 

CGTGACGATT GCAGTGGCTA AAATCAGCCT CTACGAGACC ATCTTCGAAG TTAAGAAGAG 2 5860 

CAGACGGGTG CCTGTTTTTA AAATCTATCT AAGATCTTTC AAGCAAAATC TCAAACTAGG 25 920 

TCTTCAGCTG GGTTTAATGG AGTTAGGAAT TGTGTTTCTT ACCCTTTCAG ATCTCTATCT 2 5980 

TTTCTGGGGT CAAACACCTC TGCCCTTCCA ATTGCTGAAA GCCATTTGTT TAGGTATTCT 26040 

GATTTTTCTT ACTATCGTGA TGCTGGCTAG TTACCCTATC GCGGCACGTT ATGACCTATC 2 6100 

TTGGAAAGAA ATTCTTCAAA AAGGATTGAT GTTGGCTAGT TTTAACTTTC CTTGGTTCTT 26160 

CCTCATGTTA GCCATTCTTG TCCTCATTGT GATGGTTCTT TATCTGTCCG CCTTCAGTCT 2 6220 

ACTCTTAGGT GGCTCAGTCT TCCTACTTTT TGGGTTTGGA CTATTGGTCT TTATCCAGAC 2 6230 

TGGATTGATG GAGAAAATTT TCGCAAAATA CCAATAGGAG CTTTATTTCT GAAACTACTT 263 40 

TCAAAGGCTC CAAACGCTAT TCTATAAGCG AGAAACTAAA ATCGG 2 6385 
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(2) INFORMATION FOR SEQ ID NO: 4: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2716 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
(D| TOPOLOGY: linear 



(xil SEQUENCE DESCRIPTION: S£Q_IDNO:_4: 

CCTGCCCGCA TTGCCCTAGG CATTAAGTAA ACATATAAAA GCATGTGAGA GACTGTTGGA 60 

AAAGCGAGGA AATTTCCCCT CTTTTCCTCT ACTCTCTCCT TTCTTTTGCT GATTTTATTC 120 

AAACAAAATG ATATAATAGT AGTTATCGAG AAAAAGAAAT TACCCATCAA TATCTTGAGT 180 

TCAAGTGAGA AAGTAGCACG ACAGGGAGTT TCAGGTGCTT ACCGTGAATT AGTTCCTCTT 24 C 

CTTCACCGTG CTGCCAAGGA CCAATTGATT GTTACAGAAA ATCTTCCAA? CCAGGCAGAT 300 

GTGACTCACT TTCATACGAT TGATTTTCCC TATTATTTAT CAACCTTCCA AAAGAAACGC 3 60 

TCAGGGAGAA AGATTGGCTA TGTGCATTTC TTGCCAGCTA CACTTGAGGG AAGTTTGAAA 420 

ATTCCATTTT TCTTAAAGGG AATTCTGAAA CCCTATGTAT TTTCTTTTTA CAACCGGATG 4 80 

CAGCACTTGC TTGTGGTCAA TCCTATGTTT ATTGAGGATT TGGTAGCAGC TCGTATTCCA 540 

CGTGAAAAAG TGACCTATAT TCCTAACTTT GTCAACAACG AAAAATGGCA TCCTCTACCA 600 

CAAGAAGAGG TAGTCAGACT GCGCACAGAT CTTCGTCTTA GTGACAATCA GTTTATCGTA 6 60 

GTAGGTGCTC GGCAAGTTCA GAAACGTAAA GGGATTCATC ACTTTATCCG TCTCGCTGAG 7 20 

GAATTGCCTC AGATTACCTT TATCTGGGCT GGTGGCTTCT CTTTTGGTGG TATGACAGAT 7 90 

GGTTATGAAC ACTATAAGAA AATTATGGAA AATCCCCCTA AAAATTTGAT TTTTCCAGGC 840 

ATTGTATCGC CAGAGCGGAT GCGCGAATTG TATCCTCTAG CGGATCTTTT CTTGTTGCCT 900 

AGTTACAATG AGCTCTTTCC TATGACTATT TTAGAAGCTG CGAGTTCTGA GGCTCCTATT 960 

ATGTTGCGTG ATTTAGATCT CTATAAGGTG ATTTTGGACG GAAATTATCG GGCCACAGCG 1020 

GGTAGAGAAG AGATGAAAGA GGCTATTTTG GAATATCAAG CAAATCCTGC TCTCTTAAAA 103 0 

GATCTCAAAG AAAAGCCTAA GAATATTTCC AG AG ACT ATT CTCAAGAGCA TCTGTTACAA 1140 

ATCTGGTTGG ACTTTTATGA GAAACAAGCC GCTTTAGGGA GAAAGTAAAA AGTGAGGTAA 1200 

TCTATGCGAA TTGGTTTATT TACAGATACC TATTTTCCTC AGGTTTCTGG TGTTGCGACC 1260 

AGTATTCGAA CCTTGAAAAC AGAACTTGAA AAGCAGGGAC ATGCTGTTTT TATCTTTACG 13 20 

ACGACAGATA AGGATGTCAA TCGCTACGAA GATTGGCAAA TTATCCGCAT TCCAAGTGTT 1390 
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CCTTTCTTTC 


CTTTTAAGGA 


TCGTCGCTTT 


GCCTACCGAG 


GTTTTAGCAA 


GGCACTTGAA 


1440 


ATTGCTAAAC 


AGTATCAGCT 


AGATATTATC 


CATACTCAGA 


CAGAATTTTC 


TCTTGGCCTG 


150C 


TTGGGGATTT 


GGATTGCGCG 


TGAATTGAAA 


ATTCCACTCA 


TCCATACCTA 


TCACACCCAG 


1560 


TATGAAGACT 


ATGTCCATTA 


TATTGCTAAG 


GGGATGTTGA 


TCCGGCCGAG 


TATGGTCAAG 


1620 


TATCTGGTTA 


GAGGTTTCCT 


GCATGATGTG 


GATGGGGTTA 


TTTGCCCTAG 


TGAGATTGTC 


1680 


CGTGACTTGC 


TATCTGATTA 


TAAGGTCAAG 


CTTGAAAAAC 


GGGTCATTCC 


TACTGGGATT 


1740 


GAATTAGCCA 


AGTTTGAGCG 


TCCGGAAATC 


AAGCAGGAAA 


ATTTGAAAGA 


ACTGCGTAGT 


1900 


AAACTAGGGA 


TTCAAGATGG 


TGAAAAGACG 


TTGCTTAGTC 


TTTCGAGAAT 


CTCCTATGAA 


1860 


AAAAATATTC 


AAGCAGTTTT 


AGCAGCCTTT 


CCTCATGTTC 


TGAAAGAGGA 


AGACAAGGTT 


1920 


AAACTGGTAG 


TACCTGGCGA 


TGGCCCTTAT 


CTGAATGACC 


TCAAAGAGCA 


AGCCCAGAAC 


1980 


CTAGAGATTC 


AAGACTCAGT 


CATCTTTACA GGGATGATTG 


CTCCTAGTGA 


GACGGCTCTT 


2040 


TACTATAAAG 


CGGCGGATTT 


CTTCATTTCG 


GCATCGACAA 


GCGAAACGCA 


AGGTTTGACC 


2100 


TACTTGCAAA 


CCTTAGCCAC 


TGGAACACCT 


GTCATTGCTC 


ACGGAAATCC 


TTATTTGAAC 


2160 


AACCTCATCA 


GTGATAAAAT 


GTTTGGAACC 


TTCTACTATG 


GAGAACATGA 


TTTGGCTGGT 


2220 


GCTATTTTGG 


AAGCCCTGAT 


TGCAACACCA 


GACATGAACG 


AGCATACCTT 


ATCAGAGAAA 


2280 


TTGTATGAGA 


TTTCAGCTGA 


GAACTTTGGG 


AAACGAGTGC 


A7CAGTTTTA 


TCTGGATCCC 


2340 


ATTATTTCAA 


ATAACTTCCA 


GAAAGATTTG 


GCTAAAGATG 


ATACGGTCAG 


TCAGCGTATC 


2400 


TTTAAGACAG 


TTTTCTATCT 


TCAGCAACAG 


GTGCTTCC7C 


TACCTCTAAA 


AGGATCTAGA 


2460 


CGCATGTTGA 


AGGCTTCAAA 


AACACAGTTG 


ATCACTA7GA 


GAG ACT ATT C 


GAAAGACCAT 


2520 


GAAGAATAGA 


AAGAGGAACA 


GCTATGAAAA 


AAACAATTAA 


TGAGAAGCGG 


TCGTGATAAA 


2590 


AAGATTGCGG 


GTGTTTGTCC 


TGGGGTGGCC 


CATTATCTGG 


ATATGCATCC 


GACTATCGTT 


2640 


CAAGTCATTT 


GGGGTGTTCT 


TACTTGCTGT 


TACGGAGCTG 


GAATTGTAGC 


TTACATTATT 


2700 


TTATGGATTA 


TCGCGA 










2716 


(2) INFORMATION FOR SEQ ID NO: 5 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NC 5: 
CTTTGGTTTT CCCTTATTCA ACACATGAGG CCCATCAGCA ATCA7CTCAA ACTGCGAATC 



60 
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TGTTAACAGT 


CTATGGAGAG 


CTTTCATAGA 


ACTAAGATTC 


GGTT7ATC7T 


TGCTGCCACA 


120 


AATTAGTAAG 


GTTGGATAAG 


GGTAAGTTCC 


TGCTATATCC 


GTTAAATCAA 


GTGTCTTCAA 


180 


CTCCTCAGAA 


ACTCCGACCA 


TAAGAGTCTT 


GTCTGCTCCC 


TGTTTTTCAA 


ATACTCTTTT 


240 


CCJ^ A AfTT XC,T 


TTAAAAATCA 


GCAATTGAAG 


ATAAAATAGG 


ATATTCCCTG 


CTAATTTAAG 


300 






AAGCTCGAAG 


ATTTGGTAAA 


TCGTAACTGG 


AAAGTTCTAG 


360 






ACAATCCAAT 


_C AAAAC AAAA 


GGTTCTGTCT 


CTTGAGCTAG 


420 


Vj 1 \jC 4 Vjrt i Art 


AC^CGCTCTT 


TAGCTTGTTG 


ATAGTTACTA 


ACTCCAGAAG 


GAAATAACTC 


480 




GAAGCATAAT 


CTGTCAGTAG 


ATTCCGAACT 


TCTTTCCAAG 


ACTCTGCTGA 


540 


CTCCCCTXAC 


C C ATGC AAAA 


ATATTAATTT 


CATCTAGTTC 


TCCTCAAGGC 


TTAAT7CATA 


600 






AGCCGTAAAT 


AGCTTCTGCT 


TGGGTTAAAT 


CTGCCAAGGT 


660 


r* A Af^ APTTTr" 


TCTTCTACCT 


GTCCTGTTTC 


TAGCAAATGC 


TGACGGTAAA 


TTCCTGGCAA 


720 


f* ATTCP AAG* 1 * 


CGCATAGGCG 


GTGTGTAGAG 


TTTTCCAGCG 


ATTTTCACAA 


CCAAATTTCC 


780 


i rt i AuAuu I * 




CTCCTCACTT 


ATTGTGGTAA 


ATCTTCTCTT 


GTTCTCCTAG 


840 




GGTCCCTCAG 


TGGTTTTAAA 


GTAGGTAAAG 


GATTGATTCA 


AAGCAGCTTC 


900 


rTf-AAGAf aci 


ACTTGGGCCT 


GACAAAAGCT 


TGTACTGAGA 


GGGGTTAATA 


CTTGACGATT 


960 






TGCTAAGGCT 


GATTCGCAAG 


CGGTAATCTC 


GATTAGCTTC 


1020 




CACTCTTCCT 


CAATCTTGTG 


TCCCAAGTCT 


TCTCCATCAA 


AACGAAAACC 


1080 


/WW 4 AAL oM 


CTAGCTTTTC 


TCAGCCTTTC 


CAGATGTTGT 


TCTTCAAACA 


TCAGTTGTTT 


1140 


TTrrfTT ITT 


TTTrC AGTTG 


TAATTAATTG 


GAAGCGAGCT 


TCTTTACCAT 


ACAGAACTGC 


1200 


I vjL <_ « i i i un 


TGAACCTCTC 


GGTATTCAGA 


TTCCCATGTG 


CTATCCCAAC 


TAATCCCTCC 


1260 




TAAATGGCTT 


GACCTTTGTG 


AAGTTGAATG 


GTACGAATGG 


CC AC ATT AAA 


1323 


AATCCGTCGT 


CCATTTGGAA 


GCAAGAGACC 


AATCGTTCCA 






1380 


AGGCTCCAAG 


TCCTTGATAA 


TCTCCATTGT 


CGCAATTTTC 


GGTGCACCCG 


TTATGGAACC 


1440 


ACAAGGAAAG 


AGTGAGCGGA 


AGATTTCAAC 


AACGTCCACA 


TCCTCTCGCA 


ACTGACTCTT 


1500 


GATGGTCGAA 


GTCATCTGCC 


AAACAGTTGA 


ATACTGCTCT 


ACCTGACACA 


GACGCTCCAC 


1560 


CTCCTCCCTC 


CCAACTTCAG 


AAATACGGTT 


CATATCATTG 


CGCAAGAGGT 


CCACAATCAT 


1620 


CATATTTTCA 


GAGCGATTTT 


TGCGATCCTC 


7TCCAACCAA 


CTGGCC7CTT 


CAAGATCTTC 


1680 


TTGGTCACTT 


ACCCCACGCT 


GAGTCGTCCC 


CTTCATTGGT 


CGTGTTGTCA 


ACTCGCGATC 


1740 


ATTTTGCTCA 


AAAAAGAGCT 


CTGGGCTCAT 


GGAAATCACT 


GTCATCTCGT 


CATGTTCCAC 


1300 
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ATAGGCATTG 7AGCCCGCCT CCTGCTCTAC CACCATACGA TTGTAGATGG CAAAAGGATT 1960 

GGCATT7AAC TTTTGCTTAA GTTGGACGGT GTAGTTGACC TGATAGGTAT CTCCCTGCCG 1920 

TAAATGATGG TGAATTTGGG CAATGGCCTT TTCATAGTCT GCTGCAGACG TTACTTCCTG 1980 

CCAAT7TGAG GGCAAATCAA TATCCTCATA AGTCAGAGGA ATAGGGGAAG TTTCTACGAT 2040 

ATCATGAACA CTAAAGTAAA CCAGGTAC7C TCCCAGTAGG GGATCCTTCT GAACTGC7AA 2100 

TTTTTCCTCA AAAGCAGGTG CAGCCTCGTA CCTGACATAC CCCACCACAT AATAACCTTG 2160 

CTCTTGGTAG CTTTCCACTT GTGCCAGCAA ATCTGCCACT TCTTCTACAT TTCTCGTTTT 2220 

CAACTCTTTA ATAGGCTGGG TAAAGGTATA TCTCTCCCCC AAAGTCCTAA AA7CAATCAC 2230 

TGTTTTTCTA TGCATACCTT AAGTATAGCA TAAAATAAGA AAACCCTCAT CCGCAAAGCA 2 340 

GATGAGAGAT TTCAATTATT TAAAGATTGA AGTTTTAAAG CTATTTGTTT GTTCAAGAAG 2400 

TTTCTTATAA ACACCT7CTT TTAATTTAAC TGTATTATTC ATAGATACTG TTTTATTACC 24 60 

GTTTGCTTCT TGTTTAAGAG TTTCGGCATC TTTTTTAACA GCTTCTTTAA ACAATGTCAC 2520 

TAAATCATCG TATGATGAAA CGGAAGAACC ATTTACTTCG AATGTTGTTA ATCCTTTCGT 2530 

7CC7TTATCT TTAACTTCTT TGAAGTAAGC TTTTTTAAAT 7CTTCAATAC TATTAAATGT 2640 

A7TCTTAGAT ATTTTCTTGA TAATATATTC ATCACTTAGA ACAGACTCAC CATCTGTTTT 2700 

AGATTGTTGT TTATATTTAT TTGAAGCATA ACCTAAGAAC CCATTTTCGT ATCCCTAGTA 27 60 

ACCCCATAAT CTAAAAGCAT TATGTTTGAA TGAAACAGCT CCAGGAGCAC CTTTACTAGT 2820 

ATTACCTCCC TAGATACCGG TCATCATTCT AACACCTACA TAAGGTGATT GATCGTTATA 2880 

GCTAATTGCT TCCGGTTTAT AGA7ACCATT ACC7GGAT7G CGATTAGTCA TTAATTGTTG 2940 

ATCAACTAAA TCATTAACAG ATTGAATATT TAATTCATTT TTCTCTTCTT GACTTAGATT 3000 

TCGAATTTTA TCCCATTGAT TTAATTTATT GTTATCACGG TATTCTCTAT CTATTTTTTT 3060 

GAACCATGCA CTATTTAAAT CTTTATTTTG TTGAGAAATC ACAGATTCAG CCTCAATTTC 3120 

ATCAAGAAGA CT7AAAG7GT CATTATAACC CTTCATATAT CTATTAATAT CTTCTCGTGT 3180 

TTTTAGAGTT TTTGGATCTG TAATATACCA CTGATTCCCA TCATTTTTGC GTT7AAATAC 3210 

CATATTAATA CCTAAAGAAC CAAACTCATC AAATCCACTA CCAGTAACAG GAGTTTGTAG 3 300 

CATACCCTGA GCATATGCTT CAGCATCAGT ACCTTCACGG TGTCCAAAGC CACCTAAGTA 3 3 60 

AATCGCACGG TCGTTGACGT GTGTTGTTTC ATGTGTGTAA ACTGAAATAC CGTATTCACC 34 20 

AACCATTTCT AAATGAACAT ATTTTACATC AGTTCTAATA TCATCAGAGT TAGGATATAT 34 80 

AGCAGCA7AA GCTCCTGTTC CATTATAATT ATAATACTTA TCCATAGGAC CAAAGAATTC 3540 

TCTAAGAGGA GTATATACTT TGTCGG7ATT A7AGCGGCCA TATT77TCAA CCCATCCACC 3600 



WO 98/18931 PCT/US97/19588 



177 





AAATAGGAAT 


AACAGCATCT 


CTTACTACTC 


GTTGTTTAAC 


3660 


PTTATfAGAC nr^AGACGAT 


ACCAGAAATC 


ATAATACTTT 


CTATAACCAT 


CTGCAGCTTT 


3720 




CTTCTAATGA 


TTTTTTACCT 


AATCGCTCTG 


CACTACCAAA 


3780 


GGCAA 1 A 1 1 A 1 Art 11>« 


AAATTAAATA 


AAGATGTGCT 


TTATCAATAT 


TCAGTAGTGG 


3840 


GAGTATAGTA rrTV- I AAUU 1 


n & PTT ("I TTT 
\jn\. l, i \.u i t i 


TAAATTATCG 


AATGCACGAT 


GTTTAGAATT 


3900 


TTT AA TT I L 1 rLOAUULUAij 




.TGCGATGTAG. 


ACATGGTCTT 


CTGTAGCATC . 


. 3960 


AATAAACCAA TCGTTGA I A * 


Tf.TPT AT A'T'T 


TGTGAACAAT 


TGTCTATTAT 


AATTTAAAAA 


4020 


TGCATL ; I AAA riAH-lviA* I 


TAfiTATATTT 


AGCCAATACT 


TGACCGAATG 


CGTCGAATGT 


4030 


ACGTGAACGT u AA 1 L» 1 1\j * 


TITPTT 1 ' AG A 


ACCGATTTCA 


ATTAATCTGT 


CTAATACGCT 


4140 


AAC1 1 I I 1CA Lt- A 1 Aw AAA i 


fT*n f^T' PTG AA 
v_ l vjo ill unn 


TAGCATTAAT 


TCTTTAATAT 


TAACATCACC 


4200 


AAA! I IAAL.* LUrti AulnAL 


HATTTAGGTA 


AGTTAAACCT 


AGTAATAAAG 


CTCCTTTGTT 


4260 


TTTCTCGACT TTATLAI.UAA 


TCATTTGACG 


AGCACCTGCA 


GAATCATTTA 


GTTGATGTTC 


4320 


TTCGTTTTGA AH AA I I I 


TGATTAGGTT 


TGTTAAGTTT 


TCTTTAACAT 


CTGTGAAGCT 


4330 


TTCTTCtAAA TAT AAA re L I 


TGATTGCATT 


AACTCTATAG 


TCACCTAATC 


GATTTAGATG 


4440 


CTGATACATC OlilwAwnt i 


GAAGCTCTAC 


TGATTCTAAA 


ATAGATTTTA 


TATCATTAAC 


4500 


AAGAGT AGTG TT A TV- i I L 1 * 


GAACGATATT 


AGCTGTATAT 


TTAATTCCTA 


AGTCAGTTAT 


4560 


AGTATATTCT 1 1 1 ALA i I A\- 


TTAAACCTTC 


ACTGCTAGAA 


GACAAGTtAA 


AGTAATCTTT 


4620 


TGTACCG l C- Gl_ A TAO I u AA 


CAATAATTTT 


ATTAGCTTCA 


TCTAGGTTTG 


TGATAAACTC 


4680 


AT * GTTG RL A I v_wt_wu 1 AA 


CAGAAAGAAC TTCTTTAGTA TTTAGATGGT 


GTTCTTTATT 


4740 


TAATTTATTA ccttgatata 


CAATATAATC 


TTTATTGTAG 


AATGGTATTA 


ATTTTTCAAG 


4800 


ATttttATAG GCTTGGTTAT 


ATTCAGCGTT 


ATAATCTTGA 


ATACTAGAAT 


AGGCTTTTTC 


4860 


TTCATTAAGT TTTGCAAGAG 


GAGATAGATC 


ACTTTCTAAT 


TTATCAGCAG 


T AATATTGAA 

1 nn 1 l l wnn 


4920 


AGTAGTAACT TTAGCATCAG 


CTTGTTCTTT 


AGTTAATTTA 


GTAAATGTTT 


TAGATTTCCT 


4980 


AAATCATCTA TtACCTCACG 


AATATCCCTC 


TACCGCATAT 


AAATCTTTTA 


TATGAGCACT 


5040 


AGCATAATCA GAATCATCAA 


CGTCGTTAGA 


GCCGAATAAC 


7CCTCTCCAC 


GGATAATCTT 


5100 


AGCATAGCTG ACAGAATTAC 


TTACCGTACC 


TACAGGCCAA 


CTCtTACTtG 


CTATTGCTCC 


5160 


AACTTgTACT CGATTTGAAA catctatttt 


ACCTTTTACA 


ACCGACTCAG 


TTAGGAGAGC 


5220 


ttttgtacca ATAAGATGGT 


CTAGAGTTAA 


TCCATAATCT 


ACTTTAGGAA 


C7AACAAGCT 


5280 


GGCGCGTGTT TTGTTTCCTG 


TAATAGTAGC 


ATCAACATAT 


GCTTTTCTAA 


CAATTCCTCT 


5340 
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ATAGTTTGTA CCTGCAATTC CCCCTGTATG AGAGCCATTT CCACTTGTAC AGTGTAGTTT 540C 

GCCAAAGAAA GCAACATTTT CAATACGAGT TCCATCATTC ATATTATTTA CAAATCCAGC 5460 

AACATTATTA CGACCTGAAA GTGTGCCTGT AATTTTGACA TTTGTAATAA CTGAAGAACC 5520 

TTTCATAGTA TTGGCTAATG ATGCAATATT ATCTTGACCA GAACGTTCTA TCTCTACATT 5580 

TTCAAAATTC ACATTATTTA TCGTTGCGTT TGTTATCACA TTAAATAATG GATGTTCCAA 5640 

TTCAGTAATA GCAAATTGTT TTCCTTCAGA ACTTAAAAGT TTTCCTGTCA ATTCTTTAGT 5700 

GATATATGAT TTTCCATTAG GAACAACATT 7CTACCGCTC A7TGATTCTC CC AG AC GAT A 57 60 

TTCTTTTGAA GGATCGTTTT GAATAGCTTC CACTAATTCT TTGAAATTAT AATATACATT 5820 

ATCTTCGTGG ACTTTAGGTT TTTCAATATA GTGAACGTAT TCTTCTTCAA ATTTATTATC 5880 

AGCAGTTCTA GAG ACT AAA? TGTCTGCGAT TGCTGTAACT TTATATACAC GTGTTCCGTT 5940 

AACCGTAGTT TCTTCTATAT TTTTAACAGC TAGTAATGTA CTTTTCTCAT TATTTGAAGT 6000 

TATTTTTAAA TAATAATTGC TCTTATCATC AGGAATAGTT GTTATCAGTG ATTCATTAGT 6060 

TTCTTTTCCA TTTTCGTATT TGATTAAATC TGTACGTTTA ATATTTTTAA GCTCAACTTT 6120 

TTTAAGATCT AATTGAATAT TTTGATTTTC TAGAGTTTCA GTTTCTTCAC CGTTACCTCT 6180 

CTCCTAAATC ATAGTTCTAG ATACGGTGTA TTCTTTGTAG TACTCTAGGT TCTTAAATGC 62 4 0 

AGCGCTTATA GTTTCTGTTG TTACCTTGTC ATCTGTAACG ACTACAGTAT TAATAACTTC 6300 

TTCTCCTTTT TTCAATTCAG CTGTGATTGA TTTGATTTTT GTTTTGTTTT GATTTTCTAG 63 60 

AGTATACTTA GCAACAGCTT CACGTTCCAA TATTTTCTTA TCGCTACTAG TCAATGTTAA 64 20 

TATTCCCTTT TCAGATAATT CAACCAATTT TTCAATACTT GCACTTAATT TTTCAACAGC 64 80 

TTCGTTAACT TCACTTTCTT TAGCATCTGT ATTAGCTGCA ACTTTTTCAG CCTTTGTAAC 6340 

TTCACTTTGC AGGTTTTGCC AACTTCTATC ACTGTAATCT TCTTTTACCT TTGTTTTTGC 6600 

ATCTGCAATC GTATTGTTTA ATTCAGTTTT ATCAACCTTT AGAGCGTCAA TAGCCCTTTT 6660 

AAGTTTATTT GTCTCCCTAT TTACCTCAGG CTCTTTTACA GGCTCTCAAG CATAGACACC 67 20 

TTTTGCAGTT TCTAAAACAG GTCCAAGAGC ATTGTAACTT GCTGTAGAAT AATCAGTAGG -57 80 

AGAAACTGAA CTAGCTTTAT CAATTTGATT ATTTAACTCA CTTTTATCAA CTGGTTCTTT 684 0 

AGTACCAATA CCCTTTATTT TATCTTCTGG TTTCGGTGTT TCCTCTACAG CCTTCTCTTC 6900 

TTCAGGAACT TCTGGTTGCT TTTCTCGCTC AACTGGTGCC GTTCGTGCCT GTTCGTCTTC 6 9 60 

TCTTGGCGCG ACTGGTTCAC CTCCTTGTTC AACTTTTGGT TCCTCTGTTG GTTCTGTTTC 7020 

TTTTTCTACA GCAGGCGTTT CAACTTTTGG TTGTTCAATA GATTGATTAA CAGTCTCCTC 7 080 

TTTTGGTTCT ACAGTTTCTT CAGCCTTGGT ATCTGCAGTT GACTCTTCTT GTTTCGGTGT 7140 
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TTCCTCTACA CCCTTC7CTT CTTCAGGAGC TTCTGGTTGC TTTTCTGCCT CGACTGGTCC 7 20C 

CTTTTCGTCT TCTCTTGGCG CCACTGGTTC ACCTGCTTGT TCAACTTTTG ATTCCTCAGC 72 60 

7GCTTTC7CT GATCGTTCAC TTTCTGGCTT AACTGCTACT TTTTCCTCTG GTTTTGACTC 7 320 

AACTTCTCCA CCTAC7TC7T CAACTGGAGC TCGTTCTCCT GAATCTTCTT TCCCCTCTTC 7 380 

TACTTTAGGA AGGGTGTCGT CAGTAGCTTT TACCTCCGAT TTTCCTTCTT CCTTTGGACT 74 40 

TTCTTCTGTT TTAGGTGCTT CTTCTTTTGG AGCTTCCTCT GTCTCTACTA CTTGGTTTTC 7 500 

TGTCCTAGCT TGCTCCTGAT TTGTTATTGA TTGAGGAGTC TCAACTTCGA CCACAGTCAC 7 56C 

CTCTCCAGGT T77GC7GAGG TTTCTTCTAA AACAGTGTCC AAGCCAAGCG 7T7TGAGGAT 7 620 

G7CACCTGAT AGATAACCAA CATAGCGATA GCCCTCCATT TCAACAACAC CCTCTCCACT 7 680 

AGCCAGCGC7 AGGGTCGCAA CTGGGTCTAC AGCCCC7GCA CTAGGAAGAA CTACCAATCC 7740 

CATAGCTCCA AC7AGAAAGA CGCTAGCAAT TTTCTTTCTC TTGTAGATTA AAAGCAAGCT 7 800 

« CCCAACAGTC AGCAAACCAA AAGCTGTCAA AACAGATGCT TCTGTCCCTG TTTGAGGCAA 7 8 60 

CTGATCTTTT TGATACACCA AACCATATAC AACTTCATTC CTGTCACCCT 7TCCTGTC7G 7920 

AATTAAA7C7 T7ACC7TC77 GTGAAATAAT CTCTTTAT7T ACATAGTGAT AGGTGGCTGC 7 980 

G7CCACTACA GAAGGAGCCA TCAAAAGGCT TCCAAGAAAT ACAGAGCCTA CAACTCCC7T 8040 

AATCTTACGA ATTGAAAAAC GGTCTTTTTT AAACACTTTT ATCTCCTTTA TTCATTCTCA 8100 

AAACTTCCTA ATAGCATCTT GCGGATAGTG CGCACGCCCA CCTCCGATTA ATTTTGCACG 3160 

ACTAGCCAGT GCCGT7ACAT GGGCATGACC AATCTCTCTC AAAATAGGGC GAATCGGAAC 8220 

CTGAACATGC TTGACATCCA TGCCAATTGC AGTCTCTCCG ATATCCAA7C CAGCATGAGC 82 80 

CTTGATAAAT TCAACCTCAA CTGGATCCTG CATAAAC7TA AAGGCTGCCA ACTGCCCCGA 8340 

ACCTCC7GCA TGAAGAGTAG GATGCACACT GACAATTTCC AGACCAAAC7 GCTCTGCCAC 84 00 

CTGACG7TCA ACAACGAGAG CCCGATTGAC ATGCTCACAA CCTTGAAC7G CTAAATGGA7 84 60 

ACC7CTACTA CC7AGAATAT CCAAGATAGT C7CCACTATC AGCTCACCAA TC7CTTGACT 8520 

GCATTC7T7C CC.\A7A7GAC CACCTAGCAC C7CACTAGAA GATAGACC7A AAACAAAAAG 8 580 

GGCCCCCTGC TTCAAA7TGG TCTTTTCTAA AACATC7TCC ACTACCTGAC GTGTTTC7CT 864 C 

TTGAATCTGT G7C7CG7TCA TCTCTGTTAC C7CTGT7G7C ACTC7TCTAT CATACCGTTT 8700 

7TTC7TG7TT TTAGCAAGA7 AGACAACC7A GAAAGTTTGC CCAA77ACGC A7AAAAC7CC 87 60 

CAGAAT7GAC TGGGAC7TAG CTAGTTTC7A 77CTAT77A7 A7ATA7TTCA AC7TTCCTCC 882C 

CTTTTTGGCG TCTAGAATCA ATCTTCATAT GG7AATTGGC TCCAAAA7GA AG7TTGAGCC 8830 
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CTTGATCGAC 
CAGCATCTTG 
TCTGGACAGA 
CATTTTCTAC 
TTTCATTAAT 
GGCGGACATG 
AGCGGAAATA 
CAGCCATCCA 
AAAGGCCTTG 
ACTGATCAAT 
CACCAACTTC 
TCAAATCC7T 
CAAGAAGAGA 
C7AACTTTTC 
GACTGACGTA 
TAGCCTCCAT 
GGTTTTCATT 
CrrCCAGAG? 
CAACAAGTTC 
AAACAGGCAT 
AGGAAGTTTT 
GCACAACAGT 
CCTCGACCTT 
AACCAGTCGA 
TGATGGTCGT 
AGTAGAAAGT 
TTCTAACTAA 
CACACCTGCA 
CTCTGCGATC 
ACGTTCTCTC 



ATTTTGAAGA 
GAAGCCAACG 
AAGTTTAATA 
AAGGGGTTGT 
TTCGTATTCC 
ATTGATTTCG 
GGTTGCCAAG 
GATGATGGTG 
AAGTTGGTAC 
CTGATCCAAC 
CTTGGCACGA 
CAAAGGAGCA 
TGTGACACTG 
CAATGATGAC 
GGATTTGTGA 
TTTGCTAGAC 
GATAATGAAG 
TTCATAAGAA 
TTGAGTGACA 
AGCTCCCTGA 
CATCTGCACA 
TTTCAAGTCC 
GTCTTGACTG 
GGTGGTTTCT 
TTGGCTGTTG 
TCCAACCAGA 
AAGAGAAGAA 
ATCTGCTTAA 
TCATAAATCT 
ACCAGATAAT 



CCAACTCCCC 
CCATCATCCT 
TGGCCCTGAC 
AGGACCAGCT 
AGCTTATCTC 
TCAGAGAGAC 
GACTTGGTCA 
TCCAAAGTGT 
TGACGGGTCG 
ATACCATTAA 
AGATTTTGAG 
ATCCAGCGTT 
GCCCCAAGCA 
ACGCCAAGCA 
CCAGGAGTAT 
GAACTATAAA 
GCAAAGCCCT 
ATATCCAAAC 
GAAATGACCC 
TGAATGGCCT 
CTCTCATCTG 
TTATCTGACT 
GGATTCTCAG 
AGTTTTTTGA 
CCCTCAATGG 
GCTAGGAGAA 
CGCTTCATCG 
AACGTTGGGT 
TCAGATCTGT 
CCTGAAAAGG 



130 
CACGTTTGAG 

CAATACGGAT 

CTTCCTTTTC 

TGGGTAAGAC 

CATAGCGTTG 

AAATCAAGTC 

CCTCCACCAC 

TATAGAGGAA 

TTTCTTCCTG 

ATTGGCGAGT 

CACCAGAAGC 

TAAGACTGAA 

AGGTCCACAA 

CCGTCCAATC 

AACCCTGACC 

CTCTGTGTTG 

GCTGCCCCAA 

GAAGCACACC 

ACTGACTATC 

TTTGGTACCA 

TAGAAATGAC 

TCAAGATGGT 

CATAGGCCAG 

TATAAGACTG 

TGGCCTCAAT 

TGAGAAAGAC 

GTCTTCTCCC 

AAAATAGTTC 

AGTTAAAAGC 

CAAGCCCAAC 



TTGfte^TTGA 
GACCAATCCC 
CTTAATGCCA 
TAAATTATCA 
TTTCTGGATA 
CTTGCCTTGA 
TCGCTCACTA 
ATGTGGATTA 
GCTACGAATA 
TACTTCTCTC 
AATTTCCAAC 
CCACACTAAG 
GAGCTGACTC 
AGTTCCTGCA 
TCTATCGATC 
AGGATGGTAG 
CTGGAGTTGA 
AAGATTGGCT 
TGATTTACGA 
ATCCTCAGCC 
CTGACCAGAT 
CAAAAACAAA 
AACATCCCTC 
AATAAAGTGG 
GGCTGAAGAA 
CAGAAAGATG 
TTCTTAAACT 
ATATCTTCAA 
AAGAGCTTCG 
TCTTTCTTAA 



CTACTATCAC 
GAATCCTGTT 
TGGTAAAGAG 
AACGCAACAT 
AAGAGATACT 
TTGAGCGCCA 
TCATGAAATT 
ATCTGGCTCG 
GCTACCATCA 
AGTTCATAGG 
ATGG7TTCTC 
CAGAGACAGA 
CGAACCTGGT 
ATCTTCTCTT 
TACCGTTTCA 
ACAAATTCAT 
TTGAGATAGG 
CCCTTTGCAT 
GCTGGAGTCA 
ATCATATCAG 
TTGGTCACCA 
TCTCGGATTC 
TGCTGGG7CA 
CTAGTCTGGC 
CTTGATTGAT 
GAAATAACCA 
GACGAGGTGT 
AACCAACCTT 
CTTGTTTAAC 
TCAAGGAACT 



8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9730 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
1033C 
10440 
10500 
1056C 
10620 
10680 



WO 98/18931 



PCT/US97/ 19588 



181 



CAGATAGGTC 


GGACTAAAAC 


CTAAGTCACT 


GGCTAAAGAC 


TTTAAACTAA 


ATTGCCTATC 


10740 


AGCCAGATGA 


CACTGGATTT 


TCTGGGCCAT 


GTTTCCTTCA 


AACCTATTAG 


TCAATAAATC 


10800 


TTGTAACTGC 




CTTCCTTGTC 


TAGTTTTTGT 


TTGATTTTCC 


CCAACATTTC 


10860 


CT C AAT AT CC 


TGACGAGAAA 


AGGGTTTGAG 


CAGGTAGTCG 


TCCACACCTA 


GTTTGACAGC 


10920 


AGAPAACGCA 


TAA m CAAAAT 


CATCGTAACC 


TGTTAAAAAG 


ACCAAATGAA 


CCTGAGGATA 


10980 




ACCAGACTGG 


CCAACTGGAT 


GCCATTTAGA 


TGAGGCATGT 


TGATATCGGT 


UC40 


T AA AATP.A"" A 


TCTGGCACCT 


GCTTTTGGAT 


CAATTCCCAA 


GCCTGCCTTC 


CATTTTCAGC 


11100 


rrjirfn atp 


AT' T *'" , PPA' T% AT 


CGTAGGCTGC 


TACATTGACC 


ACTTTAGTCA 


AACCTTGTCT 


11160 


TAPPAGATAT 


TCATCTTCTA 


CGATTAAGAT 


TGTGTAGGTC 


ATGCTCTCCT 


CCTTTACCAC 


11220 


TTAPTAC5TAT 


CAGTATAGCA 


AAATTCTCCT 


CTAACTGCTT 


AGGAAAGACC 


TCTTATACTC 


11280 


AATAAAAATC 


AAAAAGTAAA 


CTAGGAAGAT 


AGCCACAGGT 


TTCTCAAAGT 


ACCGCTTTCA 


11340 




AAAACTGACG 


AAGTCGACTC 


AAAGTATAGC 


TTTGAGGTTG 


TAGATAAAAC 


11400 


m PAPGAAP.TC 


GATAACCCTA 


CATACGGTAA 


GGCGACGCTG 


ACGTGGTTTG 


AAGAGATTTT 


11460 


PP.AAPAP.TAT 


TAATCAACAT 


AATCTAGTAA 


ATAAGCGTAc 


CTTTTTCTTC 


CATTTGGTCT 


11520 


Twprrr A A HP A A 
I 1 tjuunn I >\rt 


APPPP.ATAGA 


GAGGCTATTG 


ATACAGTAAC 


GTAAGCCGCC 


CTTGTCCTGT 


11580 


P.P. APP ATPPG 

U\Jrtl»V_rt.L ^. V— w 


TAAAGACATG 


CCCAAGGTGA 


GAATCTCCTA 


CTCGGCTCCG 


CACTTCCATA 


11640 


pp PP T P A A"* 
1 \-rt * r\ . 


TP.TAGCACTT 


ATCTTCCTTG 


TAGGTCACAA 


CATCTGGACT 


GATGGGTTGG 


11700 


P.TAAAAPTAG 


GCCAGCCACA 


ACCAGACTCA 


AATTTGTCTT 


TTGATGAAAA 


GAGAGCTTCC 


11760 


PPAPT^PPTA 


TATCCACATA 


GATACCGGA7 


TCAAATTTAT 


CCCAGTAACG 


GTTT G AG AAA 


11320 


C PTOTT PTG 


TTTGATTTTC 


CTCCGTAACT 


GCATACTCCT 


CAGGTGACAG 


G GTCTTTT TC 


11880 




PACTTGGTTT 


TGGATATTTG 


CTGGCATCAA 


TGACAGGATA 


GGCCGCCTCA 


11940 


TTAACATTGA 


TATGGCAGTA 


GCCATTTGGA 


I 1 4 1 Lv-i 1 wrt 


PATAGTC' T " TI G 


ATGGTAATCC 


12000 


TCAGCCACCA 


CAAAATTCTT 


CAAGTTTTCC 


TTTTCAACTG 


CTAGAGGTTG 


ATCGTATTTC 


12060 


TTAGCCACCT 


CATCAAAGAC 


TTCGTTAATC 


ACTTCCAAAT 


CCTTGTCATC 


TGTGTAATAA 


12120 


ACACCAGTAC 


GGTACTGGGT 


CCCCACA7CA 


TTTCCTTGTT 


TATTTTTGCT 


GCTTCGATTC 


12130 


ATAATGCGGA 


AATAGTGAAG 


CAGGATTTCC 


7TGACACAAA 


TTTGCTTGGC 


ATCATAGCTG 


12240 


ACATGGACGG 


TTTCTGCATG 


ACCTGTTTGG 


TTAATCAATT 


CGTACTTGGT 


TGTTTCTCCT 


12300 


CTACCATTTG 


CATAGCCTGA 


AACCGCATCC 


G7CACCCCGG 


GAACACGTGA 


GAAATATTCG 


12360 


TCCACTCCCC 


AGAAACAACC 


TCCACC7AGA 


TAAATTTCGT 


GCAAGTCTGC 


GTCTTTACTA 


12420 
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ATTTCTGTTT 


TTTTCACTCC 


TTTTCCTCCT 


TGCCTAACTG 


CCGCCTTTTC 


AATTTGCGAG 


12480 


CCATCTCTCT 


GCCCTGCATT 


TCGTATCAAT 


AGAACATAGA 


AACCGGTTAT 


GGCTAGAAAA 


12540 


AATACTCCTA 


GCAACAAGAA 


GATTTTTAAC 


TTATCATTCA 


TAAGACGCCT 


CCTAGGCTAA 


12600 


TTCCTTCAAA 


GTTTGCAAAA 


TTGCATCTTT 


TTCCATGAAT 


CCTCGATCTG 


TTTTGACCAG 


12660 


CT7GCCTTCT 


TTGTCTATAA 


AGGCTTGGGT 


TGGGTAAGAA 


CGGACACCA7 


AAGTTTCCAA 


12720 


AAGTTTGCCT 


GATGGGTCAA 


CTAGGACTGG 


GAGATTTTTA 


TAATCCAATC 


CCTTATACCA 


12780 


ATTCTTAAAG 


TCCGCTTCAG 


ATTGCTCTCC 


CTTATGTCCT 


CGTGACACTA 


CTG7CAACAC 


12340 


CACATACTCA 


TCACCAGCTT 


CTTTAGCAAT 


CTCATCCGTA 


TCTGGAAGAC 


TAGCCAGACA 


12900 


GATGGAACAC 


CAAGAAGCCC 


AGAATTTGAG 


ATAGACTTTC 


TTGCCCTTGT 


AATCAGATAA 


12960 


ACCCTAGGTC 


TTGCCATCTA 


CTCCCATCAA 


TTCAAAATCA 


GCCACCTCTT 


TCCCTTTAGC 


13020 


TGCGCTTCTT 


TTACTAGCTG 


TC7GCTCCCT 


CTTCATTTCA 


TCTTTCGTTT 


GGTGTTCACT 


13080 


AGTCACGGAC 


TTCCCTCAAC 


AAGCCGTCAA 


ACAAAGGAGC 


GAACCTGCTC 


CAAGAACACA 


13140 


7CTTTCCCAT 


TTTTTCATAT 


TGATATTCC? 


TTCCATTTTA 


TTCAAATAAT 


TCACTTAAAA 


13200 


TTGAAGCATT 


TCCAAACAGA 


ACCAAGAAGC 


CCATCACAAT 


AATGAGAAAA 


CCACCCACT7 


13260 


TTTTGACGAT 


TCCGAGATAG 


GGATGAAGTT 


TTCGGAAATG 


TTTCAAAACA 


TAACTAGAGG 


13320 


TCAGAGCTAG 


AAGCAAGAAT 


GGTAGCGCCA 


AGCCCAGCGT 


ATACACCAAC 


ATGAGACCAG 


13330 


CTCCCTGCCA 


AGCTCCTGAA 


CCACCTGAAG 


CCGCCAAGGC 


CAAAACAGAC 


CCCAGAACCG 


13440 


GCCCCACGCA 


AGGCGTCCAA 


GCAAAACTAA 


ACCTCAACCC 


CAATAAAAAT 


GCCTGACTAT 


13500 


AGCCCTTACC 


ATTTTGCCCC 


TGTCCTTGCA 


GTTGTAGCCT 


CTTTTCCTTA 


7AAAGCCCCT 


13560 


TAAAGTGTAG 


AATCTCCATT 


TGGTGCAAAC 


CAAGAAGGAT 


AATAATTGCC 


CCAGTAAGAT 


13620 


ATTGGAACCA 


AGAAGCATAA 


AGCAAATCGC 


CTAAAAAACC 


AGCTCCA7AC 


CCCAACAAAA 


13680 


TAAATATAAA 


GGAAATTCCT 


GCTATAAAGG 


CCAGAGTTCG 


TAATAAACTA 


GTAACTGAGA 


13740 


TTCAAAATTT 


GCCGCTAGAA 


GCCTGAGCAC 


CATCCTTATC 


ATCTAGTAAC 


ACTCCTGTAT 


13300 


AGACCCG7AA 


CAAAGGTAAG 


ATACAAGGAG 


AAAAGAAGGA 


TAGAA7CCCT 


GCCAAAAAGA 


L3860 


CACTTAGAAA 


AAAGAAAATA 


TGACCCATAA 


ACTTCCTCCT 


ATCAT7TTAT 


TGATAGA7T7 


13920 


ATTATA 












13926 



(2) INFORMATION FOR SEQ ID NO : 6: 

til SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20199 base pairs 
(8) TYPE: r.ucleic acid 
( C ) STRANDEDNESS : double 
(O) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CCCAGCAGAA AAATGCCAT7 TGGAGATAAT GG AAATCCTA AAAAAACTAT GTTTGAGAAA 60 

ATAACCTTGT TTATCGTGAT TATCATGCTA GTAGCAAGTT TATTGGCAAT TTTTGCAACT 120 

GCAATTGGTG CCCTCAGTAA TCTATAAAAT AGATTCAAGA AAATTTAGTG ACTGGGATTT L80 

CCCAGCCCTT TTTTAAAGTG AGAAGAAATA ATGACTATG? TTTTAGATAC ACCTAAGATT 24C 

AAGGTCAAGG CTGGTAATGG TGGCGATGGT ATGCTTCCCT TTCGTCGTGA AAAATATGTC 300 

CCTAATGGAG GCCCTTGGGG TCGTGATGGT CGTCGTGGAG GCAATGTGGT CTTCGTTGTA 3 60 

GACGAAGGAC TACGTACCTT GATGGATTTC CGCTACAATC GTCATTTCAA GGCTGATTCT 420 

GGTGAAAAAG GGATGACCAA AGGGATGCAT GGTCGTGGTG CTGAGGACCT TAGAGTTCGA 480 

GTACCACAAG GTACGACTGT TCGTGATGCG GAGACTGGCA ACCTTTTAAC AGATTTGATT 540 

GAACATGGGC AAGAATTTAT CGTTGCCCAC CGTGCTCGTC GTGGACGTGG AAATATTCCT 600 

TTCCCCACAC CAAAAAATCC TGCACCGGAA ATCTCTCAAA ATGGAGAACC AGGTCAGGAA 660 

CGTGAGTTAC AAT7GGAACT AAAAATCTTG GCAGATGTCG GTTTAGTAGG ATTCCCATCT 720 

GTAGGGAAGT CAACACTTTT AAGTGTTATT ACCTCAGCTA AGCCTAAAAT TGGTGCCTAC 7 80 

CACTTTACCA CTATTGTACC AAATTTAGGT ATCGTTCGCA CCCAATCAGG TGAATCCTTT 840 

GCAGTAGCCG ACTTGCCAGG TTTGATTGAA GCGCCTACTC AAGGTCTTGG TTTGGGAACT 900 

CAGTTCCTCC GTCACATCGA GCGTACACGT GTTATCCTTC ACATCATTGA TATGTCAGCT 9 60 

AGCGAGGGCC GTGATCCATA TGAGGACTAC CTAGCTATCA ATAAAGAGCT GGAGTCTTAC 1020 

AATCTTC GCC TCATGGAGCG TCCACAGATT ATTGTAGCTA ATAAGATGGA CATGCCTGAC 1080 

AG7CAGGAAA ATCTTGAAGA CTTTAAGAAA AAA7TCGCTG AAAATTATGA TGAATTTGAA 114 0 

GAGTTACCAG CTATCTTCCC AATTTCTGGA TTGACCAAGC AAGGTCTGGC AACACTTTTA 1200 

GATGCTACAG CTGAATTGTT AGACAAGACA CCACAATTTT TCCTCTACGA CGAGTCCGAT 12 60 

ATGGAAGAAG AAGCTTACTA TGGATTTGAC CAAGAACAAA AAGCCTTTGA AATTAGTCGT 13 20 

GATGACGATG CGACATGGGT ACTTTCTGGT GAAAAACTCA TGAAACTCTT TAATATGACC 13 80 

AACTT7CATC GTGATGAATC TGTCATGAAA 7TTGCCCGTC AGCTTCGTGG TATGGGGGTT 144 0 

GATGAAGCCC TTCG7GCGCG TCGAGC7AAA GATCGCCA77 TGGTCCGCAT TGGTAAA77T 150C 

GAGTTTGAAT TTGTAGACTA GGAGACTGGT ATCCGACATA AACCGATATC TTTCCGAGAT 1560 

GCGGATGGTA ATTTTC77TC CGCCGCAGAC C77TCCAATC AAAAGAAATT GGAAGAACTA 1620 
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TTTAATCGTC 


TCAATCCAAA 


TCGTGCCTTG 


AGATTCGCAC 


GAACTAAAAA 


GGAAAATCCA 


1580 


TCTCAGTAAA 


GAAGCTAAAA 


AATCCCGTGC 


CTCATCAGAC 


ACGGGATTTT 


GTGGTACGAC 


1740 


AGCCATGTAT 


AGCAAACTGA 


ATCTGGAATA 


GCACAGCATA 


TCTTCTAAAA 


TATAGTAAAA 


1800 


TGAAATGAGA 


ACAGGACAAA 


TCGATCAGGA 


CAGTAAAATC 


GATTTCTAAC 


AATGTTTTAT 


1860 


AAGCAGAGAT 


GTACTATTCT 


ACTTTCAATC 


AACTATATTG 


TTATAAATTG 


ATTTGAATTT 


1920 


CAAAATTAAA 


TTGTTTCATT 


CTTATTTCAA 


TTTGTTATAG 


TATATCTGAT 


GTCAAACTTC 


1980 


TCGGCGAGTC 


AAATAGCGAT 


TCCCAAGCCT 


GACTATCGTG 


AGGTAGCGGA 


TTAAAATGGT 


2040 


CTGGGGATAG 


ACCGTTTTAA 


GTCTGACGCT 


GGAAATAAGA 


ATTGTCAGAA 


GAAGGGATAG 


2100 


CGAAATCGTG 


GCTCTACGAA 


CAGGAACGTG 


ATAATAAGGC 


GTATATAGCG 


GATAAGAGGG 


2160 


CATCAAACTC 


TAAAGTCCAA 


AAAGGTAGTC 


GTAACCTATA 


TGCGTAAATC 


ACGAGAGTAA 


2220 


TTGAATTCGT 


ACTAAGATTT 


TCTATTTTCA 


CTGTAACCTT 


TTAACGCCCT 


TATATCTTGT 


2280 


ATACACGAGG 


AAACA7G7AC 


GACTTATCCC 


GTGAGGTCTA 


TCACTATAAA 


GAGAAAACGA 


2340 


CAGATAGAAG 


TGATCCTGAG 


TCACGGTTAT 


CTGTCTGATA 


GGACGGTATG 


TATAAAACGC 


2400 


TTCTCTGAAC 


TGAGAGAAGG 


GCGACAACTT 


CTTGCTAAAA 


TTTAGTTGAA 


CAGCCGTATT 


2460 


CCGATACTTA 


CA7AAGAGA7 


CTAGTCTTAG 


CTCCTACTCA 


GTTTTAGGGG 


ATAAAAAAGG 


2520 


GGCAATAGCG 


ATTCGAGAAA 


GATTATACTC 


TTCGAAAATC 


TCTTCAAATC 


ACCTCAATAT 


2580 


CGCCTTCTCG 


TATGTGTAGC 


ATACTGACTA 


CG7CAC7TCC 


ATCTACAACC 


TCAAAACAGT 


2640 


GTTTTGAGCA 


ACcTGCGGCT 


AGTTTCCTAG 


777CA7C7T7 


GATTTTCATT 


GAGTATTAG7 


2700 


AATTCAGTTA 


C7AAC7CG7C 


AACTCTGATT 


TATCCAATAA 


AATTG AAAAG 


GATGGAAAAA 


2760 


AGGATAAATT 


TATGATA7AC 


TTTATTTTGA 


AGACCTTATT 


AGAAATCTTG 


AAAGAGTATT 


2820 


GAAAACTTAG 


AATGAGAAAA 


ATTGTTATCA 


ATGGTGGATT 


ACCACTGCAA 


GGTGAAATCA 


2880 


CTATTAGTGG 


TGCTAAAAAT 


AGTGTCGTTG 


CCTTAATTCC 


AGCTATTATC 


TTGGCTGATG 


2940 


ATGTCGTGAC 


TTTGGATTGC 


GTTCCAGATA 


TTTCGGATGT 


AGCCAGTCTT 


GTCGAAATCA 


3000 


TGGAATTGAT 


GGGAGCTACT 


GTTAAGCGTT 


ATGACGATGT 


ATTGGAGATT 


GACCCAAGAG 


3060 


GTGTTCAAAA 


TATTCCAATC 


CCTTATGGTA 


AAATTAACAG 


TCT7CGTGCA 


TCTTACTATT 


3120 


TTTATGGGAG 


CCTCTTAGGC 


CGTTTTGCTG 


AAGCGACAGT 


TGGTCTACCG 


GGAGGATGTC 


3180 


ATCTTGCTCC 


TCGTCCGATT 


GACTTACACC 


TTAAGGCGTT 


TGAAGCTATG 


GGTGCCACTG 


3240 


CTAGCTACGA 


GGGACATAAC 


ATCAAGTTAT 


CTCC7AAACA 


TACAGGACTT 


CATGCTGCAA 


33CG 


GTATTTACAT 


GGATACGGTT 


AGTGTGGGAG 


CAACGATTAA 


TACGATGATT 


GCTGCGGTTA 


336C 


AAGCAAATGG 


TCCTACTATT 


ATTCAAAATG 


CAGCCCC7GA 


ACCTGAGATT 


ATTCATGTAG 


3420 
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CTACTCTCTT GAA7AATA7C GGTGCCCATA TCCGTGGCGC AGGAACTAA? ATCA7CATTA 3480 

TTGATGGTGT TGAAAGATTA CATGGGACAC GTCATCAGG7 GATTCCAGAC CGCATTGAAG 3 540 

CTGGAACATA TATATCTTTA GCTGCTGCAG TTCG7AAACG AATTCGTATA AATAATGTTC 3 600 

777ACGAACA CCTGGAAGGG TTTATTGCTA AGTTGGAAGA AATGGGAGTG AGAATGACTG 3660 

TATCTGAAGA CAGCATTTTT GTCGAGGAAC AGTCTAATTT GAAAGCAATC AATATTAAGA 37 20 

IrAGCtCCTTA" CCCAGGCTTT GCAACTGATT TGCAACAACC GCT7ACCCC7 CTTT7ACTAA 3 7 30 

CAGCGAA7GG TCGTGGTACA ATTGTCGATA CGATTTACGA AAAACGTGTA AATCATGTTT 384 3 

TTGAACTACC AAAGATGGAT GCGGATATTT CGACAACAAA . 7GG7CA7A77 77G7ACACGG 39C0 

GTGGACGTGA TTTACGTGGG GCCAGTGTTA AAGCGACCGA CTTAAGAGCT GGGGC7CCAC 3 960 

TAGTCATTGC TGGGCTTATG CCTGAAGGTA AAACTGAAA7 TACCAATATC GAGTTTATCT 4020 

TACGTGGTTA TTCTGATATT ATCGAAAAAT TACGTAATTT AGGAGCGGAT ATTAGACTTG 4080 

TTGAGGATTA AACCGTAGAG GTGTTTATGA A7A7TTGGAC CAAA77ACCA ATGTTTTCTT 4140 

TT77TGAAAC GGATCGCTTG 7ATT7CCG7C C7TTCTTTT7 TAGTGATAG7 CAGGACTTCC 4 200 

GCGAGATAGC 7TCAAATCCA GAAAATC77C AATTTATTTT CCCAACGCAG GCAAGTCTGG 4 260 

AAGAAACTCA ATA7GCACTG GCCAATTACT 77A7CAAC7C CCCTTTGGGA GTGTGGGCAA 4 32 0 

TTTGTGACCA CAAAAATCAA CAAATGATTG G77C7A77AA ATTTGAGAAG TTACATGAAA 4 3 80 

TCAAAAAAGA ACCTCACCTT CGC7A777TT TGACAAAAGA TCCTTGCTCC CAACCA777A 44 4 0 

7GACAGAGC7 TGTTAGAAAA ATTTGTCACC TTTCTTTTCA CCAA777CGC TTAAAACAAT 4 500 

TATTTATCAT TACCCACCTT GAAAATAAAG CTACCCAAAG AGTTGCTCTT AAGTCTGGAT 4 560 

7TAGTTTG77 CCG7CAG77T AAGGGAAGTG ATCG77ACAC AAGAAAAA7C CGGGA7TA7C 4 6 20 

77CAAT7TCG GTATCTAAAA GGAGAC7TCA ATGAGTAAGC ATCAGGAAAT TCTAAGCTAT 4 680 

TTCGAGGAAT TACCAGTAGG 7AAAAGCG7C AGTCTTCCTA GCATTTCGAA TCATCTAGCA 4 7 40 

GTTAGTGATG GAACAGCC7A TCGGGCTATT AAAGAACCTG AAAACCGTGG AA7TG7GGAG 4800 

ACCCGTCCTA GAAGTGGAAC AAT7CG7GTT AAA7CCCAGA AAGTTGCTAT AGAGAGATTA 4 960 

ACG7TTGC7C AAATTGCAGA AGTGACTTCT TCTCACGTTC TGGCTGGGCA AGAAGGTTTA 4 920 

GAGAGAGAA7 77AGTAAG7T 77CAATTGG7 GCCATGAC7G AACAAAA7A7 C7TG7CTTAC 4 9 80 

C7TCATGA7G GGGGGCTCTT GATTGTCGGA GACCGAACCC G7A7TCAGTT GCTAGCCTTG 5040 

GAAAATGAAA ATGCAGT7C7 GGTTACAGGG GGAT77CAGG 7TCATGATGA TGTGCTTAAA 5100 

CTGGCCAA7C AAAAAGGGA7 7CCTG77C7A AGAAG7AAGC A7GA7ACCT7 7ACCG7CGCG 5160. 
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ACCATGATCA ATAAAGCC7T GTCAAATGTC CAAATCAAGA CTGATATTCT CACAGTTGAG 522 0 

AAACTTTATC GCCCTAGTCA TGAGTATGGT TTTCTGACAG AGACAGATAC AGTTAAAGAT 52 80 

TATTTGGACT TGCTTCGTAA GAATCGTAGC AGCCGTT7CC CTGTTATCAA TCAACATCAG 5340 

GTCGTTGTTG GTGTTGTAAC CATGAGAGAC GCTGGTGATA AATCACCAAG CACCACAATT 5400 

GATAAGGTTA TGTCTCGTAG TCTATTTTTG GTTGGATTAT CGACAAATAT TGCCAATGTG 5460 

AGTCAACGGA TGATCGCAGA AGACTTTGAA ATGGTACCAG TTGTTCGAAG CAATCAAACT 5520 

7TGCTTGGCG TTGTGACGCG ACGAGATGTC ATGGAGAAGA TGAGCCGTTC CCAAGTTTCG 55 80 

GCTCTACCAA C TTTT T CTGA GCAGATTGGA CAAAAGCTCT CTTATCACCA TGATGAAGTA 5640 

GTCATTACAG TGGAACCCTT TATGCTAGAA AAAAATGGAG TTTTCGCTAA TGCTGTATTG 5700 

GCAGAAATTC TGACCCACAT GACCCGATTT AGTTGTTAAT AGTGGTCGCA ATCTCATTAT 57 60 

CGAGCAGATG CTGATCTACT TTTTGCAGGC TGTTCAGATA GATGATATAT TGCGCATTCA 5820 

GGCACGGATT ATTCATCATA CGAGACGGTC AGCTATAATT GATTACGATA TTTATCATGG 5880 

TCACCAGATT GTTTCAAAAG CAAATGTGAC 7CTTAAAATT AATTAGAAAC TAGGAGAAAA 5940 

GATCATAACA TTAAAATCAC CTCGTGAAAT CGAAGCTATG GACAAGCCTC GTGATTTTCT 6000 

AGCAAGTATT CATATAGGCT TACCTCATTT GATTAACCCA GGCGTAGATA TGTGGGAAGT 6060 

TGAAGAATAT GTCCGCCGTC GTTGTAAAGA AGAAAATTTC CTTCCACTTC AGATTGGGGT 6120 

TGACGGTGCC ATGATGGACT ATCCTTATGC TACCTGTTGC TCTCTTAACG ATGAAGTGGC 5180 

TCACGCTTTC CC7CCTCA77 A7ATCTTGAA ACATCGTGAT 7TGCTCAAAC TTCA7AT3GT 62 40 

TTTGGCAGGT CCCATTCCTA AATC7GACCT AAATGTCTCA AAATTAAACT TCAACAATGT 6300 

TGAACAAATG AAAAAATACA CTCAGAGCTA TTCTGGTGCT TTACCAGACT CATCTTGGGC 5 3 60 

TTATGCTGTT GGTACACCGT CCGAACAAGT CAAAAACTTG ATGGATGTAA CCAAAGAAGC 6420 

TATGTACAAG GGTATTGAGC AAGCTCTTGT TCGAAATCGT ATCGGTGATA TCGGTCCCGC 64 80 

TATTCAAGAA TACGCTGAAA GTCGTGGTTA CGGTGTAGTG CGTGATTTGG TTGGTCATGG 6540 

TGTTGGCCCA ACTATGCACG AAGAACCAAT GGTTCCTAAC TATGGTATTG CAGGTCGTGG 6600 

ACTCCGTCTT CGTGAAGGAA TGG7CTTAAC CATTGAACCA ATGATCAATA CAGGCGATTG 6660 

GGAAATTGAT ACAGATATGA AAACTGGTTG GGCGCATAAG ACCATTGACG GTGGATTGTC 6720 

ATGTCAGTAT GAACACCAAT TTGTCATTAC GAAAGATGGA CCTGTTATCT TGACTAGCCA 6780 

AGGTGAAGAA GGAACTTA7T AA7AAAAAGT GAAAAGACTA CTGGAAGTTT ATTTTGATAA 6840 

AAAATCCAGT AGATCTTTTC ATAATAAAAC GCATTGTATC AAGTGTTAGC GGCTGATATC 6 900 

ATGCGTTTTT CTGCTTTTAA GATTTTTTCC AACTCTCTTT GTAAGCGCAT CATAACAAAG 6960 
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GGTCTAGGAT TCAGGGCTCT CCTCCTATAT ACTATTAGTA AAGTAAAACT AAGGGAGGAT 7 020 

ATTTTAGTGT CGCAGTCTAT TGTTCCTGTA GAGAT7CCAC AATATTGTCG TTTTGATTCT 7080 

AAAAAGAGAA ATGCAA7TC? GTTTAATGTT CGTATTGCCA ATCTTAAATT TACTTTTTTA 7 HO 

TATTATACTT CCTGCCAAAC AAAATATGGT ATAGTAGTTC TATGAATGAT GAAGCAAGTA 7200 

AACAACTAAC TGATCCACGA TTTAAGCGTC TTGTTGGTGT TCAGCGTACC ACTTTTGAAG 7260 

AGATGTTAGC TGTATTAAAA ACAGCTTATC AACTTAAACA CGCAAAAGGT GGACGAAAAC 7 320 

CTAAATTAAG CCTAGAAGAC CTTCTTATGC CCACTCTTCA ATAGTGCGAG AATATCGAAC 7 3 80 

TTATGAAGAA ATTGCGGCTG ATTTTCCTAT TCACGAAAGC AACTTTATCC GTCGCAGCCA 74 40 

ATGCGTTGAA ATAACTCTTG TTCAAAGTGG TTTTACCGTT TCAAGAACTC CTCTCAGTTC 7 500 

TGAGGACACG GTAATGATTG ATGCGACGGA AGTAAAAATC AATCGCCCTA AAAAAACAA? 7560 

TAGCGAATGA TTCTGGTAAA AAGAAATTTC ACGCTATGAA GCCTCAAGCG ATTGTCACAA 7 620 

GTCAAGGGAG AATTCTTTCT TTGGATATCG CTGTGAACTA TAGTCATGAT ATCAACTTG7 7 680 

TCAAAATGAG TCGTAGAAAT ATCGAACAAG CTGGTAAAAT CTTGGCTGAC AGTGGTTATC 7 740 

AAGGGCTCAT GAAGATATAT CCTCAAGCAC AAACTCCACG TAAATCCAGC AAACTCAAGC 7 800 

CGCTAACAGC TGAAGATAAA GCCTATAACC ATGCGCTATC TAAGGAAAGA AGCAAGGTTG 7 860 

AGAACATCTT TGCCAAAGTA AAAACGTTTA AAATATTTTC AACAACCTAT CGAAATCATC 7 920 

GTAAACGCTT CGGATTACGA ATGAATTTGA GTGCTGGTAT TATCAATCAT GAACTAGGAT 7 980 

TCTAGTTTTG CAGGAAGTCT ATTGAGGTAT TGAGCTAGr? TATGAAAAAA TTCCCTGAAA 8040 

AGTCGAGTGT TTTAGAAACC CACAGTGTAG TATTCTAGTT TCAATCCACT ATATTTTGCT 8100 

ACTCCCCGTA AAGTTTCTAT TT7CCCTGAT TTCTGATATA ATAGAAATAT TGACTTCAAG 8160 

AGTAAGGAAG AGAAGATGAA CGCATTATTA AATGGAATGA ATGACCGTCA GGCTGAGGCG 8220 

GTGCAAACGA CAGAAGGTCC CTTGCTAATC ATGGCAGGGG CTCGTTCTGG AAAGACTCGT 8 230 

GTTTTGACCC ACCGTATCCC TTATTTGATT GATGAAAAGC TGGTCAATCC TTGGAATATC 8 34 0 

TTGGCCATTA CCTTTACCAA CAAGGCTGCG CGTGAGATGA AAGAGCGTGC TTATACCCTC 8400 

AATCCAGCGA CTCAGGACTG TCTGATTGCG ACCTTCCACT CCATGTGTGT GCGTATTTTG 84 60 

CGTCGCGATG CGCACCA7A? TGGCTACAAT CGTAATTTTA CAATTGTGGA TCCTGGTGAA 85 20 

CAGCGAACGC TCATGAAACG TATTCTCAAA CAGTTCAACT TGGACCCTAA AAAATGGAAT 8 5 80 

GAACGAACTA TTTTGGGGAC CATTTCCAAT GCTAAGAATG ATTTGATTGA TGATGTTGCT 8640 

TATGCTGCCC AAGCTCGCGA TATGTATACG CAAATTCTGG CCCAGTGTTA TACAGCCTAT 37 00 
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CAAAAAGAAC TTCGTCAGTC TGAATCCCTT GACTTTGATG ATTTGATTAT GCTGACCTTG 37 60 

CGTCTCTTTG ATCAAAATCC TCATGTTTTG ACCTACTACC AGCAAAAATT CCAATACATC 8 820 

CACGTTGATG ACTACCAACA TACCAACCAC GCTCAGTACC AATTGGTCAA ACTCTTGCC? 8830 

TCCCG7TTTA AAAATATCTG TGTGGTTGGG GA7GCGGACC AG7CTATCTA CCCTTGCCGT 8940 

GGTGCTGATA TGCAGAATAT CTTGGACTTT GAAAAGGATT ACCCCAAAGC CAAGGTTGTT 9000 

7TGTTGGAGG AAAATTACCG CTCAACCAAA ACCATTCTCC AAGCGGCCAA CGAGGTTATT 9060 

AAAAATAATA AAAATCCCCG TCCTAAAAAT CTCTGGACTC AAAACGCTGA TGGGGAGCAA 9120 

ATCGTTTACT ATCGTGCCGA TGATGAGCTG GATGAGGCTG TATTTGTAGC CAGAACCA7C 9130 

GATGAACTTA C7CGCAG7CA AAAC7TCC7T CATAAGGATT TTGCAGTTCT CTATCGGACT 9 240 

AATGCCCAGT CCCGTACAAT TGAGGAAGCC CTGCTCAAGT CTAACAT7CC TTATACCATG 9 300 

GTTGGCGGAA CCAAATTCTA CAGCCGTAAG GAAATTCGCG ATATTATTGC TTATCTCAAC 9 3 60 

CTTATTGCTA ATTTGAGTGA CAATATTAGT TTTGAGCGTA TTATCAACGA GCCTAAACGT 94 20 

GGAAT7CG7C 7ACGTACACT TGAGAAAATC CGTGATTTTG CAAATTTGCA AAATATGTCT 94 80 

A7CCTGGATG CTTCTGCTAA TATTATGTTG TCTGGTATCA AGGGTAACGC AGCCCAATCT 9540 

ATCTGGGATT TTGCCAATAT GA7GC7TGAT TTGCGGGAGC AGCTAGACCA CTTAAGCATT 9600 

ACAGAG7TCG TTGAGTCCGT CCTAGAAAAA ACAGGTTATG TCGATATTCT TAACTCCCAA 9660 

GCGACTCTAG AAAGCAAGGC ACGGGTTGAA AATATCCAAC ACTTTCTTTC TGTTACGAAG 97 20 

AACTTTGATG ACACCACGGA TGTGACAGAA GAGGAAACTG GTCTGGACAA ACTGAGTCGT 97 80 

TTCTTAAATG ACTTCCCTTT GATTGCCGAC ACAGATTCAG GTAGTCAGGA GACATCAGAA 9840 

GTGACCTTGA TGACCCTGCA TGCTGCCAAA GGTCTCGAAT T7CCAGTTG7 CTTTTTGATT 9900 

GGGATGGAAG AAAATGTCTT TCCACTTAGT CGTGCGACTG AAGATTCAGA TGAATTAGAA 9960 

GAAGAGCGCC GTCTAGCCTA TGTAGGTATC ACGCGTGCAG AGAAAATTCT CTATCTGACC 10020 

AATGCCAACT CACGCTTGCT TTTTGGTCG7 ACCAATTATA ACCGTCCGAC TCGTTTTATT 10080 

AACGAAATCA GTTCACACTT GCTTGAGTAT CAAGGTCTGG CTCGTCCTGC AAATACAAGC 10140 

TTTAAGGCAT CATATAGCAG TGGTAGTATT TCCTTTGGTC AAGGTATGAG TTTGGCTCAC 1C200 

GCTCTTCAAG ACCGTAAACG CGGTGCTGCC CCAAAATCAA TCCAGTCAAG CGGTCTTCCA 102 60 

TTTGGTCAAT TTACAGCTCG CGCAAAACCA GCATCTAGCG AGGCAAATTG CTCCATTGGT 10 320 

GATATTGCTC TCCACAACAA ATGGGGAGAG GGAACCGTTC TGGAAGTTTC AGGTAGCGGT 1038C 

GCTAGGCAGG AATTGAAAAT CAATTTCCCA GAAG7AGGTT TGAAAAAAC7 TT7AGCCAGT 104 40 

GTGGCTCCAA TTGAGAAAAA AATCTAA7TT TCCATCCTTC TCACGAATAA TAAAGTGAGG 1050C 
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AGGATTTTTA TGTACAG7AT TTCATTCCAA GAACAT7CAC TATTACCAAG AGAAAGGCTG 10560 

GCCAAGGAAG GAGTTGAAGC GCTTAGTAAC CAAGACTTCC TAGCTATTTT ACTCAGGACA 10 620 

GGAACACGTC AAGC7AGCGT TTTTGAAATT GCCCAAAAAC TCTTGAACAA TC77TCAACC 10680 

CTAACGGATT TCAAAAAAAT GACCCTGCAG GAATTGCAGA G7TTG7CTGC TATTGGGCGT 10740 

GTTAAGGCCA TACAATTACA AGCTATGATT GAACTGGGGC ATCGTATTCA CAAACACGAG 10800 

ACTCTTCAAA TGGAAAGT AT TCTCAGCAGT CAAAAGTTGG ~CCAAGAAGAT GCAGCAGGAA 10860 

TTAGGGGATA AAAAACAAGA GCACCTGGTG GCACTCTATC TCAATACTCA AAATCAAATC 10920 

ATCCA7CAGC AGACCA7T7T TATCGGGTCT GTAACTCGTA GTATCCC7GA ACCGCGAGAG 10980 

ATTCTTCACT ATGCAATCAA CCATATCCCG AC7TC7CTTA TCTTCCTCCA CAATCATCCT 11040 

TCAGCAGCCG TAGCGCCTAG CCAAAATGAT GATCATGTCA C7AAAC7TCT TAAAGAAGCC 11100 

TGCGAATTGA 7GGGGA7TGT TCTCTTCGAC CATTTGATTG TC7CTCATTC TAATTACTTT 11160 

AG7TA7CGTG AAAAGACAGA TTTAATCTAA AGTTCATTAA CGACATAGTC AAAGAGTTTT LI 220 

TTATCTTTGG GACGATTTTC AAAAAGAAGT TCTGGATGCC ATTGGACACC GAGAAAGGCG 11280 

ACATCATCCG TACTCATGAC AGCCTCAATG ATACCATCTT TAGGATCATG AGCCACAACT 11340 

TTTAAATTTG CTGCTAACTC CTTGATGCTC TGGTCGTGGA AGGAGTTGAT ATGAGAGATT L1400 

TCTCCATAGA TTTCTTGGAG AACGGTATCT GGTTCTGTTA CCAAGCGTTG AGTTGTGTAC 11460 

TCAACAGAAG AA7CCTCCCA ATGGTCTTCG ATATC77GGT ACAAAG77CC ACCCATGCCA 11520 

ACGTTAAACA GTTGCGTACC ACGGCAGACA GAGAAAATGG CCTTTTTC7G TTTAATAGCT 11580 

TCC7TCATCA GGGCCAGTTC GAAGATATCT CTTTGAAGGT CATACTCA7C ACTATCAATG 11640 

gttttgggt: CGCCATAAAA 7T7TGCATCC ACATTTTGCC CACCTGTCAA GATCAGC77C 11-700 

TCAATCAAAC TGATATAGTG CCAGCCCATT TCTTGATCAC CAATCGGTAG GATGATGGGA 117 60 

ATCCCTCCAG CA7CTTTAAC GCC7TCAACA AAGCC7T77G CTGCGTACCT CA7CATGA7G 11920 

TCA7CATCTG GATGAGTTTT 7TCG7T7CCT GTAATCCCAA TAAC7GG777 TTTCATAAAA 11880 

TGATTTTCGC TTTCTAATCC TCTTTTCGCA TGAAGTAGAG GAGGGT77GG AGTTCAC7TG 11940 

TCAAATCGAC ATACTGAACG ACCACGTCT7 7TGCTAAATG CAGATGCAC7 GGTGAAAAAC 12000 

TGAGAATTCC TTTCACACCA GCATCAACCA AGAGATTAGC AACCTCTTGT GACT7GACGC 12060 

TGGGAACAGT TAGGATAGCA GTCTTCACAT CAGCATCC7T GATTTTA7CC TTGATCTGAC 12120 

AAATCCCGTA AATGGGAATC CCGTCAGGAG T7TGGG7ACC CAC7TCACCA TGGTCGTCTA 12130 

GGTCAAAGGC CATGA7AA7C T7CATCTTG7 TACGTTCG7G GAAGCGG7AG 7CCACAAGCC 12240 
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CATGGCCCAT ATTTCCAATA CCAACCAGCA TGACATTGGT AATAGAGTTG TCATTGAGCA 12300 

AATCGGCAAA AAA7CTCATT AG7TTT7TCA CA7CATAGCC AAAACCACGA CGACCAAGTT 123 60 

CACCAAAATA CGAAAAATCA CGACGTACGG TCGCTCAATC AATACCGATA GCCTCTGCAA 124 20 

TTTGCTTAGA GTTGGCACGT TCAATCTTTT CTGCATGAAA TCTCTTAAAA ATTCGATAGT 12480 

AGAGAGAGAG 7CT7TTTGCT GTAGCTTTTG GAATAGCAAA CTGTTTATCT TTCACAAAAT 12540 

CACAACCTTT CTATTCTTCT ATTTTATAGA AACATTGTGA AAAAATCAAC AAAAATAAGA 12600 

AAAAACTAAG AAAAATCTTA GTTTTGATGT AAAAAA7CTG CATGAGATAG AAAACGGTAG 12 660 

AGGTCTCCGA CCAGCCCCTG ATAAACTTTT TTGCCCCTAA AAGTCAGAGA AGTCACATAA 12720 

AGTGTATCTG GTAAGGTTAC ACATCCTGAC AAAGTCAACA 7GAGACCCTC ATGATCCTCA 12730 

TACTTGAGAG TACGCTCTAC ATGATAGCAG TCCTTATAGG TCAGTTCAAA CATTTTGGCT 12 840 

CTATCTTTCC GA777TC7AA AGACACCACG TTCTACCAAG CTA7CCA7CA GGAAGTAGAA 12900 

TTTTTCCTGA TGAATATGGT GGTCTTCTGA TTTCAAAATA TCAACTAGAC GAAGGCCAAA 12960 

CTTGTCAGTG ATATTGATTT TAGCCCCTGT AAGTTCCTTG TTAATGATGA TTTTGAGTTG 13020 

GAAGCC7TCA CCGCTGTTTG GCACTTTTTC CAAAAGGCGA GTCACT7CAT AGTTACCAAC 13080 

CTTAGTTTCA AAAAAGGTGT TATCTTTGAG GGTGAATTTT TTAACAGAAG GGCTAAGAGT 13140 

C7AATCGTAA CGACAATTTT TTAACTGAAT GATTTTTTCA AATGCCATAT GGCTAACCTC 13200 

CGATAATTTC TTTTAAGGTT TTTGCGAGGG TTTGTAGGTC TTCAACGGTA TTTTCTGGCG 13 260 

ACAAACTGAT GCGAAGGGAT TCCTTCAAGC GTTCTGAATT TCCGCCATAC ATGGCTTCAA 13 3 20 

GAACATGGCT GGATTGGACA ACGCCTGCAG TACAGGCTGA GCCAGTAGAG ATTGAAATTC 13 3 80 

CAGCTAAATC TAGCCGAAGG AGTAAGAGGT CATTTTTCTG ACCAGGAAA7 CCAATATTGA 13440 

GAACATAAGG GAGATGATGT TTTCCTCTAT TCAGGTAATA CTGAA7CCCC TCCAGCTCTG 13500 

CCAGAAAGGC AGTTTCTAGA TTTTGTACAT GTTGAAAATG TTCTTCTTGT TTTTCTAGCT 13560 

CTTCTTTTAG GGCTGCAACC ATCCCTACAA TGGCAGGCAG ATTTTCAGTT CCTGCACGTT 13620 

TTTTCTGTTC CTGGTCTCCG CCATGTAGAT AGGAATCAAA GTCCATGCTA GATGCGTAGA l3 680 

GAAAACCGAT TCCCTTAGGA CCATGGAATT TGTGGGCAGA AGCACTGAGA AAATCAATGC 13 740 

CCAATTCTTC TGAATGAAT7 GGGATTTTAC CAATAGCCTG AACTGCATCA ACATGATAGG 13800 

CAGCAGGGTG TTGCTTGAGT ATTTGGCCAA TTTCAGCGAT GGGCAGTAGG TTTCCTGTCT 13 860 

CA7TATTGAC AAACATGGTA GAAACCAAAA TCGTATCG7C ACGTAAAGCC TTTTGAATTT 13 920 

GCTCGGC7GT GATTTCTTGA TTTTCTGGCT GGATAA7GG7 7GCTTCAAAC CCAAAGTCT7 139 8 0 

GAACCAAGTA ATCAATTG77 TCAAGGACAG CA7CG7CC7C GA7GGCAGT7 G7GATGATAT 14040 
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CTTTTCCTTC 


T7CT7GGTCA 


CGAAGACAG7 


ACCCAATGA7 


GGTAG7A77A 


TTGCCTTCAG 


14100 


TCCCACCAGA 


AGTGAAAAAG 


ATATGTTGAG 


GTTTTGTCCT 


TAGTAACTGG 


GCTAGTTCCT 


14160 


GACGGGCTTC 


TCGCAAGAGT 


TTGCCAGCTT 


GACGACCATG 


ACCATGAATA 


CTAGAAGCAT 


14220 


TTCCGTGCCT 


TTCTTGCATA 


ACCTTGGTCA 


TAGCTGAAAT 


AGCAACTGCT 


GACATAGGAG 


14280 


TCGTTGCAGC 


ATTGTCCAAA 


TAAATCAAAG 


AATCACCTTA 


TTTCTTTTTA 


TTGTAGGCAA 


14340 


~ agagtgggct~ 


GACTGGTTTT 


CTTTCGTCAA 


TACGGACGAT 


AGCATCACCA 


ATTAACTCAC 


14400 


TACCAG7GA7 


GTAGCATACA 


TTTTTAGGAG 


*f»^*p 777 G77 7 


TGTTGCTACT 


GAATCAGTCA 


14460 


CAAGAATTTC 


TTTAATATTA 


GTAT7CTCAA 


GAAGCTCAGC 


AGCTCCCTCG 


ACGAAGAGAC 


1452C 


CGTGGCTAGA 


AACAGCATAA 


ATTTCTG7AG 


CTCCTTCACG 


TTCAACGATT 


TTAGAACCTT 


14580 


CAGAGAAGGT 


ACGTCCTGTA 


TTTAAAATAT 


CATCAATCAA 


GATAGCTTTC 


TTACCTTCAA 


14640 


CATCACCAAT 


AATATAACCT 


TCGTTACGAG 


TTGCATCGTC 


TTGAGGGTAG 


7CGA7AA7GG 


14700 


CGATAGGAGC 


ATCAAGATAT 


TCAGCCAGGC 


TACGCGCACG 


TTTGACACCT 


CAATTTTTAC 


14760 


CGCTAACCAC 


AACAACATCT 


GAACCAAGCA 


ATCCTT7ATC 


GCAC7AATG7 


777GCGAA7A 


14320 


GGGGAACAGT 


GAAAAGATTA 


TCCAC7GGAA 


TATCAAAGAA 


ACCTTGAACC 


TGAACCGCAT 


14880 


GCAAATCAAG 


AGTCAGGATA 


CGATCAACTC 


CAGCCTTAAC 


CAGCATATTG 


GCAACTAGTT 


14940 


TTGCTGTAAG 


TGGCTCACGA 


GGACAAGCAA 


TCCCGTC77G 


ACC7CCATAG 


CCAAAA7A7G 


15000 


GAAGGACAAC 


CTTGATACTG 


7GGGCACTTC 


CACGCACACA 


AGCATCGACC 


A7GATTAACA 


15060 


ATTCCATTAG 


G7CCTTG7TG 


ACAGGGAAAC 


77C77CATTG 


GATGATGTAA 


ACATCATAAC 


15120 


CACGGACACT 


7TCT7CGA7A 


TTTACTTGGA 


777C7CCC7C 


7GAAAA77GA 


CG7GA7GATA 


15130 


GTTTTCCAAC 


TGGGACACCA 


ACAGCTTGCG 


CAATTTTTTG 


TGCAATCTCT 


7CG7TAGAGT 


15240 


7GAGTGCCAA 


AAGTTTCA7G 


TTT7TTC7A7 


CTGACATTA7 


AGACCGTCCT 


CTGTAAAC77 


133CO 


TATAAATCCT 


At» TIAlAl * . 




TATGAACTGG 


GATTTGTGTA 


T7777 A7C77 


15360 


TTCTATTTTA 


CCAAAAAATG 


GAGATTATTT 


CAGCTATTTT 


TCATACTTTT 


GACAAA7CGA 


15420 


ACCAAT77TG 


AAGCAGCTTT 


TTGATAGGAA 


ATCTGATTTT 


7C7C7AAAAA 


77C7CGAAAA 


15490 


7CC7GT77GC 


CTTCCTCATG 


ATTTTCCACT 


TCAAGCTCCA 


ATTCGTAATC 


TG77ATATCA 


15540 


AAGTATCGGC 


7C7GATCCAG 


TGCCATGAGA 


CCAATAGCTG 


TTTTCATTTC 


A7AGCGAAGC 


15600 


GTTGTTAGAC 


AACCAAGAAC 


CTGCCAGT7C 


TTACTTTGGA 


TACCATGTTT 


CGCCAATTCA 


1 5 6 6 C 


TCCAGTACTA 


CCCC7TGACG 


AAGTTCTTCC 


7TAC7CAGAT 


AGTTCTCAGC 


ATC77T7AGT 


15720 


TGCAATTTTT 


GG77GTA7TC 


CATGTTTCCA 


ACAC7CTGCG 


CGACT77GAG 


TG7CAACTCA 


15780 
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GCCCAGTCTT CAAAGCTTCG AATCCGCATA GCGACTTTCT TWS 'CAG TTCAAAATCA L5340 

GGCGTGTCGA TGTAGTAATT TGTTTGAAGA ACAGGAGTGA CACCTGTGAA CTGGTCTTTT 15900 

AGACGATTGT ATTCATCTTT TTTCAATAGT GTTTTCAATT CAATTTCTAA ATGTTTCATT 15960 

TTTCTTACCT TTTTTTATCG TTGAAAGCGG ATTTATCGTA TAATAAGCAT TGTATTTATT 16020 

GTATATGAAT CTGGAGAAAA AATCAAAGAT ATTTTTGACG GATAATATGA GAACAAGGGA 16080 

GAATATATGA CCTTAGAATG GGAAGAATTT CTAGATCCTT ACATTCAAGC TGTTGCTGAG 16140 

TTAAAGATTA AACTTCGTGG TATTCGTAAG CAATATCG7A AGCAAAATAA GCATTCTCCA 16200 

ATTGAGTTTG TGACCGGTCG AGTCAAGCCA ATTGAGAGCA TCAAAGAAAA AATGGCTCGT 162 60 

CCTGGCATTA CTTATCCGAC CTTGGAACAC GATTTGCAGG ATATTCCTGC CTTACGTGTG 16 3 20 

ATGGTTCAGT TTGTAGATGA CGTCAAGGAA GTAGTGGATA TTTTGCACAA GCGTCAGGAT 163 80 

ATGCGAATCA TACAGGAGCG AGATTACATT ACTCATAGAA AAGCATCAGG CTATCGTTCC 16440 

TATCATGTGG TAGTAGAATA TACGGTTGAT ACCATCAATG GAGCTAAGAC TATTTTGGCA 16500 

GAAATTCAAA TTCGTACTTT GGCCATGAAT TTCTGGGCAA CGATAGAACA TTCTCTCAAC 16560 

TACAAGTACC AAGGGGATTT CCCAGATGAG ATTAAGAAGC GACTGCAAAT TACAGCTAGA 16620 

ATCGCCCATC AGTTGGATGA AGAAATGGGT GAAATTCGTG ATGATATCCA ACAAGCCCAG 15680 

GCACTTTTTG ATCCTTTGAG TAGAAAATTA AATGACGGTG TAGGAAACAG TGACGATACA 167 40 

GATGAAGAAT ACAGGTAAAC GAATTGATCT GATAGCCAAT AGAAAACCGC AGAGTCAAAG 16800 

GGTTTTG7AT GAATTGCGAG ATCGTTTGAA GAGAAATCAG TTTATACTCA ATGATACCAA 16860 

TCCGGATATT GTCATTTCCA TTGGCGCGGA TGGTATGCTC TTGTCCGCCT TTCATAAGTA 16920 

CGAAAATCAG CTTGACAAGG TCCGCTTTAT CGGTCTTCAT ACTGGACATT TGGGCTTCTA 169 80 

TACAGATTAT CGTGATTTTG AGTTGGACAA GCTAGTGACT AATTTGCAGC TAGATACTGC L7 040 

GGCAAGGGTT TCT7ACCCTG TTCTGAATCT GAAGGTCTTT CTTGAAAATG GTGAAGTTAA 17100 

GATTTTCAGA GCACTCAACG AAGCCAGCAT CCGCAGGTCT GATCGAACCA TCGTGCCAGA 17 160 

TATTGTAATA AATGGTGTTC CCTTTGAACG TTTTCGTGGA GACGCGCTAA CAGTTTCGAC 17 220 

ACCGACTGGT AGTACTGCCT ATAACAAGTC TCTTGGCGGT GCTCTTTTAC ACCCTACCAT 172 80 

TGAAGCTTTG CAATTAACGG AAATTGCCAG CCTTAATAAT CGTGTC7ATC GAACACTGGG 17 3 40 

CTCTTCCATT ATTCTGCCTA AGAAGGATAA GATTGAACTT ATTCCAACAA GAAACGATTA 174 00 

TCATACTATT TCGGTTGACA ATAGCGTTTA TTCTTTCCGT AATATTCAGC GTATTCAGTA 174 60 

TCAAATCGAC CATCATAAGA TTCACTTTGT CGCGACTCCT AGCCATACCA GTTTCTGGAA 17 5 20 

CCGTGTTAAG GACGCCTTTA TCGGCGAGGT GGATGAATGA GGTTTGAATT TATCGCAGAT 17530 
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GAACATGTCA 


AGGTTAAGAC 


CTTCTTAAAA 


AAGCACGAGG 


TTTCTAAGGG 


ATTGCTGGCC 


17640 


AAGATTAAGT 


TTCGAGGTGG 


AGCTATTCTC 


GTCAATAATC 


AACCGCAAAA 


TGCAACGTAT 


17700 


CTATTCCACG 


TTGGAGACTA 


CGTTACCATT 


GACATTCCCG 


CTGAGAAAGG 


CTTTGAAACC 


17760 


TTGGAGGCTA 


TTGAGCTTCC 


ATTAGATATT 


CTCTATGAGG 


ATGACCACTT 


TCTAGTCTTG 


17820 


AATAAACCCT 


ATGGAGTGGC 


TTCTATTCCT 


AGTGTCAATC 


ACTCTAATAC 


CATTGCCAAT 


17880 


TTTATCAAGG 


GTTACTATGT 


CAAGCAAAAT 


TATGAAAATC 


AGCACGTTCA 


CATTGTTACC 


17940 


AG ACT AG AT A 


GGGATACTTC 


TGGCTTGATG 


CTCTTTGCCA 


AGCACGGTTA 


TGCCCATGCA 


13000 


CG ATT AG AC A 


AGCAGTTGCA 


GAAGAAATCT 


ATCGAGAAAC 


GCTACTTTCC 


TTTGGTTAAG 


18060 


GGAGATGGAC 


ATTTGGAGCC 


AGAAGGGGAA 


ATTATTGCTC 


CGATTGCGCG 


TGATGAAGAT 


13120 


TCCATTATTA 


CCAGACGAGT 


GGCTAAAGGC 


GGAAAGTATG 


CCCATACTTC 


ATACAAGATT 


13180 


GTAGCTTCTT 


ATGGAAATAT 


TCACTTGGTC 


TATATTCACC 


TGCACACTGG 


TCGAACCCAT 


13240 


CAAATCCGAG 


TCCATTTTTC 


TCATATCGGT 


TTTCCTTTGC 


TCGGAGATCA 


TTTCTATGGT 


13300 


GGTACTCTGG 


AAGATGGTAT 


TCAACGTCAG 


GCTCTGCATT 


GCCATTACCT 


ATCCTTTTAT 


13360 


CATCCATTTT 


TAGAGCAAGA 


CTTGCAGTTA 


GAAAGTCCCT 


TGCCGGATGA 


TTTTAGTAAC 


13420 


CTTATTACCC 


AGTTATCAAC 


TAATACTCTA 


TAAAAACTGT 


CTCAGAGTAT 


AATTATTATC 


13480 


TTAAAGGAGA 


AAACTCATGG 


AAGTTTTTGA 


AACTCTCAAA 


GCCAACCTTG 


TTGGTAAAAA 


18540 


TGCTCGTATC 


GTTCTCCCTG 


AAGGGGAAGA 


GCCTCGTATT 


CTTCAAGCAA 


CAAAACGCTT 


18600 


AGTAAAAGAA 


ACAGAAGTGA 


TTCCTGTTTT 


GCTTGGAAAT 


CCTGAAAAAA 


TTAAAATTTA 


18660 


TCTTGAAATT 


GAAGGAATCA 


TGGATGCTTA 


TGAGGTCATC 


GACCCTCAAC 


ATTATCCTCA 


18720 


ATTTGAAGAA 


ATGGTTTCTG 


CCTTGGTGGA 


GCGTCGCAAG 


GGCAAAATGA 


CTGAAGAAGA 


13780 


TGTACGCAAG 


GTTTTGGTTG 


AACATGTCAA 


CTACTTTGGT 


GTGATGTTGG 


TTTACTTGGG 


13840 


CTTGGTTGAT 


GGAATGGTGT 


CAGGAGCGAT 






TTCGCCCAGC 


18900 


TCTACAAATC 


ATCAAAACTC 


GTCCAAATGT 


AACTCGTACT 


TCAGGAGCCT 


TCCTCATGGT 


18960 


TCGTGGTACG 


GAACGTTACC 


TATTTGGAGA 


CTGTGCCATT 


AACATCAATC 


CAGATGCAGA 


19020 


AGCCTTGCCT 


GAAATTGCCA 


TCAACTCAGC 


AATCACACCT 


AACATCTTTG 


GCATCGAACC 


19080 


TAAAATTGCC 


ATGTTGAGCT 


ATTCTACTAA 


AGGTTCAGGG 


TTTGGTGAAA 


GCGTTGATAA 


19140 


GGTCGTTGAA 


GCAACTAAAA 


TTGCTCACGA 


CTTGCGTCCT 


GACCTTGAAA 


TCGATGGTGA 


19200 


GTTGCAATTT 


GATGCAGCCT 


TTGTTCCTGA 


AACTGCAGCT 


CTGAAAGCTC 


CTGGAAGTAC 


19260 


GGTACCTGGT 


CAAGCAAATG 


TCTTCATCTT 


CCCAGGTATC 


GAGGCAGGAA 


ATATTGGTTA 


19320 
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CAAGATGGCT GAACGCCTGG GTGGCTTTGC GGCTGTAGGA CCTGTTTTGC AAGGTTTAAA 19 3 80 

CAAGCCAGTT AATGATCTTT C7CGTGGATG TAATGCAGAT CATCTTTACA AGTTGACCCT 194 4 0 

CATCACAGCA GCTCAAGCAG TTCATCAATA GTGAAAACTA TAAAGTGATA TACTATGCTA 19500 

TACTGTAGTT ATGAAACTAT GTACGAAAAG CACTGCCATT AATTCCTGAG AACTAAATTA 195 60 

CTGATTGGTG TCAAAAAGGA AAACTTCCAA GCGATGATAT CCTGTCTATA CACGACCTAT 19620 

AGAAATCTGT AATATACATA TCCGTAAAAC GATAAATTCC CTTTTTGATT TTAAATGAGT 196 80 

ATGAAAAGAG AATTTTTTGG CTCTTTGTCA ACTGTAGTGG GTTGAAGAAA AGCTAAGCTC 19 740 

GAGAAAGGAC AAATTTCATC CTTTCTTTT7 TGATATTCAG AGCGATAAAA ATCCGTTTTT 19800 

TGAAGTTTTC AAAGTTCCGA AAACCAAAGG CATTCCCCTT GATAAGTTTC ATGAGATTAT 19860 

TGGTCCCTTC CAGTTTGGCG TTAGAATAGT GTAGTTGAAC GGCGTTCATA ATCTTTTCTT 19 920 

TATCTTTGAG GAAGCTTTTA AAGACAGTCT GAAAAATAGG ATGAACCTGC TTAAGATTGT 19980 

CCTCAATAAG TCCGAAAAAT TTCTCTGGTT CCTTATTCTG GAAGTGAAAA AGCAAGAGTT 2 0040 

GATAGAGCTG ATAGTGGTGT TTCAAGTCTT CCGAATAGCT CAAAAGCTTG TTTAAAATCT 20100 

CTTTATTGGT TAAGTGCATA CGAAAAATAG GACGATAAAA TCGCTTATCA CTCAGTTTAC 20160 

GGCTATCCTG TTGAATGAGT TTCCAGTAGC GCTTGATAG 20199 

12) INFORMATION FOR SEQ ID NO: 7: 

<i> SEQUENCE CHARACTERISTICS : 

(A> LENGTH: 19702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ACCCGATGTA TCAGCGGATA TTTACTCTAT TTTTCAAACG ATGTTATACC CACAATAAAA 60 

GAAAAAAGAC CCTAAGGTCT CCT7TGCTTT TATTATTAAA CGCGTTCAAC TTTACCTGA? 120 

TTCAAAGCAC GAGCTGAAGC CCAAACTTTT T7AGCTTTAC CA7CGATAAG AACAGTAACT 18 0 

TTTTCAAGGT TTGGTTTTAC GGCACGTTTT CTTTGC7TCA TCGCGTGTGA ACGGTTGTTT 2 40 

CCTGATACAG TCTTACGACC TGTAAAGTAA CATACTTTAG CCATTCTGTT TTCCTCCTA7 300 

TAGATCTAAT ATAGCGGATG TGCTAGCACC ACATACCGTA CTA7CTTATC ACATTTTCTT 3 60 

CTTTTTTGCA AGGGAATTGG AAGATTTT7T ATTTG7GTCT 7AAA7CAGGT CTTGCGTGAC 420 

ATTTcTGCTC TCCACA7GCC ATCGTTGATT AACAGAACAC CAGAATTAAA AT7ATGTG7A 4 80 

TAAAAATCAT CTC7AACTGC AGCTAAGGGT ATAGCCCTCA AGTCCAAATC CCACAGCTCA 540 
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TCTATCGATT TTCTTACAAC 
TACTTTGGAA TAAAATACCT 
AACCTTTCAG AATATTACTG 
GTCTTTTTTC CATCAATTGG 
TAAGATTATA ATGATGAACA 
CATTATCTGC ACCTAAGACA 
TTTATTCCCA T CAT ATT ATT 
TATTTTCGCC TATTCGTTCG 
CGGTTTACAA TTTTTACATT 
TAGTTAGTGA GGCAGTTAGC 
TTTAGTTCCA ATTCTTGGTA 
TGTCCGCAAA CCAGCGGAGG 
GGTCTTTGGC TAAGATATTG 
TCCAACGAAC GGTCTTTTTA 
CCATGCGGTG TTTAAAGACT 
CTGTTTGGTA ATTCTTATAA 
TGTCCAACCA TTTTTCCTTC 
TAAAGGTTGG CAGGGGTTCA 
CCTGATAAGT ACCGGTATCG 
7CTCACGACT CTCCGCCATA 
GACGGAGATT GACACTCATG 
TACGCTCACG GTGACGAGGG 
CCTTGTCATA AGGATCCACA 
CAAACTTGAG GAAGGTTTCA 
AAAAGACAGG CGTCAATTCT 
CATTTAAAAG CTCAATGTCA 
TGTCCCCGTC TTCTAGACTG 
TATAGAGGTC ATACAAGCCC 
AGCTAGCAAT GCCCAAGATT 



a95 

AATATCTGAA TCCAAATACA GTACACGAGA CTCGCTTACA 600 

AAAAAAGCCG CATATGAAAG TCCCTCAAAG GGGAGACGAT 660 

TCAATCTAAA CATTCACAAT CTCACTATTC AAAGTCTCTA 720 

AACCATTCTC GCGGAAGGTC ATCATTAAAA ACATAAAACT 780 

CAAAGAGATT TTATTGTTGT TTCAACTTTA TCCATATAAG 840 

ATCCCTTTTT TCTCTTCTTT CACTTTTTAT CTCATTTCTT 900 

CCCATCATAT GTTTCCCATC ATATGTTTCT ACGTAACCAT 960 

TAAAACCATA CCAGTGGAGA TTTTAGATGA AGTCCCATTA 1020 

ACGACACGGA GTTTTACAAA TCGATTTCAT TTGCCAAACG 1080 

TAGTTCCCCA AATAGCGACT AGCGTCCAAC AATTTGGAAC U4C 

CTGAGTCACA TCTTCTCCTC TAACTCTACG TCTGGATACT 1200 

GCAAAGTCAT TTTCAAAGAG AAAGACTGGT TGGTCAAAAC 1260 

CCACTTGACC ACATCCGTTC ATCCAAGTCC TCAGGCTTGA 1320 

CCCATTGGGT TCATAACTAC TTCCGCATTG TACTCCCCTT 1390 

TCAAACTGGA GTTGACCTAC AGCGCCTAGC ATGTACTCAC 1440 

AGCTGAACGG CTCCTTCTTG CACCAATTCC TCAATCCCCT 1500 

ATAACATTCT TAGCAGAAAC TTTCATGAAA ATCTCAGCTG 1560 

AATTCAAACT TGTTTTTTCC AACCGTCAAG CTATCCCCAA 1520 

TAAACCCCGA TAATATCACC TGCCACGGCA TTGGTCACAT 1680 

AACTC-GGT \.\ CATTAGATAG T7*?A0CC"7~ 77A7~AGTAC 174C 

CCCCGCTCAA ATTCGCCAGA TACGATACGG ACAAAGGCAA 1900 

TCCATGTTGG CTTCCATTTT AAAGACAAAG CCTCAGAAAT 18 60 

ATTTCACCGT CTGTTTTCTT GTGACCATGT GGTTCTGGAG 19 20 

AGGAAGGTCT GCACACCAAA GTTTCTCAGG GCTGAACCGA 1980 

CCAGCCAGAA TAGCTTCCTC 7GAAAACTCA 7TCCCGGCTT 204 C 

TCCTTGACTT GCTCGTAGAA AGGATTCCTA CCAAACAGTT 2100 

GCAAAACGCT CATCCCCTTT GTAAAGCTCT AAACGTTGGT 2160 

TCAAAGGCTT TCCCCATCCC CATAGGCCAC T7CATAGCCT 2 2 20 

TCTTCCAATT CTTGCAAGAG ATCCAAAGGC TCACGACCGT 2 280 
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CACGGTCCAG CTTGTTCATA AAGGTAAAGA CTGGAATCCC ACGATGTTTC ACAACCTCAA 2 340 

ACAATTTCTT GGTTTGAGCC TCGATCCCCT TGGCAGAGTC CACGACCATG ACCGCAGCAT 24 00 

CCACCGCCAT CAAGGTACGA TACGTATCTT CTGAGAAGTC CTCGTGCCCT GGCGTGTCTA 2 4 60 

AGATATTCAC GCGCTTGCCG TCGTAGTCAA ATTGCATAAC AGATGAAGTA ACAGAAATCC 2 52 0 

CACGTTGCTT CTCGATATCC ATCCAGTCAG ATTTAGCAAA AGTCCCTGTT TTCTTCCCTT 2530 

TTACCGTACC ACCCTCACGA ATCTCACCCC CAAAGTAGAG TAACTGCTCA GTGATCCTTG 2 64 0 

TTTTCCCCGC GTCCGGGTGG GAGATAATGG CAAAGGTACG ACGTTTCTTA ATTTCTTCTT 2 700 

GAATATTCAT AAGTTCTCTT TCTTTGATTC TCTATTTTTC TTGTTTCAAT AGCTGAGAAT 27 60 

GATTTTTACA TTGGATTTTA CCATTCCTTT CAACACTCCA TTATATCCGA TTTTAGCATT 2 820 

TTTTTCAATT TCTATTTCTT TTCACTTCCC CCTCCCTTAT TTATAGGAAA ATATGGTAAA 2 880 

ATAGAACACA CTAAAAATCA TCATTTCACG AAAGGATGCA AGATGAAAAT TACGCAAGAA 2 94 0 

GACGTAACAC ACGTTGCCAA TCTTTCAAAA TTAAGATTCT CTGAAGAAGA AACTGCTGCC 3000 

TTTGCGACCA CCTTGTCTAA GATTGTTGAC ATGGTTGAAT TGCTGGGCGA AGTTGACACA 3060 

ACTGGTGTCG CACCTACTAC GACTATGGCT GACCGCAAGA CTGTACTCCG CCCTGATGTG 3120 

GCCGAAGAAG GAATAGACCG TGATCGCTTG TTTAAAAACG TACCTGAAAA ACACAACTAC 3180 

TATATCAAGC TGCCAGCTAT CCTAGACAAT GGAGGAGATG CCTAATGACT TTTAACAATA 3240 

AAACTATTGA AGAGTTGCAC AATCTCCTTG TCTCTAAGGA AATTTCTGCA ACAGAATTGA 3 3 00 

CCCAAGCAAC ACTTGAAAAT ATCAACTC7C GTCACGAACC CC7CAATTCA TTTGTCACCA 3 3 60 

TCGCTGAGGA GCAAGCTCTT GTTCAAGCTA AACCCATTGA TGAAGCTGCA ATTGATGCTG 3 4 20 

ACAATGTCCT TTCAGGAATT CCACTTGCTG TTAAGGATAA CATCTCTACA GACGGTATTC 3 4 80 

TCACAACTGC TGCCTCAAAA ATGCTCTACA ACTATGAGCC AATCTTTGAT GCGACAGCTG 3 540 

TTGCCAATGC AAAAACCAAG GGCATGATTG TCGTTGGAAA GACCAACATG GACGAATTTG 3 600 

CTATGGGTGG TTCAGGTGAA ACTTCACACT ACGGAGCAAC TAAAAACGCT TGGAACCACA 3 660 

GCAAGGTTCC TGGTGGGTCA TCAAGTGGTT CTGCCGCAGC TGTAGCCTCA GG AC AAGTTC 3720 

GCTTGTCACT TGGTTCTGAT ACTGGTGGTT CCATCCGCCA ACCTGCTGCC TTCAACGGAA 3780 

TCGTTGGTCT CAAACCAACC TACGGAACAG TTTCACGTTT CGGTCTCATT GCCTTTGGTA 3840 

GCTCATTAGA CCAGATTCCA CCTTTTGCTC CTACTGTTAA GGAAAATGCC CTCTTGCTCA 3 900 

ACGCTATTGC CAGCGAACAT GCTAAAGACT CTACTTCTGC TCCTG7CCGC ATCGCCGACT 3 9 60 

TTACTTCAAA AATCGGCCAA GACATCAAGG GTATGAAAAT CGCTTTGCCT AAGGAATACC 4020 

TAGGCGAAGG AATTCA7CCA GAGGTTAAGG AAACAATCTT AAACGCCGCC AAACACTTTG 4080 
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AAAAATTGGG TGCTATCGTC GAAGAAGTCA GCCTTCCTCA CTCTAAATAC GGTGTTGCCG 4140 

TTTATTACAT CATCGCTTCA TCAGAAGCTT CATC AAA C7T GCAACCCTTC GACGGTATCC -12 00 

GTTACCGCTA TCGCGCAGAA GATGCAACCA ACCTTGATGA AATCTATGTA AACAGCCGAA 4 2 60 

GCCAACGTTT TGGTGAAGAG GTAAAACGTC GTATCATGCT GGGTACTTTC AGTCTTTCAT 4 3 20 

CAGGTTACTA TGATGCCTAC TACAAAAAGG C7GCTCAAGT CCGTACCCTC ATCATTCAAG 4 380 

ATTTCGAAAA AGTCTTCGCG GATTACGATT TCATTTTCGG TCCAACTCCT CCAAGTGTTG 4 44 0 

CCTATCACTT GGATTCTCTC AACCATGACC CAGTTGCCAT GTACTTAGCC GACCTATTGA 4 50C 

CCATACCTGT AAACTTGGCA GGACTGCCTG GAATTTCGAT TCCTGCTGGA TTCTCTCAAG 4 5 60 

GTCTACCTGT CGGACTCCAA TTGATTGGTC CCAAGTACTC TGAGGAAACC ATTTACCAAG 4 6 20 

CTGCTGCTGC TTTTGAAGCA ACAACAGACT ACCACAAACA ACAACCCGTG ATTT7TGGAC 4 680 

GTGACAACTA ATGAACTTTG AAACAGTCAT CGGACTTGAA GTCCACGTAG AGCTCAACAC 4 7 40 

CAATTCAAAA ATCTTCTCAC CTACTTCTGC CCACTTTGGA AATGACCAAA ATGCCAACAC 4 300 

TAACGTGATT GACTGGTCTT TCCCAGGAGT TCTACCAGTT CTCAATAAAG GGGTTGTTGA 4860 

TGCCGGTATC AAGGCTGCTC TTGCCCTCAA CATGGACATC CACAAAAAGA TGCACTTTCA 4 92 0 

CCGCAAGAAC TACTTCTATC CTGATAACCC CAAACCCTAC CAAATTTCTC AGTTTGATGA 4980 

ACCAATCGGA TATAATGGCT GGATTGAAGT CAAACTAGAA GACGGTACGA CCAAGAAAAT 5040 

CGGTATCGAA CGTGCCCACC TAGAGGAAGA CGCTGGTAAA AACACCCATG GTACAGATGG 5100 

CTACTCTTA? GTTGACCTCA ACCGCCAAGG GGtTCCCTTG ATTGAGATTG TATCTCAGGC 5160 

AGATATGCC7 TCTCCTGAAG AAGCCTATGC TTATCTGACA CCCCTCAAGC AAGTTATCCA 5 2 20 

GTACCCTGGC ATTTCTGACG TTAAGATGGA GGAAGGTTCG ATGCGTCTGG ATGCCAACAT 52 80 

CTCCCTTCGT CCTTATGGTC AAGAGAAATT CGGTACCAAG ACTGAATTGA AGAACCTCAA 5 3 40 

CTCCTTCTCA AACGTTCGTA AAGGTCTTGA ATACGAAGTC CAACGCCAGG CTGAAATTCT 5400 

TCGCTCAGGT GGTCAAATCC GCCAAGAAAC ACCCCGTTAC GATCAAGCGA ATAAAGCAAC 54 60 

CATCCTCATG CCTCTCAAGG AAGGGGCTGC 7GACTACCGC TACTTCCCAG AACCACACC? 5 52 0 

ACCCCTCTTT GAAATTTCTG ACGAGTGGAT TGAGGAAATG CGGACTGAGT TGCCAGAGTT 5580 

TCCAAAAGAA CGTCGTGCGC GTTATGTATC TGACCTTGGT TTATCACACT ACGATGCTAG 5 64C 

TCAGTTGACT GCTAATAAAG TCACTTCTGA CTTCTTTCAA AAAGCTGTTG CCCTAGGTGG 5700 

TCATGCCAAA CAAGTCTCTA ACTGGCTCCA AGGGGAAGTC GCTCAGTTCT TGAATGCTGA 57 60 

AGGTAAAACA CTGGAACAAA TCGAATTGAC ACCAGAAAAC TTGGTTGAAA TGATTGCCAT 5820 
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CATCGAACAC GCTACTATTT CATCTAAGAT TCCCAAGAAA GTCTTTGTCC ATCTAGCTAA 
AAATGGCGGT GGCCCGCGTG AATACGTGGA AAAAGCAGGT ATGGTTCAAA TTTCAGATCC 
AGCTATCTTG ATCCCAATCA TCCACCAAGT CTTTGCCGAT AACGAAGCTG CTCTTGCCCA 
CTTCAAGTCA GGCAAACGTA ACGCCGACAA GGCtTTACAG CATTCCTTAT GAAGGCAACC 
AAAGGCCAAG CCAACCCACA AGTTGCCCTT AAACTACTTG CACAGCAATT GGCGAAGTTG 6120 
AAAGAAAACT AGACAGAACA AAACCAGCCC TAAGGTTGGT TTTTTCTTCT CTACCAACTC 6180 
CCAATAACTA TTTTGGCTTT ATTTCCAGAG TATTTTATGG TAAAATGAAG AGTAATAATA 
TTTATTAAAG AGCTAAAAAC ATGATTGAAG CAAGTACCTT AAAAGCTGGT ATGACCTTTG 
AAACAGCTGA CGGCAAATTG ATTCGCGTTT TGGAAGCTAG TCACCACAAA CCAGGTAAAG 
GAAACACGAT CATGCGTATG AAATTGCGTG ATGTCCGTAC TGGTTCTACA TTTGACACAA 
GCTACCGTCC AGAGGAAAAA TTTGAACAAG CTATTATCGA GACTGTCCCA GCTCAATACT 
TGTACAAAAT GGATGACACA GCATACTTCA TGAATACAGA AACTTATGAC CAATACGAAA 
TCCCTGTAGT CAATGTTGAA AACGAATTGC TTTACATCCT TGAAAACTCT GATGTGAAAA 
TCCAATTCTA CCCAACTGAA CTGATCGGTG TCACCGTTCC TACTACTGTT GAGTTGACAC 
TTGCTGAAAC TCAACCATCT ATCAAAGCTG CTACTGTTAC AGGTTCTGGT AAACCAGCAA 
CGATGGAAAC TGGACTTGTC GTAAACGTTC CAGACTTCAT CGAAGCACCA CAAAAACTCG 
TTATCAACAC TGCAGAAGGA ACTTACGTTT CTCGTGCCTA ATCTCTAGAA AGAGGTCATT 
CTATGGGAAT TGAACAACAA CTTCCCGAAA TCGTTATCGC CCCACGTC7A CTTGAAAAAA 
TCATTGCTAT CGCTACTGCA AAGGTAGAGG GTCTTCACTC TTTTT C AAAC AGATCACTC7 
CTGATACCCT TTCAAAACTT TCACTCGGCC GTGGCATTTA 7CTTAAAAAC GTGGACGAAC 
AACTCACAGC AGATATCTAT CTCTACCTTG AGTACGGAGT AAAAGTTCCT AAGGTAGCGG 
TTGCTATCCA GAAAGCTGTC AAAGATGCCG TCCGTAATAT GGCTGATGTA GAACTCGCTG 
CTATCAATAT TCACGTTGCA GGTATCGTCC CAGATAAAAC ACCAAAACCA GAATTGAAAG 
ATCTATTTGA CGAGGACTTC CTCAATGACT AGTCCACTAT TAGAATCTAG ACGCCAACTC 
CGTAAATGCG CTTTTCAAGC TCTCATGAGC CTTGAGTTCG GTACGCATGT CGAAACTGCT 
TGTCGTTTCG CCTATACTCA TCATCGTGAA GATACGGATG TACAACTTCC AGCCTTTTTG 
ATAGACCTCG TTTCTGGTGT TCAAGCTAAA AAGGAAGAAC TAGATAAGCA AATCACTCAG 
CATTTAAAAG CAGGTTCGAC CATTCAACCC TTAACGCTCG TGGAGAGAAA CCTCCTTCGC 
TTGGGAGTCT TTCAAATCAC TTCATTTGAC ACTCCTCAGC TGGTTGCTGT TAATGAAGCT 
A7CCACCTTC CAAAGGACTT CTCCGATCAA AAATCTGCCC GTTTTATCAA TGGACTGCTC 
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AGCCAGTTTG TAACAGAAGA ACAATAAGGC TC7TTCTCAA CTGTAGTGGG 7TCAAAAAAA 7630 

CCTAACC7CG AGAAACCACA AATTTCGTCC TTTCTTTTTT GATGTTCAAA GCGATAAAAA 774 0 

TCCGTTTTTT GAAGTTTTCA AAGTTTCCAA AACCAAAGGC ATTGCCCT7C ATAAGTTTGA 7300 

TGACATTA7T GGTCGCTTCC AGTTTGGCAT TAGAATAGTG TAGTTGAAGG GCCTTCACAA 7 360 

TCTTTTCTTT ATCTTTGAGG AAGGTTTTAA AGACAGTCTG AAAAATAGGA TGAGCCTGCT 7920 

TAAGATTGTC CTCAATAACT CCGAAAAATT TCTCTGGT7C CTTATTCTGG~"aAGTGAAACA 79 90 

CCAAGACC7C A7AGAGCTCA TAGTGGTGTT TCAAGTCTTG TGAA7GGC7C AAAAGC7TGT 804C 

C7AAAATC7C TTTATTGGTT AAGTGCATAC GAAAAGTAGG ACCATAAAA7 CGCTTATCAC 8100 

TCAGTCTACG GC7A7CCTGT 7GAATGACT7 TCCAGTAGCG CTTGATATCC TTC7A7TCAT 3160 

GGCATTTTCC ATGAAACTGA TTCATGATTT GGACACGCAC ACCACTCATC GCACGGCTAA 8220 

GATGTTGTAC AATGTGAAAG CGA7CAAGAA CCATTTTAGC A7TCCGCAC7 GAAACAGTCT 8280 

GGCAGACTC7 TTCAGCCTGA GCCTAGCAAT TTGAAAGCGA ACCTC77TAC CCAAGTCATA 8340 

GTAACGGCTA AACATA7CCA TAGTAATAAT TTTGACGCCA CA7CGGACAA CTCTATCGTA 8400 

GCGAAGAAAG TGATTTCCAA TGATACCTTG TGTTCTACCC TCAACAACAG TGATGATATT 84 60 

CAGATTCTTA AAA7CTTGCG CAA7GAAGCT CATCTTTCCC 7TTC7AAAAC CATACTCATC 3520 

CCAAGACATA ATCTCAGGAA GACAAGAAAA ATCATGTTTA AAGTGAAAA7 CATTCACCTT 8 580 

ACCAA7AACA GTTGAAGTTG AGATGGAAAG C7CATGGCCA ATATCAGTCA TACAAATCTT 3640 

TTCAATCAAC TTT7GAGCAA TCTTTTGGTT GATGAT A CG A GGCATT7CG7 CATTTTTCTT 3700 

GACGATAGAA GTTTCAGCGA CCA7CA7TT7 7GAACAGTGA TAGCACTTGA ATCGACCC77 37 60 

7CTAAGCAGA ATTCTAC7AG GCATACCAGT CG7T7CAAGA TAAGGAATTT TACAAGCTTT 8820 

TTCAAAGTCA TATT7C7TCA ATTGGTTTCC CCACTCAGGG CAAGATGGGG CGTCGTAGTC 8880 

CAGTTTGGCG ATGATTTCCT TGTGTGTATC CTTATTGATG A7C7CTAAAA 7CTCGA7ATT 3940 

AGGGTCTTTA ATGTCTAGTA ATTTTGTGA7 AAAATGTAAT TCT7CCATAT CAATCTTTCT 9000 

AA7CACT7CT TTTGTCCC77 7TCATTATAG GTCA7ATCGG ACT77TTT7C TACAATAAAA 9060 

TAGCCTCCAT AATATCTA7A GGGGATTTAC CCACTACAAA 7A7TATAGAG CCAACAATAA 9120 

AAAGAAAAAG TG7TTGATAG ATATCAAACA CTTTTTTC77 7GCC7CCCAC 7ATC7AAAAA 9X80 

AA7GA7AATA CA7A7AATTC TAAACAAAAA 7CCAGATAGG TT77GCA7GA 7TGAGAAAG7 9240 

7AAAAAAACT A7GGCAGAGA ATCGTTAA7C TCAGATTCTC GG7AGAACCA TAAACAAGGG 9300 

CAAAAAAGAA ACCAATCAGA C7A7AATATA ATAAAC7AA7 TGCA7CTC7G TG AG AT ACTA 9 3 60 
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TCAAATGCCT AATCCCAAAG ATGATAGCAG ATACGA7AAC ATCCAAATAG TACTTGGACT 34 20 

AGGGAAAGAA CCTATTCATA AAATACCCTC TATCAAGAGT CTCCTCAAAA ACAGGACCGA 9480 

TGATTACAGG CAGGACAAAA GATAAGATAG TCCATAAAAA GGTTGGTTGT CCATTTGAAA 9540 

AAAGCACGGT AAAATACTCA TCATGAATAT TCCTATGATT AATCAAATGA GCATAGCGTG 9600 

CCCAAAAATT ACCGAGAATC TGATAAACCA CATAAGTTGC AASTAACTAG AAGACAAATG 9660 

ACCAGTTCCA GCTCTTTTTC TCAAAGATAA AGACCATCTT TTTCTT TTT T AACCTCCAAA 9720 

TTAATAGAAG GAAACTTCCC ACTAATCCCA TTCTTAAAAT AAGAGAATAG ACATCAGCTC 9780 

CTAACCCTAA AATGATCGTC ACATACAATC CAATTGTTTG TGGTAAATAG GTAGATAGTA 9840 

AAATAATAAG CAAAAATAT? CCAAATTCTC TTAGTTTTTT TGTGTTTCTC ATCCTACTTT 9900 

TTTGAAAGAT TACCCTGCTC GGAAGCCGTA CTTCCAAGCA TCTATATAAC AATTAAGTGC 9960 

CCCTTGCCTC ATATAGGGAG CAAATTCTCT ATAATATAAC CATCTACTAT ATCCATCTTC 10020 

CCAAACAGCA AGACCACCTG AAGTTTGCTC CAAGTCCTCA GTTGAAAGAA CTGTAAATGT 10080 

ATTTGTACCT GTCATTGCAA GTACCTTCTT AAAATAGATT GTTGTAGGCT CACATTTATA 10140 

GTATATTTCT TTTTTTGTCT ATTTTATAGC CCATCTCCTC AACTGGCAAT TTTTCGACCT 10200 

GAATTACATT TTTCCATAAA AAATGAGACC TTTCTAGTCT CATTTAGTCA TTCTTAGTAT 10260 

TTTCTAAATC GTTGATAGCG TTCTTCCAGC AACTCTTCTA GCGGTTTTTG TGAAAGTCTA 10320 

GCCAGCTCCG TTTGGAGTTC TTTTTTGACA CTCT7AATCA GTTCTTTACT AGAAAGTCCT 10380 

ATTTCAGAAA TCACCTTATC CACCACGTCC ATTTCTAACA GTTCATGCGA AGTGATTTTC 10440 

ATCAGTTCTG CTGCTTCCAT AGCGCGAGTA CCGTCCTTCC ATAAAATGGA AGCAAAGCCT 10500 

TCTGGACTGA GAATGGCATA GATAGAATTT TCCAGCATCC AGACACGGTC CGCGACAGCT 10560 

AGAGCCAGAG CCCCGCCTGA ACCACCTTCA CCGATAATAA TGGCGATAAT AGGAACTTTC 10620 

AGGTCACTCA TTTCCATGAG ATTGCGAGCG ATAGCTTCCC CTTGACCACG TTCTTCCGCT 10680 

CCGACACCAG GATAAGCACC TGCTGTATTC ATAAAGGTCA CAACTGGACG GCCAAATTTC 1C7 40 

TCACCCTGTT TCATCAACCG CAGTGCCTTT CGGTAGCCTT CTGGATGTGG TTGGCCAAAA 10800 

TTCCGTTTGA GGTTGTCTTG CAAACTCTTG CCTTTTTCCA TACCAACCAC TGTTACAGCT 10360 

TGGTCTCCAA GCCAACCAAT ACCACCAACA ACTGCACCAT CATCACGAAA AGAACGGTCA 10920 

CCATGTAATT GGATAAATTC AT C AAAAATG CCTGTCGCAA AGTCCAAGGT TGTCAAGCGA 10980 

CTCTGCTCAC GCGCTTCTC7 GACTATTTTT GCAATATTCA TCTAGGACTC CCTCCATGCA 11040 

ATCTGACTAG GCTAGCAATC GTATCTGGTA AGTCTCTTCT TTTGACAATA GCATCCACAA 11100 

AGCCATGT7C TAATAGGAAT TCTGCCTTTT GGAAA7CCTC AGGCAAGCTT TCACGAACCG 11160 



WO 98/18931 



PCI7US97/19588 



201 

TATTTTCAAT CACACGACGC CCAGCAAAAC CAACCAAGCT CTGTGGTTCA GCCAGAATGA L 1220 

TATCGCCTTC CATACCCAAA GAAGCTGTCA CACCACCAC7 CGTTGGATC7 GTCAAAATGG 11280 

TCAGGTAAAA GAGACCAGCA TTTCAATGCC CTTTAACCCC CGCAGAGATC TTACCCATCT 11340 

G CAT GAG ACT CATGATTCCT TCCTGCATAC GGGCTCCACC AGAGGCTGTG AATAGGACAA 11400 

CTGGCAATTT TTCGACAGTC GCATACTCAA ACAAACGAGT GATTTTTTCA CCTACAACCG 114 60 

TACCCATAGA AGCCATGATA AAGTTAGAAT CCATAATCCC AAGAGCCACA GTCTGACCTT 11520 

TAATAAGAGC AGTTCCTGTC ACAACGGCTT CATGCAGACC TGTTTTTTCA CGCATAGATG 11 580 

CCAGTTTCTT TTGGTAACCA GGGAAATGCA AGGGATCCTT GCTTTCAATC CCTGTAAACA 11640 

ATTCTTTGAA GGTTCCCATA TCAATCGTCA AAGCCAAGCG T7CTTGGGCA GAAATACGAA 11700 

AGGTATAGCT ACAGTGCGGA CAGATACGTT CACTTCCCAG ATCCTTCTGA TAGATGGTAT 11760 

GCTTACAGCC TGGACACTCG GAAAATAATT CATCTGGAAC CTCTGGCTTA GCTTGAGGTT 11920 

TTTCCCTAAC CGAACGATTG GGATTGATTC GAATATACTT A7CTTTTTTA CTAAATAGAG 11880 

CCATTGATTC CCCTTTTCGG TTTAAACTCT TAAAGTCATT TTATTCTTTT TCTTGATATT 11940 

TAGGTAAGAA GGTTTCCATC AAGAAGGAAG TATCATAATC CCC AGCAATG ACATTGCGAT 12000 

CTGAAATGAG GTCAAGCTGG AAATCTGCAT TGGTCTGCAC TCCTTCAATT TCTAATTCAT 12060 

AGACGGCACG TTGCATTTTC ATCAAGGCGT CAAAACGATT TTCGCCGTCT ACTATGATTT 12120 

TGGCAATCAT ACTATCATAA TAAGGCGGAA TGGTATAACC TGGATAAACT GCTGAATCCA 12190 

CGCGCAAGCC AACTCCACCA CTTGCCAGAT AGAGAT7AGT AATCTTACCT GG ACT? GG AC 122-10 

CAAAGTTAAA GCCTCGCTTT TCTGCATTGA TACGACACTC GATGGCATGA CCGCGTAGGA 12 300 

CAATATCTTC TTCCTTAACA GACAAAGGCT GACCTGCCGC AATGCAAATC TCTTCCTTAA 12 3 6C 

CGATATCAAC ACCTGAAACA AACTCTGTTA CTGGATCTTC TACCTGAACA CGAGTATTCA 12420 

TCTCCATGAA ATAGAAATTG CTACTTGCTT CATCAAGAAG AAATTCAATG GTTCCTGCAT 12 4 80 

TCTCATAGCC AACAAACTCT GCCGCTCGAA CAGCAGCAGC ACCTATTTCA TGACGCAGCG 1254 0 

TTTTTCCGAT TGCAATCGAG GGACTTTCTT CCAAAACCTT TTGGTTATTC CTTTGAAGAG 12 600 

AACAATCCCG TTCACCCAAG TGAATCACAT GTCCATGCTC ATCACCTAGG ATTTGAACCT 12 660 

CAATGTGCCG AGCTGGATAG ATAACCCGTT CTATGTACAT GGCACCATTG CCATAATTGG 127 2 0 

CCTTCGCCTC ACTAGAGGCA GTTTCAAAGG CAGAAACGAG GTCATCTGGT TTTTCAACCT 127 80 

TACGAATCCC TTTACCACC7 CCACCTCCTG AAGCCTTGAG CATAACAGGA TAGCCAATTT 12 34C 

TTTCAGCAAC AATCAAAGCT TCTTCAGAGT TATGCACTTC TCCATCTGAA CCTGGTATAA . 12 900 
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CAGGCACACC 


TGCTTTAATC 


ATCTGAGCAC 


GCGCATTGAT 


CTTATCCCCC 


ATCATATCCA 


12960 


TAACATCACC 


AGATGGACCG 


ATAAACTTGA 


TACCTACTTC 


TTCACACATG 


GTCGCAAATT 


13020 


TCGAATTTTC 


ACTGAGAAAT 


CCAAAACCAG 


GGTGAATAGC 


TTCTGCCTCA 


GTCAAGACTG 


13C80 


CAGCTGATAG 


AACTGCATTA 


ATATTGAGAT 


AAGACTCTGT 


TGCCTTCCCA 


GGACCAATAC 


13140 


AAACTGCTTC 


ATCTGCCAAA 


AGCGTATGAA 


GAGCTTCCTT 


ATCAGCAGTT 


GAATAAACCG 


132CO 


CTACCGTCGC 


AATCCCCAAT 


7CACGTCCCG 


CACGGATAAT 


ACGAACCGCA 


ATTTCACCAC 


13260 


GATTGGCAAT 


TAAAATTTTT 


CGAAACATGG 


AGAACCTCCT 


TAGTTCCCAA 


TTGCAAAAG? 


13320 


AAGGGTACCA 


CTGGCTGCAA 


GCTTGCCATC 


CACTTCAGCC 


TTTGCTTCAA 


CCACAGCTAT 


13380 


GGTGCCACGA 


CGTTTTACAA 


AAGTCGCTGT 


CATAACCAAT 


TGGTCGCCTG 


GTACAACTTG 


13440 


CTTCTTGAAC 


TTAACCTTGT 


CCATACCAGC 


GTAAAAGACC 


AGTTTTCCTT 


TATTTTCAGG 


13500 


TTTTGATAAC 


TCCAACACAC 


CGGCACTTTG 


CCCCAACGC? 


TCCATAATCA 


CAACACCTGG 


13560 


CATAACTGGG 


TATTGAGGAA 


AGTGGCCGTT 


AAAGAAAGGC 


TCGTTGATGG 


TCACATTTTT 


13620 


GATAGCAACA 


ATGGTATCCT 


CGCTCACTTC 


CAAGACACGG 


TCCACTAGAA 


GCATAGGATA 


13680 


ACGGTGGGGA 


AGAGCTTCTT 


TGATTCCTTG 


AATATCGATC 


ATTTGATACG 


TACCAATCCT 


13740 


TTACCAAACT 


CAACCATTTC 


TTCGTTAGAG 


ACGAGAATTT 


CCGTTACCAC 


ACCATCCTTA 


13800 


GGAGCTGGGA 


TTTC ATTC AT 


GACTTTCATG 


GCTTCGATAA 


TTACCAATGT 


TTGACCTTTT 


13860 


TTGACACTAT 


CACCAACTGT 


AACGAAGGCA 


GGTTTATCTG 


CTCCAGCACC 


CAAGTAAACC 


13920 


ACTCCAACAA 


GTGGACTCTC 


TACAACATTT 


CCCTCAGTAG 


CCACACTTGC 


TTCAGCTGGA 


13980 


GCTGGAACTT 


CTTCTGCTAC 


AGTCTCTGCT 


GGACCAGATG 


TAGGAGCTAC 


TGGACTCGG7 


14 040 


GTTGCTAGAA 


CGGGTGCTGG 


AGCGACTTGA 


GTTGCAACTT 


CAGGCACAGG 


TCTTGCTTCA 


14100 


TTCTTGCTAA 


ACTGCAACTC 


ATCCGTCCCA 


TTTTTATAAC 


AAAATTCTCT 


CAAACTTGAC 


14160 


TCC7CAAATT 


GAGTCATCAA 


GTCTTTAATA 


TCGTTTAAAT 


TCATACTTAT 


CTATTCTCCC 


L4220 


AACGTTTGAA 


AGCAAGAACT 


GCATTGTGGC 


C7CCAAAACC 


AAAAGTATTT 


GAAATAGCOT 


14230 


ATGGAATTTC 


TTTCTCCAAG 


CCTTGTCCAT 


AAACGACATT 


AGCTT CG ATA 


TAATCTGATA 


14340 


CTTCACTTGT 


CCCAGCTGTC 


ATTGGTACAA 


AGTTATCACC 


CATAGCTTCG 


ATGGTGACGA 


14400 


TAGCTTCTAC 


TGCACCCGCA 


GCCCCCAGCA 


AATGTCCTGT 


AAAAGACTTG 


GTTGATGATA 


1446C 


CAGGTACTTC 


CTTACCAAGA 


ACAGCTACGA 


TAGCACCACT 


TTCTCC7TTT 


TCATTGGCAG 


14520 


GAGTTGACGT 


TCCGTGAGCA 


TTGACATAGG 


CTACTTGCTC 


TGGAGAAATC 


TCAGCTTCTT 


14580 


CCAAGGCTAC 


TTTGATGGCC 


TTGATAGCTC 


CCTGACCTTC 


TGGATGTGGA 


GAAGTCATGT 


14640 


GGTAGGCATC 


ACAAGTATTT 


CCGTAACCAA 


CCACTTCAGC 


CAGGATAGTA 


. GCTCCACGTT 


14700 
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TTTCAGCGTG TTCAAGACTT TCTAGAACCA ACATCCCTGA ACCT7CACCC ATAACAAACC 14 7 60 

CATTGCGATC CTTATCAAAT GGGATCGAAG CACGAGTTGG ATCCTCTG7A GT AG AG AG AG 14820 

C7CTTAAGGC TTGGAAACCA GCGATGGCAA AAGGTGTGAT AGAAGCTTCT GTTCCTCCCA 14 8 80 

CCAACATCAC ATCTTGGAAA CCAAACTTAA TGGAGCGGAA GGCATCCCCA ATCGCATCAT 14940 

77GATGAACA GCAGGCAGTA TTGATAGATT TACAAACACC GTTTGCACCA AAACCCATGG 15 000 

CTACATTCCC AGAAGCCATA TT7GGTAAAG CTTTTCGAAG AGTCATTGGT TTGACACGTT 15060 

TGCGTCCTTT TTCATGAAGG CGAAGTACCT GATC77CAAT TTCCTTGATT CCACCAATAC 1512 0 

CAGATGCAAC GATAACACCA AAACGATCCC TATTAAGAGC CTCTACATCA AGATTGGCAT 15130 

CATTTACACC CTCTTGGGCT CCATACAAGG CATATAAAGA ATAGT7ATCA AAACGGTTGG 15240 

TATCTTTTTT TACAAAGTAT TTATCGAACG GAAAATCTTG GATT7CTGCC GCATTATGCA 15 30C 

CATCAAAGTC ACTATGATCA AATTTTGTAA TGCCACCAAT GCCGATTTTC CCAG7TGCTA 15 360 

AACTATTCCA AAATTCTTCT GGTGTATTTC CGATTGGAGA TGTTACTCCA 7AACCTG7TA 154 20 

CCACTACTCG ATTTACTTTC ATTCTTTTCA CCTCTAGCTT TCGCTACATA CTTAACCCAC 15430 

CATCAATGCC AACCACTTGT CCAGTTAGAT AATCTTGGCC TGCTAAAAAT ACTGTCAAAT 15 540 

CTGCAACCTG CTCTGCCTGC CCAAATTCTT TCATCGGAAT CTGAGCTAGT GTAGCTTCCT 15600 

TAATCTTATC TGACAGGATA GCCCTCA7AT CAGACTCAAT CATTCCTGGA GCAATCACAT 15660 

TCACTCC7AT ATTCCCACTA GCGACCTCGC GTGCCACAGA CTTCGTAAAG CCAATCAAGC 15*720 

CAGCCTTAGA AGCAGCATAA TTAGCTTGAC CAATATTCCC CATCAAACCA ACAACACTAG 157 90 

ACATATTAAT GA7AGCACCT TCTCTGGCTT TCATCA7CCC 7T7CAACAC7 GATTGTGTCA 15840 

TATTAAAGGC ACCAGTCAGA TTGACCTTGA GCACTTTTTC AAAATCTCCT TCTGTCA7CT 15900 

TGAGCATAAG AGTATCT7CC GTAATCCCTG CATTGTTCAC CAAAACATCT ACTGAACCCA 15960 

GTTCTGCAAT AGCTTGATCA A7CATACCC7 7ACCCTCTGC AAAATCTG A T ACATCTCCTG 16020 

AAATGGGAAC CACCTTGATA CCATAGTTTG AAAACTCAGC GAGCAATTCT TCTGAGATTC 16080 

CCCCACGACT CTTTAAGACA ATGTTGGCTC CTGCTTCAGC AAACTTGTGG GCGATGGCAA 16140 

GACCAAT7CC ACGACTCGAA CCTGTAATAA AGATA7TTTT ATGTTCTAGT TTCATTTTTT 162 00 

TCC7TTCAAA ACTTCTACTT AT77TAC7CT ATT7TTCTAA AAGTGCTACT AAACTCGCT7 162 60 

GATCTTCCAC ATGAGCTAAG TGAGCAGTTT GA7CAA7T7T TT7AACAAAA CCTGACAAGA 163 20 

CT77CCCCGG TCCAATC7CG ATAAAG7TCC T7ATCCCTGC TTCTTGCATG ACCCCAATAC 16 3 80 

TT7CATAGAA ACGAACGGGT 7CCTTGACCT GACGCGTCAA GAGCTGAGCA ATCTCCTCTT 16440 
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TTTGCATCAC 


AGCAGCTTCT 


CTATTGCCGA 


CTAGGGGACA 


AGTAAAATCT 


GAAAAACTTA 


16500 


CCTGACCTAG 


AGTTTCAGCT 


ACTTTCTGGC 


TAGCAGCTTC 


AAGGAGAGCG 


GTGTGAAAGC 


15560 


GACCTGACAC 


CTTAAGAGGA 


ATCAAGCGTT 


TGGCACCTCC 


TTCTTGCAAA 


AGTTCAACCG 


16620 


CTCGATCAAC 


TGCAACCACT 


TCTCCAGCAA 


TGACGATTTG 


TGCAGGTGTG 


TTATAGTTGC 


16680 


CTGGAGTAAC 


CACTCCAAGT 


TCAGAAGCTT 


TTTGACAGGC 


TTCTTCAATG 


ACCTCTACTG 


16740 


GCGTATTGAG 


AACTGCTACC 


ATCTTGCCAG 


AGTCAGCAGG 


AGCCGCTTCT 


7CCATATAGG 


16300 


CTCCACCCTT 


AGCTACCAAG 


GCAACCGCAT 


CTTCAAAATC 


CAAGGCGCCA 


CTTGCCACCA 


16860 


AGGCAGAGTA 


TTCTCCAAGA 


GACAAACCAG 


CAACCATATC 


AGGCTGATAG 


CCCTTTTCTT 


16920 


GCAATAAACG 


GTAGATACCA 


ACCGAAGTCG 


CTAGAATGGC 


TGCTTGCGTA 


TACCGGGTCT 


16980 


GATTGAGTTT 


GTCTTCTTCC 


GTATCGATGA 


GATAACGCAA 


ATCATAACCG 


AGCACCTGGC 


17040 


TCGCTCGATC 


AATCGTTTCT 


TTAACAATCG 


GA7ACTGATC 


ATAGAAATCC 


CGTCCCATCC 


1710C 


CTAGATACTG 


GGCACCTTGA 


CCAGCAAATA 


AAAAGGCTGT 


TTTAGTCATT 


TCTTACAACT 


17160 


CCTGTCCAGC 


GAGAGGCTTC 


TTCTTGAATT 


TTCTTAGCGG 


CTCCGTAATA 


CAAATCTTTT 


17220 


AGGATTTCTT 


CAGCTGTTTC 


TTCTTTAGAA 


ACAAGCCCTG 


CGATTTGACC 


TGCCATAACA 


17280 


GAGCCACCAT 


CCACATCACC 


GTGAACAACT 


GCTTTGGCTA 


GAGCACCTGC 


TCCCATTTGT 


17340 


TCAAAGATTT 


CTAAATCAGG 


ATCTTCTTGC 


TTAAAGGCAT 


CTTTTTCAGC 


CAGTTCAAAA 


17400 


TCTCTAGTCA 


ACTGATTTTT 


AATACCACCA 


ACAGCATGAC 


CAAAGTGCTG 


AGCTCAAATC 


17460 


GTAGTATCAA 


TATCCCTTGC 


TTTTAAAATT 


TTCTCCT7CT 


AGTTTGGATG 


GGCATTCGAC 


17520 


TCTTTTGCAA 


CTACAAACCG 


TGTCCCCACC 


TGTACAGCCT 


CTGCACCTAG 


CATAAAGCCA 


17530 


CCCCCAGCAC 


CTTCACCATC 


CGCAATTCCT 


CCTGCAGCAA 


TAACAGGAAT 


AG AT AT AG CT 


17640 


GTGGCTACCT 


GTCGCACCAA 


GGTCATCGTT 


GTTAATTTAC 


CGATATGCCC 


CCCAGCTTCC 


17700 


ATTCCTTCTG 


CAATAACAGC 


GTCTGCACCG 


ATTTTTTCCA 


TGCGTTTAGC 


TAAAGCGACA 


17760 


CTAGGAACAA 


CACGAATAAC 


GATTATCCCA 


GCTTCATGGA 


AACGTTCCAT 


ATACTTGCTT 


17820 


GGATTTCCTG 


CTCCTGTTGT 


GACAACTTTA 


ACACCTTCTT 


CAATAACCAG 


ATCCACGATG 


17880 


TCTTCCACAA 


AGGGAGATAA 


GAGCATCATG 


TTGACCCCAA 


ACCGTTTATC 


AGTCAATGAT 


1794C 


TTGATTTTAT 


CAATATTGGC 


CTTGACAACT 


TCTTTCGGGG 


CATTTCCCCC 


ACCGATAATT 


1800C 


CCTAATCCTC 


CAGCCTTGGA 


AACAGCCCCT 


GCCAAATCAC 


CATCAGCAAC 


CCAGGCGATC 


18060 


CCTCCTTGGA 


AAATAGGATA 


ATCAATCTTC 


AATAATTCTG 


TAATACCCCT 


TTTCATAGTG 


18120 


CCTCCAACCT 


TCCTTGC7TA 


CGTAATAGTT 


CCATTTCACC 


ATAATTTGAC 


AGTCAAACTA 


18180 


TTACCTAAAC 


AAGAGGGAGT 


GGCTTTCTCC 


CTAC7CCT7C 


TACTAATATT 


'"*^GG W *T ATTT 


1824C 
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TGCTTCCTCT 


TCAACGTAAG 


CAACCAAGTC 


ACCAACTCT7 


TTCAAGTCAT 


TTTCTGCTTC 


13300 


GATTTGGATA 


TCAAAAGCAT 


CTTCCATTTC 


TG AG ATT ACT 


TGGAACAAGT 


CCAATGAATC 


13360 


^GCGTCCAAA 


TCATCAAAAG 


TTGATTCAAG 


TGTTACTTCT 


GATGCGTCTT 


TTCCAAGTTC 


13420 




ATTTPTTG T A 
A i 1 1^4 I vj . /\ 


PTT^TTP A AA 


TACTGCCATG 


ATAGGACTCC 


TTTAAAATAA 


18480 


r\ 1 rt.L» 1 . 11 i t 


"•ATAAPAATC 


TGTTCAP.CAC 


ATGATTACCT 


AAATTGTAAG 


AATGAGCGTG 


18540 


p p p p a ggt p a 

L,L.L,L.AL)Ll 1 L.A 


ALiL.L> I 


P, AAGC CT G A* 11 


AGAAGAACAG 


TCTGGCTACC 


ATCTAAAGGG 


1860C 




TTTfTArAfA 


PTp*pp, AAAGT 


AAAATCGGGA 


TACTGGCTGC 


ACTGGTATTG 


13660 


L.L.A1A1 » L 


TPATATTGGP 


TGCAAGTTTC 


GCTCGGTCAA 


CACCAATTTT 


TCTAGCCATC 


18720 




TaPGGTPAT*" 


GGCTTGATGA 


AGTAGCAGAT 


AATCCAAGTC 


TGTCACCTCT 


18780 


A 1 AOLrALt A i 1 


TATfAATACT 
\_/\ 1 L./vrt t rtw i 


CTCCTTGATA 


GACTTGGCTA 


CATCTCGAAT 


GGCAAAATCA 


18840 


AAwAL- . La 1 LiL. 


PTPPA.TPPAT 


PTTPAAAAAC 


GAATCTGCAC 


TTTCTTGATC 


TGAAAATGGA 


13900 


vjfV/\ I Li I AAAL. 


PTGAATP.PPP 
L. 1 LiAA A Li^-i— *w 


ATAAGTTAAA 

n 4 AAVJ 1 4 rw^ 


CACTCCCTGC 


GACTTCCATC 


GCTATTGAGA 


13960 


L, 1 L. I L- ALiL, 1 A 


aP A & ATP.PTP 


TTGPTPGCTA 


GCTTCTAACA 


AGACACCACC 


AGCACCATCT 


19020 


f*r- h fi. liP a IPS 
L, L, AAAL. AAL. A 


/■^ aPPTPTfP & 
L- AL>L. 1 Lj 1 I LiA 




CAATCGACTG 


CCTTAGAGAG 


GGTTTCACTA 


19080 


L.L.AA i L. AL.L.A 


ALiL.^ . 1 . ilvj 


AAAGPGACCA 


GAAGCGATAA 


ACTTTTCAGC 


AGTTGAAAGA 


19140 


LiCAAATAl,AA 


a i"p/T" ft r'^p a 

A 1 LLAl . Li*-A 


APPPGPGP.TT 
AL?L ^-VjL.L*w * * 


AAGT CAAAAG 


CAAAGGCTTT 


ATTAGCACCA 


19200 


n. Al 1 rtuU 1 - 


GAAPAPGACC 


AGCTGTAGAG 


GGCATCATCG 


AATCTGGACT 


AATGGTAGCT 


19260 


AGGATCATAA 


AATCCAGTTC 


TTCTCCTGTT 


ATTCCAGCTT 


TTGCCATCAG 


TTTCTTAGCA 


19320 


ACCTCTGTAC 


CCAAATCACT 


CGTAGATTCT 


GTTCTTGAAA 


TATGCCTTTG 


TCGTATTCCC 


193 80 


GTTCCACTTG 


AAA7CCACTC 


ATCATTGGTA 


TCCATAATCT 


GAGCCAAGTC 


GTGATTTGTA 


19440 


ACCACTTGCT 


CTGGCACA7A 


ATGAGCAACC 


TGACTTATTT 


TTGCAAAAGC 


CATTATTTCA 


19500 


AATCCTCCAA 


AAATTGGTAA 


AGATTAGTCA 


AACCTTTACC 


CATGACAGCA 


ATTTCTTCCT 


19560 


CGCTCATGCC 


ATCAATAATT 


TTTTCTACCA 


TGGCCTTGTG 


GAAGCGTTTA 


TGCAGTCTAT 


19620 


GAATCAAGCG 


ACCCTTCTTT 


GTCAAATGCA 


GATGCACCAC 


ACGACCATCC 


TGTTCTGACC 


19680 


GAACTCGCTC 


AATGTAGCCC 


GG 








19702 


{ 2 } INFORMATION FOR SEQ ID NO: 8 











<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GAAAATTTCC TCTCTTCTCT TGAAAAATTT TGAAAAAATG GTATGATAGT AACAACTTAT 60 

TTTTAAGACG AAAGAAAGGG GAATAATGGA GAAAATCAGT TTAGAATCTC CTAAGACGGG 120 

GTCGGACCTA GTTTTGGAAA CACTTCGTGA TTTAGCAGTT GATACCATCT TTGGTTATCC 180 

TGGTGGTGCG GTTTTCCCTT TTTATGATGC CATATATAAT TTTAAAGGCA TTCGCCACAT 24C 

TCTAGGGCGC CATGAGCAAG GTTGTTTGCA TGAAGCTGAA GGTTATGCCA AATCAACTGG 300 

AAACTTGGCT GTTCCCCTCG TCACTAGTGG ACCAGGAGCA ACAAATGCCA TTACAGGGAT 360 

TGCGGATCCC ATGAGCGATA GCGTTCCCCT TTTGCTCTTT ACAGGTCAGG TGGCGCGAGC 420 

AGGGATTCCG AAGGATGCCT TTCAGGAGGC AGACATCGTG GGAATTACCA TGCCAATCAC 4 80 

TAAGTACAAT TACCAAG7TC GTGAGACAGC TGATATTCCC CGTATCATTA CGGAAGCTGT 540 

CCATATCGCA ACTACAGGCC GTCCAGGGCC AGTTGTAATT GACCTACCAA AACACATATC 600 

TGCTTTAGAA ACAGACTTCA 7TTATTCACC AGAAGTGAAT TTACCAAGTT ATCAGCCGAC 660 

TCTTGAGCCG AATGATATGC AAATCAAGAA AATCTTGAAG CAATTGTCCA AGCCTAAAAA 720 

GCCAG7CTTG TTAGCTGCTG GTGGAATTAG TTATGC7GAC GCTGCTACGG AACTAAATGA 7 90 

ATTTGCAGAA CGCTATCAAA TTCCAGTGGT AACCACTCTT TTGGG AC AAG GAACGATTGC 840 

AACGAGTCAC CCACTCTTTC TTGGAATGGG AGGCATGCAC GGGTCATTCG CACCAAATAT 900 

TGCTATGACG GAAGCGGAC7 T7ATCATTAG TATTGGTTCT CGTTTCGATG ACCGTTTGAC 960 

CCGGAATCCT AAGACTTTCG CTAACAATGC TAAGG7TCCC C AC ATT GAT A TTGACCCAGC 1320 

TGAGATTGCC AAGATTATCA GTGCAGACAT TCCTGTAGTT GGACATCCTA AGAAGGCCTT 1080 

GCAAATGTTG CTAGCAGAAC CAACAGTTCA CAACAACACT GAAAAGTGGA TTGAGAAAGT 1140 

CACTAAAGAC AAGAATCGTG TTCGTTCTTA TGATAAGAAA GAGCGTGTGG TTCAACCGCA 1200 

AGCAGTTATT GAACGAATTG GTGAATTGAC GAATGGAGAT GCCATTGTGG TAACAGACCT 12 60 

TGGTCAACAC CAAATGTGGA CAGCTCAGTA TTATCCCTAC CAAAATGAAC GTCAGTTAGT 1320 

CACTTCAGGT GGTTTGGGAA CAATGGGCTT TGGAATTCCA GCAGCAATCG GTGCTAAAAT 138C 

TGCTAACCCA GATAACGAAG TAGTCTTCTT TGTTGGGGAT GGTGGTTTCC AAATGACCAA 1440 

CCAGGAGTTG CCTATTTTGA ATATTTACAA GGTGCCAATC AAGGTGGTTA TGCTGAACAA 1500 

TCATTCACTT GGAATGGTTC GCCAGTGGCA GGAATCCTTC TATGAAGGCA GAACATCAGA 1560 

CTCGGTCTTT GATACCCTTC CTGATTTCCA ATTGATGGCG CAGGCTTATG GTATTAAAAA 1620 

CTATAACTTT GACAATCCTG AGACCTTGGC TCAAGACCTT GAAGTCATCA CTGAGGATGT 1680 
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TCCTATGCTA ATTGAGGTAG ATATTTCTCG TAAGGAACAG GTGTTACCAA TGGTACCGGC 17 40 

TGGTAAGAGT AATCATGACA TCTTCCGGCT GCAGTTCCAT GCGTAGAATG TTAACAGCAA 1800 

AACTACAAAA TCGTTCAGGA GTCCTCAATC GCTTTACAGG TCTCCTATCT CGTCGTCACG i960 

TTAATATTGA AAGCATCTCT CTTGGAGCAA CAGAAGATCC GAATGTATCG CGTATCACTA 1920 

TTATTATTGA TGTTGCTTCT CATGATGAAG TGGAGCAAAT CATCAAACAG CTCAATCCTC 1930 

AGATTGATGT CATTCGCATT CG AG AT ATT A CAGACAAGCC TCATTTGGAG CGCGAGGTGA 2040 

TTTTGGTTAA GATGTCAGCG CCAGCTGAGA AGAGAGCTGA GA7TTTACCC ATTATTCAAC 2100 

CTTTCCGTGC AACAGTAGTA GACGTAGCGC CAACCTCGAT TACCATTCAG ATGACGGGAA 216C 

ATGCAGAAAA GAGCGAAGCC CTATTGCGAG TCATTCGCCC ATACGG7ATT CGCAATATTG 2 220 

CTCGAACGGG TGCAACTCGA TTTACCCGCG ATTAAAAATC CAACTTAAAT TTATTAAACC 2 280 

AGCCTAAAAG GCAATAAATA ATAGAAAAGA GAGAAAAGCT ATGACAGTTC AAATGGAATA 2 340 

TGAAAAAGAT GTTAAAGTAG CAGCACTTGA CGGTAAAAAA ATCGCCGTTA TCGGTTATGG 2 400 

TTCACAAGGG CATGCGCATG CTCAAAACTT GCGTGATTCA GGTCCTGACG TTATTATCCG 24 60 

TC7ACGTCCA GGTAAATCTT TTGATAAAGC AAAAGAAGAT GGATTTGATA CTTACACAGT 2520 

AGCAGAAGCT ACTAAGTTGG CTGATGTTAT CATGATCTTC GCCCCAGACG AAATTCAACA 2580 

AGAATTGTAC GAAGCAGAAA TCGCTCCAAA CTTGGAAGCT GGAAACGCAG TTGGATTTGC 264 0 

CCATCC7TTC AACATCCACT TTGAATTTAT CAAAGTTCCT CCCCATCTAC ATGTCTTCAT 2700 

G7CTCCTCCT AAAGGACCAG GACAC7TCG7 ACGTCGTACT 7ACGAAGAAG GATTTGGTGT 2 7 6C 

TCCAGCTCTT TATGCAGTAT ACCAAGATGC AACAGGAAAT GCTAAAAACA TTGCTATGCA 2 820 

CTGGTGTAAA GGTGTTGGAG CGGCTCGTGT AGGTCTTCTT GAAACAAC7T ACAAAGAAGA 2 880 

AACTGAAGAA GATTTGTTTG GTGAACAAGC TGTACTTTGT GGTGGTTTGA C7CCCCTTAT 29 40 

CGAAGCAGGT TTCGAAGTCT TGACAGAAGC AGGT7ACGCT CCAGAATTGG C7TACTTTCA 3 000 

AGTTCTTCAC GAAATGAAAT TGATCGTTGA CTTGATCTAC GAAGCTGGAT TCAAGAAAAT 3060 

GCCTCAATCT ATTTCAAACA CTGCTGAATA CGCTGACTAT GTATCAGGTC CACCTCTAAT 3120 

CACTGAACAA GTTAAAGAAA ATATGAAGGC TGTCTTGGCA GACATCCAAA ATGGTAAATT 3180 

TGCAAATGAC TTTGTAAATG ACTATAAAGC TCGACCTCCA AAATTGACTG CTTACCGTGA 3 2 40 

ACAAGCAGCT AACCTTGAAA TTGAAAAAGT TGCTGCAGAA TTGCGTAAAG CAATGCCATT 3 3 00 

CGTTCGTAAA AACGACGATG ATGCATTCAA AATCTATAAC TAATTAGAAA TATATAGCGC 3 3 60 

TGGAGATGAT TTTATGAAAA AGATTATGAG AAAAATTGCA TCCT7ATTAT TGGTTCTAGT 3 4 20 
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TGTATAATGT AATTACACCG TCGCTAATAG TGCTAGCAGA CCAAAATAAA GCAGATTGGT 3 4 90 

CGTATCATGA AAATGCTGTA ATTAACATTT ATGATGATGC TAA7TTTGAA GATGGTAGGT 3 54 0 

TGCATATGAA CTTTGAACAA TTCTTCAAAT TGGCACAAAT AGCTAGAGAA GAAGGTCTTG 3 600 

AAATTCATTC TCCGTTTGAG AGAGCTGGTG CGACTAAATC TGCTCGTTAT ATAGCGAAAT 3660 

GGATTT7GAG AAATAAAAAA CATTAACAAA TATAGTTGGT AAATCATTAG GACCTAAATC 3 7 20 

AGCTGTTAGA TTCGGAGAAG CTTTATCCTA TATTGAAGGT CCTCTTCGCA GAATAAATGA 3780 

GACGATAGAT GGCGG7TTAT ATCAAATAGA GCAAATTATT GCATCTGGAT TGAAAGAATC 3 840 

CGGTTTAAAT GACTGGACTG CGAAAACTTT AGCTTCAGCT ATTCCTGGGA TATTAGATGT 3 900 

ACTTATTTAG CGCTTGAAA7 CATATGAATA TTACCAATTT GTTTTCTATC AAGACAGGAT 3 9 60 

GTGATGAAAC TGATAGGCAA CTGCAAAAAC TATTTTTTCA GTTGGATTTA CAATTGGGAG 4C20 

AATTGACAGA TCAACTAAGA AAATTAGATT CTAATTTTGT TCCTCGTAGT CAATTTGTAG 4 080 

ACACGTTGCA TTTGAATGAT GTAGAATATA AAGAAATTTT AAACTATTTT ATCTTCCATC 4140 

GTAATGATAG TCAACAAACT TTGGTAGAAT GGTTATATGA TTGGATTTCC ACAAATCGTT 4 200 

ATCAACTTCC TAAAGAGTTT TCGATTCGTA TGGCTCATAA ATACCATGAA AGTGTTACTG 4 260 

AAGTTTTCGG AGATGAATAA CTAAAAAACA GTCATTAGTG ACTGTTTTTT ATAGAAAAAG 43 20 

AGGTTTTATA TGTTAAGTTC AAAAGATATA ATCAAGGCTC ACAAGGTCTT GAACGGTGTG 4 380 

GTTGTGAATA CTCCACTGGA TTACGATCAT TATTTATCGG AGAAGTATGG TGCTAAGATT 4440 

TATTTGAAAA AAGAAAATCC CCACCGTGTT CGCTCCTTTA AAATTCGTGG TGCCTATTAT 4 500 

GCCATTTCCC AGCTCAGCAA GGAACAACG? GAACGTGGGG TAGTCTGCGC TTCTGCGGGA 4 5 60 

AATCATGCGC AGGGAGTACC CTA7ACTTGT AATGAAATGA AAATTCCTGC TACTATCTTT 4 620 

ATGCCCATTA CTACGCCACA ACAAAAGATT GGTCAGGTTC GCTTTTTTGG TGGGGATTTT 4 680 

GTAACTATTA AACTAGTTGG AGATACCTTT GATCCCTCAG CCAAAGCAGC TCAAGAATTT 47 40 

ACAGTCTCTG AAAATCGTAC CTTTATTGAT CCTTTTGATG ATCCTCATGT TCAAGCAGGT 4800 

CAAGGAACAG TTCCTTATCA GATTTTAGAA GAAGCTCGAA AAGAATCGAT TGATTTTGAT 4860 

GCTGTCTTGG TTCCTG7TGG TGGTGGCGGT CTCATTGCCG GGGTTTCTAC CTATATCAAG 4 92 0 

GAAACAAGTC CAGACATTGA GGTTATCGGA GTAGAGGCGA ATGGAGCGCG TTCCATGAAA 49 80 

GCTGCCTTTG AGGCTGGAGG TCCAGTAAAA CTCAAGGAAA TTGATAAATT TGCTGATGGG 5040 

ATTGCTGTGC AAAAGGTAGG TCACTTGACC TATGAAGCAA CTCGTCAACA TATTAAAACT 5100 

TTGGTAGGTG TCGATGAGGG ATTGATTTCT GAAACCTTGA TTGACCTTTA CTCTAAGCAA 5160 

GGGATAGTCG CAGAACCTGC TGGAGCGGCT AGTATCGCCT CTTTAGAGGT TTTAGCTGAA 52 2 G 
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TATATTAAGG GGAAAACCAT TTGTTGTATC ATTTCTGCAC CAAATAATCA TATCAACCGT 5 2 80 

ATGCCAGAAA TGGAAGAGCG TGCCTTCATT TATGATCGTA TCAAACATTA CTTTGTGGTC 5340 

AATTTCCCAC AACGTCCAGG AGCTTTGCCT CACTTTGTAA ATGATATCCT CGCCCCAAAT 5400 

GATGATATCA CACGTTTTGA GTATATCAAA CGACCTAGCA AGGGAACAGG CCCAGTATTA 54 60 

ATTGGGATCG CTTTAGCAGA TAAGCATGAT TATGCAGGTT TGATTCGTAG AATCCAAGCT 5 52 0 

TTTGATCCAC CTTATATTAA" CTTAAATGCT AATGAAXCGC~TTTATAATAT GCTTCTCTGA ~ 5580" 

GGACTAATAA AAAAATATCA TACCTTCATT TTGATTTCCT ATCTATTCAC AAGCATAGTC 5640 

ACACTGTCTT TAATACTCTT CGAAAATCTC TTCAAACCAC GTTAGCTCTA TCTCCAACCT 5700 

CAAAACAGTG TTTTGAGCAA CTTGCCCCTA GCTTCCTAGT TTGCTCTTTG ATT7TCATTG 5760 

AGTA7AAGGT ATGATTTGAT T7CTTTTTCT TGACAAATAT ACT AT ATT AA AAAGATATAT 5 920 

AAGTAATTAA CTGAGCTTAT CTGTCTTGTC ATCTCTATTA AGGATGGTTT AGATAATCGG 5380 

GTCTCTCCTT CTAGGCTAGC ACCTCAATAT CCAAAGGACT GATGAATTTG AAGGACATAA 5940 

GGAATACCT A TCTCTCAGAT GATTTATTGA GGAAGAAAGA TAGGAGTTTT TGAGCTAGTG 6000 

AAGGCTTGGA TTTCTAAAGG TTAGAACTAT CATCTTCAGT TCTTAAATCG AAGAAATAAG 6060 

CTATCTTACG GAAATAGAGA AGCATTTTTT AAGAACTTGA ATAATTTCGC ACCTTAAGAG 6120 

GGTAATAATA CAGTATTTTT ATTAGCAAAT ATTTATGGTG TAGAGGCTAG CAAAACCTAT 6130 

ATATTATCGG ATTTAAAAAG GAAGTAAGAA A 6211 



(2) INFORMATION FOR 3EQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

t A) LENGTH: 79 3 9 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNES3 : double 

(D) TOPOLOGY: linear 



lxi> SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CCCGACTCCC CACGATTCTT CAAAATAACT GAGTATATTT CTATC7TCAT TTTCAGATAT 60 

AAATTCTTCC TTCTGTCCCC TCTTCTTACG CTTGAGAAGA GCTTCTCCGA CATGGCTTCT 12 0 

TCCTTACTGA GCAAAACCTT GAGCATAGAT AAGTTTGACT GGCAAGCGTG CTCTTCTATA 130 

TTTGGCTCCC TTCCCACTAT TCTGGATACC GAGGCGTCTT CTCATATCAG TCGTATAGCC 2 40 

TATATAGTAG GATCCATCAC GACACTCCAG AACGTACATA TAAGCCTTAT GATCCATAAT 3C0 

AAATCTCTTC GATTTCGGGC GTATAAGAGC CAT CATC ATT GTGGACAATC AAAGGAGGTA 3 60 
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AGACC7TAAA GCCACTTGTT GAGCCATCCT TGATCGCCTC 
CCTTTTCTCT TTTTGGATAA ACAAACTGCA GCCCCTTAGC 
ACGTATCCAA AATATCCAGA AGTCGATCAG GACGATGAAC 
ACTTGAGAAT ACTCTGGGCA CTACGACAGA TTTCTTCCAA 
GAGCCAAGAG ATAATG77CA CTC7CGTTCA GATTAGAATA 
G7GCA77ACA CAAAATCATA TCCACCTTAC TCCCCTGAAT 
AATCATCGCA GATGACCTGC ATT7GC7CCT CTAATCCATT 
CCATATCCGC CAAACGCTCC TGAATCTCAA CAGACAATAT 
TAGCAAAAAG CCCCACTCCT CCATTCCCAG CACAGAAATC 
GAAAACGTGG AAATCGTGAT AAGAGAACAC TATCCACCGA 
TTTGAATGAT T7TGATATC7 GTCGAAAAGA GCTGGTTAAT 
ATTCTTC7TC TTCCATGGTC CTATTATAGC AAATTCATAT 
AACTCTAAAC TACTTCTTCT TTTTTAAATG GTGCAGGGCT 
CATTCGTCGA AAGGGAGCAA AGCCGTAGTT AAAGCCGTCG 
AGGAAACTGG TACTTTTCTT CCTCCAAAGT GCGGATAGAA 
TTCATC7AAA TCCACTACCT GAACTTGAAC CTCTTCATCG 
ATTTTCAATA AATCCTGTCC GAATCTCTGA AATGTGAATC 
TAACTCAACA AACGCACCC7 AGGGCTGAAT CCCTGTAATA 
GAT7TTCATC TTAGTCCTCG ATTTCAATAG TT7CAA77AC 
CCATAGC7CC TG7CTCAACA GCACCAA7GG CATCCAAGAC 
CTAACTGACC AAAAACCGTG TGACGGCGGT CTAGGTGAGG 
AGATTTCTGC AATCGGTTCT GGCCAACCAC CACGAG7AAT 
GG7GT7GGTT TTGCACGA7A AAGAACTGGC TGCCGTTGGT 
7GGAAAGAGC ACCACGGA7A T7GTAAACCT CTTCTGAGAA 
CGTAGATTGA CTCGCCACCC ATACCAGTTC CAGT7GGG7C 
CC7TGATAAT ACGG7GGAAA ATGACACCAT CA7AGTAGCC 
AGTTAGCCAC TGTT77AGGA CCATGTTCAG GGAAAAGCTT 
TGG7C77AA7 AGTCGCAAGA GGACCT7C7A CTGT7TCAA7 
A7TCT7T7TC TACCATACCA AATACTTCTA AGGCAGCAAA 
T77TTGTAAT ATAA7CTGCT 7T77CTTTGA T777ATCA7G 



AA7CAAAAGC ATAT7GGC7T 420 

GGCTAGAT7A 7GTCGT7TTA 4 80 

CATGGCCAAA CGCCCATTAG 540 

A7TAGTCGTG ATTTCGTG7C 600 

AGGATTCACC TTGAAATAGG 660 

GTGAGCAGGC ATATTTTTCA 720 

CAAACGGACA GAGCGTTCAG 780 

CTGTGC7TGA G7ACGAGTGC 340 

CACAATCAAC CCCTTCTTAG 900 

ATAGCTAAAA ACCTCTCTAT 960 

GCGCTCTCCT GATTTTAATA L020 

TAACATTACA AAAAA7ATAA 1080 

TC7CCAGTCC AGATTGGTAG 1140 

C7TGAAAAGC GTCTCCCTC7 1200 

AGACTGCC7T 7CCC7GTAAA 12 60 

AC7TTCAAGG TTTCATGAAT 1320 

AGCCCCGTAT CACCCCTCTC 1380 

CGCCCCTTTA GCT7ATCACC 1440 

AACA7CT7CA ACTGGCTTGT 15C0 

AGCGTAAGAT GCTTCATCAG 1560 

TGTCCCACCT TCATTGGCAT 1620 

T7CT7TCTTA GAATAAGGTA 1680 

ATTTGCACCA GCAT7TCCCA 1740 

TTCATCC7CA AAAGATTCGC 1800 

TCCACCTTCG ATCATAAAGT 1860 

A7CT7TTGAA AGAGATACAA 1920 

GATACCTAAG TCTCCGTGAT 1980 

GTC7ACTTGT GGAAAATGCA 2 040 

AATGCCATC7 TCT7CTAATG 2100 

AGAAATTCCC ATGGCAACGC 2160 
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TCATTCCAGC 


ATAATCAAAG 


AGTTCCAAGT 


CGTTGAGACC 


ATCTCCAAAA 


ACCATGACCT 


2220 


TCTCTCGTTT 


CAAGCCAAGG 


TGTTCCACAA CCTTTTCCAC CCCCGTCGCT TTGCAGCCTG 


■ 2280 


AAATCGGCAC 


AATATCAGAC 


GAATGTTGAT 


GCCAACGAAC 


CATGCGAAGT 


TTGTCTGAGA 


2340 


GACTGTCAGG 


CAAGTGCAAG 


TCATCTCCCT 


TATCTTCAAA 


AGTCCACATC 


TG AT AG AT AT 


2400 


CTTCTTTTTC 


ATGGAAATCG 


GGATCTACAT 


CTAAGTCGGG 


ATAAATTGGA 


TTGATAGCTT 


2460 


CACTCATCAT 


ATCGGTGCGA 


GTCGACAACT 


TGGCATCATG 


ACTCCCAACC 


AAGCCATACT 


2520 


CAATTCCTTC 


TTGCTT AG CC 


CAAGAGATAT 


ACTCCTCAAC 


ATCTGACTTT 


TCAATCTGAT 


2580 


GCTGATAAAT 


GACCTGACCT 


TTTTTATCTT 


CGATATAAGC 


CCCATTCAAA 


GTTACAAAAA 


2640 


AGTCAGGCTT 


GAGATCACGA 


ATCTCTGGAA 


CAACACCAAA 


AATGCCACGT 


CCAGAGGCGA 


2700 


TTCCTGTTAA 


AATTCCTTTT 


TCACGCAACT 


GTTTAAAAAC 


AGTGGGAATT 


GTAGTTGGAA 


2760 


TAAACCCTGT 


CTTTGAATTC 


CGCAATGTAT 


CATCAATATC 


AAAAAACACA 


ATCTTCATCT 


2320 


TCTTTGCCTT 


GTATCTTAAT 


TTCGCGTCCA 


TCTCACTACC 


TCTTTCAATC 


TAACTCTTTC 


2880 


C ATT AT AT C A 


TAAAGTAGGC 


AAATCCCCTA 


TTTTCAAAAA 


GTTTATCATT 


TTTATTTTAA 


294C 


TTTCTTGGAT 


GAGAAAAGAG 


ACATATTTAT 


GAAAAAGCTC 


CATCGTGCTT 


TTAATGTGTT 


3000 


CTCTTGTTTT 


CAAACTCGTA 


AAAAGGGAGC 


CACTGATCCT 


AACTCGCTCT 


CTCATTTCAA 


3060 


AGCTTGTGAA 


AAAAGACCCG 


TTGGGGTCTT 


AATTCGCTTT 


CTTGTTTTCA 


AGCTCATGAA 


3120 


AAAGAGACCC 


AACTGGGTC7 


TTTCTTTAAT 


CTTCGTTTAC 


GAAAGGCATC 


AAAGCCATTA 


3180 


CGCGAGCGCG 


TTTGATAGCT 


GTTGTTACTT 


TACGTTGGTT 


TTTAGCTGAA 


CTTCCTGTTA 


3240 


CACGACGAGG 


AAGGATTTTC 


CCACGTTCTG 


AAACGAAACG 


GCTAAGAAGC 


TCAGTATCTT 


3300 


TGTAATCAAC 


ATATTCAATT 


TTGTTTGCTG 


CGATGTAATC 


AACTTTTTTA 


CCGCGTTTGA 


3360 


ATCCGCCACG 


ACGTTGTTGA 


GCCATGTTTT 


TTCTCCTTTA 


TAAGTTTAGT 


TGTCCATTAG 


3420 


AM 1 Lib 1 AAA - 




AATATCCAAT 


GGGTTTGTTG 


CTCCAAATGG 


ATTTTCATTA 


3480 


CGTGAAAACT 


CTGGTAGTGA 


ATTTGTAGGT 


GCTGAATAGT 


TTGCAGTTGG 


TGCAGACTAA 


3540 


GCTCCACCTG 


TGTGACCCTC 


ACGCACACTA 


CGGCTTTCCA 


ACATTTGGAA 


ATTCTCAGCC 


360C 


ACGACCTCTG 


TCACGTAGAC 


ACGTTGTCCT 


TGCTGGTTAT 


CGTAACTACG 


AGTCTGGATA 


3660 


CGACCTGTCA 


CCCCGATAAG 


TCAGCCTTTT 


TTAGCCCAGT 


TAGCAAGATT 


TTCAGCCTGT 


3720 


TGGCGCCACA 


, TAACGACATT 


GATAAAATCA 


GCCTCACGTT 


CACCATTTTG 


ACTCTTAAAT 


3780 


GTACCGTTTA 


, CTGCAAGAGT 


' AAAAGTCGCA 


ACTGCTACAT 


TTCATGGGGT 


ATAACGCAAC 


3840 


TCAGCGTCAC 


GTGTCATACG 


1 CCCTACAAGT 


ACAACATTGT 


1 TAATCATAGT 


TTACCTTCTT 


3900 
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ACGCGTCAAT TTTGACGATC ATGTGACGAA GAATGTCAGC CTTGATTTTT GAAAGACGGT 3 960 

CAAACTC7TT AAGAGCTGCA TCGTCATTTG CTTCAACGTT AACGATGTGG TAAACTCCTT 4 020 

CACGGAAATC TTGGATTTCG TATGCAAGAC GACGTTTTTC CCAAGTTTTT GATTCAACAA 4080 

CAGTTGCACC GTTGTCAGTC AAAATAGAGT CAAAACCTGC TACCAAAGCG TTTTTAGCTT 4140 

CTTCTTCAAT GTTTGGACGA ATGATATAAA GAATTTCCTA TTTAGCCATT GATATGTTCC 4 200 

TCCTTTTGGT CTAATGACCC CAAGACTTTG CAAGGGGTAA GTGAGGTTCG CTCACAATAA 42 60 

ACT ATT AT AC TACAAAAAAT TTTTTTACGC AAGTAAAAAC ACTAGAATTC GAAAAAACGC 4 3 20 

CACATGGGCG TTTTCCTGTT CTTATGGTTT GATACGGTGC AACATACGTG GGAATGGAAT 4 380 

ACCTTCACCC ATATGTTTTG TTCCTGCTGC GAACGTTACC ATACG7TCGA TACCGATACC 4 440 

AAATCCTCCG TGTGGAACTG TACCGTATTT ACGAAGGTCA AGGTACAATT CATATTCTGT 4 500 

ACGATCCATG CCAAGTTCAT CCATCTTAGC GACAAGGGCA TCGTAATCTT CCTCACGCAT 4 560 

AGACCCACCG ATAATTTCTC CATAGCCTTC TGGAGCAAGC AAGTCTGCAC AAAGCACGCG 4 62C 

CTCTGGATTT CCACGAACTG GTTTCATGTA GAAGGCCTTG ATGGCTGCTG GATAGTTCAT 4 680 

GACAAATGTT GGCACACCAA AGTGGTTTGA AATCCAAGTT TCGTGTGGTG ACCCAAAGTC 4740 

ATCACCATGC TCAAGATGCT CGTAGTCAGC ATCTTCATCA TTTTCATCCT CTTGCAAGAG 4800 

GTCAATGGCT TGATCGTAAG TGATACGTTT GAATGGCTCT GCAATGTAGC GTTTCAAGAG 4 860 

TTCTGTATCA CGTTCCAAGG TTTCCAAGGC TTGAGGCCCG CGGTCAAGAA CACCTTGTAG 4 9 20 

AAGAGCTTTC ACATAAGCTT CTTGCAAGTC AAGCGACTCA TCATGTGTCA AGTATGAGTA 4 980 

CTCAGCATCC ATCATCCAGA ACTCAGTCAA CTGACCGCGT CTTTT7GATT TTTCAGCACG 5040 

GAAAACTGGA CCAAAGTCAA AGACACCACC AAGACCCATA GCCCCTGCTT CTAGGTAAAG 5100 

CTGACCTGAT TGGCTCAAGT AGGCTGGCGT TCCGAACTAG TCAGTTTCAA AGAGTTCTGT 5L60 

AGAATCTTCT GCCGCATTTC CTGAAAGAAT TGGGCTGTCA AACTTCATAA AACCGTTCTT 52 20 

GTCAAAGAAC TCATAAGTTG CATAGATAAT AGCGTTACGG ATTTGCAACA CACCTACTTG 5230 

CTTACGAGAG CGTAgCCACA AGTGACGGTT ATC CATC AAA AAGTCTGTTC CGTCTTCTTT 5 340 

TGGTGTGATT GGGTAGTCTT GAGATTCACC GATCACTTCG ATGTCTGTGA TGTCCAACTC 5400 

ATAGCCAAAT TTAGAACGTT CGTCCTCTTT GACAATACCT GTCACATAAA CAGACGTTTC 54 60 

TTGGCTCAAC CGTTTGATAA CATCAAACTT CTCAAGTCCC ACTTCTTCAC CAAATTTTTC 5 520 

GACAAAGTTT GGTTTAAAAG CCACACCTTG AAAGAAGGCT G7TCCATCAC GCAATTGTAA 5580 

GAAAGCGATT TTTCCTTTTC CTGATTTGTT GGCAACCCAA GCGCCAATCG TCACTTCCTG 564 0 

ACCAACATAG TCTTTTACG7 CAATAATCGT TACACGTTTT GTCATTATTT TTCCTTTTCT 57 00 
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TTTTTATTCT TTATGGCAAA CCACCTCTAT AT7G7TCCCA TCCAGGTCAA TCATAAAAGC 57 60 

AGCATAGTAA ATCGGATGCT CACTTCGA7A ACCAGGACCC CCATTGTCTC GCCCACCTGC 5320 

C7CTAAGCCA GCCTCATAAC AAGCCTGAAC 77CT7CCTTA TTTTCTCCTA AAAAAGCAAA 5 3 30 

ATGAACAGGA TCTTGTGTTC CCTGAGTCAG CCAAAAATCA CCACCAGGAT GACGGCTGTT 5940 

CGGGGATAGA AAACTAATTA GAGAACTAGT CTTAAAAGCC AATT7ATAGT CCAAAGGAGC 6000 

GAGAAAAC7C CTATAAAATC CTTATGAAAT TTGTAAATCC TTTACCTTAA TCTCAAAATG 6060 

ATCAATCATT CTCACTACCC ATAAATGCTT TCAAGCGTTC GACTGCTTCT 77AAGCGTGT 6L20 

CTAGGTCTGT CGCA7ACCTC AGGCGGACAT TTTCTGGTGC TCCAAATCCA GCTCCTGTTA 6180 

CCAAGGCCAC TTCGGCTTCT TCTAAGATAA CAGT7G7AAA G7CTGTCACA TCCGTGTAGC 62 40 

CTTTCATCTC CATGGCCTTT TTGACATTTG CGAAGAGATA GAAGGCCCCT TGCGGTTTGA 6300 

CCACTTCAAA TCCTGGTACC TCTGCAAGGA GGGGATAGAT GG7ATTAACA CGTTCCTCAA 6360 

AGGCCTGACG CATGCTTTCT ACAGTATCTT GCTCACCTGA TAGAGCCTCA ACTGCTGCAT 64 20 

ATTGGGCTAC TGCTGACGGA TTCGAAGTTG TTTGACCTGC AATC7TCCAC ATGGCAGCGA 64 80 

TAATGTCTGC TTCTCCAACG CCA7AACCAA TCCGCCAACC AG7CATCGCA 7AAG77TTAG 6540 

ACACACCATT GATGACCACT CTTTGCTTGC CAATCCCTTC CGATAGGCTA GAAATCGGTG 6600 

TGAACTCATG ACCATTATAA ACCAAGCGGC CA7AGATATC GTCTGCTAGG ATGAGAATAT 6660 

CATTTTCTAC AGCCCAGTTT CCAATTGCCA AGACT7CC7C ACGGGTCTAA A7CA7ACCTC 6720 

TGGGATTAGA TGGCGAATTC AGCACCAAAA CCTTGCTCTT GTCAGTGCGA GCTGCTTCTA 6780 

ACTGCTCTAC GGTCACCTTA AAGTGATTGT CTTCCTTAGC AGAAACAAAG ACGGGAACGC 634 0 

CTTCTCCCAT CTTCACCTGA TCTCCATAGC TAACCCAGTA TGGGGTTGGG ATGATGACTT 6900 

CATCACCTGG ATTGACCACA GCCATAAAGA AGGTATAGAG AGAATATTTG GC7CCCCCAC 6960 

CGACTGTCAC TTGATTTGAC GCTACAGAAT AGCCGTAAAA GCGCTCAAAG TAGCTATTGA 7020 

CCGCCGCCTT AAGCTCTGGC AGACCTGAGG TTACTGTATA AAAAGAAGCA CGCCCATCTC 7030 

GAATCGATGC AATGGCGGCA TC7TGGA7AT TTTTGGGAGT AGTGAAATCT GGCTCACCCA 7140 

AGGTTAGAGA CAAAATATCT CTACCCTCAG CCT7CAGTGC 77TCGCACCG GCTCCAGCAG 7 200 

CCAAAGTCAC ACTTTCTTCC ATTTCTAAAA CACGGTTGGA 7AGTTTCATA GGCCC7CC77 72 60 

GT7GACCAAT GCTCCTGTT7 CAAAATCTAC TAGATAAAAA TCAGA7CC7G ACTTAACTTC 73 2 0 

CCAGAT7GGC 7TATCTTGAT AACCCCCAAA GGT7ATCT7C 7CAA7CTCGC CAGCTCCC7T 7 3 80 

7TCCT7AGAA ACCG7T7C7C CT77TTCT7G TGAAACACCC 7CATT7AGC7 GA7AAACGTA 7440 
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AATCTTATGG TCATCTTTAC CAATCAGGAC AGCAAGCGCT TCTTGC7CTT TGTTACGACC 7 500 

AAGAACGCTG TAATAAGATT CCAAGCCATT GTATAAATCA ACCTGATCAG CCTGCTCTAA 7560 

TCCTGCATAC TGCTGAGCTA ATTTTTCTCC TTCACTTTTA GCTGTTTGAT AGGGTTTCAT 7 620 

GCTAAGAGAA AC CAT AT AC A GAAAGGAACC ACTGATAACC ACAAACAAAA TCGTCATCCC 7 680 

T AG A CC AT AC TGCCACAGTA GATTATTTTT TCCTTTCTTT TGTCTTTTTT TCACTCGTCT 774 0 

ATTTTACCAT CTATTAAGCT TTATTACAAG TGAATATAAG AATACTCTTC GAAAATCTCT 7800 

TCAAACCACG TCAGCTTTAT CTGCAGACCT CAAAGCTGTG CTTTCAGCAA CCAATTCTAT 7 8 60 

TTCTCCCTTC AAACAAAACC GAT7TTGAAA GTGAAACAGT TCTTACTTTT TCAGTCACAA 79 20 

ATGATTAGAG TTTGCCGGG 79 39 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9897 base pairs 
(8) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
( 0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCGCTCTACC GTCAAATAAT TACCATTTTC TTTAATACCG AAATTTTTAT CTACTGAAAA 60 

TTCAGTTGGT CTGTTGGTAC GATCGTCGTA TACAGTACCA TTCTCACGAA TAGTATAATT 120 

GTAATCAGTA TCACCTTGTT TCCTTAATTT AAGGTAATAA TTACCATCAA TTTGTTTATA 180 

ACCTGAATC? 7TTCTAGTTC CTTCTCTAAA ACTTACTCCA GCAGGCATCA CATCAGCAAA 3 40 

CATGAGTACT TGTTTGTTCT TTTTTTCAAC AATAACAGAG TCAATATAGG TTCCACCACC 300 

GCTGATTTGT AAGTCACGTC CACCAACTTC ACGAGGCCAT TCTAATGGTA CTCCCCCAAA 360 

ATCATCGAAT GCCAATGTTA ATTTTGGTTT AGTCCATGTC TTACCATTAT CATC ACT ATA 4 20 

ACTTGTAGCA ATATTAATTT TATTCAAGAA ATCATGAGTT CCACCGTAAC GAGCGTCAAT 4 80 

GCTTGAAAAT ACCCGACCAT TGCTAAAAGT ATACAGAACT GGAATACGGA AATAGTTAGA 540 

ACCTGTTGTA TCATTAGCCG TATAAATTAA ATGTCCAGTA ACAGCGTTTG TTGTCATCTT 600 

TTTAACAGTT TCTTCATCCA ATGCACTATT AAAGAATTTG ATATTTTCTA GTGTTCCGTT 660 

AAAACCAAAC CCCCTTTTTC CTGCACGTTT CACTCCCCCA AGCATATAGT AATCAATACC 7 20 

TTTAATATCC TTGATGTTTA GGAAATTATC CACTTTCTTT TCTACTACTT TTGTACCAT? 7 3C 

TGCGTATAAA GAATATCTTT TTTTGACTGA ATCTGCTACT ACTGCAACAG TGTTAGTCAC 940 

AGCCTCTTCT TTGTACTTAC CCCAAACTGA AGCAGGTCTG GATACTACGT TATTTTTATT 900 
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GGAAGAAGTA 


TCACGCGCTT 


CCATCCCCAA 


CTCACCATTG 


TCTCTAAGGA 


AfAPATfTAr 


q ft rt 


ATAACTATTT 


TGTTGACCGG 


GTTTGGAATT 


AGATATTCCA 


AACAGAGCTT 


GT AAGr r*T* n * T ' 


J. U <i u 


CTCACTTCAC 


TGATTGTACT 


TAATCACTAC 


AGTAAAGTCA 


CCGCTAC m AA 






TAACTCTTTA 


GTAACATTTT 


CTCCGCCCCC 


TGTTAAAGTA 

4 W 4 4 J n VW^VJ 4 J~| 


AC ATT ATT m T 
rt^rt l irtl 1 . l 


-V*. m p*T« X A {* * p 
L 4 1 ^_ 4 sv\\j/tl_ 


1 1 A A 
1 1 4 U 


AGGAGTTTCT 


TCCGCTGTAG 


AAGATGGATC 


(TAAGAGTA 

W * l rtrt^»rtVJ I 


Ui 1 > \_rt/*A. 1 VJ 


L i ^.wMuu 1 1 U 




TACAGTAACT 


TCCGAAGAGT 


TATCCGATGT 

4, rt * W^VJrt 1VI1 


AGGTTGTACT 

rtv»w i i u i /as*- 1 


Tf^AA ATPf. 






TGCAACAGGT 


TGCACCAACT 


TGGTG TTG A 


TAGTTfAflAA 


r*T'"'TPA CTC** 


. uaUL 4 tsJ^. 


4 J^'J 


AACTGAGTTA 


GCAACAAATG 


CTGATAATAC 


CACT ACACIA 






■"ton 


AATATTTTTT 


TTCATTTTAT 


TTTTP-CTCGT 


TTAAAAGT'**T 


c at aac a a n*^ 

Ort 4 rtrt^.rt/\0 4 


till inALnu 


1 A A 1 


TTTCATCATT 


GCAATGAATC 


TTTGGTTGGT 


flAAflATfTTP 


4 1 ^ t*U\/\S\\j I l_ 


i P*P* llfiTST 


1500 


TCCCTGGAAG 


C AATT C AAC A 


ATTTGATAGT 


GTTTGPT ATP 

V— 4 4 4 4 J\ 4 *w 


GTAAAAAP.PA 

l rtrtrtrtrtU^rt 


ATlTrPTTrT 
nlnl I I ^_ 1 




CTTCGCTAAA 


AGGTACACGT 


GACTGGGCAC 


GAACTGGGGA 


agttaptp.pp 


nil U l 1 L rtVj 


i n t r* 


TATTTTCAAC 


AACAATATGA 


ATATCTAAAT 


ATTT PT r ' , A TCI 


AflTT^PAAAA 
rttj 11* \.niv\n 


A^ATTrrTP 
rt i Ai 1. tlj 


t can 


GAACTCCATC 


AGCTAGATAA 


GTCATACAAT 


TTGCAAAAAP 


A L l • 1 l-\_<«»s»Vj 


TP A AT ATP A A 
i ^rtrt i_rtrt 


1 "7 J A 


TTTTTCCATC 


AACTAAATCT 


GTCAAATTTG 


4 J* 4 4 4 4 \v 4 *\+\ 


AAAATPAPAr; 
rtrtrtrt 4 \«rt\_rtvj 


rtV_ til t Vafrtrtrt 


iOUU 


AATATTTATT 


GACAGAAGCA 


TATCGTTTAA 


AATCAGAT"*G 

**** i v«n\jn 4 * v^ 


TTCAGAAATA 
i 4 Lnunrtn 4 rt 


ATPATATTAT 

rtiVrtlrtl 1 l V 1 


i fl fin 


TTT CT C7T T7 


CTATTAGTGA 


CGAAC^^TCC 

^VJ/ u^Vpi * * W W 


AACTTGAA^P 


PGP"*TT A A' T " T ' 


i 1 u l rtrt i rt * 


i s *> n 
i j *; u 


CATGAATCGT 


TCTATA7TTA 


GGTGCAGATA 






^ A " A C A A* 7 * A 

Ort 4 ttL rtrt 4 t\ 4 


i 3flft 


AACCTGAAAC 


TAf^GATACA 


GAGA T TGAAA 


^CAATCAATA 


TGrfTAr.TAf^ 






TTGGAGGAAG 


GAAGTATTTA 


AT AAAT AC C A 


TGACGATGGT 


TGATACAATr 


Ap.pnrTPPAT 


n 1 

* iyu 


AAGCACCTTG 


TTT ATT TGC T 


TTTTTAGAAA 


CAAATCCAAG 


AATAAATACA 


CCACGAAGTA 


2160 


GACCAAGTAC 


AAGTCCCATG 


AAACTATTGA 


ACCATTCGTA 


TGCAGATTTA 


ATATCTGAGT 


2220 


GAGCCATGAC 


AATGGAAACA 


CCAATTGAGA 


ATAAACCTAC 


TGCTAGAGAT 


ACGAATTGTG 


2290 


CAATTTTCGT 


ACGACGATTG 


TCTGACATAT 


TTTTAGAAAT 


GACA7CTTGA 


ATATCCAATG 


2340 


TCCATGAAGT 


TGCAACAGAG 


TTCAAACCTG 


TTG AAAT AG T 


TGATTGACAT 


GCTGCATAAA 


2400 


TCGCTGCCAA 


GATCAAACCT 


GTGATACCTA 


CTGGTAACTG 


GTATGCAATA 


AAGT AC AT AA 


2460 


AGATTTGGTC 


T7GAGGGATA 


TTGCTAGCTG 


CACTATCTGC 


ATTTTGTACT 


TGATAGAATA 


2520 


CCTACAAGCC 


TGTACCAATC 


AAGT AAAAG A 


C7GTTCCACT 


TGCAAGTGAC 


AAAACACCCT 


2580 


TTGTGAACAA 


CATC7TATTA 


AGTTTCTTAA 


TATTTTGTG7 


TGTAGTAAAA 


CGTTGAACCA 


2540 
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AATCTTGAGA TGAAGCATAC GAAGACAAGA TTGTAAAGCC TGAACCCATC ACAATTAAAA 2 700 

AGATGGAGTT TGAAAGCAAG TTAGGATCGA AAAGTTTTTC A7TTCCAGCA AGGAATTTCC 27 60 

CGTTTGC7AA TGTTTCTGCT ACTCCACCAA AGCCACCTTT AATATTAGCA ATCAGTACAA 28 20 

ATAAAGCTAA AACGACACCA CTAATCAGAA TCACACCTTG AATAAAGTCT GTCCATAATA 2 88C 

CGGATTTTAG ACCACCAGTA TAAGAATAAA CAATTGCAAC TACACCCATC AAAATAATCA 2 940 

AAATATTGAT GTCAATTCCT GTCAATACTG ATAAACCAGC TGATGGGAGG TACATAATGA 3000 

TAGACATACG TCCCAATTGA TAAATAATAA ACAAGAGTGC TGAAATAATA CCAAGTGCTT 3 060 

T AG AATT AAA ACCTTTATCC AAGTAATCAT ATGCCGTATC GATGTCTATC CGTCCAAAGA 3120 

TAGGTAAGAT AAAACGAATT GTCAGTGGAA TAGCTACTAC CATCCCTAAT TGAGCAAACC 3130 

ATAAAATCCA GCTACCTCCA TAAGAGCTAC CAGCGAGTCC CAAGAAGGAA ATCGCACTGA 3240 

CCAT7GTCGC AAAAATGGAT ACCGAAGTAA CATACCAAGG AACCGAACCA TCTCCTTTAA 3 300 

AGAACTCTTT TCCTTTCATC TCTTTTTTAG AGAAATAGAT ACCTGCAACC AACACCCCAA 33 60 

GTAAATAAAC AATCAAGATA ATTAAGTCAA TTATTG7AAA TCCTGTTGTG CCCA7AACA7 34 20 

ATCTCCATAT TGATTTTATT TATTATAAAA ATTCTTTTCG TGCTTCTTGA ATAAGTTCTG 34 80 

CTGCTTGTTT TGCAACTTCC AAGTCACCTT CTGCCAATGC TTCTAAAGGT TGACGAACAG 3 540 

AACCTAAATC AAGTTTTTCA TTTAGACGCA AAACTTCTTT TGCTACAGCA TACATATTTG 3600 

CCTTACCTCA TATCATCTTA TAGATAACTT CATTGATAGC ATATTGAAGT TTTTTAGCTG 3 660 

7ATC7AAA7C 7CGTTCTTGA ATCAAACTTT CCAA7T7CAA CAACAAA7CT CGCA7AACCC 37 20 

CATAAGTACC ACCAA7ACCA CC7TC7CC7C CCATCAAGCC ACCACCAACA TATTGTTCAT 3 7 3C 

CTGGACCATT GAA7ACAATG TAATCTTCTC CACC7GCAGC TACAAACA77 TGAA7ATCTT 3 840 

GTACAGCCAT AGAAGAATTT T7AACTCCAA TCACACGAGG ATTTTGACGC ATTCTTGCAT 3900 

ACAAACTACC AGTCAACGCA ACCCCTGCCA ATTGTGGAAT A77A7AGATA ATAAAATC7G 3 960 

TATTTGACGC AGC7TCAC7C AT7GCATTCC AATATGCTGC GA7TGAA7AC TCTGGCAATT 4C20 

TGAAATAAAT ACCTCCCATA GCTGCAATAG CATCGACTCC AACACTTTCT GAATGTTTTG 4 080 

CCAA7TCGAT AC7ATCTTTC G7GTTATTAC ATGCAATATG GTTGA7AAC7 GTTAATT7AC 4140 

CTTTACCAAC TTCCATAACA GCTTCAATAA TTTGTTTACG A7CTTCTACA CTTTCGTAAA 4 200 

TACATTCACC TGAAGAACCA 7 7T AC A TAG A TACC7T7TAC ACCTTTG7CA A7GAAATAT7 4 2 60 

GTACCAGAGA TT7TACACGA TCTTCGC7AA TT7CACCATT 7TCATCATAG CAAGCATAAA 4 3 20 

ATGCAGGGAT AACGCCTTTG TATTTAGT7A AA7CTTTCAT CAGATTTCTC CTTTATATTG 43 80 

T7TTTTA7TT GATGACATTA ATAAATCCCT GAGCAATTTC TT7TGGACGT G7AA7CGCTC 4 4 40 
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CACCAATGAC TACACTCGTA ACACCTAAAC TATAACCTTT TTTTAATTGT TCTGGATAAT 4 500 

CAATTTTTCt TCCCCAATTA CCCGAATATT AAAATCACCC AATTTT7TCA TTAGTTCAAA 4 550 

ATCAGGCTCA TCTGATTGTA CACTTGTACT TGTGTAACC? GATAATCTTG TACCAACAAA 4 620 

ATCAACGCCT CATTTAAATG CATAGAGACC TTCATCTAAA TTACTTACAT CCGCCATCAC 4 680 

CAATTCATTC GGATATTTTT CTTTTATTTT TTTCATAAAT TCACTGACAA CTAAGCCATC 4 74 0 

ATATCTTGGT CTTAAAGTTG CATCAAATGC AATGACTGTT GTTCCGCATT CTACAAGTTC 4 900 

ATCTACTTCT TTCATCGTAG CAGTAATATA 7CCTTCTTGA GGTCGA7AAT CCCTTTTGAT 4 860 

AATTCCAATT ATTGGTAAAT CTACTACT77 CTGAATTCC7 TTAATATCAC GCACAGAATT 4 920 

TCCCCGAATG CCCACTGCTC CTGCCTCTAA AGCTCCTTTA GCCATAAAAG GCATCAAGCT 4 9 90 

AAATTCTTCA TTATAAAGGG CTTCACCACC TAAACCTTGA CAAGAAACAA TGACTCCACC 504 0 

TTGAACTTGG CTTATAAATT TTTCTTTAGT CCAAATTTGG CTCATTTTAT TATTCCTCCT 5100 

TATGGATAAT AG7TTGATTG TAATAATATT CTCTCTCTCG ACTTTCCAGA TAATTAGAGA 5160 

ATAAGCAGTC TCTAATTAAA AGTATTGGAA ACTGAGGTGA TATGCGATTG CCATACGAGA 5 22 0 

GATGATCCGT CGAAGCTAAT AACAATAGTT CATCAAAGAA ACAATCTTCT TCGTCAAATT 5280 

TTCTTGTAGT CATTAAAACT GTTTTAGCGC CTTTATCTGC AGCTTTTTGT AGACCTTCTA 5 340 

GTACAATATC AGTTTGACCT GAAATGGATG CTCCAATGAC AAGGCAATTT TCATTAAGTA 5400 

GTAAGCTACT CCACAAAATC ATATCCTCGT CTCATAATAC TTCACCAATC ACTCCGAGAC S4 6C 

GCATAAATCT CATCTTCATT TCTTGTAAAG CAAGAACACA AC7TCCTTTA CCGTAGAGAT 5520 

ATACACGCTC ACCAGTTTCT ATCATCTCAG CAATACCCTC AAGTTGAACT TCATCAAGAA 5590 

CCGTGTAAGT TTTTCTCAAC A7TTCCTCAT AGTCGGATAA AACTTTTTCT C7TCCCTCTG 5640 

TATATAATGC CAA C TTTTCT TTCTCATGAA TCATCTCT7C GTATTTGAAA ATGAATTGTC 57C0 

TAAAACCTTT AAAACCACAT TTTTTCGCAA ATCGAGTCAA TGTTGC7TTG GAT AC ATT AA 57 60 

GGTATTCGCA CAATGCTTTA GATGAATAAT CATTCAGAGG TTGCTCTTTT AAGAAGAATT 5820 

TAGCAATGTC TTTTTCAGCA TATGCCATAT TTGCTAACTT AGCTTCTATC ATTGGAATTA 5880 

GTTCTTTTTG CAGTAAC ATA TGAGCTCCTT AGTTGAAGTA AACGTTTACA T7CTTTATTT 594C 

TAACACTTTT TTTTTTTTTC AATATTTTTC ATAAATTAGA AACTAGTTTC CAATTTCTTT 6C0C 

CGTTTCATAA CAGAACAACA AACATAAAAA TATAATAGTT TTTATTCTTT TTATCGTAAT 6060 

TATATGTATT GTAAGAACGT TTATCACTAA TAATATGTTC ATATTAAAAT ATTTTAGTAA 6120 

TATTTTATTT TGGTTTTATT ATTTCTTTTC GCAA7T7CTA TATAATAT7T 7AT7TCTAAA 6180 
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AAAATTGAAA 


AAATATTTCT 


AGTTTCTTTA 


TTTTATATAG 


r.TAAT?VTATT 


TTATTTCTAA 


6240 


ATTAAAACAG 


AATCCCATAA 


AAACTACAGA 


TTTATGAGAT 


AAATCAGGTC 


ACCTATTTTA 


6300 


AAAAAGCAGC 


AAACTATAAA 


CTAAAAAGTT 


CCACACCAAA 


TGTAACCCCA 


TACTTCCCCA 


6360 


TAAGTCAGAT 


TTATAGCGCA 


CCATACCTAA 


AAACATTCCA 


AGTGAAACGT 


ACAGACACCA 


6420 


AGCTAGAATG 


GTTCCTGGAT 


GATGTACTAA 


GGCAAATAAA 


ACACTTGTCA 


AAGCAACTCG 


6480 


AATATCTAAT 


TTTCTAACCA 


AGTTCCATAA 


AATTTCACGA 


TACAGAAATT 


CTTCAACCAT 


6540 


ACTCGCATTG 


ATTAAGAACA 


ATAAAAATCA 


AAACCAAGGA 


ACTTGATGTT 


GAACGCCAAT 


6600 


TAAATTTGTT 


TCATTCGTGC 


TTCCTTGAGC 


ATGAATCAGG 


G7AAAACATA 


GACTTATAA7 


6660 


CAGTAGACTA 


GCTAGTCCAA 


TACCAACGCA 


TTTCATCCTA 


GTTTTCATAT 


TGACCTTGAC 


6720 


CACTTGTTTT 


CGTTGACCAT 


ACATCCATAA 


AAAAGAAAAA 


AGAGACGCAC 


CATAGAGAAC 


678C 


CTGTAGTATA 


GTTAACTCAC 


CCATACAAAG 


AAATTTCAAT 


AAGTATAGAG 


ATACCAATAG 


6840 


GACATTTACT 


TCTTGGAATA 


TATAAACTGG 


AATTATTCTT 


TTCATAGTTA 


CCTCCGAAAT 


6900 


AAATCTTCAT 


AATCTAAATC 


TAATATCTGC 


ACAATCCTTT 


CTACCCATGG 


ACTTTGAGGC 


6960 


ATTCGTTGTT 


CCATCTTGTA 


GTGGCCAATC 


TTTTGATATA 


AACGATTCAA 


TTCACTTGGA 


7020 


TAGTGAAACT 


CTCCCGCAAA 


CATTTTTCTG 


GTTAACTCAA 


TCCAGCTGAT 


ATTTCTTTCA 


7030 


GCCAAAATAA 


TGGACAAGTT 


CTCCCAAAAT 


CGTTCAGCCA 


TATTrCTTCT 


CCTTTAGTTA 


7140 


GATAAATAAT 


GTGTTTGyGC 


CATGTAAATC 


AATTGTTTCG 


TATCTCTTGG 


CAATAGAGCT 


7200 


CTACCCTCTT 


CCAAATTCAG 


ACTTGCATAA 


ACCCGCTTAT 


TTGAAACCAC 


AAAAGGAACT 


7260 


CCGATGGTTA 


GTTCAGGATT 


TTTTAAAATT 


ATCTCAACGA 


AATCCGT7AA 


TCTTAGATTG 


7320 


TCACGGTTCT 


TAAATCGTAA 


TAAATTGGGA 


GATAAAAACT 


CAAAACAATC 


TGAAGAATAG 


7380 


CTCATCATCT 


CAATTAATTT 


GTCCTTTGTC 


ATTTCAGAAA 


CTCAATGACA 


AGATACCTCA 


7440 


ATCCCATAGT 


TTTGGAAGAA 


GTCTAAAAGA 


AGTTGATTTC 


TTTGGCTATT 


TTTACTTAGA 


7500 


TAGAGATCAA 


TCATGGGAGA 


CCTCCAACAA 


ATTTGCrrCC 


ATTTGATATT 


CTGAGACGAT 


7560 


TAAGGAATCT 


AACAACTTTG 


AGAAGTTAAT 


CGATTTCTTG 


TCTTCATCAT 


AAGCTTTTAC 


7620 


AGTTACTTGG 


GTTGTAAGTA 


TCCCCTCTTT 


TCCCTCGGCT 


CGATAGTCTT 


GTCAATATAA 


7680 


AACAAAAACA 


AGATTCTGAT 


TATCATCTAC 


AAAGGCATTA 


ACTCCGTTCT 


TTATATCCTG 


7740 


ACTTTCAACG 


AATTCC AT AA 


CGTTTTGAAG 


ATAGGATTCA 


TAAAATAGTG 


GGTAATTATG 


7800 


TTTTTTATGG 


TAATCATCTA 


AAAATCTTAC 


CTCAAACTCA 


CATGGATAAT 


TGGGCATCAA 


7960 


AAA7ATTTGT 


TCATCCACCT 


GTTTGATTTC 


TGCATCATCT 


AATTCTGTTT 


C7AATTCATC 


7920 


ACAATCTAGT 


ATTGATTCTT 


TATTTAATGC 


TTTTATCTTT 


TTCCTCTATT 


TCTTTTAATT 


798C 
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TC777GCGAT TGCGGCAATC ACAGCAACGG TTACACTATT ACCAACTTGT 77ATAGAGC7 9040 

CACTATTAAT AGAGACTTTT CTAGCAGCT7 CAAAAGCCTA ATCAGGAAAG CCATGCAATC 3100 

GAAAACACTC T7TACCAGTG ATTCGTCGTA T7CTCAAACG GTAAAATTGT CCATCTATTA 8160 

AAACACCAGC TACTTGGTAA ACTTGTTTAT CTTCTCCTTC ATAGCTAGCC AC7ACTACTC 3220 

CCATTTGACC ACTAGTTGTT AACGTATTAG CTATACCTTT TCCAACTCTA CCACGACGAT 8280 

ACTGAGAACT TGGTCTTTCT AAATTGATTG AATCCCCAAT CTCTGCTTGA GCA7A7CCT7 834 0 

TT77CGT7GC TTCCCGTACT 7TTAGAAATT GG ATTCC7TC TGGAATTAGT ATTTTGGGGA 84 00 

77T7A7C7CC TCCTTGCATC GTAGTCAGTG T7GCACA7AA GCCC7CAC77 CCATAGACAC 84 60 

GACCTG7C7C CT7AAAGCTA GTCGGTAAAT CTCCAACAAC GACAATCCCA TAACGATCCT 8520 

GAGTATTTAA AG7AAACATC GGCTCTTGA7 TTTCCTTAAA GCGTCTCCCA TTTTGTCTCT 8580 

7GTCTAATC7 ATCTGGTGTC ATACAAGGAA TCGCAACTTT AAATCCTTC7 CCTTTACCAC 8640 

GAAC7AAGGT 7GGCGCAAGA CCTTCTGAAT AATAGAC7TT ACCGCTCATT CCACTTC77G 37 00 

ATGGATTCAA ATTTCCTAGT GCTTTCAAAG TCTCAGAGT7 AGTTCCTTGA CCTTCTCCTC 87 60 

TGAAAGGAAA TAAGAGTCTG GTACCTTTCT 7TCTAGAATG TCCGATAATA AACACCCTC7 3820 

CTCTGTTTTT GGGAACGCCA AAATCCTTAC 7GTTAAGCAC CTGCCACTCA ACATCAAACC 8880 

CCAACTCA7C AAGTGTCGTA AGTATTGTGG TGAACGTCCG TCCCTTATCG 7GATTGAG7A 8940 

GCCCTT7AAC A7TTTCAACA AAAAGAAAAC GTGGTTGGA7 TTC7TTCCCC CCCCGAGCAA 9000 

T77CAAAGAA CAAAGTTCC7 CTAG7ATCT7 CAAATCCCAA TCGTCT7CC7 GCGAT7GAAA 9060 

ATGC77GACA AGGGAATCCC CCACAGATGA CATCGACT77 CCCTCTAAG7 7TT7TAAA77 9120 

CG7CA7C7GA AACATC7CG7 ATGTCATGAA A77CTATTTC TCCTTCCGT7 TGAAAAA7GG 9180 

AC7TATAAGA T7TCCTACCA AATTTATCAA 7C7CACAAAA 7CCCAAGCAC 7CATGCCC77 92 40 

GAGC7TCCA7 TCCCATCCTA AAGCCTCCTA TCCCAGCAAA 7AAATCTAAA ACCCAAATCA 9 3 00 

77CATACCTC TCTCAACTAG ATGTAACTTA CAAAACCCCT GACC7CA7GA GCCAC7TTCT 9 3 60 

TCC7CCTCAT GAGGTCAGTT 7TACT7TC7G CTGTTCCAGT ATCGTT77TC C7CGCTAGAT 94 20 

T7CC7CAAAA GGGCAGACTC CTCCCTTGGT 7CGTCACACG ATTTT77CA7 CTCGAC7G77 94 80 

CTT7AATGCA TCATTAACGA CGCTTTTC7T CTAGGTGGTT CATAAGGAAC AGGAAGATTC 954 0 

AGGTTGACT7 TTCTAATCCT AGAATAAAGT GC7GAAAACA ATTCGGAATA GG CAT AG AG A 9600 

C7AGACAA77 7GAGGAGC7G CTTGCGTCCT G77CGAACAC ATTT7CCTAC CACGTGAAGA 9660 

AAAAGATGGC GGAAGCG77T GATTGTTAAA G77TGGAAGT CACC7CCAGC TAGA7CT77G 97 20 
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AG AAAAAG AT AGAGATTGTA GGCGATACAG CTCATCATCA TACGAACTCG TTTTTGATTA 97 80 

AGGTTGAACT ATCCCTTTTA TCGCCAAAAA ATCCCTCCTT CATCTCCTTG ATGAAATTCT 9 840 

CGGCTTGACC ACGTCCACGA TAAAGCTGAA ACTGCTC7TG CCTTCTTCCC GTACCGA 9897 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8148 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xil SEQUENCE DESCRIPTION: SEQ ZD NO : 11: 

CCGTGGAACA AGCCAAGACC AGTTTCACCT TTATCGTGGA CGTGGTCAAC CCGAGAATTT 60 

CATCAAGGAG ATGAAGGAGG GATTTTTTGG CGATAAAACG GATAGTTCAA CCTTAATCAA 120 

AAACGAAGTT CGTATGATGA TGAGCTGTAT CGCCTACAAT CTCTATCTTT TTCTCAAACA 180 

TCTAGCTGGA GGTGACTTCC AAACTTTAAC AATCAAACCC TTCCGCCATC TTTTTCTTCA 240 

CGTGGTAGGA AAAT GTGTTC GAACAGGACG CAAGCAGCTC CTCAAATTGT CTAGTCTCTA 300 

TCCCTATTCC GAATTGTTTT CAGCACTTTA TTCTACGATT AGAAAAGTCA ACCTGAATCT 3 60 

TCCTGTTCCT TATGAACCAC CTAGAAGAAA AGCGTCGTTA ATGATGCATT AAAGAACAGT 4 20 

CGAGATGAAA AAATCGTGTG ACGAACCAAG GGAGGAGTCT GCCCTTTTGA GGAAATCTAG 4 80 

CGAGGAAAAA CGATACTGGA ACAGCAGAAA GTAAAACTGA CCTCATCAGC AGGAAGAAAG 5 4C 

TGGCTCATGA GGTCAGGGGT TTTGTAAGTT ACATCTAGTT GAGAGAGGTA TGAATGATTT 600 

GGGTAAATAC AATGAGCTTG AAAGAAGTAG CAAACTCACC AAGCGCCAAT TCTTTGAGAA 660 

TCAGATGCTG GATTATACCA TCATTGCGCA TGAGAGTTTT GAAATCATCC GTCATTCTGT 7 20 

CTACCAGACA GATGATCGTG AAGTGGAAAA TGCTCTGGC? TTTGAAGTGA AAAATGATGA 7 80 

AACAGACAAG CTGATTCTGT TATTAAGCGA GGATATTGGT GTAGGTGAAA AATTGTGCCT 840 

CGTTGACGGA ACAAAAATGC GTGGAAAATG TTTAGTATAT GATAAAATAA ATGAGAGAAT 900 

CATTCGCTTG CAGTGCTAGA AATAGGCATT TTGAATAGTG AATATGTTAT AATAAGTATT 960 

AGTAGGAGGT GTTTTAGATT GGAGAAGAAA CTGACCATAA AAGACATTGC GGAAATGGCT 1C20 

CAGACCTCGA AAACAACCGT GTCATTTTAC CTAAACGGGA AATATGAAAA AATGTCCCAA 1080 

GAGACACGTG AAAAGATTGA AAAAGTTATT CATGAAACAA ATTACAAACC GAGCATTGT? 1140 

GCCCGTAGCT TAAACTCCAA ACGAACAAAA TTAATCCGTG TTTTGATTCG TGATATTACC 1200 

AACAGTTTCT CAAACCAAAT TGTTAAGGGA ATTGAGGATA TCGCCAGCCA GAATGGCTAC 12 6 0 
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CAGGTAATGA TAGGAAATAG TAATTACAGC CAAGAGAGTG AGGACCGGTA TATTGAAAGC 13 20 

ATGCTTCTCT TGGGAGTAGA CGGCTTTA77 ATTCAGCCGA CCTCTAATTT CCG AAAATAT 13 80 

TCTCGTATCA TCCATGAGAA AAAGAAGAAA ATGGTCTTTT TTGATAGTCA CCTC7ATGAA 144 0 

CACCGGACTA GCTGGGTTAA AACCAATAAC TATCATGCCG TTTA7GACAT GACCCAGTCC 15C0 

7GTATCGAAA AAGGT7A7GA ACATTTTCTC TTGATTACAG CGGATACGAG TCGTTTGAGT 1560 

ACTCGGATTG AGCGGGCAAG TGGTTTTGTG GATGCTTTAA CAGATGCTAA TATGCGTCAC 1520 

GCCAGTCTAA CCATTCAAGA TAAGCATACG AATTTGGAAC AAATTAAGGA ATTTTTACAA 1580 

AAAGAAA7CG ATCCCGATGA AAAAACTCTG GTATTT ATCC CTAACTGTTG GGCCC7ACC7 17 40 

CTAGTCTTTA CCGTTATCAA AGAGTTGAA7 TATAAC7TGC CACAAGTTGG G7TGATTGCT 1300 

TTTGACAATA CGGAGTGGAC 7TCCT7TTCT TCTCCAAGTG TTTCGACCCT GGTTCAGCCC I860 

TCCTTTGAGG AAGGACAACA GGCTACAAAG ATTTTCATTG ACCAGATTGA AGGTCGCAAT 1920 

CAAGAAGAAA GGCAACAAGT CTTGGATTGT AGTGTGAATT GGAAAGAGTC GACTTTCTAA 1980 

AATGAAGGAA AATGACTTGC AATCTCTCTT AAGAAATAAA ATAATCCCAC CTAGAACAAG 2040 

CTAGGTGGGA TTATTTGCCT ATGAAATGAG AAATTATGGG AGCAAGCTCC TAAATCAACT 2100 

GTTTTTGATC TACTTCTTTA AC7ACTTGAT AAAAGTTATA GAAGTAGGCC AAACTTGAAA 2160 

TGATGGTTAC GACTAGGAAT ATTGAAAATT TCCATTGGAC AGGGTTGGTT AAAAGTTGTG 22 20 

GAAAGGATAT GAGGAGAAAG AAGAGCGCTC CCTTGAGGAC AGGTATCCGT TTTGATTGTA 22 30 

TTTTCTCAAG TCCTTTATTC AGCGCAGGAA GAAAGAGGAG 7AGGAGTAC7 AAAAC7CTA7 2 3 40 

GAGAAATAGC TCCTGAAGTA AGCCCCAAGA AAAGGAAAAT ACTGATAAAA ACATGAATGA 24 00 

TCAGTAGTCT AGCTAGTGAT TTCATAAGGC ACCTCCTAAT CCTGG7CTTT TTTAGCTCTT 24 60 

GCAATACGAA GTGAGTCGAC AATATGTATC ATCACTCCGA AAAAGAAAGC TCCCAGTATA 2520 

GTTTTAAAAA TATG7TTTGT ATTTAGAAGA GAACTGATAA AATTTGGATT TTCACTTGTT 2 580 

AGGGTATCAA TGAGTGGAAT TA7AAAAAAT ATCACTGTTC CATAAATCGA ACCTCCTTTC 2 640 

AGACCAGGAT AACGTAACTG TTTCTTTTCT TTTTTCATGA GTTTCCTCCT AATCCTCATC 2 7 00 

TTGATTTTTC TTAGTTTTTG CAATGCGACG GGACATCAGG AACTGTATGC TCGCTCCGAA 27 60 

GAAAATAGAA CCGAGAATAC TTGAT AC ACC ATTTCTTATA GTGAGAAGAG AATGAAAATA 2820 

GTCCTGACCT TCATCTATGA GTATCCTGAG AAGAGGAGTT ATAAAAAACA TCCATAGACC 2880 

AAAGAACAAA CCTGCTTTCA GACCTGGGTA GTGTACTTGC TTGC7TTCTT TCTCATTCAG 2 94 0 

CA7ATCTGGT 7CAA7GACTG 7GATGCCTGT T7T777CAT7 TGGTAGG7GA CATAGCCAGA 3 00C 
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AGCGATGAGG GCAATCACTA AAATCAGAGG AGGATAGATT AGAGCCACTT CTTGAGCGTA 1060 

TTTATAGGCC AGAAGGAGTG GAATAAGATT TCCGAAAATC ATCAGATAAA AGAGGATGAT 3120 

AAAGACTTGG TTCCCAATAC TATCGGCCTC ACGCCGTTTG TATTCGTCAA GGGGACCAGA 3130 

AATACCGTAT GTCCCTTTGA TCAGTTTTTC AGTGAAGGT? TCTTTTTTCA TCAGTTTGCT 3 240 

CC TTT TTTAA AAATCTTCCT CCCAAAAGAG ACTCTTCAGC TCACTTTGGA GGCTGCGGGC 3 300 

GAGATTGAGA CAGAGTTCCA AGGTTGGATT GTACTTGTCG TTTTCAATCA TATTGATAGT 3 3 60 

CTCTCTCGAG ACACCGATAT CCTTGGCGAG 7TCGAGCTGG GAAATACCCA ATTCCTTGCG 3420 

AAATTCTTTC ACACGATTCA TCTGTTCTCC TTTCTGATTT ATGTCGTATA TATTTGACTA 3480 

TATTATACTC TTTTAAACAT AAAGTGTCAA GTATTTTTGA CATATTTTTT GAAGAAATAG 3540 

TAGTCTCCTT CTCCTATTTG TCTGACAAGT GCAAGCTGGT CGGATTTGTG GTAAAATACA 3 600 

TAAGATATCA CAAAACAATT TCATCATGTA ACGGTCTTAC TCCACGAAAC GATTGATATG 3 660 

CTTGACGTAA AGCCTGATGG TATCTACGTT GATGCGACTT TGGGCGGAGC AGGACATAGC 37 20 

GAGTATTTAT TAAGTAAATT AAGTGAAAAA GGCCATCTCT ATGCCTTTGA CCAGGATCAG 37 80 

AATGCCATTG ACAATGCGCA AAAACGCTTG GCACCTTACA TTGAGAACGC AATCCTCACC 3 840 

TTTATCAAGG ACAACTTCCG TCATTTACAG GCATGTTTGC GCGAAGCTGG TGTTCAGGAA 3900 

ATTGATGGAA TTTGTTATGA CTTGGGAGTG TCTAGTCCTC AATTAGACCA GCGTGAGCGT 3960 

GGTTTTTCTT ATAAAAAGGA TGCGCCACTG GACATGCGGA TGAATCAGGA TGCTAGCCTG 4 020 

ACAGCCTATG AAGTGGTGAA CAATTATGAC TATCATGACT TGGTTCGTAT TTTCTTCAAC 4 080 

TATGGAGAGG ACAAATTCTC TAAACAGATT GCGCGTAAGA TTGAGCAAGC GCGTCAAGTG 4140 

AAGCCGATTG AGACAACGAC TGAGTTAGCA GAGATTATCA AGTTGGTCAA ACCTGCCAAG 4200 

GAACTCAAGA AGAAGGGGCA TCCTGCTAAG CAGATTTTCC AGGCTATTCG AATTGAAGTC 4 2 60 

AATGATCAAC TGGGAGCGGC AGATGAGTCC ATCCAGCACC CTATGGATAT GTTGGCTCTG 4 320 

GATGGTAGAA TTTCAGTGAT TACCTTTCAT TCCTTAGAAG ACCGCTTGAC CAAGCAATVC 4 3 80 

TTCAAGGAAG CTTCAACAGT TCAAGTTCCA AAAGGCTTGC CTTTCATCCC AGATGATCTC 44 40 

AAGCCCAAGA TGGAATTCGT GTCCCGTAAG CCAATCTTGC CAAGTGCGGA AGACTTAGAA 4 500 

GCCAATAACC GCTCGCACTC AGCCAAGTTG CCCCTCGTCA GAAAAATTCA CAAGTAAGAG 4 560 

GGAAAAAGAT GGCAGAAAAA ATGCAAAAAA CAGGTCAAAT ACTACAGATC CAACTTAAAC 4 620 

GGTTTTCGCG TGTGGAAAAA GCTTTTTACT 7TTCCATTGC TGTAACCACT CT7ATTGTAG 4 630 

CCATTAGTAT TATTTTTATG CAGACCAAGC 7CTTGCAAC? GCAGAATGAT TTGACAAAAA 4 74 0 

7CAATGCGCA GATAGAGGAA AAGAAGACCG AATTGGACGA TGCCAAGCAA GAGGTCAATG 4 8CC 
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AACTATTACG 


TGCAGAACGT 


TTGAAAGAAA 


TTGCCAATTC 


ACACGATTTG 


CAATTAAACA 


4860 


ATGAAAATAT 


TAGAATAGCG 


GAGTAAGATA 


TGAAGTGGAC 


AAAAAGAGTA 


ATCCGTTATG 


4920 


CGACCAAAAA 


TCGGAAATCG 


CCGGCTGAAA 


ACAGACGCAG 


AGTTGCAAAA 


AGTCTGAGTT 


4990 


TATTATCTGT 


CTTTGTTTTT 


GCCATTTTTT 


TAGTCAATTT 


TGCGGTCATT 


ATTGGGACAG 


5040 


GCACTCCCTT 


TGGAACAGAT 


TTAGCGAAGG 


AAGCTAAGAA 


GGTTCATCAA 


ACCACCCGTA 


5100 


CAGTTCCTGC 


CAAACGTGGG 


ACTATTTATG 


ACCGAAATGG 


AGTCCCGATT 


GCTGAGGATG 


5160 


CAACCTCCTA 


TAATGTCTAT 


GCGGTCATTG 


ATGAGAACTA 


TAAGTCAGCA 


ACGGGTAAGA 


5220 


TTCTTTACGT 


AGAAAAAACA 


CAATTTAACA 


ACGTTCCAGA 


GGTCTTTCAT 


AAGTATCTGG 


5280 


ACATGGAAGA 


ATCCTATGTA 


AGAGAGCAAC 


TCTCGCAACC 


TAATCTCAAG 


CAAGTTTCCT 


5340 


TTGGAGCAAA 


GGGAAATGGG 


ATTACCTATG 


CCAATATGAT 


GTCTATCAAA 


AAAGAATTGG 


5400 


AAGCTGCAGA 


GGTCAAGGCG 


ATTGATTTTA 


CAACCAGTCC 


CAATCGTAGT 


TACCCAAACG 


5460 


GACAATTTCC 


TTCTAGTTTT 


ATCGGTCTAG 


CTCAGCTCCA 


TGAAAATGAA 


GATGGAAGCA 


5520 


AGAGCTTGCT 


GGGAACCTCT 


GG AATGGAGA 


GTTCCTTGAA 


CAGTATTCTT 


GCAGGCACAG 


5580 


ACGGCATTAT 


TACCTATCAA 


AAGGATCGTC 


TGGGTAATAT 


TGTACCCCGA 


ACAGAACAAG 


5640 


TTTCCCAACG 


AACGATGGAC 


GGTAAGGATC 


TTTATACAAC 


CATTTCCAGC 


CCCCTCCAGT 


5700 


CCTTTATGGA 


AACCCAGATG 


GATGCTTTTC 


AAGAGAAGGT 


AAAAGGAAAG 


TACATGACAG 


5760 


CGACTTTGGT 


CAGTGCTAAA 


ACAGGGGAAA 


TTCTGGCAAC 


AACGCAACGA 


CCGACCTTTG 


5820 


ATGCAGATAC 


AAAAGAAGGC 


ATTACAGAGG 


ACTTTGTT7G 


GCGTGATATC 


CTTTACCAAA 


58S0 


GTAACTATGA 


GCCAGGTTCC 


ACTATGAAAG 


7GATGATCT7 


CGCTGCTGCT 


A7TGATAATA 


5940 


ATACCTTTCC 


AGGAGGAGAA 


GTC7TTAATA 


GTAGTGAGTT 


AAAAATTGCA 


GATGCCACGA 


6000 


7TCGAGATTG 


GGACGTTAAT 


GAAGGATTGA 


CTGGTGGCAG 


AACGATGACT 


TTTTCTCAAG 


6060 


GTTTTGCACA 


CTCAAGTAAC 


GTTGGGATGA 






GGAGATGCTA 


6120 


CCTGGCTTCA 


TTATCTTAAT 


CGTTTTAAAT 


TTGGAGTTCC 


GACCCGTTTC 


GGTTTGACCG 


6180 


ATGAGTATGC 


TGCTCAGCTT 


CCTGCGGATA 


ATATTGTCAA 


CATTGCGCAA 


AGCTCATTTG 


6240 


GACAAGGGAT 


TTCAGTGACC 


CAGACGCAAA 


TGATTCCTGC 


CTTTACAGCT 


ATTGCTAATG 


6300 


ACGGTGTCAT 


GCTGGAGCCT 


AAATTTATTA 


GTGCCATTTA 


TCATCCAAAT 


GATCAAACTG 


6360 


CTCCGAAATC 


TCAAAAAGAA 


ATTGTGGGAA 


ATCCTGTT7C 


TAAAGATGCA 


GCTAGTCTAA 


6420 


C7CGGACTAA 


CATGGTTTTG 


GTAGGGACGG 


ATCCGGTTTA 


TGGAACCATG 


TATAACCACA 


648C 


GCACAGGCAA 


GCCAACTGTA 


ACTGTTCCTG 


GGCAAAATGT 


AGCCCTCAAG 


TCTGGTACGG 


654C 
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CTCAGATTCC TGACGAGAAA AATCGTCCTT ATCTAGTCGG GTTAACCGAC TATATTTTCT 6600 

CGGCTGTATC CATGAGTCCG GCTGAAAATC CTGATTTTAT CTTGTATGTG ACGGTCCAAC 6660 

AACCTGAACA TTATTCACGT ATTCAGTTGG GAGAATTTGC CAATCCTATC TTGGAGCGGG 6720 

CTTCAGCTAT GAAAGACTCT CTCAATCTTC AAACAACAGC TAAGGCTTTA GAGCAAGTAA 6780 

GTCAACAAAG TCCTTATCCT ATGCCTAGTG TCAAGGATAT TTCACCTGGT GATTTAGCAG 684 0 

AAGAATTGCG TCGCAATCTT GTACAACCCA TCGTTGTGGG AACAGGAACG AAGATTAAAA 6900 

ACAGTTCTGC TGAAGAAGGG AAGAATCTTG CCCCGAACCA GCAAGTCCTT ATCTTATCTG 6960 

ATAAAGCAGA GGAGGTTCCA GATATGTATC GTTGGACAAA GGAGACTGCT GAGACCCTTG 7020 

CTAAGTGGCT CAATATAGAA CTTGAATTTC AAGGT7CCCG CTCTACTGTG CAGAAGCAAG 708C 

ATGTTCGTGC TAACACAGCT AT C AAGG AC A TTAAAAAAAT TACATTAACT TTAGGAGACT 7140 

AATATGTTTA TTTCCATCAG TGCTGGAATT GTGACATTTT TACTAACTTT AGTAGAAATT 7200 

CCGGCCTTTA TCCAATTTTA TAGAAAGGCG CAAATTACAG GCCAGCAGAT GCATGAGGAT 72 60 

GTCAAACAGC ATCAGGCAAA AGCTGGGACT CCTACAATGG GAGGTTTCGT TTTCTTGATT 7 3 20 

ACTTCTGTTT TGGTTGCTTT CTTTTTCGCC CTATTTAGTA CCCAATTCAG CAATAATGTG 7380 

GGAATGATTT TGTTCATCTT GGTCTTGTAT GGCTTGGTCG G ATTTTT AG A TGACTTTCTC 7440 

AAGGTCTTTC GTAAAATCAA TGAGGGGCTT AATCCTAAGC AAAAATTAGC TCTTCAGCTT 7500 

CTAGCTGGAG TTATCTTCTA TCTTTTCTAT GAGCGCGGTG GCGATATCCT GTCTGTCTTT 7560 

GGTTATCCAG TTCATTTGGG ATTTTTCTAT ATTTTCTTCG CTCTTTTCTG GCTAGTCGGT 7620 

TTTTCAAACG CAGTAAACTT GACAGACGGT CTTGACCGTT TACCTACTAT TTCCGTTGTG 7630 

ATTAGTTTGT CTGCCTATGG AGTTATTGCC TATGTGCAAG GTCAGATGGA TATTCTTCTA 7740 

GTGATTCTTG CCATGATTGG TGGTTTGCTC GGT7TCTTCA TCTTTAACCA TAAGCCTGCC 7800 

AAGGTCTTTA TGGGTGATGT GGGAAGTTTG GCCCTAGGTC GGATGCTGGC AGC7ATCTCT 7860 

ATGGCTCTCC ACCAAGAATG GACTCTCTTC ATTATCGGAA TTGTCTATGT TTTTGAAACA 7920 

ACTTC7GTTA TGATGCAAGT CAGTTATTTC AAACTGACAG GTGGTAAACG TATTTTCCGT 798C 

ATGACGCCTG TACATCACCA TTTTGAGCTT GGGGGATTGT CTGGTAAAGG AAATCCTTGG 8040 

AGCCAGTGGA AGGTTGACTT CTTCT7TTGG GGAGTGGGAC TTCTAGCAAG TCTCCTGACC 8100 

CTAGCAATTT TATATTTGAT GTAAGAATGG CACCCTGATG TTTCAGGG 8148 
(2) INFORMATION FOR SEQ ID NO: 12: 

< i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9909 base pairs 

(B) TYPE: nucleic acid 
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(Ci STRANDEDNESS: double 
(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TACTCCACCC TTAATATCCC TTCCTGTAAA TACTTTACCG CTTTTAAGTT CATAGAATTC 60 

AACTTTTAAA TGCTTGTCT? CAAGCATCTT TTCCATCCAA TTTTTAGGAG TTTGACCAGC 120 

7TTAAATAAA AACCTTGCTG GGCTGAT7AG TATAGATTTA TCTGCGATTT TATAAGCTTC 130 

ATCAATAAAA TAGTGATATA TCCGCTCATC TCTGGCTTCT CCTGTTTCCT GATACGGAGG 2 40 

ATTTCCTATC ACGACATCAA ATTTCATTTC ACTTTCCTCG CTAGATAGGC GCTCAAAACC 300 

TATCATTCTA TTCTTTTTCC AGTCTTTGAT ATGGGTTTTA GATTCTTCTA CTTCTTGGAC 3 60 

TTCTAGCTCA TCCGCAAACA AACTCAATTG TTGAGATTGC TTTTGTTTAG CTGAATAAGG 4 20 

ACTACTTTTT TTCAATCCAT CCATCTGAAA GACATTGTAA GAGATAATAG TCGCAATTTC 4 80 

TTTCTTTTGC TCTAATCTTG GTTGATTTCC ACTCTTAGCT AGATAATAGT CCTCAAAAGT 54 0 

TCCCAAAAGA 7TCTCACGCG CCAAAAGGAG AGAATCTCCT TGATACTCAT AACCATACGA 600 

AGCATGATAA GCATCTTTTA CAAGTTTATA AAATGTGACT TCATCTCAAA CCTCACGACT 660 

AATCCGTTGC ACTTTTCTAT CAACAAAACC AACTCGCTCA GATAATGGAA TTTCCTCACC 720 

AGTTACGGTA TCATATCTCG TTACCATATA AGGTGCTTCA CCACAAGTTA CCTCTAACCA 780 

TCGTAAGTCC ACATACTCCT CAAGACTTAA CGAGCCTAAT TTCGATTCTA CATATCCATT '840 

TTGCTTTGCG ACCAACCACG TTGGTGTAAA CACTTCTGCC CTTATTTTTG TCCGA7CTT? 900 

TTGTTCATAT TTGGATTTTT CAGATCTGGG CTGAATCAAG TTGGCAAAGT TTCCAGTAAC 9 60 

CTTACTTGGA TTGATGCGAT CACTTGGACC AAATCCCTTT CCTAACAATT CATAAGAATG 1020 

CGTAnGCCAA ACAATTGATT TCTTTGTCGT TCGAT C TTT T AAAAGAATTT TTAATAAGTC 1080 

AGCCGATTCT TTAGCCAAAC TTTCTTCACT AATATCTATT GTCATCAGCA ACCTCTCTTA 114 0 

TATTGTAAGC CCTATTATAT CATATTTTAA AGAATGAAAA TTTACTTGAA AAAAGTAATT 1200 

CAATAAATAT CTCTCCGATG ACCAACTTCT AGAGTAGCAA CGACTAATTC ATCATCTACA 12 60 

ATTTGTACGA TAACTCGATA ATTACCAATT CTATAGCGCC ATTGACCAAC GCGATTACCA 13 20 

ACCAAAGCCT TTCCGTGTCG TCTTGGGTCT TCCAAAACAT TGGTTTGTAA ATAGTTTGTA 13 80 

ATTAGCTTCT GCGTATAACG GTCCAATTTT TTCAATTGCT TGATAAAACG TCTTGTTGCA 14 40 

ACTAATTTAT ACAAATTATT CATCCTTCAA GCCTAAATCA TGCATCATTT CTTCCCAAGT 1500 

AATGGGTTCA ACTCCTTTTT CCAACTCTTC TAAATACTCT TGATAGGC7A AATCTGCCAC 1560 



WO 98/18931 



PCT/US97/19588 



226 



ACGACCATCC 


7AT7CATC77 


CTAGGGCTTC 


AAGAG7TTTG 


GTGCGAATAA 


GTTCCGAAAG 


1620 


GGAAACTCCT 


TCAAACTTAG 


CCATTGCTTT 


CATAAATGTT 


TTATCACCTT 


CAGAAACTT7 


1530 


TAATGTAATA 


GTAGTCATCT 


TTTGTGCTCC 


CTTTTTTAAT 


GGTAACACCA 


TTGTATTACT 


1740 


TTTTAGGTGT 


TCAGTCAATA 


TAAAAAGAAC 


ACCTTCTCAG 


CGTTCTTTCT 


ATATCTCTGT 


1300 


CAATCG7GTT 


GCGGTATCTG 


GTGAGGTATC 


ATAAACCTTA 


AAGTCTACTC 


CGACTCCCAG 


1360 


ATCAGCTTGA 


GCCAGCTGAT 


TGACCATGGT 


CATATGAGCC 


AGTTCCTTGA 


TATTGTTTTC 


1920 


C7TACATAAA 


TGCCCAAGGT 


AAATCTTCTT 


AGTACGATTT 


CCTAGCG7CC 


GAATCATAGC 


1930 


TTCAGCACCG 


TCCTCGTTAG 


AAAGGTGACC 


AAGGTCAGAT 


AGGATTCGT7 


GTTTGAGTCG 


2040 


CCAAGCGTAA 


CAACCTCA7C 


GCAAAATC7C 


TACATCA7GG 


T7GGCCTCGA 


TAACATAACC 


2100 


ATCCGCATTT 


TCGACAATGC 


CCGCCATACG 


GTCACTGACA 


TAACCTGTAT 


CTGTCAAGAG 


2160 


GACAAAACTC 


TTATCATCCT 


TCATAAAGCG 


ATAGAACTGC 


GGTGCGACTG 


CATCATGGCT 


2220 


TACACCAAAA 


CTCTCGATCT 


CGATATCTCC 


AAAGCTTTTG 


GTTTTACCCA 


TTTCAAAAAT 


2230 


ATGCTTTTGC 


GAAGAATCCA 


CCTTGCCAAG 


ATATTTAC7A 


TTTTCCATAG 


CTTGCCAGGT 


2340 


CTTTTCATTG 


GCATAAACA7 


CCATACCATA 


CTTGCGACCC 


AAAACGCCTA 


CTCCATGCA7 


2400 


ATGATCTGAA 


TGCTCATGGG 


TAATCAAGAT 


GGCATCCAGG 


TCTTCTGGCT 


7ACGGTTAAT 


2460 


TTCAGCTAGC 


AGACTGGTAA 


TTTTCTTGCC 


AGACAACCCT 


GCATCTACTA 


AAAGCTTCTT 


2520 


TTTTGACGTT 


TCCAGATAAA 


AAGAATTTCC 


ACTGGAACCC 


GACGCTAAAA 


TACTGTATTT 


2580 


AAAGCCTATT 


TCACTCATTC 


TAGTCTTCTA 


CTTCATCCTC 


CCATACTTCT 


TCTTTCACTG 


2640 


CATCCT7A7C 


A7AACCGACT 


ACAATGGTAA 


AGGTTGAACC 


CTTGCCGTAT 


TCACTCTTCG 


2700 


CCCAAATAAA 


GCCCTTA7GT 


TGTT7GATAA 


TTTC7TTAGC 


G AT AG AC AG T 


CCTAGACCTG 


2760 


7ACCACCTTG 


TGCACGAC7T 


CTAGCACGAT 


CCACACGATA 


GAAACGGTCA 


AAGATACCTC 


2920 


GTAAATCCTG 


CTTAGGAATC 


CCCAAACCGT 


GGTCAGAAAT 


GGATAAAATC 


A7CTGGTC7T 


2330 


CACTTGTCTT 


CA7TCTGACA 


GTGATTTTAC 


CCCCATCTGG 


CGAATACTTA 


ATAGCATTAT 


2940 


TTAAAATATT 


GTCGACAACC 


TGCG7CATCT 


TATCTG7A7C 


AATTTCCATC 


CAGATAGAAT 


3000 


TGATGGGATA 


ATCTCTCACC 


AACTCATAT7 


7TTTC7CCTT 


T7CCTGTCCT 


TTCATCT7GT 


3060 


CAAAACGATT 


GAGGATAAAG 


GTAATAAAAG 


CAGTGAAGTT 


AA7CAGTTCC 


ACATCTAGGT 


3120 


GACTGGTAGC 


ATTATCAATA 


CGTGAAAGAT 


GGAGGAGATC 


CGTCACCA7G 


CGCATCATAC 


3130 


GGTTGC7C7C 


A7CAAGAGAA 


ACCTTCATAA 


AG7C7GG7GC 


TACAG7TTCA 


CACAAAGCCC 


3240 


CCTCATCCAA 


GGCTTCAAGA 


TAGGATTTTA 


CGC7AGTCAG 


AGGAGTCCGT 


AACTCATCGC 


330C 


7AACAT7GGA 


AACAAAGAGT 


CTTCGTTCGC 


CTTCT7CC77 


CTCCTGC7CC 


GTCGTA7CAT 


3360 
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GCAAAACAGC 


CACCAAACCT 


1 GAAATAAAGC 


' CAGACTCTCG 


1 ACGTA7CAAG 




i a ? n 


CTCGAAGGTT 


CAAATATTCG 


1 CCATTGATAT 


CTTGGGAATC 


TAGCAACAAT 




J *1 w V 


GGGTAATCAA 


ATCACGCAAT 


TCATAGTTTT 


C7TC7ATCT7 


GAGCAATTCC 


AAAATGPTTG 


J J H U 


TATTCAGAAC 


ATCTTCCTTA 


ACCAACCCCA 


GTTGCTTCTT 


CGCTGTATTG 






TAATCTGACC 


CCGACGGTTA 


GTCGCAAGAA 


CCCCATCTGT 


CATATAAAAP 


nUAn 1 AL i Ai 


JOOU 


TTAGCCTC7T 


ACTCTCTTCT 


7CTAGATT7T 


CCTGAGTGAG 


AC C AA A A C 


T rrnArAArT 


l "J n 


CATTCAAATT 


ATTGGTAATA 


TTGGTGATTT 


CAGACCCACC 






3780 


AATAATCTCC 


TGCAATCAAA 


TCTTTAACCT 


TTTGATTGAC 


TTGGT~*CAAG 


. UAA 1 Ai inl 




CACGTCTAT? 


TTCCAGTAAT 


AAGAGGGTCA 


CAACAAGGAT 


GAAACCTAAC 


AAAATCAGGA 


"i q fin 


TAAAGATAAA 


ATCTCTGGTA 


AAAATGGTTT 


CTTTCACTAA 


ATCAAGCATT 


i i. I \_ft 1 0 




TAATACCCTA 


CACCACGGCG 


CGTCAAGATA 


TACTCTGGTC 


GGCTCGCCC*^ 


A T GT m r* A A T P 




TTCTCACGCA 


GACGTCGTAC 


AGTCACATCA 


ACTGTACGGA 


CATGAGGAAA 


n • Mlj 1 UA i Aft 


4Ua U 


CCCCAGACAG 


TCTCAAGCAA 


GTGTTCGCGC 


GTGATGACTT 


G A GCTGT A f*G 


rpATrrTAA & 

I VjV. 1 AAA 


A ^ A f\ 


TGATACAAAA 


GCTCAAATTC 


ACGATGGGTT 


AAGTCTAGTT 


(M^GGPPATA 


1 I i * I * nWCu 


•* £ JU 


ACGTAGGCGT 


CTGGAACAAT 


TTCTAAATCC 


CCAATTTGGA 


T ACGTTG ACG 






GCTTCCTGAC 


CATCTACTGG 


CATAGGTTGA 


GAACGACGCA 




AAfArcrnrr 




TGCAACTCAC 


GATTGGAGAA 


GGGTTTTCTT 


ACATAGTCAT 


GTGGPPP A AG 




1 1 Q n 


ATAACCTTAT 


CAAATTCACT 


ATCTTTGGCT 


GAAACCATAA 


GAATGGCCAC 


AG***GG' T *TGTr 


jj in 


TTACGAATGG 


TCTTAGCAAC 


TTCTAAACCA 


TCAA7TTC7G 


GAAGCATGAA 


ATmra A m a 

A 1 \» K,t\\jt\J\ i M 




ATAATATCTG 


CTTCCTCTGC 


TTCAAATTGC 


TCTAGCCCTT 


CACGACCATT 


AAAACeACT*** 


i jOU 


ACAACTTCGT 


AACCTTCCTT 


GGTCATATTA 


AAC7TCATAA 


TATCCGAGAT 


TGGTTTCTCA 


*♦ O i u 


\ V-ATuTACAA 


TTAGTATTTT 


TTTCATATC7 


TCACCTTTTT 


CTCTACTA7T 


ATACCAAAAA 


4680 


AATAGTCAGA 


AGACACAATA 


GCTACTCTTG 


GCTACTGTCT 


AAGTTGGC7T 


GTCCATAAAC 


4740 


CTGCCAGATT 


TTTTCTTGGG 


GTTTGGCAAG 


TGGGTAAT7C 


TTGAATTCTT 


CTGGTGAAAG 


4800 


CCAGCGAACT 


TCCCTATCTC 


AAAAATCATG 


GAAGTCACTC 


ACCTGACCTG 


CTACAATCTG 


4860 


TACATGCCAT 


TTTCGATGAC 


TAAAAACATG 


CTGGACTGTA 


TCAAAACAAA 


CATCAAGCCA 


4920 


ATCAACATCT 


AGCTCATAG7 


CCTGCTGGAA 


ACTCTCTTCT 


GGACTGGGAC 


CAAAGTTCAC 


4980 


ACTTTCTTCC 


CCAACC7GAT 


GAAAGAGGTC 


AAACTGCTCT 


TCTTGCGAAA 


AGTTATCAAC 


5040 


TTCTATAAAG 


GGGAAATGCC 


AAAAACC7GC 


CAAGAGCT77 


TCGCTTTCAT 


T77TT7CAAG 


5100 
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TAAAAATTG? CCTTGAGAAT TTTTCACAAC TAACCCTTTA AGATAAATAG GAACCGGCTT 5160 

TTTCTTAGGA GATTTAATTG GA7AACCCTC CATGGTTCCA TTCTGATATG CCGCACTAAA 5220 

GTCCTTGACT GGGCTTTCTT CAGCTCTGGG ATTTACAGGA GACTCAATAT CAGACCCTAA 5280 

GTCCATCAAG GCTTGATTAA AATCACCCGG ACGATCCGGA TTAATCAACA TCTCCATCAT 5 340 

TGCCTGAAAA ATT7TTCGAT TACTTGGAAT CCCAATATCG TGGTTGACTT CAAACAGACG 54 OC 

CGCCAAGACC CGCATGACAT TACCATCTAC AGCTGGCTCA GGCAAGTTAA AAGCAATACT 5460 

CGAAATGGC? CCTGCTGTGT AAGGTCCAAT CCCTTTCAAC CTGGAAATTC CTTCATAGGT 55 20 

ATTTGGAAAT TGGCCACCAA AGTCAGTCAT AATCTGCTGG GCTGCAGCCT GCATATTGCG 5580 

AACTCGAGAA TAATAGCCCA AGCCCTCCCA AGCTTTCAGT AAACTCTCC7 CAGGCGCACT 5640 

TGCCAGACTT TCCACAGTTG GAAACCAGTC CAAAAATCTT TCGTAGTAAG GGATAACTGT 5700 

ATCCACCCTG GTCTGCTGAA GCATGATTTC AG AT AC CC AG ATCTCATAAG CATTTTTACT 57 60 

TCTCCTCCAA GGCAAATCTC TTTT GTT TTC ATCATACCAA GCGAGAAGTT TCTCACGGAA 58 20 

AGAAATGACT TTCTCCTCCG GCCACATGAC GATACCGTAT TCTTTCAAAT CTAACATATC 5880 

TCTAGTATAA CACAGAAGGT TTCACCTGTC TTTGTATCTG ATTTATAATA TTTTCAATAG 5940 

ATAGTATATA ACTTTTCTAT CTACTTATAC TCAATGAAAA TCAAAGAGCA AACTAGGAAG 6000 

CTAGCCGCAG GTTGCTCAAA ACACTGTTTT GAGGTTGTGG ATAGAACTGA CAGAGTCAGT 6060 

ATCATATAcT ACGGCAAGGT GAAGCTGACG TAGTTTGAAG AGATTTTCGA AGAGTATAAA 6120 

TCTTATTGAT GAACTGCTTG CAGTCTGAGA AAAAATGAGC TTGGATATTA TTTCCAAACT 6130 

CACTTAAAGT CAATTTCAAT CCACTAGAAC AAGCCTAGTA CAGTTCCATC CCTTTCAACA 62 4 0 

TCCATGTTGA GAGCTGCTGG ACGTTTTCGA AGACCTGGCA TGGTCATAAC ATCACCAGTT 6 30C 

AAGGCAACGA TGAAGCCTGC ACCTAATTTT GGTACCAATT CACGAATGGT AATTTCAAAC 63 60 

TTTTCTGGTG CTCCAAGCGC ATTTGGATTG TCTGAGAAAC TGTATTGAGT TTTACCCATA 6420 

CAGATTGGCA ATTTGTCCCA ACCCTTTTGA ACGATTTGAG CAATTTGTGT TTGAGCTTTC 6480 

TTCTCAAAGT TCACTT7GCT ACCACGATAG ATTTCAGTGA CAATTTTTTC AATCTTTTCT 6540 

TGGACAGAAA GGTCATTATC ATACAAACGT TTATAGTTAG CTGGATTTTC AGCAATTGTC 6600 

TTAACAACTG TTTCGGCAAG TGCTACTCCA CCTTCTGCTC CATCAGCCCA CACACTAGCC 6660 

AATTCAACTG GTACATCGAT TGAGCCACAG ACTTCTTTTA AGGCTGCAAT TTCAGCTTCT 6720 

GTATCAGATA CAAATTCGTT AATAGCTACA ACTGCTGGAA TACCGAACTT ACCCATATTT 67 80 

TCAACGTGGC GTTTCAAGTT AGCAAAACCT GCACGAACTG CCTCTACATT TTCTTCAGTC 5840 

AGAGCGTCTT TAGCCACACC ACCATTCATC TTAAGGGCAC GAAGGCTTGC GACAATAACA 69C0 



WO 98/18931 



PCT/US97/19588 



229 

AC7GCATCTG GAGATCTTGG CAAGT77GC7 G7CT7CATA7 CAAGGAATTT CTCAGCACCA 6960 
AGCTCCGCAC CAAAACCAGC TTCAGTAACA GTG7AA7CAC CCAAGTGAAG GGCTGTTGTC 7020 
GTCGCCAAAA CAGACTTACA GCCATGAGCG A7AT7GCCAA ATGGACCACC GTGTACAAAG 7080 

GCAGGTGTAC CCTAAA7TCT CTGAACCAAG TTTGGCTTAA TAGCATCCTT CAAAATCAAA 714 0 

GCCAAGGCAC CCTCAACC7G CAAATCACCT ACAGAAACAG GCGTACGGTC ATAGCCATAA 7 2 CO 

CCAATAACGA "7ATTCCCCAA ACGACGTTTC ~AAGTCCTCGA~ TGT^CGTTGC C^GCAAAGA 7 26 0 

ATTGCCATGA TTTCTGAAGC AACTGTAATA TCAAAACCAT CCTCACGTGG AATACCGTTT 7 3 20 

AGAGGACCAC CAACACCAAC AGTCACATCG CGGACCC7AC GGTCGT7CAA G7CCACAACG 73 80 

CCT77CCAGA GGATACGACG 77CATCAAT7 CCCAGC7CAT TCCCTTGGTG CAAGTGGTTG 7 440 

TCAATCAAGG CAGAAAGGGC ATTGTTGCCA GTTGTAATAG CATGCATATC 7CCAG7AAAG 7 500 

TGGAGGTTGA 7GTCTTCCA7 TGGCAGAACT TGTGCATACC CACCACCAGC ACCACCACCC 7560 

TTGATCCCCA TGACTGGACC AAGAGACGGT TCGCGCATAG CAATCATGGT TTTCTTGCCA 7 620' 

ATCTTGTTCA AGGCATCCGC AAGACCAATG CTAAGCGTCG AC7T7CCT7C ACCTGCAGGT 7 680 

G7TCGC77GA 7GGCAC7AAC CAAGATCAAT TTACCGACTG GATTGCTCTC AACTGCACGA 7740 

ATTTTATCAA AGCTCACT7T AGCCTTGTAC TTTCCGTACA ACTCCAAATC GTCATAAGAA 7 800 

ATACCAAGTT TCTCTACAAC ATCAACAATT GGCTTCAACT CAATACTCTG TGCGATTTCA 7 860 

A7ATCTG7T7 7CATTCAAAA TTCCTCTAAC CTCTTA7ATG A7AATTCAT7 ATA7CACAAA 7920 

ACAAGA777T 7AACA7CCTA AAACTCTC7A AACGTTCG7A AATA7C7CTG 777T7AAGAC 7 980 

TT77AGAG7C C77TCTTAAA 77TTATA7GG C777A7AC77 TGAAAC7ATA ATAAATC77C 80<10 

G7777TACCA AAAATTTA7C AC77TCATT7 TAC77ACCGC 7TA7777TG7 G7ACAATAGT 8100 

GC7A7GAAAA T777AC7TAC ATCGGGCGG7 ACCAG7GAAG CTATCGA7AG CGTCCGCTC7 3160 

ATCAC7AACC A77CTACAGG TCAC7TGGGG AAAA7TA7CA CAGAGACTT7 GC7T7CTGCA 32 20 

GGG7ATGAAG TTTGTTTAAT TACGACAAAA CGAGCTCTGA AGCCAGAGCC 7CATCC7AAC 82 80 

CTAAGTATTC GAGAAAT7AC CAA7ACCAAG GACC7TCTAA TAGAAATGCA AGAACGTGTT 834 0 

CAGGATTATC AGGTCTTGA7 CCACTCAATG GC7GT77C7G ACTACACTCC 7G7T7A7A7G 9400 

ACAGGGCT7G AGGAAGTTCA GGC7AGCTCC AATC7AAAAG AA777TTAAG CAAGCAAAA7 3460 

CA7CAGGCCA AGATTTC77C AACTGATGAG GT7CAGG777 7G7TCC7TAA AAAGACACCC 9520 

AAAATCA7AT CCCTAG7CAA GGAA7GGAA7 CCTAC7AT7C A7C7GATTGG TTTCAAAC7G 3 580 

C7GG77GATG T7ACCGAAGA 7CATCTGG77 GACAT7GCAC GAAAAAGTC7 7A7CAAGAA7 8640 
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CAAGCAGATT TAATCATCGC GAATGACCTG ACTCAAATTT CACC&GATCA GCACCGAGCT 3700 

ATATTTGTTG AGAAAAATCA GCTTCAAACA GTCCAGACTA AAGAACAAAT TGCAGAACTC 87 60 

CTCCTTGAAA AAATTCAAGC CTATCATTCT TAGAAAGGAA AACTATGGCA AACATTCTCT 8820 

TGGCTGTAAC GGGTTCAATC GCCTCTTATA AGTCGGCAGA TTTAGTCAGT TCTCTAAAAA 8 880 

AACAAGGCCA TCAAGTCACT GTCTTAATGA CTCAGGCTGC TACACAGTTT ATCCAACCTT 8 94 0 

TGACACTACA GGTACTCTCA CAGAATCCTG TCCACTTGGA TGTCATGAAG GAACCCTATC 9000 

CTGATCAGGT CAATCATATC GAACTTGGAA AAAAAGCAGA TTTATTTATC GTGGTACCTG 9060 

CAACTGCTAA CACTATTGCA AAACTAGCTC ACGGATTTCC GGACAACATG GTAACCAGTA 9120 

CAGCTCTAGC CCTACCAAGT CATATTCCCA AACTAATAGC TCCTGCTATG AATACAAAAA 9180 

TGTATGACCA TCCAGTAACT CAGAATAATC TGAAAACATT AGAAACTACG GCTATCAGCT 92 40 

GATTGCTCCT AAGGAATCCC TACTACCTTG TGGAGACCAC GGACGAGGAG CTTTAGCTGA 9300 

CCTCACAATT ATTTTAGAAA GAATAAAGCA AACTATCGAT GAAAAAACGC TCTAATATTG 93 60 

CACCCATTGC TATCTTTTTT GCTACCATGC TCGTCATACA CTTTCTGAGC TCACTTATCT 942 0 

TTAACCTTTT TCCATTTCCA ATCAAACCGA CCATTGTTCA TATTCCTGTC ATTATTGCCA 9480 

GCATTATTTA TGGTCCACGA GTTGGGGTTA CACTTGGATT TTTGATGGGA TTACTTAGCT 9540 

TGACGGTTAA CACGATTACG ATTCTACCGA CAAGCTACCT CTTCTCTCCC TTCGTACCAA 9600 

ACGGAAACAT CTACTCAGCT ATCATTGCCA TCGTCCCACG TATTTTGATT GGTTTAACTC 9660 

CTTACTTAGT CTATAAACTG ATGAAAAACA ACACTGCTCT GATTTTAGCT GGAGCCCTTG 9720 

CTTCcTTGAC AAATACTATC TTTCTCCTTG GAGGAATCTT CTTCCTATTT GG AAATGTTT 973C 

ATAATGGAAA TATCCAACTT CTTCTGGCAA CCGTTATCTC AACAAATTCA ATTGCTGAAT 9 840 

TGGTCATTTC TGCAATTCTA ACCCTAGCCA TTGTTCCACG ACTACAAACC TTGAAAAAAT 9900 

AAAAACAGG 9909 
(2) INFORMATION FOR SEQ ID NO: 13: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TAATTTTCAT ATAATAGTAA AATAGAATGT GTGATTCAA? AATCACCTCA AATAGAAAGG 60 

AAATTCTATG TCAAATCTAT C7GTTAATGC AATTCGTTTT C7AGGTATTG ACGCCATTAA 120 
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TAAACCCAAC TCAGGTCATC CAGGTGTCGT TATGGGAGCG GCTCCGATGC CTTACAGCCT 190 

CTTTACAAAA CAACTTCATA TCAATCCAGC TCAACCAAAC TGGATTAACC GCGACCGCTT 240 

TATTCTTTCA GCAGGTCATG GTTCAATGCT CCTTTATGCT CTTCTTCACC TTTCTGGTTT 300 

TGAAGATGTC AGCATGGATG AGATTAAGAG TTTCCGTCAA TGGGGTTCAA AAACACCAGG 3 60 

TCACCCAGAA TTTGGTCATA CGGCAGGGAT TCATCCTACG ACAGGTCCTC TAGGGCAAGG 4 20 



GATTTCAACT GCTACTGGTT TTGCCCAAGC AGAACGTTTC TTGGCAGCCA AATATAACCG 480 

TGAAGGTTAC AATATCTTTG AC C ACT AT AC TTACGTTATC TGTGGAGACG GAGACTTGAT 540 

GGAAGGTGTC TCAAGCGAGG CAGCTTCATA CGCAGGCTTG CAAAAACTTC A7AAGTTGGT 600 

TGTTCTTTAT GATTCAAATG ATATCAACTT GGATGGTGAG ACAAAGGATT CCTTTACAGA 660 

AAGTGTTCGT GACCGTTACA ATGCCTACGG TTGGCATACT GCCTTGGTTG AAAATGGAAC 7 20 

AGACTTGGAA GCCATCCATG CTGCTATCGA AACAGCAAAA GCTTCACGCA AGCCATCTTT 7 80 

GATTGAAGTG AAGACGGTTA TTGCATACGG TTCTCCAAAC AAACAAGGAA CTAATGCTGT 840 

ACACGGCCCC CCTCTTGGAG CAGATGAAAC TGCATCAACT CGTCAACCCC TCGGTTGGGA 900 

CTACGAACCA TTTGAAATTC CAGAACAAGT ATATGCTGAT TTCAAAGAAC ATGTTGCAGA 960 

CCGTGGCGCA TCAGCTTATC AAGCTTGGAC TAAATTACTT GCAGATTATA AAGAAGCTCA 1020 

TCCAGAACTG GCTGCAGAAG TAGAAGCCAT CATCGACGGA CGTGATCCAG TCGAAGTGAC 1080 

TCCAGCAGAC TTCCCAGCTT TAGAAAATGC TTTTtCTCAA GCAACT U2 6 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2520 base pairs 
(3) TYPE: nucleic acid 
(C) STRAND EDNESS : double 
(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCGGCAACAA AAAAGAAAAA ATCAACAGTT AAAAAAAATC TAGTCATCGT GGAGTCGCCT 
GCTAAGCCAA GACGATTGAA AAATATCTAG GCAGAAACTA CAAGGTTTTA GCCAGTGTCG 
GGCATATCCG TGATTTGAAG AAATCCAGTA TGTCCGTCGA TATTGAAAAT AATTATGAAC 
CGCAATATAT TAATATCCGA GGAAAAGGCC CTCTTATCAA TGACTTGAAA AAAGAAGCTA 
AAAAAGCTAA TAAAGTTTTT CTCGCGAGTG ACCCGGACCG TGAACGACAA GCGATTTCTT 
GGCATTTGGC CCATATTCTC AACTTGGATG AAAATGATCC CAACCGTGTG GTCTTCAATG 
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AAATCACCAA GGATGCACTC AAAAATGCTT TTAAACAACC TCGTAAGATC GATATCGACT 4 20 

TGGTCGATGC CCAACAAGCT CGTCGGATCT TGGATCGCTT GGTAGCCTA7 TCGATTTCGC 4 80 

CTATTTTGTG GAAGAAGGTC AAGAAGGGCT TGTCAGCAGG TCGCGTTCAG TCCATTGCCC S40 

TTAAACTCAT CATTGACCGT GAAAATGAAA TCAATGCCTT CCAGCCAGAA GAATACTGGA 600 

CAGTTGATGC TGTCTTTAAA AAGGGAACCA AACAATTTCA TGCTTCCTTC TATGGAGTAG 660 

ATGGTAAAAA CATGAAACTC ACCAGCAATA ACGAAGTCAA GGAAGTCTTG TCTCGTCTGA 720 

CGAGTAAAGA CTTTTCAGTA GATCAGGTGG ATAAGAAAGA GCGCAAGCGC AATGCTCCTT 780 

7ACCCTATAC CACTTCATCT ATGCAGATGG ATGCTGCCAA TAAAATCAAT TTCCGTACTC 840 

GAAAAACCAT GATGGTTGCC CAACAGCTCT ATGAAGGAAT TAATATCGCT TCTGGTGTTC 900 

AAGGTTTGAT TACCTATATG CGTACCGA7T CGACTCGTAT CAGTCCTGTA CCGCAAAATG 960 

AGGCGGCAAG CTTCATTACG GATCGTTTTG GTAGCAAGTA TTCTAAGCAC GGTAGCAAGG 1020 

TCAAAAACGC ATCAGGTGCT CAGGATGCCC ATGAGGCTAT TCCTCCGTCA AGTGTCTTTA LO80 

ATACACCAGA AAGCATCGCT AAGTATCTCG ACAAGGATCA GCTTAAGCTA TATACCCTTA 1140 

TCTGGAATCG TTTTGTGGCT AGCCAGATGA CAGCGGCCCT TTTTGATACC ATCGCTCTTA 12 00 

AATTGTCTCA AAAAGGCGTT CAATTTGCTG CCAATGGTAG TCAGGTTAAG TTTGATGGTT 12 60 

ATCTTGCCAT TTATAATGAT TCTGACAAGA ATAAGATGTT ACCGGACATG GTTGTTGCAG 13 20 

ATGTGGTCAA ACAGGTCAAT AGCAAACCAG AGCAACATTT CACCCAACCC CCTGCCCCTT 13 80 

ATTCTGAAGC AACACTGATT AAAACCTTAG AGGAAAATCC GGTTGGACGT CCATCAACCT 14 40 

ACGCGCCAAC CATTGAAACC ATTCAGAAAC GTTATTATGT TCGCCTGGCA GCCAAACCTT 1500 

TTGAACCGAC AGAGTTGGCA GAAATTGTCA ATAAGCTCAT CGTTGAATAT TTCCCAGATA 1560 

TCGTAAACGT GACCTTCACA CCTGAAATGG AAGGTAAACT GGATGATGTC GAAGTTGGAA 1620 

AAGAGCAGTG GCGACGGGTC ATTGATGCCT TT7ACAAACC ATTCTCTAAA GAAGTTGCCA 1680 

AGGCTGAAGA AGAAATGGAA AAAATCCAGA TTAAGGATCA ACCAGCTGGA TTTGACTGTG 17 40 

AAGTGTGTGG CAGTCCAATG GTCATTAAAC TTGGTCGTTT TGGTAAATTC TACGCTTCTA 1300 

GCAATTTCCC AGATTGCCGT CATACCCAAC CAATCGTGAA AGAGATTGGT GTTGAGTGTC 1360 

CAAGCTGTCA TCAGGGACAA ATTATTGAGC GAAAAACCAA GCGTAATCGC CTATTCTATG 1920 

GTTGCAATCG CTATCCAGAA TGTGAATTTA CCTCTTGGGA CAACCCTGTT GGTCGTGACT 1980 

GTCCAAAATG TGGCAACTTC CTCATGGAGA AAAAAGTCCG TGGTGGTGGC AAGCAGGTTG 204 0 

TTTGTAGCAA AGGCGACTAC GAGGAAGAAA AGATGGCTC7 TTGTCAACTG TAGTGGGTTG 2100 

AAGTCAGCTA AGCTCGAGAA AGGACAAATT TTGTCCTTTC TTTTTTGATA TTCAGAGCGA 216C 
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TAAAAATCCG TTTTTTGAAG TTTTCAAACT TCCGAAAACC AAAGGCATTG CGCTTGATAA 2220 

CTTTGATGAC ATTATTGCTC GCTTCCAATT TGCCCTTAGA ATAGTGTACT TGAAGGGCGT 2280 

TGACGATTTT CTCTTTCTCC TTTAGAAAGG TTTTAAAGAC AGTCTGAAAA AGAGGATGAA 23 40 

CCTGCTTTAC ATTGTCCTCA ATGAGTCCGA AAAATTTCTC CGGTTCCTTA TTCTCAAAGT 2 4 00 

GAAACAGCAA GAGTTGATAG AGCTGATAGT GATGTTTCAA GTCTTGTGAA TAGCTCAAAA 2 4 60 

G CTTG T^SaA~~AAT CTCTTT A~ TTGGTTAAAT GCATACCAAA ~AGTAGCG"CGA TAAAAATGTT 2 520 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10993 base pairs 
(31 TYPE: nucleic acid 
(Ci STRANDEDNESS : double 
J D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TTTTCTCCAT AATAACTTCC ACCTTATTAT TTGGGATACC CTCCTCTTCT TCACCACCAC 60 

GTTCATAGTA GTCATCGCGA TAGAGAAAAG CTACGATATC AGCGTCCTGC TCAATAGACC 120 

CAGATTCACG AATATCAGAC AAGACCGGTC TCTTGTCCTC ACGTTGTTCT ACACCACGAG 130 

AAAGCTGACT CAGAGCGATT ACTGGAACCT TCAATTCCTT GGCTACTATT TTCAACTGAC 24 0 

GACAAATTTC AGAAACTTCT TCTTGACGAT TTTCTCGACC AGTTCCCGTG ATAAGTTGCA 300 

AATAGTCTAT CAAAATCAAA CCAAGATTTC CAGTTTCTTG AGCCAATTTA CCAGAACGAG 3 60 

AACGAATCTC TGTAATCCGA ATACCTGGCG TATCATCGAT AT AG AT AC TG CCCTTAGcTA 4 20 

GATTACCCTG AGCAATAGTA TATTTTTGCC ACTCCTCATC TGTCAATTGC CCTGTACGGA 4 80 

TAGAATGTGA CTCCACTAAG CCTTCTGCAG CTAACATACG ATCTACCAAG CTTTCCGCAC 54 0 

CCATTTCGAG TGAAAAAATA GCAACCGTTT TGTCCAACTT AGTCCCAATG TTCTGAGCGA 500 

TATTCAAGGC AAATGCTGTC TTACCAACTG CTGGACGAGC TGCTAAGATA ATCAACTCCT 66C 

CCTCATCAAC TCCTGTTGTC ATATGATCCA AATCACGATA ACCTGTCGCA ATACCTGTAA 720 

TATCGGTCGT TTGTTCCGAG CGAGCTTCCA GATTTCCAAA GTTGAGATTC AACACATCTC 7 80 

GAATGTTCTT AAACCCGCTT CGATTTGCAT TTTCACTGAC ATCAATCAAC CCTTTTTCTG 340 

CCTGAGCAAT AATTTCATCA CCTGGTTCTG ACCCTTCGTA ACCTTCCTTC ACAGACTCTG 900 

TCAACTTGGC AATTAAACGA CGTAGCATTG CTTTTTCTGC AACAATCTTA CCATAATACT 9 6.0 

CCGCATTAGC AGAAGTTGGC ACAGAATTAA CAATCTCAAC CAAGTAAGAC AAGCCACCAA 1020 
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TATTCTCTAA ATCACCTTGA TTATCAAGGA TAGTACGAAC CGTTGTTGCA TC7ATCCCAT 1080 

CACCACGATC GGATAAATCG ACCATGGCTT CGAAAATCAA ACGATGCCCA TACTTAAAAA U 40 

AGTCCCGAGA CTCAATGTAT TCTCGCACAA AAACAAGTTT ACTCTCATCA ATAAAGATAG 1200 

CCCCTAAAAC CCATTGCTCA GCTAAGATAT CTTGAGGTTG TACTCGTAAC TCTTCTACTT 12 60 

CTGCCATCAG ACTTCCCTTC CTTTTACAAT CTTGTCAAGA AGGTGTAAAC 7TATCCTTCT 1320 

TTCACACGAA GATTGATTAC ACTTGTGATA TCTTGATAGA TTTTCACTGG CACATCAATC 13 80 

AAACCAACCG CTCGAATCGG AGCTTGTACT TGAATATGAC CTTTATCAAT CTTAATTCCA 1440 

AATTGCTTTT GCAATTCTTC TCCAATCTTC TTATTGGTAA TAGAACCAAA GGTACGACCA 1500 

TCTGGACCAA CTTTTTCAAC AAATTCTACA ACAGTTTCTT CTGCTTCAAG TTGTGCTTTA 1560 

ATTCCTTTTC CTTCTGCAAT CATCTCAGCG TCAGCTTTT? CTTCCGATTT TTGTTTACCA 1620 

CGAAGTTCAC CTACACCTTC AGCAGTCGCT TCTTTGGCTA GATTCTTTTT GATAAGAAAG 1680 

TTTTGCGCAT ACCCTGTTGG TACTTCCTTA ATTTCGCCTT TTTTACCTTT TCCTTTAACA 17 40 

TCTGCTAAAA AGATTACTTT CATTCTTCTT TCTCCTTTTC CTTCATTTCA TTTAATACAA 1800 

TTTCTGTCAG TTTTTCACCT GCTTCTGACA AGGTTACATC TTTAATTTGA CCTGCTGCCA 1360 

AATTAAAGTG GCCTCCACCG CCTAACTCTT CCATAATCCG TTGTACATTC ACTTTAC7AC 1920 

GACTTCGAGC TGAGATAGAG ATAAATCCTT GTCTATTCTT CCCAAGAACA AAACTCGCTT 1980 

CAATACCTGA CATGGCTAAC ATGGCATCTG CTCCCTTACT AATAACAACT CTATCA7ACC 2040 

ATTTCATGTC CTTAGCCTCT GCTATTACTA CATCTCAACC TAATTTACCC CCCTGTAAAA 2100 

TAAGTTCATT GACCTCACGA TATTCTTCAA AATCTGTCGC AGCGATTTCC TGGATAGCAA 2160 

TACTATCACT TCCCCGCGTT CTCAGATAGC TAG CAA CATC AAATGTCCGA CTAGTTACTC 2220 

CCGAGCTGAA ATTTTTAGTA TCCAACATCA TACCAGCCAT CAAGACACTT GCTTCCA7AC 2 280 

GACTCAAACG ATTTTTCTTA GAATTCTGGA ACTGAATCAA TTCCGTTACC AACTCACTGC 2 3 40 

CACTACTTGC ACCACTTTCG ATATAAGTAA TAACCGCATT ATCTGGAAAA TCCTGATCCC 2400 

TTCTATGGTG CTCAATAACA ATGGTTTGGG TAAATAAATC ATAAAATTCT TTTGATAATG 24 60 

TTAAGGCTGT CTTTGAATCC TCTACAAGAA TCAACAAAGA ACGATTGGTC ACCATCCCCA 2 520 

TTGCATCCTT AACAGACAAC AACTTCGTAA CTCCTTCTTT TTCTATGAAT GAAACAGCTC 2 580 

GTTCAATATC TGGAGACATT TGTTCTTCAT CATAAAGAGC ATACCTATTT TCAATCACAT 2 640 

TGCTCGCGAA CAAC7GCATA CCTACAGCAG AGCCCAAAGC ATCCATGTCT AAATTTTTGT 27C0 

GACCGACTAC AAAAACCTGA TCTACACTCC GAATCTTATC TGAAATAGCT GT CAT CAT AG 27 60 

CGCGCGTACG ACTCCGTCTA CGCTTCATTG AAGCAGCAGA CCCACCACCA AAATAAACTG 282C 
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CATTTTTCCT TTCGTCCTTT TCCTTAACAA CCACCTCGTC GCCACCACGT AC7TCAGCCA 2880 
AGTTCAAATT GAGCAAAGCA ACTTTCCCTA TC7CATCATG ATTTCCATCG CCATAAGAAA 2940 
ATCCCATACT TAAGGTCAAG GGCAACTGTC TCTGTTTCGA CTCTTCTCTG AAACCATCAA 3 000 

TAACAGAAAA TTTATCATTC ATCAAGCCCT CAAGCACCGT GTAGTCAGTA AATAGATAAA 3060 
ATCCATCCAT ACTTACCCGA CGAGAAAACA TCATGTGTTT TTCTGAAAAC TCTGATATAA 3120 

"TAATTAGCTAC~ AAAACTATTG 7[ffT^CTAA _ fAfCTCACT^" AGAAGTT TCA TCCTCCAAAT 3 180 

CATCATAATT ATCCACAGAG ACAATCCCAA TCACTGGTCT ACTTGTTACC AATTCATCTG 3 2 40 

TTATGCCTTG TTCCCTGGAT ACATCTACAA AA7ACAAAAC ACCGGAAGAA GCATCCATAT 3 300 

GAACAGCATA ACGCTTCTCA CCAAGCTTGG CATAACTAGA CCGATTTCCT ACTGAAGCCT 3 360 

TGATAATCGT TTGAACAGCT TCTAAATCAA AATCACCATC TTCCTTGCTC AAAATCAATT 3 420 

CAGCATAGGG ATTAAACCAC TCAACCTCTC CACAAGATAA ATTCAATTTC ATAACACCTA 3 4 80 

CAGGCATCTG TTCCAATAGA GCTCTCAAAC TTTCTTCCGC TTGGTGGTTT ACATACTGTA 3 540 

TCTCTTCTAC ATCACTCCTT CTATAATGCA CTCTCACTTT CTTAAATAAA AAAACATAGC 3 600 

CTCCTACAAA AAGAAACAAA ATTAAAACCG TCAACAGATT ATTATTAACA AAAATAATGA 3 660 

AAGTGGATAA GACTCCAAAC GCAATCAATC CTACTAGAAT AGGAAAAATT GGACTTACAT 3720 

AAAATTTTTT CATTCAAAAC CTCTTGGCAC CCATTATACC ATAATACCCC TCAAAAAGCG 3780 

ACTTTTTAAA AGTGTAA7CA GTAATTCTAT CAATTATAAG AAAAAGGTAG TTTACAATTC 3840 

AGTAAACCTA CCTTTACACA TATTGAAATT AAGATTCTTT AACCTCTAAC AAACCAATTT 3900 

CGCCATCCTC ACGACGATAA ATCACATTGG TTGTCTGATC TTCAACATCC AC AT AG AT AA 3960 

AGAAATCATG CCCCAATAAA TCCATTTGTA GAATTGCTTC TTCCAAATCC ATTGGTTTTA 4020 

AATCAATTTG TTTTGAACGA ACAACTTTAG ACTGGACAAT ATTTGAATCT TCCACCAAAG 4 080 

CATCTGTAAA TAATTGACCA GTTGCTACCT TATTTTTATT TTTACGCTCG ATTTTTCTTT 414 0 

TATTTTTACG AATCTGACGT TCAATTTTAT CAGTTACAAG GTCAATTGAA CC AT AC AT AT 4200 

CTTGAGATAC ATCTTCTGCG CGGAGAGTAA TAGATCCAAG CGGAATCGTT ACTTCCACTT 4260 

TAGCCCTTTT -TTCACGATAA ACTTTTAAGT TAATTCGGGC ATCCAACTCT TCTTCTGGTT 4 320 

GGAAGTACTT TTCGATCTTT TCGAGTTTAG AAACTACATA ATCACGAATT GCTTCTGTTA 4 380 

CTTCTAGGTT TTCACCACGG ATACTATATT TAATCATATG AGTACCTTCT TTCTAAACAT 4 440 

TTTTGTTTTT ATGATTTTAT TATAACGCTT TCATTCTA77 TTTGCAAATT 7TTTCCTCAT 4 500 

CTTACAAGGG AAAATGTTTT TACATCCTTA GCACCAGCTT CTTCCAACAG TTTCTTAACA 4 560 
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CGATTTA7AG 


TTGCTCCTGT 


AGTATAGATA 


TCATCTATAA 


GTAGGATTTT 


TTTAGGAATA 


4620 


GTGACTCCAC 


TTTTAATAAA 


GAAAGGAAGT 


TCTGTCCCCA 


AGCGCTCTGA 


ACGATTTTTA 


468C 


GAAGAACTGG 


C7CTC7CTTC 


TCTTTTCTCT 


AATAAATCCA 


GATACTCAAA GCCTGCTGCC 


4740 


TCTACCAAGC 


CCTCAACCTG 


ATTAAATCCT 


CTATTAGCAT 


ATCTATCAGG 


ACT7AGGGGA 


4800 


ATTACAACAA 


ATTGATACTC 


TTTGTACTTT 


TTCAACTCCT 


CAC77AAAAA 


TGAAGCGAAA 


4360 


ACTTTTCTTA 


ACAGGAAGTC 


TCCATCAAAC 


TTATACCGAC 


TGAAAAAATC 


CTTCATAGCT 


4920 


TGATTGTAAG 


TAAAAATCGC 


7C7ATGACTG 


ACTTCAACTC 


CCTCTTTACA 


CCAAAGTTGA 


4980 


CAATCT7GAC 


ACTTTGTTGA 


CAACTCTGTT 


TTCATACAAT 


77GGACAG77 


C7CTTCCCCA 


5040 


ATTCTTTCAA 


AAGTAGAATC 


ACAGTCTGAA 


CAAAGACAAG 


AGTCATCATT 


CCTCAGAAG7 


5100 


AAGAGACTAC 


TAAAAGTTAA 


AACAGTCTTC 


ATAGTCTGCC 


CACATAACAA 


GCACTTCA7A 


5160 


GACCACCC7C 


CTTATTCATC 


ATCTGAATTT 


CCTTAATCGC 


CTTCTTGATT 


GAAGCATTTA 


5220 


ACCCATCATG 


GAAGAAAAGC 


AAATCTCCTG 


TCGGTCTATC 


CATGCTTCGT 


CCAACTCGTC 


5230 


CACCAA7C7G 


AA7CAAACTA 


GACTTCCTAA 


ACAAACGATG 


ATTGCCCTCT 


AC7ACGAAAA 


5340 


CATCCACACA 


AGGGAAGGTA 


ACTCCGCGCT 


CCAAGATTGT 


CGTACTGATA 


AGTATTGTCA 


5400 


GTTCTCCATC 


TCGAAAAGCT 


TGTACTTGCT 


CTAATCGATC 


CTCTGTTACA 


CAAGATACAA 


5460 


ACCCAA7T77 


CTCATTTGGA 


AATTCCTCCT 


GTAAGATTTC 


TGCTAACTGC 


TCCCCT7TC7 


5520 


TAATTTCTGA 


AGCAAAAATG 


AGTAACGGAT 


AAGCTGTCTT 


TCTCTGCTTC 


TCAATATAGG 


5580 


ACTTTAACTT 


TGGTGACAAA 


CGATTCTTGT 


CTAAGTAGCG 


ATTAAAATCC 


GATAACCAAA 


5640 


TTGGTTTTGG 


AATAATCAAC 


GGATTTCCAT 


GAAACCCTCT 


CCGTAAA77C 


AG7C7TTTTA 


5700 


GTTCTCCTAA 


ACGGACCTTT 


TTATCTAACT 


CATTGG7CGA 


AGTCGCTCTT 


AAAAAGATTC 


5760 


TCAATCCATT 


C7CCTT7ACA 


CTATTCTTCA 


CAGCGTGG7A 


AACCA7GGCA 


77A7CAACAT 


5320 


AAGGAAAAGC 


ATCTAC77CA 


TCCACTATCA 


CCAAA7CAAA 


AGCTTGATAA 


AAC7TCAA7A 


5880 


ACTGATGGGT 


7C7TCCAACA 


ACTAGTGGTG 


TTCGAAAATA 


AGGTTCCCA7 


7C7CCATG7A 


5940 


GCAAAGCTAT 


CCCGCAAGAA 


AAATCCTGTT 


GCAGGCGCTT 


G7ACAGCTCC 


AAACAAACAT 


600C 


C7ATGCGAGG 


ACTAGCCAAA 


CACACTGCAC 


CACCCCCA7T 


GA7CACTTTA 


CCCACTACTT 


6060 


GATAAATCAT 


TTCTGTCTTT 


CCAGCTCCTG 


TTACCGCATG 


AAC7AAGG77 


GGC7TTTGCT 


6120 


TGTCTACTAC 


TTGAAGCAAT 


CCCTCTGACA 


CC77C7CTTG 


AAAAGGAGT7 


AA7TGGCCGC 


6180 


GCCAT7TGAC 


AACATCTTGC 


TTTGGAAAAT 


CCTCCTGCGG 


AAAA7AGTAT 


AAAGTT7GAT 


6240 


CACTTCTGAC 


TCGCTTCA7C 


AGCAAGCACT 


CTCGACAATA 


GTAAGCACCG 


ATGGGCAAAT 


6300 


ACCATTCTTC 


' TAGAATAGTA 


, CTATTACACC 


CTTGACAGAA 


, AAG77TCCCC 


: 7TCTCCTT7C 


636C 
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TCATTGCTCG AAGTTTCTCC 
ATAAACGACC GAGATAATCT 
TTTAGATGAT TTTTTAGTAC 
AAGTCCAAGA AGAAATCAAA 
AAGAAGAGGC TCGTGACTTC 
ACTGCTCTGC CTTCATTATT 
AGCCTAGTGG TACTGCTGGT 
ATGTCTGTGT GGTCGTGACA 
TTCGTGCTTA CGCCGGCAGT 
AAGAACAGGC TGGCATTGCT 
TCCTTAAAGA ACATGGTCTC 
TGATTTATGT TGATAAAGAA 
ATGGAAAAGT CACTTTAACT 
TGTAAACAAT GAATAATACA 
AT AAAAAG AG CCCTACCAAA 
CGTTTTCCTT CACACCTATT 
CTTTGATCTA TGATATATAG 
AAGAGGTATT CATATGTCTA 
TGTTAAACTT AACAACATCG 
ATTTAATCCT GGTTCATCTG 
ACAAGATGGT ATTCTGAAAC 
TA7TGGACTT TCATGGGTAG 
AACTATGAGT GTAGAACGAC 
TCCTGGTAGC GAGGGAATGA 
TGATGGTTTC CTTCCTCTTC 
AACAGGAGCT GAG AT ACT AG 
AGTAGGTACT GGTGGAACGA 
CATTCAAGTT TTTGCAGTAG 
TCCTCACAAA ATTCAAGGTA 
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GCCAACTGAC GTTCTTCTTC TGTTAATTCA TTCTCAGTAA 64 2 0 

AAATTTACTT TCATACTTCT TTATTCGTAA AAACTAGCAC 64 80 

AATTAAATCA TGGAATTTAG GACAATTAAA GAGGACGGTC 6540 

AAATCTCGCT TTATCTGCCA TGCCAAGCGT GTTTATAGCG 6600 

ATTACTGCCA TCAAAAAAGA AC ACT AC AAA GCGACACATA 6 660 

GGAGAACCTA GTGAAATTAA ACGTACAAGT CATGATGGTG 6720 

GTTCCCATGC TTGGGGTACT AGAAAATCAC AATCTCACCA 67 30 

CGCTACTTTG GTGGTATTAA ACTAGGCGCT GGAGGACTAA 6840 

GTCGCCTTAG CTG7CAAAGA AATTGGTATT ATTGAAATAA 6900 

ATTCAAATGT CTTATGCTCA GTACCAAGAG TACAGTAACT 69 60 

ATGGAGCTGG ATACAAACTT TACAGATCAA GTCGATACGA 7020 

GAAAAAGAAA CTATTAAAGC TGCACTTGTG GAGTTTTTTA 7 080 

GACCAAGGTT TACGAGAGGT TGAAGTTCCT GTAAACTTAG 7140 

GCGTTTCGTT GACATTCTCA CAACTACTTT AGCGAGCAAA 7 200 

ATATACTAGA AAATGAAGCA ATTCAAACGA AACCTGATAT 7260 

TACTAGAATT AGCTGAACGC AATCACTTGA AAATTAATGA 7320 

AAATGGTATG GATAGCGTTA TACTAAACAT ATCTTATACA 7 380 

TTTATAACAA CATTACTGAA TTAATCGCTC AAACACCGAT 7440 

TGCCAGAAGG TCCTGCAGAC GTCTATATAA AGCTTGAAGC 7500 

TAAAAGACCG TATTCCCCTT AGCATGATTG AAAAAGCTCA 7 560 

CTGGTTCTAC TATTGTTGAA GCAACAAGTG GAAACACCGG 7 620 

GTGCTGCTAA AGGGTATAAA GTCGTCATCG TTATCCCTCA 7 680 

GTAAAATTAT CCAAGCTTAT GGTGCTGAAC TCGTCCTAAC 774 0 

AAGGTGCTAT TGCTAAGGCT CAAGAAATCG CTGCTCAACC 7 800 

AATTTGACAA TCCAGCTAAT CCAGAAGTAC ACGAAAGAAC 7 360 

CTGCTTTCGG TAAAGATGGA TTAGATGCCT TTGTTCCTGG 7 920 

TTTCTGGTGT TTCTCATGCA CTCAAATCAG AAAATTCTAA 7 980 

AAGCAGATGA ATCTGCTATT CTATCTGGTG AAAAACCTGG 3040 

TCTCAGCTGG ATTTATTCCT GATACACTTG ATACTAAAGC 8100 
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CTATGATGGT ATCGTTCGTG TAACATCAGA TGACGCTCTT CCACTCGCAC GTGAAATTGG 3160 

TGGAAAAGAA GGCTTCCTTG TAGGGATTTC CTCAGCTCCA GCTATCTACG GAGCCATCGA 3220 

GGTTGCCAAA AAATTAGGTA CAGGTAAAAA AGTCCTTGCC CTAGCACCAG ATAACGGTGA 82 80 

ACGTTATCTC TCTACAGCAC TTTATGAATT GTAACCGTCC AATAACGAAC TCTATTGAAA 8340 

AATCTCCAGA CTAGAGAACT CACGGATAGT TCCTAATCTG GACRTTTCTT ATTTGCACTT 8400 

TTCT7GTACA ACTTTAGTCC ATGGTAAATA GGCCTCTAAA ACCTCTTTGT TTACGAGAGT 84 60 

TTCCACGTTT GGAAGACATT CTAGAAGATA GGATAGATAT TTCTCACTAT T7ATAATGGA 8520 

TTGAAATAAG ATATGAACAA ATCGATTACA ACATGATGGT AAAGCGTAAT CCCTTGTTTC 85 80 

TCAGCTTTCC CAGACAAAAA AGTCCAATAG TAAGTCAGCT GACTATCACT CTCTAGCACC 3640 

CTATAAGAAG TTTCATCCCC ATGAAGTAAG GGCTGAGTCA ATAGTCTCTC TCGCAAGAGG 8700 

TTATAAAGGG GCTCCAAATA GTATTGACTC GTCTTGATAT GCCAATTAGA GATTTCCTTA 3760 

CGTGTGATTG GTAAACCCAT CCTAGCCCAA TCTTCTTCTT GGCGATAATT GGGTACCTTC 8820 

AGATTAAACT TCTCATCGAT GGTGTGAGCC ATAATAGAAG CTGAGCCAAA GTTA7GCGCT 8880 

AAAGGGCCTT TAGGAATAGG AGCTTTCACA AGCTTATCCA GATGATTATC TTTTACTCCT 8940 

TATGGACAAT GCTATATGGC ATAAATCAAG TACCTTAAAG ATTCCGACTA ATATTGCCTT 9000 

TGCATTTATT CCTCCATACA CACCAGAGAT GAACCCCATT GAACAAGTGT GGAAAGAGAT 9060 

TCGTAAACGT GGATTTAAGA ATAAAGCCTT TCGAACTTTG GAAGATGTCA TACAAGGACT 9120 

GGAGAAGGAG GTGATAAAGT CCATCGTTAA TCGGAGACGG ACTACAATCC TTTTTCAAAA 9130 

CAGATGAGTA TAAAAAGAAA GTCCTCATTT CAATAGAAAT CACGACTTTC TGATGAATTT 9240 

ATAGT AAAAT GAAATAAGAA CAGGATAGTC AAA? CCA ITT CTAACAATGT 7TTAGAAGCA 9300 

GAGGTGTACT ATTCTAGTTT AAATCCACTA TATTTGGGGA GTGATAGAAA AGCCCTTCAT 93 60 

CAGCCAATCT ACTTGTTCAG GTGCGAGAGC TTTCACATCC TTTTCTGTAC TGGACCAAGT 94 20 

CAGTTTTCCG TTCTCAAAGC GTTTATATAA TATCC AAAAT CCTTGACCAT CCCAGTAAAG 94 80 

AACTTTAAAG CGGTCTTTAC GTCCACCACA AAAGAGAAAG ACTTGATCGG AGAAAGGATC 9540 

CAATTCAAAG TGGGTTTTAA CTACATAGGC TAATGAGTCT ATTCCCTGCC TCATATCTGT 9600 

CTTCCCACAA ACAAGGTGAA CTTGACCTAA ATCACTTAGT TGAATTATCA TAGTACAATA 9660 

CCTTTCCTCC GATAATTATT TTTTATCTGG TATACTGGAA GTTGGGGAAT TAGGATAGAT 972 0 

ACCTTGTTAT GACGCGCTTA CTATGAATT7 GAAGTATAGT CTCCTAAATG CACTTAGCCC 97 30 

TTATTATAGG GCTTTTTGTT TTAATTATTC TAATCGAGTG AGACTGGGGA AAAAACAATT 9840 

TCAGGAAAAA TCTAAGCCCT ATACAAAAAA GGAAGCAATT TGCTTCCTTT CT ATT ATT AG 9900 
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TTATTCAACG CTGCTGCCAT TGTAGCTGCA ACTTCAGCTT CCAACTCGTT TGCAGCTTTC 9960 

TCGATACCTT CACCAACTTC AAAGCGAGCA AACTCAACTA CCGAAGCGTT AACTCATTCA 10020 

AGGTATGCTT CAACTGTCTT GCTGTCATCC ATGATGTAAA CTTGTGCAAG AAGTGTGTAA 10080 

GCTTGGTCAA CTTTAGTGTT ATCAAGCATG AAGCGATCCA TTTTACCTGG AATAATTTTG 10140 

TCCCAGATTT TTTCTGGTTT GCCTTCTGCA GCCAATTCAG CTTTGATGTC AGCTTCAGCT 10200 

TGAGCAATAA CATCATCAGT TAATTGAGCT TTTGATCCAT ACTTCAAGTG TGGAAGAGCT 10260 

GGTTTATTAA CCATTGCACG GC7TTCCTTG TCTTGGTCGA TAACGTGATT CAATTGTCCC 10320 

AACTCATCTT TAACGAATTG CTCATCCAAT TCTTTGTAAG AAAGAACTGT TGGTTTCATC 103 3C 

GCTGCGATGT GCATTGACAA TTGTTTAGCA AGTGCTTCGT CTCCACCTTC AACAACTGAA 10440 

ATAACACCGA TACGTCCACC CTTATGTTGG TATGCTCCAA AGTGTTGTGC GTCTCTTTT? 10500 

TCAATCAATC CAAAGCGACG CAATGAGATT TTCTCTCCGA TAGTTGCTGT TGCAGATACG 10550 

TATGCAGCTT CAAGAGTTTC ACCTGAAGGC ATTATCAAAG CAAGAGCTTC TTCGTTGTTA 10620 

GCAGGTTTTC CTTCAGCAAT GACTTTAGCT CTAGTATTTA CCAATTCAAC CAATTGAGCC 10680 

TTTTTTGCAA CGAAGTCAGT TTCAGCGTTT ACTTCAATAA CTGCTGCAAC ATTACCGTTA 107 40 

ACATAAACAC CAGTCAAACC TTCTGCAGCA ACACGGTCAG CTTTCTTAGC TGCCTTAGCC 10800 

ATACCTTTTT CACGAAGCAA TTCAATCGCT TTTTCGATGT CACCGTCTGT TTCTACAAGC 10860 

GCTTTTTTAG CGTCCATAAC ACCGGCACCA GA T TT TT CAC GCAACTCTTT TACAAGTTTA 10920 

GCTGTAATTT CTGCCATTTT AATTCTCCTA TATTTTTTGA AAATAGGAGA GCGCGGCTAA 10980 

GCCCCGCCTC CGG 1099 3 
(2) INFORMATION FOR SEQ ID NO: 16: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CGACGGGGAG GTTTGGCACC TCGATGTCGG CTCGTCGCAT CCTGCGGCTG TAGTCGGTCC 60 

CAAGGGTTGG GCTGTTCGCC CATTAAAGCG GCACGCGAGC TGGGTTCAGA ACGTCGTGAG 120 

ACAGTTCGGT CCCTATCCGT CGCGGGCGTA GGAAATTTGA GAGGATCTGC TCCTAGTACG 180 

AGAGGACCAG AGTGGACTTA CCGCTGGTGT ACCAGTTGTC TTCCCAAAGG CATCGCTGGG 2 40 
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TAGCTATGTA GGGAAGGGAT AAACGCTGAA AGCATCTAAG TGTGAAACCC ACCTCAAGAT 3 00 

GAGATTTCCC ATGATTATAT ATCAGTAAGA GCCCTGAGAG ATGATCAGGT AGATAGGTTA 3 60 

GAAGTGGAAG TGTGGCGACA CATGTAGCGG ACTAATACTA ATAGCTCGAG GACTTATCCA 420 
AAGTAACTGA GAATATGAAA GCGAACGGTT TTCTTAAATT GAATAGATAT TCAATTTTGA 4 80 

GTAGGTATTA CTCAGAGTTA AGTGACGATA GCCTAGGAGA TACACCTGTA CCCATGCCGA 540 
ACACAGAAGT TAAGCCCTAG AACGCCGGAA GTAGTTGGGG GTTGCCCCCT GTGAGATAGG 600 
GAAGTCGCTT AGCTTTAATC CGCCATAGCT CAGTTGGTAG TAGCGCATGA CTGTTAATCA 660 
TGATGTCGTA GGTTCGAGTC CTACTGGCGG AGTAATtGAT AAAAGGGaAC ACAGCTGTGT 720 
TCCTCTTTTT GTATCAATTT GTATCACCAA GCATTTTCAT AAGGAAGTCT GTTATTTCTT 7 30 

GAGAACTTTC TTTTTTTCCA TGTGCAATCC AAGTTTGGCA GACACCAAAA AGTGCATGAG 840 
TT AG AT AG AT GCTACTATAT TCTAATTCAG TGGTATTTAG ATTCAGTTGC ATAAATCGCT 900 
TTTGTAAATC TGTACTAAGC ATGATATGAA GTTTATTTCG TAAGAAATTT TGGATTTCTT 960 

TAGTCCCATT TTCAGAAAGA AGGGCAGCCA GAAGTGGTTC TCACTCTAGA TATTCAAAAA 1 020 

CTTCTAAAAT AGCGTCTCTT TTGTGATGAG CATGTTTTTG AAAAATATAT TCAAATGTAT 1080 

CGAATAGCTT GCTTTGATAG TGCTCAATCA TATCATACTT ATCCTTATAC TG ACT AT AG A 1140 

AGCTGGAACG ACTAATTCCG GCTTTTTCTA CTAATTTGAC AGTAGAAATT TTATCAAATG 1200 

GCTGTTCCAT CAGTAATTGT ACCATAGCAT TTTCAATAGT TCGCTTTGTT TTTAAGCGTT 1260 

TGTTACTTTC TTGCATATTT CCTCCTTCTA AACAAATTAG ACTATATGTC TAAAAATAGA 1320 

TTTTTTATCT TGTAATTTAG ATTTTTTAAT GTATAATCTA TTATATCAAA ATTTTACACA 1380 

ATATGTTTAA AAAAGC.VCAA ACTAAGTTTA AACAA7~~\> A~CAA'?'7TAA AAAAAACCAA 1440 

CCTTTATTAT TG7CATGATC GGGATTTCTC TTATTCCAGA TCTGTACAAT ATCATATTTT 1500 

TGTCATCAAT GTGGGATCCA TATGGGCAAT TCTCTCACTT ACCTGTCGCA GTTGTAAATA 1560 

ATGATAAAGA GGCTTCCTAT AATGGTAATA CTATGGCAAT AGGAAAAGAC ATGGTGTCCA 1620 

ATTTAAAAGA AAATAAAACC TTGGATTTTC ATTTTGTAGA TGAAGAGGAA GGAAAGAAGG 1680 

GATTGGAAGA TGGCGATTAC TATATGGTAG TGACTTTACC AAGTGATTTA TCTGAAAAAA 17 40 

CAACTACATT ATCCAATATT CAATCGACAG CAGCTTATCA ATCATTGACA AGTCAGCAAC 1800 

AAACTGAGAT AAGTGATTCT GTATCTCAAA ATTCAACTGA TAGTATTCAA TCGGCTCAGT 1860 

CAATTGTAGC TTTAGTACAA GATTTACAGG GAAGTTTAGA AAACTTACAA AATCAATCTT 1920 

CTAATCTTTC GACTTTAAAA AATCAATCTA ATCAAGTATC ACCTATTACT TCTACTTCTT 1930 

TGATAGGATT GTCAAGTGGA TTAACAGAGA TACAAGGAGA TGTTACTAGC AAATTAGTTC 2 040 
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C7CCCAGTCA GTCGATTGCA TCAGGTGTAA ACGCATATAC TACAGGTGTT CATAAACTT7 2100 

CTCAGGGCGC AACTCAACTA AGTGAAAAAA ATGCCACCTT CACACGTACT TTGGATAAAC 2150 

TAGTTTCAGC CTCAAACACC TTGACACAAA AATCTTCTAG ATTGACAGCA GGAGTTGGTT 2220 

AATTACAATC AGGATCTGGG CAATTAGCAG ACAAATCCAG TCAGTTACTT TCAGGTGCTT 2280 

CTCCATTACA GAATAGAGCT AATAAATTGG CAGATGGATC TGGGAAACTA GCAGAAGGTG 2 3 40 

GAACAAAGTT AACTTCTCGA TTGGAAGATT TACAGACAGG ACTTGCTTCT TTAGGACAAG 24 00 

GACTACGTAA TGCTAGTGAT CAACTCAAAT CAGTATCAAC AGAATCTAAA AATGCAGAGA 2 4 60 

TTTTGTCAAA TCCACTCAAT CTTTCAAAAA CAGACAATGA TCAAGTTCCT GTAAATGGAA 2520 

TCGCAATAGC TCCTTATATG ATATCAGTTG CTCTTTTTTT GCAGCAATAT CAACAAATAT 2580 

GATATTTGCG AAATTGCCTT CAGGACGTCA TCCAGAGAGC CCTTGGGCTT GGTTGAAATC 2640 

TTGAGCTGAA ATAAATGGTA TTATAGCTGT TTTGGCAGGA ATTTTGGTAT ATGGAGGAGT 2700 

TCAGCTTATT GGTTTAACTG CTAATCATGA GATGAGAATA TTTATTCTCA TCATCCTAAC 27 60 

AAGTTTAGTA TTCATGTCTA TGGTGACCAC TTTAGCAACG TGGAATAGCC CTATAGGAGC 2920 

TTTTTTCTCA CTTATTTTGC TTTTACTACA GTTAGCATCA AGTGCAGGTA CTTATCCACT 2880 

TGCTTTGACA AATGATTTCT TTAGATCTAT TAATCCCTGG TTACCAATGA GCTATTCAGT 2940 

TTCGGGATTA CGACAAACAA TCTCTATCAA CAAGTCATTT TCCTAGCTGT CATACTAGTT 3000 

CTATTTACTA GTTTAGGTAT GCTAGCCTAT CAACATAAGA AAATGGAAGA AGATTAAAAA 3060 

AATCGACCGA TTAACTGGTC GATTTTTTAT CCCTTAGATG ACTTTCGTCT CTCATTATAG 3120 

ATTCCAAATA GTAAGAGAGA AGTAAAGGAA CAGATTGCTC CAGTAATAAA ACCATTGGCA 3190 

TCAGCTTGTT TAATTTCTAT TTTCTTACCA TCTTCCTAGG CAGACCAACC TTTGTCATAA 3 300 

GGAATGGTGA AGAAAATACA TCTATCTTGT TCGACATCAT ATGTACCAAA AACCTTGTTT 3 3 60 

TTAGAAGTTC ATACTGTGAC AGGTTCTTCT TTAATTTTTT GAATTGCCTC GGTGAAAGTT 34 20 

7TGGTATCTA AACGATAGAA GGTAGGAGAT TCAAATGATA CTTGTGAATT TCCAGGGAAA 3 4 80 

CTAACATTGA TATTCAAAGT TTTTTTCTCT TTAGTATATC CTAGATTAAA GAAGGAGAAG 3 540 

ACATTATCAG TTCTAAAACT CTTTTTTTCA CCATTTACAA GGATGTCAAC CTTCTTTTGT 3 600 

TTATCGTTAG AAAAGTGAAG GTTTATGAAA GAG AG AT AAA CTTGGCTGTT TTCTGGAACT 3 6 60 

TCAATTTGAT ACTGGATTGC TGCATCTTCA TTTGAAGAAC TTGTGACACT AATCAAATCA 37 20 

TTAGTATTTT CTATTTTTTC TGTTTTTTCA TAAGGTATTG GAGAAAAATA ATCAAAATTG 3 7 80 
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ACGTTAGCAA GTTGATTTAA AAATGAGGCC TGATTATCCA AGGIAJ~TTC ATTGAACTTG 3 840 

ACATCATTGT AAACAGATTG ACTCGCAACT GCAATCGGAA GAGAGTATTG ATTTTCATAT 3 900 

AGGGTAAGAT TATCTTTTTG AT AG AT AT CT TTAAAGCCAT ACTTATCAAT AGGACTGTCT 3960 

GAGATATTGT ACTGGATACC AAATAAACTA TCAGCCAAAA TACTATTATT TGCATATCGG 4020 

AGATTGAGAT TAGTCCCAGA GGATTTAAAA CCAAGTTTAT CTAAAGTAGA GCTTGATGAA 40 80 

CGATTTCGAA CAGATGAAAA TTGAGACATT CCATTGTAGT TGAATTTCAT ACTGTCATTT 4140 

CCTGTCTGAG TTTGTAGTTT TTCAGTACGA GTAAATTGAT TTCCAATATA TGTTGAGAAA 4 200 

GATTCCATAG CTGGGATATC TCGACTATAA GCACTTCGAG AAGCAAATCC CCATTCCTTA 42 60 

GCAATTCCGT CCATTTGAGA TGAAGCATTT AAACTCATTT CAACCAGTAT AAATAAAGAG 43 2C 

ATTAGAATGG CAAATAGATT C AC AG AT AT A AACTTTTTGA TAACTCCAAG GAGTAAAAGA 4 380 

GAATAGACAA CCAAAAATTC AAGAGTAAGC AGAATATTCA AATCTGTTAA AAAAGAATAA 4440 

TGCGATTTTA GATAGATGGT AGCTAAAAAT CCTCCTACTA CAAGAAAAAG CGAAACTAAA 4 500 

AAATTCCAGA CTTTAAGTTC TTTCAGACGC TTTAAGACTT CTGCTCCTGT GTAAATTAAC 4 560 

AAGGTAGAGA AAATCCAAGC ATAGCGATGT AAAAACATGT TTGCACTATG CATGCCTTCC 4 620 

CAAAATAAGT CAAGAGCTTC TATGTAAAAC CTTGCAATTA GAAATGCAAA GAATATTACA 4 680 

TATATGAGTT TCACGTGAAA CTTAATAGAT TTCAGCGTAA AAAATAAAAT GGTCAAAATA 4740 

AAGGGAAATA GTCCAACAAA AATCATTGGG ATGGCCCCAT ACTTTGTTGT GTCAAAGGAA 4800 

CC AATGAATT GCTTAGCAAA GAGATCAAGA TACCAGCTAC TTTCACTTTC AAACTTTCTA 4860 

ACTTCAGTCA ATTTTTCCCC ATGTGTCTGT AAATCAAATA GAGTGGGAAG AGTCATAATC 4 920 

AAACTAGCCA TACCAGCTAA AAAGGACATA ACTATGAAAT CAAGAACACA TGATTTTCGA 4 9 90 

GTCTTAAAGT CCCACGAAAT TTGACAGAGA TACCAGAAAA TAAGAAACAA TAC7GTCATA 5040 

TATCCAAAAT AATAATTTTG AATAAATAAG ATTGACAGAC TTCTAAACTA CAATAGGAGT 5100 

TTCTTTTCAG TTATCAGTAC ATGTAAACCA CTTATAATTA AAGGAATCAA GATAAAAACA 5160 

TCTACCCAGG TTTTTATCTC TAATTGACTG ACAGTGAAAC TCATCAGAGC ATAGGAAGTA 5220 

GATAAGGCTA GTTTTAAAAT CTGAGGGATA GATTGAAACA ATTTATTCAA ACTAAAAAAG 5280 

GTTCACAGAC CAATCAATCC AAATTTTAAG AGAGTTGTCA GATAGATAGC ATCTGGCATA 5 340 

TTCGTTAGAT CAAAAAAGTA AACCAGAGGC GCGAGAAAAC TACCCAACTA ATAACTAGAT 5400 

AGGGCATAGA AGTTTACCCC TAGACCACTT GTAAAGGTGT AAAACACATT AC7ATTTCCA 54 60 

TGTAGGATAT TTCGTAAGGC TACATCAAAA ATAACGTATT GATGAAAGCC ATCTCCTAAT 5520 

AGAGGAGAGT TGTCCCTATT CCAGTAGATA CTTTGAGATA GATATACTCC AGACATAATC 55 80 
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ACTACAGGAA TGATGAAAGA AATAAAATAG GTTCGATATG TTTTTAAAAA TGATTTCATG 5640 

TTACCTCGTA GAATGATAGA AAACTCAGTT GGTTAACCCA ACTGAGTTTT GAAGTTTTAT 5700 

TTAGTCTTTC CAAAGTTCTT TAACTTTTGC TTGTACTTCT GCATTTTCTA GGAATTCATC 57 60 

GTAGGTTTCA TCGATACGGT CAATGACGCC ATTTTTAGAT AAGACAATGA TATGGTTAGC 5820 

CAAAGTTTGA ATAAATTCGT GGTCATGGCT GGCAAAGATC ATTGATTCTT TAAAGTTTTT 5830 

CAATCCATCA TTCAAGCTTG AGATAGATTC CAAGTCCAAG TGATTTGTTG CATCATCAAG 5 94 0 

TACAAGGACA TTTGATTTTA AGAGCATGAG TTTTGAAAGC ATGACACGAA CTTTTTCTCC 6000 

CCCTGACAAG ACATTTACAG GTTTGTTAAC TTCATCTCCA GAGAAGAGCA TACGGCCGAG 6060 

GAAGCCACGT ACCAAAGTAT TGTCATCTTC TTCTTTACTT GCGAATTGAC GCAACCAGTC 6120 

AAGAATTGAT TCTCCTCCTG CAAAATCAGC TGAGTTATCT TTTGGTAGGT AAGATTGACT 6130 

AGTTGTAACT CCCCACTTGA CAGTTCCTTC ATAGTCAATA TCTCCCATCA TTGCACGAAT 624 0 

TAATGCAGTC CTTTCAATAT CATTTTGTCC AATAAGTGCT GTCTTATCAT CTGGACGCAA 6 3 00 

GATGAAACTA ATATTATCCA AGATAGTTTC ACCATCAATC TTTACAGTTA AATTTTCTAC 5360 

TGTCAAGAGA TCATTACCAA TCTCACGTTC CG CTTTAAAG TTGATAAATG CATATTTACC 6420 

ACTAGATGGC ACAATCTCTT CTAGCTCAAT CTTATCAACC ATTCTCTTAC GTGATGTTGC 6480 

CTGCCTTGAC TTAGAAGCAT TGGCAGAGAA ACGAGCAACA AATTCTTGCA ATTGTTTAAT 6 540 

TTTTTCTTCT GCTTTACCAT TACCGTCTCC TAGCAATTTA GCAGCAAGCT CAGAACATTC 6600 

CTTCCAGAAG TCGTAGTTTC CCACATAGAG TTTGATTTTT CCAAAGTCAA GGTCGGCCAT 6660 

C7CAGTACAA ACTTTGTTTA AGAAGTGACG CTCG7GCGAT ACTACGATAA CTCTCTTATC 6720 

AAAGTCAATC AAGAAJ . . / : / \ACCAA-J7 J JATTCG ■ A : ...V- -.C CC77AGTAGG iO 

CTCGTCCAAG AGAAGAACAT CTGGTTTACC AAAAAGTGCT TTGGCGAGGA CAACCTTTAC 6 840 

TTTTTCACCG TTGGCCAATT CGCTCATGTT TTGGTAGTGT AATT CTTCTG GAATGTTTAG 6 9C0 

GTTTTGAAGT AGTTGAGAGG CTTCACTCTC TCCTTCCCAA CCTCCAACTT CGGCAAACTC 6960 

TCCTTCGAGT TCGGCAGCAC GAACCCCGTC CTCGTCTGAG AAATCTTCCT TCATGTAGAT 7 020 

AGCATCTTTC TCTTTCATGA TGCTATAAAG TT7TTCATTT CCCATGATAA CGACATCAAT 7080 

GGCACGTTCA TCTTCGTAGT CAAAGTGATT TTGACGAAGA AC AG AG AG AC GTTCATCTGG 714 0 

ACCAAGAGAG ATGTCACCAC TACTAGGTTC CATA7CTCCA GCTAAAATTT TTAAAAAGGT 7 200 

TGATTTTCCG GCACCATTAG CACCGATTAA TCCGTAAGTA TTTCCTTCTG TAAATTTGAT 7260 

ATTGACATCA TCAAAAAGTT TGCGATCACT AAAACCTAC7 GAAACATCAG ATACTGTAAG 7 320 
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CAATCTTTTT CTCCTATATC TGTAATATAT TTATTCTACT AGAAAATACA GAAATATTCA 7 3 30 

AATTTTTATT TGTCAATTTT GTGTAAATTA TATTTACAGT ATCCTTTACA CAAATCTGTA 74 4 C 

AAAAGCAAGG CTGATTTATT TTGATAAATT ACCGTTATTT CATTAAAAAA ATGCTATAAT 7500 

TGAAAGCACT ATATCGAAGG AG AAC AAAA T GACTAAACCC ATTATTTTAA CAGGAGACCG 7560 

TCCAACAGGA AAATTGCATA TTGGACATTA TCTTGGAAGT CTCAAAAATC GAGTATTATT 7 620 

ACAGGAAGAG GATAAGTATG ATATGTTTGT CTTCTTGGCT GACCAACAAG CCTTGACAGA 7680 

TCATGCCAAA GATCCTCAAA CCATTGTAGA GTCTATCGGA AATGTGGCTT TGGATTATCT 7740 

TGCACTTGGA TTGGATCCAA ATAAGTCAAC TATTTTTATT CAAAGCCAGA TTCCAGAGTT 7 800 

GGCTGAGTTG TCTATGTATT ATATCAATCT AGTTTCGTTA CCACGTTTGG AGCGAAATCC 7 860 

AACAG7CAAG ACAGACATTT CTCAGAAAGG ATTTGGAGAA AGCATTCCGA CAGGATTCTT 7920 

GGTCTATCCA ATCCCTCAAG CAGCTGATAT CACAGCTTTC AAGCCTAATT ATGTTCCTGT 7980 

TGGGACAGAT CACAAACCAA TGATTGAGCA AACTCGTGAA ATTGTTCGTT CTTTTAACAA 8040 

TGCATATAAC TGTGATGTCT TCGTAGAGCC GG AAGGTATT 7ATCCAGAAA ATGAGAGACC 8100 

AGGGCGTTTG CCTGGTTTAG ATGGAAATCC TAAAATGTCT AAATCACTAA ATAATGGTAT 8160 

TTATTTAGC? GATGATGCGG ATACTTTGCG TAAAAAACTA ATGAGTATGT ATACAGATCC 3220 

ACATCATATC CGCGTTGAGG ATCCAGGTAA GATTGAGGGA AATATGGTTT TCCATTATCT 8280 

AGATGTTTTT GCTCGTCCAC AAGATGCTCA AGAAATTCCT GATATGAAAG AACGTTATCA 83 40 

ACGACGTGGT CTTGGTGATG TGAAGACCAA GCGTTATCTA CTTCAAA7AT TAGAACGTGA 84 00 

ACTGGGTCCG G 3411 
(2} INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9064 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: L7: 

TGCCGTACTC AAGTACAGCC TGCGCTAAGT TTCCTACTTT GCTCTTTGAT TTTCATTGAG 60 

TATTAGTAAC CAAAATCCGA CCACATAGCC AGCCCCTATG AATATAGCCA TTAAAGCTAG 120 

CATGGAATTT AGGAAATTAA AAACCACCGC AGATACAAAG GTTAGCACAA AAA C ATT AAA 130 

AGCAATGGTG TCAGAAGCCA AGACTAGAAT ATAGGCTGTC AACCGATCTA AAGTTTTGGA 24C 

ATCTAGGAAA AATAACTGTT TATACATGAT GACCTCCTCT ATCCCTGAAA AGCAAGCCTT 300 
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TTCTTTTTTT ACCCCAAGAC CCTATG7AGA AAAGTGAGCA AAAACGGGAA GGTCGCTACA 3 60 

ATATTATTGA TCACATGCAC CGCATAGGAT GGATAAATGC TCTTGCTATA GCGGGTCAAA 420 

CCAGCAAAGA TCATTCCAAC TGTTGCAAAG ACGAAGATAT CTAACAGACT AGGCAGGCTT 4 80 

GAAAAATGAG GGAGAGCAAA TAAAATAGAA GCAACAAGCA AATCAAGACC AAATCGCGAA 540 

TGCTTAAAGA AAGCATGTTG CAGTAATCCT CTATAAATCA ATTCTTCCAT CAGTGGAACC 600 

AGAAAGAACA GGGCTATATA AATACCTAGC TCTGCAAAGT TAGTCCCACT ATAACCAATC 660 

AATACACCCC AACCTTCCGC AGTTGACTGA ACATGTTTAG CTGTCTGAAC GTTAAAAGAG 720 

ATCTGGAACA CTAGCACTAA TACTGTCAAA ATCCAATACC AAAGCCATTT TTTTCTTGGA 78G 

ATGCGGAAGA GATAACCATG CCCTGTCTTA ACAAGAACCA CAATCATGAC TCCAATAAAA 84 0 

AGTAAACTCA AGATATTTTG AATCCAGAAT AAATTGCCTA TCTGAGAAGA AAATTGCCAA 900 

TAGTTTTGGA CGATAACCCT CAGCTGAGAA AGACTAAATA CGAAAAATAA GTAAGAGAAG 960 

ACTGCACTTA TTTTGAATAC AAGTTGATAC TTTTTCATAG AAATCCTCCC TACTATGACC 1020 

TCACCTTCTC AGGCTCTACT GCTGTAAGAT TAAGAAGACA GTTTGTTTTT TTTAAGGCTA 1080 

ACCTGACTAC TAGATAATAG ATACATTAAG GCATTAAAGA CAATGAAAAT ATGTCCATAG 114 0 

AATAAAATCA ACCTCGCATC CAAACCAAGA TAAAGTTTGA TTATCAAAAA GATGAGCAAA 1200 

AGAATTTGAA ACCATAAGGT TTTTCCAAAA ATAAATTTAA AGCGATTTCG AATATCTACT 1260 

TCCTTGATTT TTACCGCCAC CCCTTTATTA GCAAGAAGGA AAACTCCTGC TTCAAACAAA 1320 

CCACTGTAAA GAACAAGCCA CCCAATAGAT AC GAT AG AG A TTTGTAAAAA TGTCCCTAAA 13 80 

AGAATATCCA ACACACTAC7 CAAGAAAATA ACAAAAAATA ATCTGTATTT CATATTAAAT 1440 

ACCTCCAT7C ATTTATTTCA CTAACAATTT AATAGAGCCT TCTACTCAAA TATCCTGTCA 1500 

GAAAAGGATA GAAAGCTACT TTTTATAATA CTTCAAGCCC CACATGACCA CAAGCCTCAT 156 0 

AAACAAGCAG AGAATACACC TATATAAGCG ATTAGTTGTT GATAGAATTC 7G7TTCTCAA 162'J 

ATACCTCTAT ACAAACAAAT GACAAACATA AAATCTGCCA AGCCGATAAA CATAAGTTGA 1680 

TTGGTTCTAG GACTAACCAA ATCATCATTT ACTTATATTT AAGAGTATCT CTTTTATTTT 174 0 

AATGTATGTT AGCACTGAAA AGCAAGACAG GCCAATAATA TTTAAAA7GA ACAGTAACGG 1800 

GGTTAAGTCT CTAAAAAAAT TATCTACTGA CACTACAAGA AA TACT AT AC ATATTATAGT 1860 

CCAAACTATC TTTTTCTTAT CCATAATTAT TTACTCCTTT CCTAACAAAT CCAGCTTATC 1920 

AATCAAGAGC GATTTTTAAC ATAATGTAGC AGCACCCGTT GCAACTTTGA CAAGTTTAGT 1980 

ATATCATTC7 TTTTTAAAAT TTTTCATCCA AATCTTGAAT TGTCATCGAA ACATCTTGAA 204 0 
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TTGTTAAAAA ATTTAAAAAG TAAGCATTAA AAA CAT ACT T TCCTCTTTAT ATTGTATTGA 2100 

TACCAACTTG TTTGTAGACT TTTCATCCTG CTATCACATA TCATTTTGAC AGGCGAAACA 2160 

ATATTAAAGA AACTCCCCTG TAAATTAAGC TAGCAAATAC AGGGGAGAAA TTTATTTTTT 2 2 20 

AGAGAGTACT ATCCGTATCC TTTTTGGAAG ATTTTGAAAA TATTTTTCTA ATTAAGTCAT 2 290 

CCATATAAGG ACCAAATATA CCAACTACTA AACCAATAAT AAAACTTTTA AAATCCATAA 2340 

TTACCACCAA CATATTGCTG CATAGGCTAC ACCTCCAAGT ATAGCTCCAC CTGCAGCACC 2400 

AGTTACACCT ATTCCTATAG CAAATGG7CC CAATAGAAAT GTCAAACCGT TGTTGCACAC 24 60 

CCATCAATTG CGCCATATGC AACCCCTGCT GCACAACTAA TTTTTCTTCC CCAATCAATA 2520 

TCTCCACCTT CAACGCAAGC AAGCATTTCA TTATCCATAA CTGCAAATTG TGACATCATT 2530 

TTTGTATCCA TATAGTGTAT CACTTTTCAG TTACGGAACA AGTTTAATAT AAAAATTATC 2640 

AAAAAAACAT AGGC AAT AAA GAGAAAAATT AATTTATCAT AGATTAGAAA TAATATGACA 2700 

AAACAATTCA ATGATGTTAA TTCAATAGTC TTTTGTTTTT TATCGGAGAT ACTTATGGAT 27 60 

AGATAAATAA GATAGGTTTG AAAAGCGAAG AGAATAATAA AGAATATAGC CTTCATAAAA 2820 

TTTACCTTTC ATTTTTATGA TGTAGCGGTA TAGGCTAAAT ATCCACAAAC CACTGCTCCT 2880 

CCAATTCCTC CTATTGCACC GCCCCATGGT CCTAGAACTC TCCCATATTT CACTCCACCC 294 0 

GCTGCACAAC CTAAAGCAGC AACTACAGCT GCTCCTCCGG AATTACCTCC ATAAACCTCA 3000 

CTCAGCATTG TTTCATTTAT ATT AC AAT AA GTATTCATAC AAGTCTCCTT TTATTAAAAT 3060 

CCACCCGTTG CCCCTGTTAC TCCTGCCCAA AGATCCACAC CAAATTTAGC TCCTATGTAT 3120 

CCACATGCTC CCATAAATGG TGCTCCAACA CCACTCGCAG CACAAATAGC TGTCCCTAGC 3180 

CCCCAGCCAC CAAAAGCAGC ACCACCACCT TCTAAGACAT TAGTTTGCCA ATTATTCTTG 3 240 

CCTCCTTCAA TACTAGATAA CATAGTTATA TCCATTTCAT GAAATTGTTC CATAATTTTT 3 300 

GTATCCATGA CAAATACTCT TTTTTATTTT TAATTTTTGT CTTCTTCTAA CTTTGACAAC 3 3 60 

TTTAGTATAT CATCGTTTTT TAAAATTTTT CATCCAGATT TTGAATAGTC ATCGAAACGT 34 20 

CTTCAATTGC AAAAATTACA TTAGACTTCC TGCAAAACTA GAATCCTAGT TCATGATTGA 3480 

TAATACCAGC ACTCAAATTC ATTCGTAATC CCAAGCGTTT ACGATGACTT CGATAGGTTG 354 0 

TTGAAAACAT TTTAAACCTT TTTACTTTGG CAAAGATGTT CTCAACCTTG CTTCTCTCCT 3 6CC 

TAGATAGCGC ATGGTTACAG GCTTTATCTT CAACTGTTAG CGGTTTGAGT TTGCTGGATT 3 660 

TACGTGAAGT TTGTGCTTGA GGATATATCT TCATGAGCCC TTGATAACCA CTGTCAGCCA 372 0 

AGATTTTACC AGCTTGTCCG ATATTTCTGC GACTCATTTT GAACAACTTC ATATCATGAC 3780 

AATAGTTCAC AGTGATATCC AAAGAAACAA TTCTCCCTTG ACTTGTGACA ATCGCTTGAG 384 0 
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TCTTCATAGC CTCAAATTTC TTTTTACCAG AATCATTCGC TAATTCTTTT TTTACGGCCA 3 900 

TTGATTTTTA CTTCCGTCGC ATCAATCATT ACCGTGTCCT CACAACTGAC AGGAGTTCTT 3 9 60 

GAAATCGTAA CACCACTTTG AACAAGAGTT ACTTCAACCC ATTCCCTCCG ACGGAGTAAG 4 020 

TTGCTTTCGT CAACACCAAA ATCAGCCGCA ATTTCTTCAT AAGTGCGGTA TTCTCGCACA 4080 

TATTGAAGAG TGGCCATAAG AAGGTCTTCT AGGCTTAATT TAGGTTTTCG TCCACCTTTT 414 0 

GCCTGTTTAA GTTGATAAGC TGTTTTTAAT ACACCTACCA TCTCTTCAAA AGTCGTGCGC 4 200 

TGAACACCAA CAAGACGCTT AAATCGTGCA TCAGTTAGTT GTTTACTTGC TTCATAATTC 42 60 

ATAGAACTAT AGTAAAATGA AATAAGAACA GGATAAATCG ATCAGGACAG TCAAATCGAT 4 3 20 

TTCTAACAAT GTTTTAGAAG TAGAGGCGTA CTATTCTAGT TTCAATCTAC TATACTATAC 4 3 80 

CATATTTTGT TTCGCAGGGA ATC7ATTATA AAAGGGTAAG TATTGCAAAA ACACTTACCC 4 440 

TTTTCTTTTA TACTTCATTA AGCTCTACTT TTTATAATAC TTCAAGCCCC ACATGAGCAG 4500 

AAGCATGATG ATTAACCAGA GAACAGCGCC AATATAAGCG ATTATTTGTT GGTAGGATTC 4560 

TCCTCCTGTG ATACCTCTAT ACAAACAAAT AATAGACATA AAACCTGTCA AGCCGATGAA 4 620 

CATAAGTTGA TTGGTTCTAG GACTAACCAA ATCATCATCT TCAAACTCTC TTATCCTCAT 4 630 

TTCCCTACTG AGATAAACAG TAACCAAAAT AGAAGCCAAG TTAATAACTA CTAAAAGAAA 4740 

TTGGAAAACT ACGGAAAAAT TTAAAAACTG ACGAGATAGA AATAGATAAG TAGAAACAAG 4800 

CAAGGGCAAC TGACCTAAGA ACAATCTCGC AAGGAAGATG TTCCGTTTTT TAGCAAGAAA 4860 

AC7TTTCATT TCTTT7CTCC TTTCTTTTTA TTGATAGCAA AATAGATCAT AACTGCAATC 4 9 20 

ACATAGGCTA TGGTATAAAA TAGCTGATAC CAAGCACTCT CCCTAAGCGG ATATAGAAAG 4 980 

ATGGACATGA TT AG AT AC AG AACGAAAATA ATCAGTATTT TTTTCTTCAT AAGATTTCCT 5040 

CCTAAATGTG CG AT TTATCT TAGTTGAGCA AGAACATTTA CACTGCTAGT ATAGCACTTA S100 

TTTTGACCTT GGATCACTCA AATCATAAAT GGTCATCAAA ACCTCTTGAA TTGTAAAAAT 5160 

TAAAAAAGCA AGCATGAAAA ACATACTTTC CTCTTTATAT TGTATTGATA CCAACTTCTT 5220 

TGTAGACTTT TCATCCTGCT ATCACATATC ATTTTGACAG GCGAAACAAT ATTAAAGAAA 52 80 

CTCCCCTGTA AATTAAGCTA GCAAATACAG GGGAGAAAT? TATTTTTTAG AGAGTACTAT 5 340 

CCGTATCCTT TTTGGAAGAT TTTGAAAATA TTTTTCTAAT TAAGTCATCC ATATAAGGAC 5400 

CAAATATACC AACTACTAAA CCAATAATAA AACTTTTAAA ATCCATAATT ACCACCAACA 54 60 

TGTTGCTGCA TAG GC? ACAC CTCCAAGTAT AGCTCCACCC GCAGCACCAG TTGCTGCACC 552C 

TTGCCATGTT CCTGTTTTAA TGCCTAGTTG AAGACCTCTT GCTGCTCCTC CTCCAACACC 5580 
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TGCTTTGGCA AAATCTCCCC AATTGCATCC GCCACCTTCA ACGCAAGCAA GCATTTCAGT 5640 

ATCCATAACA GAAAATTGTG ACATCATTTT TG7ATCCATG ACAAATACTC CTTTTTTAAA 5700 

AAACTAAAAT AAATCACAAT AGAATCCTCA TAATTTTACT ATAAGTCTTA CCAACTTAGT 57 60 

CCCAATTTAT CACCAACCAT ACCTCCTAAG CATGTTAATC CACCCCCAAT TGCACCAATG 5 820 

TGTGCTCCAA CAAATGCACC AGCAAGTCCA GCTACTCCTA AAGTGGCCAA ACCTGCTCCA 58 80 

GTTCCACCAG TTATAATTCC CGTAGTGACT CCTGTAATCA GTGCATTTTG ACAATCAGTG 5940 

GAGCTATACC CCCCTTCAAC TTTCGCAAGC ATTTCAGTAT CCATAACCTC TAACTCTCAC 6000 

AACATTTTTG TATTCATGAT GAATACCTCC TTTTTATTTT CAATTTGTTA CCAAAGTCTT 6060 

AAATTCAATA AACAAATAGA TTTTTTATAG TATCTTTTTG ATTTTCTTAA AAAAGTATAT 6120 

ACGTCTACTA TCTTCTTAAA GGTAGCAGTA CCTATTTTTT AGTCTAAGAT TTCAATAATC 6180 

TTGAGTATCT AAAATATCTT AATTTCGTTA TTCTCCTTCC AATAAAAAGT TTTACTATAC 62 40 

TATTTATTAA CTTGCAGAAA GCAAAAAATA TTAGTAAATA ATAGTTTATA GTTAAGTTTT 6300 

TTATTCCTAC CAATCCATCA ACTAAGTAAA GCATCAACGA TTACATAAAC GATTGATAAT 63 60 

ATAATTAAAA TTTTCCTAAC TATCTTATTC TCATCATTCT TAGATAACTT TGATATTTTG 6420 

TAAGTAAGTA AATAAGACAG TAAATTAATA GCGATAATAA TACTATATTT AAGAATCATA 6480 

ATCTTACAAA GAGGACATAA TTCCTGAACC TACACAAATA AGTGTTGCTG CTCCCCCAGT 6540 

TATCGGACCA GTCGCAGCAG CTAATAGTAC TGCTCCAATA CAACCACCGA 7TGCACATCC .6600 

TAAATTGCCT CTTCCTCCAC TAACTATTTC GAGT7CTTCA TTATCCATAA CAGAAAATTG 6 660 

TTCCATCATT TTTGTATTCA TGACAAATAC TCCTTTT7TC TTTTTTTATT T7TGTCTTC7 67 20 

TGTAACTTTG ATAAGTTTAG TATATCA7CC TTTTTTAAAA TTTTTCATCC ACATCTTGAA 67 80 

TTGTCATCGA AACCTCTTGA ATTAGCTTTT TTATTTCAAG CCACCTCTAA ATGTTTAAAA 6840 

AAAATAATTT CTAATCACTT TTTTACCATT CAGGAAGTTT TAATGACTAT TCAAGATTTC 6900 

ATAAAATATG AACTTAGTTT TATGACATAA TAGACCTATC CACTATATGA AAGGAATTGC 6960 

CAATGACTTC TTATAAACGT ACATTTGTTC CTCAAATAGA TGCGAGAGAC TGTGGTGTCG 7020 

CTGCCTTAGC CTCGATTGCT AAATTCTATG GTTCAGATTT TTCTCTAGCT CACTTGAGAG 7080 

AACTTGCAAA GACCAATAAA GAAGGGACCA CTGCTCTTGG CATTGTAAAA GCCGCTGATG 7140 

AAATGGGCTT TGAAACAAGA CCTGTTCAAG CAGATAAAAC GCTCTTTGAC ATGAGTGATG 7200 

TCCCCTATCC ATTTATCGTT CACGTTAACA AAGAAGGAAA ACTCCAACAT TACTATGTTG 72 60 

TCTATCAAAC AAAGAAAGAC TATCTGATTA TTGGTGATCC TGACCCTTCT GTAAAAATCA 7 3 20 

C7AAAATGTC AAAAGAACGC TTTTTCTATG AATGGACTGG AGTAGCTATT TTTCTAGCTA 7 3 90 
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CCAAACCCAG CTATCAACCC CATAAAGATA AAAAGAATGG TCTACTAAGC AAGCTTCCT7 7 4;10 

CCTCTGATTT TCAAACAAAA ATCTCTCATT GCTTACATTG TTCTCTCAAC CTTATTGGTC 7 500 

ACTATTATCA A7ATAGG7GG TTCTTACTAT CTCCAAGGAA TCTTGGATGA ATACATTCCA 7 560 

AATCAGATGA AATCAAC7TT AGGAATCATC TCAGTTGGTC TGGTTATCAC CTATATCCTC 7 620 

CAACAAGTCA TGAGCTTCTC CAGAGATTAT CTCCTAACCG TTCTGAGTCA GAGATTAAGT 768C 

ATTGATGTGA TTTTATCCTA TATTCGCCAT ATTTTTGAAC 7TCCCATG7C TTTCTTTGCC 77 4 0 

ACACGTCGTA CAGGAGAAAT CATTTCACGA TTCACAGATG CTAACTCTAT TATAGATGCC 7 800 

TTGGCTTCTA CCATTCTTTC TCTTTTTCTG GATC7T7C7A TTCTGATTCT 7G7AGGAGCC 7 860 

GTCTTACTGG CACAAAACCC 7AA7CTC77C CTTCTTTCTC TTATTTCCAT TCCTATATAC 79 20 

ATGTTCATCA TCTTTTCTTT TATGAAACCT TTCGAAAAAA TGAACCATGA TGTCATGCAA 7980 

AGTAATTCTA TGGTTAGCTC TGCCATTATC GAAGATATCA ACGGGATTGA AACTATAAAG 804 0 

7CGC7CACCA GTGAAGAAAA TCGCTATCAA AATATAGACA GCGAATTTGT AGATTATTTG 8100 

GAAAAATCCT TTAAGCTCAG TAAATATTCT ATTTTACAAA CGAGTTTAAA GCAGGGAACA 3160 

AAATTAGTTC TGAATATCCT TATCCTATGG TTTGGCGCTC AATTAGTCAT GTCAAGTAAA 82 20 

ATTTCTATCG GTCAGCTGAT TACCTTTAAC ACACTTTTTT CTTACTTTAC AACTCCTATG 8280 

GAAAA7A7TA TCAACCTCCA AACCAAACTC CAATCTGCGA AGGTCGCTAA 7AACCGTTTG 3 3 40 

AACGAAGTCT ATCTAGTCGA A7C7GAA7T7 CAAGTTCAAG AAAACCCTGT TCATTCACAT 3400 

77777GATGG GCGATATTGA ATTTGATGAC C7TTCT7A7A ACTA7GGT77 TGGATGAGAT 84 60 

ACCTTAACAG A7A77AA7CT CACGA77AAA CAAGGAGA7A AGGT7AGCC7 AG77GGACTT 8520 

AGTGGTTC7G GTAAAACAAC TT7AGCCAAA A7CATTC7CA ATTTCTTTCA ACCCTACAAA 3 580 

GGGCATA7TT CCATCAA7CA TCAGGA7AT7 AAAAACAT7G ATAAAAAAG7 CT7GCGCCG7 8640 

CATATTAATT ACCTACCCCA ACAAGCC7A7 A7CTTTAATG GCTCTA7T77 GG AAAACTT \ 87 00 

ACCTTGGGCG GTAATCA7AT GA7TAGTCAA GAAGATAT7C TAAAAGCT7G TCAAGTAGCT 87 60 <* 

GAAA7CCGTC AAGACAT7GA AAGAATGCC7 ATGGGCTA7C AAACTCACCT CTCTGATGGA 3820 

GC7GG7CTAT CAGGAGGACA GAAGCAACGA A7CGC7CTCG CTCG7GCTC7 7TTAACTAAA 8880 

TC7CC7GT7T TAATACTAGA 7GAAGC7AC7 AGCGGTCT7G A7CTCT7CAC 7GAGAAAAAG 3940 

GTTATAGATA A7CTTATG7C TCTAAC7GA7 AAAACCAT7C TCTTTGTAGC CCATCGTCTC 9000 

AGTATAGCCG AACGAACCAA CCGTGTCAT7 G7TC77GACC AGGGGAAAAT CATTGAAGTT 9060 

GG7A 9064 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 7780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 
CTCCATTTTT TTGATTTCA? AAATAAACAA CCTCTCTCTT AATTTTGTAT AATTATAACG 60 

ATATCCAAGT TACTTGTCAA GTCTTTTTTA AATTTTTATC TCAAAAATAT TTTTTCGTTC 120 

AAAAAAAGGA GCCATCAGTT GATTTCAAGC TCCCTTTTAT ACAGAATTAA ACTATTTTAT 180 

AGTTCGACAA TCTTACCTGT TTCAAAGTAC ACAACCCATT CACAGATATT TTTACCATAG 2 40 

TCACCGATAC GCTCC AAGT A GGAAATAACT TGGAAATAAT CACCACCCGT AAC AATGGCT 300 

TCTGGATTTT TCTTAATCTC TTCAGTCGCA AGGTCACGGA TAGTTTCAAA ATAGTGGTTA 360 

ATTTGCTCAT CCATGGAGGC CACCCGGTAT GCGTCGTCAA CAGAACCATT AAGATAAAGA 4 20 

TCAAGTGCTG CTTCCACAAC GCTTTTAACT TCACGTCCCA TTTTTTTAAT TTCTTCCTCT 4 80 

ACAGCTGGAA TGCGCTCTTC CCCCTTCATA CGGATGGTTG CCTGGGCAAT GGCTACAGCG 540 

TGATCCCCCA TACGCTCCAC ATCTGATACA GCCTTAAGGA CAGTCAAGAC TGTACGCAAA 600 

TCTTGAGAGA CTGGTTGTTG GAGTGCGATC ATTTCAAATG ATTTCTTTTC CACTTTCACT 660 

TCGTATTCAT TTACTTCTGC ATCATCTTCG ATGACCTCTT TTGCCAGGTC ACGGTCATGC 720 

CTGACAAAAG CACGTACCGT AC G ATT G ATT TGTGAGAGCA CTTCTTGTCC CATACCGTAG 780 

AACTGGTTAT GTAATTTCTC TAAATCTTCT TCAAATTGAG ATCGTAACAT CTTTCATCTC 340 

CTTATCCAAA TTTTCCTGTA ATATAGTCTT CCGTTTCCTT GTGTTGGGGA TCAAGGAACA 900 

TCTGCTTGGT ATCATTAAAT TCAATCAAAT CTCCATCTAG GAAAAATCCT GTCTTATCAG 960 

AGATACGTGA AGCTTGCTGC ATGGAACGGG TTACCAGAAG CATGCTGTAC TTGTCTTTTA 1020 

GACCATACAA GGTTTCCTCA ATTTTACCAG CTGAAATCGG ATCCAAAGCC CAAGTTGGCT 1080 

CATCCAAGAG GATGATTTTA GGACTAGTTG CCAAGACACG GGCCACGCAG ACACGCTGCT 1140 

GTTGACCACC TGACAATCCA ATAGCTGAAT CATATAGACG ATCCTTGACC TCATCCCAGA 1200 

TAGAGGCACC TTGCAAGGCT TTTTCTACGG CTTCATCCAG AACCTGCTTA TCCTTAATTC 1260 

CATTGATACG AAGCCCGTAG ACAACATTCT CAT AG AT AG T CATAGGGAAA CCATTAGGTT 13 20 

GTTGGAAAAC CATTCCGATT TCCTTACGTA ATTCAACCGT ATCTGTACGC GGACTGTAGA 13 80 

TGTTGTGACC ATTGTACACC ACGGATCCAG TTGTGGTCAC CTCTGGATTG AGATCTCCCA 1440 
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TCCCGTTGAG AGACTTCAGG AGGGTTGACT TCCCTGATCC AGATGGACCA ATCAAGGCTG 1500 

TAATTTCCTT AGCTTGGAAA GATAGGGAAA CACTATTCAA AGCCTTCTTT TTATTATAAT 1560 

AAACGGACAG GTCTGATACC TGTAAAA7CC CA7CTGTCAT ACCC7TTCC7 TTCTAACCAA 1620 

AGTGACCAGA TACATAGTCA TTGGTGGACT GTAGCTTGGC ATTTTGGAAA ATAGTTGCAG 1680 

TC7TCTCATA CTCAATCAAA TCACCCAAGT AAAAGAAGCC TGTATAGTCA CTTCCACGAG 17 40 

CAGCCTCCTG CATATTATGCT GTTACAATGA TGATGGTAAA GTTTTTCT7G AGCTCAAACA 1800 

TGCTCTCTTC 7AG7TGCATC GTCGCAATCG GA7CCAAGGC TGAGGCTGGC TCATCCATTA 1860 

AGAGGATATC TGGCTTAACA GAGATGGCAC GAGCGATACA GACACGTTG? TGCTGACCAC 1920 

CTGATAAGGT CAAGGCTGAC TTGTGGAGAT CGTCTTTAAC CTCATCCCAG AGGGCAGCCT 1980 

GACGAAGGGA GGTTTCTACG ATTTCATCTA GGACTTCCTT A7CCT7AACT CCAGCACGTT 204 0 

CATGCGCAAA GGTAATATTA CGGTAAATTG ACTTAGCAAA TGGATTGGGA CGTTGAAAAA 2100 

CCATTCCAAT GTGTTTACCC ATTTCATAAA CGTTGATTTC TGGACGGTTG ACATCAATTC 2160 

CACGATAGAG AATCTGCCCA CTTACTTTAG CAATATCAAT AGTATCATTC ATGCGATTGA 2 220 

GACTGCGTAA CTAGGTACA7 TTCCCCCATC CCGACGGGCC AATCAAAGCT CTAATTTTAT 22 80 

T7CTTTCAAA TTGCATATCA ATCCCCTTAA TGCATTCATT TTTACCATAG TAAACATGGA 2 34 0 

CATCCTTAGT AGAAAGGGCT ACTTTTTCTT CAGGAAAGGT AAGGATATGC TTCTCATCCC 2400 

AGTTATATGT TGACATGGCT TCTCCTTTAG GCAGCGGTTA ATTTCTTGTG TAGATAGCTT 24 60 

CCGAACTTAC GAGCTCCAAA CTTAAAAA7C AGGATAAAGA 7 C AG GAG C AC AGCGGCAGAA 2 520 

CCTCCTGATA CAATGCT7CC ATCTGGAATA GTGCCTTCAC TATTGACTTT CCAGATA7GC 2 580 

ACAGCCAAGG TTTCTCCTTG ACGGAAGATA GAGATGGGGC TAGTCACACT GAGGATATTC 2 640 

CAGTTAGACC AGTCAAGAGC TGGCGCCGAT TGGCCTGC7G T AT AG AT C AG AGCTGCAGCT 2700 

TCGCCAAAGA 7ACGACCAGA TGCCAAGACG ACACCCGTTA CAATACCTGG AAGCGCTTCC 27 60 

GGAATAACAA CATGAACCAC TGTCTCCCAG CGAGAAATCC CAAGAGCCAG ACCAGCCTCA 2820 

CGTTGGGTAT GGTGAACG7G TTTCAAACTA TCCTCTACAT TACGCGTCAT CTGAGGCAAG 2 880 

TTAAAGACTG TCAAGGCCAA GGCACCTGAA ATGATTGAAA ATCCATACTC AAACTCCACT 2 94 0 

ACAAAGATCA AGTAACCAAA GAGACCCACC ACCACTGATG GTAAAGAGGA CAAAA7TTCA 3 000 

ATACAAGTCC GCACAAAGTT GGTAACAGGA CC7TTTTTAG CATATTCAGC CAAGTAAATC 3060 

CCAGCTCCCA TAGAAAGAGG TACAGAAATA A7CAAGGTAA TGACCAATAG GAAAAAGGAA 3120 

7TGTAAAGCT GAATGCCAA7 CCCACCACCT GCTTGAAAAG CAGAAGACCT TCCAGTCAAG 3180 
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AAAGACCAAG AGATATGGGG CAAGCCCCGA ACCAAGATAT AGAGAATCAA GGAAGCCAAG 3 240 

ATTGTCACAA TGATGCTAGC AATCGTATAG AGGACAGCTG TTGCAAGTTT ATCTAATTTC 3 3 00 

TTAGCGCGCA TAATTTTTCT TTCCTCTTTC TTTCGTAATC AATTTAATCA CACTGTTAAA 3 3 60 

AACTAAGCTC ATCAAGAGCA GTACCAAGGC CAGTGACCAG AGAACATTAT TATTTACAGT 34 20 

TCCCATGACA GTGTTCCCAA TTCCCATAGT TAATATAGAA GTTAAAGTTG CAGCTGGTGT 3 4 80 

GGTCAAGGAA CTTGGGATAA CAGCTGAGTT TCCGACAACC ATCTGGATAG CTAGAGCCTC 3 54 0 

ACCAAAGGCA CGCGCCATCC CAAAGACCAC TGCAGTGAAA ATACCAGAAC GGGCCGCCTT 3 600 

CAAGATCACA CGCCAGATAG TCTCCCAGCG AGTGGCTCCC AT AG CG AAAC TGGCTTCACG 3 660 

ATAATAACGA GGAACCGCAC GCAAGCTATC CGTTGTCATA AAGGTTACGG TCGGCAAAAT 3720 

CATGACAAAG AGGACGGAAA TCCCTGACAA AATCCCAAAA CCAGTCCCAC CAAAGACACT 3780 

GCGAACAAAG GGAACGACGA CTTGCAAGCC AATAAATCCG TACACTACTG AAGGAATCCC 3 84 0 

AACCAGGAGT TCAATAGCTG G7TGCAAAAT CTTCGCCCCT TTTGGTGATA CTTCGGTCAT 3 900 

AAAAACTCCT GCACCAATAG CAAAGGGTGT TGCCATAACG GCTGAGAGAA TGGTAACGAT 3 9 60 

AAAGGAACCC AAAATCATAC GAAGGGCACC AAATTCTTTA CTAGAAGGAT TCCAAGTTCC 4020 

TCCCAAAAGA AAGTCAAAGA TATTCACACC ATTGACAAAG AAGGTCGACA ACCCTTTTTC 4080 

CGCTACGAAA ACCAAAATCA TGGCCACAAG GATGACTATC AAAGAAAGAC AGGCAAACGT 4140 

CAAACCTTTT CCTAATTTCT CCAGACGAGA ATTCTTTGAT GGAAGCAACA TTTTCTTAGC 4200 

TAATTCTTC? TGATTCATTA TTGTCTCCCT TCCAACACTG TCACAGTTCC GGCAGCATCT 4 2 60 

TTTTCAACCT TCA7TTCC7T AATCGGAATA TACTTCAATC CTTTGACAAT CCCTTCTTGG 4 320 

GTCTCATCCG AGAGAACAAA ATTGAGAAAT TCTGCAGCCA ACTCATTGGG CTGCCCCAAT 4 380 

GTATACATAT GCTCATAAGA CCACAAGGGC CAATTATTGC TACTTATATT TTCTCGACTT 44 40 

AAGTCATAGC CATTCAACTT CATGCTTTTG ACCGAATCAT CTATATAGGT AAGAGATAAA 4 500 

TAAGAGATAG CTCCTGGACT TTTTGATACG ATTGATTTTA CCGCTCCATT TGAATCCTGC 4 560 

TCCTGACTTT GCATGGCAGA CTGACCTTCC ATAATGACAG TATCAAAGGT AGCACGAGAG 46 20 

CCAGAGCCGG CTGCCCGATT CATAACAGAC ATGGGTAAGT CCTTACCACC AACCTCTTTC 4 680 

CAATTGGTTA CCTCACCTAT GAAGATTTGA CGAAGTTGCT CTGTCGTTAG GTTATCAACA 474 0 

TCAACCTCCT TATTGACAAT CAGAGCCAAG CCAGCTACCG CGACCTTGTG GTCAACAAGA 4 800 

GCAGAAGCAT CAATTCCGTC TTTTTCCTCA GCAAATACAT CTGAGTTTCC TATATCAACT 4 860 

GCCCCAGACT GAACCTGGGA CAAGCCTGTA CCAGAACCTC CCCCTTGGAC ATTGACCGTT 4 920 

TTTCCAACAT GGATCGTGCC AAATTCATCT GCCGCTACTT CAACCAAGGG TTGCAAGGCA 4 980 
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CTTGAGCCAA CAGCCGTTAT GGATTCTCCA CGATCAATCC AGCTAGCACA GCCTACTAAA 504 0 

CAAGCCGTCA GCCAAAAAGC GATAAGAGAC AGAGCAAGCT TTTTTCTTTT TTTCACTGTT 5 LOO 

TTTCTCCTCG AAAATAATTA TGAATACTGT GAATTTTTTA AGTAGTTCTT TATGAGTTGA 5160 

CGCATGAATT CTTACCAAAT TTCTGCGCAA TTGATTATTT ATATAATATA GGCTATATTA 52 20 

CTCTTTCCTA ACCTCCTTTT TTCATATGTG GATAAAATCT CTTGTCTATC CCTTCCCCCA 52 80 

T7GTCACCCA TTATAGTCAT TTCGTGTCTC TTTTTCCCCT TTTTAATGCA AGGGAAATTA 5 340 

CTCTCCTTAG ATGATAATCC AAAAGCTAGA AAGGTATCTC AAACCTCTCT ACTCTCCCAG 5400 

ACTAGTTTAC AACTAAAAGG AAAAGATTCT ATTTTATGAG AAATCTAGTT TACAACCGGT 54 60 

AAGAACGCTA ATAACTAAAC TTCTTGTACT CTTTGAAAAT CTCTTCAAAC CAGTGTTTTG 552 0 

AGCTATCTAT GGCTACCTTC CTAGTTTGCT CTTTGATTTT CATTGACTAC TAAAACTACA 5 580 

TGTAATGGCA ATCAAGATAT CAAGAATCAT CCTACTAAAA AAATCCATAC TTTCACTATA 5640 

ACATAGAATA AGATATTTGA CTAGCATTTT CATTTGAATC TGAGGCCTTT TGGAAAATAA 5700 

TTTTTCAAAA CATTTCCAGT AACCTTTGCA AAGCCCAAGC CATTGCCTTT AACCAAAACT 5760 

TGGTACCAAC CATTTGGCAG ACTTTCTGCC AGCTCAACGG TTTCTCCAGC CGCATACTTG 5820 

ACAAACGCTT CTTGGCCAAT TTCAACCGAC TGTTCGACCT GACTCGCTTT CAAGGCTAAA 5880 

CCAAGAGCGA AACTGGGCTC AAAGCGTTTC TTCTTAAAAG TACCCAGATG CAGTCCATTG 5940 

CGAGCAATCT TGAGCTTCCA TAAATCTGGC AAAAGTTCTG GCAAGAGATA AACCTCGTCT 6000 

CCAAAAATCT GCAAGATACC CGGTAGATTG ACCTTCAAAT GGTTTTGGGC AAATTCCTGC 6060 

CACAAGGCAA CTTGTTCACG GCTGAGGTTA CTCTTACTTC CCTTAAATTT AGGAGCTGGA 6120 

TTGTTACCCT TAAACTGTAG ATGGGCAACA AACTGACCCT CTCCCTTAAA CTGATGAGGA 6180 

TACATCCGAG CCGTTTCTGG CAGGTCAATA CCAGCTACCA TTCCATTGAT ATGCTCTACT 624 0 

GGCAACAAGT CAAAATCATA CTCTTCCAGC AACCAATTGA CAATCTCT7C GTTTTCCTCG 6 300 

GGTGCCCAGG TACAGGTCGA ATAAACCAGA TGACCACCTT CAGCTAACAT GGTCACTGCA 6 3 60 

TCCTCCAGAA TTTCTCTTTG CAAGCTAGCA CATTGACTCG GATAATCTAA GCTCCAATAG 6420 

TCCATAGCAT CAGGTTGCTT ACGAAACATT CCTTCACCAG ACCAAGGGGC ATCAAGAACG 64 80 

ATTAAGTCAA AATAGCCTTT AAAGACCTTG ACCAAGCGGT CGGCAGATTC ATTGGTCACC 6540 

ACGACATTTG TCGCTCCAAA ACGCTCCATG TTTTCAACCA AAATCTTAGC CCGTTTGCTT 660C 

GAAATTTCAT TGGAAnCAAG TAGCCCCTCC CCTGCTAGAT AGGCTGCCAG TTGAGTTGAT 6660 

TTGCCCCCCG GTGCACCAGC CAAGTCCAAG ACCtTCATAC CAGGACTGGG TTCGCCTACT 67 20 
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TGAGCCACCA TTTCAGCAGC AGCTTCTTGC GAATAAACTA AACCTGTAGC ATGCTCACGC 6730 

GATTTCCCTG AAACCTTCCC ATAGTGGCCC CAAGGGGTTT GAGTAATGGC ATCAGAAAAG 6840 

GAAAGTTGCT CTTCTTTTAA GCGATTGACC CGAAAGGCCG AAACCGCTTC CTCCTCAAAA 6900 

GAGGCAAGAA AATCTCTTGC CTCATCTCCT AGTATCTCTT TATATTTTTC AACAAATCCT 6960 

TCTGGAAATT GCATTTAAGT TCTTTTCCTT TCGTAAATAT AGGACTGAAT TTCCTCCTGC 7020 

ATCTCAAGAG GC AC CATC AT GACCGGCTGT CTGGTTTGAA AATCACGAGC TTCACCAAAA 7 080 

AGGGTCACAA CCCGATAGCC CAGACTTTCC CCTAAAATAC TAGCTGCGGC ATAATCCCAT 7140 

GGTTGCAGAT AAGTGAGATA GGTCAACAAA CGCCCTGACA AAATCTTGGC AAAACTAATG 7 200 

GCCGCACTTC CATAGACACG AACACCAAGA ACCCCTCGGC TCAAATCAGC CAGCCCCCAT 7260 

TCATTGGTTT CCAGCATACC ACTATTCCCT GCAATGAGAA AATCTCCAAG TGGTTTAGTT 7 3 20 

TTAAAAGGAG CTAGGGACCT ATCATTTAGA CAAACTGGAA ATTCCCCACC ACCGTGGTAA 73 80 

CAATCCCCTT TGACCACATC ATAAATCACA CCAAACTGTC CCTGACCATT TTCAAAATAA 7440 

GCCATCATAA CAGCAAAATC TTCCTGCTGG GCTACAAAAT TATTGGTACC ATCAATGGGA 7 500 

TCAATGACCC AAACCTTCCC CTCTTGAACC GAGGCTCGCA GACAACCTTC TTCAGCACAA 7560 

ATCTTATCCT CAGGATAACG GGACAAAATC TCACCAACCA AGAGTTCCTG AACTTCTTTG 7620 

TCCAGTCTGG TCACCAAATC TGTTGGAGAG GACTTGGTTT CAACACCCAA GTCTTCCTGC 7 680 

ATATGGTCAA GAATGTACTG ACCTGCTTTC TTAACAAGCT CTTTAGCAAA TTCAAATTTA 7 740 

CTTTCCAAGA GAAATCTTTC CTTCCCCTTT TTCTTTGGGG 7 780 

(21 INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4820 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : double - 

( D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTAATGATAT AGGAACACCA GGTGACCTGA TGGGACGTCG TAAGCCTATG AACTACTAGC 60 

TGCTAAAGGC TTTAAACATG CTATGC7ACC ATATATCTCA AACCAATACG AACAAGAACC 12 0 

CAAACAAAAG GGCAAGACAA TCAATCTCTA CGGTAAAACA AGAGGTTTGG TTACACATGA 180 

CTTGGTTTTG GAAAAGGTAT TTAATAACCA ATATCATACT TGGAGTGAGT TTAAGAAAGC 240 

TATGTATCAA GAACGACAAG ATCAGTTTGA TAGATTGAAC AAAGTTACTT TTAATGATAC 300 

AACACAGCCT TGGCAAACA? TTCCCAAGAA AACTACAAGC AGTGTAGATG AATTACAGAA 3 60 
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ATTAATCGAC GTTGCTCTTC GTAAGGATGC AGAACACAAT TACTACCATT GGAATAACTA 4 20 

CAATCCAGAC ATAGATAGTG AAGTCCACAA GCTCAAGACA GCAATCTTTA AAGCCTATCT 480 

TGACCAAACA AATGATTTTA GAAGTTCAAT TTTTGAGAAT AAAAAATAGT GTCTACTATT 540 

AGGAAATAAA GTTTAAAAAG GTGATGAAGA ACAAACCAAG ATTCAAGCAG GAATTCCTAC 600 

TGATAATGAA GTAAGTTATG ATCTTATTTA TCAGCAGGAA ACTCTTCCTG CAACAGGTTC 660 

ATCAACTTCT GAGCTTACAG CTTTAGGCCT ATTAGCTGTT GGTAGTTTAG TTCTTTTGGT 720 

TCATAATATG ACGGGAACAG TTTTTTGCTC CCTCTGAAAA GTCATCATTT GATGGCTTTT 7 80 

TTCTATATAG GGTAAAAGAT AGGGTAAAAG GCTATCATCG GACAAAATAA AGAAGGCATG 840 

ATATAATATA AAGTAGATTT CTATGTCATA AAACAAGAAC TGTTTGGACA TCATTCATTT 900 

GAAAACTCTC TATG7TCAAA CAATAGTAAA ATAAAATAGG GGATCTAAAT CCTTGCTATG 9 60 

AAAGGAAAAA ACTCAATGGC TACT ATT CAA TGGTTTCCTG GTCACATGTC TAAAGCTCGT 102 0 

CGACAGGTGC AGGAGAATTT AAAATTTGTT GATTTTGTGA CGATTTTACT AGATGCACGC 1080 

TTGCCTCTAT CTAGTCAAAA TCCTATGTTG ACCAAGATTG TTGGTGATAA ACCAAAACTC 1140 

TTGATTTTAA ACAAGCCCGA CTTGGCTGAT CCAGCAATGA CCAAGGAATG GCCTCAGTAT 1200 

TTTCAATCAC AAGGAATCCA GACGCTACCT ATCAACTCCA AAGAGCAAGT GACTGTAAAA 12 60 

GTTGTAACAG ATGCGGCCAA GAAGCTCATG GCTGATAAGA TTGCTCGCCA GAAAGAACGT 13 20 

CCGATTCACA TTGAAACCTT GCGTACTATG ATTATCGGGA TTCCAAACGC TGGTAAATCA 1380 

ACTCTGATGA ACCCTTTCGC TGGTAAAAAG ATTGCTGTTG TTGGAAACAA GCCAGGGGTC 1440 

ACAAAACGTC AACAATGGCT TAAAACCAAT AAAGACCTGG AAATCTTGGA TACACCGGGG 1500 

ATTCTCTGGC CTAAGTTTGA GGATGAAACT GTTGCACTTA AGTTGGCATT GACTGGAGCT 1560 

ATCAAAGACC AGTTGCTTCC TATGGATGAG GTTACCATTT TTGGTATCAA TTATTTCAAA 1620 

GAACATTATC CAGAAAACCT GGCTGAACGC TTCAAACAAA TGAAAATTGA AGAAGAAGCG 1680 

CCTGTGATTA TTATGGATAT GACCCGCGCC CTCGGTTTCC GTGATGACTA TGACCGTTTT 1740 

TACAGTCTCT TCGTGAAGGA AGTCCGTGAT GGCAAACTCG GTAACTATAC CTTACATACA 1800 

TTGGAAGACC TCGATGGCAA CGATTAAAGA AATCAAAGAA TTCCTTGTCA CAGTCAAGCA 1360 

GTTAGAAAGC CCTATTTTTT TAGAGCTTGA AAAGGATAAT CGCTCAGGAG TTCAAAAGGA 1920 

AATCAGCAAG CGTAAAAGAG CCATTCAACC TGAATTAGAT GAAAATTTGC GCTTGGAATC 19 80 

CATGCTTTCT TATGAAAAAG AACTTTATAA GCAAGGATTG ACCTTAATTG CAGGTATTGA 2040 

TCACGTTGG7 CGTGGTCCTC TTCCTGGTCC TGTAGTCGCT GCGGCCGTTA TTTTATCTAA 2100 
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AAATTGTAAG 


ATTAAACCTC 


TCAACGACAG 


CAAGAAAATT 


CCTAAAAAGA 


AACATCTGGA 


2160 


GATTTTCCAA 


GCCGTTCAAG 


ACCAAGCCTT 


GTCGATTGGA 


ATTGGTATCA 


TAGATAATCA 


2220 


GGTCATCGAC 


CAAGTCAACA 


TCTATGAAGC 


AACCAAACTA 


GCCATGCAAG 


AAGCAATCTC 


2280 


CCAGCTCAGC 


CCTCAACCAG 


AGCACCTTTT 


GATTGATGCC 


ATGAAACTGG 


ACTTGCCCAT 


2340 


TTCACAAACC 


TCCATTATCA 


AAGGAGATGC 


CAACTCCCTC 


TCTATCGCAG 


CAGCATCTAT 


2400 


AGTAGCCAAG 


GTAACACGTG 


ATGAATTGCT 


GAAAGAATAC 


GATCAGCAGT 


TCCCTGGCTA 


2460 


TGATTTCGCT 


ACTAATGCAG 


GATATGGCAC 


AGCTAAACAT 


CTGGAAGGCC 


TCACAAAACT 


2520 


AGGAGTTACC 


CCAATTCACC 


GAACCAGCTT 


TGAACCCC7T 


AAATCACTGG 


TTTTAGGTAA 


2580 


AAAAGAAAGT 


TAATTGAAAG 


GAAATAACAT 


GGAGGAACAG 


TCGGAAATAG 


TCCGTTCTAA 


2540 


GAAAGAATTC 


GCCTTTGCAT 


CCAGCACTAT 


ACTATCCCAA 


GTTGGTCGAG 


GAATCATTCT 


2700 


CGGCCTCATC 


GTTGGAATTA 


TCGTCGGATC 


cnrcG TTTC 


TTAATTGAAA 


AGGGCTTCCA 


2760 


CCTGATACAA 


GGAGTTTATC 


AACATCAAGG 


GTACTTAGTG 


CGCAATCTTT 


TTGTACTGGT 


2820 


TTTGTTTTAT 


ATACTCATCT 


GTTCGCTCAG 


TGCCAAACTA 


ACACCGTCAG 


AAAAAGATAT 


2880 


TAAAGGCTCA 


GGAATTCCTC 


AAGTCGAAGC 


CGAACTGAAA 


GCCCTCATGT 


CCCTCAACTG 


2940 


GTGGCGCAT? 


CTTTGGAAAA 


AATATGTGCT 


AGGTATTCTT 


GCTATTGCCA 


GTCGACTCAT 


3000 


CCTGGGTCGA 


GAGGGACCCA 


GCATTCAACT 


TGGAGCAGTT 


GGTGGTAAAG 


GAATTGCCAA 


3060 


GTGGCTCAAA 


TCCAGTCCAG 


TAGAGGAACG 


TTCCTTGATT 


GCCAGTGGAG 


CTGCAGCAGG 


3120 


TTTAGCCGCA 


GCCTTTAATG 


CTCCTATTGC 


AGCACTTCTC 


TTTGTTGTAG 


AAGAAGTCTA 


3180 


TCACCATTTT 


TCGCGCTTTT 


TCTGGGTCTC 


AACTCTAGCA 


GCCAGCATCG 


T AGC AAACTT 


3240 


TGTGTCTCTA 


CTCATGTTCG 


GTTTGACACC 


AGTATTGGAT 


ATGCCAGATA 


ACATTCCTCC 


3300 


CATGACCCTA 


GATCAGTATT 


GGATATATCT 


CGTCATGGGA 


ATTTTCCTTG 


GATTTTCAGC 


3360 


TTTTCTCTAT 


GAGAAAGCTG 


TATTAAACGT 


TGGAAGAGTT 


TATGACTTCA 


TTGGTCAAAA 


3420 


AATCCATTTG 


GATAGGGCTT 


ATTATCCCAT 


CTTGGCTTTT 


ATCCTTATCA 


TACCAGTCGG 


3480 


AATCTTCT7A 


CCTCAAATCA 


TTGGTGGCGG 


AAATCAGCTT 


CTCCTTTCTT 


TAACTGAACA 


3540 


AAATTTTAGT 


TTCCAAGTTT 


TATTAGCTTA 


CTTTTTAATC 


CCCTTTATT7 


GGAGTATGAT 


3600 


TAGCTATGGA 


AGTGGACTGC 


CAGGAGGAAT 


TT7CCTCCCC 


ATTTTAGCTC 


TTGGTTCTTT 


3660 


GCTTGGTGCC 


TTAGTTGGTG 


TTATCTGTGT 


CAATCTTGGA 


CTTGTCAGTC 


AAGAGCAATT 


3720 


CCCTATATTT 


GTCA7TCTAG 


GAATGAGTGG 


CTA7TTTGGA 


GCCATATCAA 


AAGCTCCCTT 


3730 


AACCGCTATG 


ATCCTCGTAA 


CTGAGATGGT 


AGGAGATAT? 


CGCAACCTTA 


TGCCACTTGG 


3840 


TCTTGTCACT 


CTTGTTTCTT 


ATATTATCAT 


GGATTTGCTC 


AAAGGTACGC 


CAGTCTATGA 


3900 
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AGCCATGCTG GAAAAAATCC TTCCAGAAGA AGTATCTAGC GAAGGAGAAG TTACACTTAT 3960 

CGAAATACCA CTTTCTGATA AAATTGCTGG GAAACAAGTT CATGAACTCA ACTTACCACA 4 020 

CAACGTCCTC ATCACAACTC AAGTCCATAA TGGCAAGAGC CAAACAGTTA ACGGCTCAAC 4 080 

CAGAATGTAT CTGGGTCATA TGATTCACCT GGTTATTCCA AAAAGTGAAA TTCGAAAAGT 4140 

CAAAGATTTG TTGTTGTACT ATGAGTATTT ACATAATTTA TGTTATGTAA ATGATCAGTT 4 200 

TGATTTATTT AGAA/^CCGaT^ITCTCAGGAA TGACATCGGT TATTTTTTAC TGATGAGGAA" - " 4260~ 

TTTTACATAT AAATAATTGA ACTTTATTAA AAATAAGACT ATAATTAACT TAGAAATGAT 4 320 

AAAGTATAAA GCTAGAAAGG AGTTTACTCT ATCAAATCTG TACAGTAAGA TTAAAATCAT 4 380 

GAAAAAGAAA ACAATACCAA TTATATAGAG AAATGAAATA GAAATAGGAT AAAACAATCA 444 0 

GGACAATCAA ATCAATTTCT AGCAATGTTT TAGAAGTCCA GATGTACTAT TCTAGTTTCA 4500 

ATCTATTATA CAATGTGTTT TGTATC7CAT AGCTCCTTAT ATAGCTCTTC AGTTATGTAG 4560 

TATTAACAGA AGTTTAGTGG GTCAGATTTT TATTATTTTC CTTATTCTGT TTTGTTTGTA 4 620 

GGTCTAAGTC TTTTTATCAC TTTGAAAAAC TCCTATAACA TCTTTCCGAA AAACTATAAT 4 680 

TTTCTTGAAA AATATACAAG TCTATGCTAT ACTACTAGTA TACTTACTTA TGGAGAAAAT 4740 

ACATGAAACG TGAGATTTTA CTGGAACGAA TCGACAAACT AAAACAACTC ATGCCCTGGT 4 800 

AAGTTCTGGA ATACTACCAA 4 820 

(2) INFORMATION FOR SEQ ID NO: 20: * 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21338 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
{ D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CTACGACATC ATGATTAACA GTCATGCGCT ACTACCAACT GAGCTATGGC GGATAAAATA 60 
GTCCGTACGG GATTCGAACC CGTGTTACCG CCGTGAAAAG GCCGTGTCTT AACCCCTTGA 120 
CCAACGGACC TTCTATCTGT AGCAGATATA ACCATTATAT CAATTTCTTG CTAATTGTCA 180 
ATCACTTTTG AGATTTTTTC TCTAAAATAT CTTTTAATTT TCTAATTTTT AATCTTGAAA 240 
TAGGACAACG ATGGTCTTCA TAGAAAACAA TTTCTAACTT TTTTCGATCA ATTTCTCTGA 300 
TATTACCTAT ATTTACCAAA AATGACTTGT GAGGAGAATA AAATCGCTGA GTATGTTTGT 3 60 

CC7TTTCCTC AATATCTGTC ATGGTACCAT AAAACTCTTT TGCAAAATTC TTACCAATAA 420 
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TCCCCAATTT ATGAGATACC CCTGTTGTTT CAATATACAA AATATCATGG TAAGGAATTT 480 

TTAAATCATT TCCCTTGTAA TTGTAGTCGA AATAATCTAC AACATCTTCA TTTTCAAGTA 540 

ACATACTCTT CCTG7AGAAG ATATTTTGCT CAATTCTCTT CTT AAA CATC TCATCATTGA 600 

TATCCTTATC AACAAAATCT AGGGCTGATA CCTGGTATTT ATAGGTTAGA GTCGCAAACT 660 

CTGATCGACT AGTGATAAAG ACGATAATAG CGTAAGGATT GTAATGACGA ATGAGCTGAG 720 

CCACTTCAAA TCCCTTTTTC TCAATTCCAT CAATATCGAT ATCTAGGAAA TAAAGCTGAT 7 90 

TTACTTCATC ATTTTCAATG TATTCTTCAA ATTCACGGAC TTTTCCCGTT GTCTTGTATG 84 0 

ATATTGGAAT ATTCGATTCT TTCGAAATTT CATCCAATAT TCTCTCTAGT CTCACTTGAT 900 

GTTCAATAAC ATCTTCTAAA ATTAAAACTT TCATTCAAAT TCCCTCTTAA ATCTAATGAT 9 50 

TTGTCTAAAT GTACTGCCTT CCATCTCTGT 7TCTAAAATA ATATTGTTGT ACTTATCTAG 1020 

TACTTCTTTC ACATTATTTA ATCCGACTCC GCGATTTCTT CCCTTAGTGG AGAATCCTAA 1080 

GGCAAATAGA TCTCCTGAAG GAGTCATCGT CATTTTACAT GAATTCTCAA TCACAATAAC 1140 

TGTTTCAGTT TCCATCTTAA TAACTGCTAC TTCCATCTGC TTTTTATACC TATCAGCCGA 1200 

TCCTTCGACA GCATTATTCA ATAAAACGCT CATGATACGA ACCAAATCCA ATAGTTCAAT 1260 

TCGAAGCTTG GTAATCGTAT CTTTTACTTC CAGTGTAAAC TCTACACCAT TATTTCGAGC 1320 

ATAGACAATT GACTGAGCAA CCAAACTTCG TAAAGCTGAG TCTTCTATGT TGTTCAAATC 1380 

AAAGTAAGTG TACTTATCTG AACGCAATTT ATGATTTGCT TTCACTAAAA CTTCATTGTA 1440 

AATTCTGTCA ATTTCCTGTA AATTACCACT GTCAATTGCC ATCTGCATGC TGACAAGCAT 1500 

TCCAGCATAA TCATGTCCAA AACCACGGAT TTCATTATAC AGACCAACAA TTTCATCTCT 1560 

GTAATTCTGT AAATGTTTCT GTTCAAATTT CTTCTGCTTC AAAGCAATCT CTTTCTCCAT 1620 

TTGAACTTTA TGAGAATTCA TTGCAAAGAA GGTCAAAAGG AGAGAGATAA AGACAATAGA 1680 

TGACAAAATA CTTCCAAAAC TATTCAAATG TTTAATCGTA CTTACCATAT CTGAAACGAA 17 40 

AGATACAATA TGTAGCAATA GTAAAGCAAA AAATACTTTT TTCAAGAAAG GATAAAGGTA 1800 

CTCCTTCTCA AAATAGGCTA GTTCCAAATG GAAATAGTAA ATGATTTTTA ATGTAACAAA 1860 

ATAGGTTAAC ACCGTCACAA CGAAAAAGAA TGGGAAATGA TATTGTAAAA CAAAATTATC 192C 

TCCTGTTATA GAGGAGAAAA TTACGGACAG AAAGTTATCA GTGCTCTCAT ATAAAAGAGA 1980 

TAGTAGTAAA CTTAGGAA7A GTCCTCTATC CCTCTCATAC TGTTTCATCC ATCGAAAATA 2040 

GGAATATAAG CCCAAAGGAA ATAAAAATCT TTCAATCCCT ATTTTATCTA AATATAGAAG 2100 

ATAAAAGGAA AATTCAAGTA CTATTTCAGT TAGTAATGTA TAAGCACCAA AAACGTATAA 2160 

TTCTTTTCTA TTTATTCGAC CTTTACAAAT 7AAACGGTAA CTGTGACTAA TAATTAAAAA 22 20 
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ATGAACAATA 


ACTGTCCCAA 


ATCCAAGTAA 


ATCCATTACT 


CTTTCTCCTT 


ATTTCATTAC 


2280 


TTTTTTCGTA 


GGAAAAGAAA 


ATCAAGGATG 


ATTCTTGAAA 


TCCTCATCTC 


CCCACCTTTA 


2340 


ATCTTTTGTA 


AGTCTTTTTC 


CTTCAAAGCT 


ACAAACTGTT 


CCAATT7AAC 


TGTGTTTTTC 


2400 


ATAATAAAAT 


CTCCTAAAAT 


GTTTTTT CTT 


GTAAGCTAAC 


TTACAAAAAC 


CATTATACAA 


2460 


AATGGAATTT 


CGTTTTAGAT 


AAAATTCTCT 


CAACTGTCAT 


TTTTTTCTCC 


CAAAGTGTAC 


2520 


TTTTTTAAGA 


AAAAAGCCGG 


GAAAATTCCC 


"AGCTTTGCTA 


TTATATTGAT 


CCCAGCAGGA 


2 580 


TTCGAACCTC 


CGACCGTTCG 


CTTAGAAGCC 


GAATGCTCTA 


TCCAGCTGAG 


CTATGACACC 


2640 


TAATACAATT 


ATTCTACCAA 


AAATTCAATT 


AAAAGTCAAT 


TTTCTATTTA 


TGGTAGGGCA 


2700 


ATCCCTGCTG 


AATCGTAAAA 


GCGCGATAGA 


TTTGTTCAAC 


AAGAACTAGT 


CTCATTAACT 


2760 


GATCGGGTAA 


GGTTAGGCGA 


CCAAAACTGA 


CACAAAGATT 


GGCTCTATTT 


TTTACAGATG 


2820 


ATGATAATCC 


TAAACTTCCC 


CCAATAATAA 


AAGTAAGACT 


AGAAAATCCT 


TTTATAGAAG 


2880 


TTTCTTC7AA 


CTGCTTACTA 


AATTCTTCTG 


AGAAGAAAGT 


TTTCCCTTCA 


A7GCCTAACA 


2940 


CAATAACGAA 


ATCACCGTCA 


GCAATTTTTG 


ATAAAATTCT 


CTGACCTTCT 


ATTTCTAAAA 


3000 


TCTTTTGATT 


TTCTGATTCA 


CTGGCCTTAT 


CTGGTGTTTT 


TTCATCTGAT 


AACTCAATCA 


3060 


TTTCAAACTT 


AGCAAATCTA 


GAAATTCGT? 


TTGAATACTC 


TGCGATACCA 


TCTTTTAAAT 


3120 


ACTTTTCTTT 


CACTTTCCCA 


ACTGTTACAA 


CTTTAATTTT 


CATGACTCTA 


TTCTAACATA 


3180 


TTCTCTATTT 


TTTCACATCT 


TATTCACAAA 


ATAAAAAATA 


GATTTCAATT 


AAGAAAATCA 


3240 


CAA7TTCAAA 


AGAGTTATCC 


ACAGTTTC7C 


TAAAACTTTT 


G7CTTTAACT 


7A7AA7TAAG 


3300 


CTAGTCAGTT 


TATACTTTCA 


GTAATTCAAA 


CATATGCAGG 


CAAATATGAA 


ACATCTAAAA 


3360 


ACATTTTACA 


AAAAATCGTT 


TCAATTATTA 


CTCCTTATCG 


TCATTAGCTT 


TTTTAGTGGA 


3420 


GCCTTGGGTA 


GTTTTTCAAT 


AACTCAACTA 


ACTCAAAAAA 


CTAGTGTAAA 


CAACTCTAAC 


3430 


AAC AAT ACTA 


CTATTACACA 


AACTUCCTAT 


!\ JVf*R ff* A H. K 

AAUAALUAAA 


ATTCAACAAC 


AC AGG CTGTT 


3540 


AACAAAGTAA 


AAGATGCTGT 


TGTTTCTGTT 


ATTACTTATT 


CGGCAAACAG 


ACAAAATAGC 


3600 


GTATTTGGCA 


ATGATGATAC 


TGACACAGAT 


TCTCAGCGAA 


TCTCTAGTGA 


AGGATCTGGA 


3660 


GTTATTTATA 


AAAAGAATGA 


TAAAGAAGCT 


TACATCGTCA 


CCAACAATCA 


CGTTATTAAT 


3720 


GGCGCCAgCA AAGTAGATAT 


TCGATTGTCA 


GATGGGACTA 


AAGTACCTGG 


AGAAATTGTC 


3780 


GGAGCTGACA 


CTTTCTCTGA 


TATTGCTGTC 


GTCAAAATCT 


CTTCAGAAAA 


AGTGACAACA 


3840 


GTAGCTGAGT 


TTGGTGATTC 


TAGTAAGTTA 


ACTGTAGCAG 


AAACTGCTAT 


TGCCATCGGT 


390C 


AGCCCGTTAG 


GTTCTGAATA 


TGCAAATACT 


GTCACTCAAG 


GTATCCTATC 


CAGTCTCAAT 


3960 
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AGAAATGTAT CCTTAAAATC GGAACATCGA CAAGCTATTT CTACAAAAGC CATCCAAACT 4C20 

GATACTGCTA TTAACCCAGG TAACTCTGGC GGCCCACTGA TCAATATTCA AGGGCAGGTT 4080 

ATCGGAATTA CCTCAAGTAA AATTGCTACA AATGGAGGAA CATCTGTAGA AGGTCTTGGT 4 140 

TTCGCAATTC CTGCAAATGA TGCTATCAAT ATTATTGAAC ACTTAGAAAA AAACGGAAAA 4 200 

GTGACGCGTC CAGCTTTGCG AATCCAGATG GTTAATTTAT CTAATGTGAG TACAAGCGAC 4 2 60 

ATCAGAAGAC TCAATATTCC AAGTAATGTT ACATCTGGTG TAATTGTTCG TTCGGTACAA 4 320 

AGTAATATGC CTCCCAATCG TCACCTTGAA AAATACGATG TAATTACAAA AGTAGATGAC 4 3 80 

AAAGAGATTG CTTCATCAAC AGACTTACAA AGTGCTCTTT ACAACCATTC TATCGGAGAC 4 4 40 

ACCATTAAGA TAACCTACTA TCGTAACGGG AAAGAAGAAA CTACCTCTAT CAAACTTAAC 4 500 

AAGAGTTCAG GTGATTTAGA ATCTTAATTG ACATC7ATCT AAACAAAGCT TTACATAAGA 4560 

GAAAAGATGT GTTAGTGTAG AATCATGGAA AAATTTGAAA TGATTTCTAT CACAGATATA 462 0 

CAAAAAAATC CCTATCAACC CCGAAAAGAA TTTGATAGAG AAAAACTAGA TGAACTAGCA 4 680 
CAGTCTATCA AAGAAAATGG GGTCATTCAA CCGATTATTC TTCGTCAATC TCCTGTTATT . 47 40 

CGTTATGAAA TCcTTGCAGG AGAGAGACCC TATCGGGCTT CACTTTTAGC TCGTCTACGG 4 800 

TCTATCCCAG CTGTTGTTAA ACAGATTTCA GACCAAGACA TGATGGTCCA GTCCATTATT 4 8.60 

GAAAATTTAC AGAGAGAAAA TTTAAACCCA ATAGAAGAAG CACGCGCCTA TGAATCTCTC 4 920 

GT AG AG AAAG GATTCACCCA TGCTGAAATT GCAGATAAGA TGGGCAAGTC TCGTCCATAT 4 980 

ATCAGCAACT CCATTCGTTT ACTTTCCTTG CCAGAACAGA TTCTTTCAGA AGTACAAAA7 504 0 

GG C AAA CT AT CACAAGCCCA TGCGCGTTCC CTAGT7GGG7 TAAATAACGA ACAACAAGAC 5100 

TATTTCTTTC AACGGATTAT AGAAGAAGAT ATTTCTGTAA GG AAA TTAGA AGCTCTTCTG 5160 

AGAGAGAAAA AACAAAAGAA ACAGCAAAAA ACTAATCATT TCATACAAAA TGAAGAAAAA 522 0 

CAGTTAACAA AACTACTCGG ATTAGATGTA GAAATTAAAC TATCTAAAAA AGACAGTGGA 52 80 

AAAATCATTA TTTCTTTTTC AAATCAAGAA GAATATAGTA GAATTATCAA CAGCCTGAAA 5 3 40 

TAAGGCTGTT CTTTTATTTT TTTATCTCAC AAGGTTATCC ACTATG7TTT TCGATAAAAA 54 00 

GCTTAATAAA TCAATAATTT CTTCTTTTAT CCCCAACCTG TGGATAAAGT TTGGTAACAT 54 60 

TGTGGATTAT TTTTCACAGC TTGTGGAAAA TTCTTGCTAT CTATGGTAAA ATATCTCTAG 5520 

TATTAAACTT TTAAATAGTA AAGGAGGAGA AACGATTGAA AGAAAAACAA TTTTGGAATC 5580 

GTATATTAGA ATTTGCACAA GAAAGACTGA CTCGATCCAT GTATGATTTC TATGCTATTC 5640 

AAGCTGAACT CATCAAGGTA CAGGAAAATG TTGCCACTAT ATTTC7ACCT CGGTCTGAAA 5^00 

TGGAAATGGT CTGGGAAAAA CAACTAAAAG ATA7TATTGT AGTAGCTGGT TTTGAAATTT 57 60 
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ATGACGCTCA AATAACTCCC CACTATATTT TCACCAAACC TCAAGATACG ACTAGCTCAC 5820 

AAGTTGAAGA AGCTACAAA? TTAACTCTTT ATAACTATAG TCCAAAGTTA GTATCTATTC 5880 

CTTATTCAGA TACGGGATTA AAAGAAAAGT ATACCTTTGA TAACTTTATT CAAGGGGATG 5940 

GAAATGTTTG GGCTGTATCA GCCGCTTTAG CTGTCTCTGA AGATTTCGC7 CTGACCTATA 6000 

ACCCTCTTTT TATCTATGGA GGACCAGCCC TTCGTAAGAC TCACTTATTA AACGCTATTG 6060 

GAAATGAAAT TCTAAAAAAT~ ATTCCTAATG CGCGTGTTAA AT AT ATCCCT GCCGAAAGC? 6120 

TTATTAATGA CTTTCTTGAT CACCTAAGAC TTGGGGAAAT GGAAAAGTTT AAAAAGACCT 6180 

ATCGTAGTCT TGATCTTTTG TTAATCGATG ATATCCAGTC ACTCAGCGGA AAAAAAGTCG 62 4 0 

CAACTCAGGA AGAATTTTTC AATACCTTTA ACGCCCTTCA TGACAAGCAA AAACAGATTG 6300 

TCCTAACGAG TCATCGTAGT CCAAAACATC TAGAAGGGCT CGAGGAGAGG CTTCTCACCC 63 60 

GTTTTAGTTG GGGATTGACA CAAACTATCA CCCCCCCTGA CTTTGAAACA CGTATTGCCA 642 0 

- TTTTACAAAG TAAGACGGAA CATTTAGGCT ACAATTTCCA AAGTGATACT CTAGAATACC 6430 

TAGCTGGGCA ATTTGATTCA AATGTTCGAG ATCTTGAGGG AGCCATCAAC GACATCACTT 654 0 

TAATTGCCAG AGTAAAAAAA ATCAAGGATA TCACTATTGA TATTGCTCCA GAAGCCATTA 66C0 

GAGCCCGCAA ACAAGATGTT AGCCAAATGC TCGTCATCCC AATTGATAAA ATCCAAACTG 6 660 

AAGTTGGTAA CTTTTATGGT GTTAGTATCA AAGAAATGAA CCGAAGTAGA CGCCTTCAAA 672 0 

A7A7TCTTTT GGCCCGTCAA GTAGCCATGT ATTTATCTAG AGAACTAACA GATAATAGTC 6780 

TTCCAAAAAT TGCGAAGGAA TTTCGGCGAA AACATCATAC CACACTCAT7 CATGCCCATG 6340 

CCAAAATAAA ATCTTTCATT GATCAAGACG ATAATTTACC TTTAGAAATT GAATCAATCA 6900 

AAAAGAAAAT CAAATAATTT GTGGATAACT TTTAGTTTTT TATCTTTTTT ATCCACATTT 6960 

TTTAAACAAG CTAAAAAACT TGATATGACT TGTTT AAA GG CTGTTTTCCA CAGATTTCAC 7020 

AGACTCTATT ATTACTATTA TCTTTCTAAT ACTAAAAATA AATAAAGGAG AATCCATGAT 7030 

TCATTTTTCA ATTAATAAAA ATTTATTTCT ACAAGCATTA AAT ACT ACTA AGAGAGCTAT 7140 

TAGTTCTAAA AATGCCATTC CTATTTTATC AACACTAAAA ATTGACGTGA CCAATGAAGG 72 30 

TATTACTTTA ATTGGTTCAA ATGGTCAAAT TTCAATTCAA AATTTTATTT CTCAAAAAAA 72 60 

TGAAGATGCT GGTTTGTTAA TTACTTCTTT AGGTTCGATC CTTCTTGAAG CTTCTTTCTT 7 3 20 

TATCAATGTA GTATCTAGTT TACCTGATGT AACTCTTGAT TTTAAAGAAA TTGAACAAAA 7 3 80 

TCAAATTGTT TTAACCAGTG GCAAATCAGA AATTACCCTA AAAGGAAAAG ATAGCGAACA 7440 

ATATCCACGA ATCCAAGAAA TTTCAGCAAG CACTCCTTTA ATACTTGAAA CAAAATTACT 7 500 
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CAAGAAAATT ATTAATGAAA CACCCTTTCC TGCAAGTACA CAAGAGAGTC GTCCGATTTT 7 560 

AACAGGTGTC CACTTCGTAT TGAGTCAACA CAAACAGTTA AAAACAGTTG CAACAGACTC 7 620 . 

TCATCCCCTA AGCCAGAAAA AATTGACTCT TGAAAAAAAT AGTGATGATT TTGATGTCGT 7 680 

AATTCCTAGC CGTTCTCTAC GCGAATTTTC AGCGGTATTT ACAGATGATA TCGAAACTGT 77 40 

AGAGATTTTC TTTCCCAATA ACCAAATCCT CTTTACAACC CAAAATATTA CCTTCTATAC 7 800 

TCGTCTCCTA GAAGGAAACT ATCCTGATAC AGATCGCTTG ATTCCAACAG ACTTTAACAC 7 860 

TACT ATT ACT TTTAATCTGG TAAACTTACG CCAGTCAATG GAGCGTGCCC GTCTTTTATC 7920 

AAGTGCCACT CAAAATGGTA CTGTGAAACT TGAAATTAAG GATGGGCTTG TTAGCGCCCA 7 930 

TCTTCACTCT CCAGAAGTTG GTAAAGTAAA CGAAGAAATC GATACTGATC AGGTTACTGG 8040 

TGAAGATTTG AC C ATT ACT T TCAACCCAAC TTACTTGATT GATTCTCTTA AAGCTTTAAA 8100 

TAGCGAAAAG GTGACTATTA CCTTTATCTC AGCTGTTCGT CCATTTACTC TTGTGCCAGC 8160 

AGATACTGAC GAAGACTTCA TGCAGCTCAT TACACCAGTT CGTACAAATT AAGTGAAAGA 82 20 

GGTTGAGCCT GGCTCCCCTC TTTTATGATA TAATCGAAAA AGAAAAGGAG AGTAGTATGT 82 90 

ATCAAGTTGG AAATTTTGTT GAGATGAAAA AATCACACGC TTGTACAATC AAGTCGACTG 3340 

GTAAAAAGGC TAATCCTTGG GAAATTACAC GTGTAGGAGC ACATATCAAA ATAAAATGTA 84 00 

GTAATTGTGA GCATGTTGTC ATGATGGGGC GATATGATTT TGACCGAAAA ATGAATAAAA 8460 

TTATTGACTG AGAACCCTTA GTTAGAGGGT TAGCACTTTA TCCCTTTTTG TGTTATAATA 8520 

TTAGGGATTG AAATGAAAAC GGAGAATGAG AAATATGGCT TTGACAGCAG GTATCCTTGG 8580 

TTTGCCAAAC GTTGGTAAAT CAACACTATT TAATCCAA7T ACAAAAGCAG GAGCAGAGGC 8 640 

AGCAAACTAC CCATTTGCGA CGATTGATCC AAATGTTGGA ATGGTGGAAG TTCCAGATGA 8 7 CO 

ACGCCTACAA AAACTAACTG AAATGATAAC TCCTAAAAAG ACAGTTCCCA CAACATTTGA 37 60 

AT TT AC AG AT ATTGCAGCGA TTGTAAAAGG AGCTTCAAAA GGAGAGGGGC TAGGGAATAA 8 820 

ATTCTTCGCC AATATTCGTG AAGTAGATGC GATTGTTCAC GTAGTTCGTC CTTTTGATGA 8880 

TGAAAATGTA ATGCCCGAGC AAGGACGTGA AGACGCCTTT GTAGATCCAC TTGCAGATAT 89 40 

TGATACCATT AATCTCCAAT TGATTCTTGC TGACTTAGAA TCACTGAACA AACGATATGC 9000 

GCGTGTAGAA AAGATGGCAC GTACGCAAAA AGATAAAGAA TCAGTAGCAG AATTCAATGT 9060 

TCTTCAAAAG ATTAAACCAG TCCTAGAAGA CGGGAAATCA GCTCGTACCA TTGAATTTAC 9120 

AGATGAGGAA CAAAAGCTTC TCAAAGGTCT TTTCCTTTTG ACGACTAAAC CAGTTCTTTA 91 30 

TGTAGCTAAT GTGGACGAGG ATCTGCTTTC AGAACCTGAC TCTATCGACT ATGTCAAACA 92 40 

AATTCGTGAA TTTGCAGCCA CAGAAAATGC TGAAGTAGTC CTTATTTCTC CCCCTGCTCA 9 300 
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GGAAGAAATT TCTGAATTGA ATGATGAAGA TAAAAAAGAG TTTCTTGAAG CCATTGGTTT 93 60 

GACAGAATCA GGTGTAGATA AGTTGACGCG TGCAGCTTAC CACTTCCTTC GATTGGGAAC 94 2 0 

TTACTTCACA GCTGGTGAAA AAGAAGTTCG CGCTTGGACT TTCAAACGTG GTATGAAGGC 9480 

TCCTCAAGCA GCTGGTATTA TCCACTCAGA CTTTGAAAAA GGCTTTATTC GTGCAGTAAC 9540 

CATGTCATAT GAAGATCTAG TGAAATACGG ATCTGAAAAG GCCGTAAAAG AAGCTGGACG 9600 

C7TCCGTGAA GAAGGAAAAG AATATATCGT TCAAGATCGC GATATCATGG AATTCCGCTT 9660 

TAATGTCTAA AAATTAATAA ATGGTGTCAA TTAGGTTGGA AAAAAATTCC AACCCTTTTG 9720 

GCTTTTGAAA GGAAAAATAA ATGACCAAAT TACTTGTACG CTTGGGAAAT CCAGGGGATA 97 80 

AATATTTTGA AACAAAACAC AATGTTGGTT TTATGTTCAT TGATCAACTA GCGAAGAAAC 9840 

AGAATGTCAC TT7TACACAC GATAAGATAT TTCAAGCTCA CCTAGCATCC TTTTTCCTAA 9900 

ATGGAGAAAA AATTTATCTG GTTAAACCAA CGACCTTTAT GAATGAAAGT GGAAAACCAG 9960 

TTCATCCTTT ATTAACTTAC TATGGTTTGG ATATTGACGA TTTACTTATC ATTTACGATG 10020 

ATCTTGACAT GGAAGTTGGG AAAATTCGTT TAAGAGCAAA AGGCTCAGCA GGTGGTCATA 10080 

ATGGTATCAA CTCTATTATT CAACATATAG GAACTCAGGT CTTTAACCCT GTTAACATTC 10140 

GAATTGGAAG ACCTAAAAAT GGTATGTCAG TTGTTCATCA TGTTTTGAGT AAGTTTGACA 10200 

GGCATGATTA TATCCCTATT TTACAGTCTG TTGACAAAGT TGACGATTCT GTAAACTACT 10260 

ATTTACAACA GAAAAATTTT GAGAAAACAA TCCACACGTA TAACGGATAA ATGGTGACCT 10320 

TATTAGATTT ATTCTCAGAA AATGATCAGA 7TAAAAAA7G GCATCAAAAT 7TAACAGATA 10 3 30 

AGAAAAGACA ACTAATACTT GGTTTATCAA CATCTACTAA GGCTCTTGCA ATTGCAAGCA 1C440 

GTTTAGAAAA AGAAGATAGG ATTGTGTTAT TCACGTCAAC TTATGGAGAA GCAGAACGAC 10500 

TTGTTAGTGA TCTTAT7TCT ATCTTGGGTG AGGAACTCGT CTATCCATTT TTGG7AGATC 105 60 

ATGCTCCTAT GGTGGAGTTT TTGATGTCTT CACAGGAAAA AATTATTTCA CGGGTTGAAG 10620 

CCTTGCGTTT TTTGACTGAT TCATCTAAGA AAGGGATTTT AGTTTGTAAT ATCGCAGCAA 10680 

GTCGATTGAT TTTACCCTCT CCCAATGCAT 7CAAAGATAG TATTGTAAAA ATCTCAGTTG 10740 

GTGAAGAATA TGATCAACAC GCGTTTATCC ATCAGTTAAA GGAAAATGCC TATCGAAAAG 10800 

TTACTCAAGT ACAAACTCAG GGCGAATTTA GTCTTCGAGG AGATATTTTA GATATTTTTG 10860 

AAATATCCCA GTTACAACCT TGTCGAATTG AG7TTTTTGG TGATGAAATT GATGGTATCA 10920 

GGTCATTTGA AGTAGAAACA CAATTATCGA AAGAAAATAA CACAGAACTC ACTATCTTTC 109aC 

CAGCTAGTGA TATGCTTTTG AGAGAAAAGG ATTATCAACG AGGACAGTCA GCTTTAGAAA 11040 
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AACAAATTTC 


AAAAACTTTA 


TCACCTATTT 


TGAAATCATA 


CCTAGAAGAA 


ATTCTTTCAA 


11100 


GTTTTCACCA 


AAAACAAAGT 


CATGCAGACT 


CTCGGAAGTT 


TTTATCTTTG 


TCCTATGATA 
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AGACATCGAC 


TGTCTTTGAT 


TATATTGAAA 


AAGATACTCC 


AATATTCTTT 


GATGATTATC 


11220 


AAAAATTGAT 
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GAAGTCTTTG 


AAAGAGACTT 
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11280 


AATTACAGAA 


TAGTAAAGCA 


TTTTCTGATA 


TGCAGTATTT 


TTCTGATATT 
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11340 


ATAAAAAACA 
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CTAATCTTCA 
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TTCAATCAAT 
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11460 


CTTTTCTAAA 


AGAAGAAATT 


GAACGATATA 


AAAAAATGGA 


TTACACCATT 


ATTCTGCAGT 


11520 


CTAGCAATTC 


AATGGGAAGT 


AAAACATTGG 


ACGATATCTT 


AGAGGAATAT 


CAGATTAAAT 


11580 


TCGATTCTAG 


AGATAAGACA 


AATATCTCTA 


AAGAATCTGT 


AAACTTAATA 


GAGGGTAATC 


11640 


TCAGACATGG 


TTTTCATTTT 


GTAGATGAAA 


AG ATT TT ATT 


GATAACTGAA 


CATGAGATTT 
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TTCAAAAGAA 


ATTAAACCGT 


CGTTTTCGAA 


GACAACATGT 


TTCAAATGCA 


GAGAGATTAA 


11760 


AAGATTACAA 


TGAACTTGAA 


AAAGGGGACT 


ATGTTGTCCA 
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11940 
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AGGCCAAGCA 


AAACCTTAAG 


AACCAGGTAG 


AGGATATAGC 


TGATGATTTA 


ATCAAACTCT 
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ACTCTCAACG 


TAGTCAGTTG 


AAGGGTTTTG 
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TGATGATGAT 


GATCAAGATC 


12120 


CCTTTGATGA 


TGCTTTCCCT 


TATGTTGAAA 


CGGATGATCA 


ACTTCGTAGT 
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7CAAGAGCGA 


TATGCAGGCT 


TCTCAGCCAA 


TGGATCGACT 


TTTAGTTGGG 


GATGTTGGTT 
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TTGGAAAGAC 
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ATGCGTGCAG 
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AGTCAATCAT 


CACAAACAGG 


12300 


TTGTCATTCT AGTTCCGACG 


ACGGTTTTAG 


CGCAACAGCA 


CTATACGAAT 


TTTAAGGAAC 
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AATATTGATG 
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CTTTAGAACT 


AAAAAAGAGC 
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GTCTATGCTG 
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AAAAG AATG A 
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TTACAACAAA 


GTTGACACAA 
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_ 1284C 
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TGATTCCGGA GGCTTCGATT GGATATGTTC ATGGTCGAAT GAGTGAAGTC CACTTCGAAA 12 900 

ATACTCTATT AGACTTTATT GAGGGACAAT ACGATATCTT GGTGACGACT ACTATTATTG 12960 

AGACAGGGGT GGACATTCCA AATGCTAATA CTTTATTTAT TGAAAATGCG GACCATATGG L3020 

GCTTGTCAAC CTTATATCAG TTAAGAGGAA GAGTCGGTCG TAGTAATCGT ATTGCTTATG 13080 

CTTATCTCAT GTATCGTCCA GAAAAATCAA TCAGTGAAGT CTCTGAAAAG AGATTAGAAG 13140 

CGATTAAAGG ATTTACAGAA TTCGGCTCTG CCTTTAACA7 TGCAATGCGA GATCTTTCCA 13200 

TTCGTGGAGC AGGAAATCTT TTAGGAAAAT CCCAGTCTGG TTTCATTGAT TCTCTTCGTT 13260 

TTGAATTGTA TTCCCAGTTA TTAGAGGAAG CTATTGCTAA ACGAAACGGT AATGCTAACC 13 320 

CTAACACAAG AACCAAAGGG AATGCTGAGT TGATTTTGCA AATTGATGCC TATCTTCCTC 13 380 

ATACTTATAT TTCTGATCAA CGACATAAGA TTGAAATTTA CAAGAAAATT CGTCAAATTG 13440 

ACAACCGTCT CAATTATGAA GAGTTACAAG AGGAGTTGAT AGACCCTTTT GGAGAATACC 13500 

CAGATGTAGT AGCCTATCTG TTAGAGATTG GTTTCGTCAA ATCATACTTG GACAAGGTC7 13 560 

TTGTTCAACG TGTGGAAAGA AAACATAATA AAATTACAAT TCAATTTGAA AAAGTCACTC 13620 

AACGACTGTT TTTAGCTCAA GATTATTTTA AAGCTTTATC CGTAACGAAC TTAAAAGCAG 13 680 

GCATCGCTGA GAATAAGGGA TTAATGCAGC TTGTATTTGA TGTCCAAAAT AAGAAAGATT 13*740 

ATGAAATTTT AGAAGGTTTG CTGATTTTTG GAGAAAGTTT ATT AG AG AT A AAAGAGTCTA 13 800 

AGGAAGAAAA TTCCATTTGA TATTTTTCTT CTATAAAATA GATAAAAATG GTACAATAAT 13860 

AAATTCAGCT AATAAGGATG AGATTAGATA AATATTTAAA AGTATCGCGA ATTATCAAGC 13 920 

CTCG7ACAG7 CGCAAAGGAA GTAGCACA7A AAGGTAGAAT CAACGT7AAT GGAATCTTCG 13 930 

CCAAAAGTTC AACGGACTTG AAAGTTAATG ACCAAGTTGA AATTCCCTTT GGCAATAAGT 14C40 

TGCTGCTTGT AAAAGTACTA GAGATGAAAG ATAGTACAAA AAAAGAAGAT GCAGCAGGAA 14100 

TGTATGAAAT 7ATCACTCAA ACACGGGTAG AAGAAAATGT CTAAAAATAT TGTACAAT~G 14160 

AATAATTCTT TTATTCAAAA TGAATACCAA CGTCCTCCCT ACCTCATGAA AGAACGACAA 14 220 

AAACGGAATC GTTTTATGGG AGGGGTATTG ATTTTGATTA TGCTATTATT TATCTTGCCA 14 230 

AC77TTAATT TAGCGCAGAG TTATCAGCAA TTACTCCAAA GACGTCAGCA ATTAGCAGAC 14 340 

TTGCAAACTC AGTATCAAAC TTTGAGTGAT GAAAAGGATA AGGAGACAGC ATTTGCTACC 144 00 

AAGTTGAAAG ATGAAGATTA TGCTGCTAAA TATACACGAG CGAAGTACTA TTATTCTAAG 144 60 

TCGAGGGAAA AAGTT7ATAC GATTCCTGAC TTGCTTCAAA GGTGATAAAA TCGAAAATTT 1452 0 

A77AGACGTA ATAGAGCAAT TTTTCAGTTT GTCAGATGAA AAGCTGGAAG AATTGGCTGA 145 80 
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TAAAAATCAA TTATTGCGTT TACAAGAAGA AAAGGAAAGG AAGAATGCGT AAATTCTTAA 14 640 

7TATTTTCTT GCTACCAAGT TTTTTGACCA TTTCAAAAGT CGT7ACCACA GAAAAAGAAG 14700 

TCGTC7ATAC TTCGAAAGAA ATTTATTACC TTTCACAATC TGACTTTGGT ATTTATTTTA 14760 

CAGAAAAATT AAGTTCTCCC ATGGTTTATG GAGAGGTTCC TGTTTATCCG AATCAAGATT 14 820 

TAGTACTGCA ATCTGGGAAA TTGACTCCCA AAACAAGTTT TCAAATAACC GAGTGGCGCT 14 830 

TAAATAAACA AGGAATTCCA GTATTTAAGC TATCAAATCA TCAATTTATA GCTGCGGACA 14 94 0 

AACGAT7TTT ATATGATCAA TCAGAGGTAA CTCCAACAAT AAAAAAACTA TGGTTAGAAT 15C00 

CTGACTTTAA ACTGTACAAT AGTCCTTATG ATTTAAAAGA AGTGAAATCA TCCTTATCAG 15060 

CTTATTCGCA AGTATCAATC GACAAGACCA TCTTTCTACA AGGAAGAGAA TTTCTACATA 15120 

TTGATCAGGC TGGATCGGTA GCTAAAGAAT CAACTTCTGA AGAAGATAAT CGGATCAGTA 15180 

AAGTTCAACA AATGTTATCT GAAAAATATC AGAAAGATTC TTTCTCTATT TATCTTAAGC 152 40 

AACTGACTAC TGGAAAAGAA GCTGGTATCA ATCAAGATGA AAACATGTAT GCAGCCAGCG 15 300 

TTTTGAAACT CTCTTATCTC TATTATACGC AAGAAAAAAT AAATGAGGGT CTTTATCACT 15 360 

TAGATACGAC TGTAAAATAC GTATCTGCAG TCAATGATTT TCCACCTTCT TATAAACCAG 15420 

AGGGAAGTGG TAGTCTTCCT AAAAAAGAAG ATAATAAAGA ATATTCTTTA AAGCATTTAA 154 80 

TTACGAAAGT ATCAAAAGAA TCTGATAATG TAGCTCATAA TCTATTGGGA TATTACATTT 15540 

CAAACCAATC TGATGCCACA TTCAAATCCA ACATCTCTCC CATTATCGCA GATGATTGGG 15600 

ATCCAAAAGA AAAATTGATT TCTTCTAAGA TGGCCGGGAA GTTTATGGAA CCTATTTATA 15660 

ATCAAAATCG ATTTGTGCTA GAGTCTTTGA CTAAAACAGA TTTTGATAGT CAGCGAATTG 15 7 20 

CCAAAGGTGT TTCTGTTAAA GTAGCTCATA AAAT7CGAGA TGCGGATCAA TTTAAGCATG 157 90 

ATACGGGTGT TGTCTATGCA GATTCTCCAT TTATTCTTTC TATTTTCAC? AAGAAT7CTG 15840 

ATTATGATAC GATTTCTAAG ATAGCCAAGG ATCTTTATCA GGTTCTAAAA TGAGGGAACC 15900 

AGATTTTTTA AATCATTT7C TCAAGAAGGG ATATTTCAAA AAGCATCCTA AGGCGGTTCT 15960 

AGCTCTTTCT GGTGGATTAG ATTCCATGTT TCTATTTAAG GTATTGTCJA CTTATCAAAA 15020 

AGAGTTAGAG ATTGAATTGA TTCTAGCTCA TGTGAATCAT AAGCAGAGAA TTGAATCAGA 16080 

TTCCGAAGAA AAGGAATTAA GGAAGTTGGC TGCTGAAGCA GAGCTTCCTA TTTATATCAG 16143 

CAATTTTTCA GGAGAATTTT CAGAAGCGCG TGCACGAAAT TTTCGTTATC ATTTTTTTCA 16200 

AGAGGTCATG AAAAAGACAG GTGCGACAGC TTTAGTCACT GCCCACCA7G CTGATCATCA 162 50 

GGTGGAAACG ATTTTTATGC GCTTGATTCG AGGAACTCGC TTGCCCTATC TATCAGGAAT 163 20 

TAAGGAGAAG CAAGTAGTCG GAG AG AT AG A AATCATTCGT CCCTTCTTGC ATTTTCAGAA 16 3 80 
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AAAAGACTTT CCATCAATTT TTCACTTTGA AGATACATCA AATCAGGAGA ATCATTATTT 16440 

TCGAAATCGT ATTCGAAATT CTTACTTACC AGAATTCGAA AAAGAAAATC CTCGATTTAG 16500 

GGATGCAATC TTAGGCATTG GCAATGAAAT TTTACATTAT GATTTGGCAA TAGCTGAATT 16560 

ATCTAACAAT ATTAATGTGG AAGATTTACA GCAGTTATTT TCTTACTCTG AGTC7ACACA 16620 

AAGAGTTTTA CTTCAAACTT ATCTGAATCG TTTTCCAGAT TTGAATCTTA CAAAAGCTCA 16680 

GTTTGCTGAA GTTCAGCAGA TTTTAAAATC TAAAAGCCAG TATCGTCATC CGATTAAAAA L674 0 

TGGCTATGAA TTGATAAAAG AGTACCAACA GTTTCAGATT TGTAAAATCA GTCCGCAGgC 16800 

TGATGAAAAG GAAGATGAAC TTGTGTTACA CTATCAAAAT CAGGTAGCTT A7CAAGGATA 16860 

TTTATTTTCT TTTGGACTTC CATTAGAAGG TGAATTAATT CAACAAATAC CTGTTTCACG 16920 

TGAAACATCC ATACACATTC GTCATCGAAA AACAGGAGAT GTTTTGATTA AAAATGGGCA 16980 

TAGAAAAAAA CTCAGACGTT TATTTATTGA TTTGAAAATC CCTATGGAAA AGAGAAACTC 17040 

TGCTCTTA7T ATTGAGCAAT TTGGTGAAAT TGTCTCAATT TTGGGAATTG CGACCAATAA 17100 

TTTGAGTAAA AAAACGAAAA ATGATATAAT GAACACTGTA CTTTATATAG AAAAAATAGA 17160 

TAGGTAAAAA ATGTTAGAAA ACGATATTAA AAAAGTCCTC GTTTCACACG ATGAAATTAC 17 220 

AGAAGCAGCT AAAAAACTAC GTGCTCAATT AACTAAAGAC TATGCACGAA AAAATCCAAT 17 280 

CTTAGTTGGG ATTTTAAAAG GATCTATTCC TTTTATGGCT GAATTGGTCA AACATATTGA 17340 

TACACATATT GAAATGGAC7 TCATGATGGT TTCTACCTAC CATGGTGGAA CAGCAAGTAG 174C0 

TGGTGTTATC AATATTAAAC AAGATGTCAC TCAACATATC AAAGGAAGAC ATGTTCTATT 17460 

TGTAGAAGAT ATCATTGATA CAGGTCAAAC TTTGAAGAAT TTGCGAGATA TGTTTAAAGA 17 520 

AAGAGAAGCA GCTTCTGTTA AAATTGCAAC CTTGTTCGAT AAACCAGAAG GACGTCTTCT 17 580 

AGAAATTGAG GCAGACTATA CTTGCTTTAC TATCCCAAAT GAGTTTGTAG TAGCTTATGG 17 640 

T7TAGACTAC AAAGAAAATT ATCGTAATCT TCCTTATATT GGAGTATTGA AAGAGGAAGT 17700 

GTATTCAAAT TAGAAAGAAT AATCTTTAAT GAAAAAACAA AATAATGGTT TAATTAAAAA 177 60 

TCCTTTTCTA TGGTTATTAT TTATCTTTTT CCTTG7GACA GGATTCCAGT ATTTCTATTC 17 820 

TGGGAATAAC TCACGAGGAA GTCAGCAAAT CAACTATACT GAGTTGGTAC AAG AAATT AC 17 880 

CGATGGTAAT GTAAAAGAAT TAACTTACCA ACCAAATGGT AGTGTTATCG AACTTTCTCG 17940 

TGTCTATAAA AATCCTAAAA CAAGTAAAGA AGAAACAGGT ATTCAGTTTT TCACGCCATC 18000 

TGTTACTAAG G7AGAGAAAT TTACCAGCAC TATTCTTCCT GCAGATACTA CCGTATCAGA 18060 

ATTGCAAAAA CTTGCTACTG ACCATAAAGC AGAAGTAACT GTTAAGCATG AAAGTTCAAG 18120 
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TGCTATATGG 
ATTCTCTATG 
TAGTAAGGCT 
AGCTGAGGAA 
A77CACAAAA 
AGGTAAAACT 
CTCAGGTTCT 
TTTTGAGGAT 
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ATTAATCTAC 
ATGGGAAATA 
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7CGTATCCA7 
TGGGAGGAGG 
ATAAAGAAGA 
AACTAGTTGA 
GTATTCCAGC 
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AAATGTTTGT 
CAGCACCAGC 
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TGTGCCATTT 

CAATGGCCCT 

TATTAAAGTA 
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GGAAKCTAT 
AATCCAATCA 
AGATTTTCAG 
TTCTTAAAAG 
TTGGAGGGAC 
CG7GT7CCA7 
GCTAGTCC7C 
ATCGATGAAA 
GAACGTCAAC 
GGGA7TATCG 
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TCT7CTTCC7 
G77TTGGACG 
ATGTTGCTGG 
ATCCAAAACG 
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TACATGAATT 


GATTAGTAGA 


GAAGAAGGAC 


TGG7AGACGA 


TATTCCACCT 


TTAAGGAAAT 


19980 


ATTTCAAGAC 


CAACTTTCGA 


AATCGAAT7T 
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AGCGTAGATA 


CGATAAAGAA 


CCC7ATGAAG 
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CTCCCTAGA7 


GATTATTATC 


TCTTTCATGA 
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GAT TAT AG AA 


20160 


ACAAACAAAG 


TAAAGAGAAA 
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TAGAACGCG7 
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AGAGACTTAC 
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TATAATAG7A 


AGAGT7GAAA 


ATAACAACTC 
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TAAGACACCG 


CCTTTTCACG 


GCGG7AACAC 


GCG7TCGAAT 


CCCGTACGGA 


C7ATGGTATG 


20460 


TTGCGTCAGG 


ACCAC7TGAT 


GAAAAAAAGT 


TTAAAAAAAC 


TTAAAAATC7 


TCAAAAAAGT 


20520 


GTTGACAAGC 


GAAAGCAGTT 


GTGATATACT 


AA7ATAGTTG 


TCGCT7GAGA 


GAAGCAAGTG 


20580 


ACAAAGACCT 


TTGAAAACTG 


AACAAGACGA 


ACCAA7G7GC 


AGCGCGC7AC 


AACGTAAGTT 


20640 


GTAGT AC7GA 


ACAATGAAAA 


AAACAATAAA 


TCTGTCAGTG 


ACAGAAATCA 


GTAAGAACTC 


20700 


AAACTTTTTA 


ATGAGAGTTT 


GATCC7GCCT 


CAGGACGAAC 


GCTGGCGGCG 


7GCCTAATAC 


20760 


ATGCAAGTAG 


AACGCTGAAG 


GAGGAGCTTG 


CT7C7CTGGA 


TGAGTTGCGA 


ACGGG7GAGT 


20820 


AACGCGTAGG 


TAACC7GCC7 


GG7AGCGGGG 


GATAAC7A7T 


GGAAACGA7A 


GCTAATACCG 


20880 


CA7AAGAC7A 


GA7GTTGCAT 


GACATTTGC7 


TAAAAGGTGC 


ACTTGCATCA 


CTACCACATG 


20940 


CACCTGCCTT 


CTATTACCTA 


G77GG7CGGG 


7AACGGC7CA 


CCAAGCCGAC 


GATACATAGC 


21000 


CCACCTGAGA 


GGGTGATCGG 


CCACACTGCC 


ACTGACACAC 


GGCCCAGAC7 


CCTACGGGAG 


21060 


GCAGCAGTAG 


GGAATC77CG 


GCAATGGACG 


GAAGTC7CAC 


CGAGCAACGC 


CGCGTGAGTG 


21120 


AAGAACGTTT 


TCGGATCG7A 


AAGCTCTG7T 


CTAAGAGAAG 


AACGACTGTG 


AGAGTGGAAA 


21180 


GTTCACACTG 


TGACGGTATC 


TTACCAGAAA 


GGGACGGC7A 


ACTACC7GCC 


AGCAGCCCCG 


21240 


GTAATACGTA 


GG7CCCGAGC 


GTTG7CCGGA 


TT7AT7GGGC 


GTAAAGCGAG 


CCCAGGCGGT 


21300 


TAGATAAGTC 


TCAAGTTAAA 


GGC7GTGGCT 


7AACCA7A 






21338 


(2) INFORMATION FOR SEQ ID NO: 21 











<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 627 3 base pairs 
{ B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 
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(xi) 


SEQUENCE DESCRIPTION: 


SHQ ID NO: 


21: 






TGTTTTTAAA 


GAGCCGTGTC 


TGGATAGACT 


TTCGGACCCA 


ACGC7CTA7T 


AGATAATGAA 


60 


CTGCCTATAC 


ACAAGATTTC 


TAACCTTAGT 


CGACATGAGC 


TGAAACCTCT 


TATTTGTTAA 


120 


GTAGTTCACA 


AAATATTATA 


CACCTATTTT 


ATGAATAGTC 


AACTGTCTT7 


ACAGTAAAAT 


180 


TTTAGAAAAT 


CATGAAAATT 


TTCTCTTTCT 


TTCCATTTTA 


AGTGACATTC 


AGTCATTCTC 


240 


ACATCAAAAA 


AGCCCAGACG 


AAA7TGTCTG 


AGCATTCTTT 


TATC7AGTCG 


TTTAAGGAAG 


300 


TTGAGTTCAC 


TATGTTTAAA 


CTCTCTGTCC 


CATCATTTCT 


TCAACAAACC 


TTGTTCTTGG 


360 


AG AAACTC CT 


TGGCTACTTC 


CTTTGCTGAC 


TTGCCTTCAA 


CACCGACTTG 


GTAGTTGAGC 


420 


TGGCTCATCT 


GGCTTTCTGT 


AATCTTACCA 


GCCAA7GTAT 


TAACAACTCT 


TTCCAACTCT 


430 


GGGTGTTTCT 


TGAGAAGAGC 


TTCTTTCATG 


AGTGGAGCCC 


CTTCATAAGG 


TGGGAAGAGT 


S40 


TCCTTGTCAT 


CTTCCAAGAC 


CTGTAAATCA 


TAACGCTCCA 


AT7CCGCATC 


AGTCGAATAG 


600 


GCATCCGTGA 


TTTGAATATC 


CCCTGACTGA 


ATAGCCTGAT 


AGCGAAGGCC 


TGCCTCAATG 


660 


GTCGCTACAT 


TGAGATTCAG 


ACCATACATT 


GA77GCAAGC 


CC7TA7TTCC 


ATCTTCACGG 


720 


TCCTTAAACT 


CGAGTGTAAA 


ACCTGCCTTC 


AACTGCCCTT 


CCACTTTTTT 


CAAGTCTGAA 


780 


ATCGTC7TCA 


AGCCATATTC 


TTGAGCAATC 


77TT7CGGAA 


CAGCTACAGC 


ATAGGTGTTT 


840 


TGATAAGACA 


TGGGTTTGAG 


ATAGGCTACA 


TGATCCTGCT 


TAGCAATGCC 


ATCACGCGCC 


900 


ACCTGATAAA 


CCTGTTCTGG 


TTCATGACTC 


ACCTTGGGTG 


ATGGTTGAAG 


CAAACTTTCA 


960 


GTCACCGTAC 


CAGTAAATTC 


ACCATAGATG 


7CAA7ATCGC 


CTTT7TTCAG 


AGCTTCATAA 


1020 


AGGAAGCTTG 


TCTTCCCAAA 


ATTCCGTTTA 


ACAGTCGCAG 


7CATGCTGGT 


ATTTT^*T^CA 


1080 


ATCAGCAACT 


TATACATATT 


GGCCAAAATT 


TC7GGTTC7G 


CACCTATTTT 


CCCAGCAATA 


1140 


ACCAAGTTTT 


CCTTCTCTTT 


TTGAACCAAA 


AGAGCTGGAC 


TATAAGACAG 


ACCCAGTAAT 


1200 


AAAGCCACCA 


ACGCAAAACC 


TGAGAAAATC 


GTCCGTAATT 


TTGCTTTTTC 


CATCACTTTT 


1260 


AGTAGGAAGT 


TAAAGGCAAT 


CCCTAGCACT 


GCAGAAGAAA 


GTGCCCCAAT 


CAAAATCAAA 


1320 


CTGGCATTAT 


TACGGTCAAT 


TCCCAAAAGA 


ATAAAGGAAC 


CTAGTCCCCC 


TGCACCAATC 


1380 


AACGCCGCCA 


AGGTTGCCGT 


ACCGATAATC 


AAAACAGCTG 


CCGTCCGAAT 


CCCAGACATG 


1440 


ATAACAGGCA 


TGGCGAGTGG 


AATTTCAAAT 


TTCTTGAGAC 


CTTCCCA7CT 


GGTCATCCCA 


1500 


AAGGCAATCC 


CACCCTCTTG 


CAGCTTCGGA 


TCAATTCCC7 


TCAGCCCAGT 


GATAGTATTT 


1560 


TGCAAAATAG 


GGAAAATCGC 


ATAAATCACT 


AGAGC7GTCA 


AAGCCGGCAA 


GGTCCCAATT 


1620 


CCCATCAAAG 


GGATAAAGAG 


CCCCAACAAG 


GCCAGAGACG 


GGATGCTCTG 


GAAAATACCT 


1S80 


GCAATCTGCA 


ACACCCAGTC 


GGCCAGCTTC 


TCATGATAGC 


GAAGAAAAAC 


AGCCAAGGGA 


1740 
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ATCGCAAGCA AAATAGC7AG TAACAAGGTC AAAAGCGACA ACTGCAAATG TTGAGATAGA 1800 

GCTGTCAACC AATCACTAAA ACGATCCTCA AAACTTGCAA 77AAATTAGT CATGAACACT 1860 

ACCTCCAAAC AAGTC7GCTA CAAAGTCTGT TGCAGGCGCT 77TAAAATTC TCTCGGGATT 1920 

CGCTACCTGG CGAATTTCTC CATCCTGCAA GACAGCAATA CCCTCCGCCA ACTTCAAGGC 1980 

TTCATCCCTA TCA7CGG77A CAAAAATCGT TGTCATCCCA AACTCTTTAT GCAATTCTTT 2 040 

TG*f CAGAACC TGCAAC7G7T 7TCTCGAAAT AGCATCCAAG GCCGAAAAGG GTTCATCCAT 2100 

GAGGAAAATC TTGGCCTGAC CAATCATAGC 7CGGACAATA CCCACCCCTT GC7GTTCTCC 2160 

ACCAGA7AAT TCACTAGGTA AGCGATGCCC ATACTCGGCT ACTGGTAAAC CAACCTTAGC 22 2 0 

CAAAAGCTCT TCTG7TTTC7 TCGTAATTTC T7CCTTGC7C CACCCCTTCA TTTCAGGAAT 2280 

GAGAGCAATA TTTTCCGCAA CTGTTAGATT TGGAAAAAGA GCAATAGCCT GTAAAACATA 23 4 0 

ACCAGTAGAA AGACGAACT7 CACGCTCATC ATACTC7TTG ATGCGCTTCC CATCCATATA 2400 

AATATTTCCA TCAGTTGGTT CCAAAAGACG GTTAATCATC TTGAGCATGG TCGTCTTACC 2460 

TGACCCAGAA GGCCCTACTA AAACCATAAA TTCCCCATCC TCAATCTGTA AGTTGACATC 2 520 

TC7CAAGACA TCCT77TCTG TG7ACCCCAG TGCTACATTT TTCTA7TCAA TCATTCTTTG 2 580 

TCCTCAATTT AAAACTTCCC TCCATTGGTC AAGTCTTCTA CCTTAGGCAT AACTTCCTTA 2 640 

TTATCCCAAT GCTCCACAA7 TTTCCCGTTC TCTAAACGGA AGATATCGTA CTGGGCATAA 2700 

GCAACCCCAT CAATCTGAGT CTGACCATAG CTAACCACAT AGTTTCCTTG TCCTAAGAGT 27 60 

7GGAAAACAA AGTCAAAAGT GACACTATAT TCAGCCACAT AGTTTTTATA ACCAGCACTT 2 820 

CCTTGTCCAA TATCATGATT ATGCTGAATC AAATCGTCTG CCACATAATC ACTCCACTGC 2 880 

TC7AGC77CC CA77TTGCAA AA~77*Z7j77 .V.G.W.C'C: ~.\\C7ACCTT 777A7777C7 29-40 

GCTTTCTTAT CCAAA7CCTT GATTTCAAAA 7CTCCAAAAA TTTGATCTAG TTGGTCATTT 3000 

TCAGCTGTTC GATAGTAC7C AATGACATCC CAATGCTCAA CAATACAACC ATTCTCATCC 3060 

TCACGGAAAG TATCCG7CCT CACCCATTGA GCTTCTCCAC CATTCAGATA TTGATGAACA 3120 

TGAACAAAGA CCAGATTGCC ATCCTCAATG GTGCGGACAA TCTTAATC7C ACGCTCTGGA 3180 

TGACGCTCAA AGAAATCTGC AAAGAAGGCT GCAAATCCTT CTTTCCCCTC AGGAACACCT 324 0 

GTCGAA7CTT GGATATAGGT ATCCCCTACA CACTGCGC7T GAGCCTCAGC AACTCGTCCG 3 3 00 

TC77GAATGG CATGGATGTA TAGGTTGTGA GCATTTTTCA CTTGTTGTGA CATATTCTAA 3 3 60 

ACCTCATTTC CC7TC7C7T7 CAGATTCGCC AAAATTC7TT CT7GAAAACC TTCAAATTGG 3 4 20 

7GAATTTCTT CCTC7GAAAA TCCTTTGTAA AAGA7AGTA7 CCAA7T7CTG AC7GACACGA 3 4 80 
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TCCCCCACTT CTTTCTGGGA CTTGCCTAAC TCCGTTAAAA CTAAATACTT CTTACGCTTC 3 540 

TCTTTTCCAC ACGGACTAAC AATTACAAGC T7TTCTTCCT CTAGCTTTTT TATCATAGTC 3 600 

GTCAGCGTAT TATTCGCAAG TCCAGTCGCA AGCGCGATAT CTGTCGCAGT TGCGCAGCCA 3 660 

GTTTCACTAT TCCATAAAAC CGCTAAAATC TTGCCCTCTT CACCCCTATA AAGAGCCTCA 3 720 

GGATCTTGAC TCAGTAACTT TTGAAAAATC CGCCCATTCA ACAAACGAAT ATGATGGGCT 3 7 90 

AGCAAATGAC CATCTTTCAT AACACCTCCA ATTTATTTCG ATATCGAAAT GAATAAAACA 3 34 0 

ATTGTAACAC TCA7CGTTCT AACTGTCAAC 7ATTTCCATT TAGAAATAAT 7TT7GATAAT 3 900 

TATCCACACC ACCATACTCC GGCTCAACTA ACTTT7AACG AGAGTTTCTA AACTCCTTCG 3 960 

TCCTCCAG7C TACAAAAGCC TTCCATTCGT AC7A7CCTAT ATTTTATGAG GGGACACATT 4C2C 

TTTCCTATCA GACCATTTAT TTTAAAGATA GAAGTAAATC ATAATTCCTT CCATCTGTTC 4 08C 

TTTTATAGTA TATTGAAGT7 AGACTAGAGC ACTGTATCTT CTAAAACATT CATAGAAAGC 4140 

CATTTCAATT TCCCAATCAA TTTGTTCGTA TTTATAGCAT TTCGAAACTG GAATAGGACA 4 200 

CCATGAC7CC TAAAAGATTT CTATAAATTC ATTTAAT7TC CTCAATCAAT 7TCTTCATAT 4 260 

CTTATTTCAT TCCGCTATAA TTTCACCTTA CCCTATCTTT TTCGTAGCAC CCTTCAAACA 4 320 

GCCTATCCCC TACCCTTTGA CGATTCCTCA CTTCGCTCCA CTTCCATTAC AGAAGTTTCT 4 380 

TC ACT ACT AT CGCCTCGGC7 GACTTCTCAT GATTCCTTCT TACT ACT ATT TGAACGCTCA 4440 

C GAG AT AG AT CTTACAAAAA ATGCTTTGAT CC AC AATGG A ATCAAAGCAT TTT AAAG ACT 4 500 

TCCTCATACA TAAGCGCAGA AGTCGCAGTT CCTCTGTACT TGGCTTCTTC TCTTTTGACA 4 560 

AAGCGAGCCA AGT7GAGCAA C7CAGGTCCT CGATGTTTCC CATTTACGAC CAATTCACGA 4 62 0 

TTGACCAGGC C7GAGAGACG AACTGCCTCC AATTGCTCAT TTGTAGTAGG CACTT7TTTA 4 680 

GTAGTCTCTA GGAGAGCAGC AACTAAATCT TCACTCAAAT CATGTCGAGC ATGATTCTAA 4740 

AGATCTTTTA TAAGGCTTTC TAGGTTTGGT TCTACCATCC CTACCACCTC CCTTATGCTT 4 8C0 

TAATAATGTT TAATCAAATC AACCGTTGAA CGATCCAATT TCTTCACCAA GGCTTGTAAG 4860 

AAAGCTTGCG CTTCTAGGAA GTCATCCATT GCATAGAGGG TTTGGTCACA ATGGATATAA 4 9 20 

CGAGCGCAGA CACCGATAGT TGTTGATGGG ACACCACCAT TTTTCAGATG AGCTGCACCT 4 980 

GCATCTGTTC CGCCTTTACC ACAG7AGTAT TGGTAC77GA TACCACCTTC TTCAGCCGTT 504 0 

G7CAAAAGGA AATCCTTCAT CCCTGGGAGA AGCAAG7GAC CTCGATCATA GAAACGAATC 5 IOC 

AAGGTTCCAT CTCCAATCTT GCCTTGACCA CCGTAGACAT CACCTGCTGC TGAGCAATCA 515C 

ACTGCGAGGA AGACTTC7GG GTCAAACTTG GTTGTAGAGG TATGAGCGCC ACGCAGACCA 522 0 

ACT7CT7CTT GGACGTTAGA ACCCAGATAG AGT7CATTCC CGAGT7T77G ACCCGATAAA 5280 
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GCTTCAGCTA GCTCGCTTAC CATGAGGACA CCGTAGCGGT TATCCCAAGC TTTTGAGATG 5 3 40 

ATATTTTTTT CATTCGCTGT CAAAATTGCA GAACTATCTG GTACAATGGT ATCACCAGGA 5400 

CGGATGCCAA AACTTTCTGC CTCACCCTTG TCCGCAAAAC C AC CATC AAA AACGATATCG 54 60 

GCAATGGCTG GCATGGTTGG TCCCCCCTTT CCACCAGTCA AATGCGGAGG AACAGAACCT 5520 

GAAATCACAG GAATTTCATG ACCATCACGA GTCAAGAGTT TGAAACGTTG GCTGCTAACC 5 580 

ACCATGGGGT TCCAGCCACC GATTTCTACG ACACGGAAGG TACCATCTGG CTTGATTTCG 564 0 

CTGACCATAA AACCAACTTC GTCCATATGA GAAGCGACCA AGACGCGCGG TGCATCCACA 57 00 

CCTTCTGAAT G7TTGATACC AAAAATACCA CCCAAGCCAT CTGTCACCAC TTCATCCACA 57 60 

TGCGGTGTCA ACTTTTCACG AAGATAAGCA CGGACAGGCG CTTCATCACC TGAGACTGCA 5820 

GCAAGTTCTG TTACTTCTTT AATTTTTGAA AATAATG7TG TCATTTCAGT TCCTTCTTTC 5880 

TTTCATCCAT TTTACCACTT TTTATAGGAG AAGGATAGTG CCAAGGTGGA TTTCTAAGTT 594 0 

AGTATCTTAG TCCTGCTCTA TCTTAGAAAA GGATAGTATT CTCTTGCATG TAGTGCAAAA 6000 

TCTAGTAAAC ATTCCAAAAT TAACTCGAAT ATTTATTTCC AAACAAAAAA ACAATACACC 6060 

ATCAAAGTTG TTTGGATTT? TCATGAAATT TACAGAAAAT AGTTGACTTC CCTTTCTTCT 6120 

TTCTTTAAAT ATATAGTTGG TTGAGTTTGG AATAGTACGC TCTAGCTGCT AAAACATTTC 6180 

TAGAAATTAA TTTGACTTTC CTAATAGAGT TGTTCATATC TTATTTCAAT TTACTATAGT 6240 

ACAAAACTAG AAAAGGAAAA AATCATGACC AGG 62 7 3 

(2) INFORMATION FOR SEQ 10 NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28171 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ACAACCTTTT TCAAAAACTC ACCTTGGTAC GGAGATGTTT TCCTTTCTGC TATTATTTTC 60 

GGTTATATTC ATATCAATTT TGCTTTAACT CCTCTTGC7T TTTTCATTTA TGCTAGTGGA 120 

GGTCTTATTT TAGCTCTATT GTATCGCATG ACTAAAAATC TCTACTATCC AATACTAGTT 180 

CATATTCTCA TTAATATCAC TGCCTTCTGG GATGTGTGGT TGCTCCTATT TTCAGGAAGT 240 

TAGCTTACTA AAATAATG7C GGAACTTTCC GGCAT7T7CT TTT7TCACAA ATAGTCAACG 3 00 

TT7TTCTTTT CGATATTGTA GTGGTGTGTA TCCAGTTA7T TTTTTGAATT GATTTTGAAA 3 60 
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ATAAGGTTGA 


CTTGAGAAAG 


GCAGATAGTG 


AAGATAGTTA 


AGAAGAATAG 


GATGTTCTTT 


420 


TTTCCTTTTT 


GGAAAACTTC 


TAAAATATGG 


TATAATGAAA 


AGATAAAGAA 


GTTGGGGGTA 


430 


GAAGATGAAC 


ATTCAACAAT 


7ACGCTATGT 


TGTGGCTATT 


GCCAATAGTG 


GTACTTTTCG 


540 


TGAAGCTGCT 


GAAAAGATG? 


ATGTTAGTCA 


GCCGAGTCTG 


TCTATTTCTG 


TTCGTGATTT 


600 


GGAAAAAGAG 


TTGGGCTTTA 


AGATTTTCCG 


TCGGACCAGC 


TCXGGGACTT 


TCTTGACCCG 


660 


TCGTGGGATG 


GAATTTTATG 


AAAAATCGCA 


AGAATTGGTT 


AAAGGATTTG 


ATATTTTTCA 


720 


AAATCAGTAT 


GCCAATCCTG 


AAGAAGAAAA 


AGATCAATTT 


TCTGTTGCTA 


GCCAGCACTA 


780 


TGACTTCT7G 


CCACCAACTA 


TTACGGCCTT 


TTCAGACCGC 


TATCCTGACT 


ATAAGAACTT 


840 


CCCTATTTTT 


GAATCAACTA 


CTGTTCAAAT 


ATTAGATCAA 


GTGGCCCAAG 


GGCATAGTCA 


900 


GATTGCCATT 


ATCTACCTCA 


ACAATCAAAA 


TAAAAAGGGG 


ATTATGCAAC 


GCGTTGAAAA 


960 


ATTAGGTCTG 


GAGGTCATCG 


AATTGATTCC 


TTTCCATACC 


CATATTTATC 


TCCGTGAGGC 


1020 


TCATCCTTTA 


GCCCAGAAAG 


AGGAATTAGT 


CATCGAGGAT 


TTAGCGGATT 


TACCAACGGT 


1080 


TCGTTTCAC? 


CAAGAGAAAG 


ACGAGTACCT 


TTATTATTCA 


GAGAACTTTG 


TCGATACCAG 


1140 


CGCTAGCTCA 


CAGATCTTTA 


ATGTGACAGA 


CCGTGCCACC 


TTGAATGCTA 


TTTTCGAGCC 


1200 


GACGGACGCC 


TATGCGACAG 


GTTCTGGATT 


TTTAGATACT 


GACAGTGTTA 


ATGGCATTAC 


1260 


AGTTATTCGT 


CTCAAGGATA 


ACCTAGATAA 


CCGCATGGTC 


TATGTTAAAC 


GTGAAGAAGT 


1320 


GGAGCTTAGT 


CAAGCTGGGA 


CTCTCTTCGT 


AGAAGTCATG 


CAACAATATT 


TTGATCAAAA 


1380 


GAGGAAATCA 


TGAAAAAAAG 


AGCAATAGTG 


CCAGTCATTC 


TACTCCTTTT 


GATTGGGCTG 


1440 


CATCAGTTCC 


TCAAATCCTA 


TATCCTCCAG 


CAGATTCCAC 


TGGGTCAACT 


CCCCTCCTGG 


1500 


ATCCCCAAT? 


TCGTTACCTT 


GACCTACCTG 


CAAAATCGAG 


GTCCAGCCTT 


TTCTATCTTA 


1560 


CAAGATCAGC 


AGCTGTTATT 


CGCTGTCATT 


ACTCTGGTTG 


TCGTGATAGG 


TGCCATTTGG 


1620 


TATTTACATA 


AACACATGGA 


CGACTCATTC 


TGGATGGTCT 


TGGGTTTGAC 


TCTAATAATC 


1630 


GCGGGTGGTC 


TTGGAAACTT 


TATTCACACC 


CTCAGTCAGG 


GCTTTGTTGT 


CGATATGTTC 


1740 


CACCTTGACT 


TTATCAACTT 


TGCAATTTTC 


AATGTCCCAC 


ATAGCTATCT 


GACGGTTGGA 


1300 


CTCATTATTT 


TATTGATTGC 


AATGCTAAAA 


GACGAAATAA 


ATGGAAATTA 


AAATTCAAAC 


1360 


TGGTGGTCTG 


CGTTTGGATA 


AGGCTTTGTC 


AGATTTCTCA 


GAATTATCAC 


GTAGTCTCGC 


1920 


GAATGAACAA 


ATTAAATCAG 


GCCAGGTCTT 


GGTCAATGGT 


CAAGTCAAGA 


AAGCTAAATA 


1980 


CACAGTCCAA 


GACCCTGATG 


TCGTCACTTA 


CCATGTGCCA 


GAACCAGAGG 


TATTACAGTA 


2040 


TGTGGCTCAG 


GATCTTCCGC 


TAGAAATAGT 


CTACCAAGAT 


GAGGATGTGG 


CTGTCGTTAA 


2100 


CAAACCTCAG 


GGAATGGTTG 


TGCACCCGAG 


TGCTGGTCAT 


ACCAGTGGAA 


CCCTAGTAAA 


2160 
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TGCCCTCATG TATCATATTA AGGACTTGTC GGGTATCAAT GGGGTTCTGC GTCCAGGGAT 2 2 20 

7G7TCACCCT ATTGATAAGG ATACGTCAGG TCTTCTCATG ATTGCTAAAA ACGATGATGC 2280 

GCATCTAGCA CTTGCCCAAG AACTCAAGGA TAAAAAGTCT CTCCGCAAAT ATTGGGCGAT 2340 

7G7TCATGGA AATCTACCTA ATCATCGTGG TGTAATTGAA GCGCCGATTG GCCGGAGTGA 2 4 00 

AAAAGACCGT AAGAAACAGG CTGTAACTGC TAAAGGGAAG CCTGCAGTGA CGCGTTTTCA 2 4 60 

- " CGTCTTGGAA CGCTTTGGCG ATTATAGCTT AGTAGAGTTG CAAC7GGACA CAGGGCGCAC 25 20 

TCATCAAATC CGTGTCCACA TGGCTTATAT CGGCCATCCA GTCGCTGGTG ATGAGGTCTA 2 5 80 

TCGTCCTCCC AAGAC777GA AAGGACATGG ACAATTTCTT CATGCCAAGA C777AGGTT7 2 64 0 

TACTCATCCG AGAACAGGTA AGACCTTGGA ATTTAAAGCA GATATCCCAG AGATTTTTAA 2700 

GGAAACCTTG GAGAGATTGA GAAAGTAAGA ATGAAAAAGA AATTAACTAG TTTAGCACTT 2760 

GTAGGCGCTT TTTTAGGTTT GTCATGGTAT GGGAATGTTC AGCCTCAAGA AAGTTCAGGA 2820 

« AA7AAAA7C C ACTTTATCAA TGTTCAAGAA GGTGGCAGTG ATGCGATTAT TCTTGAAAGC 2 880 

AATGGACATT TTGCCATGGT GGATACAGGA GAAGATTATG ATTTCCCAGA TGGAAGTGAT 2 94 0 

TC7CCCTATC CATGGAGAGA AGGAATTGAA ACGTCTTATA AGCATGTTCT AACAGACCGT 3000 

GTCTTTCGTC GTTTGAAGGA ATTGGGTGTC CAAAAACTTG ATTTTA7TTT GGTGACCCAT 3060 

ACCCACAGTG ATCATATTGG AAATGTTGAT GAATTACTGT CTACCTATCC AGTTGACCGA 3120 

GTCTATCTTA AGAAA7ATAG TGATAGTCCT ATTACTAATT CTGAACGTCT ATGGGATAAT 3130 

CTGTATCGCT ATGATAAGCT TTTACAGACT GCTGCAGAAA AACC7G777C AGT7ATTCAA 3240 

AATATCACAC AAGGGGATGC 7CA77T7CAG TTTGGGGACA TGGATATTCA GCTCTATAAT 3 300 

7 A7G A AAA*? G ^_\AC7GAT7~ ATC~CG7GAA T7 AAAC AAAA TTTGGGATGA CAATTCCAA7 3 3 60 

TCCTTCATTA GCGTGGTGAA AGTCAATGGC AAGAAAATTT ACC7TGGGGG CGATTTAGA7 3 4 20 

AATGTTCATG GAGCAGAAGA CAAGTATGG7 CCTCTCATTG GAAAAGTTGA TTTGATGAAG 3 4 80 

7TTAATCATC ACCATGATAC CAACAAATCA AATACCAAGG ATTTCAT7AA AAATT7GAG7 3 540 

CCCAGTTTGA TTG7TCAAAC T7CGGATAGT CTACCTTGGA AAAATGGTGT TGATAGTGAG 3 600 

TATGTTAATT GGC7CAAAGA ACGAGGAATT GAGAGAATCA ACGCAGCCAG CAAAGACTA7 3 6 60 

GATGCAACAG TTTTTGATA7 TCGAAAAGAC GGTTTTCTCA ATAT7TCAAC A7CC7ACAAG 3 720 

CCGATTCCAA GTT77CAAGC TGGTTGGCAT AAGAGTGCAT ATGGGAACTG G7GGTATCAA 3 7 80 

GCGCCTGATT C7ACAGGAGA GTA7GC7GTC GG7TGGAATG AAATCGAAGG TGAATGG7AT 3 840 

TAC7TTAACC AAACGGG7A7 C7TGT7ACAG AA7CAATGGA AAAAATGGAA CAA7CATTCC 3 90C 



WO 98/18931 



PCT/US97/19588 



276 

T7CTATTTGA CAGACTCTGG TGCTTCTGCT AAAAATTCGA AGAAAATCGC TGGAATCTGG 3 960 

7ATTATTT7A ACAAAGAAAA CCAGATGGAA ATTGG7TGGA T7CAAGATAA AGAGCAGTGG 40 20 

TATTATTTGG ATGTTGATGG TTCTATGAAG ACAGGATGGC TTCAATATAT GGGGCAATGG 4080 

TATTACTTTG CTCCA7CAGG GGAAATGAAA ATGGGCTGGG TAAAAGATAA AGAAACCTGG 414 0 

7ACTATATGG ATTCTACTGG TGTCATGAAG ACACGTGACA TAGAACTTGC TGGTCAACAT 4 200 

TA77ATCTCC AAGATTCAGG AGCTATGAAG CAAGCCTCGC ATAAAAAGGC AAATGATTGG 42 60 

TATTTCTACA AG AC AG AC GG TTCACGAGCT GTGGGTTGGA TCAAGGACAA GGATAAATGG 4 3 20 

TACTTCTTGA AAGAAAATGG 7CAATTACTT GTGAACGGTA AGACACCAGA AGGTTATACT 4 330 

GTGGATTCAA GTGGTGCCTG GTTAGTGGAT GTTTCGATCG AGAAATCTGC TACAATTAAA 444C 

ACTACAAGTC ATTCAGAAAT AAAAGAATCC AAAGAAG7AG TGAAAAAGGA TCTTGAAAAT 4500 

AAAGAAACGA GTCAACATGA AAGTGTTACA AATTTTTCAA CTAGTCAAGA TTTGACATCC 4 560 

TCAACTTCAC AAAGCTCTGA AACGAGTGTA AACAAATCGG AATCAGAACA G7ACTACAAA 4 620 

AGAAGGTTTT AGGGCCTTCT 7TTTCCTATC AACTCTTTTC TA7TTCCTC7 TATTCATGTT 4 630 

ATAATGCATA AATATGAATA ATCGGAGTGA GACTATGAAA TACAAACGGA TTGTCTTTAA 4 740 

GGTGGGTACT TCTTCTCTGA CAAATGAGGA TGGAAGTTTA TCACGTAGTA AGGTAAAGGA 4 800 

TATTACCCAG CAGTTGGCTA TGCTGCACGA GGCTGCTCAT GAGTTGATTT TGCTGTCTTC 4860 

AGGTGCCATT GCGGCTGGTT TTGGAGCCTT AGGATTTAAA AAGCGTCCGA CTAAGATTCC 4 920 

TGATAAACAG CCTTCAGCAG CGCTAGGGCA CCCCCTTTTC TTGGAAGAAT A7ACAACCAA 4 980 

TCTTCTCTTG CC7CAAATCC TT7C7GCACA AATCTTCC7G ACCCAAGATG ACTTTGTCGA 5 040 

TAAGCGTCGT TATAAAAATG CCCATCAGGC 77TG7CCCTT TTGCTCAACC GTGGGGCAAT 5 ICO 

7CCTA7CATC AATGAGAATG ATAGTCTCGT TATTGATGAG CTCAAGGTTG GGGACAATGA 5160 

CACTCTAACT GCTCAAGTAG CCCCCATCG? CCAAGCAGAC CTTTTAGTTT TCTTGACAGA 5 220 

TGTCGACGGT CTCTATACTG GAAATCCTAA TTCAGATCCA AGACCCAAAC GCTTGGAGAG 5 2 8C 

AATCGAGACC ATCAATCGTG AGATTATTGA TATGGCTGGT GGACCTCGTT CG7CAAACGG 5 340 

AACTGGGGGT ATGTTAACCA AAATCAAGGC TGCAACTATC GCGACGCAAT CAGGAGTTCC 5400 

TGTTTATATC TGCTCATCCT TGAAATCAGA TTCCATGATT GAGGCGGCAG AGGAGACCGA 54 60 

GGATGGTTCT TACTTTGTTG CTCAAGAGAA GGGGCTTCGT ACCCAGAAAC AATGGCTTGC 5 520 

CTTCTATGCT CAGAGTCAAG GTTCTATTTG GGTTCATAAA GGGGCTGCGG AAGCTCTCTC 558C 

TCAATATGGA AAGAGTCTTC TCTTATCTGG TATCGTTGAA GCAGAAGGAG TCTTTTCTTA 5640 

CGGTGATA7C GTGACAG7AT TTGACAAGGA AAGTGGAAAA TCAC7TGGAA AAGGACGCGT 57 00 
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GCAATTTGGA GCATCTGCTT TGGAGGATA? GTTGCGTTCT CAAAAAGCCA AGGGTGTCTT 5760 

GATTTACCGT GACGACTGCA TTTCCATTAC TCCTGAAATC CAACTACTTT TTACAGAATT 5320 

TTAGAGGTAA ACTATGGTGA GTAGACAAGA ACAATTTCAA CAGGTACAGG CTGTTAAAAA 5880 

ATCGATTAAC ACAGCTAGTG AAGAAGTGAA AAACCAAGCC TTGCTAGCCA TGGCTGATCA 5940 

CTTAGTGGCT GCTACTGAGG AAATTTTAGC GCCTAATCCC CTCGATATGG CAGCGGCTAA 6000 

GGGGAAAATC TCAGATC7CA TGTTGGATCG TCTTTATTTG GATGCAGATC GTATAGAAGC 6060 

GATGGCAAGA GGAATTCGTG AAGTGCTTGC CTTACCAGAT CCAATCGGTG AAGTTTTAGA 6120 

AACAAGTCAG CTTGAAAATG GTTTGGTTAT CACAAAAAAA CGTGTACCTA TGGGTGTCAT 6180 

CGGTATTATC TATGAAAGCC GTCCAAATGT CACGTCTCAT GCCGCTGCTT TGACTCTTAA 6240 

GAGTGGAAAT GCGGTTGTTC TTCGTAGTGG TAAGGATGCC TATCAAACAA CCCATGCCAT 6300 

TGTCACAGCC T7GAACAAGG GCTTGGAGAC GACTACTATT CATCCAAATG TGATTCAACT 63 60 

GGTGGAGGAT ACTAGCCGTG AAACTAGTTA TGCTATGATG AAGGCCAAGG GCTATCTAGA 64 20 

CCTTCTCATT CCTCGTGGAG GAGCTGGCTT GATCAATGCA GTGGTTGAGA ATGCGATTGT 6480 

ACCTGTTATC GAGACAGGGA CTCGCATTGT CCATGTCTAT GTGGATAAGG ATGCAGACGA 654 0 

AGACAAGGCC CTCTCTATCA TCAACAATGC TAAAACCAGT CGTCCTTCTG TTTGTAATGC 6600 

CATGGAGGTT CTCCTCCTTC ATGAAAACAA GGCAGCAAGC TTCCTTCCTC GCTTGGAGCA 6660 

AGTCTTGCTT GCAGAGCGTA AGGAAGCTGG ACTGGAACCA ATTCAATTCC GCCTACATAG 6720 

CAAAGCAAGC CAGTTTCTTT CAGGTCAAGC AGCTGACACC CAAGACTTTG ACACCGAGTT 67 80 

TT7AGACTAT GTCCTTGCTG TTAAGGTTGT GAGCAGTTTA GAAGAAGCGG TTGCGCACAT 684 0 

TGAATCCCAC AGCACCCATC ATTCGGATGC TATTGTGACG GAAAATGCTG AAGCTGCAGC 6 900 

ATACTTTACA GATCAACTCG ACTCTGCAGC GGTGTATCTT AATGCCTCAA CTCGTTTCAC 6960 

AGATGGAGGA CAATTTGGTC TTGGTTGTGA AATGGGGATT TCTACTCAGA AATTGCACCC 7020 

GCGTGGTCCC ATGGGCTTGA AAGAGTTGAC CAGCTACAAG TATGTGGTTG CCGGTGATGG 7 080 

GCAGATAAGG GAGTAACACA TGAAGATTGG ATTTATCGGT TTGGGGAATA TGGGTGCTAG 7140 

CTTGGCAAAA TCTGTCTTGC AGACTAGGAC GTCAGATGAG ATTCTCCTTG CCAATCGTAG 7 200 

TCAAGCTAAG GTAGATGCTT TCATTGCAGA CTTTGGTGGT CAGGCTTCCA GCAATGAAGA 7 260 

AATGTTTGCA GAAGCAGATG TGATTTTTCT AGGAGTTAAG CCTGCTCAGT TTTCTGAACT 7 3 20 

GCTTTCTCAA 7ACCACACCA TCCTTGAAAA AAGAGAAAG7 CTTCTTTTGA TTTCCATGGC 7 3 80 

AGCTGGATTG ACCTTAGAAA AACTAGCAAG TCT7ATCCCA AGTCAACACC GAA7TATTCC 744 0 
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TATGATGCCT AATACCCCTG CTTCTATCGG GCAAGGAGTG ATTAGTTATG CCTTGTCTCC 7 500 

7AATTGCAGG GCTGAGGACA GTGAGCTCTT TTATCAGCT7 TTAGCCAAGG CTGGTCTCTT 7 5 60 

GGTTGAACTA GGAGAAAGTT TAATCGATGC ACCGACAGC7 CTTGCAGGTT GTGGACCACC 7520 

CTTTGTCTAT CT77T7ATCG AGGCCTTGGC AGATGCAGGT GTTCAGACAG CATTACCACG 7 6 90 

AGAAATAGCA TTGAAAATCG CAGCACAAAC TC7GG7AGGA GCTCGGCAA7 TGGTCCTTGA 7 740 

AAGTCAGCAA CA7CC7GGAG TATTGAAAGA CCAAGTCTGT AGCCCAGGCC GTTCGACTAT 7 800 

CGCTGGTGTA GCAAGCCTAG AAGCGCATGC TTTCCGAGGA ACAGTCATGG ATGCAGTTCA 7 360 

TCAAGCCTAC AAACGAACAC AAGAACTAGG TAAATAAGAG GTAGTTTTGA CTGCCTCTTT 79 20 

TA7GGTGGC7 GAAATGAGAA GACACAAAAA GA77G7CACA AACCCCTATT TTTTTGATAC 7980 

AATAGAAGTA GTAAAAAAGA AATGAGTTAG ACATGTCAAA AGGATTTTTA GTCTCTCTTG 8040 

AGGGACCAGA GGGAGCAGGC AAGACCAGTG 7T7TAGACCC TCTGCTACCA AT7TTAGACC 8100 

AAAAAGGAG7 AGAGGTGTTG ACGACCCGTG AACCTGGCGG AGTCTTGATT GGGGAGAAGA 8160 

7TCCGGAAGT GATTTTGGAT CCAAGTCATA CTCAGATGGA TGC7AAAACA GAGCTACTTC 32 20 

7CTATATTCC CAGTCGCAGA CAGCATTTGG TGGAAAAAGT 7CTTCCACCC CTTGAAGCTG 82 80 

CCAAGT7GGT CA7CATGGAT CGTTTTATCG ATAGTTCTGT TGCCTATCAG GGATTTGGTC 8340 

GTGGCTTAGA TATTGAAGCC ATTGACTGGC TCAATCAGTT TGCGACAGAT CGCCTCAAAC 8400 

CCGATTTGAC ACTCTATTTT GACATCGAGG TGGAAGAAGG GCTCGCTCGT ATTGCTGCTA 8460 

ATAGTGACCG CGAGGTTAAT CGTTTGGATT 7GGAAGGGTT GGACTTGCAT AAAAAAGTTC 8 520 

GTCAAGGCTA CC7TTCTCTT CTGGATAAAG AGGGAAA7CG CAT7G7CAAG A77GATGCTA 8530 

GTCTCCCTTT GGAGCAAGTT G7GGAAAC7A CCAAGGC7G7 C7TGT7TGAC GGAATGGGCT 3640 

TGGCCAAATG AAACAAGA7C AAC7AAAGGC 1TGGCAACCA GCTCAGTTTG ACCGTTTTCT 8700 

CCGTATCTTA GAACAAGACC AGCTCAATCA CGCC7ATC7C TTTTCAGGT7 TCTTTGAAAG 37 60 

C7TGGAAA7G GCGCAAT7TT TAGCTAAGAG CCTC7TT7GT ACGGA7AAAG TTGGCGTCTT 3820 

ACCATGTGAG AAATGCCGAA GT7GCAAGC7 GATTGAACAG GGAGAATTTC CCGATGTCAC 8830 

CTTGATTAAA CCAG7TAATC AGGTCATTAA GACGGAACGC ATTCGAGAAT TGGTGGGTCA 8940 

GTTTTCTCAA GCAGGGATTG AAAGCCAGCA ACACG7C777 A7CATCGACC AAGCGGATAA 9000 

AATGCATCCC AACGCAGCCA ATTCTCTGCT CAAGG7CA7C GAAGAACCCC AGAGTGAAG7 9060 

T7ATATTTTC TTCT7GACTA GCGATGAGGA AAAGA7GT7A CCGACAATCC GAAG7CGGAC 9120 

TCAGATCT7C CACTTTAAAA AGCAAGAAGA AAAAC77A7C 77ACTC77AG AACAAATGGG 9130 

ACT7G7TAAG AAAAAAGCCA C7C7TT7AGC 7AAG7TTAGT CAA7CGCGAG CTGAAGCAGA 9240 
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AAACTTCCC7 AATCAGGCAA GTT7T7GCAC CTTGGTCGAT GAAAGTGAAC GCCTGCTGAC 9 300 

TTGGTTAGTA GCTAAGAAAA AACAAAG7TA TCTACAGGTT GCCAAATTAG CCAACTTGGC 93 60 

AGATGATAAG GAAAAACAGG ATCAGGTTTT ACGGATTCTT GAAGTTCTCT GTGGGCAGGA 9420 

CCTCTTGCAG GTAAGAGTAA GAGTGATTCT ACAAGATTTA CTAGAAGCTA GAAAAATGTG 9480 

GCAAGCTAAT GTCAGCTTTC AAAATGCCAT GG AATATCTC GTCTTGAAAG AAATATAAAC 9 540 

TCAAAAATGA A7GATAAAGA AACGAAAGGG CTCTTTTATG GACAAAAAAG AATTATTTGA 9600 

CGCGCTGGAT GATTTTTCCC AACAATTATT GGTAACCTTA CCCGATG7CG AAGCCATCAA 9660 

GAAAAATCTC AAGAGCCTGG TAGAGGAAAA TACAGC7CTT CGCTTGGAAA A7AG7AAG7? 9720 

GCGAGAACGC TTGGGTGAGG TGGAAGCAGA TGCTCCTGTC AAGGCCAAGC A7C7TCCTCA 9780 

AAGTGTCCGT CGCATTTACC GTGATGGATT TCACGTATGT AATGATTTTT ATGGACAACG 9840 

TCGAGAGCAG GACGAGGAAT GTA7GTTTTG TGACGAGTTG CTATACAGGG AGTAGGCATG 9900 

CAGATTCAAA AAAGTTTTAA GGGGCAGTCT CCCTATGGCA AGCTGTATCT AGTGGCAACG 99 60 

CCGATTGCCA ATCTACATGA TATGACTTTT CGTGCTATCC AGACCTTGAA AGAAGTGGAC 10020 

TCCATTCCTC CTGAGGA7AC GCGCAATACA GGG CTTTTGC TCAAGCATTT TCACATTTCC 10080 

ACCAAGCAGA TCAGTTTTCA TGAGCACAAT GCCAAGGAAA AAATTCCTGA TTTGATTGGT 10140 

TTCTTGAAAG CAGGGCAAAG TATTGCTCAG GTCTCTGATG CCGGTTTGCC TAGCATTTCA 10200 

GACCCTGGTC ATCATTTAC? TAAGCCAGCT A7TCAGGAAG AAATTGCACT 7C7GACAG7T 10260 

CCAGGTGCCT CTGCAGGAAT TTCTGCCTTG A7TCCCAGTG GTTTAGCGCC ACAGCCACAT 103 20 

A7CTTTTACG G7TTTTTACC GAGAAAATCA GGTCAGCAGA AGCAATTTTT TGCCTTGAAA 10380 

AAA-ATTA-: —GAAACACA GA777T77A7 GAA7CACC7? A7737CTACC ACACACC77C 10440 

GAAAATATGT TAGAAGTCTA CGGTGACCGC 7CCGTTG7CT 7GCTCAGGGA AT7GACCAAA 10500 

A7C7ATGAAG AATACCAACG AGG7ACTATC 7CTCAG7TAT TAGAAAGCAT 7GC7CAAACG 10560 

CCACTCAAGG GCGAATC7CT TC7CATTCTT GACGGTGCCA CTCACGGTG7 GGAGGAAAAG 10620 

GACGAGGAAG ACTTGTTCGT AGAAATTCAA ACCCGCATCC AGCAAGG7G7 GAAGAAAAAC 10680 

CAAGCTATCA AGGAAGTCGC TAAGATTTAC CAGTGGAATA AAAC7CACCT C7ACGC7GCC 10740 

TACCACGACT GGGAAGAAAA ACAATAAAGG GAGACAGGAT G7AA7AAT7C 7G7C7G7TTC 10300 

TGTTTAAC77 AATTAGTGAT GA7AATATAA AGATG7ATCA C77GGTATAG AAGC7TTGGT 10860 

ATTAAGTT77 77A7TAAGCC CATACCCAAT ACCGATGCTT GGAGCAGCAC 77ATAGCGTT 10920 

CTTAGAAGG7 ATAAATAGAA AAATAACCTC A777TAAA7C AAAGGATTGA 7AAA7CAGAA 10980 
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AGAAGCTGAT TTTTTGCGAA CATACGAAAA TAAAGAAGAA CWWvGCTC AG AT AG AG AA 11040 

AACATTTGAG AAATATA1TT TAGAATTTGA TAATATTCCA GAAAATTTAA AAGATAAGAG 11100 

AGCTGATGAA CTTCACAGAA CTCCAGCAGA AAACCTTGCT TATCAGGTTG GTTGGACCAA 11160 

CTTGGTTCTT AAATGGGAAG AAGATGAAAG AAAGGGGCTT CAAGTAAAAA CACCATCGCA 11220 

TAAATTTAAA TGGAATCAAC TTGGTGAATT ATATCAGTGG TTCACAGATA CCTACGCTCA 11280 

TTTATCTCTG CAAGAGTTGA AAGCAAAATT AAATGAAAAT ATTAATTCTA TCTCTGCAAT 11340 

GATTGATTCG TTGAGTGAGG AAGAATTATT TGAACCGCAT ATGAGAAAGT GGGCTGATGA 11400 

AGCGACTAAA ACAGCGACTT GGGAAGTGTA TAAGTTTATT CATGTAAATA CGGTTGCACC 11460 

TTTTGGAACT TTCAGAACTA AAATCAGAAA ATGGAAGAAG ATAGTATTAT AAA TT AT ATT 11520 

TTTAACTTTA AAAAATTTCA TAAAAATGGT TACCAAAGGC GATAGAAGAA AAACTATCGT 11580 

CTTTTTCTTT GCAAATTTTT AACAAGGGAG GTGATCTTGC ATGGACTTTG AATATTTTTA 11640 

TAACAGAGAA GCGGAAAGAT TTAACTTCTT AAAAGTACCG GAG AT ATT AG TTCATAGAGA 11700 

AGAATTTCCG GCCTTATCAG CAGAAGCAAT TATCCTTTAT TCCATACTTC TTAAACAGAC 11760 

AGGAATGTCA TTTAAGAATA ACTGGATAGA CAAGGAAGGC AGAGTATTTA TCTATTTTAC 11820 

TGTCGAAGAA ATTATGAAAA GAAGAAATAT CTCAAAGCCA ACTGCCATAA AAAC ATT AG A 11880 

TGAGCTTGAT GTAAAAAAGG AATAGGACTG ATCGAAAGAG TAAGCCTTGG ACTTGGTAAG 11940 

CCGAACATCA TTTATGTTAA AGACTTTATG AGTATATTTC AGGTAAAAGA AAATGACTTA 12000 

CAGAAGTCAA AAAACTTAAC TTCAGAAGTA AAAGATTTTA ACCTCAGAAG TAAAGAAAAT 12060 

GAACTTCAAG AGCT7AACAA CCTTGACTCT AACTATATAG AGAATAATAA GAGTAAGTAT 12120. 

ACT AA GAG AG AATATACTTT TGGTGAAAAC GGACTTGGAA CATTTCAAAA TGTGTTTTTA 12180 

GCTGCTGAAG ATATATCGGA TTTACAAATC ATAATGAACT CACAGCTTGA GAATTACATT 12 2 40 

AGACTTCCTC CAAAACTAGA ATCCTAGTTC ATGATTGATA ATGCCAGCAA TCAAATTCAT 12 3 00 

TCCTAATCCG AAGCGTTTAC GATGATTTCC ATAGATTGTT GAAAACATTT TAAACGTTTT 12 3 60 

TACTTTCCCA AAGATGTTCT CAATCTTGCT TCTCTCCTTC GATAGCGCAT GGTTACAGGC 124 20 

TTTATCTTCA GCTGTTACCG GCTTGAGTTT GCTGGATTTA CGTGGAGTTT GTACTTGAGG 124 80 

ATATATCTTC ATGAGCCCTT GATAACCACT GTCAGACAAG ATTTTACCAG CTTGTCCGAT 12540 

ATTTCTGCGA CTCATTTTGA ACAACTTCAT ATCACGACAA TAGTTCACAG CGATATCCAA 12 600 

AGAAACAATT CTCCCTTGAC TTGTGACAAT CGCTTGAGCC TTCATAGCGT GAAATTTCTT 12 660 

TTTACCAGAA TGATTCGCTA ATTCTTTTTT TAGCGCGATT GATTTTTACT TCCGTCGCAT 12720 

CAATCATTAC CGTGTCCTCA GAACTGAGAG GAGTTCTTGA AATCGTAACA CCACTTTGAA 12 780 
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CAACAGTTAC TTCAACCCAT TGGCTCCGAC GGATTAAGTT CCTTTCC7GA ATACCAAAAT 12 840 

CAGCCGCAAT TTGTTCATAA CTTCGATATT CTCGCACATA TTGAAGAGTG GCCATAAGAA 12 900 

GGTCTTCTAG GCTTAATTTA GGTTTTCGTC CACCTTTTGC GTGTTTAAGT TGATAAGCTG 12960 

TTTTTAATAC AGCTAATATC TCTTCAAAAG TCGTGCGCTG AACACCAACA AGACCCTTAA 1302 0 

ATCGTGCATC AGT7AGTTGT TTACTTCCTT CATCATTCAT AGAACTACTA TACCATATTT 13080 

TGTTTCCCAG GAAGTCTATT GGAAAGTAAG AAATATTGAA GCTGAGGCTA TTAGAAGAAA 1314 0 

TTGTGAGCGT GGTGCTATTT TTTCAGGTAA AATAAAATAT CACGAAGATT CACAGTTTAA 13 200 

AGGAGATCAC TATGTTGAAT GTTATCCTGT TTTAGATAAT ACGGTTATAG CAAGAGATAG 13260 

AATAACAGTC CCTA7CGATC CGTTATGTGG AAAAGATTTT ATAGAGTAGC ATATAATTCA 13320 

TTCTTAACTG GAATACTCAC TATCTCTTTA CATCAAGAAA ATGACTAAAC AGCGAAGTTT 13 380 

CCCTTCTTCC CTTTTTTTGT TAT ACT ACTA GAAGAAAAAA TTAGAAAGAT TTGTGGGTGT 13440 

CAAACAGCCC AGTGGGGTGT TTTAATATCG ACTTAGGTCC CACCCAAACA GCTATTACTG 13500 

TCGTGTCTCA ATCTTATATC AATGTTATCG GTGC7GCTTT GGCACGTTCT GAAGCAGCTT 13560 

ACCAAATCGC AGAGCGTGGT ATTCCACTTA AACTATATGA AATGCGTGGT GTCAAGTCTA 13620 

CACCCCAGCA TAAAACAGAC AATTTTGCTG ACTTGGTTTG TTCCAATTCT TTGCGTGGGG 13680 

ATGCTTTGAC AAATGCAGTT GGTCTTCTCA AGGAAGAAAT GCGTCGCTTG GGTTCTGTTA 13740 

TCTTGGAATC TGCTGAGGCT ACACGTGTTC CTCCAGGTGG TGCCCTTGCA GTGGACCGTG 13 800'" 

ATGGTTTCTC TCAAATGGTG ACCGAAAAAG TTGCCAACCA CCCCT7CAT7 GAAGTGGTTC 13 960 

GTGATGAAAT TACAGAATTG CCG AC AGATG 77A77ACCCT TATCGCTACT GCTCCTTTGA 13920 

:a,\:7~A7GC C77GG~T0.--\ AAG.VT77.YTG 77':" V-.TC.^ :~~~?r:7~7 7777A77TC7 13330 

ACGATGCGGC AGCGCCTATT ATCGATGTCA ACACTATCGA TATGAGCAAG GTCTACCTCA 14040 

AATCACGTTA TGATAACGCA GAAGCGGCCT ACCTCAATGC CCC7ATGACC AAGCAAGAXT 14100 

TTATGGATT7 CCATGAAGCT TTGGTCAATG CAGAAGAAGC ACCCCTTACT TC7TTTGAAA 14160 

AAGAAAAGTA CTTTGAAGGA TGTATGCCTA 7CGAAGTCAT GGCCAAACGT CCCATTAAAA 14 22C 

CTATGCTTTA TGGCCCTATG AAGCCAGTCG GTC7TGAGTA CCCAGACGAC TATACAGGAC 14 2 80 

CTCGTGATGG AGAATTTAAA ACACCTTATG CGGTTGTGCA ACT7CCTCAG GATAATGCAG 14 340 

C7GG7AGCCT CTACAATATT GTTGGTTTCC AGACCCACCT CAAA7GGGGA GAACAAAAGC 14400 

GTCTCTTCCA AATGATTCCG GGTCTTGAAA ATGCGGAG77 TGTCCGTTAT GGTGTGATGC 14 4 60 

A7CGCAATTC .TTACATGGAT TCACCAAATC TTCTTGAGCA GACTTACCCT TCTAAGAAAC 14520 
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AACCAAATCT CTTCTTTGCT GGTCAAATGA CGGG7GTGGA AGCCTATGTT GAGTCGGCGC 14 530 

CTTCACGCTT AGTTGCGGGA ATTAACGCAG CTCGTCTCTT CAAGCAAGAA AGCGAGGCTA 14640 

TTTTCCCCGA GACGACAGCG ATTGGAAGCT TAGCTCATTA CA7TACCCAT GCCCACAGCA 147 00 

AACATTTCCA ACCAATGAAT GTCAATTTTG GGATCATCAA GGAGTTGGAA GGCGAGCGTA 14 7 60 

TCCGTGATAA GAAGGCTCGT TATGAAAAAA TTGCAGAGCG TGCCCTTGCC GACTTAGAGG 14 820 

AATTTTTGAC TGTCTAATTT TTTTGAAAGA ATTGCTCATG ATACTATAAA AATCTTAGAA 14 980 

ATTGTGATAA AATAGGTAGG ATGAAAGAAC GAGAGTGAAA ATGGCGAATC CCAAGTATAA 1494 0 

ACGTATTTTA ATCAAGTTAT CAGGTGAACC CCTTGCCGGT GAACGTGGCG TAGGGATTGA 15000 

TATCCAAACA GTTCAAACAA TCGCAAAAGA GATTCAAGAA GTTCATAGCT TAGGTATCGA 15060 

AATTGCCCTT GTTATCGGTG GAGGAAATCT CTGGCGTGGA GAACCTGCAG CAGAAGCAGG 15120 

TATGGACCGT GTTCAGGCAG ATTACACAGG AATGCTTGGG ACTGTTATGA ATGCTCTTGT 15180 

GATGGCAGAT TCATTGCAAC AAGTTGGGGT TGATACGCGT GTACAAACAC CTATTGCCAT 15 240 

GCAACAAGTG GCAGAGCCTT ATGTCCGTCG ACGTGCCCTT CGTCACCTTG AAAAAGGCCG 15 300 

TATCGTTATC TTTGGTGCTG GAATTGGTTC ACCTTACTTC TCGACAGATA CAACAGCGGC 153 60 

CCTTCCTGCA GCTGAAATCG AAGCAGATGC CATCCTCATG GCTAAAAATG CTGTCCATGC 154 2 0 

TGTTTACAAT GCCGATCCTA AGAAAGATAA GACAGCTCTT AAGTTTGAAC AATTGACCCA 15480 

CCGTGACGTT ATCAATAAAG GTCTTCGTAT CATGGACTCA ACAGCTTCAA CCCTCTCAAT 15540 

GGACAACGAC ATTGACTTCG TTGTATTCAA CA7GAACCAA CCAGGCAACA TCAAACCTC7 15600 

CGTATTTGGT GAAAA7ATCG GAACAACAGT TTCAAATAAT ATCGAAGAAA ACCAA7AAGA 15660 

AAGAATATGG CTAACGCAAT TATTGAAAAA GCTAAAGAGA GAATGACCCA GTCTCACCAA 15720 

TCACTTGCTC G7CAATTTGG TGCTATCCGT GC7GGTCGTG CCAATGCAAG C7TGCTTGAC 15790 

CGTGTACATG TACAATACTA TGGAGTCGAA ACTCCTCTTA ACCAAATCGC TTCAATTACG 15840 

ATTCCAGAAG CGCGTGTTTT GTTGGTAACA CCATTTGACA AGTCTTCATT GAAAGACATC 15900 

GAACGTGCCT TGAACGCTTC TGATATTGGT ATCACACCGG CTAATGACGG TTCTGTGATT 15960 

CGCTTGGTTA TCCCAGCTCT TACAGAAGAA ACTCCTCGTG ACCTTCCTAA AGAAGTGAAG 16020 

AAGGTCGGCG AAAATGCTAA AGTGGCTCTC CGCAATATCC GTCCCGATCC TATGGACGAA 16080 

GCTAAGAAAC GAGAAAAAGC AAAAGAAATC ACTGAAGACG AATTGAACAC TCTTGAAAAA 16140 

G AC ATT C AAA AAGTAACAGA CGATGCTCTT AAACACATCG ACGACATGAC TGCTAACAAA 16200 

GAGAAAGAAC T7TTGGAACT CTAAAAATAA ACAGAAAAAC TCAGTTGGCA TTGCTGGCTG 162 60 

AGTTTTATTC GAAAGAAGGA AATATGAATA CAAATCTTCC AAGTTTTATC GTTGGACTGA 163 20 
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TCATCGATGA 


AAACGACCGT 


TTTTACTTTG 


TGCAAAAGGA 


TGGTCAAACC 


TATGCTCTTG 


16330 


CTAAGGAAGA 


AGGCCAACAT 


ACAGTAGGGG 


ATACGGTCAA 


AGGTTTTGCA 


TACACGGATA 


15440 


TGAAGCAAAA 


ACTCCGCCTG 


ACAACCTTAG 


AAGTGACTGC 


CACTCAGGAC 


CAATTTGGTT 


16500 


GGGGACGTGT 


CACAGAGCTT 


CGTAAGGACT 


TGGGTGTCTT 


TGTGGATACA 


GGCCTTCCTG 


16560 


ACAAGGAAAT 


CGTTGTGTCA 


CTCCATATTC 


TCCCTGAGCT 


CAAGGAACTC 


TGGCCTAAGA 


16620 


AGGGCGACCA 


ACTCTACATC 


CGTCTTGAAG 


TGGATAAGAA 


AGACCGTATC 


TGGGGCCTCT 


16680 


TGGCTTATCA 


AGAAGACTTC 


CAACGTCTTG 


CTCGTCCTGC 


CTACAACAAC 


A7GCAGAACC 


1674C 


AAAACTGGCC 


ACCCATTGTT 


TACCGTCTCA 


AGCTGTCAGG 


AACTTTTGTT 


TACCTACCAG 


163C0 


AAAATAATAT 


GCTTGGTTTT 


ATTCATCCTA 


GCGAGCGTTA 


CGCAGAGCCA 


CGTTTGGGGC 


16860 


AAGTATTAGA 


TGCGCGCGTT 


ATTGCTTTCC 


G7GAAGTCGA 


CCGCACTCTG 


AACCTCTCCC 


16920 


TCAAACCACG 


CTCCTTTGAA 


ATGTTGGAAA 


ACGATGCTCA 


GATGATTTTG 


ACTTATTTGG 


16980 


AAAGCAATGC 


CGGTTTCATG 


ACCTTAAATG 


ACAAGTCATC 


TCCAGACGAC 


ATCAAGGCAA 


17040 


CCTTTGCCAT 


TTC7AAAGCT 


CAGTTCAAGA 


AAGCTTTAGG 


TGGTCTTATG 


AAGGCTGGTA 


17100 


AAATCAAGCA 


GCACCAGTTT 


GGGACAGAGT 


TGATTTAGGG 


ACGCTTATCA 


GAAAATCATT 


17160 


TTACACTTCG 


CTCATGACCG 


AGCGCAATCC 


TAAAAGTAAC 


AGTCCCAAAG 


CAATTTTGGC 


17220 


AGACCTCGCT 


TTTGAAGAGT 


CAGCCTTTCC 


AAAACACACA 


GATGATTTTG 


ATGAGGTCAG 


17280 


TCGCTTTTTG 


GAGGAGCATG 


CCAGTTTCTC 


TTTTAACCTA 


GGACATTTTG 


ACAGCATTTG 


17 340 


GCAGGAATAT 


CTAGAACACT 


AGCATTTATT 


CATTGGGTTT 


CGGCTAGTAA 


TTTCTCCATC 


17400 


CCTCTGCTAT 


AATAAAAAGA 


AATAAAAGGA 


TTAGAGAGGT 


TCTTTATTTG 


AAGGAACATT 


17460 


CAATAGACAT 


TCAACTGAGT 


CATCCAGATC 


ACO"*G* w ^ w *'"* ^ 


TCTTTTTGGT 


TC~AATOAAC 


17520 


GCCATCTTCG 


TTTGATGGAA 


GAAGACCTTG 


ATGTTGTCAT 


TCATGCTCGT 


ACGGAGATTG 


17580 


TCCAGGTTTT 


GGGAGAAGAG 


TCTGCCTGTG 


AGGAAGCCCG 


TCAAGTTATT 


CAGGCTTTGA 


1764C 


TGGTCTTGCT 


AAATCGTGGG 


ATGACCGTTG 


GTACGCCAGA 


TGTAGTCACT 


GCGATTAGCA 


17700 


TGGTCAAAAA 


TCATCAAATT 


GACAAGTTTG 


TCGCCCTTTA 


CGAAGAAGAA 


ATTATCAAGG 


17760 


ATAATACTGG 


GAAACCTATC 


CGTGTCAAAA 


CCCTAGGGCA 


AAACCTTTAT 


GTGGACAGTC 


17820 


TCAAACAGCA 


TGATGTCACC 


TTTGGAATTG 


GGCCAGCAGG 


TACAGGGAAG 


ACCTTCCTTG 


17380 


CAGTCACCTT 


GGCAGTGACT 


GCCCTTAAAC 


GTGGGCAAGT 


CAAGCGAATT 


ATCCTAACTC 


17940 


GTCCAGCGGT 


GGAAGCGGGA 


GAGAGTCTTG 


GATTTCTTCC 


CCGTGATCTT 


AAGGAGAAGG 


18000 


TGGATCCTTA 


CCTTCCTCCT 


GTTTACGATG 


CCTTG7ATCA 


AA7TCTTGGC 


AAAGACCAAA 


18060 
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CGACTCCTCT 


CATGGAGCGT 


GAAATTATCG 


AAATTGCCCC 


CCTTGCCTAT 


ATCCGTGGCC 


18L20 


GGACCTTGCA 


TGATGCCTTT 


GTCATTCTCG 


ATGAGGCGCA 


AAACACGACC 


ATCATGCAGA 


13180 


TGAAGATGTT 


CTTGACGCGT 


TTAGGTTTTC 


ATTCTAACAT 


GATTGTCAAT 


GGAGATATTA 


18240 


GTCAGATTGA 


CCTGCCACGT 


AATGTCAAGT 


CCGGTTTCAT 


TGATGCTCAA 


GAGAAACTCA 


133CO 


AGAACATCCA 


TCAGATTGAC 


TTTGTTCATT 


TTTCACCCAA 


GGATGTGGTT 


CGCCATCCTG 


18360 


TTGTCGCTCA 


GATTATCCGA 


GCCTATGAAT 


ATTCTACTGA 


AGTTGCACAC 


GACTGATTTT 


18420 


GAGGAAGTTC 


GCCTGCAAAA 


GAATAGACTT 


GTTCCGTAAC 


TGTAAAAAGT 


C7TATACTAT 


18480 


TTTTATGCAA 


ACAGTATACG 


ACAAAGCACA 


AAAACTTAAC 


TCAAAAAACT 


TCAAACTATT 


18540 


GATTGGTGTC 


AAAAAGGAAA 


CCTTTCAACT 


CATGCTAGAA 


CACCTGAATT 


CAGCCTATCA 


18600 


GATTCAGCAC 


CGAAAAGGTG 


GACGTCCACG 


TAGTCTGCCC 


ATGGAAGACC 


AGCTCATTAT 


18660 


CACCCTCCCT 


TACTTGCGAT 


ATTATCCCAC 


TCAGCGTCTG 


CTGGCCTTTG 


ATTTTGGCGT 


18720 


CGGTGTAGCT 


ACGCTAAATG 


CCATCATCAC 


TTCGGTGGAC 


GATACACTTC 


GTGCGTCAGG 


18790 


TAGCTTTGAT 


TTGGACCATT 


TAGAAGCCCC 


GAGTCCTGCT 


GTGGCTATTG 


ACGTGACCGA 


18840 


AAGTCCGATT 


CAGCCTCCAA 


ACAAAACCAA 


AGCAAAAATT 


ATTCTGGTAA 


AAAGAAACCA 


18900 


CACACCTTAA 


AAACTCAAAT 


TATCCTGGAT 


TTGACGACAC 


ATAAAGTCTG 


TCAAATCGCC 


18960 


TTTTCTGACG 


CACATACCCA 


TGATTTTACT 


CTCT7CAAAC 


AAAGTATTGG 


ACAAAGTTTC 


19023 


CCTGAAACGA 


CCCTTCCCTT 


TGTTGACCTA 


GGTTATTTAG 


GCATCTTGAA 


ATTTCATGAG 


19080 


AATACTTTCA 


TTCCTGC7AA 


AAATTCCAAA 


AATCGCCGCC 


TGAGTCAGGA 


TGATAAGCAG 


19140 


77AAATAAAG 


ACATGTCACC 


CATACGAATT 


GAAATTGAAC 


ATTTTAACGC 


7AAATTCAAC 


19200 


ACCTTCCAAA 


TCATGTCAGT 


CCCTTATCGT 


AACCGCAGAA 


AACGTTTCGA 


GTTACGGGCG 


19260 


GAATTAATTT 


GTGCCATCAT 


CAATTATGAA 


GTGAACTAGA 


TTCCGAACAA 


GTCTAATATA 


19320 


CTTTTGAGAG 


AGGAAAATCC 


AGTTCTATAG 


GCTAAAGGTT 


TTATCCAAAG 


GTCTGAGACA 


19380 


ACGATTAGGC 


ACGATGGAAA 


GAACTTTTAT 


GTGGCTGATG 


ACGATCAGTG 


CATCTTCCTG 


19440 


TGTCATAATC 


ACAGGGCACA 


AGAAAGTAGG 


AATTTGAAAA 


GATGATTGAC 


CAACTATCTA 


19500 


AGTATTACAG 


TTGTAGGATA 


CTAACTGAAA 


AGGATATTCC 


AAGTATTTTA 


TCTTTATATG 


19560 


AAAGTAATCC 


TCTGTATTTT 


CAGCATTGTC 


CACCAGAGCC 


AAATTTTGCA 


ACTGTAAAAG 


19620 


AGGACATGCT 


TTGTCTACCT 


CAAGCTAAAG 


CTAAGCCTGA 


TAAGTTTTTT 


GTTCGATTTT 


19680 


GGAATGGATC 


TGACCTTGTG 


GCTGTTATGG 


ATTTTCTCTA 


TCCA7ATCCT 


GATGAGGAGA 


19740 


CTGTTTTTAT 


TGGTTTGTTT 


ATGGTTGATC 


AAGCCTATCA 


GAGAAAAGGG 


A7TGGTAGTC 


19900 


ATATTGTGAC 


AGAAGCACTA 


GCTTATTTTG 


CTAAGAACTT 


TCGAAAGGCA 


CGTTTGGCTT 


19860 
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ATC7TAACCC AAATCCGCAA TC7CAGCA77 7T7CGGAAAA GCAGGGCTTT AAATCAATTG 19 920 

GATGCGAGGT TAAGCAAGAA C7CTATACGG TTGTTATCGC 7GAACAGAGC CTAGAAGATT 19980 

AGAAATGGCA TCAAGTAAGA ACTATTTGGA ATTTGTTTTG GAACAATTAT CAGGATTAGA 20040 

TGATGTGACT TACCGTTCCA TGATGGGGGA G7A7A77CT7 TACTTCCGCG CCAAGATTAT 20100 

TGGCCGCA77 TATGACGATC GC77777AC7 7AAACCCC7C CAAGCAGTCT 7AGA7AAGA7 20160. 

7GACCAA7C7 TCTTTTGAGT TTCCATACAA AGGTGCCAAA GAAATGATTT GAGTGGAAGA 20220 

ACTTGATAAT AAGATGTTTC TATAAGACCT AATTTTAGCT ATGTATAACC AACTGCCAAC 2 0280 

GCCCAAACCT AAAAAGAAAA AGCAAGGGTG AACGAAGTAA AAAAGAAGTC TGCTAAGGCC 20340 

CTCTCTTTGC ACGGGTAAAA TTTTATATAT AAAAAGAAGC TGGGACTAAA GAGCTCAGCT 20400 

TCCTTTGGTT TATATAA7TC TCATTACAAG ACGAACTCC7 TCCGCGAAAC 7C7CT7GAC7 204 60 

TTATTCAATT TAGAGTTTCT TATGCACAAT TGAGTCTGGA ACGAAAGTCT CCAGTTCCAA 20520 

« AGTATACACT ACAATAAACC AACGATGTAA TAGCTGATGA CACAAAGCAC AGTGGGTAGG 20580 

AC77CCGAAG TCACCCTTTT C7T77CAAAA TTTATACTAA ATCATTGATA TCAGTGTACT 20640 

CACGATTAAG TCCTTGAGCA ACTGGTAGGT TAGTCAAGTA ACCTTGATAA GTAGTCACAC 20700 

CTTGACGCAA GCCTTCATCT TCAGAGATTG C7TG7CCGAA TCCTTTGCCA GCCAAAGCTT 207 60 

CCATATAAGG AAGAGTGACA 7TGC7TACGG CGATGGTTGA AGTGCGAGCA ACCGCACCAG 20820 

. GGATATTGGC AACGGCATAG 7CCACAACAC CGTGTTTTTC ATAGACGGCT 7CATCCTCCG 20880 

7TG7CACACC GTCAGCTGTT TCCATAACCC CACCTTGGTC AACAGCAACC TCAACGATAC 20940 

AGAGCCTGGA CGCATTTCTT TGACCA7CTC ATCTGTCACC AATTCCGGTC CTTTTGCACC 21000 

AGGGATGAGA ATGGCTCCAA TCACCACATC AGCATCTCTC ACACTTGCT7 CAATC77GAA 21060 

7GAATTAGAC ATAAGAG77T GAAT7TGACT TCCAAAGACT 7C77C7AGAA C7GAGAGACG 21120 

C7TGGAACTA A7A7CTAAAA TAGTCACT7G AGCACCAAGA CCAAGGGCGA TGCGGGCA1C 21180 

A7G7G7ACCG ACGACACCAC CACCGA7GA7 AG77ACTT77 CC777TGGAA CACCTGGTAC 212 40 

ACCACCAAG7 AGAACACCAG AGCCACCAGC 7TGC77AGTA AGGAAG7GAC C7CCCA7T7G 21300 

AACAGCCATA CGACCTGCAA CCTCACTCA7 AGGAACGAGG AGCGGTAGT7 G7CC77GAT7 21360 

GTCACGAACA G777CAG77G 77T77GC7G7 7AACA7AGCA 7CTGC7AA77 CTGGAGCAGC 21420 

GGCCA7G7GC AAGTAGGTGA AGAGAAGAAG A7CCTCGCGC AAG7AACCG7 A77CAGAAC7 214 80 

7AAAGA77CT T77AC7TTCA CAACCAAC7C 7GC7GCCCAA GCT7CACCAG CAG7AGCGAC 21540 

AATCTCACC7 CCT7CC7777 GATAGTCAGC A7CAG7AAAG CCAGAACCGA CACCAGCA7T 2 1600 
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TGTTTCGATA ACCACACGA7 GACCACGACT AACTAAGCTA TGAACACCTG CAGCTGTGAG 21560 

GGCGACACGG TTTTCGTTAT TTTTAATTTC TTTTGGGATT CCGATTAACA TTGAGATAAC 21720 

CTACCTTTCA ATTGACGGTC TTGTTTTGGT TGTCACATTC CAGTTCATAA ATCAAAAATG 21780 

TGACGGTTTC ATTGTATATG AAACCGCTTC AAAAATCAAG AAAAACTTGT CATCCAAATT 21340 

TTTTTATGCT AGACTAGTGA AAATCAAGCT C7AATGGAGG GAAAAGTATG GAATCAATAT 21900 

TTGTGAAATT TGCCCAGTAT CCGTCTATAG AAACGGAGCG TTTATTCCTC AGACCTGTAA 2196 0 

CTTTGGATGA TGCGGAAcAA TGTTTGACTA TGCCTCGGAC AAGGGTAA7A CACGTTACAC 22020 

7TTTCCAACC AATCAAAGC? TGGAAGAAAC CAAGAATAAC ATTCCTCACT TCTACTTGGC 22030 

TAATCCCTTG GGACGTTGGG GAATAGAACT AAAAAGCAAT GCTCACT7TA TTGGAACCAT 2 2140 

TGACTTGCAC AAGATTGATT CTGTTCTTAA GAAGGCAGCT ATTGGCTACA TTATCAATAA 22 200 

AAAGTATTGG AATCAAGGAT TAACGACAGA ACCCAATCCT GCTGTGATTG AGCTAGCTT7 222 60 

TGAGAAGATA GGGATGAATA AGTTGACTGC CCTTCACGAT AAGGCTAATC CCGCGTCAGC 22320 

AAAGG7CATC GAGAAATCAG CCATGCGTTT TTCCCATGCA GAACCATATC CTTGTATGGA 22 380 

CCACCA7GAA AAAGGCCGAA TCGTGACAAG AGTTCATTAT GTCTTGACCA AGGAAGACTA 224 40 

TTTTGCAAAT AAATAAGCAG TTGAAAAGAA ATTTTTCGAC TGTTTTTTCT TCCTCTTACG 22500 

AATAATCTAA GAGAGGAGAA AATATGGAAG CAATTATCGA GAAAATCAAA GAGTATAAAA 225 60 

TCATCGTCAT CTGTACTGGT CTGGGCTTCC TTGTAGGAGG ATTTTTCCTG CTAAAACCAC 22620 

CTCCACAAAC ACCTGTCAAA CAGACGAATT TCCAGGCTGA AGTTGCAGCT CT7TCCAAGC 22680 

ACTCATCGAC CGAAAAGGAA GTGAAGAAGG AAGAAAAGGA AGAACCCCTT GAACAAGATC 227 40 

TAATCACAGT AGATGTCAAA CCTCCTCTCA AATCGCCAGG CAT7TA7GAC TTGCCTGTAG 22S0O 

GTAGTCGAGT CAATGATGCT GTTCAGAAGG CTGCTGGCTT GACAGACCAA GCAGACAGCA 22860 

AGTCGCTCAA TCTAGCTCAG AAAGTT AGTG ATGAGGCTCT GC777ACCTT CC7ACTAACC 22 920 

GAGAAGAAGC ACTTAG7CAA CAGACTGCTT CGGGGACAGC TTCTTCAACA AGCAAGGAAA 22 980 

AGAAGGTCAA TCTCAACAAG GCCAGTCTGG AAGAACTCAA GCAGGTCAAG GGACTGGGAG 23040 

GAAAACGAGC TCAGGACATT ATTGACCATC GTGAGGCAAA 7GGCAAC77C AAGTCAG7AG 2 3100 

ACGAGCTCAA GAAGGTCTCT GGCATTGGTG GCAAAACAAT AGAAAAGCTT AAAGACTATG 23160 

TTACAGTGGA TTAAGAATTT CTCTATTCCC CTAATTTACC TGAG7TT7CT A77AC7TTGC 2 32 20 

CTTTATTACG C7AT777C7C AGCATCTTA7 CT7GCTT7G7 7GGGC7T7GT T77TC7GC7A 2 3280 

G7C7GTCTCT TTATCCAA7T TCCG7GGAAA TC7GC7GG7A AAG77C7AA7 AA7TTGCGGA 23 340 

A7CTTTGGA7 7T7GC77TG7 TT7TCAAAA7 TGGCAACAGA G7CAAGCGAG TCAAAA7CTG 2 3 400 
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CCGGATTCTC TTGAAAGGGT ACGGATTTTG CCTGATACTA TTAAGGTTAA TGGTGATAGT 2 34 60 

CTATCC7TTC GTGGCAAGTC TAACGGTCGT GCTTTCCAAG TCTATTATAA ACTCCAGTCC 2 3520 

GAGGAGGAGA AAGAAGCCTT TCAAGCTTTA ACTGACCTGC ATGAGATAGG ACTAGAAGGG 23580 

AAGCTTTCGC AGCCAGAAGG GCAGAGAAAT TTTGGTGGCT TTAATTACCA AGCCTATCTG 2 3 640 

AAGACTCAGG GAATTTACCA GACTCTCAAT ATCAAAACAA TCCACTCAC7 TCAAAACATT^ 2 3 700. 

GCCAG7TGGG ATATAGGAGA AAACTTGTCC AGTTTACGTC GAAAGGCTGT GGTTTGCATT 2 3 760 

AAGACGCACT TTCCAGACCC TATGGGCAAT TACATGACAG GACTCTTGCT GGGACATCTG 2 3 820 

CACACCCACT TTGAGGAGAT GAATGAGCTT TATTCCAGTC TAGGAATTAT CCACCTCTTT 23 980 

CCCCTATCTG GCATGCAGGT AGGTTTTTTC ATGAATGGAT TTAAGAAACT TCTCTTGCGA 2 3 940 

TTGGGCTTGA CCCAAGAAAA GTTGAAATGG CTGACTTATC CCTTT7CCCT TATCTATGCG 2 4000 

GGACTAACTG GATTTTCAGC ATCGGTTATT CGCAGTCTCT TGCAAAAGCT ACTGGCTCAA 24060 

CATGGGGTTA AGGGCTTGGA TAATTTTGCC TTGACGGTGC TTGTCCTCTT TATTGTCATG 2 4120 

CCAAACTTTT TCTTGACAGC AGGAGGAGTC TTCTCCTGCG CTTATGCTTT TATCCTGACC 24180 

ATGACCAGCA AAGAAGGGGA GGGGCTCAAG GCTGTTACTA GTGAAAGTCT ACTCATCTCC 242 40 

TTCGCCATAT TGCCCATTCT ATCCTTCTAT TTTGCGGAAT TTCAACCTTG GTCTATCCTT 24300 

TTGACCTTTG TCTTTTCCTT TCTTTTTGAC TTCGTCTTCT TACCGCTCTT GTCTATCTTA 24 3 60 

TTTCTCCTTT CCTTTCTCTA TCCAGTCATT CACCTGAACT TTATCTTTGA ATGGTTAGAC 244 20 

GGCATTATTC GCTTGGTCTC GCAGCTGGCA AGGAGACCAC TTCTCTTTGC TCAACCCAAC 24480 

GCATGGCTTT TAATCTTATT GTTAATTTCC TTCGCTTTGG TCTATGATTT GAGGAAAAAC 2 4 540 

ATTAAAGGAT TAACAGTATT GAGTTTATTG ATTACAGGTC TCTTTTTCCT TACCAAGTAT 2 4 600 

CCACTGGAAA ATGAAATCAC CATGCTGGAT GTOGGGCAAG GAGAAAGTAT TTTCTACGGG 24 660 

ATGTAACTGG GAAAACCATT CTCATAGATG TAGGTCCTAA GCCACAATC7 TATAAGAAAA 24720 

TCAAAAAATG GCAAGAAAAG ATGACGACCA GCAATCCCCA CCGAACCTTG ATTCCCTATC 247 80 

TCAAAAGTCG AGCAGTACCT AAGATTGACC ACCTAATTTT GACTAACACG GACAAGGAGC 2 4B40 

ATGTTGGAGA TTTGTCAGAG ATGACCAAGG CTTTCCATGT AGCGGAGATT CTAGTATCAA 2 4 900 

AAGACAGTCT GAAACAGAAC GAATTTGTGG CAGAACTACA GGCGACTCAA ACAAAGGTGC 2 4 960 

GTAGTATGAT AGTAGGGGAG AACTTGCCCA TTTTTGGAAG TCAGTTAGAA CTTCTATCTC 2 5020 

CAAGGAAAAT GGGAGATGGA GGACACGATG ATACCCTAG7 TCTGTATGGG AAATTCTTGG 2 5030 

ATAAGCAATT TCTCTTCACG GGAAATTTGG AGGAGAAAGG AGAGAAGGAC 7TGCTGAACC 25140 
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ACTATCCAGA 


C7TGAAACTA 


AATGTTTTGA 


AAGCTACCCA 


ACATGGCAAT 


AAAAAATCAT 


25200 


CAAGTCCAGC 


CTTTCTAGAA 


AAACTCAAAC 


CAGAGCTTAC 


TCTTATCTCA 


GTTGGAAAGA 


25260 


GCAATCGAAT 


GAAACTCCCC 


CATCAGGAAA 


CATTGACACG 


ACTCGAAGCT 


ATCAATAGCA 


25320 


AAGTTTATCG 


AACTGACCAG 


CAAGGAGCTA 


TACGTTTTAA 


GGGGTTGGAT 


AGTTGGAAAA 


25380 


TCGAAAGTCT 


TCGATAGGAA 


GGATAAATGT 


TGTAGATTAG 


TGAAATAAAC 


TAAAAATTTG 


25440 


TTGCATAATA 


ATGATAAAAA 


TGGTATAATG 


AAAACGTATT 


CAATATTGAG 


GATATAAAAT 


25500 


CATTAAAAAT 


CAGCAAAAGT 


TGTTTTATTA 


GTTAGTTTAT 


AATCTATTGG 


TCTTCTTCAG 


25560 


TCCAGTGTAT 


CTGCTGTGAC 


AGTCACTAAA 


AGTTACAAGT 


ATGATTGGAA 


TACGGTTTGG 


25620 


GAATATAGTA 


CCAACTATCA 


CGACCATCAG 


TATCCTTCCA 


TTCCGTCATG 


CTCTCGTTAT 


25680 


GACAGCTATT 


CTCACTATAA 


AGTTGGCGGA 


GGCTGGAACT 


ACCCTCGTTA 


TGAGGTCATA 


25740 


AACTATTACA 


GCGGAGGCTA 


TTAATTCTTA 


AAGAGTGAGA 


AAAAGGAGGG 


CTAGATATGT 


25800 


TGCAGCTTAC 


TCATGTGACC 


TTAAAAACGC 


GACAAGTCAT 


CTTGCAAGAT 


GTGGATTTCA 


25860 


CCTTTAAAAA 


GGGTAGGCTT 


TATGGTCTTC 


TTGCTATCAA 


TGGCTCTGGA 


AAGACGACCC 


25920 


TGTTCCGTGC 


CATTAGCAAT 


TTAATTCCCA 


TAAGTAGTGG 


AAATATCGCA 


GCCCCTCCTT 


25980 


CTTTATTTTA 


TTATGAGAGT 


ATTGAATGCC 


TCGATGGAAA 


CTTAACTGGC 


ATGCACTACC 


26040 


TTCCTCTTAT 


CAAAAACATC 


TGCAAGTCAG 


GTCTGAACTT 


GAGGGATGAA 


ATCGCCTATT 


26100 


GGGAAATGTC 


TGACTATATC 


AGTCTTCCCA 


TTCGCAAGTA 


TTCCTTAGGC 


ATGAAGCAAC 


26160 


GCTTGGTGAT 


TGCCATGTAT 


TTCCTCAGTC 


AGGCCAAATG 


CTGGCTCATG 


GATCAGATTA 


26220 


CAAATGGCTT 


AGATGAGTAT 


TA7CGACAGA 


AGTTTTTTGA 


TAGGCTAGCA 


CAAATCGATA 


26280 


GACAAGAACA 


GCTCGTTCTT 


TTAAGTTCCC 


ACTATAAGGA 


AGAGTTGG7T 


GATGTCTGCG 


26340 


ATAGAGTAGT 


AACCATTCAT 


CAGGGGCAGA 


TAGAAGAGGT 


TTAGTTTATC 


AAAGATGTTA 


26400 


GTCTATTTTT 


ATTGAAAAAA 


GTTTTCAAAA 


GCCCCTTAAA 


CTGGATTGTC 


TTAGCTTTAT 


26460 


TTGTATCTGT 


ACTCGGTGTT 


ACCTTTTATT 


TAAATAGTCA 


GACTGCAAAC 


TCACACAGCT 


26520 


TGGAGAGCAG 


GTTGGAAAGT 


CGCATTGCAG 


CCAACGACAC 


GCCTATCAAT 


GAAAATGAAG 


26530 


AGAAACTCTC 


CCAAATGTCT 


GATACCAGCT 


CGGAGGAATA 


CCAGTTTGCT 


AAAAATAATT 


26640 


TAGACGTGCA 


AAAAAATCTT 


TTGACGCGAA 


AGACAGAAAT 


TCTGACTTTA 


TTAAAAGAAG 


26700 


GGCGCTGGAA 


AGAAGCCTAC 


TATTTGCAGT 


GGCAAGATGA 


AGAGAAGAAT 


TATGAATTTG 


26760 


TATCAAATGA 


CCCGACTGCT 


AGCCCTGGCT 


TAAAAATGGG 


GGTTGACCGC 


GAACGGAAGA 


26320 


TTTACCAAGC 


CCTGTATCCC 


TTGAACATAA 


AAGC AC AT AC 


TTTGGAGTTT 


CCGACCCACG 


25880 


GGATTGATCA 


GATTCTCTGG 


ATTTTAGAGG 


TTATCATCCC 


AAGTTTGTTT 


CTGGTTGCTA 


26940 
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TTATTTTTAT GCTAACACAA CTATTTGCAG AAACATATCA AAATCATCTG GACACAGCTC 27000 

ACTTATATCC TGTTTCAAAA GTGACATTTG CAATATCCTC TCTTGGACTT GGAGTGGGAT 27060 

ATCTAACTGT GCTGTTTATC GCAATCTGTG GCTTTTCTTT TCTAGTGGGA AGTCTGATAA 27120 

GTGCTTTTGG AC AG TT AG AT TATCCCTACC CAATTTATAC CTTAGTGAAT CAAGAAGTAA 27180 

CTATTGGGAA AATACAAGAT GTATTATTTC CTGGCTTGCT CTTAGCTTTC TTAGCCTTTA 27 240 

"TCGTCATTGT GGAAG7TGTG TACTTGATTG CTTACTTTTT CAAGCAAAAA ATGCCTGTCC 2 7 300 

TCTTTCTTTC ACTCA7TCGC ATTGTTGGCT TATTGTTTGG TATCCAAACC ATTCAGCCTC 273 60 

TTCAAAGCA7 TCCACATCTG AT7CCCTTTA CTTACTTGCG TTCAGTGGAG ATTT7ATCTG 27420 

GAAGATTACC 7AAGCACATT GATAATG7CC ATCTAAATTG GAGCATGGGA ATGGTCTTAC 27 480 

TTCCTTGCCT GATTATCTTT TTCCTATTCG GAATTCTATT TATTGAAAGA TGGGGAAGTT 27540 

CACAGAAAAA AGAATTTTTT AATAGATTCT AGCTTTCCTA TAGGTAGGGA AAATAAGTAA 27600 

AAACTAACAT ACAGAGCGAA TCAACTTGAT TCTCTCTTTT TGATTCGAAA ACCAAACCAA 27660 

AATACAAACA CAAACTTTTC AAAAAATAAC TTTTTATCTT GACAAGAGCT AGAAAACTTG 2 772 0 

GTATCATATA AAAGTTGAGA AAAGCAGAAG TGAGAGCTTC TCCCCTTGTG ACATTAAGTT 27780 

GCCTGGCCCT ACGCATGAAA ACTTTCGAAG AAACGCTATC ATAACCTGCG GCCTTGTATA 27840 

TTTACAAGTC CGCTATTG7T TTTCTCTAAT AAAACAAAAG AGGTGAAAAC CATAGCAAAG 2 7 900 

CAAGACTTAT TCATCAATGA TGAGATTCCT C7ACGTCAAG TTCCCTTGAT TGGTCTTGAA 27960 

GGAGAACAGC TAGGTATCAA CCCACTCAGT GAAGCGCAAG CTTTGCCTCA TAACCCTAAT 28020 

GTTGACCTAG TATTCATTCA ACCCCAAGCC AAACCCCCTG TTCCAAAAAT TATGGACTAC 28080 

CG7AAC-7TCA AA7T7?:AG7 \ crAC.'-AGAA" ^-VAAAv.V-C AA-:G7AAAAA V,.VV\^CC77 2 3140 

GTTACTGTGA AAGAAGTTCG TCTAAGTCCC G 28171 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 7147 base pairs 
(8) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CCGCTCAACT TTTGCAATCA AGGCTAAGTA GACAGCAGCA AATTTCATAT TGTATAATTT 60 

CTGACTCATA C7TCTC7CTT TC7ATGTGTA CTAGTATAAA TAAGAAAAAG AAGGCCGTCA 120 
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AGCCTTCTTT TGATTTATTC TTCTGCTTCA TCTTCTGTAA ATTGACTATT GTACAAGTCA 130 

GCGTAGAAGC CACCTTCCCC CATCAGTTCC TCATAGTTGC CTTCCTCGAT GATATTTCCA 240 

TCTTTCATGA CCAAGATCAA GTCTGCATTT CGGATGGTTG ACAAGCGGTG GGCAATGACA 300 

AAGGATGTGC GTCCTTCCAT CAAACGGTCC ATGGCTTTTT GGATCAATTC CTCTGTCCGT 3 60 

GTGTCAACAG AAGAAGTCGC CTCATCCAAA ATCAAAACCG GTGCATCCTT AAGAAGGGCA 420 

CGAGCAATAG TCAATAGTTG TTTTTCTCTT ACAGACAAGG TCACGGTGTC ATCCAAGATG 480 

GTATCATAGC CATCTGCCAA GCTCATAATA AAGTGGTGAA TTCCCACAGC CTTACTAGCT 540 

TCCATCATTC GTTCATCACT AATCCCTATT TGATTATAGA TGAGATTGTC TCGAATAGTT 600 

CCTTCAAAGA GCCAGGTATC CTGCAAGACC ATTGAAAAGG CATCATCCAC TTCTGAACGC 660 

GTCATAGCCT TGGTATCCAC ACCATCAATG CCAATACTTC CCTTATCAAT CTCATAGAAT 7 20 

TTCATCAAAA GATTGACAAT GGTTGTCTTA CCAGCCCCAG TCGGCCCAAC AATGGCAACC 7 80 

TTTTGACCAG CATGAGCTGT CGCAGAGAAG TCATAGTCTT GAACATTGAC ACCCTCCACC 840 

AGAATTTCTC CTGCTCACAC GTCGTAGAAA CCTGGAATCA GATTCACCAG AGTTGATTTA 900 

CCAGAACC7G TTCACCCAAT AAACGCCAC7 GTTTGACCAG TTTCTGCTTT AAAGCTAACA 9 60 

TGTTCAATAA CTGCCTCCGA ATTTGCCGCA TAGCGgAAGG TCACATCCTT AAACTCGACC 1020 

TGACCTTTGA AGTTTTCATC AGTCAGCTGC ACTTGAACAG CGTTTTGGAT AGAAGAATGC 1080 

AAATCTAAAA CTTGATTAAT CCCCTTAGCA GAG AC CAT AG TTCGGGCAAG AACGATGAAG 1140 

AGTCCTCCCA TGAGAAGGAA GCCCATGACA ACCTACATGG CATAAGACAT GAAAACAATC 1200 

ATGTCACTAA AGAGAGGCAG ACGCGCTATC GCACCAGCCT CGTTAATCAC ATAGGCCCCA 1260 

ATCCAGTAAA TCGCCACACT CAAACCACTT GAAATCCCCA TCATGATAGG ATTCAAAATA 1320 

GCCATAAGAC GGTTGACAAA CAAATTCAAA CGGGTCAATT CATCATTTAC TGCTGCAAAT 13 90 

TTTTCATTTT GATAATCCTC TGCATTGTAG GCACGAACGA CACGAATACC TGTTAAACTC 1440 

TCACGAGTGA TACTGTTCAG TTTATCTGTC AGCCCCTGAA TCAAGGACTG TTTTGGAAAC 1500 

GCTAGCGTCA TCAAAACGGT CGTCATCACG ACGTTGATAA TCACTGCCAC AAGTACGGCC 1560 

CAGAGCCAGT ATTCTGAATG ACCTAAAATC TTCCCAATAG CCCAGATAGC CATAATTGAA 162 0 

CCACGCGTTA CCAC7TGCAA GCCCATAGTA ATCAACATTT GAACTTGAGT AATGTCATTG 1630 

GTAGTACGCG TCAACACGCT AGGAATTGAA AATTTCTTAA TCTCTCTCTC CGAGTAATCC 1740 

AAAACTCGC? TAAAAATATC ACTTCTCAGC C7ACTAGTAT AAGAAGCCGC CACTCGGGAT 1900 

GCAAAAAATC CAACTGCAAC TACGGACAAG AAGGCAAGAA AGGACATTCC CATCATCATG 1360 

CTTGCCGACT GCCACAACTC ATCTAAATTA GTT7CTTGAC TACCTAGCAA ATCCGTAATT 192C 
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TTCGAGATAT 


AGGTCGGCAC 


: TTCCAACTCT 


' AG AT AG AC CG AAAAGCAAGT 


1 AAAGAGAATG 


I960 


GCTAGTAAAA 


TCATCCCCCA 


TTCTTTTCTA 


CTAATTCTTT 


' TGGCTAATTT 


CTTTATTCTC 


2040 


TCC7CCTATT 


CCCTTGATAT 


TTTGCCTGTA 


GTTGACCGAG 


AACCTTCTCA 


AAAATCAGTA 


2100 


ATTCATCTTC 


ATCAATCTCT 


TCCATCAACT 


GCTTGTCTAT 


GCGTTCAAAA 


AAAGCCTTAA 


2160 


CCTGTTGCAT 


CTGAGAACGT 


GCTTTGTCCG 


TCAGACGAAC 


AAACTTAGCC 


CGCTTATCAA 


2220 


"CAGGACTCGC 


CTCCAATTCC 


ACCAAACCAT 


TTTCCAC7AT 


ACGCTTAACC 


AGATTACTAG 


2280 


CAACAGGCTT 


GGTAATATTG 


AGTTCCTGCT 


CCATATCTTT 


AATCAAGACC 


AAGTCTTGGT 


2340 


TTTTCTCGCG 


ATTATCCAAA 


AAACGCACAA 


CCTCACCTTG 


CGGCCCACCC 


ATAAATTCAA 


24C0 


TGCCGCAACG 


TTTGGCTTCC 


TTTTGCACCA 


TCAGGTGAAT 


TTGATGACCA 


AAACGCTTAA 


2460 


AGACTAACAT 


CCGTTTATCC 


ATAATCTCCC 


CCTTCTAAAT 


AAAAATACTT 


CTCTGGAGAA 


2520 


TAATTAAATT 


TCTATGAGAA 


CTATTTTCTT 


GATTAAAAAA 


ATCCCAAGTG 


ATTTTCTCAC 


2580 


TTAGGATCAT 


GTTCTATAGG 


TTAAATTAAA 


ACCCATCTAC 


GTTCGTATAA 


ATCTTTTGGA 


2640 


CG7CTTCGTC 


G7CTTCAAGA 


ACGCTGTAAA 


GTTTTTCAAA 


GGTTTCAAGG 


TCTTCGCCTG 


2700 


ACAATTCCAC 


TTCTGACTGA 


GCAATCATTT 


CCAA7TCAGT 


CACTTGGAAT 


TCTTCAATAC 


2760 


CAGACTCACG 


CAGGGCAACG 


ATAGCCTTGT 


GAAGGTCAGT 


TGGCGCTGTG 


TAAACTGTGA 


2320 


TTG7ACCTTC 


TTGTGCTTCT 


ACGTCATCCA 


CATCCACATC 


CGCTTCGACC 


AATTGCTCAA 


2380 


AGACTGCCTC 


CGCATCTTCA 


CCTCCAAATA 


CAATAACACC 


TTTGTTGTCA 


AAGAGGTAAG 


2940 


AAACAGAACC 


TGAAGCGCCC 


ATGTTTC CGC 


CGTTTTTACC 


AAAGGCTGCA 


CGGACATTGG 


3000 


CTCCTGTACG 


GTTGACGTTA 


GAAGTCAAAG 


TATCCACAAT 


TAGCATAGAG 


CCATTTGGCC 


3060 


CAAAACCTTC 


GTAACGTCC? 










3120 


TCGCTTTATC 


GATAATGTGT 


TTTGGCACTT 


GGGCTTGTTT 


AGCACGGTCG 


ATAACGAATT 


3180 


T C AAA fir T (1 A 






CACCTTTTTT 


AGCTGCTACA 


TAGATTTCTA 


3 240 


CACCAAATTT 


TGCATATACT 


TTACAGTTAC 


CTCCATCTTT 


AGCCGTTTTC 


TTGGCTACGA 


3300 


TATTGGCCCA 


TTTACGTCCC 


ATTAGGAATC 


TCCTTTT7TC 


ACATTTTAAT 


CTTTCTTATT 


3 36C 


ATAACACAAG 


TTTTTTTGA7 


TTTCACTAGA 


GGAAATGGAT 


TTTATTACCA 


AATCAAGCTA 


3420 


GGATAGCACT 


TTACCTGCTA 


AGATGGTCTT 


GCCTTTCTAT 


CTTTATCAAC 


AGGCACTCAT 


3480 


CCACATTCAA 


AAAACAAACT 


AGACCATTAT 


CTGCAAATAG 


AAACTTTCAG 


CCAAGTTTGA 


3540 


CAAAGTCAGC 


TCAAATTACT 


CTTTCAACT7 


TGTAGATATA 


AGCGACAAAA 


ACAATCATAC 


3600 


TGCACCTTTT 


GTTGACAGTC 


TACTCCAGAC 


ATATCATAG7 


TCAACTAAAT 


ACTTTGAAAT 


3660 
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TCAACACTTC TTATAGGCGC TATTGTATTC TAAGAAATCA ATAGAA^AG? TTCTAAGCAA 37 20 

ACCTCTAATA CTCAATAAAA ATCAAAGAGC AAACTAGAAA GCTAGCCTCA GGTTGCTCAA 37 80 

AACACTGTTT TGAGGTTGCG GATGGGGCTG ACATGCTTTG AACACATTTT CGAAGAGTAT 3840 

AATTTACCTG TTCCCAAGAT GGACAACTTA GACTAGTACA CTGGCACTTC TAAAACATTG 3900 

CTAGCAATTG ATTTGTTCAT ATTTAATTTC ATTTTTTCCA TAAATGGGTA TTAGATATAA 3 9 60 

ACAGCAAAAT ATTTCCGATA CGTGTCGTTC TTGAATTTCC AATCATCTAA AACAAGTAAA 4020 

GCATAATCAA TCCCCTGTAT ATCAAGGAAT TGGCTACCCT TTTTACTTTT TTACACATTC 4080 

TGTTTGATAG ATTCAT7TTA ACATCACGAG CATACTCCAA TGGAAATCGC TAGGCAAGAG 4140 

ATAAACTTTC AGATATCCGC AGAGAGATCA TCGCCTCTTT TTGTCGCAAG CATTCTCCTC 4 200 

TCCTAGTCAT TTTCTACC7T ATCTTCTACC TGAGGATAGA CACTTGTTCC CCAAATAGAA 4260 

ATCG7CCGCT TACGCACTAG TGGCAAATCG GTTTTTTCAT AAACCGTACG CCACCATTCC 4 3 2C 

CAGGCAAGCC CGGTACACTC 7CTAATTTTG ACACAGAGAT TACGAACATT CCCTT TT AAA 4 3 80 

GGAATACTAG TGGTAAAGTG ACCCGTTAAA TCCTGCC CAT 7TC7GTCCCA AGCCTTAGGA 4 440 

GTCAAGACTT CCTTACCTTG ATGATCATAG GATAATTCAT TCCAAGTAAT ATAATATTGG 4 500 

GCAACATAGG CACCACTATG ATCCAGCAGT AAATCTCCGT TTCTGTAAGC TGTAACCTTA 4 560 

GTCTCAACAT AGTCTGTACT ATTTTGAAAG GTCGCAACTA CATTGTCACG TAAAAAAGAA 4620 

GTTGTATAGG AAATCGCCAA GCCTGGATGA TCTCCTGTAA AGCGACTCCC TTCTTGAATC 4 680 

AAGTCCTCTA CCATATCCAC CTTGCCTGTT ACAACTCCGG CACCCGAACT TGGGTCGCCC 4 7 40 

CCTAAAATAA CCGCC7TCAC TTCTGTAT7C TCCAAAATCT GTTTCCACTC TCTCTGAGGA 4 800 

GCTACCTTGA CTCCTTT7AT CAAAGCTTCA AAAGCAGCCT CTACTTCATC ACTCTTACTC 4 8 60 

G7GG77TCCA ACTTGAGATA GACTTGGCGC CCATAAGCAA CACTCGAAAT AT AG AC C AAA 4 920 

GGACCCTCTC CAGAAATTCC TCTCTGTTTT AAATCCTCTA CCGTTACAGT ATCTTGAAAC 4 980 

ACATCTCCTG GATTTTTAAC AGCATCTACG CTCACTGTAT AATAAATCTG CTTAAAATTA 5040 

ACAATCTCAA TCTGCTTTTC GCCTGAATGG ACAGAGTTAA AATCAATATC AACAGAATTC S10C 

CCTGTCTTTT CAAAGTCAGA ACCAAACT7G ACCTTGAGTT GTTCCATGCT CTGAGCCGTG 5160 

ATTTTTTCAT ACTGCATTCT AGCTGGGACA TTATTGACCT GACCATAATC TTGATGCCAC 5220 

TTAGCCAACA AATCCTTTAC CGCTCCGCGA ACACTTGAAT TGCTGGGGTC TTCCACTTGG 5280 

AGAAAGCTAT CGCTACTTGC CAAACCAGGC AAATCAATAC TATAAGTCAT CGGAGCACCA 5340 

TCGACCGCAA GAAGAGTGGG ATTATTCTCT AACAAGGTCT CATCCACTAC GAGAAGTGCT 5400 

CCAGGATAGA GGCGACTGTC GTTGGTAGCT GTTACAGAAA TATCACTTGT ATTTGTCGAC 54 60 
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AAGCTCCGCT TCTTTCTTTC GATAACAACA AACTCATCCC CTAGCTGATT ACCCTCTTTG 5520 

ATGAAACGAT TTTCAATACT TTCTCCCTGA TGGGTCAAGA GTTTCTTTTT ATCGTAATTC 5580 

ATAGCTAGTA TAAAGTCATT TACTGCTTTA TTTGCCATCT TCTACCTCCT AATAAGTTCC 564 0 

TGGATTGAGT TGCATAAACT CAGACTTGTT CAGCGAAATC AGCCGTGGTT GGACTAAGTA 5700 

ATCCAAAATT TCCTCGTACA ATTCTTCTGA GACATTGCGT CGCCGTCTGG CTAAATAAGA 57 60 

AGTCGGAATG ACCGTATTAT CCAACATAAA TACCTTATCT AAGTCAATCA AGGTTGGTCT 5820 

TCTAAAAGCA TTACGAGCTA GATCCGGCTC TTCTATCATA AAGTTCTTGA CCAAACGTCT 5880 

GGTCAAGAGA GCTGGTTTGA AGGTCTGATT TTTAACCAAC TCTTTGTTTT TAGTCATGCT 5940 

GTTGTCAATA CAGATATACA TATGATTCTT CACAGCCAAA TCGCTACTAA TAGTCGGAAA 6000 

AGGCAAATAA AGAGCTACAA CATCTCCTCT CTTAATCAAG CAAGAGCACC CCCTTTTCTC 6060 

CTAATGTAAC ATAGACAGGA TTGACCAAGT CTTCTGATTG ACTCAGAATT TCCAAAGTTT 6120 

GAGTTTGGCG CGCTGTCAAT TTAGTAGCAT CTTGTCTCTT CAATACAAAA TGCTTGTCGC 6130 

CAATAACCTT GACAATATAA TCCTTCTCCA AAGCTGACTG GTAAATCCAC ATCAGATGTT 6240 

GTCTGTCCTG AGAACTCAAG AGAGAAGGAT TTTCAAGCCT CCCGATAGTC TGATAAAAAT 6300 

CAAAAACAGG AGCTAACTCC TGCCAATCTC ATTGCCTAGT TGTCAAGGCT AGAAAAAGGG 63 60 

CTTTGCGAGC TCATACTTCT TGGTTAGCCT TGAGAGTTAC TTTCCCCTCC AACTTTTTTA 6420 

GAAATCGGGA AACTCCAGAA AGCAAATTTT TCTCTAACTG CGAGAAATAA AAACCTTTCC 6430 

T7CCCACACA TAAGTCTTTC ATGTCGCTTT CTCTAGCAAA TAAGAGCTCA AACATTTGAT 6540 

AGTAAAAGAA AAATATCTGG CACTGGGTCG CGCTCATCTT TTCCTTATCG GCTTCTTTTT 6600 

TTAACCAGAG CAAGGGCGAC AGGTAGCTGG ATTGAGACAT TTCCTCTACC TCCTACTCTT 6660 

TTTTAACTGG AGCATCTGCA CTAGCTGCCA CTTCTTTTGA CTGGATACTT TCCCACTGGT 67 2 0 

TAATCTCCTC TGAGATAAGA CCTTCGCATG TCTTGACAAA TAGGGCAAAA GCCTTGGT^T 6780 

TTCCTGCATA TTTCTCCGTT TGGCATTGAT AGAGGAATTT TTCTTTCTCC AGGAGTTGCG 6840 

CACTTTTTTC GTAAGAAATC CAATTTTCCT TTGCATTATA CAAATTGATA ATCCCCTCAC 6900 

ACAGCAAGCC GAGACTGGAT AAGGCAACCG AAATCAAACG GTAGCGATCA CCTGGCATAG 6960 

GAATAGCACA AAAGACAGCT ATGAGGAAAC CTGCCACGAT TTCTCTTATT TTTAATACCT 7020 

TATAGCGCCT ACGATGTTGA ACGCTTTTCT TTAAAAAATG ACCTATCTGT ACGTCTAATC 7080 

GCTCTGTCAG GTACATTTCT TCTGGCGTCA TATTCGTAAC TCCTTTCATT TACTTTGATA 7140 

ATCAGGG 714 7 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i» SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 755 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CCGCATGGGA TTGGTGTCCT TTTGGGCAAT CTCTTTGACC AAACTGGAAA CATGTTTTAT 60 

GCGCCTGCCT TTACTGCCCT TGTCGGCGGT ACGTCTATAT GATCCTAGTC GCAAAAGTTC 120 

CCCGCTTTGG AGCCATTACC ACTATCGGCC TTGTCATTGC CCTCTTTTTC TTGGGAACTA 180 

AACACGGTGC TCGTTCCTTC CTTCCTGGAA TTATCTGTGG CCTCCTAGCA GATGGAGTAG 240 

CTCATTTAGG AAAATACAAG GACAAAACAA AGAACTTCCT TTCTTTCATT ATTTTCGCCT 300 

TTAGTACAAC AGCACCAATC TTCCTTATCT CGATTGCGCC CAAAGCCTAT ATGGCTACTC 360 

TTCTGGCAAG AGGAAAATCC CAAGAATATA TCGACCGTAT CATGGTCGCT CCAAACCCTG 420 

GAACTCTCCT TCTATTTATC GCAAGTATTG TCATCGGAGC CCTAGTGGGT GCCTTGATTG 4 80 

CACAAGCCTT GAGTAAAAAA TTTCCCCAGA AAATCTGATC ACTTAAAAAG AGCCACGCGC 540 

CTCTTTTTTA TTTATGGCTC AATTTCTTAG TCAAGAAATC TCCCAAGAAT TGGATTGCAA 600 

AGATAATCAA AATGATAATA ATGGTTGCCA AGATGGTCAC ATCGTGATTG TAGCGGTTAA 660 

ATCCATAAGC GATGGCTACG TTACCGATAC CACCAGCTCC AACCGCACCG GCCATAGCTG 720 

TTtcCCAACA AGGGaAtCAA GGTcACAGTC GTCAC 7 55 



(2) INFORMATION FOR SEQ ID NO : 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3010 base pairs 
(BJ TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xii SEQUENCE DESCRIPTION: SEQ ID NO : 25: 

TTCAATTGGT ATCTCAATCA ACGGTCTTCA CATGCTTTCA ACTGGTTTGA CTCTTGAAAA 60 

AGCGAAAGCT GCTGGTTACA ACGCAACTGA AACAGGCTTT AACGATCTTC AAAAACCAGA 120 

ATTCATGAAA CATGACAACC ATGAAGTAGC AATTAAGATT GTC7TTGACA AAGATAGCCG 130 

TGAAATTCTT GGTGCCCAAA TGGTTTCACA TGATATTGCA ATTAGCATGG GAATCCACAT 240 

GTTCTCACTT GCTATCCAAG AGCATGTGAC AATTGATAAA TTGGCATTGA CAGACCTCTT 30C 
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CTTCTTCCCA C ACTTCAACA AACCATACAA CTACATCACA ATGGCTGCCC TTACGGCTGA 3 60 

AAATTAAAAA TGAATGAGCT ATCTGGCCTT AAGTTAAGGT CAGATAGTTT TTAGCTAATT 4 20 

TGTCCCCATA CAATTATAGT TTTTTTATCT TGTCCTTCAT TCTGTTCTGA CTTAAAATGA 4 80 

AAAGGTAGCT ACCAATACAA ATGATGAGGA TAAAACAAAT GACTGAAAAT CGTTATGAAC 540 
TAAATAAAAA CTTGGCACAG ATGCTCAAGG GTGGTGTTAT TATGGATGTG CAGAATCCTG 500 
AACAGGCTCG TATCGCAGAA GCTGCTCGTG CGCCAGCTGT GATGGCCTTG GAACGAATTC 660 
CCCCTGATA? TCGTGCAGCT GGAGG AGTTT CCCGCATGAG CCACCCAAAC ATGATTAACG 720 
AAATCCAAGA AGCCGTTAGT ATTCCAGTAA TGGCTAAGCT CAGAATCGGG CATTTTGTTG 7 80 

AAGCTCAGAT TTTAGAGGCT ATTGAAATTG ATTATATCGA CGACAGTGAA GTTCTATCTC 840 
CACCTGATGA CCGTTTCCAT GTGGACAAGA AAGAATTCCA AGTTCCTTTT GTCTGTGGTG 900 
CTAAGGATTT CGGTGAAGCC TTGCGTCGTA TCGCTGAAGG TGCTTCCATG ATTCCTACCA 960 

AAGGAGAACC AGGGACAGGG GATATCGTCC AAGCTGTTCG TCATATGCGT ATGATGAATC 1020 

AGGAAATTCG CCGCATTCAA AACTTACGTG AGGACGAGCT TTATCTTCCT GCCAAGGATT -.080 

TGCAAGTCCC TGTAGAATTG GTCCAATATG TTCATGAACA TGGAAAATTG CCAGTTGTAA 1140 

ATTTCGCTGC TGGAGGTGTT GCAACGCCAG CAGATGCTGC GTTAATGATG CAATTAGGCG 1200 

CAGAGGGGGT CTTTGTCGGT TCAGGTATTT TCAAGTCAGG AGATCCTGTT AAACGAGCGA 12 60 

CTCCCATTGT TAAGCCTCTG ACTAACTTCC GTAATCCTCA AATCCTAGCT CAAATCTCTC 1320 

AAGATTTACC AGAAGCCATG GTTGGTATTA ATGAAAATGA AATCCAAATT CTCATGGCTG 1380 

AACGAGGAAA ATAGATGAAA ATCGGAATAT TGGCCTTGCA AGGGGCCTTT GCAGAACATG 1440 

CAAAAGTCCT AGATCAATTA GGTGTCGAGA GTGTAGAACT CAGAAATCTA CATCATTTTC 1500 

AGCAAGATCA GAGTGACTTG TCGGGTTTGA TTTTGCCTGG TGGTCAGTCT ACAACCATGG 1560 

GCAAGCTCTT ACGTGACCAG AACATGCTAC TTCCCATCCG AGAAGCCATT CTATCTGGCT 1620 

TACCAGTGTT TGGGACCTGT GCCCCCTTAA TTTTGCTGGC TAAGGAAATC ACTTCTCAGA 1680 

AAGAGAGTCA TCTAGGAACT ATGGATATGG TGGTCGAGCC TAATGCTTAT GGGCGCCAAT 1740 

TAGGAAGTTT CTACACGGAA GCAGAATGTA AGGGAGTTGG CAAGATTCCA ATGACCTTTA 1800 

TCCGTGGTCC GATTATCACT AGTGTTGGTG AGGGTGTAGA AATTTTAGCA ACAGTGAACA 1860 

ATCAAATTGT TGCAGCCCAA GAAAAAAATA TGTTGGTAAG 7TCT7TTCAT CCACAATTGA 1920 

CTGATGATGT GCGCTTGCAC CAGTACTTTA TCAATATGTG TAAAGAAAAA AGTTGAGATT 1930 

GAATTTCTCA ACTTTTTTAC ATGTAATAAA CAATAGCGA7 GTATTGAAGT GCGGACGCAG 2C40 
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CTACGATAAA GAGATGCCAA ATCATGTGCA AATAAGGTTT 7TTCTTGGCA TAAAATCCAG 2100 

CTCCAACTGT ATAACAGAGT CCGCCAGTTA CCATGAGACT CCAGAAAACG GGTGTCGTTT 2160 

GACTGATAAT GGCAGGAATG ATAGCCAGAA CCAACCAGCC CATAATCAGG TAAAGAGCAA 2220 

GGCTAAATTT CTCATTGACC TTTTTAGCAA AGATTTTATA GAGAATACCA AAGATGGTCG 2280 

TTCCCCATTG GATGACAATA ATCAGATAGC CAAACCAGTT ATTCATCAAG GTCAAGACAA 2340 

CGGGCGTGTA TGAGCCGGCA ATCGCAACG7 AAATCATAGA ATGGTCAATG ATTCGCAAAA 2400 

CATATTTGTG GGTCGAACCA TAGGCCATAG ACTGATAAAT GGTCGATGAT AGGAACATGA 2460 

GAAAGAGACT GATGACGAAA ATGGAAACGC CGATAGAGGA TAAAAATCCG TGTCCTTCAT 2520 

AACTATAGA7 GGATGAAATA GGCAGCAAGA TAAGCATGAT GACTCCACCC ACAGCATGGG 2 580 

TCACGCTATT AGCAATCTCC TCTCCAAAAC TGAGTTGTTT GCTGAGTTTA AGACTAGTCT 2 640 

TCATTGGATT ACCTCCTCTT GAGTATGATC GATTAAGTCT AGAGTTTGAT GATAGAGTTT 27 00 

AACCGTTTGG CAGCTGGTTT GGATAATAGG GTTAGCTGCG TCAATTCCTT CGTTCATGTA 2760 

GTCCACAAAA GCATCGTAGA GTTGGTCTGA ACTTGCTTGA GTTTCTAGAG TATTAAGTGT 2820 

CTCGGCTA7T TCTTCAATAG AAAATACAGA CTTGAGGGTT GTGATAGCAA TCAAACGCGC 2880 

AATCTGTTGG CGTTGGTATT TTTTTTTGTC AGCCTTTGTC AGGTAACCAT TTTTCACATA 2940 

ATTGTTGACC ATAGATGCTG TTAGGCCCTT GTCTTTATTA GGAGAGATAG GGGCGCAGAC 3000 

CTGATTGACA 3010 

(21 INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(Ai LENGTH: 15213 base pairs 
( 3 > TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 26: 

CATAAATCGG TGCAAATAAC TTAATAGTCA AGTAGCCATT TCTTTCCTAT TTACCTGAGC 60 

CATATTCCCT AGACGAAAGA AT ATT ATT AT CAATCAAATC ATTGAATGAA CGTAGTCTTT 120 

CAACTTCTTC TACTGTTAGA TTTCTGACAA CATTTGTTGC ATAGACCTTA TTTCCATCAG 180 

GATCAGGATG GTACTCATTT GTAACTTTTC TAAGAAGTTG TTGTTTTTGA TTCGTATCCA 240 

ATTTAAGAAT TGAATTTCCT TCGAGATATT CCAACATATA AACAACGTCA AACATGTTGT 300 

GGACATATTG CTTCAAATCA TCTGCATTAT TAAATCTTGT AGTTGGATCA AGTACTTGTA 3 60 

ATCGTCGACT TTCTGTACTA TCAGATTTTG AATGTTTCAA GATGGAGTTG ATGGTAATGG 4 20 
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TPPP ATP ATP 


tpp a tpptpt 


GCTGCTTCVTA 


ATAATCCTTT 


AGCAAAGAAC 


TCTGGTCCCA 


480 




'"PPAPPATAT 


CCTCCAAGAT 


AAATCTCCTG 


ATCTGAGTCA 


TCTGTCATCT 


540 


CATtA-OTA 1 A 


AO 1 AA 1 AUL I 


P P AT PPTT A T 


CCAACATTCG 


ATAACCCATA 


TAATAAACTG 


600 


CATCACCTGT 


ft ff A T ft ft f"V* ft 

AuC A l AAUUA 


lul 1 VJrt 1 


TATCCCCAAC 


TTT ATTTCC A 


ACAGGTCCAA 


660 


AGAAATGTTG 


CAT i vj(_AOUA 


TTTPPATTAT 


CAAAATCTGC 


CACTTCTGTA 


GCTTTCCCTA 


720 


CGGTATTATC 


* T /"* /"* X ft ft *P 

ATCGCCAAAT 


TT 1T1 APP AT 


PPT VAAGC AA 


AATATTTCTA 


TAAAGTTTTT 


780 


CACGTGCATT 


GTCGTCTAAA 


ft*P ft AT APP 


AATAATCGTA 


GTGATCTCGC 


TGACGTTTGG 


840 


CTGTTTCAC-G 


CGCATTTTCT 


TP A APAAAAT 


CATTGAGAGC 


CTTGCCCGCT 


TTATGGTCAC 


900 


TACTGCGGTA 


f l^f* »mr ft T A A 

GCGATCATAA 




PTACACTAGA 


CATGCTCCAG 


ATGACAAATA 


960 


CGGATCTCTC 


TGGCAAGGTC 


ft r^/* ft f ft cine a 


APAPPATATT 


GCGGTATTTC 


CATGTGGCAC 


1020 


TCGTGATACG 


» Am* ft ft ("* a 
A rt- AT AAA A 


PPP AT APAAT 


ACTTCGTGCC 


AGCTAACCCT 


TGCTTCGTTT 


1080 


TCACCTCTTC 


GAT AGTGGA I 


'PTTTrTTf*r ft. 


P AATGT AAGP 


CTTAGTCTCT 


GATTTAAACC 


1140 


AGTCATTATT 


GCTTGTATTT 


r*r*T» & ft ft ft ft A 


^*TTTTPPPT A 


ATGTTCCAGC 


GTGCTAAACA 


1200 


AATCTGTCGT 


TCGATU1 TO A 


nrrm apap 


TGATACCATA 


ACTATCGACA 


TTATTCTTAG 


1260 


CTAGAAGATT 


ft ft ft/*f*f"*A 

GTTAAAGCCA 


fftTTTAPPPA 


APTPAATCAG 


AGTATCTAAT 


GGTGAAGCAT 


1320 


TCCCCTTACC 


AAAGAAGTCC 


» & ft T , ^/~T* ft O ft 
AAA 1 uLt 1 


PAAPTAPGTP 


TTTGACATTC 


ACCTGACCAT 


1380 


AGCTAAAGTT 


ATACCACCGT 


TPPAPATAP.P. 


TPAAGCCAAG 


TACCAACGCT 


TCCTTGTTGC 


1440 


GTTTGATTTT 


ATCTACAAGA 


I AALL 1 1 Ua« 


TPApr;PGGT r ' 


AGCACTAGCC 


AGTCCAGCAT 


15C0 


CCGCTGACAA 


GAGTTTTTTC 


AAAV. 1 1» i v- 1 1 


PPACTTCTTG 


TTTTGTTTTG 


GCGAACTGGT 


1560 


CTTCTAGATA 


GAGCTCAGTT 


tj™ nwpni PPT 


TTGGAGAAAT 


ACCCAGCGTC 


TTTCTGATGG 


1620 


CTTCTGAATG 


* m r "ft ft /*/^ 

ATAGTCAACC 


TTTTPTA APT 
ill lu 1 1 


PACGTAAGAC 


TTGCTTGATG 


ATAGAGGTTT 


1680 


GGTCATACAG 


CAATTCCTTT 


GGCGTATAGA 


GAAGTCCAGT 


A7TGCCCACA 


CTATATTC^G 


L / n \J 


CTAATTTGGC 


GAAATCATTC 


TGGTATTTGA 


GATCCAGCTT 


CTCAGATAAA 


TCATCCTTCT 


1800 


AGTGAAGCAA 


GAGTTTGTTT 


GCAGTCTGTT 


TGTTAGAAAC 


AATGTCTGTG 


ATGACTTGGT 


1860 


TGTCCTTCAT 


CATGACTGCT 


GACAAGAGTT 


CTTTTTGATA 


TAAAAGACTC 


TTCTCATTGA 


1920 


CCAGGTTTCC 


GTATTTGACG 


ATGGTTGCCT 


TGTTGTAGAA 


AGGTAGCAAT 


TTTTCAATGT 


1980 


TTTtATAAGT 


CAAGTTGCGC 


TTAGCTTGAT 


AATAGGCCAC 


CTTAGAAAAA 


TCACTGTCTT 


2040 


TTTTGCCACT 


TGTTGAAAGT 


GGCTCCACTG 


TTGGTAAAAT 


GAGAGGATTG 


ATTTCTGCTT 


2100 


TTTTGCTTGC 


AATTTGAGAA 


GCATCTAGCA 


TTGTTCCTCT 


TTCTTCAAAG 


GATTCCTTGC 


2160 
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TGACGACCTC 


ATCCTTGACC 


AAGGTGACAT TGTAGACTCT 


GTTGGCCTTG 


CTGCTGAATG 


2220 


TGTCCTTTAC 


CTTCATTTCG 


TTATAGTCGT AACCAGTGAT 


GGCATTTCCG 


TTGGTTACAT 


2280 


TAACATCGCT 


GAGAACATTG 


GTCAAACTTC CAGCATGCCT 


AACATCACCA 


GAAGTTCGAT 


2340 


CCCACAAATT 


GCCTGCCACT 


CCAGCGACTC TACCAAAGTG 


CTTGACATTG 


TTGATATCAC 


2400 


CTTCAGCATA 


GC7ATCTTGG 


ATCTGTCCAT CTCGGTCTAC 


TAGGCCTGCA 


AGTCCACCCA 


2460 


CAGTCTGATC 


TGAAGTATTT 


GTGTTAGATG AAATGGCTAC 


TGTCGCTTTT 


GACTTAGTAA 


2520 


GTAAACCCTT 


GTCACCTGTC 


AAATGACCGA CCATACCACC 


GATATTGTAG 


GCAGCAGTCG 


2580 


TTTCATAAGT 


GTTCATAATT 


CTTCCCTTGA AACTGCTCTC 


TGTGATGCTT 


GATTGCTCAG 


2640 


CCTTAGCCAG 


CAAACCACCG 


ATACCACGTT CACCAGCCAG 


AACACCATCG 


A CGTG AACTT 


2700 


GCTTAATTTT 


TGTGTTATTC 


TGACCTTCAT TTGCCAGTGA 


ACCGATATCA 


TCTTTCCCTG 


2760 


AAATAGCAAC 


ATTTTTTAGA 


CTCAGTTTTT CTACTGTAGC 


ACCACTCAAG 


TTTTCAAACA 


2820 


GAGGTTTTTT 


CAAATTATAG 


ATAGCATAAT TCTTGCCATC 


TTTTTCACCG 


ATTAAACGAC 


2880 


CACTAAAGGT 


GTCCTTGATA 


TAGGATCTTT CATCAGGACC 


AAGCTCCACT 


TCGTTAGCAT 


2940 


TCAGGCTGGC 


CGCTAAATGA 


TAGCTTCCAG AGGGATTTTG 


GTTTATAGCT 


TTGACCAGAT 


3000 


TACTAAAGCA 


AGTAAAGTTT 


GTTCTTTCTT CTGTTCCCTT 


CTTAGCTAGA 


TAGAAGGTAA 


3060 


AATTATCTTT 


ATATCTGCTT 


TCTATCTCCT GCTGAAGCTT 


CTCTACTTTT 


GCTGTGATTT 


3120 


TATAAAGGAT 


TTTATCATTT 


TTTCTTTCCT CTGATATTGA 


TGCTACTGGT 


AGGTATACAT 


3180 


CTTTGAATGA 


AGAAGATTTC 


ACTTTAACAA AGTAGCTATT 


TGGATTGCTT 


GGAACTTGCT 


3240 


CTAACGAAAT 


GTGTTCTTTA 


TAAGTACCAT TTGACAAACT 


GTATAACTCT 


AGGTCGGAAA 


3300 


CATTTCTTAA 


TTCAAGTCTT 


TTCTCTGGTT CTTCTACCTT 


TTTATCAGGG 


TCTAGTTCAT 


3360 


TTTCTTGTTT 


AATTTCTTCG 


TTTCCATTTG AATTGGATGT 


GTTTGATTCG 


GTTGAAACAT 


3420 


CCTCAGTTGA 


ATTTCCGTTT 


GATGGTTCTG GTTCTGTTTG TCCATTCTCT 


GATGTTGTAT 


3480 


TACCTGAATT 


TTCTGGTTTT 


GTTGCAGTTC CGTTTTTTTC 


TGGTTGATTT 


GATTCTTCAA 


3540 


CTGGTGGTTT 


TGAATCACTA 


GGTTTATTGG ATACTTCTCC 


AGTATTTTCG 


TTAGCTATTT 


3600 


TCCCAGAGTT 


TGTTTGTGTT 


TCTTCTGCAG GTTGAACTCG 


TTTTTCTGTT 


TCTTGATTTG 


3660 


AGGTACCTTC 


TACTGTGCCT 


TCATTTGGAT TTACTGGAAC 


TTCTTCTACA 


GTTTTTTCTG 


3720 


AATTTTCATT 


TTTAGAGTCA 


TTATGTTCTG GTTTATTTGA 


TTCTCCAACT 


GAGGTTGTCG 


3780 


AATCACTAGG 


ATTAC7GGAC 


ACTTCCCCAG TATTTTTGCT 


AGATGTATCT 


GGTGATACTT 


3840 


TCTCTGAATT 


CCTTCTTCAT 


TCTTCTGCAG GTTGAACTGG 


ATTTTCTGCT 


TCTTGAATTG 


3900 


AGGTTCCTTC 


: TGTAGTACCT 


TCATTTGGAT TTACTGCTCT 


TTCTTCTGTT 


GGTTTTACTG 


3960 
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GAACTTCTTC ACTTTTTTCT GGACCTTGTT CTTTGGTCTT CTCAACCGGA CTTTCACCTT 4020 

TTACTTGCTC AATATTACCC TTATATTCTG GAACCGGTGC TACCTGCTC? GGTTCACCTT 4080 

TATCACTTAC CACAGTATCT GCCGACTCTG GTTGAACCTC AGTCTCACCT TTGTCGGTCA 4X40 

CAACTGCTTC GGGTAATGTA GGTTGAACTT CTGGTTCGCC TTTGTCACTT ACTACAGCTT 4200 

CGGGCAACTC AGGCTGAATT GCGGGTTCAA CAATAGCTCC AGACTGTACG TCCTTATGTT 4 2 60 

CTACACCACT CTCACCTTCT TCCTTTATAA CTTCAGTTTT TTTAGTACCT TTTTCGACTA 4 J 20 

TTCTTGGACT AGGCGCAGTC GTTGAAGTTG AAACAATTTC TCGCGAAACT TCTTCCTTGT 4 380 

TTACAGAGAA TATTCTGACG ATTTCAACTT TCTTACCTAA TTTACCTTCT TCTTTTACTC 4 440 

TTACAGTTCC TTCAGCTAAA TCAGGATTTT CTTGAATTTC TTCTTGAAAA TCTATTTTTG 4 500 

TCTCCATAGT TTCCTCACGA TATAAGAGTT CAGGTTTGTT CAATTGACCT GATAAAACTT 4 560 

CATCCTCTGG ATTTAATGTA TTTACCCCAG TCTTTTCTTT TGGAGAAATC TTCTCCTCTT 4620 

TCTTCGTTTC TAGATTCTTA TGTTCCGCTA ATTG7TCTTG AGAATCTGAA GATTGTTTCT 4 680 

CTTCTTTTCT TGGATTGATT AATTCAGTAG AGAAAGGTTT TTCAACTACT TGAACTTCTG 4 740 

TCGCCTTAGT TGAAGAAACA GGTGTTTGTT CCTGAATAGC TTGTACTGTT CATGGATGGT 4800 

CTACAAAATT CGGTGTAACA TTATAATCCA CCTTTTGTTG TTTTGTAGGA GTGGCAACTG 4 860 

AACTCTTTTG ATTACTTACT TCAGACTCAG AAGTCGTTTT TCCCTCTTTG ATATATCCAA 4 920 

TATAAGTGTA ACCTGAAATC TCTTTAGGAA GAGGTAATTT TTCTCCAGAG GTCAATTCAT 4 980 

AGTCCGTATT CTAATTTACC AAAACATGAT TTTCTAAAGC ATGGACTGAA ACTAAGACAC 5040 

CATTTCCTAT CCCTGCAACC AATACTAAAT GTAATACCGT TTTATTCTTA ACCTTTTTCT 5100 

TGGAAACAGC AAAAATTAAA ATTCCCATAC CAGCTAAGCT AGCACCACCA ACTAGGGCTT 5160 

GCCTCTCATT CTTGCTTCCA GTATTTGGCA ATTCCGCCAG TTGATTTTGA GAATTTAACT S220 

TATAAACAAG ATAATAAGTT TC AT CATC AT TCTCCACGTA TGTCGGAATA TCATAGACAA 5230 

CCTCCTTCTT TTCTTCTGAT GATAGCTCTG AATCTGCCAC ATATTTATAC TGAACTCCCG 5 340 

CAGTTTCTTG AGCATCCACA GATGAACTAG CTAATACAGA CATAAAAAAT AAACTTGAAA 5400 

TCGTTGCAGA TACAAGTCCT ACTGATAATT TTCTAAATGA AAAACGCTCT TGTTTTTCAC 54 60 

CAAAATACTT TTCCATTATT CCTCCTTGAA ATAAAATTTA TATATGTTAC AAAGACCTTT 5520 

ATT AT ATT AG TGTATTATCT ATTATCTATA GAAAAGGCAG TATACCTTAA TTATACTCTT 5580 

AATTTACAAA AAAGTCTTAA AATTGAGATG CGCTTTCATA CTTTGTTTTA TATTATTTGG 5 64 0 

AGGTACAATA ACACCTACCA TGAAATTTAC ACGGTAGGTG TTACTCATAT CACTAATCGT 5700 
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TCTAAAAATG 


GTTTGAGGCA 


GTTGAGGAGA 


ATTCCTTCTA 


TCCAGCTTCC 


TTGTGCTGAT 


5760 


GAGCGATGGT 


CTTCCTGCAG 


GcrrmrTT 


AGAAAATCTC 


CCACTTCTTC 


TGGTGCGATT 


5920 


TCAAATTCAA 


AGGCTTTCAT 


TTTATAGAAA 


AAGTCGATGA 


GATGATCTGA 


CAGGTATTCA 


5880 


GTTGAAAAGG 


GTACTTCACC 


ACTTTTTCTA 


TATTCTAATA 


AGAGTCTAGA 


AAATCGAGCT 


5940 


TTTTCTTCAG 


GAAGCTCACG 


AAAATAGGAA 


TTGAGGATCC 


AA'CTCTGCTT 


CTGTTTTCTT 


6000 


TCAATTGGAT 


CCTGACTGGC 


AATTCGTTGG 


TCTTTTTCCA 


CCTCT7TTTG 


GTATTGTTTG 


6060 


GCCTTGATAG 


CTCGTTCTGC 


TCTATTTTTA 


CCAAAAAGAA 


TTTTTTCCCA 


CTTGCGTTCT 


6120 


TCTTGAGTCA 


GGGTCTCTGT 


AAAGCCAAAG 


TAATCTTGAT 


AAGCACGCTC 


TGCGGGTCCC 


6180 


ATGGCTACAA 


CCAGATTGTC 


TGCATATTGC 


TTGGCGATTT 


TATCCCTCTT 


CTTGCGTTCT 


6240 


TTCTCTGCCT 


GGATACGGAG 


TTCTTGTTCG 


TAGTCAATTT 


TCTCCTTGCC 


TAGCTTGACA 


6300 


AGGTAGAGTT 


GGTCATCCGA 


TTTCCCAAGT 


AAAAAGGGTT 


TGATACACTT 


TTCAAGGACT 


6360 


TCTTCCATCC 


GAGCCTTTTT 


CTTTGGTTCC 


GCCTTGGTCC 


AACTTCCTCC 


CTGAAAGACT 


6420 


TCTAGGAAAA 


CCTGGTAGTC 


TCTCTCAGGC 


GCAAATTGAT 


TGCCACGATT 


GGGTTTGAAA 


6480 


ACACCTTTTT 


CCCAGAGCCA 


TTTTAGAAGT 


CGCTCGTCAA 


AGTTACTTTT 


ATTGACCTTG 


6540 


ATTTTTTCCT 


TTTTCTGAGC 


TTTTCTGGTT 


AGATTTTCAA 


CCTTTCTGAG 


CAGTTTTTCT 


6600 


TCCTCTTCCA 


ATTGCTGGTC 


AAGGGACAAT 


CGATGAAAAT 


GACGAACACA 


GTCGCTACCA 


6660 


ATTCGAAAGA 


GGCGTTGGCC 


TGTGACACCG 


TTAAAGAGTT 


CATAAGCGTA 


TTTGATGGCA 


6720 


TTTCCACAGA 


CACAATTGCT 


ACGGCCGATA 


CCGTTAAAAA 


TAAAGGAAAC 


TTCATTCCAT 


6780 


TCCTTGGTAG 


CTTGTTdCCA AGTATCCGCT TTCCAACCCT 


GTAAAACTGC 


ATCGTGCAGG 


6840 


GATTTTCTAA 


CTGGAAGTGT 


CATGAGGTCT 


CCTTTCTAAT 


ACTCAATAAA 


AATCAAAGAG 


6900 


CAAACTAGAA 


AGCTAGCCGC 


AATCAGCTCA 


AAACACTGTT 


TTGAGGTTGT 


AGATAGAACT 


6960 


GACGAAGTCA 


GCtCAAAACA 


CTGTTTTGAG 


GTTGTGGATA 


GAACTCACGA 


AGTCAgTAAC 


702C 


CATATATACA 


GCAAGGCGAA 


GCTGACGTGG 


TTTGAAGAGA 


TTTTCAAAGA 


GTATAAGTTA 


7080 


TACTTTTACA 


ACTTGAACCT 


CGTCTTTACC 


GAGTAAAATC 


AAGTATTTTT 


CAATATTTTC 


7140 


AATCGAATAG 


GCTCGTGATA 


AAGCCTCTTC 


GTATAGAGCT 


AACTGACCAC 


GATAGCGGTC 


7200 


TACGAGTTGA 


CTTGGTTCAT 


CATAGCGGTC 


TGTCTTGTAG 


TCGAACAGAA 


CAATTTTGTT 


7260 


TTCGTAAAGC 


AGATAGCCAT 


CAAGGATACC 


ACGGACAACA 


AAGTCTTCCT 


GACTCTTTTG 


7320 


GTCTCGTTTG 


AGCATGGAGA 


AAGGTTGCTC 


GCGATAAAGA 


TGGTCGG7AT 


TAGCAAGAAT 


7330 


TTCCTGACCG 


AGTACTGTGT 


CAAAGAAAGC 


AAGAATTTTA 


TCAAGAT7GA 


TCTTGTCTCT 


7440 


GACAGCTTGG 


CTAGTTTGAA 


CTTGTTTGAG 


TGTTTCTGTT 


AGGCTAGCAA 


GGG TTAGTTG 


7500 
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CTGGCTGAGG 


TCAATTCTCT 


GCATGAGTTC 


GTGAGTACCA 


CTACCAATCT 


CAGCTCCAGT 


7560 




TTGGTTGAAA 


AATCTGGCAA 


ATCGAACCTG 


A7TT7CT7GC 


CTACTGACTG 


7520 


ACCTTCACCA 


GCAATCTCGA 


CACCTTCCAT 


ATCCATAACT 


CGTTCGTAGA 


AT7TCTTCAT 


7680 


TTGACTTGGG 


GTTTGAACAC 


TAGCAAGTTC 


AATACCTGCG 


CCCTGAAGAG 


TATTATAAAC 


7740 


TTCCACCTCC 


TTCAGCATTT 


CCAGAGCTTC 


TTTGATGGTA 


TC7GACTGAC 


GATTGTCTGC 


7800 


TTCr-HACCTA 


"*CTTGGAGAG 


GACTCTTGGT 


TTCCAACTCT 


CCGATAGCTT 


CTCTGGTCAA 


7860 


\- i l \» * iy>j 


CCAATAAAAC 


GATAACTAAA 


GTTGACCTTG 


TCCTTAGTAA 


ACACTTTACT 


7920 




AGCCAATCTT 


GGAAATTCCG 


TCCTTGCACT 


CTAGTATTGC 


TATTTAGTTT 


7980 


rrr at t c t"t v t 1 fi 


GCTGCTGGGT 


ATTCCTTGGA 


TTCCAGCTTT 


TCACGAGAAC 


CCTTGCCGAC 


8040 


A AG AT AG AGP 




CCCGCGTCAT 


AGCAACATAC 


AGCAAACGCA 


TCTGCTCAGA 


3130 


ATAGGTTCGT 


AGCTGTAATT 


CCTCTTCGTT 


CTGCCTATAG 


CTCAGACTAG 


GAATGGAGAG 


3LS0 


*T*TTG ATGGTT 


TTAGGATAGT 


GGTCTTCTAC 


TGCCCCTGTC 


7CCATC7TCC 


CAATATATPT 


322C 




GGATTGTGAC 


GACTGAGAAT 


GACTTCTGAC 


ATAGAGTCTT 


GCTTGTTGAA 


8230 


ATCTTGATCC 


ATATTGAGGA 


TAAAGACGTA 


AGGAAACTCC 


AGCCCTTTAC 


TCTTCTCGAT 


8340 


r.r.Tr atg agg 


TC'ACTGCAT 


CTTTT GGCGG 


TGCGACGGCC 


ACGCTTGCCA 


AATCGTGCTG 


8400 




a rTT r.r.T r aa 


TCATACGAAT 


AAAACGCCAC 


AAACCTTTGA 


AATTGCTCrr 


8460 




TCAGCACGCA 


GTGCTAGGGC 


ATAGACATTC 


GCCTCCCTAG 


CAGGACCATT 


8520 




GGAAGATAGT 


CATAATAAAA 


ACGGTCGTTC 


TAAATCTTCC 


AAATCAAGTC 


35*0 




1 \JU\J 1 » * » v»f\J 


CATACAAGCG 


CCAAGAACCT 


AGGATATCCA 


TGAATTGCTT 


8640 


T AGTTTT T C A 


GCTAGAGCTG 


TGTCAATCAA 


GCCTTTTTGA 


C7ACTTCCCA 


TTTTTTGTGC 


8700 


ATGACTAGT 


TTCTCATAGA 


CATTTTCGTC 






GAAGGGACAA 


8760 


ACGTGC7AGC 


TCATCCTCAT 


CAAAACCAAA 


CATTGGAGAC 




GAAf 'CAAGCC 


8 3 20 


GTAGTCTTGC 


ACCGCATTCT 


GAATCACACG 


AAGAGTGTCT 


AGCATGACTT 


GCACTTCTAG 


8830 


GGATTGGAGA 


TAATTGTTTT 


GCTCTCCGTC 


AGTTTTGACA 


GGAATTCCGT 


ACTCAGACAG 


8940 


GGCGAGGACA 


ATCTGGTCAT 


7ACGACTCCC 


GCTGGAGGTC 


AGAAGGGCAA 


TTTCCTTAAA 


9000 


GGCAACACCT 


TTTTCTTCAT 


GAAGTTTCAG 


AATCTCCTTG 


A7AACTAAGC 


GCATTTCGCC 


9060 


TGTTAGTTTC 


GTTTCTGTTT 


CACTCTCTTC 


TTCCTCACCT 


GTATCCTCCT 


TCTCCTAGAC 


9120 


CACAAATGC? 


GCCTTCTTGT 


CTGGATTGGC 


AGTCAC7TTG 


G7ATTCGCAA 


AAACAAGCTG 


9180 


GTCCTTCTTA 


TCATAGTTGA 


TTTCGCCGAC 


CTCTTGGTCC 


A7CACACG77 


CAAAGACATC 


9240 
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ATTGCTTGCT 


GACAGCACTT 


CTGAACTACT 


ACGGAAATTT 


TCCTTGAGCA 


TAATGAGCCT 


9300 


GCCTTCTTGG 


GGATTTTGCG 


CATAGCGTTG 


GAATTTCTCA 


TTGAAAATCT 


GCGGGTCTCC 


9360 


CTGACGGAAA 


CGATAGATGG 


ATTCCTTGAT 


ATCTCCCACC 


ATAAAGCGAT 


TGTGGCCATT 


9420 


AGACAACAAT 


TCCAGCATCC 


GTTCTTGAAT 


ATGGTTGGTA 


TCCTGATACT 


CATCGACCAT 


9480 


GACTTCATGG 


AAGCGCTCCT 


GATAAGACTC 


ACGAACTTGT 


GGGAAATTCT 


CTAAAATCTC 


9540 


AATGGTGTAA 


TGGCTGATAT 


CAGCGAATTC 


GAAGGCATTT 


TCCTGTCGTT 


TTCTCTGACG 


9600 


ATAAGCCTCT 


ACAAAATCGC 


TCATGAAAGA 


TTGGAAGG7T 


TTAGCTACTT 


TCCAAGTGTC 


9660 


TCCATGATAA 


CGTTCTTGAT 


AGTCGAGAAT 


CGCTATCTGG 


TCTGATAATT 


GTCCTAGTTT 


9720 


AGCAAACTGG 


CTCTT7CTCT 


CTTCCTTGTA 


GGCATCAGCC 


AGGGGCTTCA 


AATCAGCCTA 


9730 


CGGCTGGCAT 


TAGTCAGAGC 


TCGACCGTTT 


TTCTCCT7AG 


AGATGGCGAC 


AACACGCGCA 


9840 


AGCACTGCCT 


GATAAGCCTG 


ACTATCGGAC 


TCCTGATTTA 


GGGAGCCAAT 


TTCATCCAGA 


9900 


ATTAACTGAA 


CATTTTCTAA 


ATACCCACCC 


TTTGCAAACT 


CCTTGGCATC 


GTTATCCAGA 


9960 


TGGTAACGGA 


AAAAGCTTTC 


CAAATCCCAA 


AGGGCTTGTT 


TGATTTGCTC 


CGTCAGTTTT 


10020 


TCTTTTTCAC 


TGGTAAAATC 


ACCTTTCTCA 


AATCCTTTGA 


GGAAAGATTC 


ACTCAGCCAC 


10080 


TTTTCAGGAT 


TACTGGTGGA 


TTGGAGGAAG 


TCATAGATTT 


TATAGACCTG 


CTGGCGCAGA 


10140 


CCCCGTTCGT 


CCTTGCCACG 


CCCAGCAAAG 


TTTTTCAGCA 


AATGACTAAA 


GGTCTCTTTC 


10200 


TGTTTACCTT 


GGTAATGCGC 


TTCAAAGACC 


TCATGAAAGA 


CTTCCTTTTC 


GAGAATAAGT 


10260 


TGCTCGCTTT 


GGTTTTGTAA 


AAT A CGG AAA 


TTAGCTGCAA 


TATCAAGCAG 


ATAACCATGT 


10320 


TTGCCAAGGA 


ATTTTTGTGT 


GAAAGAATCC 


ATGGTTCCAA 


TGGCAGCGTT 


GGCTAGGTCT 


10380 


GCCAACTGGC 


GACCCAAGTG 


TTGTTTGAGG 


TCGACATCAT 


CTGTTTCTTG 


GATTTTCTTC 


10440 


CTGATTTTTT 


TCTCTAAACG 


TTCTTTAAGT 


TCAGTTGCAG 


CCTTGACGGT 


AAAGGTTCAG 


10500 


ATAAAGAGTT 


GAGAAATTTC 


GACACCACGC 


CCCAATTGGT 


CCAGAATGCG 


CTCTGCCATC 


10560 


ACAAAGGTCT 


TTCCAGAACC 


ACCCCATGCT 


GAGACCAGGA 


TATTCTCGGC 


AGAAGTGTAG 


10620 


ATAGCTTCGA 


TTTGCTCGGC 


AGTTTTCTTC 


TGTTCCTTGC 


TCGAATTTGC 


TTCTGCTTCT 


10680 


TGCAGTTTTT 


GAATCTCCTC 


CTCACTTAAA 


AAGGGAATAA 


GCTTCATCCA 


TTCAACTCCT 


10740 


CTCTTATTTT 


TTCAAGCCAA 


GCTTGCTTGA 


GTTTTTCTCC 


GACCAGACGC 


TTGCCATCAG 


10800 


CTAGGTCCAA 


CTTTTCTAGG 


AAACGGGCTT 


GGCCCAGATC 


GTAATTGGCT 


TCAAAGCCTG 


10860 


TAATAGCCTG 


ATGTTGCTGG 


ACGTATGGGG 


CAATGCTTCT 


GCCATTTTCA 


GTATAAGGAT 


10920 


TGATGGCGAA 


CCGGCCTGCT 


AAAATCTTCT 


CAGCAGCTTT 


CTTGTAAAGA 


TAGGCATTCT 


10980 


AGTCCAGTAG 


GAGCTGAAAT 


TCCTCATCTG 


7CAGTTGATT 


AGCCTTGTTT 


TTGTTATAAA 


11040 
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ATTCGCCTAA ATAACTGCTT T C TTT TT CCA AGAAGAGCCC TTGGTATTTC ATACATTTGC LilOO 

TGCCTTCTAC CACTCC7CCT GCCAGACTTT TTACCGCCAT CAGAGATTGG ACAGGTTCAG U160 

CCATTTCCAA GTACATGGCG CCGAAAAACT TCTGCTCCCC TTCTCTTTTT AGGGCAGCAA 1L220 

CATAGGTTCG TAACTCACAA TTGAGCCCAT TAAAGAAATG AGGAAACTGG AACTGAGTCA 11280 

GACTGGATTT GTAGTCTACT ACTCCTATCG CTCCATTAGC TTTCAAACGG TCAATCCGGT 1134 0 

CCACCTTGCC TCCTACAAAG ACACTGCGTC CATTGTCTAA TTGAATAAAG GCTTGGTCTT 11400 

TTCCACCAAA ATTTGCTTCT TCTTTGATCC TTTCGATGGC TGGATTGTGT CGGAGAATAT 114 60 

GTCCAGTTGT CCGTGCAACA TCAAGCAAAA CTTCCTTGGT AAACTGGCCT 7CCAAACTTT 11520 

CTTGATAAAT ACCTTCAAAT TCGCGTTCTT GACTGGTTTC TTGAATAGCT TGTTCTAGAC 11580 

GTTGGTCAAA GGAATCTTCA TTAGGCAACT GTAAGGCGCG TTCAAACATA CGATGCAAGA 11640 

AATTCCCGTG ACTACGGGCA TCAGGATGCA AACGTAATTC CTCCTGCAAG CCTAAAACGT 11700 

AGCGTAGGAA ATAACTGTAT TCATTGCGAT AAAACTCTGT CAAACCCGAC GTAGACAGGT 11760 

AAAACTCCTG TTTGGCAGGA TAGAGACCTT GCAAGGTGTC CTTGGCTAAG GTCTTGCTGC 1182 0 

TTGGACTGG7 TGGGATAGCT GGATTTTCCA GACCTTCCTG ATCTAGTTTT TTACCTATGA 11880 

CACGCGACAG AACCTTGACA AAAGTCAAAT CTTGCTCAGT ATCGCTCATC TCACCCTGCT 11940 

GGTGATAGGC AACCAGACTA GACAAAAGAC TCTGATAGGA CCCCATATCC TCCTTAGACA 12000 

CTCCTTTGTG ATTCATCCTC TTCTCTCTCC CCCTAAATCC AAAATGGATC AACTCTTCAA 12060 

GATAGGCAGA TTCCTTACTT TCACTTTCGT TAAAAAGGCT TGGAGCCGAC AAGAACAACT 1212 0 

CCTTACGAGC AGAATTGACC AAGGAAAGCA TAC7GTAGCG ATTTTTCTTG AGATTTTCAC 12190 

TCCTGGCAAT CAGT .WTTGA ACGCCTTCTT C^CTTGCTTC GTTT." ."TTT "3TCTT7CTT 12240 

CATCTGTCAG AAGACTGGTG TTTTGAGAAA TTTTTGGTAA ATTGTCCTGA GTTAGTCCAA 12 300 

TAG CAT AG AC AAAGTCACCA GTCAATGGTC CAATCAAATC GTAACTCTGC ACCAGAACAG 12360 

TGTCCACTGT TGCTGGAATG GTACGGTATT GGGACAAACT CATTCCAGAA TGGAGCAAGG 12420 

CTAGGAAGTC TTCCAGACTA ACCTGTGAAC CAGCAAAAAC AGTCGCAAAT TGTTCTAAAA 12480 

CATGGCAGAA AGCCTTCCAA ACTTCGCCTT GTCTTTCCTG TTCTACACCT TCCAAAGTGG 12540 

TTGTCAAATC TTGTAACTGC TTGGTCACAG CTCCTTCTTT TAGAAAGACA CTCCATTTTT 12600 

GTAGGAGTTT TTCAGCCTTT TCTTTTCGGC TGGCAAAGAG GGTTTCAAGA GGTCCTAAAA 12 660 

TTCTCAGGCG GAGGACATTC AAACGCTCAA GATTAAATTT TCCATGG7GG CATTTGGTGA 12720 

AGGTTTGCTG AAAGGCTGGC AAGCCATTGA TACCAACATA GCGGATATAT TCCTCAAAAC 12780 
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CATCAATATC AGACTGACTG AGCTCAGTAT ACAAATCACT TCTAAGAAGA TTAATCAAAT 12840 

CCTCCTGACG AAAACGG7AA CGTTTTAAAG CTAAAATAGA CTCGACAAAC TGAGTCAAGG 12900 

GATGATGAGC CATGGCTTCG CTTCTACCAA GATAAAAAGG AATCTGATAC TGGTCAAAAA 129 60 

TGGTTTTGAG AGATAACTGG TAAGAAGCTA CATCCCCCAA GAGAATACGA AAATGCTTGT 13 020 

AGCTCAGGTC TGAGTTCTCA TGTAATTTCT GACGAATACT ACGGGCTACT AGCTCCAACT 13080 

CCTCCTTTTG CGTCAAACAA GACCAGATTT GTAAATTTTC ACGGTCTTTC TCATCGACAT 13140 

CCAAAGCGAG TTCTGAAAAG TCATAAGAAG ACTCCAACAA ACGAGAGGCC TTGTCAAAAC 13 200 

TATCCATCTT CTCATGAGTT TGAGAACAGT CCTGAGCAGG CGTTTGGTAT TTAGAAGCCA 13 2 60 

GATGATGGAG AAATTTTACG CTGGCTTGGT AGAGATTGCC CTCGCTAAAA GGACTGGTAT 133 20 

AGGCTTTCTT ACT AG CAT AA GCCCCGATAA CAATCTCAAC ACCTTTGCCG TGAAGTAAGT 13380 

CCACAACCCG CTCTTCCTCA GCAGAAAAAC GAGTAAAGCC GTCAATGACC AAGGCGATTT 134 40 

GATTAAAATC ACTACTTACC TTGTCATTCT CAATAGCCTC AATCAAATGG GACAACTGAC 13500 

TTTCCTGGGC TAACTGACCT TGATTAAGAT AGGCTGTTAC TTTCTCAAAA ATCAAGAGTA 13560 

AATCCGCCCT CTTATCCTCA TCTGTTAAAT TCTCCAAGTC CAAAAAACTC ATCTGAGATT 13 620 

TGGTCATCTC ATGGTAAACC TCAATTAACT GCTGGATCAA TTGAGGATCC TGCTTAATAG 13 680 

CGCCATAAAC ACGCAAGTCC TTCGGATCCA GTTCGGCAAG GCATTTGTAA AAGGCCAACC 13740 

CAAGACCGAT ATCATCAACA GTACTTTTAG CTGGTAAATC ATTCAAGACC AGATAGCGAG 13800 

CCATTTGAGC AAAGCGCGTG ACGGTAATCG AAAAAGAAGC CTCCTGGGAC AAGTATTCCA 13860 

GCACGGCGCG TTCCT7TTCA AAAGAAAGAG AGTTGGGGGC AATGTAGAAG ACCCGCTTGC 13920 

CAGCTGCAAC TAGCTCTTCT GCCTCTCTTC TTAGAATTTC TGTCAAAGAA GTCCGAATAT 13 980 

CAGTATAAAG TAATTTCATC TCACCCTCGT TGGAATTTTT CATCACCCTA TATTATACCA 14040 

TGATTAGCCT CGTAAATCTG TTAAAATATT TAGGCCATCC TTTCTTTTCT TCATCATCTG 14V00 

CTAAATCTTA AATACTTACC TTTACTTGTA TTAGATAGAA TAAGTCTGGC TACTCAAAAT 14160 

CACATAATAA AAAAGCCTCG GTAACAAGGC TTTGAGTTTT ATGATTGTTT CTTAGGTACG 14220 

GAATACACTT CAATGTGTTG TCCCAGTATC TTAATGTCGA CTGGTAGATT GTCTCATTTA 142 80 

TCGCCATCAA CATCGGACTC TAATTCGATA TCAGAAGAAG TTTTAATATT ACGTGCCTTT 14 340 

ATATATTCAA TATTCTTGAT AGAATGATTG AACTATAGTA AATTGAAAC7 ATAATAGTAC 14400 

ACCGTGGATG CTAAAATATT TCTAGAAATT AATTTGATTT CCCTAATCAA GCTATTCGTA 144 6C 

TCTTATTTCA ATCTACTATA ATAAAATGAA CCAAAAATAG TACACAATGT GGTATAATCT 14520 

TCTTATGGCA TATTCAATAG ATTTTCGTAA AAAAGTTC7C TCTTATTGTC AGCGAACAGG 14530 
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TAGTATAACA GAAGCATCAC ACGTTTTCCA AATCTCACGT AATACCATTT ATGGCTGGTT 14640 

AAAGCTAAAA GAGAAAACAG GAGAGCTAAA CCACCAAGTA AAAGGAACAA AACCAAGAAA 147C0 

AGTTGATAGA GATAGACTTA AAAACTATCT TACTGACAAT CCAGATGCTT ATTTGACTGA 14 7 60 

AA7AGCTTCT GACTTTGGCT GTCATCCAAC TACCATCCAC TATGCGCTCA AAGCTATGGG 14820 

CTACACTCGA AAAAAAGAAC CACACCTACT A7GAACAAGA CCCAGAAAAA GTAGCCTTAT 14880 

TTCTTAAGAA TTTTAATACT TTAAAGCACC TAGCACCTGT TTAGATTGAC GAAACAGGAT 14940 

TCGATACTTA TTTTTATCGA GAATATGGTC GCTC ATT AAA AGGTCAGTTA ATAAGAGGCA 1SO00 

AAGTATCTGG AAGAAGATAT C AC A GC ATT T CTTTGGTTGC ACGTCTAACA AATGGTGAAT 15060 

TAATCGCTCC AATGACTTAC GAAGAGACGA TGACGAGCGA CTTTTTTGAA GCTTGGTTTC 15120 

AGAAGTTTCT CTTACCAACA TTAACCACAC CATCGGTTAT TATAGTAAAA TGAAATAACA 15180 

ATAGGGCGGG GGGGCGAGGG GGGGGGAGGG AGA 15213 



!2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TTATTACCTG AAA C ATT AAA TTTAATTGCA CATCCCGTTA TCAATTTTAT AATATCATCA 50 

AGATTTTTAT TATCTGATTC AGGAATTTTA TCTGATATAA CAACACCATT TTCAAGATAG 120 

TTCATTAAAT 7ATTTGA7TC ACTAACATTA CTGTTTTGAT CTC rATC'AAC CCAAAAATAA 130 

TGGTTATCGG AATCTAAATA CGATGAGTTT AAAATATTAT TACAAATTAT TTGATTTGC7 24 0 

CCACCAGGAA TATATCTCAC TACTAAATTC TGTTTAAGAT TCTCACTACC TGAATGAGTG 300 

ATAACAAACT CTAGAATATA TTTAGCTAGT CTATCTTCAA CATAAATCAT CTTCCTAGAA 3 60 

TGATACACAT CACCTAATTC AAAAAATGCA TCCTGATAAT CAATATTTTC AATAACATCT 42C 

ACCTTTTCTC CGTTTTTCAC TAAAAGTTTC ACCCCTTCTC TAGGAAAATC TTTTATAAGT 480 

TGTGTAGAAT GTGTAGTGAT AATAATTTGA TGTTTTTTAT TTAAACACTC TTGAAGTAAA 540 

AACTCTTTAA ATTTATAGAT TGCACTCGGA TGAAGTGAGA TTTCAGCTTC ATCTATTAAT 60C 

ATTAATGAAT TTGATTGCGC ATTTACTATA TCATTTACTA ACAAAATAAT TCTAGCCTCA 66C 

CCTGTTCCTG CAAAAGCCTC GGAATATTCT TTTCCAGATT TTTTCATCCA AATAGTTTTC 72 0 
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GAAGCTTTTA TATCATCACC TTTTGAATAC AACTTATGTG TTAAAATTTG AATGTCTGTA 7 30 

TAAGATTCAT CCATTATTTC ACTAATAATT TCACAAACTT TATCATCAAC TTTAACATTA 840 

TCTATAACCA TTTCCTTTTT ATAACGCGTA TAGCTACTTG TATTATTCTT TAAAATATCA 900 

GCAACTGCCT TAGATCGTAA TCTTATAAAA TCTTGTTTAC TACGTTGAGT AGAAATTTTT 960 

TTAAAATTAT AGTGATAGAA AAATAAATCA AAAGCAGAAA CATATTCTTT ACAATCACAA 1020 

AAGACAACAT TTTTTTCAAT GCCATCCCAT CTGTCTGTCG AAGAACTTCC AATATATTTA 1080 

TTTTTGGGTA ATCTTTCCAT CTCATATTGT TTTTGAGGAG CATATGGTTC CCAATAATCT 1140 

AATCCTTTT? TTCTTCCAGA ACCGCCTTTA AGAACTTCTA CATTTCTAGA AGCTTTAATG 1200 

TTATAATATG AATACATTAA ACATTGTTTC CCATCCACTT CA7CTATTTG ATCAACATTT 12 60 

GTACTAAACC AATATTCAGA CACACTTTTA TTCGCTCCAG AACCATATAA AGCTTGTAAA 13 20 

ATTGAAGTTT TATTTACTCC ATATCTATTA CAGACACCTC ACGATTATTT AACTTATAAG 13 80 

TTTTAACAGC TACGGAATCA ATTTCAACAG CAACTTGAAC ATCTATGCCT GATTTTTTAA 1440 

CGCCACTTGT AGTGCCACCT GCACCGTTAA ATAAATCAAT ACCAACAATT TTCCCCATAG 1500 

TATTCTCCTA AAGTTTCTCC TTTTTATTAT AACATTATCA AATGTAAAAC CCAACCCGAT 1560 

AGGGTTAGGT TTTTAACATC ATTTCACCAA CTTCTTCATC TCATCAATAC GTGCGACCGT 1620 

CCCCTCATAT TTACCTTGGT AGTCAGCTTG TTTGTCGCA? TCTTTTTGGA CGACTTCTGG 1680 

TTTGGCGTTG GCTACGAAGC GTTCGTTAGA GAGTTTCTTA CCAACCATGT CCAGTTCTTT 1740 

TTGCCATTTA CCAAGTTCCT TGTCGAGACG GGCCACTTCT TCTTCAACAT TGAGGAGATC 1300 

CCCCAGTGGC AGGTACATTT CTGCTCCTGT GATCACACTT GACATAGCCA GTTCAGGTGC 1860 

AGGGATGGTT GATGCGATTT CCAACTCTTC TGGATTTCTA AAGCGT7TGA TATAGTTGAC 1920 

ATTGCTCTTA AAGAACCCTT CCAACTCGCT ATCGCTTGTC TTAACAAGGA TGGTGATAGG 1980 

CTTGCTTGGT GCTACATTTA CTTCCGCACG CGCATTCCGA ACAGCACGAA TCAACTCTTT 204 0 

GAGACTTTCC ACACCAGTGT GAGCCGCAAG GTCTTCAAAG GCTAGATTAA CAGTTGGGTA 2100 

TGCAGCTGTC ACGATAGAAC CTTCTGAGAT TTGTCCAAAG ATTTCCTCTG TCACGAATGG 2160 

CATGATTGGG TGAAGGACAC GAAGGATCTT CTCCAGCGTA TAGAGGAGAA CAGA7CGACT 2220 

AATGACCTTA TCGTCTTCAT TGTCGCTG7A TAGAACTTCC TTGGTCAACT CAACATACCA 2230 

GTTGGCAAAT TCTTCCCAGA TGAAGTTGTA AAGGATATGA CCAGCCACAC CAAACTCGAA 2 340 

CTTATCAAAG TTTTCAGTAA CTTTTGCAAT GGTTTCGTTG AGATTGTCGA GAATCCAGCG 2400 

GTCCGTCACA TTACCACCCT CACCTGTTGC AACTTTTGTG ACATTGTCAT GCGCCACATC 24 60 

CAGCGTCAAA CCTTCATTGT TCATGAGCAT A7AGCGAGAA ATGTTCCAAA TTTTG7TAAT 2 52C 
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AAAGTTCCAT CAAGCATCCA TTTTCTCGTA AGAGAAACGA ACGTCTTGAC CTGGTGCGGA 2580 

ACCCTTTCAA AGGAACCAAC GAAGGGCATC AGCACCGTAT TTCTCGATGA CATCCATTGG 2640 

GTCAATCCCG TTACCGAGAG ATTTAGACAT CTTGCGTCCT TCCTCCTCAC CGA7GAGACC 2700 

GTGGATAAGC ACGTTTTGGA ATGGCTGACG ACCAGTAAAT TCCAAGGACT GGAAGATCAT 2760 

ACGAGACACC CAGAAGAAGA TGATGTCGTA ACCTGTTACC AACGTTGAAG TTGGGAAATA 2 820 

ACGTTTAAAG TCTTCTGACT CGACTTCAGG CCAGCCCATG GTTGAAAATG GCCAGAGGGC 2880 

AGAACTGAAC CAAGTATCCA AGACGTCTTC GTCCTGAGTC CATCCGTCAC CTTCTGGAGC 2 940 

TTCTTCGCCG ACATACATTT CACCATCAGC ATTGTACCAG GCAGGGATTT GGTGACCCCA 3000 

CCAAACCTCA CG AG AG AT AA CCCAGTCGTG GACATTTTCC ATCCATTGAA CGAAGGTATC 3060 

GTTCAAACGA GGTGGGTAGA ATTCGACCTT GTCCTCTGTG TCTTCCTTAG CAATGGCGTT 3120 

CTTAGCCAAT TGCTCCATCT TGACGAACCA TTGAGTAGAC AAGCGTGGCT CAACTACGAC 3130 

ACCTGTACGT TCTGAGTGAC CAACACTGTG GACACGTTTT TCGATTTTGA CAAGGGCACC 3210 

GATTTCTTCC AACTTAGCAA CGACTGCCTT ACGAGCTTCA AAACGATCCA TGCCTGAAAA 3 300 

TTCAAAGGCA AGCTCATTCA TAGTTCCGTC GTCCTTCATG ACGTTGACTT GTGGCAAGTT 3 360 

ATGACGTTGG CCAACCAAGA AGTCATTTGG ATCGTGGGCA GGTGTGATTT TCACGACACC 3 420 

AGTACCAAGC TCAGGATCTG CGTGCTCATC TCCAACGATT GGGATGAGTT TATTAGCGAT 3 480 

TGGAAGGATG ACGTTTTTAC CAATCAAGTC CTTGTACCCC CCCTCTTCTG GATTAACCGC 3 54 0 

AACCGCAACC TCCCCAAACA TACTCTCAGG ACGACTTCTA GCAACTTCAA GGGCGCGTGA 3 600 

ACCATCTTCC AGCATGTAAT TCATGTGGTA GAAGGCACCT TCTACATCCT TCTCAATCAC 3 660 

CTCAATATCA CAAACCCCTC TCCCACCTGC T~CG?CCCAC TTGATTATAA ACTCACTACG j^CC 

ATAGATCCAG CCTTTCTTGT AAAGGTTCAC AAAGACCTTA CGAACAGCTT TTGACAAACC 3780 

TTCATCAAGA GTGAAACGCT CACGAGAATA GTCTACAGAA AGCCCCATCT TGCCCCATTG 3840 

TTCCTTGATG GTAGTGGCAT ATTCGTCTTT CCATTCCCAG ACCTTCGTCA AGAAAGACTC 3 900 

ACGACCTAGG TCATAACGCG TAATACCCTC ACCACGTAAG CGCTCCTCAA CCTTAGCCTG 3 960 

AGTCCCAATA CCAGCGTGGT CCATACCTGG AAGCCAAAGG GTATCAAACC CTTGCATGCG 4020 

TTTTTGACGG ATGATGATAT CCTGCAAAGT CGTATCCCAA GCGTGACCAA GGTGAAGTTT 4080 

CCCAGTTACG TTTGGTGGTG GAATCACGAT TGAATAAGGC TTAGCCTTTT GATCGCCTGA 4140 

AGGCTTGAAA ACATCCGCAT CAAGCCATTT TTGGTAACGA CCAGCCTCAA CCTCCGCTGG 4 200 

ATTGTATTTA GGTGAAAG7T CTTTAGACAT GTGTGTGTCC TTTCTCTATT TTGTTTATTT 42 60 
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TATTTTGAAT TTGCTTAGCA GCTTCTTCTC CAGACAAATT CGTATTATTT ATTTTAAAGT 4 320 

AGTGGTGCAA CTCATTCGGT TGATGTTGGG AATTTAATTG AAGTGTTTCA GCGGTCTCTA 4 3 80 

AAATTTCTCT TTCAGATACC TCAATATGTC GTTTTAAGCC TTTGTGCTTT AATCGATTCT 444 0 

CCGTTCGATT TCGACGTATG CACTCTTCAA GACTTCTT7C CAATTCAACA AACAGAATCT 4500 

CTTGATGAAA GTTATCCAAT AAATCCTGAA TTTGCTTTAA ATACATCAGC TGGTACTGAT 4560 

TTGAAAAATC AATTACCTCT GTTAAAATTA CTGATCGCTG ATTTCTTGCA CTTGCTCCAA 4620 

GGAAAGAAAA GGTAATTCCA CGAACAAATT CCCACATCTC CTCGGTATAA TCCTCATAGA 4680 

TCTCTAGTGC AAAATCAATG GCTTGATGGT TATAAAATAG GGTAGCATCC GTCAGTCGAG 4 7 40 

ATAATTCTTG ACCAATGGTC ATTTTTCCTG ATGCTGGAGC ACCAATGATG AAAAGATGCA 4800 

TCAAATCACC TCCCACTCAC TCCTCACCAA GCCATATCTC AAATCATCAC AGCAGTTGCC 4 360 

TTGAGCATCT TTCCCGTCTC TTATCCGAGC TTCGAGGGTA AAGCCAAGC? TTTCCGAGAC 4920 

TCGTTGACTT TGAAGGTTAT ATCCAAAGCA AGTTAGTTCA ATCTTGTGAA GACCAAGTTC 4980 

TTTAAAACCT AGATCAATCA AGGAACACGC TGCTTCTGGA ACATAACCTC CACCCCAATA 5040 

GTCTCCCTCC AAGGTATAGC CAAGCTCTAG CACATCATCC GCATGAAGAT GGTTGAAGTC 5100 

AACAGAACCA ATGACTTTAT CGGTTCCTTT GACGACAATC CCATAGCCAG CTGCGAGATT 5160 

TTCCTTTTGA GTACGCTCCG GAAGAATGTG CTCCAGATAA TAAATCTCAT CTTCCAAGAT 5220 

CTTGACTGGA GGAAAACCTG CTGGATAGGC GACCTCTGGC AAACTAGCGT AGGTATGGAT 5280 

ATCCTCAGCA TCCACCACTG TGCGGACTCG TAAAACGAGA CCTTCTGTTT CGATTTTATC 53 40 

TGGCAGCTCA GTTCTTCCCA TCCTTCTTCC TCGCTTTTTT GATGAAACTG CCCTTCATAT 5400 

CTACACGCTT GTCCAGATAG CGATAAACGC GCTGATATCC ATCTCCCATG AAATAGGTTG 54 60 

GGGCAAACAG TTGATTTTTA AAATGTCCCT TTTCATCCAG GAGTTCTGGG GCAACAAGTC 5520 

GCTCAAGAAT CTTGGCAAAG ATGTGGC AAA TACCGTCTTC CTCAACAATC CTATCTACCC 5580 

CACAATCTAA AACAAGTGGA CAGGCGTCTA AAATAGGAGT CTGAGTTCGT TCAGAAAT IT 5640 

CATAATGCAC TCCCAAACGT TCCAATTTCT CCTGATGACT GATAAAACCA GCCTCCTCCA 5700 

TCGCAAGCAT AGAAGTTTCA TCAGAAAT AT TCACACTAAA TTTTTGATAC TGTTTGATCT 57 60 

GCTCTGCGGC ATTCTCTCTC GCAACGACTC CAATCACAAC CCAATCTCCT AGACTATAAG 58 20 

AGGAACTACA GGTCGTGATG TTATAGCCAA AATTCTAATC TTGATATCCT AAAATAAAAA 5880 

CAGGAAAACC ATAATATAGT 7TACTTCTGT TAAAAGATTG CTTCATAACA ACCCCCTTTG 5940 

ACTAAGACGT AAAAGAAAAG CCCTGCCATC TACATGACAG GGACGAATCT GTTTATCCGC 6000 

GGGG 6004 
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(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5857 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
{ D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
TGTAGAATTC ACGACAATGC TTCGTTGATT TCTGCGTTCA TTTCGTCGCG TTCTGGCAAG 60 

CGAGTCAATG AACCAAAAAT AGTACACAAT GTGGTATAAT CCTTTTATGG CATATTCAAT 120 

AGATTTTCGT AAAAAACTTC TCTCTTATTG TGAGCGAACA GGTAGTATAA CAGAAGCATC 180 

ACACCTTTTC CAAATCTCAC GTAATACCAT TTATGGCTGG TTAAAGCTAA AAGAGAAAAC 240 

AGGAGAGCTA AACCACCAAG TAAAAGGAAC AAAACCAAGA AAAGTTGATA GAGATAGACT 300 

TAAAAACTAT CTTACTGACA ATCCAGATGC TTATTTGACT GAAATAGCTT CTGACTTTGC 3 60 

CTGTCATCCA ACTACCATCC ACTATGCGCT CAAAGCTATG GGCTACACTC GAAAAAAGAA 420 

CCACACCTAC TATGAACAAG ACCCAGAAAA AGTAGCCTTA TTTCTTAAGA ATTTTAATAG 4 80 

TTTAAAGCAC CTAACACCTG TTTAGATTGA CGAAACAGGA TTCGATACTT ATTTTTATCG 540 

AGAATATGGT CGCTCATTAA AAGGTCAGTT AATAAGAGGC AAAGTATCTG GAAGAAGATA 600 

TCAGAGGATT TCTTTGGTTG CAGGTCTAAC AAATGGTGAG TTAATCGCTC CAATGACTTA 660 

CGAACAGACG ATCACGACCG ACTTTTTTCA ACCTTCGTTT CAGAAGTTTC TCTTACCAAC 72 0 

ATTAACCACA CCATCGGTTA TTATTATGGA TAATGCAAGA TTCCATAGAA TGGGGAAGCT 780 

AGAACTCTTG TGTGAAGAGT TTGGGTATAA ACTTTTACCT CTTCCTCCCT ACTCACCTGA 340 

GTACAATCCT ATTGAGAAAA CATGGCCTCA TATCAAAAAC CACCTCAAAA AGGTATTACC 9 CO 

AAGTTGCAAT ACCTTTTATG AGGCTTTTTT GTCTTGTTCT TGTTTCAATT GACTATATAA 960 

ATTGTCTAAG CGAAACAACC GATAAGAATT GGCACAAAAG CGACCGTATT TTTCTTACCA 1020 

ATACAGGAAA AACAGTTCAT AGTTCTATCT TGACCAAGTC TCTCCACCGA GCAAACGAAC 108C 

GCCTTAAAAA ACCAATTCCC AAACATCTGT CCCCTCACAT CTTCAGACAC ACCACTATTA 1140 

GCATCTTATC AGAAAATAAA ATTCCTTTAA AAACAATCAC GGACAGGGTT GGTCATCCCG 1200 

ACTCTGAAGT CACTACTTCC ATCTACACCC ACGTCACAAA GAACATGAAA GATGAAGCAA 12 60 

TCAATGTACT GGATAAAGTT ATGAAAAAGA TTTTTTAAAA AGTTTTGTCC CTTTTTTGCC 13 20 

CTCTAAATAC AAAAATAGCC C7TCGGATAA AATCCGAGGG GCTAGAAACG TTGTTAAATC 1380 
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AACGGCCGAA CTTTTCAATT TCATCGTTCG GCATAAAATA GTTCACTGAA CTATTTTATT 1440 

TTTTAAGGTT ATCATAATAT CAAATAGTTC AATTAAATAC GCTAAATTAC TAATATACTT 1500 

TTTACCTTTT TCATTCTAAA ATGTAAAGTA CAAACAATTA CAATATACTA GAGGGGGAGT 1560 

AAAAAAGGTA TTAAATCCAT GAGTTCAGCA GGCAAGAAAA TAGCACCTTT ACGGGTGCTA 1620 

TTTTTTAATT AACGCCACGT TAACTTTTGA TTGATGAATT TTATTGTTTG GCACTTCTTT 1680 

CATTTCACGG TAAACATCGA TGAAATTCTT TCCAACATTA TTTTTGGAGT TAACTGCATT 1740 

TATTTTTGTA TTAATAACTT TTTTAGTATC GAAAGAATGG TTTAAGAAAT CCATAACTAA 1800 

CTCTCCTTTC TCATCCTGTA ATCAAGATTT TTATCAATGT CAAAATAGTA TTTTCTATCA 1860 

ATCCAAATTG CTCCTTCTCC TTTAGAAATA GCAAGTACAT CTACCGGACC TCCTAC7GTT 192 0 

TCAAGAGTGT TGACAATTTT TCTCTTAAAT GAACTTAATT CAATAAATGT TTTACCTGTA 1980 

CTCGCCATTT CATTAAGTGG TTCCATTCCA ATAAGGTCTA TTATAGGATT TATATAATAT 2040 

TTTTGCTGTA TAGATGATAT ATTTTCAAAT ATATTCTCAA TTTCATCACC CAATCCATTT 2100 

TTCTCCATAA CTGATGATAC TTCCTCTGCG AT AT AT AC AT TTAAGTTAGG ATCTATACCA 2160 

TTCATAATCG TCTCAACCAT CTCTGACTGT GCAAAAGGGA TTATATGACA ACTTTTATGA 2220 

TGATTTATCA CACTTTCATT AATAACTTTC CAAATTAATC GTTTACAAAA AATTCCATAT 2280 

AATTCAATTT GTCTTATAGA TGGAAATATC TCGTCTGTAC CATAACCTGC TATAACTAAT 2 340 

CCAGTTATGT TTGTTGAGTC ATATCCAATG AAAATCGCTT TATATAAACA TTTAGCAATA 2400 

ACTTCAACCT CATCATCAGT ATGAGGAAAG CATTTAAAAA CATCGTCTAC AATGCTTTTT 24 60 

ATTAACTCTA ACTCAGCTTC AAAAAATTCA AAATTACTTT CAGCTTCTAC TTTTGAAATT 2 520 

TCTAAACTAA AATTAGTTAT AGCATTTAAT AAAATTTTAT TAAAATCATC TAGAGTGATG 2 580 

GTTTCACCAT TAGAAACTCT TAAATCAGCT GTTTCTTGCG CTTCATAGCC AATGCTGTCC 2 640 

AAAATACTTC TTGTACTTCT GACAATATAA TTTCTTAATA AATCCTCAAC TTGTAGATGT 2700 

TTAAAGGAAA TTAAAAATTC TATTAGCTTT TCAACGTATT GGGCAGTATT ATCTAATAAA 2 7 60 

TCTGTGCCAA TAGCCTGCTT AAACTCATTT AAAATTACCT CCCACGGAAT TTCCATAAAC 2 820 

GAAGCGTTCC CATATATCAT GATCCCCACG GAATGTTCTT TTGATAAAGT GAATAATTTT 2880 

CGGGCGCTAT TAAAAACTTT TGAATTTTTC CCGTCTGATA AGGTTACAGC GCTATCAGAA 2940 

GCCAATACAA CACCATTTTT ATTTAATATT CCAATTTCTG CTGTCAAAAT ATCACCTAAA 3 000 

CTTTCTAAAC CTGCTCATGC TCTAATCCTA CAACAGCTAA GGTCTTACCA AGAC7TGCCA 3060 

ACACTTTTAA TACTGTATCA AGTTGTGGGC TTCTCTTTCC TGTTTCCATT CTAGCGATAA 312C 

CTGGCTGACT AACACCGCTC ATCTCCTCTA GTTTCTTCTG ACTAATACCC TTTTCATTTC 3180 
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TAGCCTCCAT AAGCTCACTC A7GATACCCA CGCGCATATC AC77TCCAAA ATTTCCTCTT 3240 

7GC7GAA7AA TTCACC7C7T ACATCTTTCC AGTTACTACC AA7AGCATTA TTTTTCATTG 3300 

TCTAAACCTC T7TCTTTTAA ATCTGCAAGT TCACCTTTAG CTTGCTCAAT CTC7CTTTTG 3 3 60 

GCTGTTTTCT G7G7CC7TTT CATAAAATGA 7GCAG7AAAA CAAAACTACC ATCCATCCAA 3 420 

GCAACAAATA AAA7TCTATC TCTAAGTGGT C7CAGCTCCC AAATTTCAGC ATCTAAATGC 3 480 

7TAA7ATATG GT7CGCC7CC GCGTGTTCCA TG7TGGCTTA ACAACTCAAT ATAATCATTA 3S40 

ATTTTA77AA GCTTAATTCT GCTATCTTTC CCTTTTTTAC TGGTAAGCTC TCGCATATAA 3 600 

TCAAAAACAG GCTCATTGCC GTTTTTATCC TTGTAAAAAT AGATAT7ATG CACTATTAAC 3 6 60 

ACCTCTTCCT MTAACPJKTT ATAACCTAAA AG77A7TGTT TGTAAATACT TTTAACTTA7 3^20 

TAAAATAAAA AGCACCTAGT TTCCTAGATG CTAGCACAAT GACACGGATT CGCACCGTGC 3780 

CTACCTCTAT CAAGGTGTAC TCCTTCTATA CTATCCCTTG TGCTTTAGAA TATTATACCA 3 840 

CACAATCAAC TAGATACCTA CCA7C7CA7G ATATACCCCC ATTTTGGGCA AGGGTACAAC 3 900 

GCTAAAATAC AAATCAGAAT AGATATTAAA CCACTTATTT AACTTATCAT AAGCTGGTGA 3 960 

TTGACTGATA AATAATATCC GCTGACAAGC TCCGATAACA TTCATGTGAT TGTACACATA 4020 

AACCTCTTTT ACAGCCTCTA AAATGTCAGC CTCACTTGTT TGTACCCTAA TATCTGTTAT 4080 

CTGCTTGATA GTTGCGTATT TTTGATAAGC TAGCATATCT TGATTTTTAG CAGCATCAAA 4140 

CATTTTACGC TCAAGGACAC TATACTTAGG TTGTTCTTTA TCTCGCATGA AATACCACTT 4 200 

GAGCCATAAA ATC7TTTC7C GGTGTATTAC AGAAATACGC TCAATTTTCT 7CTTTG7CA7 4 2 60 

7GC7ACCTCC TAAATCATCA ATTTAACAAT TCTAACCACT CACTTTTAGA AATAGTTCCA 4 320 

7AGA7C77CT TCGATGTATG ATACAAAGGT TCTAAATCTT TTTCCACCCT AATATAGTTC 4 380 

ATCTTATCCT CATGAGTAGG AAAG7ATAG7 ATTTCCCTTT CATCCTCGTT TAGCATACGA 4440 

77CCACCAA7 CATCAATAAT AACTGGCACT 7CCCAC7CAC CCCATTTTTT AAGGTTTTCT 4 500 

AAAAGTTCAT T AT C ACT AAA TAGCTCGCCA TCTATTTGGA AAAATTCCCC TAAG7CA7TG 4 560 

T77CCT7CAA CAA7AA7AAA CTCTGGCATA 7T7CTAT7AC TTAATAACTC CTTGAG77CT 4 620 

TGTAAC7CT7 TGA7TTCCTT TAGATACTTC CTCAAT77CC AACCTCAATT C77CAA7CTG 4 680 

CCTTACTACT CCAAAAAT7T CATGGGTCT7 ATAAGATTG7 TCAAG7ATAG CCTT7GCTGC 4 7 40 

TTGAG7TC7T ATAAACGGGT TGACCTTAC7 G7CCATCATA ATATCATTGA G7ACAGAAAC 4 800 

AGCGTTAGAT GATGC7AAAT AAAGCATT7G AG7TG77TTA 7CCA7CATCT CATCTTGCTT 4 360 

7ATCC7CAAT GTC7T7TTAA CCGC7GCAAC 77TTAGA7AC 77ATGACC7G 7TGCGCG7GA 4 920 
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TACCCCTGCT TTTTGACATG CTTTGTCTAT CGTTGCCTCG 

TTTAATTTGC TTGGACCTAA GGTTATCATT TTCATTTCCT 

TATCAAAA7A AAGGGTTGCC CCTTTATTTC CCTATGCTAG 

CCATTGCCTC TGAATTGCCC TCAACAATCA T7TCATGCTC 

CGTTAATAAG TAAACCACCG TGGAAATAAT CAATTTTTCT 

TTTCAAGGCG TTGCTGTTGG CTGAATTCCT CCATGTCAAT 

TATCATTATC CATAATATCT TCTAATTTTC TAAGAGCTAG 

CTAGGTATTC TCTCATTTCT GCCACTGTTA ATTTGATACT 

CTGCATCATC TGCTGTAATA CCCTCTTCTT TTGATTCATG 

TCTCTTTTTC TAGTTGCTGA TACAATAGCT GAGCAGTATT 

CTTTTTTATA TTTTAAAAGT TCTTGCTCTG CATACACTTT 

AAACTAATTC CCCATCTTGA GCTTTTAGCT TAATACTCCC 

ACTTAATTAT ACCATTTTTT GAGTATAAAA CAAAGCCTTT 

TATCATCCTT GTTTTCAGTC ATGCTTTTCT CCTTTATTTC 

CCCCTACTCT ATTTATTTCA CTAGGTTTTT AGGCTTCGTA 

TGTGTACCTT ATGGCTGACT TTTCAAATTG GTTAGTT 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10254 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



GTAAGCATGG 
GCCATCTATT 
ATAATTCTGC 
TACTAAATCA 
ATCAAGGAAA 
TTCGATATAA 
AGGTTTATTT 
AGATAATAAA 
CTTTGCTAGT 
TTGGGAATAG 
CCCGATAATC 
ATGCTCTGGA 
CTCCATCATT 
ATTTTATTAT 
TGCTAAAATA 



CATCTATGAA 
ACCTCCTCAT 
AATTCTGCAT 
ATCTTATCTC 
TCTACTAGCT 
GCAAGGGTAG 
TTATATTTTT 
CTTAGTTCAG 
TCAGCATTTT 
TTTTCGCCCT 
ACTTCCTTAT 
ATTTCAATAT 
TTTAATAATT 
AATCTGAATA 
CTACCCTTTT 



4980 
5040 
5100 
5160 
522C 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5857 



(xi> SEQUENCE DESCRIPTION : SEQ ID NO: 29: 
AAAATGATAG CAGGAGAGTT TTCCCGTCCA TCACACCCAC AACTGAGAGC 
GCTTCTCGCC AAAAACAGGC CGCCTTTAAC AAGGAAGAGA ACCCCTTGAA 
ATCATCAAGA CTTGGTTTGC CTCAACCGGG AAAAATCTTT ACATCAACAC 
GTGGACTACG GTGTCAACAT CCATCTAGGG GAAAATTTTT ATTCTAATTG 
ATGCTGGATA TCTGTCCCAT TCGTATCGGG GACAATGCTA TGATTGGTCC 
TTTTTGACAC CCCTCCATCC ACTAGATCCA CAGGAACGCA ATTCAGG7AT 
AAGCCTATCA CAATCGGAGA TAATTTCTGG ACTGGTGGTG GCGTCATTGT 
GTGACACTGG GAAATAATGT CCTTCCACCA GCAGGGGCAG TAATTACCAA 



CTTAGCTCAG 
GGGAGCCGAA 
TCGCTTGATG 
GAACTTGACC 
TAATTGTCAG 
CGAGTACGGA 
CCTTCCTGGA 
ATCTTTTGGC 



60 
L20 
130 
240 
3G0 
360 
420 
480 
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GACAACGTTG TCCTAGCTGG CAATCCTGCG CGCGTGATTA AGGAAATACC TGTTAAATAG 540 

AAGTAAAAAG GAACAGCTCC GGTTGTTTCT TTTTTGTAGG TTT CATC ATT TTTTACCCAG 600 

TTCACATTTA CCTACTCTAT CTCTTAGCAA GTCTGTTTCA TTAAGCAAGT TCAAAGCATC 660 

TCGTAAGTGG GATGTTTTTC TCCTCAGTTC ATCAGCTTCC TCCTTGACAC TCGGTCAGAT 720 

TTTGATACAA TACTACAAAA TTAGAGGAGG CAGGCTATGA TTCAGAAACA TGCGATTCCT 780 

ATTTTAGAGT TTGATGACAA TCCTCAGGCG GTTATCATGC CCAATCACGA GGGGCTGGAC 840 

TTGCAGTTGC CAAAGAAGTG TGTTTATGCA TTTTTAGGTG AGGAGATTCA CCGCTATGCG 900 

AGGGAAGTAG GGGCGAACTG TCTTGGCGAA TTTGTTTCTG CCACCAAGAC CTATCCAGTT 9 60 

TATGTCGTGA ACTACAAGCA CGAGGAGGTC TGTCTGGCTC AGGCTCCTGT TGGCTCCGCT 1020 

CCAGCAGCCC AGTTTATGGA TTGGTTGATT GGCTATGGTG TGGAGCAGAT TATCTCTACT 1080 

GGGACCTGTG GTGTCCTAGC TGATATAGAG GAAAATGCCT TTCTAGTCCC TGTTCGCGCT 1140 

CTGCGAGATG AAGGAGCCAG TTACCACTAT GTGGCACCTT GTCGTTATAT GGAAATGCAG 1200 

CCAGAGGCTA TTGCTGCTAT TGAGGAACTT TTGGAAGACA GAGGGATTCC TTATGAAGAA 12 60 

GTCATGACCT GGACGACAGA CGGTTTTTAC CGAGAAACGG CTGAAAAGGT CCCTTATCCT 13 20 

AAGGAAGAAG GCTGTGCTGT TGTCGAGATG GAGTGTTCTG CTCTTGCGGC AGTAGCTCAA 13 80 

TTGCGTGGGG TTCTCTGGGG TGAATTCTTG TTCACAGCAG ATTCTCTACC GGACTTGGAC 1440 

CAGTACGACA GTCGTGACTG GGGCTCGGAA GCTTTTAATA ACCCGCTAGA ACTGAGTTTA 1500 

CCAAGTCrrC ACCACCTTTA CTTGTACTCG CAAAGGATTT CTTTTATCAT AAAATGTCTA 1560 

GCTCATACTT TTCAAAAATA TGTTTAAACG AGGTCACCTT CCTCTTGTCC TAGGCATGTT 1620 

GAGGTTGGGA AAAATCTTTA AAATCAGAAA AACGTATCAT ATCAGGTCAT GAAAACTTTG 1680 

ACACTATGCG TTTTATGTCG ATAAGATTTA GAGTGAGATG AAATGATACT CTTCGAAAAT 1740 

CTCTTCAAAC CAGGTCACCT TCACCTTGCC GTAGGTATAT GTTACTGACT TCGTCAGTCT 1800 

TATCCGGCAA CCTCAAAACG GTGTTTTGAG CTGACTTCGT CAGTTCTATT TGCAACCTCA 1860 

AAACAGTGTT TTGAGCAACC TGTGACTAGC TTTCTAATCG ATGCCTTGGT TTTCATTGCC 1920 

TATAATCAAA AAGAGAAATT TTCTCCTGAA AAGCATATAG ACTAGCTGGC GTTAAAAGCT 1980 

CCTGTCTTGC TTTTTTGACC TATAGTCACA TCTATCAAGT ATTGTTCTTG CCTAAGCTAT 2040 

CAATAAAAAG GTGGCATTTT TTAGGCTTGG TGTTAGTAGA TTTTGCCTTA TCCTATCTAA 2100 

GTCATTTCGA ACTTTTTATG GTACAATGGA AACATGTTAT TCAAATTATC TAAGGAAAAA 2160 

ATAGAGCTAG GCTTATCTCG TTTATCGCCA GCCCGTCGTA TTTTTTTGAG TTTTGCCTTG 2 2 20 
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GTCATTTTAC TAGCCTCTCT TCTTTTGAGC TTGCCCTTTG TCCAAGTTGA AAGCTCACGA 22 80 

GCCACTTATT TTGATCATCT TTTCACTGCT GTCTCTGCAG TCTGTGTGAC GGGTCTCTCA 2 340 

ACCCTTCCAG TAGCTCACAC CTATAATATC TGGGGTCAAA TAATCTCTTT GCTCTTGATT 2400 

CAGATCGGTG GTCTAGGGCT CATGACCTTT ATTGGGGTTT TCTATATCCA GAGCAAGCAA 2460 

AAGCTTACTC TTCGTAGCCG TGCAACTATT CAGGATAGTT TTAGTTATGG AGAAACTCGA 2 520 

TCTTTGAGAA AGTTTGTCTA TTCTATTTTT CTCACGACCT TTTTGGTTGA GAGCTTGGGA 2 580 

GCTATTTTGC TTAGTTTTCG CCTTATTCCT CAACTTGGCT GGGGACGTGG TCTTTTTAGT 2 64 0 

TCCATTTTTC TACCGATCTC AGCCTTCTGT AATGCCGGTT TTGATAATTT AGGGAGCACC 2700 

ACTTTATTTC CTTTTCAGAC CGATTTACTG GTCAATCTGG TGATTGCAGG CTTGATTATT 2 7 60 

ACAGGCGGCC TTGGTTTTAT GGTCTGGTTT GATTTGGCTG GTCATGTAGG AAGAAAGAAA 2820 

AAAGCACGTC TCCACTTTCA TACGAAGCTT GT ACT ATT AT TGACTATACG TTTGTTGTTA 2880 

TTTGGAACAG CAACTACTCT CTTTCTTGAG TGGAACAATG CTGGAACGAT TGGCAATCTC 2940 

CCTGTTGCCG ATAAGCTTTT AGTTAGCTTT TTTCAAACAG TGACGATGCG AACAGCTGGC 3000 

TTTTCTACGA TAG ATT AT AC TCAGGCTCAT CCTGTGACTC TTTTGATTTA TATCTTACAG 3060 

ATGTTTCTAG GTGGGGCACC TGGAGCAACA CCTCGCGGAC TCAAGATTAC GACATTTTTT 3120 

GTCCTCTTGG TCTTTGCACG AAGTGAGCTT CTAGGCTTGC CTCATGCCAA TGTTGCGAGA 3180 

CGAACGATCG CGCCGCGAAC GGTTCAAAAA TCCTTTAGTC TCTTTATTAT CTTTTTGATG 3240 

AGCTTCTTGA TAGGATTGAT TCTCCTACGG ATAACAGCCA AAGGCAATCC TCCCTTTATC 3 300 

CACCTCG7AT TTGAAACCAT TTCAGC7CTT AGTACAGTTG GTGTAACGGC AAATCTGACT 3 3 60 

CCTGACCTTG GGAAATTGGC TCTCAGTGTT ATCATGCCAC TTATCTTTAT GGGACGAATT 3 4 20 

GGTCCCTTGA CCTTGTTTGT TAGCTTGGCA GATTACCATC CAGAAAAGAA AGATATGATT 3 4 80 

CACTATATGA AAGCAGATAT TAGTATTGGT TAAGAAAGGA AAGAGCATGT CAGATCGTAC 3 540 

GATTGGAATT TTGGGCTTGG GAA T TTT T GG GAGCAGTGTC CTAGCTGCCC TAGCCAAGCA 3 600 

GGATATGAAT ATTATCGCTA TTGATGACCA CGCAGAGCGC ATCAATCAGT TTGAGCCAGT 3 660 

TTTCGCGCGT GGAGTGATTG GTGACATCAC AGATGAAGAA TTATTGAGAT CAGCAGGGAT 3 720 

TGATACCTGC GATACCGTTG TAGTCGCGAC AGGTGAAAAT CTGGAGTCGA GTCTGCTTGC 3780 

GGTTATGCAC TGTAAGAGTT TGGGGGTACC GACTGTTATT GCTAACCTCA AAAGTCAGAC 3 84 0 

CGCTAAGAAA GTGCTAGAAA AGATTCCAGC TGACTCGCTT ATCTCGCCAG AGTATGAAAT 3900 

GGGGCAGTCT CTAGCACAGA CCATTCTTTT CCATAATAGT GTTGATGTCT TTCAGTTGGA 3960 

TAAAAATGTG TCTATCGTGG AGATGAAAAT TCCTCAGTCT TCCCCAGC7C AAAGTCTGAG 4020 
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TAAATTAGAC CTCCGTGGCA AATACAATCT GAATATTTTG GGTTTCCGAG AGCAGGAAAA 4080 

TTCCCCATTG GATGTTGAAT TTGGACCAGA TGACCTCTTG AAAGCACATA CCTATATTTT 4140 

GGCAGTCATC AACAACCAGT ATTTGGATAC CCTAGTAGCA TTGAATTCGT AAAGAGGGAT 4 2 00 

GACCCCTCTT TTTTGATGCC TAAGATGCCA AATAGAGACA GAAGCCCCTT CTCTTCTAGT 4 2 60 

AAAAGTTCTT CAAAGGCTGG ACTTTATGGT AAAATAGAAA GAAGTGACAA GAGAGAGTAA 4 320 

TACTCAATGA AAATCAAAGA TCAAACTACC AAACTAGCTA CGGGCTCCTC AAAACACTGT 4 3 80 

TTTGAGCTTG CAGATAGAAC TGACGAAGTC AGTAACATCT ATACGGCAAG CCGACGTTGA 44 40 

CGCGGTTTGA AGAGATTTTC GAAGACTATA AGAAAAAATC AGTCCCCTAA AGGAGTAGAT 4 500 

TATGAACTTA TTGTCTATCG CAATTTCTAG CTATAATGCA GCAGCCTATC TTCATTACTC 4 560 

TGTGGAGTCG CTAGTGATTG GTGGTGAGCA AGTTGGGATT TTGATTATCA ATCACGGGTC 4 620 

TCAGGATCAG ACTCAGGAAA TCGCTGAGTG TTTAGCTACC AAGTATCCTA ATATCGTTAG 4680 

ACCCATCTAT CAGGAAAATA AATGCCATGG CGGTGCGGTC AATCGTGGCT TCGTAGAGGC 4 7 40 

TTCTGGGCGC TATTTTAAAG TAGTTGACAG TGATGACTGG GTGGATCCTC GTGCC7ACTT 4800 

GAAAATTCTT GAAACCTTGC AGGAACTTGA GAGCAAAGGT CAAGAGGTGG ATCTCTTTCT 4 860 

GACCAATTTT GTCTATGAAA AGGAAGGGCA GTCTCGTAAG AAGAGTATGA GTTACGATTC 4 9 20 

AGTCTTGCCT CTTCGGCAGA TTTTTGGCTG GGACCAGGTC GGAAATTTCT CCAAAGGCCA 4 980 

GTATACCATG ATGCACTCGC TGATTTATCG GACAGATTTG TTGCGTGCTA GCCAGTTCTA 504 0 

ACTGCCTGAA CATACTTTTT ATGTCGATAA TCTCTTTGTC TTTACGCCCC 7TCACCACCT 5100 

CAAGACCATG TACTATCTGC CTGTCGATTT CTATCGTTAT TTGATTGGGC GTGAGGACCA 5160 

GTCTGTCAAT GAGCAAGTGA TGATTAAGTG CATTGACCAG CAACTCAAGG TCAATCGAC7 5 220 

CTTGATAGAC CAACTTCATT TCTCCCAAGT GAGTCATCCC AAAATCCGAG AATATCTCCT 5 2 80 

GAATCATATT GAACTCACGA CGGTGATTTC CAGTACCCTG CTCAACCCAT CTGGAACAGC 5 3 40 

GGAGCATCTG GCAAAAAAAC GCCAATTGTG GACCTATATT CAGCAGAAAA ATCCAGAACT 54 CO 

CTTTCAGGCT ATTCGTAAGA CCATGTTGAG CCGTTTGACC AAACATTCTG TCTTGCCACA 54 60 

TCCCAAACTG TCCAATGTCG TC7ATCAAAT CACCAAATCT CTTTATGCAT TTAATTAATA 5 5 20 

TAAGTGTTTT ATAAGAGGGA TTTAAGAAAA ATTTTAACTT TTTCTTAGTC CTTTTTAATT 5 580 

TCAGGAGATT AT ACT AG AG T CATCAAATAA AGAAAGACTC TAAGGAGAAT CCTATGAAAT 5640 

TCAATCCAAA TCAAAGATAT ACTCGTTGGT CTATTCGCCG TCTCAGTGTC GGTGTTGCCT 5700 

CAGTTGTTGT GGCTAGTCGC TTCTTTCTCC TAGTTGGTCA GCCAAGTTCT GTACGTGCCG 5 7 60 
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ATGGGCTCAA TCCAACCCCA GCTCAAGTCT TACCTGAAGA GACATCCGGA ACGAAAGAGG 5820 

GTGACTTATC AGAAAAACCA GGAGACACCG TTCTCACTCA AGCGAAACCT GAGGGCGTTA 5880 

CTGGAAATAC GAA7TCACTT CCGACACCTA CAGAAAGAAC TGAAGTGAGC GAGGAAACAA 5940 

GCCCTTCTAC TCTGGATACA CTTTTTGAAA AAGATGAAGA AGCTCAAAAA AATCCAGACC 6000 

TAACAGATGT CTTAAAAGAA ACT GT AG AT A CAGCTGATGT GGATGGGACA CAAGCAAGTC 6060 

CAGCAGAAAC TACTCCTGAA CAAGTAAAAG GTGGAGTGAA AGAAAATACA AAAGACAGCA 6120 

TCGATGTTCC TGCTGCTTAT CTTGAAAAAG CTGAAGGGAA AGGTCCTTTC ACTCCCGCTC 6180 

TAAACCAAGT AATTCCTTAT GAACTATTCG CTGGTGATGG TATGTTAACT CGTCTATTAC 624 0 

TAAAAGCTTC GGATAATGCT CCTTGGTCTG ACAATGGTAC TGCTAAAAAT CCTGCTTTAC 6 300 

CTCCTCTTGA AGGATTAACA AAAGGGAAAT ACTTCTATGA AGTAGACTTA AATGGCAATA 63 60 

CTCTTGGTAA ACAAGGTCAA GCTTTAATTC ATCAACTTCG CGCTAATGGT ACTCAAACTT 64 20 

ATAAAGCTAC TGTTAAAGTT TACCGAAATA AAGACGGTAA AGCTGACTTG ACTAATCTAC 6480 

TTGCTACTAA AAATGTAGAC ATCAACATCA ATGGATTAGT TGCTAAAGAA ACAGTTCAAA 6540 

AAGCCGTTGC AGACAACCTT AAAGACAGTA TCGATCTTCC AGCAGCCTAC CTACAAAAAC 6600 

CCAAGGGTGA AGGTCCATTC ACAGCAGGTG TCAACCATGT GATTCCATAC GAACTCTTCG 6660 

CAGGTGATGG CATGTTGACT CGTCTCTTGC TCAAGGCATC TGACAAGGCA CCATCGTCAG 67 20 

ATAACGGCGA CGCTAAAAAC CCAGCCCTAT CTCCACTAGG CGAAAACGTG AAGACCAAAG 67 80 

GTCAATACTT CTATCAAGTA GCCTTGGACG GAAATGTAGC TGGCAAAGAA AAAC AAGCGC 6840 

TCATTGACCA GTTCCGAGCA AAyGGTACTC AAACTTACAG CCCTACAGTC AATGTCTATG 6900 

GTAACAAAGA CGGTAAACCA GACTTGGACA ACATCGTAGC AACTAAAAAA GTCACTATTA 69 60 

ACATAAACGG TTTAATTTC7 AAAGAAACAG TTCAAAAAGC CGTTGCAGAC AACGTTAAAG 7 020 

ACAGTATCGA TGTTCCAGCA GCCTACCTAG AAAAAGCCAA GGGTGAAGGT CCATTCACAG 7080 

CAGGTGTCAA CCATGTGATT CCATACGAAC TCTTCGCAGC TCATCGTA7C TTGACTCGTC 7140 

TCTTGCTCAA GGCATCTGAC AAGGCACCAT GGTCAGATAA CGGTGACGCT AAAAACCCAG 72 CO 

CCCTATCTCC ACTAGGTGAA AACGTGAAGA CCAAAGGTCA ATACTTCTAT CAATTAGCCT 7 2 60 

TGGACGGAAA TGTAGCTCGC AAAGAAAAAC AAGCGCTCAT TGACCACTTC CGAGCAAACG 7320 

G7ACTCAAAC TTACAGCGCT ACAGTCAATG TCTATGGTAA CAAAGACGGT AAACCAGACT 7 3 80 

TGGACAACAT CGTAGCAACT AAAAAAGTCA CTATTAACAT AAACGG7TTA ATTTCTAAAG 74 4 0 

AAACAGTTCA AAAAGCCGTT GCAGACAACG TTAAGGACAG TATCGATGTT CCAGCAGCCT 7 500 

ACCTAGAAAA GGCCAACGGT GAAGGTCCAT 7CACAGCAGG TGTCAACCAT GTGATTCCAT 7 56C 
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ACGAACTCTT CGCAGGTGAT GGCATGTTCA CTCGTCTCTT GCTCAAGGCA TCTGACAAGG 7 620 

CACCATGGTC AGATAACGGC GACGCTAAAA ACCCAGCTCT ATCTCCACTA GGTGAAAACG 7 680 

TGAAGACCAA AGGTCAATAC TTCTATCAAG TAGCCTTGGA CGGAAATGTA GCTGGCAAAG 7740 

AAAAACAAGC GCTCATTGAC CAGTTCCGAG CAAACGGTAC TCAAACTTAC AGCGCTACAG 7900 

TCAATGTCTA TGGTAACAAA GACGGTAAAC CAGACTTCCA CAACATCGTA GCAACTAAAA 7860 

AAGTCACTAT TAAGATAAAT CTTAAAGAAA CATCAGACAC AGCAAATGGT TCATTATCAC 7920 

CTTCTAACTC TGGTTCTGGC GTGACTCCGA TGAATCACAA TCATGCTACA GGTACTACAG 7980 

ATAGCATGCC TGCTGACACC ATGACAAGTT CTACCAACAC GATGGCAGGT GAAAACATGG 8040 

CTCCTTCTGC 7AACAAGATG TCTGATACGA TGATGTCAGA GGATAAAGCT ATGCTACCAA 8100 

ATACTGCTCA GACTCAAACA TCAATGGCAA GTATTGGTTT CCTTGCGCTT GCGCTTGCAG 8160 

GTTTACTCGG TGGTCTAGGT TTGAAAAACA AAAAAGAAGA AAACTAATCA GCTAAGGAAA 8220 

TAAATGATCG ATAGTGGGCT GACTAAGATT AGTTTAACAA CTCAATCAGC AATCAGGACT 82 90 

TTCTTTCAAT AGCAGATTAA AATCATCGTA AAACAATAAA AATAGTGTTA TACTTAAAGC 8340 

AGTATAGCAC TCTTTTTATC AAAGGAGAGA CAGATGGGAA AGACAATTTT ACTCGTTGAC 84 00 

GACGAGGTAG AAATCACAGA TATTCATCAG AGATACTTAA TTCAGGCAGG TTATCAGGTC 84 60 

TTGGTAGCCC ATGATGGACT GGAAGCGCTA GAGCTCTTCA AGAAAAAACC GATTGATTTG 8 520 

ATTATCACAG ATGTCATGAT GCCTCGGATG GATGGTTATG ATTTAATCAG TGACGTTCAA 8 580 

TACTTATCAC CAGAGCAGCC TTTCCTATTT ATTACTGCTA AGACCAGTGA ACAGGACAAG 864 0 

ATTTACGGCC TGAGCTTGGG AGCAGATGAT TTTATTGCTA AGCCTTTTAG CCCACGTGAG ,8700 

CTGGTTTTGC GTGTCCACAA TATTTTGCGC CCCCTTCATC GTGGGGGCGA AACAGAGCTG 8760 

ATTTCCCTTG GCAATCTAAA AATGAATCAT AGTAGTCATG AAGTTCAAAT AGGAGAAGAA 8820 

ATGCTGGA7T TAACTGTTAA ATCATTTGAA TTGCTGTGGA TTTTAGCTAG TAATCCAGAG 8880 

CCAGTTTTCT CCAAGACAGA CCTCTATGAA AACA7CTGCA AAGAACAC7A CGTGGATGAC 8940 

ACCAATACCT TGAATGTGCA TATCCATGCT CTTCGACAGG AGCTGGCAAA ATATAGTAGT 9000 

GACCAAACTC CCACTATTAA GACAGTTTGG GGGTTGGGAT ATAAGATAGA GAAACCGAGA 9060 

GGACAAACAT GAAACTAAAA AGTTATATTT TGGTTGGATA TATTATTTCA ACCCTCTTAA 912 0 

CCATTTTGGT TGTTTTTTGG GCTGTTCAAA AAATGCTGAT TGCGAAAGGC GAGATTTACT 9180 

TTTTGCTTGG GATGACCATC GTTGCCAGCC TTGTCGGTGC TGGGATTAGT CTCTTTCTCC 9240 

TATTGCCAGT CTTTACGTCG TTGGGCAAAC TCAAGGAGCA TGCCAACCGG GTAGCGGCCA 93 00 
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AGGATTTTCC TTCAAATT7C CAGGTTCAAG GTCCTGTAGA ATWSA^CAA TTAGGGCAAA 93 60 

CTTTTAATCA GATGTCCCAT GATTTGCAGG TAAGCTTTGA TTCCTTCGAA GAAAGCGAAC 94 20 

GAGAAAAGGG CTTGATGATT CCCCAGTTCT CGCATCATAT TAAGACTCCT ATCACTTCGA 9480 

TCCAAGCGAC GGTAGAAGGG ATTTTGGATG GGATTATCAA GGAGTCGGAG CAAGCTCATT 9540 

ATCTAGCAAC CATTGGACGC CAGACGGAGA GGCTCAATAA ACTGGTTGAG GACTTGAATT 9600 

TTTTGACCCT AAACACAGCT AGAAATCAGG TGGAAACTAC CAGTAAAGAC AGTATTTTTC 9 660 

TGGACAAGCT CTTAATTGAG TGCATGAGTG AATTTCAGTT TTTGATTCAG CAGGAGAGAA 97 20 

GAGATGTCCA CTTGCAGGTA ATCCCAGAGT CTGCCCGGAT TGAGGGAGAT TATGCTAAGC 9780 

TTTCTCGTAT CTTGGTGAAT CTGGTCGA7A ACGCTTTTAA ATATTCTGCT CCAGGAACCA 9840 

AGCTGGAAGT GGTGGCTAAG CTGGAGAAGG ACCAGCTTTC AATCAGTGTG ACCGATGAAG 9900 

GGCAGGGTAT TGCCCCAGAG GATTTGGAAA ATATTTTCAA ACGCCTTTAT CGTGTCGAAA 9960 

CTTCGCGTAA CATGAAGACA GGTGGTCATG GATTAGGACT TGCCATTCCG CGTGAATTGG 10020 

CCCATCAATT GGGTGGGGAA ATCACAGTCA GCAGCCAGTA CGGTCTAGGA AGTACCTTTA 10080 

CCCTCGTTCT CAACCTCTCT GGTAGTGAAA ATAAAGCCTA AAACCCCTTT ACAAATCCAG 10140 

CTATTCATGG TACAATAGA? TTTGTGTGAA ATATCAGCAG GAAAGCATGA AGCTCGTCAA 10200 

CAGGTGTCTT ATGACAAGTA ACCTTGGCTG TTTAGGCGAA GGGCATCTGC ACGG 10254 
(2) INFORMATION FOR SEQ ID NO: 30: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



CCGGCGACTA 


TCGATAACAC 


TTGACTTGGT 


AGCCCCACAT 


TTTGGACAAC 


GCATCCTTTC 


60 


CCTCCTTATC 


GTTTTCTTTT 


CATTATACCA 


TTTTTTAAGC 


GATTCCCAAA 


ACAATTCTTC 


120 


TTTTTCCTTG 


ACAAGTTTTT 


TGTTTTGTTG 


TATTATTTAA 


TTAAGACAAC 


AAGGTAAAAG 


180 


AAAGGAGACT 


AAGATGTCCT 


GGACATTTGA 


CAACAAAAAA 


CCCATCTATT 


T ACAG ATT AT 


240 


GGAGAAAATC 


AAGCTTCAGA 


TTGTTTCCCA 


TACACTGGAA 


CCCAATCAAC 


AACTTCCAAC 


300 


CGTGAGGACC 


TAGCTAGCGA 


GGCTGGTGTC 


AATCCCAATA 


CCATCCAAAG 


AGCCTTATCA 


360 


GACCTTGAAC 


GAGAAGGATT 


TGTCTACAGC 


AAGCGAACAA 


CTGGACGATT 


TGTGACTAAG 


420 


GATAAGGAGC 


TAATCGCCCA 


GTCACGCAAA 


CAATTATCAG 


AACAAGAATT 


GGAACACTTC 


480 
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TCACCCATT 


TGGCTATGAA 


AAAGAAGAAC 


TACCAGGCGT 


AGTCAGTGAT 


540 


TATATTAAAG 


ft A tTFTT A A£SC 
unu lit rtAUV* 


CTATGTCATT 


ACTAGTATTT 


GAAAATGTAT 


CCAAATCATA 


600 






AAAATGTTTC 


TCTTGACATT 


CCAGCTGGAA 


AAATTGTCGC 


660 






CAGGAAAAAC 


AACCCTGATT 


AAACTAATTA 


ATGGCCTCTT 


720 




pa Ann app/tp. 

V_ AJMjVjA*-*j * 


TCCTCATCAA 


CGACATGCAC 


CCAAGCCCAG 


CAACCAAGGC 


780 


V.L> 1 I vj 1 AljL. i 




ATACGACCTA TCTCAATGAG 


CAAATGAAGG 


TCAAAGAAGC 


840 


1 AAv_\- 1 AL. 


1 1 l»AA\jAV«.t. * 


TCTATAAAGA 


TTGTCAGATC 


TTGAACGCGC 


CCATCATCTA 


900 


r TTr.rAnArr i 


TP.PP.PATTGA 


TGAAAATAGT 


CGTCTCAAGA 


AACTATCAAA 


AGGAAACAAA 


960 




r\M<— 1 1111 


CCTTATGAGC 


CGTGATGCTC 


GTCTCTATGT 


TTTGGACGAA 


1020 


L.v_V. A i i VjVj 1 \j 


rt(T.Tr.r. atpp 


AGCAGCCCGT 


GCTTATATCC 


TCAATACCAT 


TATCAACAAC 


1080 


L At. 1 LnV_L/\rt 


U 1 1 v 1 /W.V.U 1 


TTTGATTTCT 


ACCCACTTGA 


TTTCTGATAT 


CGAGCCAATC 


1140 




TTflTP TTP PT 
I lulLl Ivv. I 


AAAAGACGGA 


AAAGTCGTCC 


GTCAAGGAAA 


TGTAGATGAT 


1200 




Ad 1 Inuu l Vji\ 


ATCCATTGAC 


CAACTCTTCC 


GTCACaATTT 


AAGGCCTAAG 


1260 


L~AAAIjwAIjA 1 


lnl 1 IA1U1 * 


TTGGAATTTA 


GTTCGCTACG 


AATTTAAAAA 


TGTTAACAAG 


1320 


TLtOT A Ll. l Ati 




AGCCGTGCTA 


CTCCTTTCTG 


CCCTCATCCG 


AATACAGACA 


1380 


L AALKJ CllTA 


& & & ATPTAPP 


TT At. L. AAVjAA 


AP TP A CICC*** A 


CTATGCTACT 


TTTTCTAGCT 


1440 


A(_AOTt_ I nb 




GCTTACACTT 


GGGATTTCAA 


CCATTTTCTT 


GATTATTAAA 


1500 


LLiLTi IL,AAAL» 


vj I nu >Vj iL In 


CGACCGACAA 


GGCTATCTGA 


CTTTGACCTT 


CCCAGTTTCT 


1560 


vjAALALLA I A 




CAAACTAATC 


GCTGCCTTTA 


TCTGGTCATT 


GATTAGCACC 


1620 


O*- . J l n 1 . 


C***^*T AAGTCC 




CTICCTTTAA 


CACCTCCAGA 


ATGCATTCCT 


1630 


CTTTCTTATG 


TrAT^AfATT 


TGTAGAAACA 


CATCTCCCTC 


AGATCTTTCT 


TACAGGTATA 


1740 


TCCTTCCTAC 


TAAATACTAT 


TTCAGGAATC 


CTCTGCATCT 


ACCTGGCTA t 


T rv_LAl l\j<jA 


1 ana 


CAGCTTTTCA 


ATGAATACCG 


TACAGCACTC 


CCTGTTCCAC 


TCTACATTCC 


TATCCAAATC 


1860 


GTCATTGGAT 


TTATTGAACT 


TTTCTTCAAT 


CTTAGTTCTA 


ATTTCTATGT 


CAATTCACTG 


1920 


GTAGGACTCA 


ATGACCATTT 


CTATATGGGA 


GCAGGTATAG 


CCATTGTTGA 


ACAACTCATA 


1980 


TTCATAGCTA 


TCTTTTATCT 


CGGAACCTAC 


TACATCTTGA 


GAAATAAGGT 


TAATTTGCTT 


2040 


TAAATAATTT 


TTACCTAGAT 


ATGTAACATA 


CTCATAGAAC 


AAAAGAGACC 


AGGCAAAAAG 


2100 


TCTTTAAAAT 


TAGAAAACGC 


ATAGTATCAG 


GTGTTGAATA 


TGTACTCCcC 


CCCAAAAGTT 


2160 


AGATTTTTTC 


TGTCTAACTT 


TTGGGGGCAG 


TTCAT.AAGAA 


CCTTGGTAAT 


ATGCGTTTTT 


2220 
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TGTGAGCTGA 


CTTATTTCCT 


TTCACTATAT 


CGCAAAATGA 


AATAAGAACG 


GAACGATGGG 


2280 


AT7TTGCAAT 


TCAAATCAAT 


T7ATAAGAA7 


GTTTTAGAAG 


TAATATTATC 


CTATTCCACA 


2340 


TTCAGTTCAC 


TATACAATTG 


AGTTTTCAAG 


CAACCTGTTT 


ACATAATGTG 


TACATAATTA 


2400 


GGTTCGTGAT 


TCCACCCTTT 


7CACCTTTAA 


AAACCTCGCT 


TTCGCAAGGC 


TCTTCTATTT 


2460 


ATAAGATAAG 


GCACGTTTAA 


AGGTTTTCCA 


AATCCCTAAA 


TCATCCGTTT 


GAAGAACGAG 


2520 


ACTAGCATAC 


ATGCGTCCGA 


TAAATCCTGT 


TGCTACCACC 


GCAAAAATCA 


CTGTAATAGC 


258C 


AAGTGAAATC 


CA7CC7TC7G 


CTCCCCCCGC 


ATAG7CATTA 


ATCGTTCCAA 


ACGGCATAAA 


2640 


GAAGGTCGAA 


ATAAACGGAA 


TATAAGAACC 


AATCTTCAAG 


AGGAGATTGT 


CACCACCTCC 


2700 


ACCTAGAGCT 


GTCACTCCAA 


AAAAACCACC 


CATAATCAAA 


ATCATCAAAG 


GCGACAAGGC 


2760 


TTTCCCTGAG 


7CC7CAGGAC 


GAGAAACCAT 


AGATCCTAGG 


AAGGCTGCCA 


AGACTACGTA 


2820 


CATGAAAAGA 


CTGATCAAAA 


TAAAGAGCAA 


CGTATTCAGT 


GAGATAGCAT 


CTCCCAAGTG 


2880 


ATCCAAAATA 


CCACACTGAG 


CCAAGAATGG 


CAAATCTTTA 


AAGAGCAAAA 


CGGCAGCCAG 


2940 


ACCACCTACA 


ACATAGA7CC 


CAATATGCGT 


TAAAATCACT 


AGAAACAGAG 


CCATCATCCG 


3000 


CGCATAGAAA 


TAGTGACTTG 


CCCTTATGCT 


AGAAAAAACG 


ACTTCCATAA 


TTTTGGTCCC 


3060 


TTTTTCACTG 


GCAACTTCCT 


GAGCTGTTAC 


ACCCGCATAG 


GTAA7CAGAA 


TCATATAAAG 


3120 


AAACAATCCT 


AAGGCACCTG 


CTGCAATTGT 


7TGAATAAAC 


TTTTTATTTT 


CCTTGGCTTC 


3180 


A7CAATCTT7 


TCTCTCAATT 


GAATTGTCTG 


CGCTAAGCCT 


TTTTCC7GC7 


CTTGAGACAA 


3240 


GGAAGCAGTT 


GAACGATTAA 


GCTGATTTTG 


CAGTTCATTG 


AGTGTACC7C 


TAACCTCAAA 


3300 


TTTAATTCCA 


TTTTCAAGCG 


ATGTTTCGCC 


A7GATAAAC7 


GCCT77AGAA 


CACTATCTTC 


3360 


TTGATCAATG 


GTCAAATAAC 


CTTTTAATTT 


7TCT7CTTTA 


ATTGCTTCTT 


TGGCACTTGC 


3420 


TTCCTCTTTA 


TAG7CGAAGT 


TAACACCATT 


7ACA7777TC 


AGTCCT7CTG 


CTACAGATGG 


3480 


CACTGTTGTC 


ACTACTGCCA 


CTTTATTATT 


TTTAGCCATA 


GAAGAACC77 


GGAGATGCCC 


3540 


AATTCCTACA 


GAGATTCCTA 


AAAAGAGGAA 


CGGCGAAATC 


ACCATAAAGA 


AGAAACTCCA 


3600 


TGACTCGACA 


TGTCGAAGAT 


AGGTTTCCTT 


GATTACAACC 


CACATA777C 


TCATACTTCC 


3660 


ACTCC7GATT 


CTAGTTTAAA 


GATTTCATCG 


ATAGTTGGCG 


CTTGTTGGTC 


AAATGTTGCG 


3720 


ATATATTGAC 


CTTGAGTCAA 


GATTGAGAAG 


AGTTCCCTTC 


CAGCGCTCTC 


ATCC7CCAAA 


3730 


ATCAATTTCC 


AACTGCCTTG 


TTTGGTCAAG 


CTCACC7GTT 


TGACATGACG 


AAGATTTTCC 


3840 


AATTC7TCC7 


TGCTTCGT7C 


ACTTGAAACA 


AAGAGACGCG 


TTTTCCCG7A 


77GATTGCGG 


3900 


ACATCCTGAA 


CTGGTCCGTG 


CAAGACCACA 


CGGCCATCTC 


GGATCA7CAG 


AA7ATCGTCA 


3960 


CAAAGTTCCT 


CAACATTGGT 


CATGACATGG 


TCAGAAAAGA 


TAA7GGT7GT 


CCGCGCTCT7 


4C20 
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TTTCCTGAAA AA7GACTTGT TTGAGCAATT CTGTATTAAC TGGGTCCAAT CCACTAAAAG 4080 

GCTCATCCAA GATAATCAGG TCTGGTTCAT GAATCAGAGT AATAATGAGC TCAATCTTCT 4140 

GCTGATTTCC TTTTGACAGA CTCTTGATTT TATCTGTCAG CTTTCCTTTC ACTTCCAACC 4 200 

TCTTCATCCA TTGAGGGAGT TTTTCTTTGA CTTCTTTGGC ATCCATGCCT TTTAGAGTCG 42 60 

CCAAGTAGCG AACTTGTTCA AGAACTGTCA ATTTAGGCAT GAGATGCGTT CTTCAGGCAG 432 0 

ATAACCAATC CCAGCATAGG TCTCCTGACG AATATCCTGA CCATCCAGAC CCATTTCTCC 4 38 0 

CTGATATTCT AGGAATTTCA AAATACTATG GAAAATCGTT GTTTTTCCAG CACCATTTTT 44 40 

TCCGACTAGT CCCAAAATAC GACCTGGTCG CGCTTGAAAG TCAATACCAA ACAAAACT7G 4500 

CTTGGATCCA AAACTTTTCT CTAGACTTCT TACTTCTAGC ATCTTTCACC TCCGAAATTT 4 560 

CTTGCACTCA 7TATACTCCT TTTTGATAGC CTTTACAATG TTTTTTCTCC ATTTTTAGAA 4620 

GACTATTGCT GTGTAAAATA TGGCCTGGAG CACTTTTATA CTCAATGAAA ATCAAAGAGC 4680 

AAACTAGGAA GCTAGCCGTA GACTGCTCAA AGTACAGCTT TGAGCTTGCA GATAAAACTG 474 0 

ACCAAGTCgA CTCAAAACAC TGTTTTGAGG TTGTGGATAG AACTGACGAA JcCrTAaCTAT 4800 

ATCTACGGCA ACGCCAAcTG ACGTGGTTTG AAGAGATTTT CGAAGAGTAT TAGTGATAAA 4860 

TCCATTATAC AGCAGCAAAC TTAATTTATA CCTTCCGCTC CTCAACTCTC TATTTTTAAT 4920 

CCTGAATTGT TATTTGAGTA ACTCCTTTTT CCTCGTAAAG TTTTCTTCCT CTAAAACTTC 4 980 

TGGAAAAAGG CTAATACTTT CAGACAACAT TTTTATAAGA AACAAGTTCA TCTGTCATTT 5040 

CAAGAAGGAG TAATCCTTTA TCTACTAATG GACGGAACAG AATTCAACCG CTTGTCCGAT 5100 

ATGTTTTCTA AGGATTATAT AGTAAAATGA AATAAGAACA GGACAAATTG ATCAGGACAC 5160 

TCAAATTCA7 TTCTAACAAT C7TTTAGAAG 7AGA7CTA7A C7A7TCTAC7 77CAATC7CC 52 20 

TATATCTATT ATGCACACCC CTATAGGATC TAATGAAAAT CACAACAGGC TCATTCATAG 5280 

ATGGTTACCT AAGCCTAAGG GAACTAAGAA AACGACTACC AAGGAAGTCG CATTCATCGA 5 340 

AAAGTAGATT AACAACTATC CTAAAAAATG CTTGAACTAC AAGTCCCCCA GAGAAGACTT 5400 

CTGGATGACT AACTTGAACT TGAAATTTAG CAATAATTAA TTCACTATCT AACTATATTT 54 60 

AGTAATTATT TCAGAACTGA TTAATATTAA AATTAACTAA CAATTCAAAG GATTCATACT 55 20 

AGCCATAAAT TACGTCCATC AG AG AG AG AC TCTTACTACT TTTAGATTTT AGTCTTTCTA 5580 

GCTTCAGAAT ACATCTAAAC TTTAGGGAAA ATGACTATTC GAAAGCGCGA ATGCCTCAAA 5640 

ATTATCTCAG ATAAGCTATT CGAAACTTAG AATGCTTTTA AATTTATGGA ATTGCGATTA 5700 

7TCGAAACCT AGAATGCATA TAACCTTTAG TTGACAGACC TATTCTAAGT CTCGAAGGGC 5760 
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TATTTACTTT CTATTCCTTA TCAAAAAAGA CTCATTCCCC CTTTCTCCTC CAAAATATGG 5820 

TATAGTAGAA ATATACTATC TATGAGGAGT TTACATGTCA CAGGATAAAC AAATGAAAGC 5880 

TGTTTCTCCC CTTCTGCAGC GAGTTATCAA TATCTCATCG ATTGTCGGTG GGGTTGGGAG 5940 

TTTGATTTTC TGTATTTGGG CTTATCAGGC TGGGATTTTA CAATCCAAGG AAACCCTCTC 6000 

TGCCTTTATC CAGCAGGCAG GCATCTGGGG TCCACCTCTC TTTATCTTTT TACAGATTTT 6060 

ACAGACTGTC GTCCCTATCA TTCCAGGGGC CTTGACCTCG GTGGCTCGGG TCTTTATCTA 6120 

CGGGCACATC ATCGGGACTA TC7ACAACTA TATCCCCATC GTGATTGGCT CTCCCATTAT 6180 

CTTTTATCTA GTGCGCCTAT ACGGAGCTGC CTTTGTCCAG TCTG7CGTCA GCAAGCGCAC 62 40 

CTACGACAAG TACATCGACT CCCTACATAA GGGCAATCGT TTTGACCGCT TCTTTATTTT 6300 

TATGA7GATT TGGCCCATTA GCCCAGCTCA CTTTCTCTG7 ATGCTGGCTG CCCTGACCAA 63 60 

GATGAGCTTC AAGCGCTACA TGACCATCAT CATTCTGACC AAACCCTTTA CCCTCGTGGT 64 20 

TTATACCTAC GGTCTGACCT ATATTATTGA CTTTTTCTGG CAAA7CCTTT GACACGTAAA 64 80 

AAATC CGTTT GGTTTCCCAA GTGGATTTTT AAAGCGTAGA TTAACTATAG CTTGATACTA 6540 

AATATACTTT GGTATGGAAA TCATGCATAT TTTTCCATAG TGACGCGAGG ACTTACCTAG 6600 

CCTTTCCGCC GTGATAGAAA CACCTGAAAT CTAATGGTTT CAGGTATTCG GAAACTTTGA 6660 

GCCTAGTGTC TCAAAGTTTA GCTATGCAAT TTTGAAGAAA GTCGCTACCG TCCGTAATCA 6720 

CTTAAGGAAA GGCTCAAAAA TATTGTTTTC AACCACAAAA TCCGTTTGGT TTCCCAAGCG 6780 

GATTTTGTGC TTTATTTTGA AACTTCTTT7 GCAAGAACAA AGTTCCCAAG TGTGGCAGAA 6840 

CCATTTCCTG CCACTGC7CC CGTCACGATA TAGTCACGCA CATCTGGTAC TGGTAGGTAA 6900 

CCATTAAGAA GAGATGTAAA TTTCTCACGG ACACGGTCCA GCATATGTTG TTGAGCCATG 6960 

ACCCCTCCAC CAAAGACAAT CACGTCTGGG CGGAAAGTCA CTCTCGCATT AACCGCAGCT 7020 

TGACCGATAT AGTAGGCTTG AACATCCCAA ACAGGGTTGT TGAGTTCAAT ACTTTCCCCA 7 080 

CGTACACCTC TACGAGCTTC CAAACTTGGA CCAGC7CCA7 AACCTTCTAG ACATCCCTTA 7140 

TGGAAAGGAC AAACACCC7T AAACTCTTTT TCAATATCCA TTGGGTGTCT AGCAACATAA 7200 

TAATGACCCA TTTCACCCTC ACCCACACCA CCGATAAACT CACCACGTTG GATGACGCCT 72 60 

GCACCGATAC CTGTACCGAT TGTGTAGTAA ACCAAGTTTT CGATACGACC ACCAGCATTG 7 3 20 

TTACGGGCAA CCATTTCACC GTAACCAGAG CTCTTTACGT CTGTTGTGAA GTACATTGGC 7 380 

ACGTTTAGGG CGCCACGAAG GGCACCAACC AAGTCTACA7 TTGCCCAGTT TGGTTTTGGA 74 40 

GTCCTCGTGA TAAAGCCATA AGTTTTTGAG TTTTTGTCAA TATCAATCGG CCCAAATGAA 7500 

CCAACTGCAA GACCAGCAAG GTTATCGAAT TTTGAGAAGA ACTCAATGGT TTTATCGATT 7 560 
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GTTTCCATTG CAGTTGTTCT TGGAAATTGT GTTTTTTCTA CAACGTTAAA GTTTTCATCA 7 620 

CCGACAGCAC AGACAAACTT TGTACCGCCC GCTTCCAAGC TTCCATATAA TTTTGTCATC 7 630 

ATAAACCTCT TGTTTTTATT TTCTTTATTA TAGCATACTT CGAAAGTCTA AATGTCTCTA 7 740 

TTTTTTAGAT TTTCCTCTGT AAATCTTACT ATCTAATAAA AACGAACAAA CATGTCATTT 7800 

GTTCGTTTTC AC ATT AG AG A GGATTGATTA GATTTTCACT TCGATCACAG CATCCCCCTT 7860 

AGCAACTGAA CCTGTTGCGA CTGGAGCTAC TGAAGCGTAG TCACCTGTAT TTGTAACGAT 7920 

AACCATTGTT GTATCATCAA GTCCAGCTGC AGCGATTTTG TTTGAGTCAA ATCTTCCAAG 7980 

AACATCGCCA GCTTTCACCT TATTACCTTG ACCAACTTTT GTTTCAAAAC CGTCACCGTT 8040 

CATAGATACA GTATCAATAC CAACATGAAT CAAAACTTCA GCACCATTTC TTCTTTTCAA 8100 

ACCAAAAGCG TGCCCTGTTG GAAAGGCAAT TGAAACTTCA GCATCAGCTG GTGCATAGAC 8160 

CACGCCTTGG CTTCGTTTCA CAACGATACC TTGTCCCATA GCTCCACTTG AGAAGACTGG 8220 

GTCATTGACA TCAGCAAGAG CGACAACATC ACCGACGATA GGAGTTACAA GTGTTTCATT 8 230 

TTGAAGAGCT GCTGGCGCAA CTTCTTCTTT TTCTTCAGCC ACTTCAGCTC GTTTTGCACC 8 340 

TGCAGTTGCG TCTACTTCAT CTTCGTAACC AAACATGTAA GTAAGAGCAA AACCAAGGGC 8400 

AAATGATACA GCTACCATAA GAAGGTATTG TGGAACTTCT CCGTTACCAA CATAAAGCAT 84 60 

TGTACCAGGG ATGATGGTGA TACCATTACC AGTACCAGCA AGTCCAAGGA TAGAAGCCAA 8520 

TCCACCACCG ATTGCACCAG CAATCAATGA AAGGAAGAAT GGTTTACGGA AGCGCAAGTT 8580 

CACCCCGAAG ATAGCAGGCT CTCTAATACC 7AGGAAGCCTA GAAACAGCAG CCGGGAAAGC 36hC 

AAGTGTTTTC AGTTTTGGAT TTTTTGTTTT AACACCAACC GCAACAGTAG CAGCACCTTC 8700 

AGCTGTCATA GCAGCTGTGA TGATAGCCTT GAATGGGTTA GCATGGTCAG CAGCAAGTAA 87 60 

TTGCACTTCA AGCAAGTTGA AGATGTGGTG CACACCTGAC ACGACCATCA ATTGGTGAAC 8820 

CCCACCAATC AAGAAACCAC CAAGACCAAA TGGCATGCTA AGAATCGCTT TTGTAGCAAT 8880 

AAGGATGTAG TTTTCAACAA CGTGGAAAAC TGGTCCAATG ACAAAGAGTC CAAGGATAGA 8940 

CATGACCAAA ACTGTCACGA ATGGTGTTAC CAAGAGGTCA ATGACATCTG GAACAACTTG 900C 

CGGACAGCTT TTTCAAATTT AGCTCCGACA ACCCCGATGA TGAAGGCTCG AAGAACGGAA 9060 

CCTTGCAAAC CAACAACAGG GATGAAACCA AAGAAGTTCA TCGCTGTTAC TTCACCACCT 9120 

TGAGCAACTG CCCAAGCGTT TGGAAGTGAG CCAGAGACAA GCATCATACC AAGAACGATA 9130 

CCAACGGCAG GATTTCCACC AAATACACGG AAGGTTGACC ACACAACCAA ACCTGGCAAG 92 40 

ATCATGAAGG CTGTATCTGT CAAGATTTGT GTGTAAGTTG CAAACTCACC TGGAAGTGGC 9 3 00 
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ATTTCAAGAG CGTTGAAAAG ACCACGCACA CCCATGAAGA GACCTGTCGC TACGATAACT 9 3 60 

GGGATGATTG GAACGAAAAC ATCACCAAAA GTACGGATAG CACGTTGGAA CCACTTCCCT 94 20 

TGTTTAGCAA CTTCTGCTTT CATGTCATCC TTAGATGATG TTGGTAATCC AAGTACAACA 9430 

ACTTCATCGT ACATTTTGTT AACTGTACCT GTACCAAAGA TAATTTGGTA TTCCCCTGAG 9S40 

TTAAAGAAAG CACCTTGAAC TTTTTCCAAG TTCTCAATCA CTTCTTTATT GATTTTCTCT 9600 

TCATCTTTGA CCATGACACG TAGACGAGTC GCACAGTGGG CAACACTATT GACATTTTCA 9660 

CGTCCGCCCA AGGCATCGAT GACTTTTTTT GCAATTTCCT GATTGTTCAT TTGCAAAAAT 9720 

CTCCTTATAT AACATTTTGT TCTT C TTTCA AAGCGATTTT ATTCGCCGG 976 9 



(2) INFORMATION "OR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENCTH: 3149 base pairs 
(3) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 31: 

CGCTTGAGTG CTAATTCATA GTTCTATTCT ATCACTTGGT CAGAAATAAT CAAGAAAAAA 60 

GTCTGACTTT CTCAAGATAA AAAGC CTG AG ACCAACTCAG ACTTTTTAAT TCTTAAAATG 120 

GCAATTCTTC CTCTTCCAAG ACCAAATCTG CCAAATCTTG GCCTGCATTA TTTTCACGCA 190 

TAGCACGTTG CCCACGACTT TCCAAGAGTT GGAATCCTGT GACAAGTACT TCGGTCACGT 240 

AGTTCATTTG GCCATTTTTC TCAAAGCGAC GGCTACGCAA TTCTCCATCA ACGGAAATGA 300 

GACTACCTT? GGTTGCGTAC TTGCCAAAGT TTCTGCTAGT CTGCCCCATA GGACCATATT 3 60 

GACAAAATCA GCTTCACGTT CACCGTTTTG CTCTTTCTAA CGACGGTTCA CAGCGATAG? 4 20 

TGCTCGCGCT ACCCACTTCT CATTGTTGGT TTTGTCCAAT TCTGGTGTAG ACGTTAAACC 4 80 

TCCAATCAAG ATAACTTTAT TATACATATT TTCTTCCTCC TACTTATCTA TTCGTAGG/A 540 

ATCAAAAAAA GTTACAGAAA TTTGTAACTT TTCGAGAAAA TTTTT7ATTT TTTATGAACC 600 

ATGAAACCTG TCGCCTGTTG ATTGGCCATA ATGGTCATAT CTGTAATCTG AACACGACGA 6 60 

GGTTGACTAG TCACATAGAC TACTGTATCT CCAATATCCT GAGCTTGCAA AGCTTCTATT 720 

CCTTGGTAAA CGGACGCAGC TCGTTCTTTA TCACCATGAA AACCCACTC7 AGAAAAATCT 780 

GTTTCGACAA TTCCAGGCTG AATGGTCGTC ACCTTGATAT CCC7TCCGAT GG7ATCAATT 840 

CGCAGTCCAT CTGAAAAGGT CTTAACTGCC GCCTTGGTCC CTGAG7AAAC AGCTGCACCA 900 

GCATAGGCAT AAATTCCTGC GGTTGACCCC A7ATTGATAA TATCACCTTG ATTCCC7TT7 960 
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ACCATTGC7G GCAAGAAACA GCGAGTGACT GCCATCAAAC CTTTGACATT GGTATCCAAC 1020 

ATGGTCAGCA TATCCAACTC TTCATAGTCT TGATAGGGAG CTAAGCCAAG AGCCAGTCCT 1080 

GCGTTATTGA CCAGGATGTC AATCTGACCT ATCGTTTCTA AAATATCAGA GCAGACAGTC 1140 

TTTACCATTG TCATATCCGT GACATCTAGG AGAAAAGTCC AAACTGTTTG ATTTGGAAAA 1200 

GTTTCTGCAA ACTCCGCCTT AAGAGCTTCT AGTCTGTCTA TCCGTCGTCC TGTTAGAACG 1260 

~ ACATCC7CAC CC7GCTCCAG ATAAGCACGC GCAATCGCTT CACCGATTCC TGATGTCGCT 13 20 

' CCTGTAATCA CAACA777T7 TGCCATCTTA TTTCCTTCTA GCTGC7C7AT CAGATATTAA 13 80 

CAACTTCT7A GGCAGTCCAG TC7TTCGCTG GGTCGAACGG TGTTCCGACA ACTTGGTCTT 14 40 

CTGATAATTC AAGCACCCCA CGTTTTTGTG GAGCATTTGG CAGATGCAAT TCACGAGGAC 1500 

TGCACA7CAT ACCAAAACTC TTTTCACCAC GAAGTTCACC TGGGAAAATG AGATTCCCTT 1560 

TTCGCATCAT AGCTCCAGGA AGCGCGACAA TGGTTTTCAA CCCCACACGC GCATTGGGAG 1620 

CTCCTGCAAC GATTTGTACA CTC77ATCAC TTGCGACTGC AACTTGGCAG ATGTTGAGGT 16 80 

CG7CAC7ATC TGGATGGGCT ACCATCTCAA CAATTTCACC TACAACAAAC TTACGTTCCT 17 4 0 

TA7CATTAAC AATTTCTTCT GTAAAACCTT CCGCCTGCAA CTCTTGGTTC AAACGAGCGA 1800 

CTTGCTCATC TGTCAAAAAG ACTTGACCCC GCTCTGCAAT TTCAAATAAA CTTGAAACTT 1860 

CGAAAATATT CCAAGCCACT GTTTCCCCAT TATCTTTGAG AAAAACACGG GCTACCTTCC 1920 

CTTTGCGCTC CACATCCAGT TTGGCATCTC CGCTATTTTT CACGATGACC ATAAGGACAT 1930 

CACCCACA70 TTTTTTATTA TATGTAAAAA TCATTG77TC CTTTTTC7CC 7A7T7CAGTC 204 0 

CTGCTAAAAA GTCATTGATT TGTTGCTTGC TTT7ACCG7C GCGATTGACA AAACGACCCA 2100 

7TTCCT7GTC CTT7TCTAGA ACAACAAGGC 7AGGAATTCC GTAAACATCC CAGAGTTTGG 2160 

CCAAATCCAT ATACTCATCT CGGTCCATTC GAATAAACGT GAACTC7GCA TTGGTCTCCT 222 0 

CAA7CTCTGG TAAGGCAGGA TAAATATAAC GACAATCGCT ACACCAGTC7 GCCACAAAAA 2 280 

TGAAGACCTT CTTGCCCGCT TTTTCCACTA AAGATGCTAA TTCTTCTAAA CTTGCTGGCT 2 340 

GTATCATAAG ACTTCCTCCT CATAGACTAG GTCTTCATTT TCA7AGACAA AGGTATAATG 2400 

ACGGCCATCC TCAAAAATGA CGCCACCAAC CAACCTCTCC AGAC7GC77T CG7AAACTTG 2 4 60 

AACA7AAAGG G7CGCAAT7T CCCCCA7GTC GGAAAAA7GG TCTCGCACAA TC7CTGTCAA 2 520 

CTCTTCCTGA GTCTTCATGA GCTTACGGTC A7C7GCAAC7 7TT7TCGTAG CAAGAGCAAG 2 580 

GCTTCCGATA CCTAGCAGAG CCAAGCC7GC CATCCACA7T 7T7TTAGCTT 7CATACCAT7 2 640 

CA7TTTAACA CAAAAAAGGC 7TCAGGACAA A7GAGGAAGC AGCAGAAAAG CAAGTAAAAA 2 7 00 
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GCCTCTTCCT TTAAGGAAAA GGACTTCTTA TACTCAATGA AAA7CAAAGA CCAAACTAGG 27 60 

AAGCTAGCCG CAGGCTGCTC AAAGCACTGC TTTGAGGTTG TAGATAGAAC TGACGAgTCa 2820 

CTCAAAACAC TGTTTTGAGG TTGTGGATGA AGCTGACCTG GTTTGAAGAG ATTTTCGAAG 2 880 

AGTATTATTC TTATTGCCAG GCACCTAAGT TGCCAACGTA GTAACTATCA GGTGTGTAGG 2 940 

TATTGCGAGC ATCTTACCTG ATGAAGCCAG ATAATACTAC TTGCCATTGT CTTTGACCCA 3000 

ATCATTCGCA ATCATGGAAC CAGAAGAACT TACATAATAC CATTCTCCCT TGTCATAAAC 30 60 

CCAACTACTG ACTTTCATGG TTCCTGAGCA ATTAAAGGCA AAAAAACTG? CCAATAACAT 3120 

TCGTTTTTTA AAAGCATTTG AC ACT AC AT 3149 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



CCAAAAATTC 


AACCTTTAAG 


GGCAGTCCAG 


AGAGACTCAC 


AAGGTGTCAG 


ATAAAAGAAT 


60 


GGTGCAATTT 


TCTAGAGGAG 


ACTTTTTGAG 


TGTGCTCTCT 


TGTGTTGTAC 


GATTTTAACT 


120 


GAGGCCTTGC 


ACTAGCAAGG 


TCTTTTCTTT 


ATCTGGTCCC 


CTTAAAATTT 


AAGGAGGAAA 


180 


AGTTATGAAT 


CCCACATGTA 


AGAAGCGTTT 


GGGTGTCATT 


CGGTTGGAAA 


CCATGAAGGT 


240 


GGTTGCACAA 


GACCAAA7CC 


CGCCACAATC 


TTTGAATTAG 


TCCTAGAAGG 


AGAAATGCTT 


300 


GAAGCCA7GC 


GAGCAGCCCA 


ATTTCTTCAT 


CTGCGTGTAC 


CGGACGATGC 


CCATCTCTTA 


360 


CGTCGTCCTA 


TTTCAATTTC 


GTCTATTGAC 


AAGGCAAACA 


AGCAGTCTCA 


CCTCATTTAT 


420 


CGGATTGACG 


GAGCTGGGAC 


TGCAATTTTT 


TCAACCTTAA 


GTCAGGGAGA 


CACTCTTCAT 


480 


GTGATGGCCC 


CTCAGGGAAA 


TCGTTTTGAC 


TTGTCTGACC 


TTGATGAGCA 


GAATCAGGTT 


540 


CTCCTTGTTG 


GTGGTGGGAT 


TGGTGTTCCA 


CCCTTGCTTG 


AGGTGGCCAA 


GGAATTGCAT 


600 


GAACGTGGAG 


TGAAAGTAGT 


GACAGTCCTC 


GGTTTTGCTA 


ATAAGGATGC 


TGTTATTTTG 


660 


AAAACGGAAT 


TGGCTCAGTA 


TGGTCAGGTC 


TTTGTAACGA 


CAGATGATGG 


TTCTTATGGC 


720 


ATCAAGGGAA 


ATGTTTCCGT 


TGTTATCAAT 


GATTTAGACA 


GTCAGTTTGA 


TGCTGTTTAC 


780 


TCGTGTGGGG 


CTCCAGGAAT 


GATGAAGTAT 


ATCAATCAAA 


CCTTTGATGA 


TCACCCAAGA 


840 


GCCTATTTAT 


CTCTGGAATC 


TCGTATGGCT 


TCTGCCATCG 


GAGCTTGCTA 


TGCCTGTGTT 


900 


CTAAAAGTAC 


CAGAAAACGA 


GACGGTCAGC 


CAACGCGTCT 


GTGAAGATGG 


TCCTGTTTTC 


960 
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CGCACAGGAA CAGTTGTATT ATAAGGAGAA AATTATCACT ACAAATCGAT TACAAGTTTC 1020 
TCTACCTCGT TTCGATTTCA AAAATCCGAT TATTCCAGCA TCACCC7CTT TTGGCTTTGG 1080 
ACAAGAGTAT GCCAAGTACT ATGATTTAGA CCTTTTAGGT TCTATTATGA TCAAGGCGAC 1140 
AACCCTTGAA CCACCTTTTC GGAATCCAAC TCCAAGAGTG GCAGACACGC CTGCTGGTAT 12 00 

GCTCAATGCA ATTGGCTTGC AAAATCCTGG TTTAGAGCTT CTTTTGCCTC AAAAGCTACC 12 60 

7TCCC7CGAA AGAGAATATC CAAATCTTCC TATTATTGCC AA7GTAGC7C G77TTTCAAA 13 20 

ACAAGAGTAT GC AG CTGTTT CTCATGGGAT TTCCAAGCCA ACTAATGTAA AAGCTATCGA 13 80 

CCTCAATATT TCTTGTCCCA ATGTTGACCA CTGTAATCAT GGACTTTTGA TTGGTCAAGA 14 40 

TCCAGATTTG GCTTATGATG TGGTGAAAGC ACCTGTGGAA CCCTCAGAAG TGCCAGTTTA 1500 

TGTCAAATTA ACCCCGAGTG TCACCGATAT CGTTACTGTC GCAAAAGCTG CACAAGATGC 15 60 

GGGAGCAAGT GCCTTCACCA TGATCAATAC TCTCCTTGCA ATCCCCTTTG ACCTCAAAAC 1620 

TAGAAAACCA ATCTTGGCCA ATGGAACAGG TGGAATGTCT GGTCCACCAC TCTTTCCAGT 1680 

AGCCCTCAAA CTCATCCGCC AAGTTGCCCA AACAACAGAC CTGCCTATCA TTGGAATGGG 1740 

AGGAGTCGAT TCGGCTGAAC CTGCCCTAGA AATGTATCTC GCTGGCCCAT C7GCTATCCC 1900 

AGTTGGAACA GCTAACTTTA CCAATCCTTA TCCC7CCCCT GACATCATCG AAAATTTACC 1860 

AAAAGTCATG GATAAATACG GTATTAGCAG TCTGGAAGAA CTCCCTCAGC AAGTAAAACA 1920 

GTCTCTCACC TAAACTGCAA TCAATCTGTT CTTCATTTTT TATTAGTTTC TAATATGAAT 1980 

TTACCAGAAT TTTGGTACAA TAAAATAAAT AAGAACACAG OAAGAAGGTT AATCAACAAA 204 0 

CTAAGATTTA TTTTTTTACC TCTCCTATTT TTCTTAGCTA GTCCAGAGGG TGCAATGGCT 2100 

ACTGATGCTA CTTGGCAAGG AAAAC AGT AT CTGAAACAAG ATGGCAGTCA AGCAGCAAAT 2160 

GAGTGGGTTT TTGATACTCA TTATCAATCT TCGTTCTATA TAAAAGCACA TGCTAACTAT 2H2C 

GCTGAAAATC AATGGCTAAA GCAAGGTGAC GACTATTTTT ACCTCAAATC TCGTGGCTAT 2 28C 

ATGGCCAAAT CAGAATGGGT AGAAGACAAG GCAGCCTTTT ATTATCTTGA CCAAGATGGA 2 340 

AACATGAAAA GAAATCCTTC GGTAGGAACT TCCTATGTTG GTGCAACAGG TCCCAAAGTA 2 4 00 

A7AGAAGACT GGGTCTATGA TTCTCAATAC GATGCTTCG7 TTTATATCAA ACCAGATGGA 2 4 60 

CAGCACGCAG AGAAAGAATG GCTCCAAATT AAAGGGAAGG ACTATTAT7T CAAATCCGGT 2 52 0 

GGTTATCTAC TGACAAGTCA GTGGATTAA7 CAAGCT7ATG 7GAATCCTAG TGCTCCCAAA 2580 

G7ACAGCAAG GTTGGCT77T TGACAAACAA 7ACCAA7CT7 GGTTTTACAT CAAACAAAAT 2640 

GGAAAC7ATG CTGATAAAGA ATCCAT7TTC GAGAATGG7C ACT ATT ATT A TCTAAAA7CC 27 00 
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GGTGGyTACA 


TGGCAGCCAA 


TGAATGGATT 


TGGGATAAGG 


AATCTTGGTT 


TTATCTCAAA 


2760 


TyTGATGGGA 


AAATrCCTGA 


AAAAGAATGG 


GTCTACGATT 


CTCATAGTCA 


AGCTTGGTAC 


2820 


TACTTCAAAT 


CCGGTGGTTA 


CATGACAGCC 


AATGAATGGA 


TTTGGGATAA 


GGAATCTTGG 


28S0 


TTTTACCTCA 


AATCTGATGG 


GAAAATAGCT 


GAAAAAGAAT 


GGGTCTACGA 


TTCTCATAGT 


2940 


CAAGCTTGGT 


ACTACTTCAA 


ATCTGGTGGC 


TACATGGCGA 


AAAATGAGAC 


AGTAGATGGT 


3000 


TATCAGCTTG 


GAAGCGATGG 


TAAATGGCTT 


GGAGGAAAAA 


CTACAAATGA 


AAATGCTCCT 


3060 


TACTATCAAG 


TAGTGCCTGT 


TACAGCCAAT 


GTTTATGATT 


CAGATGGTGA 


AAAGCTTTCC 


3120 


TATATATCGC 


AACGTAGTCT 


CGTATGGCTA 


GATAAGGA7A 


GAAAAAGTGA 


TGACAAGCGC 


3130 


TTGGCTATTA 


CTATTTCTGG 


TTTGTCAGGC 


TATATCAAAA 


CAGAAGATTT 


ACAAGCGCTA 


3240 


GATGCTAGTA 


AGGACTTTAT 


CCCTTATTAT 


GAGAGTGATG 


GCCACCGTTT 


TTATCACTAT 


3300 


GTGGCTCAGA 


ATGCTAGTAT 


CCCAGTAGCT 


TCTCATCTTT 


CTGATATGGA 


AGTAGGCAAG 


3360 


AAATATTATT 


CGGCAGATGG 


CCTGCATTTT 


GATGGTTTTA 


AGCTTGAGAA 


TCCCTTCCTT 


3420 


TTCAAAGATT 


TAACAGAGGC 


TACAAACTAC 


AGTGCTGAAG 


AATTGGATAA 


GGTATTTAGT 


3480 


TTGCTAAACA 


TTAACAATAG 


CCTTTTGGAG 


AACAAGGGCG 


CTACTTTTAA 


GGAAGCCGAA 


3540 


GAACATTACC 


ATATCAATGC 


TCTTTATCTC 


CTTGCCCATA 


GTGCCCTAGA 


AAGTAACTGG 


3600 


GGAAGAAGTA 


AAATTGCCAA 


AGATAAGAAT 


AATTTCTTTG 


GCATTACAGC 


CTATGATACG 


3660 


ACCCCTTACC 


TTTCTGCTAA 


GACATTTGAT 


CATCTCCATA 


AGGGAATTTT 


ACCTCCAACC 


3720 


AAGTGGATTA 


AGGAAAATTA 


TATCGATAGG 


GGAAGAACTT 


TCCTTGGAAA 


CAAGGCTTCT 


3780 


GGTATCAATC 


TGGAATATGC 


TTCAGACCCT 


TATTGGGGCG 


AAAAAATTGC 


TAGTGTCATG 


3840 


ATGAAAATCA 


ATGAGAAGCT 


AGGTGCCAAA 


GATTAGTACT 


ATAAGTGAAT 


ATCATTTGAG 


3900 


TGAATAGTAA 


GTTAAAAATC 


CTGATTTCAA 


GTAAAATCAG 


GATTTTTTCA 


TGGATGCAAT 


3960 


TTTTTTGGAG 


TCTGGTGTGA 


CGCGGAGGGT 


CTTTTGTCCT 


GTGTAAGTGA 


CAAAGCCGGG 


4020 


TTTTCCACCA 


GTTGGTTTAT 


TGAGTTTTTT 


GACTTCAATC 


ATATCTACCT 


GCACCAGArr 


4080 


CCACACGCGC 


CCTTGAGAGA 


AGTAGGCAGC 


TAACTCTGCT 


GCGTCTGTCT 


TGACTGCATC 


4140 


AGATGGGTCA 


AGATTTCCTG 


AGATGACAAC 


ATGGCTTCCA 


CGAATGTCCT 


TAGCATGGAA 


4200 


CCAAAGTTCC 


TCCTTGCGGG 


CCATTTTAAA 


GGTCAATTCC 


TCATTTTGAA 


GATTGTTTCG 


4260 


TCCGACATAG 


ATGATGGTTT 


TGCCATCGCT 


TGCTAGATAT 


TGTTCTACTT 


TTTTGCGTTT 


4320 


CTGGATTTTC 


TCCCGTTGTC 


TTCTGCCGAT 


AAAACCTGTT 


TGAATCAATT 


CTTCACGGAT 


4 380 


TTCAGCGATT 


TCTTCCAGTC 


CAGCTTGGTT 


GAGGACGGTT 


TCTACACTTT 


CC AG AT AG AG 


4440 


AATAGTGGCT 


TTGGTTTCTT 


CAATCAAATC 


AGTCAAGTAT 


' TTGACAGCTT 


1 CTTTGAGTTT 


4500 
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CTCATACCCT TTAAAATAGC GTTGGGCATT CTGGTTGGGA GTCAGAGCCT TATCAACCGC 4560 

AATCATGATA GGTTGGTTGC TATAGTAGTT GTCTAGGATA ACCTGGTCTT GGTCGTTAGG 4620 

CACTTGGTGG AGGAAGGTTG TCAGCAATTC TCCTTTTTGA CGAAATTCTT CAGCGTTGTC 4680 

TGTCGCCAGT AACTCTTTTT CCTGTTTTTT GACTTTGTGT CGGTTTT TCT GAACT7CAT7 47 4 0 

TTCAACACGA CGAATCAGT7 CACTGGCCTG CTGTTTGACG CGGTCGCGCT CAGCC1TATC 4 800 

C7TATAGTAG GTGTCCAACA AATCAGAAAG ATTTGCAAAA GGCTCTCCCA CCTGATT7CC 4860 

AAAAGGAACT GGACTGAAGG AACTCTCACT CAAGCATGGC TTGCTTTCTT GATTGAAAAA 4 920 

ATTTCGGAAA GCGGAAAGTT TTTCACTAAC CAGTATCCTT TCCAATTCAT TTGCCGTATC 4930 

GCGTCCCAGA CCTTGAAAGA GGCTTTGAAG ATTTTTTGCT GTTAGTTCTT GGGTTTGCAG 5040 

GATTTCAAAG AG CT I TT CAT CCTTGATAGT AAAAGGATTG ACACATTTTG TACTTGGCGG 5100 

AG CG AT AT AG GTCGATCCTG GAAGTAAGGT CCCGTAGCTA TTTTGTGAAA AGCCGACGTG 5160 

TTTGATAACT TCCAGGATTT TA7GACTGCT TTTATCGACC AGTAGAATAT TACTGTGTTT 5220 

CCCCATAATT TCGATAATCA AGGTAGCCTG GATATGGTCT CCAATCTCGT TTTTATTGGA 5280 

AACTGTAATT TCCACAATAC GGTCATTTTC CACTTGCTCA ATCGACTCAA TCAGGGCCCC 53 4 0 

CTGCAAATAC TTTCTCAAAA CCATGATAAA GGTAGAAGGT TGAGCTGGAT TTTCAAAACT 5400 

CGTTTCGGTC AGCTGAATGC C7CCAAAAAC TCGATGCGCA GAAAGGAGCA GGCGATGGCT 5460 

TTGGCCATTC CTGCGGATTT GCAAGACCAA CTCTTGTTCA AAAGGCTGAT TGATTTTCTG 5520 

CATCCGACCA TTCACTAA7T CCCTTCGCAA TTCCTCAACT ATGTGGTGTA AAAAAAATCC 5580 

GTCAAATGAC ATCGTTCTCT CCTTGTCATT GTATTCCATA GTATTATATC AAAAAGGT AG 564 0 

AATAAAATCA TGGAAATGTG GTATAATAAA GCCAAGTAAA GAGAAACGAG AAGCACATGT 5700 

ATATTGAAAT GGTAGATGAA ACTGGTCAAG TTTCAAAAGA AATGTTGCAA CAAACCCAAG 57 60 

AAATTTTGGA ATTTGCAGCC CAAAAATTAG GAAAAGAACA CAAGGAGATG GCAGTCACTT 5820 

TTGTGACCAA TGAGCGTAGT CA7GAACTTA ATCTGGAGTA CCGTAACACC GACCGTCCGA 5380 

CAGATGTCAT CAGCCTTGAG TATAAACCAG AATTGGAAAT TGCCTTTGAC GAAGAGGATT 594 0 

TGCTTGAAAA TTCAGAATTG GCAGAGATGA TGTCTGAGTT TGATGCCTAT ATTGGGGAAT 60C0 

TGTTCATCTC TATCGATAAG GCTCATGACC AGGCCGAAGA ATATGGTCAC AGCTTTCACC 6060 

GTGAGATGGG CTTCTTGGCA GTACACGGCT TTTTACATAT TAACGGCTAT GATCACTACA 6120 

CTCCGGAAGA AGAAGCGGAG ATGTTCGGTT TACAAGAAGA AATTTTGACA GCCTATGGAC 6180 

TCACAAGACA ATAAACGAAA ATGGAAAAAT CGTGACTTGA TATCCAGTTT AGAATTTGCT . 624 0 
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TTGACAGGTA 


TTTTTACTGC 


TATCAAGGAA 


. GAACGCAATA 


, TG66WW^CA 


CGCAGTGACG 


53C0 


GCTCTAGTGG 


TCATCCTTGC 


ACCTTTTGTT 


TTTCACGTGT 


CACGAATCGA 


ATGGCTCTTT 


6360 


CTCCTATTCA 


GTATTTTCTT 


GGTAGTAGCC 


TTTGAGATTA 


TCAACTCTGC 


TATTGAAAAT 


6420 


GTGGTGGAT7 


TGGCCAGTCA 


CTATCACTTT 


TCCATGCTGG 


CTAAAAATGC 


CAAGGATATG 


6430 


GCGGCCGGCG 


CGGTATTAGT 


GGTTTCTCTT 


TTCGCAGCCT 


7AACACGCGC 


ATTGATTTTT 


6540 


CTCCCACGAA 


TCTGGGATTT 


ATTATTTTAA 


ACAGTAAGAG 


GAAATTATGA 


CTTTTAAATC 


6600 


AGGCTTTGTA 


CCCATTTTAC 


GACGTCCCAA 


TGTTGGGAAG 


TCAACCTTT? 


TAAATCACGT 


5660 


TATGGGGCAA 


AAGATTGCCA 


TCATGAGTGA 


CAAGGCGCAG 


ACAACGCGCA 


ATAAAATCAT 


6720 


GGGAATTTAC 


ACGACTGATA 


AGGAGCAAA7 


TGTCTTTATC 


GACACACCAG 


GGATTCACAA 


6780 


GCCTAAAACA 


GCTCTCGGAG 


ATTTCATGGT 


TGAGTCTGCC 


TACAGTACCC 


TTCGCGAAGT 


6840 


GGACACTGTT 


CTTTTCATGG 


7GCCTCC7GA 


TGAAGCGCGT 


GGTAAGGCGC 


ACGATATGAT 


6900 


TATCGAGCGT 


CTCAAGGCTG 


CCAACGTTCC 


TCTCATTTTG 


GTCGTGAATA 


AAA7CGATAA 


6960 


GGTCCATCCA 


CACCACC7C7 


TGTCTCAGAT 


TGATGACTTC 


CGTAATCAAA 


TCCACTTTAA 


7020 


GGAAATTGTT 


CCAATCTCAG 


CCCTTCAGGG 


AAATAACC7C 


TCTCGTCTAG 


TGGA7ATTT7 


7080 


GAGTGAAAAT 


CTGGATGAAG 


GTTTCCAATA 


TTTCCCGTCT 


GATCAAATCA 


CAGACCATCC 


7140 


AGAACGTTTC 


TTGGTTTCAG 


AAATGGTTCG 


CGAGAAAGTC 


TTGCACCTAA 


CTCCTGAACA 


7200 


GATTCCGCAT 


TCTGTAGCAG 


TAGTTCTTGA 


CTCTATGAAA 


CGAGACGAAG 


AGACAGACAA 


7260 


GGTTCACATC 


CGTGCAACCA 


TCATGGTCGA 


GCCCGATAGC 


CAAAAAGGGA 


7TATCATCGG 


7320 


TAAAGGTGGC 


GCTA7GC77A 


AGAAAATCGG 


TAGCA7CGCC 


CGTCGTGATA 


TCGAAC7CAT 


7380 


GCTAGGAGAC 


AAGGTCTTCC 


TAGAAACCTG 


GGTCAACGTC 


AAGAAAAACT 


GGCCCGA7AA 


7440 


AAAGCTAGAT 


TTGGCTGACT 


TTGGCTATAA 


TGAAAGAGAA 


TACTAAGTAG 


AGG7AGGCTC 


7500 


ATGCCTGCTT 


CTTGTTTTTA 


CAGAAGGAGG 


ACTTATGCCT 


GAATTACCTG 


AGG7TGAAAC 


7560 


CGTTTGTCGT 


GGCTTAGAAA 


AATTGATTAT 


AGGAAAGAAG 


ATTTCGAGTA 


7AGAAATTCG 


7620 


CTACCCCAAG 


ATGATTAAGA 


CGGATTTGGA 


AGAGTTTCAA 


AGGGAATTGC 


C7AG7CAGA7 


7680 


TATCGAGTCA 


ATGGGACGTC 


GTGGAAAATA 


7*TT G CT7T T T 


TATCTGACAG 


ACAACG7C77 


7740 


GATTTCCCAT 


TTGCGGATGG 


AGGGCAAGTA 


TTTTTACTAT 


CCAGACCAAG 


GACCTGAACG 


7300 


CAAGCATGCC 


CATGTTTTCT 


TTCATTTTGA 


AGATGGTGGC 


ACGCTTG7TT 


ATCAGGATGT 


7860 


7CGCAACTTT 


GGAACCATGG 


AACTCTTGGT 


CCC7GACCTT 


TTAGACGTCT 


AC777ATTTC 


7920 


TAAAAAAT7A 


GGTCCTGAAC 


CAAGCGAACA 


AGACTTTGAT 


TTACAGGTCT 


77CAATCTGC 


7980 


CCTTGCCAAG 


TCCAAAAAGC 


CTATCAAATC 


CCATCTCCTA 


GACCAGACCT 


TGGTAGC7GG 


8040 
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ACTTGGCAAT ATCTATGTGG ATGAGGTTCT CTGGCGAGCT CACCTTCATC CAGCTAGACC 3100 

TTCCCAGACT TTGACAGCAG AAGAAGCGAC TGCCATTCAT GACCAGACCA TTGCTGTTTT 8160 

GGGCCAGCCT GTTGAAAAAG GTGGCTCCAC CATTCGCACT TATACCAATG CCTTTGGGGA 82 2 0 

AGATGGAAGC ATGCAGGACT TTCATCAGGT CTATCATAAG ACTGGTCAAG AATGTGTACG 32 80 

CTGTGGTACC ATCATTGAGA AAATTCAACT AGGCGCACCT GGAACCCACT TTTGTCCAAA 3 3 40 

CTGTCAAAGG AGCGACTGAT GGGAAAAATC ATCGGAATCA CTGGGGGAA7 TGCCTCTGGT 8400 

AAGTCAACTG TGACAAATTT TCTAAGACAG CAAGCCTTTC AAGTAGTGGA TGCCGACGCA 84 60 

GTCGTCCACC AACTACAGAA ACCTCGTGGT CGTCTGTTTG AGGCTCTAGT ACAGCACTTT 8520 

GGGCAAGAAA TCATTCTTGA AAACGGAGAA CTCAATCGCC CTCTCCTAGC TAGTCTCATC 85 30 

TTTTCAAATC CTGATGAACG AGAATGGTCT AACCAAATTC AAGGGGAGAT TATCCGTGAG 864 0 

GAACTGGCTA CTTTGAGAGA ACAGTTGGCT CAGACAGAAG AGATTTTCTT CAT GG AT ATT 87 00 

CCCCTACTTT TTGAGCAGGA CTACAGCGAT TGGTTTGCTG AGACTTGGTT GCTCTATGTG 37 60 

GACCGAGATG CCCAAGTGGA ACGCTTAATG AAAAGGGACC AGTTGTCCAA AGATGAAGCT 38 20 

GAGTCTCGTC TGGCAGCCCA GTGGCCTTTA GAAAAAAAGA AACATTTGGC CAGCCAGGTT 3880 

CTTGATAATA ATGGCAATCA GAACCAGCTT CTTAATCAAG TGCATATCCT TCTTGAGGGA 8940 

GGTAGGCAAG ATGACAGAGA TTAACTGGAA GGATAATCTG CGCATTGCCT GGTTTGGTAA 9000 

TTTTCTGACA GGAGCCAGTA TTTCTTTGGT TGTACCTTTT ATGCCCATCT TCGTGGAAAA 9060 

TCTAGCTCTA GGGAGTCACC AAGTCGCTTT TTATCCAGGC TTAGCAATTT CTCTCTCTCC 9120 

TATTTCCGCG GCGCTCTTTT CTCCTATTTG GGGTATTCTT GCTGACAAAT ACGGCCCAAA 9180 

ACCCATGATG ATTCGGGCAG GTCTTGCTAT CACTATCACT ATGGGAGGCT TGGCCTTTGT 9 24 0 

CCCAAATATC TATTGGTTAA TCTTTCTTCG TTTACTAAAC GGTGTATTTG CACGTTTTGT 9 3 00 

TCCTAATCCA ACGGCACTGA TAGCCAGTCA GGTTCCAAAG GAGAAATCAG GCTCTGCCTT 9 3 60 

AGGTACTTTG TCTACAGGCG TAGTTGCAGG TACTCTAACT GGTCCCTTTA TTGGTGGCTT 94 20 

7ATCGCAGAA TTATTTGGCA TTCGTACAGT TTTCTTACTG GTTGGTAGTT TTCTATTTTT 94 80 

AGCTGCTATT TTGACTATTT GCTTTATCAA GGAAGATTTT CAACCAGTAG CCAAGGAAAA 95 4 0 

GGCTATTCCA ACAAAGGAAT TATTTACCTC GGTTAAATAT CCCTATCTTT TGCTCAATCT 9600 

CTTTTTAACC AGTTTTGTCA TCCAATTTTC AGCTCAATCG ATTGGCCCTA TTTTGGCTCT 966C 

TTATGTACGC GACTTAGGGC AGACAGAGAA TCTTCTTTTT GTCTCTGGTT TGATTGTGTC 9720 

CAGTATGGGC TTTTCCAGCA TGATCAGTGC AGGAGTCATG CCCAAGCTAG GTGACAAGGT 9 780 
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GGGCAATCAT CGTCTCTTCG TTGTCGCCCA GTTTTATTCA GTCATCATCT ATCTCCTCTG 9840 

TGCCAATGCC TCTACCCCCC TTCAACTAGG ACTCTATCGT TTCCTCTTTG GATTGGGAAC 9900 

CGGTGCCTTG ATTCCCGGGG TTAATGCCCT ACTCAGCAAA ATGACTCCCA AAGCCGGCAT 9960 

TTCGAGGGTC TTTGCCTTCA ATCAGGTATT CTTTTATCTG CGAGGTGTTG TTGGTCCCAT 10020 

GGCAGGTTCT GCAGTAGCAG CTCAATTTCG CTACCATGCT GTCTTTTATG CGACAAGCCT 10080 

TTCTGTTCCC TT7AGTTGTC TCTTTAACCT GATTCAATTT CGAACATTAT TAAAAGTAAA 10140 

GGAAATCTAG TGCGAGTAAA AATCAATCTC AAATGCTCCT CTTGTGGCAG TATCAATTAC 10200 

CTAACCACTA AAAATTCAAA AACCCATCCA GACAgATTCA 102 40 
(2) INFORMATION FOR SEQ ID S r O: 33: 

(il SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



CGCTTTATCG 


TGGACGTGGT 


CAAGCCGAGA 


ATTTCATCAA 


GGAGATGAAG 


GAGCGATTTT 


60 


TTGGCGATAA 


AACGGATAGT 


TCAACCTTAA 


TCAAAAACGA 


AGTTCGTATG 


ATGATGAGCT 


120 


GTATCGCCTA 


CAATCTCTAT 


CTTTTT CTC A 


AACATCTAGC 


TGGAGGTGAC 


TTCCAAACTT 


130 


TAACAATCAA 


ACGCTTCCGC 


CATCTTTTTC 


TTCACCTGGT 


GGGAAAATGT 


GTTCGAACAG 


240 


GACGCAAGCA 


GCTCCTCAAA 


TTGTCTAGTC 


TCTATGCCTA 


TTCCGAATTG 


TTTTCAGCAC 


300 


TTTATTCTAC 


GATTAGAAAA 


GTCAACCTGA 


ATCTTCCTGT 


TCCTTATCAA 


CCACCTAGAA 


360 


GAAAAGCGTC 


GTTAATGATG 


CATTAAAGAA 


CAGTCGAGAT 


GAAAAAATCG 


TGTGACGCAC 


420 


CAAGGGAGGA 


GTCTGCCCTT 


TTGAGGAAAT 


CTAGCGAGGA 


AAAACGATAC 


TGGAACAGCA 


480 


GAAAGTAAAA 


CTGACCTCAT 


GAGGAGGAAG 


AAAGTGGCTC 


ATGAGGTCAG 


GGGTTTTG'l'A 


540 


ACTTACATCT 


AGTTCAGACA 


GGTATGAATG 


ATTTGGGATT 


AATCATTTCT 


TGTTTTAAAT 


600 


CACGAGAATA 


GTAACGATTT 


TTTCCTTTTT 


TGACGAACTC 


TATTCCGTAA 


CGATCAATCA 


660 


ATTTAATCAT 


GTACCTAATA 


TTAGAATTGT 


TTATCCCAAA 


TTTATTTGAA 


AGCTTCTCTA 


720 


AGCTATATCC 


TTGTTTTCTA 


AGTTCATAGA 


TCTGAACTTT 


ATCATCATAA 


GTTAGTTTCA 


780 


TAATAAAAAC 


ACCCCAAAAG 


TTAGATTTTT 


TCTGTCTAAC 


TTTTGGGGGG 


CAGTTCATTC 


340 


AACACCTGAT 


ACTATCCC7T 


TTTCTTATTT 


GAAATACTTT 


TTACTCAACC 


TCTTTATACT 


900 


CAATGAAAAT 


CAAAGTGCAA 


ACTAGAAAGC 


TAGCCTCAGG 


CTGCTCAAAA 


CAGTGTTTTG 


960 
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AGGTTCCAGA 


TGGAAGCTGA 


CGTGGTTTGA 


AGAGATTTTC 


GAAG AGTA * T 


ACTTAATCTT 


1C20 


CTTGATACTT 


TGACTAAGAA 


TAAATCCTAC 


AATCATCCCT 


ACCATATTTT 


Vjt-ATAAAAi 1 


1080 


CGGTAGAATT 


TCTGGGAGGG 


CTGCTGCCCA 


GCCATTCATC 


AAAGCAGAAC 


CCAAGGCGTA 


1 1 A 


GCCTCCTACC 


ATGGCAATAG 


TTCCTAAAAT 


AAGGCCTAAC 


CACTGACTTT 


TTCCTTTAAA 




TCCTGCGAAA 


AATCCCTGCA 


AGCCATGGTT 


GACCAAGCTA 


AAGAACATCC 


ft y^m^ ft ft 

ACTGAGCGTA 


1 ^ OU 


GCCTGATAAG 


AGGTCAATCA 


AGAAACTTGC 


TAGTCCTCCG 


ACTACCGCTC 


CTTCACGACT 


1320 


ACCAAAGTAA 


AAGGCCGCAA 


AGAAGACACC 


AGCATCTAAA 


AGAGTTAGAA 


TTCCTGTAGG 


I J OU 


TGTTGGGATT 


TTTAAGAAAT 


AACCTAGAAC 


CACAGAAAGG 


GCCGTTAATA 


O ft m ft ft ft 

GGGATACAAG 


1440 


GGCGATTTTA 


GTTGTTTTTG 


TTTGCTTCAT 


ATTGTCTTAC 


TCCATACTGA 


TCTGCTTGTG 


1 c rt f\ 
1300 


CAATAGCACG 


ATAAACGAAA 


GCCTTAGAGC 


TTTCTACTGC 


TGGCAAAAGT 


7TATCACCTT 




TAACCAGGTG 


ACTGGCAATG 


CTAGAGsCAA 


AGGTACAACs 


TGCACCAGCA 


TTTTGGCCTT 


1 £ T rt 
1 0 J U 


GGATAACTGG 


ATTTTCTAGG 


ATAGTAAAGG 


TCTCTCCATC 


ATAAAAGACA 


1*^^ ft ^ ft ^F+^^fr 

TCCACAGCC T 


: £ fl f\ 


TGTCCTGACT 


AAGACGATTG 


CCTCCCTTGA 


TAATGACTCt 


GGCGCTCCTA 


AATCATGCAA 


1 / 4 vJ 


TTTCTGCGCT 


GCAGTTTTCA 


TCTCTTCCAA 


GGTTTTAATT 


TCCTGACCGG 


ft Jft ft ft ^ ft ft > fMf ^ 

ATAATAATTC 


1 QrtA 

ltJUU 


TGCTTCTGGG 


AGATTAGGCG 


TAATCACACT 


GACATAAGGG 


AAAAAGCGAA 


TCAACTC irG 


1 O OU 


GCAGAGCTCA 


CTGACAGCTA 


CATCATGCGT 


TTCCTTGCAG 


ACCAAGACAG 


CATCCAACAC 




CACACGTACT 


CCTCGGCGTT 


GTTTCATAAA 


GTCCAAGGCC 


TTCTCAGCCA 


ft /"^ ft / * *^ 

CCCTGACAGT 


1 a an 


ACGGAGAAGA 


CCAATCTTAA 


TTCCCCCAAA 


TTCCACA7CA 


CGCAAGCTAT 


/ ft ft r|Mf « ^ fi 

CT AA r "CAT G 


"J O 4 A 
i U 1 U 


TTGAAAAATG 


GTATCATCAG 


TTGGAAAGAC 


TTCAAATCCT 


TTTTCTGTCA 


AGGCTG7 CAA 




\7.\ag7cact 


GCTACAAACC 


CATGCAACCC 


GTTCAACG7\ 


TAG 37." T- _ ^ A 


AA 7 CAG CTG A 




CAGTCCACCA 


CCACTAAAAA 


TATCATTTCC 


AGAAAGTGCT 


AAAATACGAT 


TA*TCTTCAT 




AACGAATCTC 


CTTTAAATAC 


AAACCATTTG 


GTGCTGCAGT 


GGGACCTGCA 


AG7TGCCTGT 


2280 


CCTTCTTCTC 


CAAGATCAGA 


TCAATCTGCT 


CTACTGGCAT 


GCGGTTGTTA 


CCGATTTTGA 


2340 


GAAGAGTCCC 


CACCATATTG 


CGAATCTGTT 


TATACAAGAA 


ACCATTTCCT 


GAAAAGGTAA 


2400 


AGGTCAAAAA 


TTGTCCTGTC 


TCATCGACTA 


TTAAACTAGC 


7TCTGTGATG 


GTGCGAACCT 


2460 


TATCCTCTAC 


ACTAGTCCCA 


GAGGCTGTAA 


AACCGGTAAA 


ATCATGGGTT 


CCCTCTAGCT 


2520 


TTTTGATTGC 


AA7CTCCATT 


CGTTCCACAT 


CGAGTGGGTA 


GGGAAAGTGG 


GTGGCATAGT 


2580 


GACGGCGCAT 


CCCATTTTtG 


GGACGTCCTC 


TATCCACAG7 


AAACTCATAG 


GTCTTGCTAT 


2640 


GCTTGGCATA 


ACGGCAATGA 


AAATCATCTG 


CCACAACC77 


AA7CGAAA7C 


ACATCAATAT 


2700 
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CTTCAGGAGA CTGGGTATCC AAGGCAAAAC GGAGTTTCTC CTCATCCATC TGATAAGGCA 27 60 

GGTCAAAATG AATCACCTCT CCCAGGGCAT GAACCCCACT ATCTGTCCTA CCAGCACCGT 2820 

GAACAGTAAT GGCTTGCCCT TTATTTAATC TGGTCAAGCT TTTTTCAATT TCTTCCTGAA 2 880 

CGCTACGCGC ATGAGGCTGG CGCTGAAAGC CAGCAAAGGC ATAACCATCA TAGGAAATAG 2940 

TTGCTTTATA TCTCGTCATA GCCTCTATTT TATCAAGAAA TTACTCTGTA AACAAGGACC 3000 

TAAAACAAAT ATTGTATGGG TATAAAAATC TCATACTCTT CGAAAATCTC TTCAAACCAC 3060 

GTCAGTTTCC ATCTGCAACC TCAACACACT ATTTTGAGCA ACCTGCGGCT AGCTTTCTAT 3120 

AGTAGATTGA AATAAGATAT GAACAACTCT ATTAGGAAAG TCAAATTAAT TTCTAGAAAT 3180 

ATTTTAGCAG CTACACCGTA CTATTCCAAA CTCAATCAAC TATAGTTTGC TCTTTGATTT 32 40 

TCATTGAGTA TCAAAAGAAA AACTTAGGAA TCAATCCTAA GCTCTCTTCT GAAGTAGGTA 3 300 

CATGACAAAG ATAGAGATTA CAATCAACCA ACCTCCTAAC ATACTAAAGA CCAACATCCC 3360 

ATTGTGAGTT AGTAAGCCAA TTGCACCTAG AACGAATGGG GTCGTAAAGG CTCCGAAACT 3 4 20 

ACAGCCTAAT ACAGCAAATG AAGTTGCTTG ATTGAGGAGT TTAGCTCGAA TTCGTTCAGA 3480 

GACAAGTTCA AAGACCGTCG TCAAGACTAC ACTATAGCCA AATCCAGCCA GAACACTTCC 3540 

TGCTACTACC ACCCACAAGG ATGAAGACAA GGCAATCACG ATTTGCCCCA AGCCAAAGGT 3600 

AATACCAGAC CAGAGGAGCA GTTTCTCTTT AAAGATACAA ATCAACAAAG AAAAACTCAC 3660 

CCCAGCCACA ATCCCGATCA ACTGCATGAT ACTAAGAACA AAACT AG ATA ACTGGGCATC 3720 

CCCCAATCCT CTTTCCACCA TCAAACTTGG AATACGGATG GTAATAGCTG TATTGGTACA 37 80 

AACTACAACT GCCGCTTCGA TACCTAAGGT AAAAATCAAG CCTTTCA7TT CTCCACTTAA 3 840 

ACGACTTGCT TCCTTCCCTC TTTTCTTGAC TTCTTTCTTT GATTTTCCAT AAGGGACAAA 3 9 CO 

OAGCAGATAA AGGGGCAGCA CCAAAAATCC AGCACTATAG CCTAGAAACA TACCTGTCCA 39 60 

ACCAAAGGCC AACAACTGAC CGACGGCCAA GGTAATGAGA GAAGCTCCAA CGACCTCTGC 402 0 

AGAAGCGCGT AGCCCTAACA TCTGAATTCG CCTTTTTCCT TGGTAGCGTT CACTGATAAT 4080 

AGAAATGGCC TTGCCATTGA TCATCCCAAG ACCCAAACCA AAGAGAAGCC GTGTTCCAAA 4140 

GACAAAGGGA TAGCCTTGGT ACCAGAAGGG AGCTGTACCG CTCAATGATA AAATCAGCAA 4200 

GCCCAAACTA ATCTGTAAGC GCTCAGGAAA TATTTTTTCT AAG AAACC AT TTAGCAGTAA 42 60 

CAT CATC ATG ATTCCAAAGG AAGGCAAGCT CACCAAGACC TCAATTTGTT CCTTAGAATA 4 3 20 

ACCCTGATAA TAGTCAAACA TGGCTGGTAG GGCACTCGAA ATGGAAAAGG ACGTAATCAA 4 3 80 

AACGAGGGAG AGAGCCAAAA TGCTGGCCCG TTCTAAAAAT TGTTTCATGA AATCTCTTTC 44 4 0 

TATATTTCTC TTAATCTTCT ACTTTTTTGA TAGTTATCAA ATAAGCAAGA AAAGAAGAAG 4500 
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CCTCATTCGT 


TTGTAGACTC 


CTTCTTAAAT 


' TCGAAAATGA 
i wyAAAn i LA 


A 1 LLw .ILIA 


l CTTAT ACTC 


4 5 60 


AATGAAAATC 


AAAGAGCAAA 


CTAGGAAGCT 


AGCCG C A GGT 


lull LAAAAL 


AGTlTTTTGA 


3 62 2 


GGTTGCAGAT 


GGAAACTGAC 


GTGGTTTGAA 
vj t vj'Vj ill lma 


GAGATTTTGG 
LMLrt 1111 


SAOi f~"T i TT A 
AAVjAt* 1 Ai r A 


GGATGACTTT 


4630 


CTCTTGATTT 


GCTTRATAAA 


GTAGAAAATA 
L t nun/v\n 1 A 


AA IvLlA 


lla rA rALLL 


AACAAAGATA 


4740 


ATCAGACACC 


ACTTAAACAC 


AAG ATTGG A A 
ftftUn 1 i L L/\/^ 


L \_ i lul 1 S.A 


A T^T*^ A A A A A 

*-A I I laAAAA 


GAAGTAAGGG 


4800 


AAAGGATTAT 


rrrGGr att 


tggaatat"»g 


r\L 11*1 ALAA 


L L AAt»L L ATT 


AAAAAGAGCA 


4 8 60 


AACATCATAT 


AGAGAAAGGG 
rtL rtL rSJyJy LLL 


1 AAnA L LL 1 L 


LALALTllTG 


GATCCCAAAT 


CTTGTATTGA 


4920 


CCCTGTTTGT 


CAAAAAACAG 


ggtatggggt 


n/W\nL L A 


I lLl* AAl vjA i 


ATAGTGGCAA 


4980 


AGGAAATTTT 


CTAGGGTA^A 


GAAATTACTG 


GPAATGGGGG 


rpft A/* A*™^* A A 

L L AAL All AA 


ATGGTAAATC 


5040 


ACACAGGTAA 


TCATGATAG* 7 * 


GATGGTGAGG 


^- w LI 1 HA 


All (jl AAl AG 


ACTTGGCCTT 


5 100 


TGCCAATTTT 


CACCTACACG 


GCTCA^AAGG 


TTTAGAAGAT 


a a irrrTs a a 
AAALLL I AAA 


AAT AG T T AC C 


5 160 


AAGAGGTTGG 




GTAATAGAGA 


AGPATGGPA A 


AAllAl^A I L 


L l i AGTAATT 


5220 


TCAAGAT AAA 


CTCCCGT AAA 


AGGCGGTAGA 


AAlw rtrtO rtALl rt 


I ALlLl . A * A 


AAATACAAGT 


5230 


TTATAGTGTT 


TTGAGATGG'?* 


TAAATfTTrf 


TPlPia a fTf 


1 LAI I T AAl T 


TT C ATGGC AC 


5 340 


CAAAACCATC 


AATGT T AC AG 


TGGATATTGT 
i lls\ i A * lui 


p.nTrrrrTTP 


lALLATLLLL 


ATATTTTTCA 


5400 


cccccctccc 

W W W V« \J 1 


T^rT^TP AAA 


ILL 1 i iuLLL 


lall rn tau 


TTTCAAGTCC 


TTGATGAGAG 


5460 


TTACTGTATC 




A A^^*^ AH*^^nv * 


ppTTrpr a T*r* 

v-L» 1 1 libLft i t. 


latalllAla 


AGACCTTCTT 


5520 


CTACTTCTGC 


AACTTGAGGA 


GGAfTGGAGT 

Ouni I 1— L/\L L 


i L»AVjL ALA 


L 4 L I LLLL AA 


AC*- AG i AGGvj 


5580 




GT AG AC AT AG 


1 L i vjrYL l l At. 


nil i . UynLn 


K^'^r^f^mA A A 

A i rTLGTAAA 


TTGTTCATGG 


5640 




TCATCATTCA 


CT %TTCTTT j 




• • . v» . - ..A.-.C 


AllAAlTATT 


5 ** 0 C 


ATACCCTAAA 


ATGAGGATT*" 


TG AG AAATT** 1 


AG AAA AAA AG 


praTiTrs A"» 
LLA * A I LAA i 


L i A 4 LLLL I T 


57 60 


TTCTACATTT 


ACATTCTTTT 


TTCAGCTTCT 


GCTTTGATTT 


TTTCAACTAC 


TTCTTCAATG 


5320 


TTCAAACCAG 


TTGTATCAAG 


GTAGACAGCA 


TCCTCTGCTT 


GTTTGACAGG 


AGAAGTCTCA 


5880 


CGATGACTAT 


CCTTGTACTC 


ACGCGCAGCA 


ATTTCCTTTT 


TTAGGCTTTC 


AAGGTCTGTT 


5940 


TCAATTCCCT 


TGGCAATATT 


TTCCTTGTAA 


CGACGCTCTG 


CTCTCTCATC 


AACAGAAGCT 


6C00 


ACTAGGAAAA 


TTTTCAATTC 


TGCTTGTGGC 


AATACAACAG 


TTCCAATATC 


GCGACCATCC 


6C60 


ATCACAATCC 


CCCCTTGCTG 


GGCAATTTCT 


TGTTGGAGAG 


AAACCACTTT 


CTCACGCACT 


6120 


TCACGAATTG 


CTGCAATAGC 


AGAAACATGA 


TTGGTCACTT 


CATTTTCACG 


GATAGGATGG 


6130 


GTAA7ATCCA 


CATCTCCTAC 


AAAAACAAGC 


TGGTC7CCAG 


TTTCTGAACG 


TCCAAAGCTG 


6240 
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ATTGGATCCT GGTCCAACAA GGCTAGAAGG CCTTCGAC7T CTTCAACTCC TAATTGGTTC 6 300 

TTAAGAGCCA TATAGCTCCC TGCACGATAC ATAGCTCCTG TATCAAGCTA GGTGAATCCA 6 3 60 

AAATCCTTAG CAATAATC7T TGCGACCGTA CTCTTACCCC TGGAAGCAGG ACCATCAATA 6-1 20 

GCAATTTGAA TTGTTTTCAT ATCGGCTCCT ATTTTATTTT TATAACATCA CCTGGATTAG 64 30 

CAAACCAAGA TCCTGTAGCC ATCTGCCCAG GATTCAACCC CTCTAACTCA GCAATGGAGA 6540 

TTCCTCCACC AGCGGCAATA GCTGCTTCCC CTTCTCCTGC GAGAACTTTA ATCGTTCCTT 6600 

CAGGATTAGC AGCTTCTTCT GAACTACTAG AAGTAGATTC TGGCTCTGAA CTCTGCTCAG 6660 

GCTGAGAACT ACTTGAAGAT GAGATTTGTA CTACACTGGC ATCAGAATCA TGAAAGCCTT 67 20 

TTAACGCTGC TGTCCGATTA CTCCCCCCCG ATGATAGATA GATGAGAACC ATGACCATCA 6790 

CCACCACAAT TACAAAGAAA ATACTAGCTA GGATCGTCAA AATACGATTA GCCATCCTAT 6840 

CACCCCCTCC GTCCTTTCGA TGCCGACGCT CTCCTCTTGA TTCTTCTTGA T CAT AG AT AT 6900 

CTTCTTGCCA CGCTTCTTTT GCCATACCTT ACTCCTTGTT TTTTTTTACT TTTCTTATTA 6960 

CAATATAAAT ATGAACATGA AAATCACACT TATACCTGAA CGATGTATCG CCTGTGGGCT 702 0 

TTGCCAAACT TATTCTGATT TATTTGATTA CCACGATAAT GG AATCCTGC CTTTTTACCA 7080 

TGACCCTGAC CAACTGCAAA AAGAAATTTC TCCTAGTCAG CATATCTTAG AGGCTGTTAA 7L4 0 

AAATTGCCCA ACTCGCGCCC TGATTGGAAA CCAGGAAGCC TAAATCAATG GCGATAATCC 7200 

ACTCCCTCTA GTTTAGCACA TTTCCATGTA AAATTATAGT CTTTTCACTT TATTTTTTTC 7 260 

TGTAAAATCA CGAAGGTCAC TTTTTTCTTT GATAAGATAA AGTGCTCTTT TTTTAGTCTC 73 2 3 

7 AAA? AAA TC TTACTGATAT ACTTGCCGAG AATCCCAATG GTCAAGAGTT GAATGCCTCC 7 3 80 

AAGAAAGAGA ATAACAGCCA TCAGAGAGGT CCAACCAGAT GTCGCATTGC CCAAAATGAG 7 4 40 

GGTCCCAACC ACAACAAAAA AGGTCATCAG CAGAGAAAGA AAACAAGATA GGAGACCAGC 7 500 

TACAAAGGCT ATAATCAAGG GAAAATCTGA AAAATTAATA ATCCCTTCAA TGGAGTAGAA 7 560 

AAAGAGTTGC CTAAAACTCC AACTTGTCTT GCCAGCCTGC CTTTCGACAT TTCGATAGTC 7 62 0 

CAAATAGTAG GTTTTGAAAC CCACCCAGGC GAAGAGCCCC TTTGAAAAAC GATTGCACTC 7 680 

GGTCAAGCTT AAAATGGCAT CGACTACAGA CCTTCTCATC ATACGAAAAT CACGGACACC 77 40 

CGACGGCAGA GCTACTCGGC TGATTTTTTG CATGAGGCGA TAAAAGAGAA CAGCACAGAA 7 800 

ACTGCGAAAG AAGGGTTCTC CCTCCCGACT AGTTCTCCGT GTCCCAACGC AGTCCAAGTC 7 860 

TACAT7TTTG TCTAATACAT TTT7CATCTC AAACAACATA CTAGGAGGAT CTTGGAGGTC 792 0 

TGCATCCATC ACCACCACCA AATCTCCTGT CGCATATTGC AAGCCTGCAT AAACCGCTCC 7 980 

TTCTTTGCCA AAATTTCCAC AGAAAGAAAT ATAATGCACT GCCGGATTTT GCTCCCGATA 804 0 
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GGCCTTTAAG 
TTCTGTTTCC 
CTCTTCGTTT 
GATGCTTGCT 
CCCAACCGAG 
GGCGACGTTC 
TAATTTCAAC 
CTTTTTGGAT 
AGTCACGAAC 
CTAATTTTCC 
GATTTTCTCC 
CAACATTCCA 
TAGTACGACC 
GCGGATCCCC 
CCATGGCTTT 
TACGTCTTGA 
CCTGCTTGAA 
TCTCAACTCC 
TATCCACCAC 
TAAAACC jOC 
ATTATTTTTT 
TGATAAAATT 
TCGCCTTGCC 
GTTTTGAGCT 
CGGCTAGTTT 
TGACATAGTT 
GTTGGGCTAG 
TGGCTGCATA 
TGTAGGCACG 



AGTTCCAAGG TCCCATCACT 
AAATCTGGAA CTAAAGCTTC 
AAACAAGGGA CGATGATTCA 
TTGCCAACGC CATGCGTCTT 
TTCTCCTTTA GCTTTTGCCG 
TACGATGCGG TAAGGAATAG 
AACTGAGTAA CCTTTACCAG 
TTTTTTCAAA GCTGCAACGT 
ACCTGTTCCA TCTTCCGTAT 
AACGGCTACT TGAGTCACAT 
CAAATCACCA CTCTCATGGG 
TTCTGAGTCT GCTTTGTAAA 
GTATGGGTTG GTCACTGAAA 
GTAAACTCTC GCAGAAGAAC 
CAAAAGGCTG ACACTTCCAG 
TTCGCCAACA GCCTTCAAAC 
AATATCTCTG AGGGTATCTC 
TCTGATTCCT TCAACAACTT 
AACAACTTGA TGACCTGCTT 
ACCACCAGTT ACCAAAATCT 
CTTATTTTAC CATTTTTGAC 
TCAGTAAAAT GCTTATACTC 
ATGGGTATGG TTACTGACTT 
GACTTCGTCA GTTCTATCCA 
CCTAGTTTGT TCTTTGATTT 
TTCAATTGGG TAATTTAGAG 
ATGGTAACCA ATGATAGGAC 
GACACCAGTT AAGTCAGGCA 
GATTCCAACA CGCTCAGATT 
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TGA7CCATCA TCGACAAAGA 
CAGAGCCTGA TAAAAAAGAG 
AATCATCATC TTAGTCTTCA 
CACACATTTG GGTGATGTCG 
GGTCTGAGTA GCAGCCAGCC 
GACGGCCCAC CGCTTTTTCC 
TTCCAAGGTT ATAAACGTTT 
GACCCTTAGC CAAA7CGACA 
CGTAATCGTC TCCAAACACT 
ATCGCAAGAG ATTGTTTGGA 
CTCCGATTGG GTTAAAGTAA 
TATCAGTCAA AATTTCCTCT 
GTGGGAAATC TTCCAAGATG 
TGAAGATGAT GTTTTTACAG 
CGATATTGTT GTCATAGTAG 
CAGCAAAGTG AATGACACCA 
TGTCACGAAT ATCTGCCTCA 
CTAAACTCTT ACGATTGCTA 
GGATCAATTC AATAACAGTG 
TTTCTTGCAT - T T7TTT C CT 
AGGGAATGTC ATTTGCCATC 
TTCGAAAATC CAATTCAAAC 
CGTCAGTTCT ATCCACAACC 
CAACCTCAAA GCAGTGCTTT 
TTATTCAGTA TTATTCCCTT 
CGTCCAACC7 CAACTCCTTC 
CAGTTGTGAG GCCTGATGAA 
CCTGCCCAAA GAAAGGAGAG 
TTGAAGTAGC TTCAGCCAAA 



CATACTCGAT 8100 

GAAGTACTTC 8 1 SO 

AATCCATTTG 9 2 20 

AGTTCTGCTT 8230 

ATATCACCTC 8 3 40 

ATCTTTTGCA 84 00 

AGTCCTGAAC 84 60 

ACGTGGATAT 8 520 

TGCACTTGCT 8580 

ATACCGTTTC 864 0 

CGAAGCAAGA 8700 

AGCATGAGCT 87 60 

GGCACTGTGT 3 820 

TTGTTTTCTT 88 80 

GCAAGAGGGA 894 0 

GTCGGTTCTT 9000 

TAGAAAGGAA 9060 

TTCACAAGAT 9120 

TGGGTTCCAA 9180 

CTAAACTACC 9 300 

TACGTCAACG 9 360 

TCAAAACAGT 94 20 

GAGTAACCCG 9480 

TTTACTCGTT 9 540 

TCTTGGATCA 9600 

CCTAGTCCAC 9660 

AAATCACTGG 9720 

ATCAGATAGT 97 30 
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GAGTCAAGGT 


GGCCTCCTCC 


AT77GT7GCA 


GCAACGTTTC 


ATCTACCGTC 


AAATCAAATC 


9840 


CCATGTCATT 


TTCGTGGGTA 


GCGCCTAAGG 


ATAATTTCCC 


ACCTGCAAAG 


GGAATCAAAT 


99C0 


CCCACTCCCC 


T7CTGGCATC 


ACAACAGGGT 


AATCTTCCAT 


GTCTTGGGCA 


ACCTGATAAT 


9960 


CTCCTAGT7G 


TCCTTTTTGA 


GGACGGACAT 


CCACTTCATA 


ACCTAAAGGC 


TCTAACATGT 


10020 


CCCCCAACCA 


AGCTCCCGTC 


GCCAAAATAA 


CCTGCTCAAA 


CTCCTCTTCA 


CCAATCTGGT 


10080 


AGCCTGATGC 


TAACGGTGTC 


AGAGTCACTT 


TT7C7TTCAC 


CAGCTTGACA 


TGACTCACTT 


1014C 


CCAGCAAACG 


AGTCACTAAA 


AGTTGCCCAT 


CTACTCTCGC 


TCCACCAGAA 


GCATAGAGCA 


10200 


GGCGGTCAAA 


7CCCTGCAAA 


CCAGGGAATA 


ATTCATTAGC 


TGAGGCTTGG 


T7CAGAATCG 


10260 


CTAATTGCCC 


TATCAAGGGA 


GATTCTTCTC 


TGCCCTGGAG 


GGCCAGTTGA 


TAAAGTTCTT 


10320 


CCAAATTGGA 


TTCATCCTTT 


TTCAAGAGAA 


AGACTCCCGA 


ACGCTGGTAA 


AAGTCGATTT 


10380 


CTTGTCCTGA 


TTTCTCTAAA 


TCAGCTAATA 


AATCCACATA 


AAAA7CAGCC 


CCCAAGCGCG 


10440 


CCATCTTGTA 


CCAGGCTTTA 


TTACGGCCTT 


TGGAAAACCA 


AGGACTGATA 


ATTCCTGCTG 


10500 


CGCCCTTGCT 


GCCTTGACCT 


7CCTCATGC7 


CAAAAACGGT 


CACCTCTAGG 


TCACTTTCTC 


10560 


TCGACAGGTA 


GTAGGCAGCT 


GTTGCTCCCA 


CAATTCCTGC 


TCCAATAATG 


GCAACTTTTT 


10620 


TCATTCTCTT 


CACTTTCTAA 


CTAGATATGA 


TGGAAAGGAT 


TGGTTGATGC 


CTGACTAGGC 


10680 


AAGATATCAA 


TAGACCACCC 


CTTATCTTCC 


TTCCATTCAC 


TAAGAACTGC 


TGCGATTTTT 


10740 


7CTACAAAAA 


TCACTTCGAT 


A7ACTCACCT 


GCG7CCAATC 


CAAGCAACCC 


ATCAGATAGC 


10800 


ATATCCTGAG 


CAGTATGGTA 


GTAGATATCA 


CCAGTGATAT 


AGACATCTGC 


CCCCTTTGCC 


10860 


AAAGCATCCT 


7A7AGAAAGA 


CTGCCCGCTT 


CCACCACAAA 


7TGCTACTCT 


TGAAATAGGC 


10920 


T7CTCCAAAT 


CATCCTCTTG 


ATAATGCACC 


ATTCGAAGCC 


TATCTAGGTC 


AAAGACTTGC 


10980 


TTGACCTGTT 


GGGCCAATTC 


CCAAAATG7C 


7GAGGCTGAA 


TA7TCCCAA7 


ACGTCCAATT 


11040 


CCACGTTCTG 


GACCTG7T7C 


CTGCAGATAA 


GTCGTCTCCT 


CGATTCCTAG 


CATCTGACAA 


11100 


AACCAGTCAT 


TGAGCCCATT 


TTCAACGATA 


TCAATATTGG 


TATGGCTGAC 


A7AAACTGCC 


11160 


ATATCATGCT 


TAATCAGGTC 


GATGTAAATC 


TGATTTTGCG 


GACGGCTGGC 


AACCAACTCC 


11220 


TTGATAGGAC 


GAAAGATACG 


CGCGTGCTTG 


ACGATAATCA 


AGTCCACACC 


CTTTTCAATG 


11280 


GCCTCTGCCA 


CTGTCTCTTC 


ACGAATATCG 


AGGGCAACCA 


TGACCCTTTG 


GATACCCTTG 


11340 


7C7AAAG7GC 


CAATTTGCAG 


ACCACGCC7C 


TCTCCCTCCA 


TAGAAAATTC 


CTGAGGGCAA 


11400 


AAGGCTTCAT 


AAGC77CCA7 


CACTTCACTT 


GCTAACATGG 


AGCACCTCCT 


7GA7AGCTTG 


11460 


AATCTTATCT 


ACTAGAACTT 


GACGTTCTTC 


CAGATTTTT7 


TC7GGGATTT 


G7CCGAGGCC 


11520 


GAACTCTAGC 


TTCTCAGCTT 


CTTTTTGCCA 


TTTTTGGACA 


AATACTGGAC 


TGAC7TCTTT 


11530 
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GGALAAGAAG 


rmrn a an;'* 




GGCTGATAGC 


TTCATTTGTC 


CTGCTTCCAC 


1164C 


CACs-AAAA I *- 


I V- A 1 AAAftl- * 




TTCTAAGATG 


C7TTCTCCTA 


CAATCTGGAA 


11700 


TCCATGATCC 


TG TAG C GAGA 


Tarr.PA aptp 


\J * \^ 1 4 \* « > *" 


TTATTGGGCT 


GGAGGATCAA 


11760 


ACGCTCTACA 


TTAGCTAACT 


TPPPPa & Af*f* 


TTCTTr'TAAA 


ATCCTAGCAA 


TCAAACGACC 


11820 


ACCCATGCCA 


GCAATGGTAA 




* i ^) w 1 ^ i» W * W 


TCTTCAAAAG 


CTGCCAAGCC 


11880 


ATTGGCTAAA 


CGGACTTGGA 


r i i iv.4 v.*. l i 




GCCTCAACAT 


TTTTAACCGC 


11940 


AGACTGATAG 


GGACCTTCCA 




THfAATAGCG 


CTTTTGATTT 


GGCCTCTCTC 


12000 


AACCAACTCG 


ATAGGCAGA I 


a anc ATYtnTf' 


ACTTCCCACA 


TCTAGTAAAA 


TAGCCCCCTG 


12060 


TGACACAAAG 


GAAGCTACCA 


1L1 M i \- 1 


(TT' T, G AAA^C 

L 1 1 « OAA/A . V_ 


ATCTTCTCTC 


AC TTT CC AAA 


12120 


ACTCTATTAC 


CTCTTATTA I 




AATC^TCAAC 


T7CCCAGTAA 


TATAAGCACC 


12180 


TCTGGCGAAA 


GAAGTi IGAA 


Tf"'TPf" T, AAAG 


TAATAAGTGA 


ATCCAATTGA 


AAGATTTTAA 


12240 


ACAATTTGCA 


AAAATGTCAA 


a A A&TAAAAA 


ATAAACAGTT 


TATTCAGAAA 


ATTCTTCACA 


12300 


TATAAAAACA 


CATGGTAGAA 


I A I AA 1 i Aun 


AAnTTACIAAA 


AAATAAAAGT 


TTGACTAAAA 


12360 


TTTGTATTTG 


AAGGTGGTGT 




AA TTT AGT C A 


GACGAACCAC 


GAATTTGCTC 


12420 


TATGCTTTCT 


GGAATTTATC 


at a & r* Annan 


flATACACTCA 


TGGAACAAAC 


ATTGTTTGAA 


12480 


TTAGAACTAC 


TTCCAGAGGA 


AGATATuAT. i. 




TrrCTAAGTA 


TTGTTCTTTT 


12540 


ACTTGTTTAA 


TTACAGGTCG 


t_ * AG I Lnlftt 


TTT AT AT AAA 


ATAAGTAGCT 


TTACTTACGC 


12600 


AATACCCTAG 


TGCTGTG - w . 




TTAATAATTA 


GGAGTTTGTT 


ATGGATTTAT 


12660 


TAGAGAAAGA 


ATGTTTAAAA 


tv-tp a t a a a a 


ATTTrCAACA 


GGGTGATATT 


TGGAATTACT 


12720 


ATTATTTATC 


AGATAAGATG 


CCTGCAGAAG 


nnTfinAAAAT 


ACACATAAGC 


TCCCAAATAA 


12780 


AAGACCCTGT 


AAATATTTTT 


AAGATTGTGT 


ATAAACTATC 


CCAACTAAAT 


AATTGT. AGC - 


12 84 0 


TTAAAGTTGT 


TAAAAATTTA 


GAGGAATTAA 


AAAAAATTAA 


TTCCCCTAGG 


GAAATGAGCC 


12900 


CTACTGCTAA 


CAAATTTATA 


ACTCTATATC 


CTAAGTCAGA 


ATCTGAAGCT 


AAGAGTATGA 


1296C 


TTTGTAATCT 


TACGAATAGA 


CTGTCAGAAT 


TTAAGGCTCC 


AAAAATACTA 


TCTGACTATC 


13020 


AATGTGGAAT 


GCATTCTCCA 


GTTCATTATA 


GATATGGGGC 


TTTTTTAAAA 


AAACAAGCTT 


13080 


ATGATGAAAA 


AAATAAAAAA 


GTCATCTATT 


TATTGCTAGA 


TGAAAAAAGG 


AAGAACTATG 


13140 


TAGAAGATAA 


GAGACAAAAT 


TTCCCTAGTC 


TTCCTAGCTG 


GAAAATGGAT 


TTATTTTCAG 


13200 


AAGAAG 












13206 



(2) INFORMATION FOR SEQ ID NO: 34: 
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(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


34: 






CCGGATCCAG 


CGAAAAATAT 


GCTCTTTGAT 


GCTGTAACTG 


GTCAAAAAGA 


TGCTAAAACA 


60 


GCTGCTAACG 


ATGCTGTAAC 


ATTGATCAAA 


GAAACAATCA 


AACAAAAA7T 


TGGTGAATAA 


120 


AAAATTTGTT 


CAAGGGGCGT 


GGAAATCAAA 


TCCCCCTTTG 


AATTTATCAA 


TAGAGACACA 


iao 


AATAATTTAC 


CTT7CTTATA 


AAAAAGTAGT 


ATCCTATGAA 


AGGAGTTAAT 


ATGGAAAAGC 


240 


AACAACCTAG 


TAAAGCAGCC 


CTGCTGTCTA 


TCATTCCTGG 


GTTAGGACAG 


ATTTACAATA 


300 


AACAAAAAGC 


CAAAGGTTTT 


ATCTTCCTTG 


GTGTAACCAT 


CGTATTTCTC 


CTTTACTTCC 


360 


TAGCACTTGC 


AACCCCTGAA 


TTGAGCAACC 


TCATCACTCT 


TGGTGACAAA 


CCAGGTCGTG 


420 


ATAATTCCCT 


CTTTATCCTG 


ATTCGTGGTC 


CCTTCCATCT 


AATCTTTGTA 


ATCGTTTATG 


430 


TACTCTTTTA 


TTTCTCAAAT 


ATCAAAGATC 


CACATACGAT 


TGCAAAACGC 


ATTAACAATC 


540 


GAATTCCAGT 


TCCACGCACA 


CTCAAAGACA 


TGATCAAAGG 


GATTTATGAA 


AATGGCTTCC 


600 


CTTACCTCTT 


GATCATTCCA 


TCTTATGTTG 


CCATGACCTT 


CGCGATTATC 


TTCCCAGTTA 


660 


TCGTAACCTT 


GATGATCGCC 


TTTACCAACT 


ACGACTTCCA 


ACACTTGCCA 


CCAAACAAG7 


720 


TGTTGGACTG 


GGTTGGTTTG 


ACCAACTTTA 


CAAACATTTG 


GAGC7TGAGT 


ACC7TCCGTT 


730 


CTGCCTTTGG 




7CTTGGACTA 


TCATTTGCCC 


TTTGGCAGCT 


TCTACTTTAC 


34C 


AAATCGTAAT 


TGCTATCTTC 


ACAGCTATCA 


TTGCCAACCA 


ACCATTTA7C 


AAAGGAAAAC 


900 


GTATCTTTGG 


TGTTATTTTC 


CTTCTTCCTT 


GGCCTGTCCC 


AGCCT7CATC 


ACTATCTTGA 


960 


CATTCTCAAA 


CATC7T7AAC 


GATAGTGTCG 


CTGCTATCAA 


CACTCAAG7A 


TTGCCAATCT 


1020 


TCGCTAAAT7 


CCTTCCTTTC 


CTTGATGGAG 


CTCTTATTCC 


TTGCAAAACA 


GACCCAACTT 


1080 


GGACTAAGAT 


TGCCTTGATT 


ATGATGCAAG 


GTTGCCTCCG 


ATTCCCATAC 


ATCTACGTTC 


1140 


TGACCTTGGG 


TATCTTCCAA 


TCTATTCCTA 


ACGACCTTTA 


CGAAGCACCT 


TATATTGACG 


1200 


GTGCCAACGC 


TTGGCAAAAA 


TTCCGCAACA 


TCACTTTCCC 


AATGATTTTG 


GCTGTTGCGG 


1260 


CACCTACTTT 


GATTAGCCAA 


TACACCTTCA 


ACTTTAACAA 


CT7CTCTATC 


A7GTACCTCT 


1320 


TCAATGC7GG 


AGCACCTCGT 


AGTGTCGGAG 


G7GGAGC7GG 


TTCAACCGAT 


A7CTTGATCT 


1390 


CATGGATCTA 


CCGTTTGACA 


ACAGGTACAT 


CTCC7CAATA 


CTCAATGCCG 


CCACCTGTTA 


1440 


CCT7GATTAT 


CTCTATCATT 


GTCATCTCAA 


TCTCTATCA7 


CGCATTCAAG 


AAACTACACG 


■500 
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CATT7GATAT GGAGGACGTC TAAGATGAAT AACTCAATTA AACTCAAACG TAGACTGACT 1560 

CAAAGCCTTA CTTACCTTTA CCTGATTGG? CTATCAATTG TAATTATCTA TCCACTGTTG 1620 

ATT AC C ATT A TGTCAGCCTT TAAAGCAGGT AACGTCTCAG CCTTTAAACT AGATACTAAT 1680 

ATCGACCTCA ATTTTGATAA CTTTAAAGGC CTCTTCACTG AAACCTTGTA CGGTACTTGG 1740 

7ACCTCAACA CTTTGATTAT CGCCTTAATT ACCATGCCTG TTCAAACAAG TATCATCGTA 1800 

CTTCCTGGTT ATGCTTACAG CCGTTACAAC TTCTTGGCTC GTAAACAAAG TTTGGTCTTC 1860 

TTCTTGATCA TCCAAATGGT GCCAACTATG GCCGCTTTGA CACCCTTCTT CCTTATGGCC 1920 

CTTATGTTGA ACGCCCTTAA CCACAACTGG TTCCTCATCT TCCTCTACGT TGGTCGTGGT 1980 

ATCCCGATGA ATGCTTGGCT CATGAAAGGC TACTTCGATA CAGTGCCAAT GTCTTTAGAC 2040 

GAATCTGCAA AACTAGACGG TGCAGGACAC TTCCGCCGCT TCTGGCAAAT TGTTCTACCA 2100 

CTTGTTCGCC CAATGGTTGC CGTACAAGCT CTCTGGGCCT TCATGGGACC TTTCGGGGAC 2160 

TACATCCTCT CTACTT7CTT GCTTCGTGAG AAAGAATACT TTACTGTTGC CCTACCTCTC 2220 

CAAACCTTCG TTAACAATGC GAAAAACTTG AAGATTGCCT ACTTCTCAGC AGGTGCTATC 2 280 

CTCATCGCCC 7TCCAATCTG TATTCTCTTC TTCTTCCTAC AAAAGAACTT TGTTTCAGCA 2 340 

CTTACAAGTG GTGGCCACAA GGGATAATTT ATCCCCGCCA CCCTTTTTCA TTTTATACTC 2400 

TTCGAAAATC TCTTCAAACC ACGTCAGCTT TATCTCCAAC CTCAAAGTTG TGCTTTGAGC 2460 

AACCTGTGGC TACTT7CCAC TTTGATTTTC ATTGATTATT ACCAATTCTC ACTGTAAATA 2520 

ATATCCTTGT AGCAAGCAAT TTTTCTCCTA GACTTGAAAT AAAGCGCATT TCTCTATATA 25 80 

ATAATACTCA TATAGAAAAC ACCTTTTAGA AAGATACCTA TGCTTCCATA TCCATTTTCC 2 540 

7A77777CAA G7A777CGGG CGTTCCTAAG CCC77C7CCA AACG7TTCCA GCTCAACTCG Z TOO 

TT7CAACTTC TCTTTACCAG TA7CTTCCT7 ATCAGCTTGT CTATGGTACC CATTGCTATC 2760 

CAAAACAGCT CCCAGGAGAC CTATCCGCTA GAAACTTTTA TCGATAATGT CTATGAACCT 2 82C 

CTCACAGATA AGG7TGTCCA GGATCTCTCT GAACATGCTA CAATTGTCCA TGGCACATTA 2 380 

ACTTATACTG GAACAGCTAG TCAAGCCCCT TCTGTTGTGA TTGGTCCAAG TCAAATCAAG 2 94C 

GAATTACCTA AGGACTTGCA ACTGCATTTC GATACAAATG AGCTAGTCAT CAGCAAGGAA 3 000 

AGCAAGGAAC TGACCCGCAT CTCTTACCGA GCCATTCAGA CTGAGAGTTT CAAAAGCAAA 3060 

CACAGCTTGA CCCAAGCAAT TTCTAAAGAC TGGTACCAAC AAAATCGTGT CTATATCAGC 3123 

CTCT7CCTAG TTCTCCGTGC GAGCTTCCTC 7TTGGTTTCA A7TTC7TTAT CGTCTCTCT7 3130 

GGAGCTAGCT TTCTCCTTTA TATCACCAAA AGATCACGCC 7CTT77CATT 7AATACCTT7 3 24 0 
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AAAGAGTGCT ACCATTTTAT CTTGAACTGT TTAGGATTGC CGACTCT \\T TACACTTATT 3 300 

TTGGGATTAT TTGGCCAAAA TATGACAACC CTGATTACTG 7ACAAAATAT TCTTTTTGTT 3 3 60 

CTGTATCTGG TCACTATCTT TTATAAAACA CATTTCCGTG ATCCAAATTA CCATAAATAG 34 20 

GAGATTTTTA TGCCCGTTAC GATTAAACAC GTGGCCAAGG CTGCTGGTGT TTCGCCTTCA 3430 

ACCGTAACCC GTGTTA7TCA AAATAAATCA ACCATTACCG ACGAAACAAA AAAACGTGTT 3540 

CGCAAAGCTA TGAAGGAACT CAACTACCAC CCAAACCTCA ACCCTCCTAG CTTGGTAAGC 3 600 

AGCTATACTC AGGTTATCCG ATTACTTCTT CCTGATGACT CAGACGCCTT CTACCAGAAT 3 660 

CCTTTCTTTC CATCGGT7CT ACGTGGCATC TCTCAACTCG CATCTGAAAA CCACTATCCC 3720 

ATT C AG AT AG CAACAGGGAA ACATGAGAAG GAGCGTCTCA ACGCTATTTC ACAAATGGTC 3730 

TACGGCAAGC GTGTAGATGG GCTAATTTTT CTCTATGCCC AAGAAGAAGA CCCTCTCGTA 3840 

AAACTCGTCG CAGAAGAACA GTTCCCCTTC CTTATCTTAG GTAAATCTCT ATCTCCTTTC 3 900 

ATCCCACTTG TCGACAACGA CAATGTTCAA GCTGGTTTTG ATGCGACTGA ATATTTCATC 3960 

AAAAAAGGCT GCAAACGCAT TGCCTTTATC GGAGGAAGTA AAAAGC7CT7 CGTGACCAAA 4020 

GACCGTTTAA CAGGCTATGA ACAGGCGCTT AAACATTACA AACTTACCAC TGACAACAAT 4080 

CGCATCTACT TTGCCGACGA GTTTCTGCAA GAAAAGGGCT ATAAATTTAG CAAGCGATTA 414 0 

TTCAAGCACG ATCCACAAAT TGATGCTATC ATCACAACCG ATACCCTCCT AGCTGAAGGT 4 200 

CTTTGTAACT ATATTGCCAA ACACCAGCTG GATGTCCCTG TTCTCAGCTT TGACTCGCTT 4 260 

AATCCCAAGC TCAACTTGGC AGCCTATGTC GA7ATCAATA GTTTAGAGCT TGGTCGTGTT 4 3 20 

TCCCTTGAAA CTATTCTCCA CA7TATTAAT GATAATAAAA ACAATAAACA AATTTGTTAC 4 3 80 

CGTCAATTGA TCCCCCACAA AATTATCGAA AAATAACACA CTGGG CAAAA AGTCGTTAAA 4440 

AGCAAAAACG CATACTATCA GGTATTGAAA AAAC7TGA7A C7ATCCCTTT TATTGTGGGA 4500 

ACATTTACTT CCTTTTCTAC TGAAATTGAG TCTTTTCCCA AGATCTTTT7 ATAC7CAATG 4 560 

AAAATCAAAG TGCAAACTAG GAAGCTAGCC GCAGGTTGC7 CAAAACACTG TTTTGAGGTT 4 620 

GTAGATGAAA CTGACGAAGT CAG7AACCAT ACCTACGGCA AGGTGAAGC7 CACCTGCTTT 4 630 

GAAGAGAT77 TCGAAGAGTA TTAATCACTA AT7A7CTATC TCAACAAA7C TTCCTAGAAT 47 40 

ATGAACATTT TCCGAGACAG AGACAAAGGA GCTTGGATCC ACTTCTGTCA TAATCTGTTT 4 800 

AAATTCATTA AACTCTGCAC GTGTAATGAC AG TG ATT AAA ACTCCCTTTC 7CTCGTGATT 4360 

ATAGGTTCCT TCTGCATCGT GCA7CATGGT TCC7CCGCGG TGCAATT77T TATGGATTTT 4920 
TTCAATTACC T7CTCTGGAT GATTTGTCAC AATCATGGCC TGCA7ACCCT TTTGC77AG7 4 9 30 

AAAGACTGCG TCTGTCACAC GGC7AGAGAC AAAGATCGTA ATCATAGAAT AAAGAGCGTA 5040 
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T77CCAACCA AAGGTCAAAC CTGCTATCAG CATGATAGTT CCATT7ACCA AGAAAGAAAT 5100 

ACTACCGACA TTC7TACCCC TTTTCTTACG AATAGTCAGG CTGACCATA7 CCGTCCCACC 5160 

ACTGGAGATA TTGTTTCGAA GAGCAAAACC AATCCCCAAA CCCA7AACAA CACCCCCAAA 52 2 0 

AAGGGAATTG ATAATGGGAT CCTCTGTCAA GGTTGCCACA GGGACAAACT GGATAAAGAA 52 80 

GGAAC7CATA GATACCGTGA TAAAGGTAAA GACGGTGAAC TTATGGCCAA TCTCATACCA 5 340 

AGCTAAGACC ATCAAAGGGA AC7TAATGCC GTAGAAGCTT AGCGAAATCG GAATATGAAA 5400 

ACCAAACCAG TGATTACTCA AGGCAGAGAT AATCTGTGCC AGACC7CTTC CACCACTCGA 54 60 

ATACACATGC CCTGGTTGGA AAAAGAAAT7 AACTGCTACT GCTGATAAAA AACCATAGAC 552 0 

CAGAGAGGCC GAAATCTTCT CATCATACTT 7TCTCGACAG ATACTTTGTA AGACACGTAA 5530 

AATTTTTATC TGATAACCAA AGCGGCGCAG ATAATAGCGC CACCGCTTAA TTCGTTTTGT 5 640 

TTGTTTCATC TTCTTCTACT TGTAAGCTGA GTTCCTCTAG TTGTTTGAGA GCGACTGTTG 5700 

ATGGAGCTTG TGTCATTGGG TCAGTTGCCT TGTTCTTC7T AGGAAACGCA ATGACTTCAC 57 60 

GGATATTTTC TTCTCCAGCA AGCAACATGA CAAAACCGTC AAGCCCGATA GCCAAACCAC 5820 

CGTGTGGTGG GAAACCATAG TCCATGGCTT CAAGAAGGAA ACCAAACTGG TCATTGGCTT 5880 

CTTCAGTTGA GAAACCAAGA GCCTTGAACA TGCGTTCTTG AAGGTCTTTT TGGTTGATAC 5940 

GAAGGCTACC ACCACCAAGC TCATAACCGT TCAAGACGAT ATCGTAAGCA ATGGCACGAA 6000 

CCTTAGCCAA ATCACCTTCT AATTCATGAC CAGTCTCTTC CTGTGGAAGT CTGAAAGGAT 6060 

GGTGGGCGCT CATGTAGCGG CCTTC7TCTT CAGACCATTC AAACATCGGC CAGTCAACCA 6120 

CCCAAAGGAA GTTGAACTTA TCATTATCAA TCAAGCCAAG CTCTTTAGCA ATACGTCCAC 6190 

GAAGGGCACC CAGTGTTCCA T7ACCCACTT CAAGCGTATC CGCCACAAAG AGAACCAAGT 624 0 

CCTTATCTTC AAGAACAAGC GCTGTTGTCA ATTCTTCTTC GATACCAGTC AAGAACTTGG 6 300 

CAACTGGTCC GTTTAATTCT CCATCAACCA CCTTGACCCA AGCAAGACC7 T7GGCACCAT 63 60 

ACTGTTTGGC TACT7CCGTC ATCTTGTCGA TGTCTTTACG TGAATAGTTG TCCGCAGCTC 64 20 

CTGTGACCAC AATCGCTTTT ACAGCAGGTG CTTCTGAAAA GACTTTAAAG TCTACACCTC 64 80 

GGACCACTTC TGTCAACTCC TGAAGCAACA TGTCAAAACG AGTATCTGGC TTGTCAGAAC 654 0 

CGTAAAGAGC CATAGCATCA TCGTAT7TCA TACGAGGGAA TGGTAGCGTT ACTTCGATGC 6600 

CTTTTGTTTC CTTCATCACG CGCGCCATCA AGCTTTCTGT AATATCTTGG ATTTCTTGCT 6660 

CAG7AAGGAA GGACGTTTCC AAGTCGACCT CAGTAAATTC AGCCTGGCCG TCTCCACGCA 67 20 

AG7CCTCGTC ACGGAAACAT TTAACGAT7T GGTAGTAACG G7CAAAACCA GC ATT CATC A 67 80 
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AGAGCTCTTT CGTGATTTGT GGACTTTGAG GAAGAGCGTA AAAATGCCCC TTATTAACAC 6840 

GAGACGGCAC TAAATAATCA CGCGCCCCTT CAGGCGTTGA CTTACAAACC AATGGTGTCT 69 00 

CCACGTCGAT AAACTCCAAC TCATCCAAGT AGTTGCGGAT AGAGTGGGTC ACCTTGGCAC 6 9 60 

CAAGTTTAAG ATTTTCCAAC ATTTCTGGAC GACGAAGGTC AAGGTAACGG TAACGCAAAC 7 020 

GTGTATCGTC ATTTGCCTCA ATGCCATCCT TAATCTCAAA TGGTGTTGTC TTAGCTGTGT 70 8 C 

TAAGCACAAT AAGAGCTGTC ACGTTTAACT CAACCGCACC AGTTGGCAAC TTATCATTGG 7140 

CTTGTCACGC GCAGCGACCT GACCAGTCAC CTCAATAACA AATTCGCTAC GAAGGcTTTC 7200 

AGCTGTTGCC ATAACCTC7G CAGATACTTT TTCAGGGTTG ATAACCAACT GCATGATTCC 72 60 

TTCACGG7CA CGAACATCGA TAAAGATCAA ACCACCAAGG TCACGACGAC GGCCAACCCA 7320 

TCCTTTCAAG G77ATTTCTT GTCCGATGTG TTCCTCACCA ACACGACCAG CATACATACT 7 3 80 

ACGTTTCATT A7TTCTCTCC TCTTTTATTC TGTTACTATT TT AC CAT AAA AGCGCAGC7C 74 40 

TTCATGAAAA TCATCAGAAA AGTTTGCCAG TCTTTAAAAG TCAGGTGAAA GCCCTAAAAA 7 500 

TTAGCGCTAA TACTCTTCGA AAATCTCTTC AAACCACGTC AGCCTCGCCT TACCGTATGT 7560 

ATCG7TACTG ACTTCGTCAG TTTCATCTAC AACCTCAAAA CCATGTTTTG ACCTGACTTC 7620 

GTCAGTTCTA TCCACAACCT CAAAACAGTG TTTTGAGCAA CCTGCGGCTA GCTTCCTACT 7680 

TTGCTCTTTG ATTTTCATTC AGTATAATAC AAAAATCCGA TGAACTTCAC CGGACTCTTT 774 0 

TATTTTGAAT TTTTGCCTGC TTTACGCTTT TCAGCGATTT CGGCTGCCTT TCGAGGCAAG 7800 

ACAATTTCCG T7ATG7AACC CGTCCCAAAA CCCAGTACAC CTGCAATAGG AGCAAAGACA 78 60 

ACTCCTACAT AGTTATACAA GAAATCGCCT TTCAAGCCAT AACCTAGCGC TCCAATGATG 7920 

AAAAATAGAA CCACTGCCTG AATCACTGCT AATAAAATTA CTCGTTTCA7 GTGACCTCCT 7980 

GACTCTATTA 7AGCATGACA ATCATCAAAA AGCCGACTAA ATTATTCAAA GCGTGAAGAG 8040 

AAATACTGTA GACCAGACCT TTTCTGCTAA TGTAAGCCAA ACCCAAACTA AAACCAAGGC 3100 

TAAAATAGAC AAAAAATTGT TGCACATCAC CTGGAAAA7G AA7CAAGGCA AATAGAAGAC 8160 

T AG AT AC C AG AAGAAAAATC AGCCTTCGTT TACTATTGTC CTCCTTAGGA AAGACATAGC 32 20 

GTGCTAACAT CCCTCTAAAA ACAATCTCTT CCGTCAAAGG AGCAAAAATA ACCACAGCAA 82 90 

AGAATGAGAA AAGTGGTTGA GACAAGGTCA AGTCTGTCGC TATTTGCTGA TTTACTGAAG 8 340 

GATCATCTGG CAAGAAGAAT TGAACGACCA GAGATAAGAA CCAAACCAAG ACAGGAAGCC 8400 

AAATAAATCG ATTAAAGCCG C7CTTCTCAA TATGAACAGG AGCCTTCTGA TACCATTTGT 84 60 

AAATGCCGTA CACATATAC7 CCAGCCAAGG CC AC AT AG AG TAGAGTAACA GCATAGGGTG 8520 

AAGCGCCTAA AGCAAGCCAC GCAGTCGCGA GCCCCTGAA7 AAAGCCATAG ATAAA7AAAA 8 580 
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AGGATAGAAG GGCTAGAAGA ATCCAGCCAA GGTTTT7AAG TAATTTCATA GATAACTCCT 8 64 0 

T7ATTTGAAA TAACGTTTTA CCATAGGTAA CTGCATCACA TTGATATAAA CATGGATGGC 8700 

TCCTACAAGC AAGAAAGCTA GTAACTGAAT CTCTCCTGTC AAGAAAGAAA TGATAATAAG 8760 

AAAAA7ATAT AAGGCTGGTA ACACATATTG GTGTAATTGG AATAAAATTC GAAAACTCTG 8820 

TTCCAAATTA GCCTGACGCT CCCCTTCATC ATAAGAATTT ATATAGTTCA AGACATCCTT 3880 

TGGTGTAGCG AAAAATTCCA AATCAAAC7G ACGAACAATC GCAATGGTTT TAAAAAGAGA 8940 

TTTTTGAGCG ACTAAGAATA CCACAAAGAG TAAGAAAGAA AGGAAAAATG TTTGAGGGTT 9000 

TGTATGCAAT ATAATCACCT CACTTAATGA AATAAAAATA GCCAATGGAA TCGCTACACC 9060 

TGTAATATTA AAAGCAATGG TTCCAAACTC AAGATTCCGA TACATTTGCA CATAATACGT 9120 

TTCATTCAGA TCGTCATCCA TTTCCTCTTG ATACAAAGAA TGAAATTTTC TGCTTTTCTT 9180 

TAAGAAATTG AAAGTCAAAA ACATACTAAT GAAACCTATC AGTAAACAAA TAGCTGATAT 92 40 

CCATGGCATC AAGGCTTTTA CATCTAAAAT AATTTCGTGG GATTCGACAC GTGCCTTAAA 9300 

CATCCCTACA AACATGCCCA AGAACCCCCC AAGACAATAG ACATCAAAAA TAACAATCTA 9 3 60 

CCTTTCTTTT TCATATTCAT TCTCCTTTTT CACTTGCTAG ATTTTTGGAT TTCTTTTCAA 9420 

TCCATTCAAT TACTGGGATG AGAGCAAAGT AGACCCAAAC AAATTGGTCG CTTTGATAGG 9480 

GATTAAACCA GCTTAGCTCC ATCCCAATCA GTAGAAATAC GCTGACTAAT AAAGCTATGA 9540 

CC ACT AC AT A ATAAATCACT TTATACTTCT TCATCACTCG TCCTCCTCCA AACGAAATAC 9 600 

CGATTCGACT GTTTCGTTGA AAATTTGAGA TATTTTCAGG CCAATGATAA TGGATGGCGT 9 660 

GTACTCATCC CGTTCTAGTA GGCTAATGGT CTCTCTGGAA ACCCCTGCCA GTTTGGCTAG 9720 

GTCCG7TTCA TT GAG AC CAT COCGAGCTCG AAGCTCTTTT AGACGATTTT TTAGTTGCAT 9780 

GTTACACACC TACTCTCCGT CAAATTCAAC GCTTTGGATA TCCTCAATAC GTTGCAACTT 9840 

GAATTTTTCT TTTCCCGTAT TATCTACACG TCGTAGCTTT ACCCATTCCT CATCAACATC 9900 

CACAACTTCC CAGTTATCTG GCCCAATATA CACTCCCGTT ATAATTGGTT CCTTTCCAAT 9960 

CATTTCTTGT AATAATCTCG ACATTTCTCC GTTTCCTTTC TCTTTTCGCT CAAGTCTTTT 1002 0 

CATTTTATTC TCTAGTTTCT TGATTTTTTT AGAATTATTA GAATAAAAGA AAATCATAAA 10080 

TAGTATAAAT CCTAGTACCC ACATTATAAC TCCTTTCTGC TTCCTATTTC TTAACTTGAA 10140 

TTCATTGTAA CATATCTTTT TCTTTTTGAC AAGTATAGTT GTCAAAAAAA TTATGATTTT 10200 

TGTCATTTTG CAAAAGAAAA AGGTCAGGAG TACGTTCCTG ACCACTTTAT CTATCATTAA 10260 

TACTCTTCTA AAATCTCTTC AAACCACGTC AGCT7CACCT TGCCGTAGGT ATGGTTACTG 103 20 
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ACTTCGTCAG TTTCATCTAC AACCTCAAAA CCATGTTTTG AGCTGACTTC GTCAGTTCTA L0330 

TCCACAACCT CAAAACCATG TTTTGAGCTG ACTTCGTCAG TTCTATCCAC AACCTCAAAA 10440 

CCATGTTTTG AGCTGACTTC GTCAGTTCTA TCCACAACCT CAAAACAGTG TTTTGAGCAA 10500 

CCTGCGGCTA GCTTCCTAGT TTGCTCTTTG ATTTTTATTG AGTATAAAAT CCTACTTTTT 10560 

CAAAGATTTC TGAGAAGTTT TGGCTGATTG TCTCAAGTGA CACTTGCACT TCTTCTCGCG 10620 

T7TGGTTGTT CTTGACCGTC ACTTGTCCGC TTTCGACTTC GCTCTCTCCT AGGGTGATGA 10690 

GGGTCTTAGC CGCAAAGACA TCGGCTGACT TGAACTGAGC TTTTAGTTTA CGGTTGAGGT 10*? 40 

AATCACGCTC TGCTTTGAAA CCTTGTTGGC GAAGAGCCTG TACCAATTCC AAGGCCTTCA 10800 

TATTTCCCCC TTCGCCCAAG ACTGCGATAT ACACATCTAG CCCGTTTTCC ATAGGGAGGG 108 60 

TCACACCTTG CTTTTCAAGG ATGAGAAGCA GGCGCTCTAC ACCAAGTCCA AAACCAAATC 10920 

CAGCAGTTTC AGGGCCTCCA AAGTAAGCAA CCAAACCATC GTAGCGACCA CCCGCACAGA 1C980 

CCGTCAGGTC ATTGCCCTCA ATCTCTGTGA TAAACTCGAA AATGGTGTCG TTGTAGTAGT 11040 

CCAGACCACG CACCATATTG GTATCGATGA TGTAATCTAC TCCAAGATTT TCCAACATCT 11100 

GACGCACAGC ATCAAAATGA GCTTGGCTTT CTTCATCAAG AAAGTCCAAG ATACACCGCG 11160 

CATTCTC7AC TGCCACCTTG TCTTCTTTTT CCTTAGAGTC CAAGACACGA AGAGGATTTT 11220 

CCTCCAAGCG ACGTTGGCTA TCCTTAGACA ACGTCTCCTT GAGCGGTGTC AAATAGTCAA 11280 

TCAAGGCTTG GCGGTAGGCT GCACGGCTCT CAGGATTTCC AAGAGTGTTG AGGTGCAATT 11340 

TGACACCTTC AATACCGATT TCCTTCAAAA AATGGGCTGC CATACCGATT GTTTCCACAT 11400 

CGCTAGCTGG ATTGCTAGAG CCAAAACACT CAACACCAAT CTGGTGGAAT TGGCGCAAGC 114 60 

GCCCTGCCTG TGGACGCTCA TAACGGAACA TAGGTCCCAT GTAGTACAAC TTGCTTGGCT 11S20 

TTTCCACTTC TGGGGCGAAA AGTTTATTTT CCACATAGGA ACGGACAACG GGTGCAGTTC 11580 

CTTCTGGACG GAGGGTAATA TGACGGTCAC CCTTGTCATA AAAATCGTAC ATTTCCTTGG 11640 

TTACGATATC CGTTGTATCT CCGACAGAGC GACTGATAAC CTCGTAATGC TCAAAAATAG 11700 

GCGTGCGCAC TTCTGCATAG TTGTAGCGTT TGAAAATCTC ACGGGCAAAG CCCTCAACGT 11"? 60 

ACTGCCACTT AGCAGACTCA GCAGGTAAAA TATCCTGCGT TCCTTTTGGT TTTTGTAATT 11820 

TCATAGGGAA TCCTCTTTAA ACTTAATAGT CTTATTTTAC CATAAATAGA CCCATTAAAA 11880 

CAGTAAGAAA AAAATTAGGA TTTAGATATC ATTTTTGACA TTAAGAATTG TCAAAAAAAT 11940 

AGCTAGCAAG GAAAGACCAA CAAATAGCAT CCAAGTCAAC TGTATATTCC A7ACGGCTAC 12000 

TAGTGAAAAA CAAGCTGTTC CCACAGGTAT GGATAAGGTA AACAATAGAC CTAAAAAATT 12060 

ACTAGTACGA GCTAGAACCT CTGGAGCTAG ATTTTTCATG AGCATGGCAC TAATCTTTGG 12120 
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TTGAACTTTA CCAGACACAT ACAGAGTAAA GAAGAGAAAT AGCAAACCAA GCACGACTTG 12 190 

ATTGAATAAA TTAGCCAAAC CAACTAGACT AAGTCCTACG CTCTCCCACA TCATCAATCT 122 40 

AGGCAAGGAC TGCTTCCCAA AATAATCATT GCCCGTAAGG CTACTGATGA TGACTGATAC 12 300 

TAAAACACAG AATTGATTGA TAAATAGTGC CTCTGTATAA GAAAAATTCA AGAGAGAATG 123 60 

GCTCAAAAAG AAGATATTAT AAATTCCACC CAAAGCGCCA CCCAAGGAAT TAATAACCAA 12420 

GACAGCAAAG AGCATAAAAC CAAAGTTTTT CTGTCCACTT TTAAGAAAAA CGAGACGTAA 12 480 

ATTTCGGTAA ATTGTTAGGA ACTGGTCTTT GATAGAAAGC TTCTCATTTT TTAAGTTTTC 12 54C 

ACCATCACCA GATGACATTG ACAGGCTCAA TTTGCTTTTT CCTAAAAAGA GGATAGTGGC 12 600 

TGATACTAGG AAAAAGCAGG CATTGATTCC CGCAACGAGA GAAAAATTGT TGACCGATAG 12660 

AGCTAAGAGC CAGACTCCGA AAGCTTGACC ACCAATACCT GAAATATAGG TGATGAACTG 12720 

TGAAAAAGAA TAAGCCTCCA TCAGATCATC TTCAGCTACT TTTTCCTTAA TAAGAGGCAT 12780 

ACGCAGGCCA CCTGCAAAAT CACTGATGAT ATCACTAATG ACATTGATCA AACACACCCT 12 840 

AGAAAAGGCA AAGAGACTAG CTTGCTGAAC AACTAGGGCT GCTAGAAAAA ATAGAACCGC 12900 

CTGAAACAAA CCGCTATAGA CCATCCATTT GACCTTGTCC CTCGTGTAAT CTGCCCGAAT 129 60 

CCCTGCAAAA ACTGTAAAGA GCCTCGCAAG AATCATCACA ATATTCGCCA TACCAACACC 13020 

AAAAGATGCT TGTGACAAGG TCGATGCATA GACGATAAAG ACCACGTTGA AAATCGAAAC 13080 

ACCAAAAGCA TTGAAGAAGC GTGG 13104 

(2) INFORMATION FOR SEQ ID NO: 35: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19250 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CCGGGCAAAT AGTTTTGAAC TTTTCATCAT TTTCTCCTTT AAAACTTTCT CTCCATTATA 60 

GACTCTTTTC AGAAAGTTGT CAACAGAATT TTCAGAATTT TTGAAAATTA TTTTTCAAAC 120 

AACATCTTTG CAAAAAATAT GAATATCGTA AGCGCGTCA7 AACAAGGTAT CTATCATTCA 19C 

TGGAGCTCCT CCTGTATACT ATTAGTAAAG TAAATATTGG AGGATATTTT AATGCCACAA 240 

CCTATTGTTC CTGTAGAGAT TCCACAATCT CGTCGTTTTG ATTCTAAAAA GAGAAATGAT 300 

ATTCTrCTTA AAATTCGTAT TGGCAAGCTT GAAGTAAGTT TTTTTCAA7C TCTCAATCTC 3 60 
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GAAATGATAG 


AACAGCTTTT 


GGATAAGGTG 


TTGCTCTATG 


ACAATTCATC 


TATCTAGCCT 


420 


AGGCCAGGTC 


TATCTCCTGT 


GTGGGAAAAC 


TGATATGAGA 


CAAGGAATCG 


ATTCACTCGC 


480 


TTATCTCGTT 


AAAACCCAC? 


TTGAATTGGA 


TCCTTTCTCC 


GGTCAAATCT 


TTCTCTTTTG 


540 


TGGTGGACGT 


AAAGACCGCT 


TTAAAGTCCT 


TTACTGGGAT 


GGTCAAGGAT 


TTTGGCTACT 


600 


ATATAAACGC 


TTTGAGAACG 


GCAGACTGAC 


TTGGCCCAGT 


ACAGAAAAGG 


ATGTCAAAGC 


660 


TCTCGCACCT 


GAACAAGTAG 


ATTGGCTGAT 


GAAAGGCTTT 


TCTATCACTC 


CAAAAATATA 


720 


GTAGATTGAA 


ACTAGAATAG 


TACACCTCTG 


CTTCTAAAAC 


ATTCTTAGAA 


ATCGATTTTA 


780 


CTGTCCTGAT 


CCATTTGTCC 


TGTTATTATT 


TCATTTTACT 


ATAAATCCAT 


CAGAAAGTCG 


840 


TGATTTCTAT 


TGAAATGAGG 


ACTTTCTTTT 


TATACTCATC 


TGCTTTCAAA 


AAGCACTCTA 


900 


GTCCATCTCC 


GATTAACGAT 


GGACTTTATC 


ACCTCCTTCT 


CCAGTCCTTG 


TATAACATCT 


960 


TGAAGTTGAT 


TCATGACATC 


TTCCAAAGTT 


CGAAAGGCTT 


TATTCTTAAA 


TCCACGTTTA 


1020 


CGAATCTCTT 


TCCACACTTG 


TTCAATGGGG 


TTCATCTCTG 


GTCTGTATCG 


AGGAATAAAT 


1080 


GCAAACCCAA 


TATTACTCGG 


AATCTTTAAG 


GTACTTGATT 


TATGCCATAT 


ACCATTGTCC 


1140 


ATAACGAGTA 


AAAGATAATC 


ATCTGGATAA 


GCTTGTGAAA 


GCTCCTATTC 


CTAAAGCCCC 


1200 


TTTATAACCT 


CTTGCGAGAG 


ACACTATTGA 


CTCAGCCCTT 


ACTTCATGCG 


GATGAAACCT 


1260 


CCTATCGGGT 


TCTAGACAGT 


GATAGCCATC 


TGACCTACTA 


TTGGACTTTT 


TTGTCAGGTA 


1320 


AAGCAGAGAA 


ACAAGGGATT 


ACGCTTTACC 


ACCATGATCA 


GTCTCGAACT 


GCTTCAGTAG 


1380 


TACAAGAATT 


CCTAGGAGAT 


TATTCTGGCT 


ATGTTCATTG 


TGATATGTTG 


CCGCAGTAAC 


1440 


TTAGGACTTT 


AGTCCTCTAG 


TTCTCCCTAT 


GCGATAGCAG 


TCCAAGGTTT 


AGGAGTAAGG 


1500 


CGACGCTAAG 


CTTGGTAAAC 


TGCGAACAGC 


TAGAAGCTTA 


TCCTCAACTG 


GAAGAAGCTG 


1560 


CACTTGTTGG 


ATGTTGGGCG 


CATGTGAGAA 


GGAAGTTTTT 


TGAAGTGCCC 


CCCAAGCAAG 


1520 


CAGATAAATC 


ATCCTTAGGA 


GCTAAAGGTT 


TACCCTATTG 


TGATCAGTTA 


TTTTCCTTGG 


1680 


AAAGAGACTG 


GGAGGCTTTG 


CCAGCTGATG 


AACGGCTACA 


GAAACGTCAA 


GAACATCTCC 


1740 


AACCCCTACT 


GGAAGACTTC 


TTTGCTTGGT 


GCCGTCGTCA 


GTCAGTTTTA 


TCGGGTTCAA 


1800 


AACTAGGAAG 


GGCAATTGAA 


TACAGCCTCA 


AGTATGAAGA 


AACCTT7AAG 


ACCATTTTAA 


1860 


AAGACGGACA 


TCTGGTCCTT 


TCCAATAATC 


TAGCTGAACG 


CGCCATTAAA 


TCATTGGTTA 


1920 


TGGGACGGAC 


TAAAAGAGTC 


CAGTGGACTC 


TTTTAGCCTA 


AGCTCAGTTT 


AAAAAAACGA 


1980 


GGGTGGTTAT 


TTTTAAAAAA 


GCGAGGGTGG 


TTATTTTCTC 


AAAGTTTTGA 


AGGAGCTAAA 


2040 


GCAAGAGCTA 


TT ATT AT GAG 


TTTGTTGGAA 


ACAGCTAAAC 


GTCATCAATT 


ATAGTGCGTT 


2100 


CAATCTATAA 


CAGTACGCA? 


CGACTGCTAA 


AATATTTCTA 


TAAATCAATT 


TTCCTTTCCT 


2160 
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AATCGATTTC 


TTCATATCTT 


ATTACAATCC 


ATTATAAATA 


GCGAGAAATA 


TCTATCCTAT 


2220 


CTTCTAGAAT 


GTCTTCCAAA 


CGAGGAAACT 


CTCGTAAACA 


AAGAGGTTTT 


AGAGGCCTAT 


2230 


TTACCGTGGA 


CTAAAGTTGT 


ACAAGAAAAG 


TGCAAATAAG 


AAATCTCCAG 


A7TAGCAACT 


2340 


ATATATGAGT 


TCTCTAGTCT 


GGAGATTTTT 


CAATAGACTT 


CGTTATTGGG 


CGGTTACTTT 


2400 


CGAAACTTTG 


AAAACTTCAA 


AAAACGGATT 


TTTATCGCTC 


TGAACATCAA 


AAAAGAAAGG 


2460 


ACGAAATTTG 


TCCTTTCTCA 


AGCTTAGCTT 


TTCTTCAACC 


CACTACAGTT 


GACAAAGAGC 


2520 


CC TTTATTCT 


ATCAAACATC 


AAGCGCAAAA 


ACAAGCCAAA 


AATCCGATAG 


AATGGCTATC 


2580 


CCTCGACTAT 


CAAGTAAGAC 


ATTTCCATCA 


AATACGTTCA 


ATTTT A C TCT 


TGTTCTACTA 


2640 


AGAATTAATC 


ATCTCGTTTT 


GATTTATTAA 


AAATATACAA 


TTCAGCTTTT 


CCTCCAAACT 


2700 


ATTTTATCCA 


CTATCCCTGT 


ATAGCTCTGT 


ATTATCTTAA 


CAACTTTAGT 


AGAGACATTT 


2760 


TCCTCAACAT 


AATCCGGAAC 


CGGTAATCCA 


AAATCCTCAT 


CTTGTGCCAA 


GCTAACAGCA 


2820 


GTTTCAACTG 


CTTGAACAAG 


AGAATTTTCA 


TCAATGCCTG 


CCAAAATAAA 


TCCTCCCTTA 


2880 


TCTAAGGACT 


CAGGACGTTC 


TGTACTTGTA 


CGAATACATA 


CAGCGGGAAA 


AGGATAACCT 


294C 


TGACTAG7AA 


AGAAACTACT 


TTCTTCCGGT 


AAAGTTCCCC 


AATCAGATAC 


TACAACAAAT 


3000 


GCATTCATCT 


GTAAACAATT 


ATAGTCATGG 


AATCCTAGTC 


GCTCATGCTG 


AATCACACGT 


306C 


rr *T AT C T AG T*T 


TAAAACCGCT 


CTCTTGTAGC 


CTTTTCTTTG 


ATCTAGGATG 


GCAAGAATAT 


3120 


AAGATTGGCA 


TATTATACTT 


TTCAGCTAAT 


TGATTAATTG 


CTGTAAAGAG 


AGAAATAAAA 


3180 


TTTTTATCTG 


TATCAATATT 


TTCCTCACGG 


TGAGCTGAAA 


GTAAGATATA 


ACCTCCTTTT 


3240 


TTCAATCCCA 


AACGTTCATG 


GATATCTGAA 


GACTCAATAG 


CAGATAAATT 


TTTATGTAAC 


3300 


ACTTCTGCCA 


TAGGAGAACC 


AGTTACATAT 


GTGCGCTCTT 


TACCTAAACC 


ACACTCATGT 


3360 


AAATACTTAC 


GTGCATGTTC 


AGAGTATGCT 


AAGTTAACAT 


CTGAAATAAC 


ATCAACAATC 


3420 


CGACCATTAG 


TCTCTTCCGG 


TAGGCACTCA 


TCTTTACAIjC 


l L Vj^_ ^ AO^w 




3480 


AAAATTGGAA 


TATGTAAACG 


CTTGGCAGCA 


ATAGCTGATA 


AACAAGAATT 


TGTATCCCCT 


3540 


AAAATCAATA 


AACCATCTGG 


TTTAATTTGA 


TTCATCAATT 


TGTATGAAGT 


ATTAATAATA 


360C 


TTCCCTACAG 


TAGCACCAAG 


ATCATCTCCA 


ACAGCATCCA 


7GTATACGTC 


CGGAGTGTCT 


3660 


AACCCTAAAT 


TATCAAAGAA 


AATACCATTT 


AAATTGTAAT 


CATAGTTTTG 


TCCAGTATGT 


3720 


GCCAAAATAA 


CATCAAAATA 


CTTTCGACAT 


TTAGTGATAA 


CACTACTTAG 


ACGTATAATC 


3780 


TCTGGACGTG 


TTCCCACAAT 


AATCAATAAC 


TTAAGTTTGC 


CATTATCTTT 


AAAGTGAATA 


3840 


TCACTATAA? 


CTCTCTTAAT 


TTTCATTTAT 


TTCTCCACTT 


GTTCAAAAAA 


AGTATCTGGA 


3900 
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TGTCTAGGAT CAAATGACTC ATTAGCCCAC ATCACAGTAA TTAGATTTTC TCTATCAGAA 3960 

AGATTAATAA TATTATGTGC ATAGCCCGGT ATCATATGTA TTGCTTCAAT CTTATCGCCC 4020 

GACACTTCAA AGTTCAGAAT AGGATACTCT TGACCGTTTT CATCCAGCCC TATCCTACGC 4080 

TCTTGTATTA AAGCACGACC AGAAACAACC ATGAAAAATT CCCACTTAGA ATGATGCCAA 4140 

TGTTGCCCTT TGGTAATGCC AGGTTTAGAA ATATTAACAG AXSATTGACC CGTATTTTCT 4200 

GTTTTTAATA ATTCCCTAAA ACTACCTCGT TCATCTATAT TCATTTTTAG AGGAAACTTA 42 60 

AACTTA7CTA CTGGTAAATA AGATAGGTAG GTAGAATACA ATTTCTTTTT AAACGATCCC 4 320 

TGAGGAATTT CAGGCATAAC TAAACTATCA GGCTGTTTTT TAAATGTTTC TAATAGAGAG 4 3 80 

ACAATCTCTC CTAAGGTTGC ACGATGAGTC GTTCGTACGT AGCAGTAGTT TCC7CATCCG 4440 

CTAGGTAAGA TTTGTAATCC ATCTAGATTA CAACGATGAG GATTTCCTTC CAATGCAGTT 4500 

AGACACTCTT GTATCAAATC ATCAATATAC AGCAACTCCA ATTCTACACT TGGATCATTT 4560 

ACTTGAATAG GTAAATCGTG AGC7ACATTA TAACAGAAAG TTCCTACACC AGAATTGTAG 4620 

TTAGGACGCC ACCACTTCCC ATAAAGATTC GGGAAACGGT AAACTAAGAC AGGTGCTCCC 4680 

CTTTTCTTTC CATATTCAAA GAAGAGTTCT TCCCCTGCTA GCTTAGATTG TCCATATATA 4740 

GAGTTTGAAA ATCGGCCTTC TAAACTAGCT TGAGTAGAAC TTGAGAGTAG AACAGGACAA 4800 

GTGTTTTCAT ACTTTTCTAA AATCTCCAAT AATCTACTTG AAAAACCGTA ATTTCCCTCC 4 860 

ATGAATTCAT CAGGATTCTG TGGACGATTG ACACCAGCTA AATGGAATAC CAAATCCGCC 4920 

TTCTTACAAT ATTCATCTAA TAAAATCGGA TCTGTATCAC GATCATACTG AAAAATCTCT 4 980 

CCAATCTCTA AATTAGCACG AGTCCTATCT CGTCCATCTT TCAAAGCTTC C AG ACT AC AG 5040 

ATAAGATTTT TTCCTACAAA TCCTTTCGCT CCTGTGATTA AAATATTTTT AATCATGCCC 5100 

CCTCCTTATT TTATATGCTG TTTTAATAGT TAACTCTCTC GACAATACAT GAT AC ATT AT 5160 

ATATCCTTGA TAATTTTAAT GTATCTTAAA AGATTTTACA TCTCTTCCTC TGCTACCATA 5220 

TCACGAATTG CTGTCTGTAT TTCATCTAAT TCTAGCAACT TTCTTTTAAC TTGCTCTACA 5230 

TCCATCAAAT CGCTATTATT ACTATTGAAT TCTGTCAACA AATTTCTATT CG7ACTACCA 5 340 

TCTTTGAAAT ACTTATCATA GTTAAGATTA CGATTATCAC TAGGAACTCT ATAAAAATCA 5400 

CCCAAATCAA TTGCATTTCC GCACTCTTCG TTAGTTAATA GTGTTTCATA CCTTTTTTCT 5460 

CCGTGTCTAA TACCTATAAT CTTAATATCT TGTTCTGAGG CAAAAATTTC TGATACAGCC 5520 

TTAGCCAACA CTTCAATCGT ACATGCTGGT GCTTTCTGAA CTAGTATATC TCCAGATTTC 5580 

CCTTCTTCAA ATGCAAATAA AACCAAGTCT ACTGCTTCTT CCAATGTCAT CACAAAACGT 5640 

GTCATGCTAG GTTCAGTAAT TGTAAGAGCA TTTCCTTCCT TAATTTGCTC AATCCAAAGA 57 OC 
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GGAACGACAG ATCCACGCCT ACACAGAACA TTCCCATACC GAGTCACACA TATCTTTGTA 5760 

TGCTCAGGAT TTACCGTCCT GGACTTAGCA ACAGCAATCT TTTCCATCAT AGCC7TCGAT 5820 

GTTCCCATAG CATTGACAGG ATAAGCCCCC TTATCTGTAG AAAGACAGAT AACTTGCTTT 5880 

ACACCAGCTT CGATAGCCGC AGTCAGGACA TTCTCCGTTC CCAAAATGTT AGTTTTTACC 5 940 

CCTTCTACAG GGAAAAATTC ACAAGAAGGT ACTTGTTTAA GAGCAGCAGC GTGAAAAACA 6000 

TAATCCACAC CATGCATAGC ATTTTTTACC GAAGCTAAGT CACGCACATC TCCAAGGTAA 6060 

AAACGGATTT TCCCAGCCAC TTCTGGTACT TTTACCTGAA ACTCATGACG CATATCATCT 6120 

TGTTTCTTTT CATCTCGCGA AAATATACGA ATCTCTGAGA CATCTGTTTC TAAAAAACGC 6180 

TTGAGAACCG CATTCCCAAA TGAACCTGTC CCTCCTGTAA TTAGGAGAGT TTTTCCTGTA 62*10 

AATTGTGACA TATATTACAC TTCTCCTTCT AGTATGTCTG CAATTTTCTT ACAAGCCGTT 6300 

CCATCTCCAT ATGGATTTGA AGCTTGACTC ATTCCTTGA7 AAACTGAATC ATTTTCTAAT 6360 

AATTCTTTAA AATGCCTATA AATATTATTT TCATCAGCAC CTACAAGTTT CAAAGTCCCT 6420 

GCTTCAATTC CCTCTGGACG TTCACTTGTA TCTCTCATAA CCAAAACAGG TTTTCCTAAA 6480 

CTTCGACCCT CTTCCTGAAT ACCACCACTA TCTGTTAAAA TTAAATAACT TCTTGATAAA 6540 

AAATTGTGAA AATCTAATAC TTCTAAAGGT TCGATCATCT TGATACGTTC ACAGCCACTT 6600 

AGTTCTTCCT CAGCAATTTG GCGAACACGA GGATTCATAT GGATAGGATA AATAGCCTTG 6660 

ACATCTGAAT ATTCTTCAAT AATCCTTCTA ATTGCTCTAA ACATATGTCT CATCGGTTCA 6720 

CCAACATTTT CACGACGATG AGCTCTAATT AGAATAAACC TGCTTTCTCC TATCCATTCT 6^80 

AACTCAGGAT GCGTATAGTC CTCTTCAATT GTAGTTTGTA AAGCATCAAT CGCCGTATTA 6840 

CCTGTCACAA ATATGCTCTC TGGAGtTTTT CrrTCTCTT* AAACAT7A7C 7777CAAACT 5900 

TCTGTTGGTG TAAAATGATA CTGACCCAAA ACCCCAACTG CTTGACGATT AAACTCTTCA 6960 

GGATATGGTG AATAGATATC GTAAGTGCGC AAACCAGCTT CAACATGACC AATTGGAATC 7020 

TGTAAATAAA AGCCCGCCAG TGAACTAGCG AAGGTCGTAC TTGTATCCCC A7CAACTAAC 7080 

ACCAAATCAG GTTTTTCTGA CTCTAAAATA GCCTTCATTC CTTCCAAAAT GCCAATGGTC 7140 

ACATCAAATA AAGTTTGTTT ATCTTTCA7A ATAGACAAA7 CAAAA7CCGG AATAATCCCA 7 200 

AATGTGTCCA AGACCTGATC CAACATTTGA CGGTGTTGGC CCGTAACGCA AACTAATCT7 72 60 

TCAATATTCT TACGTGTTCT TAACTCTTTG ACCAAAGGAC ACATCTTGAT CGCTTCTCGA 7 3 20 

CGAGTTCCAA ATACTACAAC TACTTTTTTC ATATATTTAC TTACTCCTAA CAAATAATGA 7 380 

ACGGTTCTTA AAATAAATTA GATAACGGCT AATCCATAAC ACCACCTCAG ACATACTTCA 74 40 
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ACAAATAGCT AATCTTACTA AACTAAAATT ATCAGACAAG ATAAATATTC CTAATCCCAA 7500 

AGTTTGGACA ATCGAAGCTA ATATAGTTGT CATTGTACTT TCTTTCACTT TATCAATAGC 7560 

TCCTAACACA GGCCATCCGT AAATCATAGA ATAAAAACTA GCAACAAAAG CGGGT AATAA 76 20 

GTACTTAAGA AAATCTGCTG AAACGGTATA TTTTTCACCA CCAATTATAC AAAGAATTTG 76S0 

ATTTGAAAAG AATAAAACTA TCAAAACTCC AAAGATAATA GGAATAAACA TAATCCGATT 7740 

AATACTCTTA ACCGATTGTA TATCTTTAGT ACCTATCATA TGCGGATATA AACTATTCGC 7 800 

TATAGGATTA TACAATGATT TTGCTGCTGA AAGCAGTTGC ATTCCTATCC CCCAAAAGGC 7 860 

TATCTCTTGA CTTTGTAAAT AAAAACCCGA AATGACTGTC GTAAAGACGC CAAAAATAGT 7 920 

AGTTGCAAAA TTGGATAAAA AATAAATAGA GGATTCCTTT AAATCTTTAA CCCAAACAGA 7980 

CAGATAAGAA AATGATAATT TAATTCCATA ATAATGAAGG AATCTATAAG AAACTACTGC 8040 

AGCAACTAAA TTCCCAATTC CTTCCAATAT AGGAATCCAT AAAATAGAAG AATCATCTTT 8100 

TACTACAATA AATGTCAAAA TTGTAATGAT AGTTTTAGAA ATAATATAAG GAATTGCAAC 8160 

TGCATGCATC TTTTCAATTC CACGAAATAA AAAGTCAAAG ATAAAAATAT TGGTCACTGT 8220 

AGCTAACAAA TAAAAAACTG AAAAAAGAAT ATTCTCTCTC ATTATTGGCA TTTGCCACAT 8280 

CAATATGGTG TAAATTAGAA TCGAAATGAT AGATAAAAAT ATTTTTTCAA CTAGAGTATC 8340 

TCCAACTATC CTTCCAATCT TTGAGGGAGT AGTACAAGCA TTTACAATAT TTTTTGTAGC 84 00 

TGATATCATG AAACCAAAAT CAATCACCAG TTGAACATAA GCTATTAACG CTTTAACATA 8460 

AATAACCATT CCATACGCGT CTACCGAAAC CACCCTTGTC AAATACGGGA GTGTT AATAA 3520 

AGGAAATAGT AATTTAACAA TATTCAGAAT AT AC AG AG AA CTTGTATTTT TTATAAATGA 3580 

AATTCTATCA ACTTTCACGA ACTAGTCCTT CCAAAAAAAG ATCTAAATAG TCCAAACTAC 8640 

TTCTCGCTTT CAACACCAAT TCTGAAGGTA TTGTTATCGG TTTTAGATGA AAAGTTTCAA 8700 

GTTTCTTTAC AATACTATTA ACACTTGAAT CAAATAAAGA TTCACAACG? TGTAACTCTC 87 60 

CAATTGCTCC ATAATAACGT GCTGTTTTTT CTGGATGGCA TGCAATGGCA ATCACAGAT V 8820 

TATTAAAACA TGTTGCCACT ACCCCAACAT GTAATTTACA AGTTAAAACC ACATCTACCA 'J880 

TTTTCAACAA TGATGTCATT TCTGCAGGAG AATGATACTT GAATTGAAAA CAATCCTCAG 8940 

TTCTAACTAA TTTTCTAAAT TCCTGATAAT AAGCATCTTC ATAAGGTAGA ATGGAATCCG 9000 

AAGTTACTAC AACATAATAG TTAGGATTGT TTTCTAGAAA AAGACTAATT GATTCCGCAA 9060 

ATTTTTCAAG AGCTTTTTTG GAATGATTAT AGTGAACAAG AATTATCTTC TTATCTTTAC 9120 

CTTCTCTTTT CAATTGACAC AGCTGCTCTG TTTTTTCTTC TCTTAATTTA CTTGAAATAA 9180 

TTAAATCAAA GGTTTCATGC ACTGGAGCCG AAGGCGACAA ATGCTTCAAA GAATCAAATG 92 40 
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ATTCTCCATC ACGAACTGTA ATAAATTGAG CATGATTAAT AATTCTCTTT ATACCATAAT 9 300 

TCATCAAAGA ATCGTTATTA GGCCCTGCAC CAATACCTAA TACTCCTATA GGCTTTTTAA 9 360 

AATATGAAGC CCAAATTCCC AAAGGTAAAA ATCGTTTAAA TTGGATTAAA TTATCACGAA 9420 

AACGTGCATT ATGCCCTTCC CCAAAATATC CTCCCGGGA? ATACAAAATA GCATCTGCTT 94 80 

GTTTTTTAGT AAAACTTTGT TTTTGGCGAT ATTCTTTCAA GTACATTTGA AAGAAATCTG 9540 

ATGG ATTAT A ^CaAAGAAACT~ TCAT ATCCTT TAGATTCTAA TAAATCATAG ACAATCTCAC 9600 

CGTAAAGATA ATCACCG7AA TTACTTGAAC CATAATCCGT TGCACCATGT AACATAATTT 9660 

TTTTCACCAC TATTTTTTCA ACCTCCTAAA AATAAATA7C ATAATCAAAC TATACATAAT 97 20 

AGGACGATAA ACATCTATTG AACTACTTCT CACTAAAAGC AATAGTTGAG AAATTACCGA 97 80 

AAAATAAATA ACTTTTGAGA TTTTACTTGT TTGAAAAGCT CTGAAA7TTA ATCGCCATCC 9840 

ACTAAATATT CCCAAAACAA AACTCCAAAA AACACCACCA TAGTAACCAA AGTTCCAAAA 9900 

TAATTCTTCC ACAAAAGAAG AGCCTACAGG TAACCCCAAA AATTTATTAA TAACAACCGT 99 60 

CGCTGATGCT TTATCAAAAA AATCACCAAC TAACCATCCA ATAGGAAAAA TTGATAGGAT 1002 0 

AGTGCCTAGA AATGTCATCC CATATTCATA TGGAATGCTA CTAGGCACAA CAGTTACAGC 10080 

AGAAGCTACT GTTAGGCTGG TCACTCCCGA CTCTGAAAAT ACTTCCCCTA GTATATTCTT 1014 0 

TACAAAATC7 AATGAAGAAA AGGAATCAAA TAAGTATATA CCTATAGTAT TCAAGTCGAA 10200 

ACGGTGCCCC CTAATAACAA CTAATACATT TAATAGAAAT ACACTTACTA TTAAAAATAC 10260 

AAGTACTCTT TTCTTCGAAA AAGTAATCCC TAAAGATTGT GTGTATACTA AAACCAACGC 10320 

CAAGATTGAA AACACCTGGA TTTTACGACT TCCTGTTAGG ATCATTATCA AAATTAGGTA 10380 

AAACAACATT ACCCAAAAAA TAGTACCCTT TATAACTCGC GACAGCTTAT CTGAATAAAA 104 40 

CAAGGAGAAC ACACCAGGAA GCATAAGTAC TCCTAAATCA TCTATTATTC CTGAACTACC 10500 

TGCCTCTGAA TATGCTGAAT AGCTATTCGC CGCTCTAACT GCTAGTACTG TTTTAGAATC 10560 

AGTTATTACC CTAGAAATAA AGCCCACTCC TGTTAAAATC CTACCCGCAT TGTACAAAAT 10620 

TTTCTCTTCA TTTTCCTGAT AATTTTGTAC TTCTGAATCA TAATGTACCT TTCCATCACT 10680 

ATAAAAAAAT AAATAGCCTA CAGAATAACA AAACAAAATC CAAATTATAA AAATATATGA 1074 0 

ATGAAATAAT TCTTCATTAT TATAGAAGTT ACTAGGCCTC CACAGCAGAG TTGTTTGAAA 10800 

CCCCATATAC TCATTGAAAA TTAATCCAAA CATAAAAAAA TAAGATAAAA TCACATACCA 10860 

TACAGAAAAA TCAT AT AT AC TAACTTTTTG TAAAATAAAA CCAGTAATTT GAAAAATAAT 1C920 

TAGAAAGCAA ACCCATATAA ATATAGACGG AACATAATTA GATATAAGAA AACCATTATT 10980 



WO 98/18931 



PCT/US97/19588 



354 

CCAATTATCG AGAGTCCAGA ACAAGTAACA GAAAGCAAAT ATAA ^ACTTA ATGTCACTAG U040 

TGTCACTCTA CAAA7ATACT TTGTCTGCAT CTATATCTCC TTTATTACAC ACATTTCTTG 11100 

ATAACGATTC AATAATTTAC TAGCTTGATA ACAAATATCA TAGAGTCCAT CTGTCATACT 11160 

GTTATTTATT TCAAAACGAT TGCATTCCTC AGATGTTAAA GACACTACT? TATCTTTCCA 11220 

TAGCAACACA GACTCTTCGT TGATAGGTAA GTAACTAATG TTTTTGGTCA CATCTACTTC 112 BO 

TTGCGTCACT GTATCTGACG ATAAAATTTC TAATCCCGAT GCCTGAGCCT CTACTAGAGA 11340 

AACAGGCAAC CCCTCATATT TAGACGGAAG CAAAAAAACA TCCATCGCAG ATAATAAATC 11400 

AGAAATATCA GTCCTTCTCC CTAAAAATAG CACATATGGG GTCAGATTTA GTTCTAAAGC 114 60 

TTTCTGTTTT AATTTCTGCT CATCCTCACC ATTACCAACT AGGAGTAAAA TAACATTTGG 11520 

TTTGATTAAA ATGAGTTCTT TTAAAACGTT AAATAAATAA CTTTGGTTTT TTTGATCTGA 11580 

TAGGCGAGCT ATATTTCCTA AT AC G AACTT ATTTGACACA TCTAATTCTC TACGACATTT 11640 

TTCTCTAACA TCTGACAAAA ATTGATACTT TTTCAAATCA ATTCCATTAA AAATAATTTC 11*7 00 

AATTTTTCCG TCTTTATACG CTTTCTCTCC ATATAACCAC TTACCCGAAT CTTCCCCACA 11760 

TGCAAACCAA TGAGTTGCTA AGATTTTTAC CAAAATTGTT ACTAATTTAC GCAATACTTT 11820 

TTGAAAACTG TTTTCTGTTA CATAAGCCAT ATGACTATGA ATAATTCTAA TTTTACAACC 11980 

AATTATTTTA GAT AAGATC A GACCAATTGC AGATTTATAG CCATGGCAAT GAACTATATC 11940 

ATAATCTCCT TTCTTTATTA TTCTAGCAAG AGAGAGAAAC TGATCTAGAG GCTTTTTCCT 12000 

TAATAGAGGC ACATGATAAA CCTTTGCACC CAATTCTTTC ATTTTATCCT CTAAAAATCC 12060 

TTGTTCTTTT CCAGGCACAA TAAAATCAAA TTGAATTTTT TTTCTATCAA TGTGAGAATA 12120 

ATAGTTGAAT AGAAAACTTT CTACTCCACC ACTATCTAGT CTTCTAAATA GATGTAATAC 12180 

777AATCA77 C77C77CC77 AAGCTTAAGA TTCGCT7C7C TAATTCTA77 7CTGTTTTTT 12240 

GTT7TTCTAA ACTAATTCTG TCCATGAAGT TATCACAATT CTTAATTAGC TCTTTCCTGT 12 300 

CAAGGTTTTG AATATACAAA GCCAAACAAT CTTTTTCCGA 7TCATCCTTC ATAGGTAAAA 12360 

CGAAACCAAA ACCATTCTCT ATTGACACTT TTTCCATATA AGTATCTTCA CAAACTAAAA 12 420 

TAGGTTTATA CAACAATGCA GCAAAGTAGA GTTTATTAGA CAAAGCATAG TCTAGTAAGG 12480 

GACTCTGATT CCCGTATAAA TTCAAAACAA CATCTGTATT CTTATAAAAA GACATGGTAT 12540 

CTTTAGGCTG GAATGTGTCC ACCAAGTTAA CATTGCTGAT ATTTTTTTCT TGACAAAATT 12600 

CCCTTAATTC TCCTGCATTA GTACCTATAA AATTCAACTG AAATCGACTG TCATTTGCAA 12660 

AAAAATCGAT TATTTTTTTA TTTTGTTCTT GAAAACGAAT TAAACCAATG TAGGAAAGTT 127 20 

GAATTGGAAA CGTACTA7TA 7TTTTTAACT GC777ACCTC G7TTAAT7CT ATCATATTGG 12780 
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GTAGGTTATG CCTAGTAAAA TACTCTCCCA TTGGTAAAAA AAATTTATAC CCGTCTGAAG 12 840 

AAACGATATT CATTAAAGAA TTTTTCACCA ATTGTTTCTG AACCAAACGA TAAACCAAAA 12900 

ATTTTTCATA ACTGTAATCA CGAATATCAT AAATATATCT ATTTTTAAAT GAAAAGAGAA 12 960 

GAAAATCTAC TAAAATGAAA GACACAATAC TATGTAACGG CAATATCATA TCATAATCAT 13020 

TTTCTTTTAG CTTCTTTTTA ATTTCTTTTC TGAATTTTAC ATAACC7AAT ATCTTACTTA 13080 

ATTTTC CTTT ACCAGAAAAA GAAATACGAT AGTAGTTTTG TTT7GTAATA ATCTCGTTAA 13 HO 

TATTCTTATC CCAATATATA ACATCGTAAC TAATAGACAG TTTCTTCAAT AATTCTTTAT 13 200 

AAAAATTGAA GTAAGGAGTT AGATATATAT TATCAGATAG TA7AAACAG7 ACTCTCATTA 13 26C 

AATTATTCTT TCTTACTTTC CCTCTCTAAA CATGTCTCCA GTTCGAGCAT AAACTGCTCT 13 320 

TTTGAAAAGT G ATTTTC AT A GTAACAACGA GCTTTCTTTC CTAACTCTCT TTGTCTCTTA 13 380 

ATAGATAACA TACTAAATTT ACAAATATTT TTTGCCAATT GTTTTACATC TCGTTCGGGA 13440 

CTAACATATC CACAATTTGC TTCTTCTACA ATTATTTTAG CATC7CCTCA AATTGCACCT 13500 

ATAATTGCTT TGCCTGCCGC CATATAAGAk TGTACCTTCC CAGGTATAGT ACGAGAAACT 13560 

ATCGAGTCTC CTATTAAAGA AACTAACATA GCATCTGATT TTTTATACAA GGATGGCATT 13 620 

TCCTCCAAAG AACGTCTTCC ATAGAAGGAA ATATTCTTTA ACTCCAATTC ATGAGCTAAT 13 680 

CCTTTCATGC TTAACAATTC CGTACCATCT CCAACAAAAT GAAAATCAAT TTTCTrGCCT 13 740 

AAATTGGTAT TCTTCTCTAT CAAACTGGCA GCTTTCAAAA TACTTTCCAA ATTTTGTGCT 13 800 

TTGCCAATAT TACCAGCAAA AGTTAGGTCA ACACT7TCTT TATTAACTAT AGATTCATCA 13860 

GGGATAAAAA CATCTTCTCC ATATTGTGGC AAATATG7AA TCTTTTGTTC GGATATGTCA 13 920 

AA77GCTTCA CAAAA7AATT 777AAATCA7 CGACTACTCA CAAA7ATA7A A7CAC7ACCT 13 9 3 0 

CGGTAAACTT TTTTTGAGAT AAATTTAAAC AGCTTGAAAA TCAAGCCATC TTGTTTCACT 14 040 

CCACCTACGG TTAAACTATC TGGCCAAACA TCCATACAAT ATAGAAACAT CGCTTTCTTA 14100 

TATTTTTTTT TATAAGCCAT ACCAGCCCAT GCCATCATAA CTGGAGACAA TTGGTTAACG 14160 

AATACACAGT CAAAATTCGA TCCATCTTTC GTTTTATACC TCCCCAATAA AACTCCTAAA 14 2 20 

GTAGAACTAA TTGCAAAGCT AAAATAATTC AACAATCGAA ATACAACACT TTTTTTTCTA 14 2 80 

GGGATTGTAT AAGAACGATA TATCGTAACA CCTTCTATAA TCTCACGTCT TTTTTTATTA 14 340 

TGACGATAAT CTGCATATAT CTTCCCTTCA GGGTAATTAG GAATCCCAGC CAAAACAGAG 14400 

ACTTCATGCC CTTTTCGAAC TAAATCTTCA CAAATATCTG ACAACCTGAA TGCTTC7GGC 144 60 

TTATAATGTT GGCAAACAAA TAGT ATTTTC AT7GTCCAAT TTAACTTTCT 7TCTTACCAC 14 520 
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TACCCTCTAC 
TATCCATTAT 
CAGACAAAGT 
TTGCTCCATA 
GACCTACAAT 
ATGTTTTTCG 
GAGGCGCCAC 
ATTC7TTATG 
TAATCGCAAT 
ATAATATATC 
TATTTCCATT 
C AG AAACTC A 
CTACGTCAGA 
GTTCTCAATA 
TTTGAAGACA 
ATGCTCTGTA 
TTTTT ACC AT 
T7ATTTAAAA 
TGGATATCAC 
TGAGTGATAA 
TCAAAATATT 
GTAGGGTTTG 
CATAAACCGT 
TCACGAGATT 
GTTTTATAGC 
TCCCCAGGGT 
TTTGTACACA 
CCTATTTC7A 
CCCAAAAGTG 
AAAAGTTCAA 



AATACCTTTT 
CAAAGACAGA 
ATCACCCCCG 
CTTATCTCTC 
ACTCATATTA 
CAAGAAAGCC 
ATTTTTAGGT 
AATACCAAAG 
TGCAATTATC 
ACCTAATCGT 
TTTCATTCTA 
ATATATATTT 
TAAAACTTTT 
ATGAATTAGA 
TATCGGTTTC 
AGAATGCCAG 
AATTTCCATA 
CAACTCCTAG 
GTTTATTCGC 
TTGCCGCATC 
TACGCAATGT 
GTGATACAGA 
GAGATAAATC 
TAAAAACTCC 
CTGCACGCGC 
TAACAGAAGT 
AGGCATTGTA 
ATGTCGGCAT 
TCATCTGCAA 
GATGAAGAAC 



CGTTTCAGTA 
TCTTTAACAT 
TTAATTTGTG 
TCTGCAATCA 
CCAACAAGAA 
CCTACTTTTG 
GCATCTATTT 
CGTTTTTGCT 
ATAAAAACCG 
TTTATTATAC 
TTTCCATTTG 
GGTCACTCAA 
CTCCATCTAA 
AGGTCCAGTT 
TGAAATTGCA 
AGTTATATTG 
GG AACCGTAA 
G AAC AGTTTC 
CTCACCTGTT 
AATAACAATT 
TTCAATCATA 
TCCCGATTGA 
AGCTGTCCCA 
TAACATAACT 
AAACGACCAT 
AACGGAAATT 
ATATTCTTCT 
CCTTCTCTCC 
TGTATTTTCG 
TATAACACTA 
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CGTAAGGTAT 

AGTAGCCATC 

CCCATCCACT 

AATCTAGTTC 

TATTAAACAA 

TAATCyATTG 

TCATAGACCT 

TAAATATAAC 

GACACAATAT 

CGTACATAAA 

ACAAATTAAA 

TGATTTTCAG 

ACAAAATTTA 

CAATTATTCT 

ATCAGTACAT 

TAGTCCCCTT 

ACTCCATACT 

CCTGTTTGTT 

GCTGTTACCA 

CCAATACGCC 

TCATTAAAAT 

ACTACAAATA 

GATAAAAATT 

GAATTTCGAG 

GCTATATTTA 

ACTT7TAGTT 

GCCTTCTTAA 

TATTTCAACT 

ATATCTTCCG 

CTTCCAATCA 



7GTCTTAACT 
TAACTCCGTC 
TAACCCTGCC 
ATTTATACCC 
TTGTCGTAGT 
CTCTGGATTA 
AAATTTCAAA 
CGGACCTTCT 
TATTATCCCT 
CAACCTCCAA 
7CAGGCAGTA 
AAATATAATT 
TTTGTTTCAG 
TCCAAATAGA 
AAGCTAATAA 
CC AT ACT AT A 
TATTAACCGA 
TTAATTCTTG 
AGATGCACGC 
GTGTATCAAT 
TTTTACTTTG 
AATTTTCAAT 
CTGTTAGCCC 
TATCGCCATC 
TGGAAGTAGT 
TATCTCCGCT 
TGAACTCCAG 
TACCCAAGTT 
GACGTTTCAC 
CCCCTGCCAA 



ATACATCTAA 
TTCATCTCAA 
AAGATATCAT 
GCTGGTCTAG 
TCATCCAAAG 
TATAAGTTTC 
ATATAGAAGT 
GAATCAAGTT 
ATTAAAGATA 
CTATAAATTC 
CATGCAACTA 
CTTTTATCCT 
TAATATATGA 
CCGAATATTA 
ACTGATAAGT 
TTCATTTTAT 
GATATCCAAT 
"*TT C G GrT"^ T 
ATCACACTTT 
AATGATATAA 
TAACAAGGCT 
ATTTCTATCA 
TGTAATT1 IT 
GATCAAAAGA 
7GTTTTTCCT 
CAACTGTATA 
TTTTTTTTGT 
TGGCACAACT 
ACGAGTATCC 
AAAACCAATT 
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AGTCTATTGC CTTTAATATT TGGCGAACAC GGGGATATCG CCGGCCTTGC CTCCTCCAGT 
G7TGTCACGT CAGAAACACG AGTAATACTG ATAATTTTTT GAGCACCTAC TTCTCTCAAA 
GAGTTAGCGA TACGGCTTGC CTCTTCAGGA ACTCGATCAT TAACTGAAAT AGAGACAATA 
CGGGTATCAA CTGGTACTGT CACTTTAATT TTATTAGCCA AACCTTTTGG CCTCAAATCT 
AGTTTCAAAT CAGAAACAAC TTCCTCCAAA ACATCCTGCG AAAGGATAAT CTCACGGTAG 
TCTTTTACCA GATAAGTTCC TGCCTGCAAA TCCTGATTTG TCAACCCCGG CTTCTCTCCT 
TCATTGCGAT TCACTACGTA AATTCGCGTG GTACTCGTAT ATTCTGGCTT AACAATAAAA 
GTGCTATATG CAAAAGCCCC CGCACCTCTC ACAAGTGCCA CTATTAAAAT CATTAGCTTG 
CGTTTCCACA AGCTTTTAAC TAATTGAAAT ACATCGATTT CTATCGTATT TTGTTCTTTC 
ATCATTTCTC CTAAATTAGT TGATCCATTA CAATTTTTCG AGGATTGTCT ATAAAAAGTT 
CCTGACCCTT CCCTTCTCCG TATTTTTGGG TAACAAGGTC ATATGCTTCT GCCATATGAC 
GAGGTCTACC GTCTAGATTG TGCATATCAC TTGCAATGAC ATGAACCAAA TCCTGCTCTA 
AAAAATACTG AGCTCTTTTT TTCATGAATT TATAACGTTC GCCAAAAAGT TTGGGTTTGA 
GGACATGTGA ACTATTTACT TGCGTGTAAC AGCCCATATC GATCAGTTCT CGAACGCGTT 
TTTCATTATT TTCAAGAGCA TCATAGCGCT CAATGTGGGC AATGACTGGA GTAATTCCCA 
ACATCAAGAT CTTGCTCAAG GCGCTATGAA TATCGCGATA AGGAGTGTTC ATACTAAACT 
CTATCAACGC ATAACCACTA TCATTGAGGG TCGGAATCCG CTTTTTTTCC ACCTTATCCA 
GAACATCTGG TGTGTAATAA ATTTCACCCC CGTAAGCAAT GACCAAGTCA CTCGCCACTT 
CCTTAGCTAT TTCCCGAACC TGAAGAAAGT TTTCTGCTAT CTTCTCTTCC GGAGTTTCAA 
ACATGCCCTT GCGACGGTGA GAGGTAGAAA CAATGGTTCG CACCCCCTGT CTGTAGCATT 
CTGCCAAGAG AGCCTTGCTT TCCTCTCTTG ACTTGGGACC GTCATCTACA TCAAAAACCA 
TATGCGAATC GATGTCTATC ATTTCATCTA CCCTCCATCA CATCCTGTAT AGCTGCTTTA 
ACTACAGCTA AACTACTATC ATCTATTTCC ATCACATAGA CCTTACTCTC TGGCATTGCA 
TAAGAAGGAA GATCCATCCG ACCTCTCCCT TTTAAATCTT GAGAATTTAC TTTATAATTC 
CCTCCACTTT CTAACTGAGC ATTGACCAAA TTTATCATGC TCTCAAGTGG CATATTTGTT 
TGGATAGAAT CTTGCAACCT ATTAATGATC GTACTATAAT TTTTCAGCAC TTCGGTTGAC 
GTTAATTTTT GAAGGATAGC CACAATCACC TTTTGTTGAT GGCGCCCGCG GTCACGATCG 
CCATCTGCTA GGGAGTAGCC CTCACGAACA AAACCGAGAG CCTGTTCTGA ATCAAGATGA 
ACATTGCCTG CAGGGTAATA CTTTCCATTC GTATGGGCAG TAAATTCTTG ATCATTATAA 
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ACATCAATTC CACCCAACAA ATCAATCAAT TTCAAAAACG AAGTGAAGTT CAATCGCACA 13120 

TAGTAATTGA TATCCACTCC ATAGAGATTT TCTAAGGTGT GAATGGACGA ATCAACTCCA 18180 

TAAATGCCCG CATCAGTCAA TTTATCTTTT TGATTATTTC CACCATCTGC GATTGGTACA 13240 

TAGGCATCAC GTGGCGTTGT GGTCAAGAGG ATTTTCTTGG TATCTCGATT GACAC7CATC 18300 

AGGATGTTGA CATCTGATCG CGACACCGAA CTAATAGGAC CATAGGTGTC AATTCCACTA 13360 

ACATAGATAT TGAAAGACTG ACTCTTACAC GTCTTAGGAG CTTCTACTTT TTTAGTGAAT 184 20 

CCCTTAGTAT AAATCTTTTT TATCTTCGAT GCGTAGTCTG GATACTCTGA CTCGATGATG 134 80 

7TTTCAAAGA CACTATTTAG GACAATGGCC TTAGTCTCCC CTGCAATCAA ACTCTTGTAA 13540 

CCTGCCAAGT AAGACGAACT CTCGTTCACC GTCAAATCGG TATTCTGACT TGACTTGATA 18600 

TCAGCTAGTA ATTTCTCAAT ATTTTCATTA TTAGTCCCAG TCGGTGCTGT CACACTCGTC 13660 

AGTTGCGTAA CATTTTCGAT CTCACTATCT GCTAAAACAG CGACACTGAT TGAATATTCT 18720 

GAGTAATTAG AAGTCGCATT TAAACGATTG GTCAGTCCAA CAAACTCCTC TACTGCAAAG 13780 

AGCGACACAG AGCTGACAAG GATAGAGAAC ACCAACACAA AAATACTAAA CTTTTCAGCT 18840 

TTTTTATAGA TAATCAAGAC TAGCCCTACC AAGGCAACTA GTAGGACTAA CGCAGTTACC 18900 

ACTAGATTAA GATATCTAAA AGCAAGGATA TTCTACTTAA AGATTAAGAA CAATAAAAAA 18 9 60 

CAAACTAACA ATAAATAAAT AGTCAGCAAA AC TAT ATT AA CACTTCGCTT CACTTTCTGT 19020 

GAACGTGATT TTTTAAAACG TCTACTCATG ATTAATACCT ATACATTGAA CATTATACGA 19080 

TTATATCACT TTTTTACGGT AATGTCTACA CCTTTATTTT TACTATCTGC ATCTTTAAGT 19140 

ATCTTAGTAG ACTTCCCGCC AAACAAAAAT ATAGTAAAAT GAAATAACAA CAGAACAAAT 19200 

CC7TCAGGAC AGTCAAATCG ATTTCTAACA ATGTTTTAGA AGCAGAGGTG 1925C 
(2) INFORMATION FOR SEQ ID NO: 36: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

AAAGTTGAAA GACTGCTAGC TGTTTTTGAT ACCAATCGTT TCCAACTACA GAGCAAACAG 60 

TATACAAAGT TTCTTTTTGC ATGTAAGCTT CTTGATGGAC AATTCCAAGA AAATCAAGAA 120 

ATTGCTGACC TTCAATTTTT TGCCATTGAC CAACTGCCGA ACTTATCTGA AAAACGCATT 180 

ACCAAGGACC AAATAGAGCT TCTTTGCCAC GTTTATCAAG GTCATAGGGG GCAATATCTT 2 40 
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GACTAAGAAG ATGATTATCG TATTTCTAAA TCCATTTTTA ACAAC7ACCA TGGTATAATA 300 

ATATGCAGGA AAATTTTGAA TTATGAGGAA GACTAGATCA ATTTATGGGA TATTTT C TTT 3 60 

ACGACTCAGG CAACCGAGCC GCCCAAATTT GACCTTTTTT GGTATGTTAG CCTATTTACC 4 20 

CTCTTAGCCT TAACCTTTTA TACAGCCCAT CGCTATCCTG AAAAGAAGGT TTACCAACGA 4 80 

TTTTTCCAAA TCTTGCAGAC TGTTCAGTTA ATCCTTCTTT ATGGTTCGTA CTGGGTCAAT 54 0 

CATATGCCAC TGTCAGAAAG CCTACCCTTT TACCATTGCC GTATGGCTAT GTTTGTGGTA 600 

CTCTTGCTTC CTGGTCAATC CAAATATAAA CAATACTTTG CATTATTGGG AACATTTGGG 660 

AC ATT AG C AG CCTTTGTTTA TCCACTGCCA GATGCTTACC CTTTTCCACA TATCACCATT 720 

CTA7CCTTTA TC7TTGGTCA TTTAGCACTC TTGGGGAACT CTCTAGTTTA TCTATTGAGA 780 

CAGTATAATG CGCGATTGCT GGATGTGAAG GGAATTTTTC TCATGACCTT TGCCCTAAAT 84 0 

GCCTTGATTT TTGTGGTCAA TTTGGTGACA GGTGGCGATT ACGGATTTTT GACAAAACCG 900 

CCATTCGTTG GGGATCACGG TCTAGTAGCT AATTATTTAC TTGTTTCAAT TGTGCTGGTA 960 

GCTACTATCA CTTTCACTAA GAAAATCTTA GAATTCTTTT TAGCTCAAGA AGCAGAAAAA 1020 

ATCATTGCAA AGGAACCTTA ACACAGAGCT TTCTTTTTTG CTCTTAGAGA GTTTTTACAA 1080 

GCAGCTTATA AAATAAGAAT TTCTGAATAG ACAAACTCAA AAAATGGCTG GCAAATTTAG L140 

GAAAAAAGCA AGCACGATTA AATTTTTTGT GTTATAATAT TTTGTGAATA GCTATGCCTA L200 

TGTTTAGCTA TGGAATAATA CGAAGTGCGA AACTTGGAAG ATAGAGAGGA ACCGATCTAA 12 60 

TGCCTAGAGA ACGCTTTTTT ACAGGTCTAG ATATTGCAAC AAGCTCTGTC AAGGTGCTTG 13 2C 

TGGCCGACCA GAGAAA7CCT GAATTAAATG TAATTGGCGT GAGTAATGCC AAAAGTAAAG 13 80 

GTGTAAAGGA TGGAATTATT GTTCATATTG ATGCAGCACC AACTGCTATC AAGTCAGCCA 144 0 

TTTCCCAAGC GGAAGAAAAG GCAGGCATTT CGATTAAATC AGTGAATGTC GGCTTGCCTG 1500 

GTAATCTTTT GCAGGTAGAA CCAACTCAGG GGATGATTCC ACTAACATCT GA7ACTAACG 15 60 

AAATTACGGA TCAAGATGTT GAAAATGTTG TCAAATCACC TTTGACAAAG AGTATCACAC 1620 

CTGACCGTGA AGTCATTACC TTTATTCCTG AAGAATTTAT 7CTGCATGGT TTCCAACGGA 1680 

TTCGTGACCC ACGTGGCATG ATGGGCGTTC GCCTTGAAAT GCGTGGTTTG CTTTATACAG 1740 

GACCTCGTAC TATCTTGCAC AATTTGCGTA AGACGGTTGA GCGTGCAGGT GTTCAGCTTG 1300 

AAAATGTTAT CATTTCACCA CTAGCAATGG TTCAGTCTGT TTTCAACCAA GGGGAACGTG 18 60 

AATTTGGTGC TACAGTGATT GATATGGGGG CAGGTCAAAC GACTGTCGCT ACAATCCGTA 19 20 

ATCAAGAACT CCAGTTCACA CATATTCTCC AAGAAGGTGG AGATTATGTA ACTAAAGATA 19 80 
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TCTCCAAGGT TTTGAAAACC TCTCGCAAAT TAGCGGAAGG CTTGAAACTG AATTACGCGG 20-10 

AAGCC7ATCC GCCTCTTGCA AGCAAAGAAA CCTTCCAACT AGAGGTTATT GGAGAAGTAC 2100 

AAGCAGTCGA AGTCACGCAA GCCTACTTCT CAGAAATTAT TTCTGCACGA ATCAAGCACA 2160 

TCCTTGAACA AATCAAGCAA GAATTAGATA GAAGGCGTCT ATTGGACCTC CCTGCTCCTA 2220 

TTGTCTTAAT CCG7GGGAAT GCCATTTTAC CAGGTATGGT TGACCTTGCT CAGGAAGTCT 2280 

TTGGCGTCCG TGTCAAGCTT TATGTTCCAA ATCAAGTTGG TATCCGTAAT CCAGCCTTTG 2 3 40 

CGCATGTGAT TAGT77ATCA GAATTTGCGG GTCAATTAAC AGAAGTTAAT C77TTGGCTC 2 400 

AGGGAGCGAT AAAAGGTGAG AA7GACTTAA GTCATCACCC AATTAGTTTT GGTGGGATGC 24 60 

TGCAAAAAAC AGCTCACTTT GTACAATCAA CGCCTGTTCA ACCAGCTCCT GCTCCAGAAC 2520 

TAGAGCCGGT GGCGCCTACA GAACCAATGG CGGATTTCCA ACAAGCTTCA CAAAATAAAC 2530 

CGAAATTAGC AGATCGTTTC CGTGGATTGA TCGGAAGCAT GTTTGACGAA TAAAGAGGAA 2 64 0 

AAATAAATTA TCACATTTTC ATTTGATACA GCTGCTGCTC AAGGGGCAGT GATTAAAGTA 2700 

A7TCG7C7CC GTGGAGGTCG TGGCAATGCC ATCAACCGTA TGGTCGACCA AGGTGTTACA 2 760 

GGCGTAGAAT TTATCGCAGC AAACACAGAT GTACAAGCAT TGAGTAGTAC AAAAGCTGAC 2 320 

ACTGTTATTC AGTTGGGACC TAAATTGACT CGTGGTTTGG GTGCAGGAGG TCAACCTGAG 2880 

GTTGGTCGTA AAGCCGCTGA AGAAAGCGAA GAAACACTGA CGGAAGCTAT TAGTGGTGCC 2940 

GATATGGTCT TCATCACTGC TGGTATGGGA GGAGGCTCTC GAACTCCAGC TGCTCCTCTT 3000 

ATTGCTCGTA TCCCCAAAGA TTTAGGTCCG CTTACAGTTG GTGTTGTAAC ACG7CCCT7T 3060 

GGTTTTGAAG GAAC7AAGCC TGGACAATTT GCTGTAGAAG GAA7CAA7CA ACTTCGTGAG 3120 

CATGTAGACA CTCTATTGAT TATCTCAAAC AACAATTTGC TTGAAATTGT TGATAAGAAA 3180 

ACACCGCTTT TGGAGGCTCT TAGCGAAGCG GATAACGTTC 7TCC7CAACG 7CT7CAAGGG 3240 

ATTACCCATT TGATTACCAA TCCAGGATTG ATTAACCTTG ACTTTGCCGA TGTGAAAACG 3 3CC 

GTAATGGCAA ACAAAGGGAA TGCTCTTATG GGTATTGGTA TCGGTAGTGG AGAACAACC7 3 3 60 

GTGGTAGAAG CCCCACCTAA GGCAATCTAT TCACCACTTC TTGAAACAAC 7A7TCACCG7 342C 

GCTGAGGATG 77ATCG7CAA CGTTACTCGT GGTCTTGACT 7AACC77GAT TGAGGCAGAA 34 80 

GAGGCTTCAC AAA7TG7GAA CCAGGCAGCA GG7CAAGGAG TCAACATCTG GCTCGCTAC7 3 54C 

TCAATTGATG AAAGTATGCG TGATGAAATT CGTGTAACAG TTGTTCCAAC GGGTGTTCGT 3600 

CAAGACCGCG TAGAAAAGGT TGTGGCTCCA CAAGCTAGAT CTGCTAC7AA C7ACCGTGAG 3660 

ACAGTGAAAC CACCTCATTC ACATGGCTTT GATCGTCAT7 T7GA7ATGGC AGAAACAGTT 372C 

GAATTGCCAA AACAAAA7CC ACGTCGTTTG GAACCAACTC AGGCATC7GC T7TTGG7GA7 37 30 
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TGGGATCTTC 


GCCGTGAATC 


GATTGTTCGT 


ACAACAGATT 


CAGTCGTTTC 


TCCAGTCGAG 


3840 


CCCTTTGAAG 


CCCCAA77TC 


ACAAGATGAA 


GATGAATTCG 


ATACACCTCC 


ATTTTTCAAA 


3900 


AATCGTTAAG 


TAAATGAATG 


TAAAAGAAAA 


TACAGAACTT 


GTTTTTCGAG 


AAGTTGCAGA 


3960 


GGCTAGTCTG 


AGTGCTCATC 


GAGAGAGTGG 


TTCCGTCTC7 


GTCATTGCAG 


TTACCAAGTA 




TGTAGATGTA 


CCGACAGCGG 


AAGCCTTGCT 


TCCGCTAGGT 


GTCCATCATA 


TCGG7GAAAA 


4 080 


TCGTGTAGA7 


AAGTTTCTGG 


AAAAATATGA 


AGCTTTAAAA 


GATCGAGATG 


TGAC7TGGCA 


4140 


777CATTGG7 


ACCTTGCAAA 


GACGTAAGGT 


GAAAGATGTC 


ATTCAATACG 


77GATTA7TT 


4 200 


CCATGCATTG 


GACTCAGTAA 


AGCTAGCAGG 


GGAAATTCAA 


AAAAGAAGTG 


ACCGAGTCAT 


4 260 


CAAGTGTTTC 


CTTCAAGTAA 


ATATTTCTAA 


AGAAGAAAGC 


AAACACGGT7 


77TCGAGAGA 


4 320 


GGAACTGCTG 


GAAATCTTGC 


CAGAGTTAGC 


CAGACTAGAT 


AAGATTGAAT 


ATGTTGGTT7 


4 3 80 


AATGACGATG 


GCACCTTTTG 


AGGCTAGCAG 


TGAGCAGTTG 


AAAGAGATTT 


TCAAGGCGGC 


4440 


CCAACATTTA 


CAAAGAGAAA 


TTCAAGAGAA 


ACAAATTCCA 


AATATGCCTA 


TGACCGAG77 


4 500 


AAGTATGGGA 


ATGAG7CGTG 


ATTATAAAGA 


AGCGATTCAA 


TTCGGTTCCA 


C7TTTGTTCG 


4 560 


TA7AGCTACA 


TCATTTTTTA 


AGTAGGAGAG 


AACCATGTCT 


TTAAAAGATA 


GATTCGATAG 


4 620 


ATTTATAGAT 


TATTTTACGG 


AGGATGAGGA 


TTCAAGTCTC 


CCTTATGAAA 


AAAGAGATGA 


4680 


GCCTGTGTTT 


ACTTCAGTAA 


ATTCTTCACA 


GGAACCGCCT 


CTCCCAATGA 


ATCAACCTTC 


4740 


ACAGTCGGCT 


GGCACAAAAC 


AGAACAATAT 


CACCAGACTT 


CATGCAACAC 


AACAGGAATT 


4800 


CGCAAATCAG 


AGTCAGCGTG 


CAACGGATAA 


GGTCAT7A7A 


GATGTTCGTT 


A7CCTAGAAA 


4860 


ATATGAGGAT 


GCAACAGAAA 


TTGTTGATT7 


ATTGGCAGGA 


AACGAAAG7A 


7CTTGATTGA 


4920 


7777CAC7AT 


A TO AC AG AC G 


7CCACCC7CC 


TCGTT'jTTTG 


GAC7A777GG 


ATGGAGCT7G 


4980 


TCATGTTTTA 


GCTGGAAATT 


TGAAAAAGGT 


AGCTTCTACC 


ATGTA7TTGT 


TGACACCAGT 


5040 


GAACCT7A77 


GTAAATGTTG 


AAGATATCCG 


TTTACCAGAT 


GAAGATCAAC 


AGGGTGAGT7 


5100 


CGGTTTTGAT 


ATGAAGCGAA 


ATAGAGTACG 


ATAATGATT7 


TTTTAATTCG 


7A7GATTTAT 


5160 


AATGCAGTGG 


ATATTTACTC 


CCTGATTTTG 


GTAGCCTTCG 


CTGTCATGTC 


TTGG7TTCCA 


5220 


GG7CCCTACG 


AATCCAGTTT 


AGGTCGTTGG 


A77G7ACCC7 


TGGTGAAACC 


AG7CCTTGCT 


5280 


CCCTTGCAAC 


GCCTGCCTTT 


ACAGATAGCG 


GGTCTTGATT 


7ATCTGTTTG 


GGTTGCGAT7 


5340 


G7T7TCGTTC 


GATTTTTAGG 


AGAAAACCTA 


GTGCGTTTTC 


7GCCGATCA7 


ACGATGAATA 


5400 


AACCCA7TTA 


TCAGCATTTC 


TCCATAGAAG 


ATCGTCCATT 


TCT7GACAAG 


GGAATGGAAT 


5460 


GGATAAAGAA 


GGTAGAAGAT 


AGCTATGCTC 


C7T77T7AAC 


7CC^7"^ , *^A* T, C 


AA7CC7CATC 


5520 
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AGGAGAACCT ATTAAAGATT TTGGCCAAAA CCTATGGTCT TCCTTCTAGC AGTAGTGGGG 5580 

AATTCGTCTC GAGTGAGTAT GTTCGAGTTT TATTATACCC ACATTATTTC CAACCAGAGT 5640 

TTTCAGATTT TGAAATATCT CTCCAGGAAA TTGTGTATTC CAATAAATTT GAACATTTAA 5700 

CGCATGCTAA GATTTTAGGG ACAGTCATCA ATCAATTAGG GATTGAACGG AAACTTTTTG 5760 

GAGATATCCT AGTAGATGAA GAACGGGCGC ACATTATGA? TAXTCAGCAG TTTCTTCTTC 5S20 

TCTTTCAAGA TGGACTAAAG AAAATTCCTC GTATACCTGT TTCGCTGGAG GAACGTCCTT 5880 

TCACCGAGAA AATAGATAAG CTAGAACACT ATCGAGAACT GGATTTATCT GTGTCTAGTT 5940 

TTCGATTAGA TGTTCTTTTA TCAAATGTTT TGAAACTATC TAGGAATCAA GCAAACCAGT 6000 

TGATTGAAAA GAAACTTCTC CAAGTAAATT ATCATGTGGT AGACAAATCA CATTACACTG 6060 

TTCAAGTTGG AGACTTGATT AGTGTGAGAA AATTTGGTCG CTTGAGATTA CTTCAAGATA 6120 

AGGGACAAAC GAAAAAAGAG AAGAAAAAAA TAACCCTCCA GTTATTATTA AGTAAGTGAG 6180 

GAATAGAATC CCAATTACAT CATTAGAAAT AAAGGACAAG ACTTTTGGAA CTCGATTCAG 6240 

AGGTTTTGAT CCAGAAGAAC 7CCATGAATT TTTAGATATT GTGGTTCGTG ATTACGAAGA 6300 

TCTTGTGCGT GCGAATCATG ATAAAAATTT GCGTATTAAG AGTTTAGAAG AGCGTTTGTC 6360 

TTACTT7GAT GAAATAAAAG ATTCATTGAG CCAGTCTCTA TTGATTGCTC AGGATACAGC 6420 

TGAGAGAGTG AAACAGGCGG CGCATGAACC TTCAAACAAT ATCATTCATC AAGCAGAGCA 6480 

AGATGCGCAA CGCTTGTTGG AAGAAGCTAA ATATAAGGCA AACCACATTC TTCGTCAAGC 6540 

AACTCATAAT GCTAAGAAAG TCCCTGTTGA AACAGAAGAA TTGAAGAACA AGAGCCGTGT 66C0 

CTTCCACCAA CGTCTCAAAT CTACAATTGA GAGTCAGTTG CCTATTGTTG AATCTTCAGA 6660 

TTCCGAAGA? ATTCTCCGTC CAACAGCTAC TTATCTTCAA ACCAGTGATG AAGCCTTTAA 6720 

AGAAG7GGTT AGCGAACTAC TTGGAGAACC CAT7CCACCT CCAATTCAAG AAGAACCAAT 67 80 

TGATATGACA CGTCAGTTCT CTCAAGCAGA AATGCCAGAA TTACAAGCTC GTATTGAGGT 6840 

AGCCGATAAA GAATTGTCTG AATTTGAAGC TC AG ATT AAA CAGGAAGTGG AAGCTCCAAC 6900 

TCCTCTACTG ACTCCTCAAG TTGAAGAAGA GCCTCTGCTC ATCCAGTTGG CCCAATGTAT 6960 

GAAGAACCAG AAGTAGCTCC AATGCATCCG ATAGGTCCAA CACCAGCTAC AGAAACTGTT 7020 

GATTCAATAC CGGGATTTGA AGCACCGCAA GAATCTGTTA CAATTTTATA AGAAATATTC 7080 

TGAGAACAAT ATCTTATCCT TATATTTCCA GCGAGCAGGA GATGGTGTGA GTCCTGTAAT 7140 

CCCTATTGAT AAGATTATCC TCTCAAAAAC TCAAGTCTGA AGCTAGTAAG ATTTG AC GTT 7200 

TCCCACGTTA CGGGATAAGA GGGAGAAAGA CTAAATCTTT TTCCCAATAA AGGTGGTACC 7260 

ACGATTTTCG TCCTTTTTGG AAGTCGTGGT TTTTAAT7TC TTATTATTTA TAAAGGAGAT 7 320 
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ACCATGAAAC TCAAACACAC CCTTAATCT? GGGAAAACTG AATTCCCAAT GCGTGCAGGC 7 3 80 

CTTCCTACCA AAGAGCCAGT TTGGCAAAAG CAATGGGAAG ATCCAAAACT TTATCAACGT 7 440 

CGTCAAGAAT TGAACCAAGG AAAACCTCAT TTCACCTTGC ATGATGGCCC TCCATACGCT 7 500 

AACGGAAATA TCCACGTTGG ACATGCTATG AACAAGATTT CAAAAGATAT CATTGTTCGT 7560 

TCTAAGTCTA TGTCAGGAT? TTACGCACCA TTTATTCCTG G7TGGCATAC TCATGGTCTG 7 620 

CCAATCGACC AACTCTTCTC AAAACAAGGT GTCAAACCTA AAGAAATGGA CTTGGTTGAG 7680 

TAC7TGAAAC TTTGCCGTGA GTACGCTCTT TCTCAAGTAG ATAAACAACG TGAAGATTTT 77 40 

AAACGTTTGG GTGTTTCTGG TGACTGGGAA AATCCATATG TGACCTTGAC TCCTGACTAT 7 800 

CAAGCAGCTC AAATTCGTGT ATTTGGTGAG ATGGCTAATA AGGGTTATAT CTACCGTGCT 7 860 

GCTAAGCCAG TTTACTCGTC ATGGTCATCT GAGTCAGCAC TTGCTGAAGC AGAGATTGAA 7920 

TACCATGACT TGGTTTCAAC TTCCCTTTAC TATGCCAACA AGCTAAAAGA TGGCAAACGA 7980 

GTTCTAGATA CAGATACTTA TATCGTTGTC TGGACAACGA CTCCATTTAC CATCACAGCT 804 0 

TCTCGTGGTT TGACGGTTGG TCCAGATATT GATTACGTTT TGGTTCAACC TCCTGGTGAA 8100 

GCTCGTAAGT TTGTCGTTGC TGCTGAATTA TTGACTACCT TGTCTGAGAA ATTTGGCTGG 8160 

GCTGATGTTC AAGTTTTGGA AACTTACCGT GGCCAAGAAC TCAACCACAT CGTAACAGAA 8220 

CACCCATGGG ATACAGCTGT AGAAGAGTTG GTAATTCTTG GTGACCACGT TACGACTGAC 8280 

TC7CGTACAG GTATTGTCCA TACAGCCCCT GG7TTTGGTC AGGACGATTA CAATGTTGGT 8 3 40 

ATTCCTAATA ATCT7GAACT CGCAGTGACT CTTGATCAAC GTGGTATCAT GATGAAGAAT 8400 

GCTGGTCCTG AATTTGAAGG TCAATTCTAT CAAAAGGTAG TTCCAACTGT TATTGAAAAA 84 60 

CTTGGTAACC TTrTTCTTGC CCAAGAAGAA ATCTCTCACT CATATCCATT TCACTGCCCT 3520 

ACTAAGAAAC CAATCATCTG GCCTGCACTT CCACAATCGT TTGCCTCAGT TTCTAAATTC 8 580 

CGTCAAGAAA TCTTGGACGA AATTGAAAAA GTGAAATTCC ACTCAGAATG GGGTAAAGTC 8640 

CCTCTTTACA ATATGATCCG TGACCGTGGT CACTGGCTTA TCTCTCGTCA ACGTGCTTGG 8700 

GGTGTTCCAC TTCCTATCTT CTACGCTGAA GATGGTACAG CTATCATGGT AGCTGAAACT 37 60 

ATTGAACACG TAGCTCAACT TTTTGAAGAA TATGGTTCAA CCATTTCGTC GGAACGTGAT 3820 

GCCAAAGACC TCTTGCCAGA AGGATTTACT CATCCAGGTT CACCAAACGG CGAGTTCAAA 8880 

AAAGAAACTG ATATCATGGA CGTTTGGTTT GACTCAGGTT CATCATGGAA TGGAGTGGTG 894 0 

C7AAACCCTC CTGAATTGAC TTACCCAGCC GACCTTTACC TAGAAGGTTC TGACCAATAC 9000 

CGTGGTTGGT TTAACTCATC ACTTATCACA TCTGTTGCCA ACCATGGCGT AGCACCTTAC 9060 
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AAACAAATCT TGTCACAAGG TTTTGCCCTT GATGGTAAAG GTGACAACAT GTCTAAATCT 9120 

CTTGGAAATA CTATTGCTCC AAGCGATGTT GAAAAACAAT TCGGTGCTGA AATCTTGCCT 9180 

CTCTGGGTAA CAAGTGTTGA C7CAAGCAAT GACGTGCGTA TCTCTATGGA TATCTTGAGC 92 40 

CAAGTTTCTG AAACTTACCG TAAGATTCGT AACACTCTTC GTTTCTTGAT TGCCAATACA 9 300 

TCTGACTTTA ACCCAGCTCA AGATACAGTC GCTTACGATG AGCTTCCTTC AGTTGATAAG 93 60 

TACATGACGA TTCGCTTTAA CCAGCTTGTC AAGACCATTC GTGATGCCTA 7GCAGACTTT 94 20 

GAATTCTTGA CGATCTACAA GGCCTTGGTG AACTTTATCA ACGTTGACTT GTCACCCTTC 9480 

TACCTTGATT TTCCCAAAGA TGTTGTTTAC ATTGAAGGTG CCAAATCACT GGAACGCCGT 9 540 

CAAATGCAGA CTGTCTTCTA TGACATTCTT GTCAAAATCA CCAAACTCTT GACACCAATC 9600 

CTTCCTCACA C7GCGGAACA AATCTGG7CA TATCTTGAGT TTGAAACAGA AGACTTCGTC 96 60 

CAATTGTCAG AATTACCAGA AGTTCAAACT TTTGCTAACC AAGAAGAAAT CTTGGATACA 9 7 20 

TGGGCAGCCT TCATGGACTT TCGTGGACAA GCACAAAAAG CCTTGGAAGA AGCTCGTAAT 9780 

GCAAAAGTTA TCCGTAAATC ACTTCAAGCA CACTTGACAG 7TTATCCAAA TGAAGTTGTG 9940 

AAAACTCTAC TCGAAGCAGT AAACAGCAAT GTAGCACAAC TTTTGATCGT GTCTGAGTTG 9900 

ACCATCGCAG AAGGACCAGC TCCGGAAGC? GCCCTTAGCT TCGAAGATGT AGCCTTCACA 9960 

GTTGAACGTG CTACTCGTGA AGTATGTGAC CGTTGCCGTC GTATCGACCC AACAACAGCA LOO 20 

GAACGCAGCT ACCAGGCAGT TATCTGTGAC CACTGTGCAA GCATCGTAGA AGAAAACTTT 10080 

GCGGAAGCAG TCGCAGAAGG ATTTGAAGAG AAATAAGATT GAAAAGTCTA GGCAAAATTC 10140 

AATTTGAGAA GAAAAGACAA C7AATTTTAT AGTC7ATTAA ACGCA7TCTA TCACGTTTTT 10200 

GAATACC7GA 7ATGA7CCG7 TTTTTATTTA TTTTAAAAAT TTGCGAGGTA TGACTTTTTA 102 60 

TACTCAACAA GAATCAAAGA GAAACTTAGC AAGCTAACAG TAGTAACATA AAAT AGG AAT 103 20 

7TGATATTAC GGATAAGATT GGTAAATAGT GTAATATTTT TACAACAATA AAT77ATATA 10 3 80 

GTTATTTCTG GTTTCTGAAA AGTATTATAT TTTATTTCAT ATTATACAAA TTTTTATTT7 104 4 0 

ATAATATCAG AACATACTTT TTTTAAAAGC AAATATGATA CAATTTTATT TGAAAAAAAT 10500 

AAAAAAGGAG ATTTTATTAT AAAATTAAAA ACACT7CC77 TAATTAGTGC TATCGTCCGT 1356C 

CTTCTGGGAG GAATTTTACT 7C77AT7GG7 CCTTTTGTCT TGTTGGGAAT AGCGGTAAAC 10620 

ACAGCTGCTA CAACTCTTAA TCGAGGAGCT ACTGCAGGGG CTTTTTCAGG TGTAGCCTTA 10680 

CTCTTGAATG CCTTGAAGAT TGCAAATCTT GTTCTTGGTA TCATTGCTA7 TG7T7ACTA7 107 4 C 

AAAGGAGATA AGCGTGTAGG TGCAGCTCCG TC7C7AC7AA TGATTGTTTC TGGTGGAGTT 1080C 

AG7C7CA7TC 7AT7CCG77C TTAGGATGGC TTGGGGCCA7 TTTTGCTATT ATCGGAGGAT 103 6C 
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CTCTATTCCT TTCAACATTC AAGAAATTCA AATCAGAACA ATAAAAGGTA TTTTAGCATG 10920 

AAAAGAACAA AAAAGTTTAT CGGTATAGGA GTAGCTCTAT TATCTCTTTC TCTTCTAGTT 10980 

GCATGTGCAA CATAAAGTTC AAAGAATACT TCAACAAGTA ATGATGAGAA GACAGTAGCA 11040 

ACATCCAATA GTTCAAAAGA AACAATCACT TTCGATACAC CGGTTGTAAC AGACGATGCG 11100 

ATTGAATCAA TACGCACTTA TGCAGATTAT ATAGATCTTT ATAAAAATAT TTTTGATGAT 11160 

TATTTTACTA AAGCTGAGGA AGGTTTCAAA GGCATAGCTA TGGAAAATAA TGACTCGTTT 11220 

ACTAAACTAA AAGAGTCAAC TCAAAAATTA TTCGATGCGC AGAAAAAAAG GTTAAATAAT 112 80 

GAAGATAGAA TAGAAACAAC CAAAAACAAT GTGATTGCCA AACATTGTCA AACAGTCCTT 11340 

TCC T TTTTGG TTTTGACTAG CTTTTTTGTG AAAAATTGTG TAAAATAGAA TAGATAAACG 11400 

AGGGGAAACC TCGGAAAATT TAAAGGAGAA TCCATCTAAT GGTAAAATTG GTTTTTGCTC 114 60 

GCCACGGTGA GTCTGAATGG AACAAAGCTA ACCTTTTCAC TGGTTGGGCT GATGTTGATT 11520 

TGTCTGAAAA ACGTACACAA CAAGCGATTG ACGCTGGTAA ATTGATCAAA CAAGCTGGTA 11580 

TCGAATTTGA CC AAGCTT AC ACTTCAGTAT TGAAACCTGC TATCAAAACA ACTAACTTGG 11640 

CTCTTGAAGC TTCTGACCAA TTGTGCGTTC CAGTTGAAAA ATCATGGCGC TTCAACGAAC 11700 

GTCACTACCG TGGTTTGACT GGTAAAAACA AAGCTGAAGC TGCTGAACAA TTTGCTGATG 117 60 

AGCAAGTTCA CATCTGGCGT CGTTCATACG ATGTATTGCC TCCAAACATG GACCGTGATG 11820 

ATGAGCACTC AGCTCACACA GACCGTCGTT ACCCTTCAC7 TGACGACTCA GTTATCCCAG 11880 

ATGC7CAAAA CTTGAAAGTG ACTTTGGAAC GTGCTCTTCC ATTCTCCGAA GATAAAATCG 11940 

CTCCAGCTCT TAAAGATGGT AAAAACCTAT TCGTAGGAGC TCACGGTAAC TCAATCCGTC 12000 
CCCTTGTAAA AC AC AT C AAA GGTTTGTCAG ATCACGAGAT CATGGACGTG GAAATCCCTA ' 12060 

ACTTCCCACC ATTGGTATTC GAATTCGACG AAAAATTCAA CGTCGTTTCT GAATACTACC 12120 

TTGGAAAATA AAAAATTGTA AGTCTAGAAT TGATTTCTAG GCTTTTTATG TTAGTATGGA 12180 

AGTATGATAA GGAATAAAAA ACAAGATTAT GTACTGGCCT ACAAGCAACC AGCTTCAACC 122 40 

ACTTACATGG GTTGGGAAGA AGAAGCTTTA CCGATAGGCA ATGGTTCTTT AGGAGCAAAA 12 300 

GTATTTGGCC TTATAGGGGC TGAACGGATT CAATTTAATG AAAAAAGTCT CTGGTCTGGA 12 3 60 

GGTCCACTTC CTGATAGTTC AGATTATCAG GGTGGAAATC TTCAGGATCA GTATGTTTTT 124 20 

TTAGCTGAGA TTCGGCAGGC TTTGGAGAAG AGAGATTACA ATCTGGCTAA GGAACTGGCT 12 4 80 

GAGCAGCACC TAATTGGGCC AAAAACGAGT CAATATGGGA CCTATCTGTC TTTTGGGGAT 12 540 

ATTCACATTG AGTTCAGCCA GCAAGGTACG ACTTTGTCTC AGGTGACCGA CTATCAGAGA 12 600 
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CAGCTCAATA TTAGTAAGGC ACTTGCGACG ACTTCTTATG TCTATAAGGG AACGCGATTT 12 660 

GAACGTAAAG CTTTTGCGAG TTTTCCAGAT GATCTCTTGG TTCAATGTTT TACTAAGGAA 12720 

GGGTTGGAAA CTCTAGATTT TACTATAGAA CTATCCTTGA CCTGTGATTT GGCTTCTGAT 12 780 

GGAAAGTATG AGCAGGAAAA ATCTGATTAC AAGGAGTGTA AGTTGGATAT TACTGATTCT 12 8 40 

CATATCTTGA TGAAGGGAAG AGTTAAGGAT AATGATCTGC GGTTTGCTAG TTATCTAGCT 12900 

TGGGAAACGG ATGGAGATAT TACAGTTTGC TCAGATAGGG 7TCAGATATC AGGACCC AGT 12 960 

TATGCCAATC TCTTCTTGGC CGCTAAGACG CATTTTCCCC AAAATCCTGC TAGCAATTAT 1302 0 

CGCAAGAAAC TAGATTTAGA GCAACAGGTG ATAGACTTGG TGGACACAGC TAAAGAAAAG 13 080 

GGCTATACCC AATTGAAATC AAGGCATATC GAGGACTACC AAGCCTTATT CCAGCGTGTT 13140 

CAATTGGATT TGGAAGCTGA TGTTGACGCA TCCACTACAG ATGATTTGTT AAAAAATTAT 132 00 

AACCCACAAG AAGGGCAGGC TTTGGAGGAG CTGTTCTTCC AGTATGGACG GTATTTATTG 13 260 

ATTAGTTCGT CCAGAGACTG CCCAGATGCT CTACCAGCTA ACCTACAGGG AGTCTGGAAT 13 320 

GCCCTCGACA ATCCTCCTTG GAATTCGGAC TATCACTTAA ATGTCAATCT CCACCTGAAT 13 380 

TATTGGCCAG CCTATGTTAC CAATCTCCTA GACACGGTCT TTCCAGTCAT CAACTATGTA 13440 

GATGATTTGC GTGTCTATGG TCGTCTAGCG GCTGTAAAGT ATGCAGGAAT CGTCTCTCAG 13500 

AAAGGTGAGG AGAATGGTTG GTTGGTTCAT ACTCAAGCGA CTCCCTTTGG TTGGACGGCA 13560 

CCTGGTTGGG ATTACTATTG GGGTTGGTCA CCAGCTGCCA ATCCGTGGAT GATGCAAACC 13 620 

GTTTATGAAG CCTATTTATT TTATAGGGAC CAAGACTATC TCAGGGAGAA AATTTATCCC 13 68C 

ATGTTGAGGG AAACGGTTCC TTTTTGGAAT GCCTTTTTAC ATAAGGATCA GGAGGCGCAG 13740 

CGTTGGGTGT CTTCTCCGTC TTATTCCCCA GAACATGGGC CGATTTCGAT TGGCAATACC 13 800 

TATGACCAAT CTCTGATTTG GCAGTTAT7T CATGATTTTA TTCAGGCTGC TCAGCAATTG 13860 

GGACTGGATG AGGACTTGTT GACTGAGGTT AAGGAGAAGT CTGATTTACT AAATCCTTTG 13920 

CAAATCACTC AATCTGGTCG AATCAGGGAG TGGTATGAGG AGGAAGAGCA GTATTTTCAA 13 980 

AATGAGAAAG TGGAGGCCCA GCATCGGCAC CCTTCCCATC TAGTGGGACT CTATCCTGGC 14040 

AATCTCTTTA GCTACAAGGG ACAAGAGTAT ATTGAACCGG CGCGTGCTAG CCTCAATGAT 14100 

CGTGGAGATG GCGGCACAGG CTGCTCCAAG CCTAATAAGA TCAATCTC7G GGCGCGTTTG 14160 

GGAGATGGCA ATCGAGCCCA TAAATTATTG GCAGAGCAGT TAAAGACATC CACCTTCCAA 14 220 

AATCTTTGGT GTAGCCATCC TCCTTTTCAG ATAGATGGTA ATTTTCGTGC TACTAGTGGC 142 80 

ATGGCAGAAA TGTTACTCCA GTCTCATGCA GCTTATCTGG TACCTCTAGC TGCCCTACCT 14 340 

GATGCTTGGT CAACAGGTTC TGTTTCAGGC TTAATGGCAC GTGGACATTT TGAAGTGAGC 14403 
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ATGAGCTCGG AAGATAAAAA AC7CTTACAG TTGACCATTT TATCAAGGAG TGGAGGAGAT 14460 

TTGCGAGTTT CTTATCCAGA TATTGAGAAG AGTGTGATTA AAATGAATCA AG AAAAAAT A 14S20 

AAAGCGAAAT GCATGGGGAA AGATTGTATT TCGGTGGCAA CAGCAGAAGG TGATCTTGTT 14 580 

CAATTTTATT TTTAAGAAGA TGTTATAAGG CAGTAATTTG AAACTGCCTT TTAATAAGGA 14640 

TTTAAGAATA TAAGCAGTTT TCAACTAGTT GAAAAAACGT TATAATGATA ATAGGAAGTA 14 7 00 

ATACT CAATg"aAAA~TCAAAG" AG^CAAACT TgGAAGCTAG CCG C AG GTTG~~ CTC AAAAC AG 147S0~ 

TCTT7TGAGG TTGCAGATGG AAGCTGACGT GCTTTGAACA CAGATTTTCG AGGAG7ATAA 14 820 

TTTGTTTGAT AGAGGGTGGG TCTGATGGCT TATATTCAGA TGAAACACTG TTACAAGCGT 14 880 

TATCAGGTTG GGGACACGGA GATTGTGGCC AATTGTGATG TGAATTTTGA GATTGAAAAG 14 940 

GGGGAGCTGG TTATTATCC7 TGGTGCTTCA GGTGCAGGCA AGTCAACAGT TCTTAACCTT 15000 

CTTGGGGGAA TGGATACCAA TGATGAAGGG GAAATCTGGA TTGATGGTGT TAATATTGCG 15060 

GATTATAGTT CCCACCAGCG CACCAATTAC CGTAGAAATG ATGTGGGGTT TGTTTTTCAG 15120 

TTTTATAATC TAGTTTCTAA TCTGACAGCT AAGGAAAATG TGGAACTGGC TTCTGAAATT 15180 

GTGACAGATG CCTTGAATCC TGATCAGGCC TTGACAGATC TAGGTCTCGC TCATCGTCTC 15240 

AATAACTTTC CAGCCCAGCT TTCTGGAGGG GAGCAACAGC GAGTCTCCAT TGCACGCGCG 15300 

GTAGCCAAAA ATCCTAAAAT TCTCCTTTCT GATGAACCGA CTGGAGCCTT GGATTATCAG 153 60 

ACGGGCAAGC ACGTTTTGAA AATTCTCCAA GACATGTC7C GTCAAAAGGG AGCGACGGTC 15420 

ATCATCGTGA CTCATAATGG AGCTTTGGCG CCCAT7CCTC ATCCCGTCAT TCAAA7CCAC 15480 

GATGCCAGTG TCAAGGATGT GGTCCTCAAC CAGCATCC7C AGGA7ATTGA CAGTTTGGAG 15540 

TACTAGCATG ATCAAGCGAA AAACTTATTG GAAGGACTTA G7TCAGTCCT TCACAGGCTC 15600 

CAAGGGGCGT TTTTTATCCA TCTTGATCCT GATGATGTTG GGATCTCTAG CCTTAGTAGC 15660 

CCTCAAAG7A ACCAGTCCCA ACATGGAGGC GACACCTAAT CCTTATTTAA CAACTGC7CA 15720 

AACCTTGGAT TTGGCAGTCA TGTCTAACTA TGGCTTGGAT CAAGCAGACC AAGAAGAACT 15780 

AAAAC AG AC G GAGGGCGCAG AGGTCGAGTT TCGCTATTTC ACAGATGTGA CTATGGATAA 15840 

TGGGCAGGAT GCCATTCGCC TGTACTCCAA ACCAGACCGA ATTTCAACCT TTCAGCTAAC 15 900 

AAAGGGACGA CTTCC7CAGT CAGACAAGGA AATCGC7TTC GCCAC7CATT TGCAAGGCCA 15960 

ATACAGCGTG GGACAGGAGA TTAG7TTTAA AGAAAAAGAA GAGGGTCATT CCTCTTTAAA 16020 

AGACCATACT T AT AC C ATT A CTCCT7TTGT GGATTCGGCT GAAATCCTCT CCCAGCGAGA 16080 

TA7GGCCTAC GCAGGAAGTG GAAGTGGGAC TCTGACAGCC TATGGGGTGA TTTTACCTAG 16140 



WO 98/18931 



PCTAJS97/19588 



368 



TCAATTTGAT 


CAGAAACTCT 


ACAATATAGC 


TCGTTTGAAA 


TATCAAGATT 


TAGCGGGTTT 


16200 


AAATGCCTTT 


TCATCAGCTT 


ATGAAGAAAA 


ATCCAAGCAA 


CATCAAGAAG 


AGCTTGAACA 


16260 


AATTTTATCA 


GATAATGGCA 


AGGTACGTCT 


GCAACTTTTG 


AAAAAAGAAG 


GACAAGAGTC 


16320 


TCTAGACAAG 


GGGCAAGAGA 


CCCTTGACAA 


GGCTCAGACT 


AATTTGCAGG 


AAGGCAAGCG 


16380 


TCGTTTAGCA 


GCTGCTCAAC 


CTCGTATACA 


GGCTCAAGAA 


ACTCAACTAG 


CCTTGTTTCC 


15440 


TCAAGTTCAG 


AGAGAGCAGG 


CTAGTGCTCA 


ACTTACCCAA 


GCCAAGCAGG 


AATTGGGCAA 


16500 


GGAAGAGGAC 


AAACTAAACC 


AAGCTGAACA 


AAATCTAGCC 


CAAGAAAAGG 


AAAAATTAGA 


16560 


AAAACATCAG 


CAAGTCTTGG 


ATGATTTGGC 


GGAGCCAAGG 


TATCAGGTTT 


ATAATCGTCA 


16620 


GACCATGCCA 


GGTGGTCAGG 


GCTATCTTAT 


GTATAGCAAT 


GCTTCATCCA 


GTATTCGACC 


16680 


AGTGGGCAAT 


ATCTTTCCTG 


TGGTACTTTA 


TGCCGTAGCA 


GCCATGGTGA 


CCTTTACGAC 


16740 


CATGACTCGC 


TTTGTAGACG 


AAGAGCGAAC 


TCATGCAGGG 


ATTTTTAAGG 


CCTTGGGTTA 


16800 


TCGTAGTAAG 


GATATTATCG 


CCAAGTTTCT 


CCTTTATGGA 


CTACTACCTG 


GGACTGTCGG 


16860 


AACGCCTCTA 


GGTAGTATAC 


TTGGTCATTA 


TTTGCTAGCC 


AGTGTAATTT 


CAAGTGTCAT 


16920 


TACAAAAGGC 


ATCGTGGTGG 


GAGAAACTCA 


GATTCAGTTC 


TATTGGACCT 


ATACCTTAC7 


16980 


AGCTTTTGTC 


TTGAGCTTGT 


TGGCGAGTGT 


GTTACCACCC 


TATCTGGTCC 


CTTGGAGGCA 


17040 


ACTTCATGAC 


GAAGCAGCCC 


AGCTTCTACT 


TCCTAAACCT 


CCTGTCAAAG 


GAGCTAAAAT 


17100 


CTTATTGGAG 


CGTATCGGTT 


TTATCTGGCG 


TCGTCTCAGT 


TTTACTCATA 


AGGTAACAGC 


17160 


CCGCAACATC 


TTTCGTTATA 


AGCAGAGAAT 


GTTGATGACA 


ATCTTTGGTG 


TGGCAGGTTC 


17220 


TCTAGCTCTG 


CTC7TTGCAG 


GTTTGGCAAT 


CCAATCTTCT 


CTAGCAGGAG 


7TCCGTCTAA 


17230 


ACAGTTTCAA 


CAAATCCAAC 


AGTATCAGAT 


GCTTGTCTCT 


GAAAATCCTA 


GTGCGACCAA 


17340 


TCAGGACAAG 


GTAGAGCTAG 


CAGAAGTGTT 


GAAAGGGCAG 


GAGATACTAG 


CCTACCAGAA 


17400 


AATCTATTCT 


AAAGCGCTAT 


ACAAGGATT? 


CAAAGGCAAA 


GCTGGTCTTC 


AAAAC ATT AC 


17460 


TCTTATGATG 


ATAGAGAAGG 


AAGATTTGAC 


TCCCTTTATC 


CATCTTCAAC 


ATCATCACCA 


17520 


GGAGCTGACA 


TTAAAACATG 


GCATCGTTAT 


TACAGCTAAA 


CTCGCCCAGC 


TGGCAGGTGT 


•7580 


CAAGGTTGGG 


CAGACTTTAG 


AAATTGAAGG 


TAAGGAACTA 


AAGGTCGTTG 


CTATTACTGA 


17640 


GAACTACGTT 


GGTCACTTTA 


TTTATATGAG 


TCAGGCTAGC 


TATGAGCAAC 


TTTACGCACA 


17700 


GCTACCCCAA 


GCCAACACTT 


ATCTGGTCTC 


ATTAAGGGAT 


ACCAGTGCAA 


CTAGTATCGA 


17760 


AAGTCAGGCG 


GGCTTGCTTA 


TGAATCAATC 


TGCGGTGTCC 


AGCGTTGTCC 


AAAATGCTTC 


17820 


AGCCATTCGA 


CTCTTCGACT 


CTATCGCTAG 


CTCACTCAAT 


CAGACCATGA 


CCATCTTGGT 


17880 


CATCGTATCC 


GTTCTATTAC 


CTATTGTCAT 


CCTTTACAAT 


CTGACCAATA 


TCAACGTAGC 


17940 
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TGAGACAATC 


CGTGAACTCT 


CCACTATCAA 


GGTTCTTGGT 


TTTCATAATA 


ATGAAGTCAC 


18000 


CCTCTACATT 


TACCGTGAGA 


CGATTGTGCT 


CTCCCTTGTG 


GGAATCGTAC 


TTGGTCTGAT 


18060 


AGCTGGTTTC 


TATTTACACC 


AATTTTTGAT 


TCAAATGATT 


TCGCCTGCGA 


CTATTCTCTT 


18120 


TTATCCGCAG 


GTAGGCTGGG 


AAGTCTATGT 


AATCCCAGTG 


GCAGCAGTAA 


GCATCATTTT 


18180 


GACCTTGCTT 


GGTTTCTT C G 


TCAATTATTA 


TCTGAGAAAG 


GTTGATATGT 


TAGAAGCCCT 


18240 


GAAA^CTCTA 


GAGTAAGGTA 


GTTATTTTTA 


GCTGATTGAA 


CTTCTATTTA 


CTAATATTCA 


13300 


AAAATfCTTr 


'ITTTCAAAGA 

Jill 


GCAGGGAACT 


CTTTGTGACA 


GAGCATTTTT 


TCTATAGGGC 


18360 


f^TAGCAGCT 

1*4 oVJV_ i^^_IV_ 4 


GGAATTCCGG 


CTTCGAAGTT 


TGGCTCAGAA 


TTCATATTAT 


CCACGTATTC 


18420 


AACGTAGCGA 


ATCGTATTGT 


CAGTATCGAG 


GACAAAGACT 


GCGCGTGCTA 


ATAGGTGCCA 


18480 


T*P f* GT TG A T G 


AAGAGGGCAT 


AATCGCGCCC 


GAAAGAATGG 


TCAAAGTAGT 


CTGAAAGCAT 


18540 


AATGGCATTG 


TCAAGGCCTT 


CAGCACCGCA 


CCAACGTTTT 


TGAGCAAAAG 


G7AGCTCCAT 


18600 


TGAAACACTC 


AATACGACCG 


TGTTGTCCAG 


TCCAGCCAAT 


TCTTCATTAA 


AACGACGTGT 


18660 


TG AGTTG AG 

* 4 VJ/WJ 1 1 VJAVJ 


GACATCCC^G 


TATCGATAGA 


AGGAACGACA 


C7CAACACT7 


TTTTCTTCCC 


18720 


ATGAAAATGA 


GCC AGACATT 


TTTTAGAAAG 


ATCTGTTGT A 


GTAAGAGAAA 


AATCAAGCGC 


18780 


PTTGTCGCCG 


ACTTGTAGTT 


GTTTACCTGT 


AAAGCTCACA 


GGATTTCCGA 


GAAAAGTTAC 


18840 


GATAGGATAG 


T G G AAT G*TT T 


TTTCTTCCAT 


TTTAGCTGAA 


ACAGTCGGAA 


TTTTCCAATG 


18900 


ATTTGACGCG 


AAAT AT GGGC 


ATAGAAAAAA 


CGCCAGCTCA 


TGTGAGAATG 


ACGTTTTTCA 


18960 




TTCGCAATCC 


TTCAGCAATC 


TTGTCAAGGT 


TGTATTTCAT 


CATCCTGTAC 


19020 


m AGG m GTGGG 




TTGTTCTGCG 


ATAGAGTCAG 


TAAAGATTTG 


AGCGTAGATT 


19080 


GGGATG' pw * w G 


TGTCTTGAGA 


AACAGTTTTC 


ATTGGACGCT 


CATCCACACT 


TGATTCTACA 


19140 


AAG AGTG ATG 


GAACTTTTGT 


TTGGCGAAGT 


TTTTCAACCA 


AGGTCTTGAT 


TTGTTCAGGA 


19200 


GTTCCTTCTT 


CTTCAGTATT 


GATTTCCCAG 


ATGTAAGCAC 


TTGGGACACC 


ATAGGCTTTA 


19260 


GAGAAGTATT 


TGAATGCTCC 


TTCGCTGGTT 


ACAATGACTT 


TCTTTTCAGC 


AGGGATCTTA 


19320 


TTAAATTTAT 


CCTTACTTTC 


TTTATCAAGT 


TTGTCTAACT 


TATCAGTATA 


TTCTTTCACA 


19380 


TTTTTTTCAT 


AGAATTCTTT 


ATTGTTAGGG 


TCTTTGGCGC 


TCAATTGTTT 


GCCGATATTT 


19440 


TTAGCAAAAA 


TAATACCGTT 


TTCAAGGTTA 


AGCCAAGCGT 


GTGGGTCTTC 


TTTTCCTTTT 


19500 


TCATTTTGAC 


CTTCAAGGTA 


GATAACATCA 


ACGCCGTCGC 


TGACTGCGAA 


GTAGTCTTTG 


19560 


7TTTCAGTTT 


TCTTGGCATT 


TTCTACCAAT 


TTTGTAAACC 


AAGCATTGCC 


ACCTGTTTCA 


19620 


AGGTTGATAC 


CGTTATAGAA 


AATCAAATTA 


GCC7CAGAAG 


TTTTCTTAAC 


GTCTTCAGGA 


19680 
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AGTCGTTCCT ATTCGTCTGG GTCTTGCCCA ATCGGAACGA TACTATGAAC CTCAATTTTC 19740 

TCACCAGCAA TATTTTTAGT AATATCAGCG ATGATTGAGT TTCTAGCAAC AACTTTTAGT 19800 

TTTTGACCAC AAGTTGTATC '"' I TTTTTCCG CTAGCACATG CTACAAGAAT GATTGCAGAA 19860 

AGAAAGAGAA CGAGTAATGT ACCTAATTTT TTCATTAGAT CCTCCAATTT ATTAGGCCTT 19920 

TGCCCCTTAT TTTAACAAAT GTTTATTTTT CACTTTCAAA TATCGTTGTT TGGGAGCGAT 19980 

AAAGAACCTA ATGAGAAAGA AACTAGCAGC TGTAAGCACG ATACTACAAC CTGCCGCAAC 20040 

ATTAAAACTA TAGCCAATAA AGAGTCCCAA AACTGAAGCA GTAGCTCCGA AGGTTGAGGA 20100 

AAGGAAAATC ATACTTTTCA GACTATTAGC ATACAGATAA CCAGTTGCAG CTGGGGTAAT 20160 

CAGCATGGCT ACAATCAGGA TAGTTCCGAC ACTTTCCATC GCTCTCACAG ACACGAGAGT 20220 

CAGGAGTACC ATGAGAAGGT AGTGATAGAA ATTGACAGGC ATTCCCATGG CTTTAGCCAA 20280 

CAGTTCATCA AAGGAAGTTA TCAAGAGTTG CTTGAAGAAA ATCCACATTA ACAAGAGGAT 2 0340 

AGCTGCCCCC ACACCCA7AG TAATAAACAT ATCCCTATCT TGGACGGCCA GGATATTACC 20400 

AAAAAGGATA TGGAAAAGGT CAGTTGAACT TTTACCGACA CCAATCAAGA TCATACCGAC 204 60 

GGCTAAGAAA GAAGAAAAGG TAATGCCGAT CCCGGTATCG CTTTTGATAA TCGAGTTTCC 20520 

TTTGATGTAG GTAATGATGA TGCCACCTAC CAATCCAAAG ACAATGGCTC CGATAAAGAA 2 0580 

CTCAACGCCC AAGATGAAGG ATAGGGCTAC ACCTGGTAAG ACACCATGTG AAATGGCATC 20640 

TCCCATGAGT GACATCCCGC GTAGAATAAT GAAACATCCC ACACCTCCAC CTACAATCCC 20700 

GACGACAATA GCTGTTATCA AGGCATTTTG TAGGAAATCG AATTTTTGCA ATCCATCGAT 20760 

AAATTCTCCA ATCATAGGTC ACCTCCATTG AAAAAGACTT GATTACCGTA AGCTTCTTTT 20820 

AGATTGGTTT CGGTAAAAGT TTCTTTTGTT CGACCAAAGG CAATCACTTC TCCATTCACA 20880 

AGTAAGACTT GATCGAAGTA GTGGGGAATC TTGCTGAGGT CGTGGTGAAC GATGAGAACC 20940 

GTCTTCCCAG CTTTTTTCAA ATCTCTCACC GTATTCATGA TGATTTCCTC ACTGACAGAG 21000 

TCAATCCCAG CAAACGGTTC ATCCAAGAGG ATATAGTCGG CTTCCTGCAC CAAACATC7C 21060 

CCAATCAAGA CCCGCTGGAA TTCACCTCCA GACAGTTGAC TAATTTGACG TTCAGCGTAG 21120 

TCAGCTAGGC CGACGATTTC AAGGGCCTCT TGCACTTTCT TCCAATCTTT AGCCTTTAAA 21130 

CTTCGAAAGA GAGGAATAGA GGGAAATAGT CCTAACGAGA CGCATTCCTT GACCTTGATG 21240 

GGAAAGTTGT AGTCGATATT GATTTTTTGT TCGACATAGG CAATTCGGTG TAAGGATTTT 21300 

TTAACTTCCT TGTCATCGAG AAATGCCTGA CCTTGATGTC GGATAATTCC CAACATACCT 2136C 

TTTAATAGTG TTGATTTCCC AGCGCCGTTT GGACCAATGA TGCCGGTAAT TGTTGGTCCA 214 20 

TGGAGCACTA GTGAAATATC CTTAAGTGCC AACGTTTCTT TGTAGGAGAC ACTGAGGTTT 21480 
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TCGATACGTA TCATAAACTT GTATTCCTCC TGTCTCTTAA TATACATTAA AAAAAAAATT 21540 

AAGTCAAGTT AATTTTTGAA AAAATTAAAA TAATAACTCA AAAATAGATT CTAAAGATAA 21600 

CTTTCAGGAT AAATTTCTAA ATTATAAAAC CCATACTATC AAGTGTAAAA AACTTGGAAT 21660 

TATCCC7TTT ATCATGGAAA GATTTTTTAT AATAGCTAAA AAATAA 217C6 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE-CHARACTERISTICS: 

(A) LENGTH: 6171 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
(Di TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GATCCCCAGG AAAAACCGAG GTTTTCCCAA TCAATCGTTA CTGTCATATT CCACTCCTTA 60 

TTCTAAAAAC CTATTTCTTA TATTCTACAC TATTTTTCTA AAATAGCAAG TATATTTTGT 120 

AATTTTCAGA AAATTTCTCC AATAAAAACC AACTCTTAGA ACTGATTCTT CATTTCACTT 180 

ATTTATCTTC AGTAACTACT TCCTGAAGAT AAGCGTCAAA AACTTCTTCA TCTGAAATCG 2 40 

TGTCAGAAAT GAAGCTTCCA TTGCTAGTGC GTTCTGACAA GTTCAAGTCT TGCAATCGCC 300 

TTTCATAGAT TGTTCCTTTA TTGGATTGGA CAAGCAGAGT TTGGTCGTTC ACATCCACTT 3 60 

CCGTACTGAA GAAATCGCCA ACAAATCCTT GCTCTGCAAC TCCTCCTGCC AAGAAGACAC 4 20 

GATGCGGTTT GTTTTTCAAC TCACGCAACA CTTGTAATCC TCGTTTGGCA CCGCTGGTTC 4 80 

CTAGAATTTC CTCAATGGAA ACACGTTTCA AGCTTCCACG CTGGCTCAAG AGGTAGAACG 540 

ACGAAGTATT AC AG AT AAA C CCAGATTGGA GG AC AT CATC TTCTTTCAAA TTCATAGCCT 600 

TGACACCTGC TGCCTTAGCA CCGACAACCG GAACCTCTTC GATATTGAAA CGCAGGGCAT 660 

AACCATTTTG ACTAACCAAG ACAACATCAT CTACTTTAAT CGGAGCCACT GCTACAATCT 720 

GATCTGTATC GTCTTTGAGC TTAGCATACT TGACAGACTT AGATCTATAG GTCCGCCATG 7 80 

GAGTGAATTC TTTTCGCTCT ACCCGTTTGA TTTGACCAAG GCGAGTCACT GCAAAGTAGG 840 

TTGTCGCATC GTCAAACTGA TCCAGTACTT CCACATAAAG GATTTCTTCA TTCGTTTCAA 900 

AGTTTGTGAT GGTTTGGCTC AGATGCTCTC CGATGTCCTT CCAACGAATA TCTGCCAACT 960 

CATGGATTGG TCTGTAGATG ACATTTCCAA GACTTGTGAA CATCAAGAGG TGCTGGGTTG 1020 

TCTTGGCAGA TTGAACAAAA ATCAAACGGT CATCATCACG CTTCCCAATT TCTTCCAAGG 10 80 

TGGAAGCCCC AAAGGAACGT GGACTGGTAC GCTTGATGTA ACCTGCCTTG CTCACGCTGA 114C 
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CGTAGGTATC TTCCTCAGCC ATAACACTAG CTGTATCAAT CTCAATTGCT 7TCGCACTGT 1200 

CTTCTAAAGA ACTCAAACGA GGAGTTGCAA ATTTCT7CTT GACCTCACGA AGTTCTTTCT 1260 

TCATGAGATT GTACATAGTC CTTTCATCAC CGATAATAGC CGCCAGCATA GCAATCTTCT 1320 

CACGAAGCTC TGCTTCTTCT TCCTGCAAGA CAACCACATC GGTATTGGTC AAACGGTACA 1380 

GTTGCAAAGT TACGATAGCC TCAGCCTGTT CTTCCGTAAA ATCATAGCTA ACTTTGAGGT 1440 

TTTCCTTGGC GTCCGCCTTA TTCTCAGAAG CACGGATAAG AGCAATGACT TCATCCAAAA 1500 

TCGAAATCAC ACGAATCAAA CCTTCGACCA TATGGAGACG TTTCTCAGCC TTTTCTTTGT 1560 

CAAAGCGTGA ACGCGCCAAA ATCACTTCTC GACGGTGAGC GATATAGCTA GACAGGATTG 1620 

CAACAATCCC AACCTGACGA GGTGTGAAAT TGTCAATCGC CACCATATTA AAGTTGTAGT 1680 

TCATTTGTAG GTCGGTGTAC TTAAATAAGT AGTTCAGAAC AAGCTCAGTA TTAGCGTCTT 1740 

TCTTAAGTTC GATAGCGATA CGAACACCAT CACGGTCAGA CTCATCACCA ACCTCAGCAA 1800 

TCCCACCTAC CTTGTTATTA ACACGAACAT CATCGATTTT CTTCACTACA TTGGCCTTAT 1860 

TGATTTCATA AGGAATCTCA ATAATAACGA TTTGTTCCTT ACCACCTTTT AGCTTTTCAA 19 20 

TTTCAGTCTT GGAACGAACA ACCACGCGCC CTTTCCCAGT CTCATAAGCT TTCTTCATTT 1980 

CATCACGACC CTGAATAATA GCCCCTGTAG GGAAGTCTGG TCCAGGCAAG AATTCCATGA 2040 

GTTTATCAAT CTTTGCAGTT CGGTGGTCAA TCATGTAAAC TGCAGCATCT ATGACCTCAC 2100 

CTAAATTATC GGGAGGAATG TCTGTGGCAT AACCAGCCCA AATCCCAGTC GAACCATTGA 2160 

CCAAGAGGTT TGGAAAGGCT GCTGCCAAGA CCGTTGGTTC TTTCTCCGTA TCGTCAAAGT 22 20 

TCCATGCAAA AGGAACTCTC TTTTTCTCCA TATCC7GAAG AAGGTAGCCT GCAATTTCAG 2280 

ACAAACCTCC CTCAGTATAA CGCATAGCCG CAGGAGGATC TCCGTCCATA GAACCGTTAT 2 340 

TACCCTGCAT TTCAACTAGA ATCTCACCAT TTTTCCAGTT CTGTGACATA CGAACCATGG 24 00 

CATCATAGAT AGAAGAATCC CCGTGTGGGT GGAAATTCCC CATGATGTTC CCGACTGACT 24 60 

7GGCCGACTT ACGGTAGCTC TTGTCAAAAG TATTGCTATC CTTATTCATA GAATAAAGAA 2 520 

TACGGCGCTG AACCGGCTTC AACCCATCAC GAATATCTGG CAAAGCCCGG TCTTCAATAA 25 80 

TCTACTTCGA GTAGCGACCA AAGCGCTCTC CCATGATGTC CTCCAGGGAC ATGTTTTGAA 2 640 

TGTTAGACAT AAGATACAAA GCCCATAAAA TACCAAGTGA AAATAGAAAA TTCTTGAAGT 2700 

AACCAAACTC ACAAGAGAAT TTATCTTTTT C AC AC ACT AT CTAGGGCGTG TTCAACTCCT 27 60 

TTCAAAGAA7 GTAGAGTAGC TTTTTATGCA GTAAAAGATA TTTTACGGGA ATTCCTCCCG 2820 

TGTTCAGTTA CGATAAGTAA CCAAACTATC CTGTTTGTAT TTTTCAATAT GAAAATCTGG 28 80 
TTTTCCAAAA TTAGTCTTAG TTTGTGTCTT AGCCGCTCCC TTAAGCGCCT CTTTGAGATA 2940 



WO 98/18931 



PCT/US97/19588 



373 

ACCACTCATA GCAGATTCTT CATTAATAAT CCTCCAATTT TTTCAAACCA AGATTTTCAA 3000 

ACTGCTTTTT CACATAGTCA TTCACATCCG ACTCTAATTT CCAGTTTACT AACATATTAT 3060 

TTTCTTTCAT TAAAACACTG TCGTTTCTTC TAGCGTAAAC TTGACATTAT CTTCAATCCA 3120 

TTTACGGCGT GGTTCTACCT TATCTCCCAT GAGAACATTG ACGCGGCGTT CGGCGCGCGC 3180 

TAAATCTTCA ATTGTGACAC GGATGAGGGT ACGTGTTTCT GGGTTCATGG TTGTTTCCCA 3 2 40 

GAGCTGGTCC GCATTCATCT CACCAAGTCC TTTGTATCGT TGGAGGGTAG CGCCTTTACC 3 300 

GAACTGTTTA CGGAGTTCTT CTAGTTCTCC GTCCGTCCAA GCCTAGGCCA CTTCTTCTTT 3 360 

CTTGCCTTTA CCTTTGGACA TCTTGTAAAG AGGTGGGAGG GCAATATAGA CATGACCTGC 3 420 

CTCGACTAGC GGACGCATGT AACGGTAGAA AAATGTCAAG AGCAAGGTCT GGA7ATGGGC 3480 

ACCGTCGGTA TCCGCATCGG TCATGATAAT GATCTTATCA TAGTTGGCAT CTTCAATAGA 3540 

CAACTCTGCT CCAACACCCG CACCAATGGT ATAAATCATG GTATTCATCT CTTCATTTTT 3 600 

GAGGATATCC GCCATCTTGG CCTTGGCTGT ATTGACAACC TTACCACGAA GAGGTAGAAT 3 660 

AGCCTGGAAC TTGCGGTCAC GACCTTGTTT GCCACAACCA CCGGCAGAGT CCCCC7CAAC 37 20 

TAGATACACT TCATTCTTAG CAGGATTCTT AGATTGGGC? GGGGTCAATT TCCCAGACAA 37 80 

CAAGCCCTTA TCTTTCTTGT TTTTCTTCCC ATTTCGGCTC TCATCACGCC CCTTACGTGC 3 840 

TGCTTCACGA GCATCACGGG CCTTGATAGC CTTGCGGATG AGGTTAGAAG CTAATTCCCC 3 900 

ATT7TCCATA AGGAAAAAGG TCAACTTATC AGCCACTATT CCATCCACAA CTGGGCGAGC 3 960 

TAGGGGGCTT CCTAGTTTAT CCTTGGTCTG TCCTTCAAAC TGCAAGTGTT CTTCAGGAAC 4020 

TAACATACAA AGAACGCCCG CTACTCCCTC ACGATAGTCT GAACCTTCAA GGTTTTTATC 4080 

T7TTTCCTTG AGAAGACCTG TTTTACGTGC ATAGTCATTC ATGACCTTGG TAATGGCAGA 4140 

CTTGAGTCCT GTCTCGTGCG TTCCACCGTC CTTGGTGCGA ACGTTATTGA CAAAAGATAG 4 200 

AATCTTA7CT GAGAATCCGT CATTGTACTG GAGGGCTACT TCCACTTGAA AACCATTGTC 4 2 60 

TTCCCCTTCA AAGTAAAGAA CTGGCGTCAA GATTTCCT7A TCTTCGT*TGA GATAAGAAAC 4 320 

AAAATCTTGT ACTCCATTCT CATAGTGGAA CTCAATCGCT TCATTTGTTC GCTTGTCCGT 4 380 

TAAAGACAAG GTCACATTTT TCAAGAGAAA GGCTGATTCA TTAAGGCGCT CTGAAATCGT 4 440 

ATTCTACTTG AAATCTGTCG TAGAAAATAT AGTCGCGTCA GGCATAAAAG TAACTTTGGT 4 500 

CCCTGTTTTA GACTTGGGTG CTGTACCGAT TTTCTTCAAA GTCGTGACAG GTTTTCCACC 4 5 60 

ATTTTCGAAA CGTTGCTTGT AAACTGCGCC ATCACGGGTA ATTTCAACTT CTAACCAGCT 4 620 

AGAAAGGGCG TTAACAACGG AAGAACCCAC TCCGTGAAGT CCACCTGATG TCTTATAGCC 4 630 
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ACCTTGACCG AATTTCCCTC CGGCATGAAC AATCGTAAAG ATAACCTCAA CAGTTGGAAT 4 740 

7CCCATAGCG TGCATACcTG TCGGCATCCC ACGTCCATGG TCTTGAACCG TTAGACTACC 4800 

GTCTTTATTG ATAGTTACAT CAATACGATC ACCAAACCCA GACAAGGCTT CATCGACTGC 4860 

ATTATCAACG ATTTCCCAAA CTAGGTGATG AAGACCAGCG CCATCGGTCG ATCCAATATA 4 920 

CATCCCTGGA CGTTTTCGGA CCGCATCCAA CCCTTCTAGC ACCTGAATAG CATCATCATT 4980 

ATAATTGTTA ATATTGATTT CCTTTTTTGA CACAAGGAAC CTCCTATTCG TTCATCTTTA 5040 

CTATTCTACA GGTTTTCCAA GGATTTTGCA AAATTTTTCT TTCTCCGATG TGACAATTTC 5100 

AGCAGAGATT CTCTGCTTTT CTTTCCCAAT TCATGATATA ATAGGAGTAT GATTACAATA 5160 

GTTTTATTAA TCCTAGCCTA TCTGCTGGGT TCGATTCCAT CTGGTCTCTG GATTGGACAA 5220 

GTATTCTTTC AAATCAATCT ACGCGAGCAT GGTTCTGGTA ACACTGGAAC GACCAACACC 52 80 

TTCCGCATTT TAGGTAAGAA AGCTGGTATG GCAACCTTTG TGATTGACTT TTTCAAAGGA 5340 

ACCCTAGCAA CGCTGCTTCC GATTATTTTT CATCTACAAG CCGTTTCTCC TCTCATCTTT 5400 

GGACTTTTGG CTGTTATCGG CCATACCTTC CCTATCTTTC CAGGATTTAA AGCTGGTAAG 5460 

GCTGTCGCAA CCAGTGCTGG AGTGATTTTC GGATTTGCGC CTATCTTCTG TCTCTACCTT 5520 

GCGATTATCT TCTTTGGAGC TCTCTATCTT GGCAGTATCA TTTCACTGTC TAGTGTCACA 5580 

GCATCGATTG CGGCTGTTAT CGGGGTTCTG CTCTTTCCAC TTTTTGGTTT TATCCTGAGT 5640 

AACTATGACT CTCTCTTCAT CGCTATTATC TTAGCACTTG CTAGTTTCAT TATCATTCGT 5700 

CATAAGGACA ATATAGCTCG TATCAAAAAT AAAACTGAAA ATTTGGTCCC TTGGGGATTG 5760 

AACCTAACCC ATCAAGATCC TAAAAAATAA AATGCCAGTT CTGTACTGCC CCCAAACAGT 5820 

TAGACAAATA ATTTATCCAA AGGATTTAGT TCTGTACTGC ACAGGACTAA GTCCTTTTAG 5880 

TTTTACCTTA ATTCGTTTGT TGTTGTAGTA ATCAATATAG TCTATAATGG CTTGTTCCAA 5940 

TTGATTAAGT GATTTAAATG TTTTCTCATA GCCATAAAAC ATTTCCGATT TTAAAATGCC 6000 

AAAGAAAGAT TCCATCCTAC CGTTGTCTTG GCTGTTGCCC TTACGTGACA TGGATGCTTG 6060 

AATTCCCTTA CTCTCTAGGA ACCGATGATA AGAATCGTGT TGGTATTGCC AGCCTTGGTC 6i20 

ACTATGGAGA ATCGTATTCT CCTAGTCCTT CTCTGTGAAT GCCTGTTCCA A 6171 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18475 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(Di TOPOLOGY: linear 
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(xil SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TATTACAAAT AAAAAAACGC AGGAGTGCTT TATGAAAGCC TATACTTATG TTAAACCAGC 60 

ACTTGCTTCT TTTGTTGATG TAGACAAACC AGTTATTCGC AAGCCAACAG ACGCTATTGT 120 

GCGTATTGTA AAAACCACTA TTTGTGGAAC AGACCTCCAT ATTATCAAAG GGGATGTTCC 130 

TACTTGCCAA AGTGGTACCA TTCTTGGCCA CGAAGGGATT GGGATTGTTG AAGAAGTTGG 240 

GGAAGGAGTT TCCAACTTCA AAAAAGGTGA CAAGGTCTTG A1TTCTTGCG TCTGTGCCTG 300 

TGGTAAATGC TACTACTGTA AAAAAGGAAT TTATGCTCAC TGTGAAGACG AAGGGGGCTG 3 60 

GATTTTCGCT CACTTGATTG ATGGTATGCA GGCTGAATAT CTACGTGTCC CTCATGCAGA 420 

TAATACTCTT TACCATACTC CAGAAGACTT GTCAGATGAA GCTTTGGTTA TCCTGTCAGA 4 80 

CATTCTGCCT ACTGGATATG AAATTCGTC? CTTAAAAGGG AAAGTAGAAC CTGGTTGCAG 540 

CGTAGCCAT? ATTGGTTCAG GTCCAGTTGG ATTGGCTGCT CTTTTAACAG CCCAATTCTA 600 

TTCACCAGCT AAATTGATTA TGGTAGACCT AGACGATAAC CGCTTGGAAA CTGCCCTATC 660 

ATTCCGTCCG ACTCATAAGG TTAATTCTTC AGACCCTGAA AAAGCCATTA AAGAAATTTA 720 

TGATTTGACA GATGGTCGTC GTGTGGATGT CCCTATCGAA GCTGTTGGTA TTCCTGCAAC 7 80 

ATTTGATTTC TGTCAAAAGA TTATCGGTGT AGACGCAACG GTTGCCAACT GTGGTGTGCA 840 

TGGTAAACCA GTTGAATTCG ATTTAGATAA ACTTTGGATT CGCAACATCA ATGTAACAAC 900 

TCCTTTCCTA TCTACAAATA CGACTCCACA ATTGTTGAAA GCACTTGAAA G7CATAAGA? 960 

TGAACCGGAA AAATTGGTAA CTCACTATTT CAAACTCAGT GAAATTGAAA AAGCCTACGA 1020 

AGTCTTCAGT AAGGCAGCAG ACCACCATGC CATTAAGGTC ATTATCGAAA ACGATATCTC 1080 

AGAAGCCTAA GTAGTAAAAA TATTTTTGTA CATAAGTAAA TAGAAATTCA CTCATCCATC 1140 

AGATGGCTGG ATTTTTTATC AAAAAATTAA GAAATGAGCA TATTTCTTTC CTTGTCTGGC 1200 

GGAATTGCTT ATAATATACG GTACAAAGGA ATGAATCAAT ATGTATCCTC TTATAGAAAT 12 60 

GTACGGAGAT TTTGAACCGT GGTGGTTCTT AGAAGGTTGG GAAGAAGATA TTGTAGCAAG 13 2 0 

TAGAAAATTT GACCAGTATT ATGATGCTCT CAAATACTAC AAAACTTGCT CCTTTAGATT 1380 

GGAACAAGAA TCGCCTCTTT ATAAAAGTAG AAGCGACTTG ATGACCATTT TTTGGGACCC 144 0 

GGAAGACCAA CGCTGGTGTG ATGAATGTGA TGAGTATTTA CAACAATACC ATTCTTTGGC 1500 

TCTTTTGCAG GATGAGCAGG TTATCCCAGA CGAAAAACTA CGCTCAGGCT ATGAAAAACA 1560 

AACCAG7CAG GAAAGGAATC GTTCTTGCCC TATGAAATTA AAATAGAGAA AAGTAACTTT 1620 

TTTCGAGTTG CTTTTTTTAT TTTTCTAACT CTTTCCCAAT AGTATAGGTG AGGAGGTAAG 1680 
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TATGGTTCAA CAAATTCCAC AACAAATCAT TCGTTCAGCT CGGAAAAAAG GGACGCAGGA 174 0 

TATCTATTTT GTCCCTAAGT TAGACGCCTA TGAGCTTCAT ATGACGCTAG GAGACGAGCC 1800 

CTGTAAAATT GGTAGCTATG ATTTTGAAAA GTTTGCAGCC GTTATCAGTC ACTTTAAGTT 1860 

TGTGGCGGGT ATGAATGTGG GAGAAAAAAG ACGTAGTCAA CTGGGTTCCT GTCATTATGC 132 0 

CTATGACCAT AAGATAGCGT CTCTACGTTT ATCTACTCTA GGCGATTATC GGGGGCATGA 1980 

GAGTTTGGTT ATCCGTTTGT TGCACGATGA GGAGCAGGAC. CTGCATTTTT GGTTTCAGGA 2040 

TATTCAACAA TTAGCCAACC AGTACAGGCA ACGGGGACTC TATCTTTTTG CTGCTCCGGT 2100 

TGGGAGTGGT AAGACGACCT TGATGCATGA ATTGTCCAAG TCACTCTT7A AAGGACAGCA 2160 

AGTTA7CTCC ATCGAAGATC CTGTCGAAAT CAAGCAGGAC GACATGCTTC AGTTGCAGTT 22 20 

GAACGAAGCA ATCGGCCTAA CCTATGAAAA TCTAATCAAA CTTTCCTTGC GTCATCGACC 2280 

AGATCTCTTG ATTATCGGAG AAATTCGTGA CAGCGAGACG GCGCGTGCAG TGGTCAGAGC 2 340 

T AGTTTG AC A GGTGCGACAG TCTTTTCAAC CATTCACGCC AAGAGTATCC GAGGTGTTTA 2400 

TGACCGTCTG CTGGAGTTGG GTGTGAGTGA AGAAGAATTC GCAGTTGTTC TGCAAGGAGT 24 60 

CTGCTACCAG AGATTAATCG GGGGAGGAGG AATCGTTGAC TTTGCAAGCA GAGATTATCA 2520 

AGAACACCAA GCAGCCAAGT GGAATGAGCA AATTGACCAG CTTCTTAAAG ATGGACATAT 258C 

CACAAGTCTT CAGGCTGAGA CCGAAAAAA7 TAGCTACAGC TAACCAAAAA AATATCATCA 2 640 

CCCTATTTAA CAATCTCTTT TCTAGCGGTT TTCATCTGGT GGAGACTATC TCCTTTTTAG 2700 

ATAGGAGTGC TTTCTTCGAC AAGCAGTGTG TGACCCAGAT GCGTGTCGGC TTCTCTCAGC 27 60 

GGAAATCATT CTCAGAAATG ATGGAAAGTT TGGGATGTTC AAGTGCTAT7 GTCACTCAGT 2820 

TATCCCTAGC TGAAGTTCAT CGCAATCTCC ACCTGAGTTT GGGAAAGATA GAAGAATATC 2 880 

TGGACAATCT GGCTAAGGTC AAGAAAAAAT TGATTCAACT AGCGACCTAT CCCTTGATTT 2 940 

TGCTGGGTTT TCTTCTCTTA ATTATGCTGG GCCTACGGAA TTACCTGCTC CCACAACTGG 3 000 

ATAGTAGCAA TATTGCCACC CAAATTATCG GTAATCTGCC CCAAATTTTT CTAGGCATGG 30 60 

TAGGGCTTGT TTCCGTCCTT GCCCTTTTAG CACTCACTTT TTATAAAAGA AGTTCTAACA 3120 

TGACTGTCTT TTCTATCTTA GCACGCCTTC CCTTTATTCG AATCTTTGTG CAGACCTACT 3180 

TGACAGCCTA TTATGCACGT GAATGGGGGA ATATGATTTC ACAGGGAATG GACTTGACGC 3 2 40 

AGATTTTTCA AATGATGCAG GAACAAGGTT CCCAGCTCTT TAAAGAAGTC GGTCAAGATC 3 300 

TGGCTCAAAC CCTGAAAAAT GGCCGTGAAT TTTCTCAGAC GATAGGAACC TATCCTTTCT 3 3 60 

TTAGGAACGA ATTGAGTCTC AT CAT AG ACT ATGGGGAAGT TAAGTCCAAG CTGGGTAGTG 3420 

AGTTGGAAAT CTATGCTGAA AAAACTTGGG AAGCCTTTTT TACCCGACTC AACCGCACCA 3430 
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TGAATTTGGT GCAGCCACTG GTTTTTATCT TTGTGGCACT GATTA7CCTT TTACTTTATG 3 540 

CGGCAATGCT CATGCCCATG TATCAAAATA TGGAGGTAAA TTTTTAAAAT GAAAAAAATG 3 600 

ATGACATTCT TGAAAAAAGC TAAGGTTAAA GCTTTTACAT TGGTGGAGAT GTTGGTGGTC 3 660 

7TGCTGATTA TCAGCG7GCT TTTCTTGCTC TTTGTACCTA ATCTGACCAA GCAAAAAGAA 3 720 

CCAGTCAATG ACAAAGGAAA AGCAGCTGTT GTTAAGGTGG TGGAAAGCCA GGCAGAACTT 3780 

TATACCTTAG AAAAGAATGA AGATGCTAGC CTAAGAAAGT TACAAGCACA TGCACGCATC 334 0 

ACGGAAGAAC AGGCTAAAGC TTATAAAGAA TACAATGATA AAAATGGAGG AGCAAATCGT . 3 9C0 

AAAGTCAATG ATTAAGGCCT IT AC CATC CT GGAAAGTCTC TTGGTTTTGG GACTTCTGAC 3 960 

TATCCTTGCC TTGGGCTTGT CCGGCTCTGT CCAGTCCACT TTTTCAGCGG TAGAGGAACA 4020 

CATTTTC7TT ATGGAGTTTG AAGAACTCTA TCGGGAAACC CAAAAACGCA GTGTAGCCAG 4080 

TCAGCAAAAG ACTAGTCTGA ACTTAGATGG GCAGACGCTT AGCAATGGCA GTCAAAAGTT 4140 

GCCAGTCCCT AAAGGAATTC AGGCCCCATC AGGCCAAAGT ATTACATTTG ACCGAGCTGG 4200 

GCGCAATTCG TCCCTGGCTA AGGTTGAATT TCAGACCAGT AAAGGAGCGA TTCCCTATCA 42 60 

ATTATATCTA GGAAATGGAA AAATTAAACG CATTAAGGAA ACAAAAAATT AGGGCAGTGA 4320 

TTTTACTGGA AGCAGTAGTC GCTCTACCTA TCTTTGCCAG CATTGCCACC CTCCTTTTGG 4380 

GACAAATTCA AAAAAATAGG CAAGAGGAAC CAAAAATCTT GCAAAAGGAA GAAGTCTTGA 4440 

GGGTAGCTAA GATGGCCCTG CAGACGGGGC AAAATCAGGT AAGCATCAAC GGAGTTGACA 4500 

TTCAGGTATT TTCTAGTGAA AAAGGATTGG AGGTCTACCA TGGTTCAGAA CAGTTGTTGG 4560 

CAATCAAAGA GCCATAAGGT CAAGGCTTTT ACCTTGTTAG AATCCCTGCT TCCCCTCATT 4620 

GTCATCAGTG GGGGATTACT CCTTTTTCAA GCTATGAGTC ACCTCCTCAT TTCAGAAGTT 4 680 

CGCTACCAGC AACAAAGCGA GCAAAAGGAG TGGCTCTTGT TTGTGGACCA ACTTGACCTA 474 0 

GAATTAGACC GTTCGCAGTT CGAAAAAGTA GAAGGCAATC GCCTATACAT GAAGCAAGAT 4800 

GGCAAGGACA TCGCCATCGG TAAGTCAAAG TCAGATGATT TCCGTAAAAC GAATGCTCGT 4 860 

GGTCGAGGTT ATCAGCCTAT GGTTTATGGA CTCAAATCTG TACGGATTAC AGAGGACAAT 4 920 

CAACTGGTTC GCTTTCATTT CCAGTTCCAA AAAGCCTTAG AAAGGGAGTT CATCTATCGT 4 9 80 

GTGGAAAAAG AAAAAAGTTA AGGCAGGTGT TCTCCTCTAC GCAGTCACCA TAGCAGCCAT 5040 

CTTTAGTCTT TTGTTGCAAT TTTATTTGAA CCGACAAGTC GCCCACTATC AAGACTATGC 5100 

TTTGAATAAA GAAAAATTGG TTGCTTTTCC TATGCCTAAA CCAACCAAAG ATAAGGTTGA 5160 

GCAAGAAAGT GGGGAACAGT TTTTTAATCT AGGTCAGGTA AGCTATCAAA ACAAGAAAAC 5220 
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TGGCTTAGTG ACGAGGGTTC GTACGGATAA GAGCCAATAT CAGTTTCTGT TTCCTTCAGT 5290 

CAAAATCAAA GAAGAGAAAA GAGATAAAAA GGAAGAGGTA CCGACCCATT CAAGCGAAAA 5340 

AGTGGAGAAG AAAAAATCAG AAGAGAAGCC TGAAAAGAAA GAGAATTCAT AGTCAATTCA 5400 

ACTATAATGC GTTGAATCCA GAATAGTCCA CTGTAGTTTC TAGAAAATTG CTGGAAATGC 54 60 

ATGTTAAGCT CCAATTCATT TGTTTATATC TTATTTCAC7 TT ACT AT ACT TTGTGCTAAA 5520 

TTAAAGATAT GAAACATGAT TTTAACCACA AAGCAGAAAC TTTCGATTCC CCTAAAAATA 5580 

TCTTCCTCGC AAACTTGGTA TGTCAAGCAG CCGAGAAACA GATTGATCTT CTATCAGACA 5640 

AAGAAATTTT AGATTTCGGT GGTGGCACGG GTCTATTAGC CTTGCCCCTA ACCCCTAGCC 5700 

AAGCAGCC7A ACTCAGTCAC TCTTGTAGAC ATTTCTGAGA AAATGTTGGA GCAAGCTCGT 5760 

TTGAAAGTGG AGCAGCAAGC AATCAAGAAT ATCCAGTTTT TGGAGCAACA TTTACCGAAA 5820 

AATCCCTTGG AGAAAGAGTT TCATTGCCTT GCTGTTACTC GGGTTCTTCA TCATATGCCT 5380 

G ATTTGG ATG CGGCTCTCTC ACTGTTTCAT CAACATTTCA AGGAAGATGG GAAACTCATC 5940 

ATTGC7CATT TTACCAAGAC AGAAGCTAAT CATCATGGAT TTGATTTAGC TGAACTGGAA 6000 

AACAAGCTAA TTGAGCATGG TTTTTCATCT GTGCATAGTC AGATTCTCTA TAGTCCTGAA 6060 

GACCTGTTTC AAGGAAATCA CTCAGAATTC TTTTTAATAG TAGCCCAAAA ATCACTCGCC 6120 

TAGTCAGGGA GTGATTTTTC TATAAGGATG GAAAAAAGAA GGGAAATTTG GTAAGATAGG 6180 

AATATGGATT TTGAAAAAAT TGAACAAGCT TATACCTATT TACT AG AG AA TGTCCAAGTC 6240 

ATCCAAAGTG ATTTGGCGAC CAACTTTTAT GACGCCTTGG TGGAGCAAAA TAGCATCTAT 6300 

CTGGATGGTG AAACTGAGCT AAACCAGGTC AAGGAGAACA ATCAAACCCT TAAGCGTTTA 6360 

GCACTACGCA AAGAAGAATG GCTCAAGACC TACCAGTTTC TCTTCATGAA GGCTGGGCAA 6420 

ACAGAACCCT TGCAGGCCAA TCACCAGTTT ACACCGGATG CTATTGCTTT CCTTT7CCTC 64 80 

TTTATTGTGG AAGAGTTGTT TAAAGAGGAG GAAATTACTA TCCTCGAAAT GGGTTCTGGG 6540 

ATGGGAATTC TAGGCGCTAT TTTCTTGACC TCGCTTACTA AAAAGCTGGA TTACTTGGGA 6600 

ATGGAAGTGG ATGATTTCCT GATTGATCTG GCAGCT AGC A TGGCACATGT AATTGGTTTG 6660 

CAGGCTGGCT TTGTCCAAGG AGATGCCGTT CCCCCACAAA TGCTCAAAGA AAGCGATGTG 6720 

GTCATCAGTG ACTTGCCTGT CGGCTATTAT CCTGATGATG CCGTTGCGTC GCGCCATCAA 6780 

GTTGCTTCTA GCCAAGAACA TACTTACGCC CATCACTTGC TCATGGAACA AGGGCTTAAG 6840 

TACCTCAAGT CAGACGGATA CGCTATTTTT CTAGCTCCGA GTGATTTGTT GACCACTCCT 6900 

CAAAGTGATT TGTTAAAAGA ATGGCTGAAA GAAGAGGCGA GTCTGGTTGC T ATG ATT ACT 69 60 

CTGCCTGAAA ATCTCTTTGC TAATGCCAAA CAATCTAAGA CTATTTTTAT CTTACAGAAG 7 320 
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AATATAAAAT 


7140 


AGATTTTGTT 


ATAATAGTTG 


AAAACGCTTA 


AAAAGGGGTA 


TCATGTTATG 


ACAAAAACAA 


7200 


T TGCAATCAA 


MA * A A ft ft AM 

TGCAGGAAGT 


TCAAGTTTGA 


AATGGCAATT 


ATACTTAATG 


CCAGAAGAAA 


7260 


AAGTA TTGGC 


GaAAGGTTTG 


% ft ft^4**mft 

ATTGAACGTA 


TCGGTTTGAA 


AGATTCAATT 


TCAACTGTAA 


7320 


AATTTGACGG 


CCCTTCTGAA 


CAACAAATTT 


TGGATATTGA 


AAATCATATA 


CAAGCCG7TA 


7380 


AAATTTTATT 


GGATGACTTG 


ATTCGTTTCG 


ATATTATCAA 


GGCTTATGAC 


GAGATTACAG 


74<10 


G i. GTTGG AC A 


TCGTCT7GTT 


GCTGGTGGAG 


AATATTTCAA 


AGAATCAACA 


GTTGTTGAGG 


7 50C 


GAGATGTTTT 


AGAAAAAGTT 


GAAGAGTTGA 


GTTTGTTGGC 


TCCTCTACAC 


AACCCGGCCA 


7S60 


ATGCAGCAGG 


TGTTCGTGCC 


TTCAAGGAAT 


tgttgccaga 


CATTACCAGT 


GTAGTTGTTT 


7620 


TTG AT ACT i C 


CTTCCACACA 


ACTATGCCAG 


AGAAAGC7TA 


TCGCTACCCT 


CTACCAACAA 


7680 


AATATTACAC 


AGAAAACAAG 


GTTCGTAAAT 


ACGGTGCTCA 


TGGTACAAGT 


CACCAGTTTG 


77-;o 


TAGCAGGAGA 


AGCTG*-AAAA 


CTCTTGGGAC 


GTCCATTAGA 


AGACTTGAAG 


TTAATTACCT 


7800 


GTCATATTGG 


TAACGGAGGC 


TCAATTACAG 


CTGTGAAAGC 


CGGCAAATCT 


GTAGACACTT 


7860 


CTATGGGGTT 


CACTCCTCTT 


GGTGGTATTA 


TGATGGGAAC 


GCGTACAGGG 


GATATTGATC 


7920 


CAGCTATCAT 


TCCTTATTTA 


ATGCAATATA 


CAGAGGATTT 


TAACACACCA 


GAAGATATCA 


7980 


GTCGTGTTCT 


TAACCGTGAA 


TCAGGTCTTT 


TGGGAGTTTC 


TGCTAATTCT 


AGCGATATGC 


3040 


GCGATATAGA 


AGCAGCTCTA 


GCAGAAGGGA 


ATCACGAGGC 


TAGCTTGGCT 


TATCAAATCT 


8100 


ATGTTGACCG 


TATCCAAAAA 


CATATCGGTC 


AGTACCTTGC 


AGTGCTAAAT 


GGAGCAGATG 


8160 


CCATTGTTTT 


CACAGCAGGT 


GTCGGTGAAA 


ATGCAGAGAG 


TTTCCGTCG7 


GATGTAATCT 


3220 


CAGGGATTTC 


GTGGTTTCCT 


TGTGATGTTG 


ATGATGAAAA 


GAATGTCTTT 


GCCGTTACAC 


3280 


GAGACATCTC 


AACAGAGGCA 


GCTAAAATCC 


GTCTCTTGCT 


TATTCCAACA 


GATGAAGAAT 


834C 


TAGTCATTCC 


CCGTGACGTT 


GAACCCTTCA 


AAAAATAAGT 


GAAACTAAAA 


AAATATTCAA 


8400 


TACAAGGAGT 


TGGGAAAGTT 


ATTTTTCCAG 


CTTCTTT7TC 


TGATGAAATT 


GTCCAAAACC 


8460 


TTGCTATGAT 


TGCCTTTTTT 


GAAAAATATG 


GTATAATAGT 


AGTAATTTAA 


TAGATGGAGT 


852C 


TGAGTTTTGA 


AGAAAAACTT 


TCGTGTAAAA 


AGAGAGAAAG 


ATTTTAAGGC 


GATTTTCAAG 


9580 


GAGCGGACAA 


GTTTTGCTAA 


TCGCAAATTT 


GTGGTCTACC 


AATTAGAAAA 


CCAGAAAAAC 


864C 


CGTTTTCGAG 


TAGGTCTATC 


AGTTAGCAAA 


AAACTGGGGA 


ATGCCGTCAC 


TAGAAATCAA 


3700 


ATTAACCGAC 


GGATTCGGCA 


TATTATCCAC 


AATGCAAAAG 


GGAGTCTGGT 


AGAAGATCTC 


376C 
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GACTTTGTTG TCATTCCTCG AAAAGG ACT C GAAACCTTGG GAflVAGC TAG A GATGGAGAAA 8320 

AATCTACTCC ATGTATTAAA ATTATCAAAG ATTTACCGGG AAGGAAATGG CACTCAAAAA 8380 

GAAACTAAAG 7TGAC7AGTT TGCTAGGAC7 GTCTCTGTTA ATCATGACAG CCTGTGCGAC 8940 
TAATGGCGTA ACT AG CG AT A TTACAGCCGA ATCGGCTGAT TTTTGGAGTA AATTGGTTTA 
CTTCTTTGCG GAAATCATTC GCTTTTTATC GTTTGATATT AGTATCGGAG TCGGCATTAT 

TCTCTTTACG GTCTTGATTC GTACAGTCCT CTTGCCAGTC TTTCAGGTCC AAATGGTGGC 9120 

TTCTAGGAAA ATGCAGGAAG CTCAGCCACG CATTAACG£G CTTCGAGAAC AATATCCAGG 9180 

TCGAGATATG GAAAGCAGAA CCAAACTAGA GCAGGAAATG CGTAAAGTAT TTAAAGAAAT 9240 

GGGTGTCAGA CAGTCAGACT CTCTTTGGCC GATTTTGATT CAGATGCCGG TTATTTTGGC 9 300 

CCTGTTCCAA GCCCTATCAA GAGTTGACTT TTTAAAGACA GGTCATTTCT TATGGATTAA 9 360 

CCTTGGTAGT GTGGATACAA CCCTTGTTCT TCCGATTTTA GCACCAGTAT TCACCTTTTT 9420 

AAGTACTTGG TTGTCCAACA AAGCTTTGTC TGAGCGAAAT GGCGCTACCA CTGCGATGAT 94 80 

CTATGGGATT CCACTCTTCA TTTTTATCTT TGCAGTTTAT GCGCCAGGTG GAGTCGCCCT 9 54 0 

ATACTGGACA GTGTCTAATG CTTATCAAGT CTTGCAAACC TATTTCTTGA ATAATCCATT 9600 

CAAGATTATC GCAGAGCGCG AGCCCCTACT ACAGGCACAA AAAGATTTGG AAAATAGAAA 9 660 

AAGAAAAGCC AAGAAAAAGG CTCAGAAAAC GAAATAAATA AGGAGGAATC TGGTAGTGGT 9720 

AGTATTTACA GGTTCAACTG TTGAAGAAGC AATCCAGAAA GGATTGAAAG AATTAGATAT 97 80 

TCCAAGAATG AACCCTCATA TCAAAGTCAT TTCTAGCGAG AAAAAAGGCT TTCTTGGTCT 9840 

ATTTGGTAAA AAACCAGCCC AAGTGGATAT TGAAGCGATT AGTGAAACGA CTGTTGTCAA 9900 
AGCAAATCAA CAGGTAGTAA AACCCCTTCC GAAAAAAATC AATGATTTGA ACGAGCCTGT 996C 
GAAGACGGTT AGTGAAGAAA CCGTTGACCT TGGTCATGTG C7TCATGC7A TTAAAAAAAT 
AGAGGAAGAA GGTCAAGGTA TTTCTGATGA AGTCAAGGCT GAAATCTTAA AACATGAAAG 

ACATGCCAGC ACTATCTTAG AAGAAACTGG TCACATTGAG ATTTTAAATG AACTTCAAAT 10140 

CGAGGAAGCG ATGAGGGAAG AAGCAGGCGC TGATGACCTT GAAACTGAGC AAGACCAAGC 10200 

TGAAAGTCAA GAACTAGAAG ACTTGGGCTT GAAACTTGAA ACGAACTTTG ATATTGAACA 10260 

AGTAGCTACG GAAGTAATGG CTTATGTTCA AACGATTATT GATGACATGG ATGTTGAGGC 103 20 

TACACTTTCA AATGATTATA ACCGTCGTAG CATCAATCTA CAAATTGACA CCAACGAACC 10380 

ACGTCC7ATT ATCGGCTACC ATGGTAAAGT CTTCAAGGCC TTGCAACTGT 7GGCTCAAAA 10440 

TTATCTTTAC AACCGCTATT CCAGAACCTT CTACGTTACA ATCAATGTCA ATGATTATGT 10500 

CGAACACCGT CCACAACTCT TGCAGACCTA TGCGCAAAAA TTGGCGACTC C7CTTTTGGA 1C560 



10020 
10080 
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AGAAGGGCGC AC7CATAAAA CAGATCCAA7 GTCAAATAGC GAACGCAAGA TTATCCATCG 10620 

TATTATTTCA CGTATGGATG GCGTGACTAC TTACTCTGAA GGTCA7GAGC CAAATCGCTA 10680 

TGTTGTTGTA GATACAGAAT AAGTAAAATC AGGTTTATCC TGATTTTTTG CTAGTTAGAG 10740 

GAGGTTAAAC TGATGTTCAA TAAGATAAGA GACTATTTAG ACTTTGCTGG TTTGCAGTAC 10800 

CGTAATCCTG ATAAAGCGGG AGCAGAGCGA GAGAAGATGC TGGCATTCCG CCACAAAGGA 108 60 

CAAGAGGCCC GAAAGGTT7T TACAGAACTG GCCAAAGCCT TTCAAGCAAG CCATCCAGAA 109 20 

7GCCAACTCC AACAGACTAG CCAGTGGATG AATCAGGCCC AGCGTTTGAG ACCACATTTT 10980 

7GGGT7TA7C TACAGAGAGA CGGACAAGTG ACAGAACCTA TGATGGCCTT ACC7T7CTA7 U040 

GCGACA7CTA CTGACTTTGG AATTTCTT7G GAAGTCAGTT TCATCGAACG TAAGAAGGAT 11100 

GAGCAAACAC TGGGCAAGCA GGCCAAAGT7 TTAGACATTC CAACCGTTAA AGGGATTTAT 11160 

TATCTAACCT ACTCTAATGG TCAAAGTCAA CGGTGGGAGG CGAATGAAGA AAAGCGTCGT 112 20 

AC777ACGCG AGAAGGTGAG AAGTCAAGAA GTTCCAAAAG TTTTAGTGAA GGTAGATGTT 112 80 

CCTATGACAG AAAATTCGTC TGAAGAAGAA ATCGTAGAAG GCTTATTGAA GTCTTATTCT 11340 

AAAATTCTTC CCTATTATCT AGCTACGAGA AAATAAGATA ATTTGTAAAA CATCATAAAT 11400 

CATACAGTCC AAGAGTGAAC AGTCCGC7GT GTAATTCTTC GTCTTTTTGT TTGCGCTTTC 11460 

GCATTATATA ATAAACTTAC AAAAACAATT CAAAAGGAGA ACAATTATGG AACTCGTTTC 11520 

AAGTCTTCTA AATTGGTTTT CTAGCAATAT TTTCCACAAT CCCGCATTTT TCGTAGCTTT 11580 

ATTGGTGTTG ATAGGATATG CACTTTTGAA AAAACCTGCC CATGACGT77 TTTCAGGGTT 11640 

TGTTAAAGCA ACAGTAGGGT ATATGTTGCT TAACGTGGGT GCTGGTCGTT TGGTTACAAC L17 00 

CTTTCGTCCA ATCT7AGCAG CTCTTAACTA CAAATTCCAA ATTGGTCCAG CGGTTATCGA 11760 

CCCTTACTTT GGACTTGCTG CAGCAAACAA CAAAATTGTA GCAGAGTTTC CAGATTTTGT 11820 

TGGAACTGCA ACTACAGCTC TATTGATTGG TTTTGGAATA AATATCTTGC TCGTAGCTCT 11880 

7CGAAAGATT ACGAAGGTAA CAACCCTCTT TATTACTGGT CACA7CATGG TACAACAAGC 11940 

TGCAACAGTA TCTCTTATGG TTC7ATTC7T AG7ACCACAA TTGCGCAATG C7TACGGTAC 12000 

AGCAGCGAT7 GG7ATCATCT G7GGAC7T7A CTGGGCAG77 AGT7CAAATA 7GACTGTTGA 12060 

GGCAACTCAA CGCTTCACTG G7GGTGGCGG AT7TGCGATT GGTCACCAAC ACCAA7TTCC 12120 

AATC7GGTTT GTAGA7AAAG TAGCAGGACG CTTTGGTAAC AAAG AAG AAA G7TTAGACAA 12180 

7C7TAAA7TA CC7AACTTCC 7C7CAA7CT7 CCACGA7ACA GTTCTTGCA7 C7GCTACCTT 12 240 

GA7GC7CG7A TTCT7CGGAG CCAT7C7TTT AATC77GGGT CCAGACATTA TGTCTAATAA 12 300 
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AGAAGTCATC ACTTCAGGAA CTCTATTCAA TCCTGCTAAA CAAGATTTCT TTATGTACAT 12360 

TATCCAAACA GCCTTTACCT TCTCAGTTTA CTTGTTCGTT TTCATGCAAG GTGTCCGAAT 12420 

GTTCGTATCT GAGTTGACAA ACGCCTTCCA AGGTATTTCA AACAAATTGT TGCCAGGTTC 12480 

ATTCCCAGCG GTTGACGTTG CAGCTTCTTA TGGATTTGGT TCTCCAAATG CTGTCTTGTC 12540 

AGGATTTACC TTTGGTTTGA TTGGTCAATT GATTACAATT GTTTTGCTCA TCGTCTTTAA 12 600 

AAATCCGATT CTTATTATTA CAGGATTTGT ACCAGTGTTC TT7GACAATG CAGCCATTGC 12 660 

GGTCTACGCT GATAAACGCG GCGGATGGAA ACCCCCTCTT ATCCTTTCCT TTATATCAGG 12720 

TGTCCTTCAA GTTGCTCTAG GAGCTCTTTG TGTGGCCCTT CTCGATTTGG CATCTTATGG 12780 

TGGCTACCAT GGAAATATCG ACTTTCAATT CCCATGGCTT GGATTTGGAT ATATCTTCAA 12 840 

ATACCTTGGT ATTGTTGCTT ATGTACTTGT CTCTCTCTTC TTGCTTGTTA TTCCTCAACT 12900 

TCAATTTGCC AAAGCAAAAG ATAAAGAGAA ATATTACAAC GGTGAAGTTC AACAACAACC 12960 

TTAGTATCTA GAAAAGGAGA AATAAAATGG TTAAAGTATT AGCAGCGTGC GGAAATGGAA 13020 

TCGCTTCATC AATGCTTATC AAGATGAAGG TTGAAAATGC TCTCCGTAAG CTTAATCAAA 13 080 

CAGATTTTAC ACTCAATTCA TGCAGTGTCG GTGAAGCTAA AGGTTTACCA GTAGGATATG 13140 

ACATCGTAAT CGCTTCTCTT CATTTGATTC AAGAATTGGA AGGGCGAACT AATGGGAAGT 13 200 

TAATTGGGCT TGATAACTTG ATGGATGATA AAGAAATCAC CCAAAAACTC AGTCAAGCAC 13 260 

TACAGTAAAA GGTTGGAGGG GGCTGGACAG AAACTGAGAG TTATCGTTTC TGTCCTTCTC 13 320 

CCTCTT7AAA TAAAGGAGGC AGATATGAAT TTAAAACAAG C TTT AATTG A CAATGACTCG 13 380 

ATCCCACTAG GTTTAGAGGC TAACAATTCG AAAGAAGCAC TCAAGGTAGC AGTAGATCCC 13 440 

TT AATTG AAA GTGGGGCAAT TTTGCCACAC TATTACGATG CTATCATTGA ATCCACTGAA 13 500 

GAGTATGGGC CTTACTATAT CTTGATGCCA GGTATGGCTA TGCCCCACGC TAGACCTGAA 13 560 

GCAGGTGTGC AAAGTGATGC CTTTTCATTG ATTACCTTAC AAAATCCTCT TGTATTTTCA 13 620 

GATGGGAAAG AGGTATCTGT TTTGTTGGCA CTAGCAGCAA CAAGTTCAAA AATTCACACA 13 680 

AGTGTACCCA TTCCACAAAT TATTGCCCTA TTTGAATTAG AAGATTCTAT TGCACGTTTA 13740 

CAGGCTTGCC AGACTAAAGA AGATGTCTTG GCTATGATTG AAGAATCTAA GGATAGCCCT 13 300 

TATCTCGAAG GATTGGATTT GGAAAGTTAG AAAGAGGAAT AAAGAAATGA CAAAAAGAAT 13860 

ACCTAATTTA CAAGTTGCAT TAGACCATTC AGACTTGCAA GGAGCGATTA AAGCAGCTGT 13920 

TTCTCTTGGT CAGGAAGTAG AT ATT AT CG A AGCTGGAACT GTTTGCTTGC TTCAAGTTGG 13 980 

AAGTGAACTG GCTGAAGTCT TGCGTAGCCT TTTCCCAGAT AAGATTATTG TGGCAGACAC 14040 

AAAATGTGCT GATGCTGGTG GAACACTTCC TAAAAATAA7 GCGCTTCGTG GAGCAGACTG 14100 
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GATGACTTCT 


ATC7GTTGTG 


CAACCATCCC 


7AC7A7GGAA 


GCAGC7C7AA 


AGGC7ATCAA 


14160 


GACTGAACGA 


GGAGAACGAG 


GCGAAATCCA 


GATCGAGC7T 


TA7GGCGA77 


GGAC7TTTGA 


14220 


ACAAGCTCAG 


CTTTGGCTAG 


ATGCAGG7AT 


CTCACAACC7 


ATTTATCACC 


AATCTCG7GA 


14280 


TGCTCTTCTT 


GCTGGTGAAA 


CTTGGGGTGA 


AAAAGACCT7 


AATAAGG77A 


AAAAAC7CA7 


14340 


TGACATGGGC 


T7CCGTGTAT 


CTG7AACAGG 


TGGTCTAGAT 


G7AGATAC7C 


7CAAAC7C77 


14400 


7GAAGCTAT7 


GATGTCTTTA 


CC7T7A7CGC 


AGGTCG7GGA 


A77 AC AG AG G~ 


C7GTGGA7CC 


14460 ■ 


AGCAGGAGCA 


GCGCG7GCC7 


TCAAGGATGA 


AA7CAAACGA 


AT7TGGGGG7 


AAA7CA7GG7 


14520 


ACGTCCAATT 


GGAATTTATG 


AAAAGGCAAC 


CCCAACACAC 


7GTAC77GCC 


TAGAACG777 


14580 


AAATTTTGCC 


AAGGAGTTAG 


GC7TTGA77T 


TG7CGAGA7G 


7C7A77GACG 


AACGTGACGA 


14640 


GCGTTTAGCA 


AGACTTGACT 


GGAG7AAGGA 


AGAACGC7TG 


GAAG77G7CA 


AAGCAA7CTA 


14700 


TGAAACTGGT 


GTTCGTATTC 


GT7C7ATC7G 


T77T7CAGGC 


CATCG7CGCT 


ACCCAT7GGG 


14760 


TTCAAAAGA? 


CCAGTTCTAG 


AGGAAAAA7C 


TC7AGAAC7C 


A7GAAAAAA7 


G7ATCGAAT7 


14820 


AGCTCAAGAC 


TTGGGAGTTC 


G7ACGA77CA 


AT7AGC7GG7 


7ACGA7GTT7 


ACTATGAGGA 


14880 


AAAGTCACCC 


CAGACACGCC 


AACG7777AT 


CAAAAA77TG 


AGAAAAGCC7 


G7GAC7GGGC 


14940 


TGAAGAAGCT 


CAGGTGGTAC 


TTGC7A7TGA 


AA77A7GGAT 


GATCCT77CA 


7CAGTAGCA7 


15000 


CGAAAAATAT 


TTGGCTATAG 


AAAAAGAGAT 


7GAC7C7CCC 


TTCC7C777G 


7A7ATCCAGA 


15060 


TATTGGTAAT 


G7G7C7CCA7 


GGCATAA7GA 


TA7C7A7AG7 


GAGT777A7C 


T7GC7CA7CA 


15120 


TGCCATCGCA 


CC7C7CCA7C 


7CAAGGA7AC 


T7A7CCACTC 


ACAGAAAC77 


CAAAGGGCCA 


15130 


GTTCCGAGAT 


G7ACC77TCG 


GGCAAGGT7G 


TGTCAAA7GG 


GAAGAACC77 


7CGATAT777 


15240 


AAAGGAAACC 


AA7TA7AATG 


GACCTTTCC7 


AATCGAAATG 


TGG7CTGAAA 


A77C7GAAAC 


15300 


AGTAGAAGAA 


ACACGCGCAG 


CCATTCAAGA 


GCCGCAAGCT 


7T7C7C7ATC 


CACTCAT7AA 


15360 




I I OA 1 vj I AAu 




TAA7CAATCC 


7ATGCG7AAA 


CGAG7C7G7C 


15420 


ATGCCAATCA 


A7CA77GCCA 


AAACATGGAC 


TTG7CAAA7T 


TACC7GCGGG 


AA7GTA7C7C 


L5480 


AAGTTAATCG 


CGAAC7CGG7 


G7CATTG7TA 


TCAAACCATC 


AGGCG7GGAT 


TA7GACGAA7 


15540 


TGACACCTGA 


AAACATGG7A 


G7GACTGATC 


7AGA7GG7AA 


GA7CC7AGAA 


GGGGAT77AA 


15600 


GACCATCTTC 


CGACC7CCCA 


AC7CA7G7GC 


AAT7A7A7AA 


CACT7GC7CA 


GAAATTGGTA 


15660 


GTGTGGTTCA 


CACCCA7TCG 


ACAGAAGC7G 


TTGGTTGGGC 


TCAGGCACG7 


CG7GA7ATTC 


15720 


C777CTACGG 


AACAACCCA7 


GCAGAT7A77 


TC7ACGGT7C 


AATCCCTTGC 


GCCCG7AG77 


15790 


TGACCAAGGA 


CGAAG7AGAA 


G7GGCCTA7G 


AAAAAGATAC 


7GGCC7GG77 


A7CG7AGAAG 


15840 



WO 98/18931 



PCT/US97/19588 



384 



ACTTTGAACA 


TCGCCGACTT 


AACCCGGTTG 


AAGTACCAGG 


AATTGTTGTA 


CGCAATCACG 


15900 


GTCCATTCAC 


CTGGGGCAAA 


AATCCAGAGA 


ATGCTGTTTA 


TCACTCTGTC 


GTACTAGAGG 


15960 


AAGTATCAAA 


GATGAATCGC 


TTTACAGAAC 


AAATCAATGC 


AAGAGTTGGA 


CCTGCTCCCC 


16020 


AGTACATACT 


AGAAAAACAC 


TACCAACGTA 


AACATGGACC 


AAATGCTTAT 


TATGGTCAAA 


16080 


AGTAAGAACG 


ATGAAGGAGG 


AGAAAAAGAT 


AAATTTAGCT 


CCTCT7TTTA 


CATTTGATTT 


16140 


TTATTGAGAG 


TAAAGTTGGA 


GTTGAAGTAA 


TTTTAAAAGA 


TTTTTTAGAA 


ATAGCGCTTG 


16200 


ATATATATAT 


GGTAAAATAA 


AAAGAATTGC 


TGTGATATCA 


ATAGATTTGG 


GGGATTTTTT 


16260 


AATATGGTAC 


TGGATAAGGC 


AAGTTGTGAT 


TTCCTTCAA7 


ATTTGATGGA 


TCAAGAAACG 


16320 


TCCAAAACGA 


TTATGGCGAT 


TTCGAAAGAT 


TTGAAAGAGT 


CAAGAAGGAA 


AATTTATTAT 


16380 


CACATTGACA 


AAATCAATGC 


TGCTCTGGGT 


GACGAGGCGC 


TTCACATCAT 


TAGTATTCCA 


16440 


CGAATTGGTA 


TTCACTTAAC 


GGAAGAGCAG 


AGAGATGCTT 


GTTGTAAACT 


ATTATCGGAA 


16500 


GTAGATTCGT 


ACGATTATAT 


CATGAGTGCG 


CATGAACGTA 


TGATGATAAT 


GTTACTATGG 


16560 


ATAGGTATTT 


CTAAAGAACC 


TATTACGATT 


GAAAAATTGA 


TAGAGTTAAC 


AGAGGTATCT 


16620 


AGGAATACTG 


TTCTCAATGA 


TTTGAATAGT 


ATTCGTTATC 


AACTAACTTT 


GGAACAATAT 


16680 


CAGCTGATCT 


TGCAAGTGAG 


CAAGTCACAG 


GGATACAACC 


TTCATGCCCA 


CCCTCTTAAT 


16740 


AAAATTCACT 


ATCTTCAATC 


GCTTCTATAT 


CATATTTTTA 


TGGAAGAAAA 


. TGCCACTTTT 


16800 


GTATCTATTT 


TAGAAGATAA 


GATGAAAGAG 


AGGTTAGATG 


ATGAGTGTTT 


GCTTTCTGTT 


16860 


GAAATGAACC 


AATTTTTTAA 


GGAACAGGTT 


CCTTTAGTTG 


AACAAGATTT 


AGGG AAG AAA 


16920 


ATAAACCA7C 


ATGAAATAAC 


TTTTATGTTG 


CAGGTTCTAC 


CTTATTTCCT 


GTTAAGCTGT 


16990 


CATAATGTTG 


AACAGTATCA 


AGAAAGACAT 


CAGGATATAG 


AGAAAGAATT 


TTCTTTGATA 


17040 


AGAAAAAGAA 


TAGAGTATCA 


GGTGTCTAAG 


AAATTAGGAG 


AACGGTTGTT 


TCAAAAGTTT 


17100 


GAAATTTCT7 


TGTCAGGACT 


TGAAGTTTCT 


CTTGTAGCTG 


TTCTCCTCCT 


CTCCTATCGT 


17160 


AAAGATTTGG 


ATATTCATGC 


AGAAAGTGAT 


GATTTTCGGC 


AATT AAAACT 


TGCTTTAG/vA 


17220 


GAATTTATCT 


GGTATTTTGA 


ATCACAAATC 


CGAATGGAGA 


TTGAGAACAA 


GGATGATTTC 


L7280 


TTACGAAATT 


TGATGATCCA 


CTGTAAAGCC 


TTGTTATTTA 


GAAAGACTTA 


CCGTATTTTT 


17340 


TCTAAAAATC 


CTCTAACAAA 


ACAAATTCGA 


TCCAAGTATG 


GAG AATT ATT 


TTTAGTCACT 


17400 


AGAAAATCTG 


CGGAAATTTT 


AGAAGGAGCA 


TCGTTTATTC 


GGCTAACAGA 


CGATGATATT 


17460 


CCCTATTTGA 


CGATTCATAT 


TCCACGATTT 


TTAAAATATA 


CACCATCATC 


TCAAAAAAAT 


17520 


ATGAAAAAAG 


TTTATCTCGT 


TTGTGATGAA 


GGTCTTGCGG 


TTTCGAGACT 


TTTGCTGAAA 


17590 


CAATGCAAAC 


TTTATTTTCC 


AAATGAGCAA 


ATTGACACTG 


TATTTACAAC 


AGAACAATTT 


17640 
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AAGAGTGTCG AAGATATTGC ACAAGTTGAT GTAGTGATTA CTACTAATGA TGATTTGGAT 177 00 

AGCAGATTTC CGATTTTAAG GGTTAATCCT ATCCTTGAAG CAGAAGATAT TTTCAAAATG 17 760 

CTAGACTATC TTAAACACAA TATATTTCGT AATAAGAGCA AAAGTTTCAG TGAAAATCTT 17 320 

TCTAGTCTTA TTTCGTCTTA TATTGTAGAC AGCAAGTTGG CTAGTAAGTT CCAAGAAGAG 17 880 

GTTCAAACAC TTATAAATCA AGAAATAGTA GTTCAAGCTT TTTTGGAAGr TATTTGAAGG 17 940 

ACAGTCCAAT GATGAACACA AACCTGTGTk TTTCsTGGTC TTTTtTAGTG TTTTGAAGGG 18000 

TGGkATACTA ATCTCAAAGA TAACAATTAT ATCCAAAGGA GGCAACATAT GCCAAACGTC 13060 

AAAGAAATTA CAAGAGAGTC ATGGATTTTA GCCACTTTCC CACAGTCQGC AACATGGTTG 13120 

AACGAAGAAA TCGAAGAAGA AGTCGTACCT GAAGGCAACT TTGCCATGTG CTGGCTACGC 18180 

AACTGTGGTA CTT.GGATTAA GACACCAGCT GGTGCTAACG TTCTCATGGA CCTTTGGTCA 18 240 

AACCGTGGAA AATCAACCAA AAAAGTGAAA GATATGGTTC GTGGCCACCA AATGGCAAAT 18 300 

ATGGCAGGTG TTCGTAACCT GCAACCAAAC TTGCGTGTTC AGCCAATGCT TATCGATCCA 13 360 

TTTCCTATCA ACGAACTACA CTATTACTTA GTTTCACACT TCCACACTCA TCATATCCAC 18420 

C CAT AC AC AG CTGCAGCAAT TCTCAATAAT CCTAAGTTAG AGCATCTTAA CTTCG 13475 

( 2 } INFORMATION FOR SEQ ID NO: 39: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7186 base pairs 
(Bl TYPE: nucleic acid 
(C) STRANDEDNESS: douole 
( 0 ) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CCAGGATTTG GTACCGTTGC AACTGGTGTG CCTTTCCTCC TAAAGGAAAA TGGAGGAAAA 50 

ATCAATCAAT CAGCACATTC ACATATCAAA GTTGCTAAGG TATTGGTCAA GGATGAAGAT 12 0 

GAAAAAAATC GCTTGCTTGC AGCAGGGAAT GACTTTAACT TTGTAACCAA TGTGGATGAT 180 

ATTTTATCAG ACCAGGATAT TACTATCGTA GTGGAATTGA TGGGGCGTAT TGAGCCTGCT 2 40 

AAAACCTTTA TCACTCGTGC CTTGGAAGCT GGAAAACACG TTGTTACTGC TAACAAGGAC 3 00 

CTTTTAGCTG TCCATGGCGC AGAATTGCTA GAAATCGCTC AAGCTAACAA GGTAGCACTT 3 60 

TACTACGAAG CACCAGTTCC TGGTGGGATT CCAATTCTTC GTACTTTAGC AAATTCCTTG 4 20 

GCTTCTGATA AAATTACGCG CGTGCTTGGA GTAGTCAACG GAACTTCCAA CTTCATGGTG 4 80 

ACCAAGATCC TGGAAGAAGG CTGGTCTTAC GATGATGCTC TTCCGGAAGC ACAACGTCTA 540 
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GCATTTGCAG AAAGCGATCC GACGAATGAC GTAGATGGGA TTGATGCAGC CTACAAGATG 600 

GTTATTTTGA GCCAATTTGC CTTTGGCATG AAGATTGCCT TTGATGATGT AGCCCACAAG 660 

GGAATCCGCA ATATCACACC AGAAGACGTA GCTGTAGCTC AAGAGCTTGC TTACGTAGTG 7 20 

AAATTGGTTG GTTCTATTGA GGAAACTTCT TCAGGTATTG CTGCAGAAGT GACTCCAACC 780 

TTCCTACCTA AAGCGCACCC ACTTGCTAGT GTGAATGGCG TAATGAACCC TGTCTTTGTA 840 

GAATCTATCG CTATTGGTGA GTCTATGTAC TACGGACCAG GTGCGGGTCA AAAACCAACT 900 

GCAACAAGTG TTGTAGCTGA TATTGTCCGT ATCGTTCGTC GTTTGAATGA TGGTACTATT 9 60 

GGCAAAGACT TCAACCAATA TAGCCG7GAC TTGGTCTTGG CAAATCCTGA AGATGTCAAA 1020 

GCAAACTACT ATTTCTCAAT CTTGGCTCTA GACTCAAAAG GTCAGGTCTT GAACTTGGCT 1080 

GAAATCTTCA ATGCTCAAGA TATTTCCTTT AAGCAAATCC TTCAAGATGG CAAAGAGGGT 1140 

GACAAGGCGC GTGTCGTTAT CATCACACAC AAGATTAATA AAGCCCAGCT TGAAAATGTC 1200 

TCAGCTGAAT TGAAGAAGGT TTCAGAATTC GACCTCTTGA ATACCTTCAA GCTGCTAGCA 1260 

GAATAAGATG AACATTATTG TACCTGCAAC CAGTGCCAAT ATCGGGCCAG CTTTTCACTC 13 20 

GGTCGGTGTA GCTGTAACCA AGTATCTTCA AATTGAGGTC TGCGAAGAAC GAGATGAGTG 1380 

GCTGATTGAA CACCAGATTG GCAAATGGAT TCCACATGAC GAGCGTAATC TCTTGCTCAA 14 40 

AATCGCTTTG CAAATTGTAC CACACTTGCA ACCAAGACGC TTGAAAATCA CCAGTGATGT 1500 

CCCTTTGGCG CGCGGTTTGG GTTCTTCCAG CTCGGTTATC GTTGCTGGGA TTGAACTAGC 1560 

CAACCAACTG GGTCAACTCA ACTTATCAGA CCATGAAAAA TTGCAGTTAG CGACCAAGAT 1620 

TGAACGGCAT CCTGACAATG TGGCTCCAGC CATTTATGGT AATCTCGTTA TTGCAAGTTC 1680 

TGTTGAAGCG CAAGTCTCTG CTATCGTAGC AGACTTTCCA GAGTCTGATT TTCTAGCTTA 17 40 

CATTCCAAAC TATGAATTAC GTACTCGCGA CACCCGTAGT GTCTTGCCTA AAAAATTGTC 1800 

TTATAAGGAA GCTGTTGCTG CAAGTTCTAT CGCCAATGTA GCGGTTGCTG CCTTCTTCCC 1860 

AGGAGACATG GTGACCGCTG GGCAAGCAAT CGAGGGAGAC CTCTTCCATG AGCGCTATCG 1920 

TCAGGACTTG GTAAGAGAAT TTGCGATGAT TAAGCAAGTG ACCAAAGAAA ATGGGGCCTA 1930 

TCCAACCTAC CTTTCTGGTG CTGGGCCGAC AGTTATGGTT CTGGCTTCTC ATGACAACAT 2040 

GCCAACAATT AAGGCAGAAT TGGAAAAGCA ACCTTTCAAA GGAAAACTGC ATGACTTGAG 2100 

AGTTGATACC CAAGGTGTCC GTGTAGAAGC AAAATAAAGA ATAGAAGATA GGATGGGGAA 2160 

ACTCTTGACC AGAGGGGTTC ATATCCTTTT TGTGAAAAGA ACT TT AT ACT CAATGAAAAT 2220 

CAAAGAGCAA ACT AGG AAG C TAGCCGCAGG CTGCTCAAAA CAGTGTTTTG ACGTTGCAGA 22 30 

TAGAACTGAC GAAGTCAGCT CAAGACACTG TTTTGAGGTT GCAGATAGAA CTGACGAAGT 2 34C 
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CAGTAACCAT 
TAGTTAAAAA 
TTTTTGGGGC 
CTACGCTAAT 
AGAAACGGTC 



TTATTTCCGA 
A7ATCGAACT 
GCAGGAGCAG 
C7ACATTCTC 
TCATATCGAT 
TAGTCAAGAG 
TGCTACAGAG 
AGATATTACG 
TTGGCCAAGT 
CCATGGAATG 
TTTCACAGAT 
ACGCTTTGTG 
AAACAAATAA 
CCGATTTATC 
CCCCTTGTTC 
GATTGTTCAC 
GCTCCTTATC 
AAAGGCAGCA 
CTTGCCCAAG 
GGATACCTAC 
CATTATCGTT 
GTTCTTAGTC 
7CCTTTCTAC 
ATTGCAAGGG 



ACTACGGTAA 
CGTGATAAAG 
CTAGAGGAAT 
GGTCAAGTCG 
CAAGTGATTT 
GTTATCGATC 
GGGATTTATT 
GAACGCATGC 
GCTGAAGACT 
GTGACCGATG 
GTGTTGAAGG 
GC7CCATTTA 
ACAGGTGAGC 
TTTAGCCGTC 
GAGCGAATTG 
GGACCGCGGG 
GCCAAGGATG 
AACAGACAGT 
AAACACCTAT 
AAGCTGTTAG 
CAATCTTTAC 
TTTGCAGTAA 
GTAGGTTCTG 
GACAGCAGAG 
CAGATTCAGA 
TTTGGTGCCA 
TTGGTTGCCA 
AGTAGTCTCA 
ATGCGGGTTA 



GGTGACGCTG 
GAGAAATAAA 
ATTTTTCACG 
AAACGACCAA 
AC GAT GAG AA 
CTCTATCTAT 
ATCAGGATGA 
TGGGTCGAAA 
ACCACCAAGA 
CTGATAAGCC 
CCAGTCTATC 
CCAATGCCTA 
CACTCTTTTT 
CGATTTCCAA 
AAGTTCGTTC 
AGTTAGGCCG 
AGATGGAAAA 
GGGGCTTCCC 
TATCTTACTT 
AAGCTCTTTT 
CTCAGCGAGA 
TTGGCGTTTT 
CTAAGGAATT 
ACCGTCTGAC 
CTGGTATCAA 
TTTTTATGGC 
TTTTGACCAT 
GAAAGAAAAC 
TTCGTGCTTT 
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ACGTCCTTTC 
GATGGCAGAA 
CATTTCTGGA 
TTACCAGTTG 
GGAAGTGTCA 
CAATCAACAA 
AGCAGATTTG 
GATTGCAGTA 
CTATCTCAGG 
ATTGATTGAT 
TGAAGAGTCT 
TGACCAAACC 
TGCCAAGGAT 
AGAGTTGATT 
TCGTTCAGGC 
CCTCCGTTAC 
AGCAGGATAT 
ACTTTCTTCA 
CAAACCCTAC 
TGAGCTCTTG 
TCAAGGTCAT 
AGTGGCCTTG 
GACAAACGAT 
AACTTCTAGT 
TCAATTCCTG 
TTATCGAATC 
TGTCATTGTA 
GGACCAACTG 
TGGTCAAGAA 



AAGAGATTTT 
ATTTATCTAG 
GTGCTAGAAA 
CTCAAGGAAA 
CTCAGAGAGA 
GGGAATGACC 
CCAGCTATCT 
GAAGTGGAGC 
AAGAATCCTT 
GCAGCAAACT 
TATCGTGTCA 
TTTGAACAGG 
AAGTTTGCTT 
CATTATTACA 
AGTGCTCACT 
TGTATCAATT 
GGCTATCTAT 
TTTCTAGAAT 
ATCAAGGAAT 
GTTCCCATGG 
CTCTGGATGC 
ATAGCTCAAT 
CTTTATCGTC 
TTGGTCACTC 
CGTCTCTTTT 
TCAGCTGAGT 
GGGTTATCTC 
GTTCAGGAAA 
AAACGAGAGT 



CGAAGAGTAT 
CAGGTGGTTG 
CCAGTGTTGG 
CAGACCATGC 
TTTTACTTTA 



2400 
2460 
2520 
2580 
2640 



GTGGTCGCCA 
ACACAGTGGT 
AATTACCCCA 
CAGGTTACTG 
ATGAAAAGCC 
CACAAGAAGC 
GCATTTATGT 
CAGGTTGTGG 
AGGATCTGAG 
TGGGTCATGT 
CTGCTTCTTT 
TGCCTTACTT 
ATGAATACAA 
CAATTTTAGC 
TGATTGCTGG 
AGATTGGCCT 
TTTACTCAGC 
ATATTCTTTC 
GCTTGACTTC 
TACGAGCGCC 
TGACTTTCTG 
GATTGGTCAA 
CGCCCCAGCA 
TACAGATTTT 



2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 
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TCAAACCCTT 


AACCAAGTTT 


ATGCTAGATT 


ACAAGAAAAG 


ACAGCT77C7 


GGTCTAGTTT 


414C 


ATTAACACCT 


CTGACCTATC 


TGATTGTCAA 


TGGAACTCTT 


CTCG77ATTA 


TCTGGCAAGG 


4200 


CTATATTTCA 


ATTCAAGGAC 


GAGTCCTCAG 


TCAAGGTGCT 


C7CAT7GCTC 


TTATCAATTA 


4260 


CCTCTTACAG 


ATTTTGGTGG 


AA7TGGTCAA 


GCTAGCCATG 


TTGATCAATT 


CCCTCAACCA 


4320 


GTCCTATATC 


TCAGTCAACC 


GAATCGAGGA 


AGTCTTTGTT 


GAGGCTCCAG 


AGGATATCCA 


4390 


TTCAGACTTA 


GAACAAAAGC 


AAGCTACCAG 


AGATAAGGTT 


7TACAAG7CC 


AAGAATTGAC 


4440 


CTTTACCTAT 


CCTCATGCGG 


CCCAGCCTTC 


TCTGAGATAC 


ATTTCC7TTG 


ATATGACTCA 


4500 


AGGACAAATT 


CTAGGTATCA 


TCGGGGGAAC 


TGGTTCTGGT 


A.\ATCAAGCT 


TCGTGCAACT 


4560 


CTTACTTGGA 


CTTTA7CCAG 


TAGACAAGGG 


GAACATTGAC 


CTTTATCAAA 


ATGGACGTAG 


4620 


TCCTCTTAAT 


TTGGAGCAGT 


GGCGGTCTTG 


GATTGCCTAT 


GTACCTCAAA 


AGGTCGAACT 


4630 


CTTTAAAGGA 


ACCATTCGTT 


CCAACTTGAC 


TCTAGGTTTC 


AATCAAGAAG 


TATCTCACCA 


4740 


GGAACTCTGG 


CAGGCCTTGG 


AGATTGCGCA 


AGCTAAGGAT 


T7TGTCAG7G 


AAAAGGAAGG 


4800 


ACTCTTGGAT 


GCTCTAGTTG 


AGGCAGGGGG 


GCGAAATTTC 


TCAGG7GGAC 


AAAAACAAAG 


4860 


ATTGTCTATC 


GCCCGAGCAG 


TCTTGCGCCA 


GGCTCCGTTT 


CTCATCCTAG 


ATGATGCAAC 


4920 


CTCGGCACTG 


GATACCATTA 


CAGAGTCCAA 


GCTCTTGAAA 


GC7A7TAGAG 


AAAATTTTCC 


4980 


AAACACGAGC 


TTAATTTTGA 


TCTCTCAACG 


AACCTCAACT 


TTACAGATGG 


CGGACCAGAT 


5040 


TCTCCTCTTG 


GAAAAAGGTG 


AGTTGCTAGC 


TGTTGGCAAG 


CACGATGACT 


TGATGAAATC 


5100 


CAGCCAAG7C 


TATTG7GAAA 


TCAATGCATC 


CCAACATGGA 


AAGGAGGACT 


AGAATGAAAC 


5160 


GACAAAC7GT 


AAACCAGACG 


CTCAAACGTT 


TAGCCGTAGA 


T7TAGCAACC 


CATCCTTTCC 


5220 


TCCTTTTCCT 


AGCCTTTCTA 


GGAACTATTG 


CCCAACTTGG 


CTTA7CAAT7 


TACCTACCTA 


5280 


TTCTGATTGG 


GCACCTCAT7 


GACCAAGTCC 


TAGTGGCTGC 


77CA7CACCA 


GT7777TGGC 


5340 


AGATTTTTCT 


CCAGATGCTC 


TTGGTGGTAA 


TAGGAAATAC 


7C7GGTACAA 


TGGGCCAATC 


5400 


CTCTCCTC7A 


TAATCGTCTA 


ATCTTCTCTT 


ATACCAGAGA 


777ACGGGAG 


CGAATCATCC 


5460 


A7AAGCTCCA TCGTTTACCG 


ATTGCCTTTG 


TAGA7AGGCA 


AGGTAGTGGA 


GAGA7GG7TA 


S520 


GTCGTGTAAC 


CACGGACATC 


GAACAGTTCC 


CACCTCCCTT 


G AC CATC ATT 


TTTAACCAAT 


5580 


TTTTCATTGG 


TGTTTTGATG 


ATTTTGGTCA 


GTATTCTAGC 


CATGCTCCAA 


ATTCATCTCC 


5640 


TCATGACTCT 


CTTAGTCTTG 


CTGTTGACGC 


CACTGTCCAT 


CGTCATTTCA 


CGCTTTA7TG 


5700 


CCAAGAAATC 


CTATCATCTC 


TTCCAGAAGC 


AAACAGACAC 


CAGCGCAATT 


CAGACTCAGT 


5760 


TGATTGAAGA 


ATCGCTTAGT 


CAGCAGAC7A 


TAATCCAG7C 


CTTCAATGCT 


CAAACAGAAT 


5820 


TTATCCAAAG 


ATTGCGTGAG 


GCTCATGACA 


AC7AC7CAGG 


C7AT7CTCAG 


7CAGCCA7C7 


588C 
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TTTATTCTTC AACGCTCAAT CCTTCGAC7C GCTTTCTAAA TGCACTCATT TATGCCCTTT 5940 

TAGCTGGAGT AGGAGCTTAT CGTATCATGA TGGGTTCAGC CTTGACCGTC GGTCGTTTAG 6000 

TGACTTTTTT GAACTATGTT CAGCAATACA CCAAGCCCTT TAACCATATT TCTTCAGTGC 6060 

TAGCTGAGTT GCAAAGTGCT CTGGCTTGCG TAGAGCGTAT CTATGGAGTC TTAGATAGCC 6120 

CTGAAGTGGC TGAAACAGGT AAGGAAGTCT TGACGACCAG TCACCAACTT AAGGGAGCTA 6180 

TTTCCTTTAA ACATGTCTCT TTTGGCTACC ATCCTGAAAA AATTTTCATT AACGACTTGT 62 4 0 

CTATCGATAT 7CCAGCTGGT AGTAAGGTAG CCATCCTTCG TCCGACAGG? GCTGGAAAAT 53 00 

CAACTCTTAT CAATCTCCTT ATGCGTTTTT ATCCCATTAG CTCGGGAGAT ATCTTGCTGG 63 60 

ATGGGCAATC CATTTATCA? TATACACGAG TATCATTGAG ACAGCAGTTT GGTATGGTGC 6420 

TTCAAGAAAC CTCGCTCACA CAAGGGACCA TTCATGATAA TATTGCCTTT GGCAATCCTG 6480 

AAGCCAGTCG AGAGCAAGTA ATTGCTGCTG CCAAAGCAGC TAATGCAGAC TTTTTCATCC 6540 

AACAGTTGCC ACAGGGATAC GATACCAAGT TGGAAAATGC TGGAGAATCT CTC7CTGTCC 6600 

GCCAACCTCA GCTCTTGACC ATAGCCCGAG TCTTTCTGGC TATTCCAAAG ATTCTTATCT 6660 

TAGACGAGGC AACTTCTTCC ATTGATACAC GGACAGAAGT GCTGGTACAG GATCCCTTTG 67 20 

CAAAACTCAT GAAGGGCCGC ACAAGTTTCA TCATTGCTCA CCGTTTGTCA ACCATTCAGC 67 80 

ATGCGGATTT AATTCTTGTC TTAGTAGATG GTGATATTGT TGAATATGGT AACCATCAAG 6840 

AACTCATGGA TAGAAAGGGT AAGTATTACC AAATGCAAAA AGCTGCGGCT TTTACTTCTC 6 900 

AATAAGCCAT TCTCTTTTGA AAGTTTATGG ACGAAAAAAG TTCCC7TCCA CTCACTTTTT 6960 

TGTTACAATA GCTAGAAAAA TTGT7CACTC TAATACTCAA TGAAAATCAA AGAGCAAACT 7020 

AGGAAGCTAG CCCTAGGTTG C7CAAACCAC ACCTTTGAGG TTGTAGATAA GACTGACGAA 7 080 

GTCAGTTCAA AACACTGTTT TGAGCTTCCA GATAGAACTG ACGAAGTCAG CTCAAAACAC 7 140 

TGTTTTCAGG TTGCAGATAG AACTGACGAA GTCAGCTCAA AACAGG 7186 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14273 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<DI TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 4C: 

CTGAAAATTC TAAAAAATTT ATAAGTAAGG AATTAATTAG 7TATTTTTGT GATAAAGTTT 60 
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ATCATGAAAT 


ATTTGTTGAA 


GAGGTAGTTC 


CGCACGT7TT 


TCTGCCATAT 


GAATCTGACT 


120 


TACTTCTTAT 


TTTACCAGCT 


ACGGCAAATG 


TGATTCCCAA 


AATTGCTAAT 


GGTATTGCTG 


180 


ATGATTTAGT 


TACAGCAACT 


GTTTTAAACT 


TTAATAAAAA 


AATAATTTTT 


TGTCCCAATA 


240 


TGAACTCTAC 


TATGTGGGAC 


AATCACATAG 


TTCAAAGAAA 


TGTATCAATT 


CTAAAGGAGT 


300 


TGGCACATAT 


ATTTTTATTT 


GAGTCTAAAA 


AAACATATGA 


GGTAGGATTG 


CGTAAAGCAA 


360 


TAGATTCAAC 


ATCTTCAATC 


TTACAACCAC 


AGTCGTTAGT 


AAAAGAACTT 


ATCAAATTAG 


420 


AAAATATTCT 


CCTTGAAGAG 


GGACATTAAA 


AACTACTCAG 


AATATTAATG 


AGGGGAAAAA 


480 


ATGGAAAATT 


CATCAATCGA 


TGTAGATATG 


C7GTTGGAAG 


AATTGACACA 


AGAAGCAATG 


540 


GTCGTTGTTG 


CTGTTGATAA 


GGACTGTTAA 


TTTAAACTTA 


TGGCAATATA 


TGAAAGGTTA 


600 


CTGGATGTTT 


TAAATTATGC 


AGGCAGTAGC 


CTTTTATTAT 


ATACAAATGG 


ATAAAGTAAG 


660 


GATAATACAA 


TGATTAATAA 


AAAAATACAA 


CAAGTTGTTT 


TGGAATCATT 


ACAGAATTTT 


720 


TTGAATGGGA 


ACTTCATTTC 


GCCTTGTGTA 


GTCTATGATT 


TTGCCTTGCT 


CGAAACTGTA 


780 


CTTGATGAAT 


TTAAAAATCA 


AATTCCTGTA 


ACATTCAATT 


ACCAACTTTT 


TTATGCCGTT 


840 


AAAGCAAATT 


CAAATGAGAA 


GATACTTGAA 


TTCTTAGTAC 


ATAAAATTGA 


TGGAGTTCAT 


900 


GTGGCGTCAT 


TATCTCAATT 


AGATGTGGCT 


AAAAAATTTT 


TCCCACCAAC 


TCAAATTTCT 


960 


GTTAATGGTC 


CCGCATTTTC 


TTATGAAACT 


TTATATAATC 


TGATTAAAAA 


ACAATATAAA 


1020 


GTTGATATTA 


ACTTTTTGGA 


ACATCTTCAA 


CAATTTTCCC 


CAAAAGAATC 


TGTTGGAATA 


1080 


AGAGTAACGG 


AGCCAGATGA 


ACTTAATAAT 


CGTATGAGTC 


GATTTCGAAT 


AAATATTTGC 


1140 


AGTGATAATT 


GGACTAGTAA 


TTTACAAAAT 


CCTTTAATTA 


CACGACTGCA 


TTTTCATTTT 


12C0 


GGAGAAAAAG 


ATGATAAATT 


TATTGTTAAG 


TTAGATAAAA 


TATTATTTAA 


GTTACAAGAA 


1260 


ATTAATAAAC 


TTAGAGAGGT 


TAGAGAAATA 


AATCTTCCAG 


GCGGTTTTAT 


GAAATTATTT 


1320 


ATGGAAAATC 


GTTTGAAAGA 


ATTTTTTCTA 


TCACTTATGG 


AAATCTATAA 


AAAGTACGAT 


1380 


ATTGATAGTA 


CTGTGACTAC 


AATAATAGAA 


CCAGGTAGTG 


CAATTACTTC 


ATTTTCTGCC 


1440 


TATATGATTA 


CTAGCCCAGT 


TAATGTTAGT 


GAGGTGAATG 


AGCAGCAGGT 


TATCACCTTA 


1500 


GACACATCAA 


TATACACCAA 


TACATTATGG 


TTTGTTCCGC 


AT ATT ATT AC 


AACGTTAAAT 


1560 


TCAAGTAGTA 


AAGAGCGTTA 


T ACT ACT ATT 


CTCTATGGTA 


ATACCTGTTA 


TGAACATGAC 


1620 


AAGTATAAAA 


TGAAAGTTTC 


GCTTCCAAGG 


TTAACTCAAA 


ATAGCAGTAT 


AGTGTTTTTT 


1680 


CCTGTAGGAG 


CTTATATAAA 


AACCAATCA? 


TCAAATTTAC 


ATCGTAATGA 


TTTTATGCGG 


1740 


GAGGTATATT 


TGTGGACAAA 


AAACTTGACA 


TATTAGATAA 


AGTTAAGGAA 


TATTTAGGAA 


1300 


ATAAAACTAC 


TCAAATTCTG 


GATAATCAAT 


ATAAAGAATT 


TTTGAAACTT 


1 AATGATATAA 


135C 
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GCCGACCCTT 
AATTTAATGA 
AGGAATGGAG 
CAGATGATTT 
GATATTTATC 
CTTTAACAGC 
TCTTCAAGAA 
TAAATAATCT 
AAGATAAAT7 
AAACTGATTT 
TTATTTTAGA 
TTAAATTTGC 
GATTACAATT 
TGAAAAATTT 
GCCTCTATGA 
CTGATTTAGT 
AAAA7TCTGA 
TAGCTGAACC 
ACCTTGGAAT 
ATTTTGGAAT 
GGTTAAGTTA 
CTACTCATGA 
TAACTTAGCA 
TTCGACCA7C 
CTTTTTTGGA 
GG7TCAATTA 
GATAATGAGT 
TCTCTTGATT 
TATTTCGTAT 



TCGTATTTCA GAAAAAGTAT 
TTTAATTAAT AACGAAAATT 
AAAAAAATGC TTTAATCATT 
TCTTAACACG AAGACATTGA 
GAAAAGTTCG ATAGGCGATA 
TAATGGTATG TCTGCTATTA 
GAAGATTGAT TTATTATATT 
TCCTAAATCA GGTATTAGTT 
TTATAATGTA TTCATGATGG 
CAAGATAGTA GAATATTTTG 
TATTTCATAT CAAGGTTCTA 
GAATGTAATT ATTTTTGTGG 
GACAAATGGG GGAATAATAG 
TATTGAAGAG GAATTCAATA 
ATACTGTTTG CTTGATAATT 
TATGAAATTT ACCAGTAATT 
TATTGAAATC ATCCA7GAAG 
TAAAAAAGAA TATGAAATGT 
CACATTGTAT GCTAGAAATA 
TATTGGGACA GAAACATATA 
TTATTTGATG AAATTTTTAT 
GGTTAATAGA TGAAAAATTT 
GACAGTGTAT TTTATATACT 
TTCTTAGGAA TATTTATTGC 
CCAGTTATTG ACAGAGTAAA 
GCAGTGGCTG TAATATTTTT 
CTAGTGTTTA TTTCAGTAAT 
CCTCAAGTGG TAGAATATGA 
AAAGTATTAG ATTCTATTTT 



391 

TAAACAATTC TTTTAATTTT 
ATTTATTCGA ATATGCATGT 
CTTATCGTTT TCTATGCTCA 
GAAGTAGCCA AATTGAATAT 
GAGCGGTTGA TGGCTTTGTT 
AACT ATCTCT ~TG AG AT ATT A 
CAACCGGATA TTATGAAACA 
GCTATGACGT AAGTAATTGT 
AACCCAATCG AGCCGATTTA 
TTAACTA7AA AAATAATTCA 
ATTTTAAATT AGTAGAATTT 
TACGATCTTT GATAAAATTA 
AAGTGTTTAT TCCTAATCAT 
AATTTAGAAA TTCTCACGGA 
CTTTAACTTT AAAAAATGAT 
TTTATGCTGA TATAAAAGAC 
AGGGACTACC TTTTGTATTT 
TTTTTCAATG GTTAAACTTC 
GTTTTCGGTT TCGGAATCTA 
TATTTAACAT TTGTCCAGGT 
TAAAATCTTT TTCAAATGAA 
GATAAAGTTG CTAATAATTA 
AGCATTGTGG CACGTTAGCA 
AGTAAATTAT CTACCGGATT 
TCCGCAAAAA ATTCTTATAA 
ATT ATT ATT A AACCAAATAT 
GGCTAGCTCC ATAAGTTACC 
TAAGATTCTA TTTGCAAATT 
TAATTCATTC GCATCATT7T 



ACGAGTAAAG 1920 

AGAATTAGAG 1980 

CCTATAATTA 2040 

AAATATGAGC 2100 

TCCTTCAATA n60 

AACTCTATTT " 2 2 20 

AGATTTTTAT 2 280 

GAATTGCATA 2 34C 

AC ATT AC AAA 2 400 

ATAAAAGTCG 2460 

TTAGAGAAAT 2520 

GATCAAATCG 2530 

TTGACAAACT 2 640 

CCTAATCTAA 2700 

TGG AACT ATT 2 7 60 

TTGTTCATGG 2820 

TTAGATTTAA 2 880 

TTTTACAAAC 2 94 0 

AC ACT AG ACT 3000 

GTTTATAAAC 3060 

TATTTAAAAA 3120 

CATTGATTGT 3 130 

ATAATTATTC 3 240 

TGTTACTAAT 3 300 

TATCAATTTT 3 3 60 

CATTTTGGGT 3 4 20 

TGATAGAAGA 34 80 

CTCTTTTTAG 3 54 0 

TACAGGTGGC 3 600 
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AGTACGATTT ATTTTATTGC TTAAGATAGA TATAGGCATA *PFW=.\CTTG CTCTATTTAT 3660 

ATTGTTGTTG TTAAAATTTA GAACTAGCAA TGCGAATATA GAAAACTTCT CTTTCAAATA 37 20 

TTACAAGAGA GAAGTGTTGC AAGGTACAAA GTTTATTTTA AATAATAAA? TATTATTTAA 3 7 80 

AACCAGTATT TCTTTAACGC TTATAAACTT TTTTTATTCA TT7CAGACAC TAGTTGTACC 3840 

GATTTTTTCT ATTCGATATT TTGATGGTCC GATTTTTTAT GGTATTTTTT TAACTATTGC 3900 

TGGTTTGGGT GGTATATTCG GAAATATGCT AGCGCCAATC GTAATAAAA? ATTTAAAATC 3 9 60 

GAATCAAATT GTTCGTGTAT TTCTTTTTTT GAACGGCTCA AGTTCGTTAC TAGCAATTGT 4C20 

TATAAAACAC TATACTTTAT CACTTATTTT ATTTTTCGTT TGTTTTATGT CTAAAGGAGT 4080 

CTTCAATATT ATTTTTAATT CGTTGTACCA ACAAATACCT CCACATCAAC TTCTTGGTAG 4140 

GGTAAATACT ACCATTGATT CTATTATTTC TTTTGGAATG CCAATTGGTA GTTTAGTTGC 4 200 

AGGAACGCTT ATTGATTTCA ATATTGAATT AGTGTTAATT GCTATTAGCA TACCTTATTT 4 2 60 

TTTGTTTTCT TATATTTTTT ATACGGATAA TGGATTGAAA GAATTTAGTA TATATTAGAA 4 3 20 

ATGTTTATGT TCATTCAAAA GCATAATGAC TATAACTGAA AAAGAAAAGT GATATCTTTA 4 3 80 

AGGTTGTTCT TCTTGGTGGT GAGATTCCTG AGACAACCCA AGCTTTTGTC GGAAAGATTA 4440 

CCAATGCTTT GATGGATAGG ATGTACTTTA GCAACATGTT TTTAGTCGTA ACGCTATCGT 4 500 

CGATCGACGT CTAATAACCT CTTCTTTCGA GGAGTATTTT ACTAAAAAAC TAGCCTTGGA 4 5 60 

GCGTTCCCCA GAAACGGACT TACTCATTCA CTCTTCAAAG ATTTGGGGAG AAGATTTTGC 4 620 

TTCATCTGTT CCTTGAAAAA AGTCACAGCA GTCATCACAG ACGATAGTAC TGAACAAAAC 4 680 

TATGAAGAGT TAGAAATTTA TACCCAGCTG ATTCTATAAA GGATCTGGAA ATAGATAAGA 474C 

AGTTGATTAG TATTGACCTA GGTGGTACAA A7A7TAACA? TACTGTTCTT TCAAATGACG 4 800 

GTCAGATTGA AACTTTGTGG AGTATTACAA CAGATACAAG TGAGAAAGGT TCTCAAATTA 48 60 

TATCGGACAT CATCAGTTCT ATTAAAAATA AATTGACCGA ACGGAATATT CCTCATAGCC 4 920 

ACCTTCTTGG AATCGCTATG GGAAGTTGCT CATCATACTT TCCTTGTAAA 7CATAGGGGC 4 980 

TATAAACTCT CCGTCTACTT GTCCTGCAAC AATTGAAGTC TGCTCAAAAC GCCGTCCGCT 3040 

AATCTTTTCA TAGACTTTCT CCCTTTTAGG AGCCTAGCTT TCTAGTTTGT TCTTTGATTT 5100 

T7ATTGAGTA TACCACTATT TTACTCCCTC TGGCAAGGGA CTTTGTCTAT GTGGAGGGAT 5160 

TGGGCTCCTA TGTGGTGGAG CTTTTCTGTT CTTTCTGAAA TATGGTATAA TAGCACTAAT 5220 

CAATTTCTAG GAAAATAGAT ACAGAAAGGG CCTCAAACA? CTCTCATAT? ATTGAATTGC 5 280 

CAGACATGCT GGCAAACCAA ATCGCGGCTG GAGAGGTCAT TGAACGTCCT GCCAGTGTGG 534 0 

TCAAAGAGTT GGTAGAAAAT GCCATTGACG CGGGCTCTAC TCAGATTATC ATTGAGATTG 54C0 
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AGGAAGCTCC TCTCAAGAAG GTTCAAATCA CGGATAACGG TCATCGAAT7 GCCCACGATG 54 60 

ACGTGGAGTT GGCCCTGCGT CGCCATGCGA CCAGTAAGA7 AAAAAATCAA CCAGATCTCT 5520 

TTCGGATTCG GACGCTTGCT TTTCGTGGTG AAGCCTTCCC TTC7ATTCCG TCTGTTAGTG 5530 

TCTTGACTCT G7TAACGGCG GTGGATGGTG CTAGTCATGG AACCAAGTTA GTCGCGCGTG 564 0 

GGGGTGAAGT TGAGGAAGTC ATCCCAGCGA CTAGTCCTGT GGGAACCAAG GTTTCTGTGC 57 00 

AGGATCTCTT TTTCAACACG CCTGCCCGTC TCAAGTATAT GAAGAGCCAC CAAGCGGAGT 5*760 

TGTCTCATAT CATTGATATT GTCAACCGTC TGGGCTTGGC CCA7CC7GAC ATTTCTTTTA 5820 

GCTTGATTAG TGATGGCAAG GAAATGACGC GGACAGCAGG GACTGCTCAA TTCCGCCAAG 5880 

CAATCGCAGG GA7TTACGGT TTGG7CACTC CCAAGAAGAT GATTGAAATT CAGAAC7C7G 5940 

ACCTAGATTT CGAAATTTCA GGTTTTGTGT CCTTGCCTGA GTTGACTCGG GCTAACCGCA 6000 

ATTATATCAG CCTCTTCATC AATGGCCGTT ATATTAAGAA CTTCCTGCTC AATCGTGCTA 6060 

TTTTCGATCG TTTTGGAAGC AAGCTTATCG TTGGACGTTT TCCACTGGCT GTCATTCACA 6120 

TCCATATCGA CCCTTATCTA GCGGATGTCA ATG7GCATCC AACTAAGCAA GAGGTGCGGA 6180 

TT7CCAAGGA AAAAGAACTG ATGACTCTGG TTTCAGAAGC TATTCCAAAT ACTCTCAACG 6240 

AACAAACCT? GATTCCAGAT GCCTTGGAAA ATCTTGCCAA ATCGACCGTG CCCAATCGTG 6 300 " 

AGAAGGTGGA GCAAACTATT CTCCCACTCA AAGAAAATAC CCTC7AC7AT CAGAAAACTG 6 3 60 

ACCCGTCAAG ACCTAGTCAA AC7GAAGTAG CTCA7TA7CA GGTAGAATTG ACTGATGAAG 64 20^ 

CGCAGGATTT GACCCTGTTT GCCAAGGAAA CCTTGGACCC A7TCACCAAG CCAGCAAAAC 6480 - 

TGCATTTTGC AGAGAGAAAG CCTCCTAACT ACGACCAGCT AGACCA7CCA CACTTACATC 6540 

7TCCTAGCAT CCATAAGGCT TATGACAAAC TGGAGCGAGA AGAAGCATCC AGCTTCCCAG 6600 

AG7TGGAGTT 7TTCGCACAA ATGCACGGGA CTTA7CTCT7 TGCCCAAGGG CGAGATGGAC 6660 

TTTACATCAT AGATCAGCAC CCTGCTCAGG AACGCGTCAA CTACCACCAG TACCGTGAAA 67 20 

GCATTGGCAA TGTTGACCAA AGCCAGCAGC AACTCCTAGT GCCCTATATC TTTGAATTTC 67 80 

CTGCGGATGA TGCCCTGCGT CTCAAGGAAA GAATGCCTCT CTTAGACGAA GTGGGCGTCT 684 0 

TTCTAGCACA GTACGGAGAA AATCAATTTA TTCTACGTGA ACA7CCTA77 TGGATGGCAG 6900 

AAGAAGAGAT TGAATCAGGC ATCTATGAGA TGTGCGACAT GCTCCTTTTG ACCAAGGAAC 69 6C 

TTTCTATCAA GAAATACCGA GCAGAGCTGG CTATCATGAT GTCTTGCAAG CCATCTA7CA 7020 

AGGCCAATCA TCGTATTGAT GATCATTCAG CTAGACAAC? CCTCTATCAG C77TC7CAA7 7 0 80 

GTCACAATCC CTATAACTGT CCTCACGGAC GTCC7C77TT CGTGCA7777 ACCAAGTCGG 7 140 
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ATATGGAAAA GATC7TCCCA CGTATTCAGG AAAATCACAC CAGTCTCCGT GAGTTGGGGA 7 200 

AATATTAAAA GTATAAAAAA GTCTGGGAAA AATTTTCAAA ATCAAAAAAA CGCATAAAAT 7 260 

CAGGTCTTCA AAAACCTTGA TTTTA7GCGT TTTATCATGG AAATAGTTAC TTCATTTTTT 7320 

CCTAATTCTT TTCGAAACTC TTTTTAAACG ACGTCAGTTT TATCAGTAAT CTCAAAACAG 7 380 

T3TTTTGACC TAATTTTGCC AGTTTTGTCT GTAACATCGA AGTTCTGTTT TACCACTCTG 7440 

CGACTGGTTT CCTAGTTTGC TCTATGATT? TCACAGAGCA TTAAATTGCG ATTTTGCCAA 7 500 

C7TTCTTTAT TCGTCTAAAA GTAGAG7CTG TTCTATGCGT CTAATGTACG AATCAGGTTG 7 560 

ACCATTTCAA TAGCTCCTTC TGCACACTCA GAACCCTTAT TTCCTGCTTT AGTACCAGCT 7620 

CGTTCTATGG CTTGTTCAAT TGTATCTCTC GTTAGCACAC CAAACATAAC AGGAATTTCG 7 680 

CTATTTAAAC TGATTTGGGC GATTCCCTTA GATACCTCGC TACATACATA ATCATAATGA 7740 

CTTCTATTCC CTCTAATGAC AGCTCCCAAG CAGATAATTG CATCATATTT TTTACTTTTT 7 800 

GCCATTTTTG ATGCAATCAG TGGTATTTCA AAACCTCCTG GAACCCAGGC TACCTCTATA 7860 

TCTTTCTCCT TTACATTCTC TCTTTTCAGA TTATCTAGTG CTCCAGATAA TAATTTTGAA 7 920 

GTTATAAATT CATTAAATCT CGCTACAACA ATACCTATTT TAATATTGTT TGC TACT AAA 7980 

TTACCTTCAT AAGTGTTCAT TTATTTTTCC TCCATATTTA AAATGTGACC CATTCCATTT 8040 

TTCTTTGTTT CTAAATAAAA ACTATCGTAA GGATTGGCTT CTATTTCGAT TCATATTCTA 8100 

CTGGAAATGG TAATTCCATA TTTTTCTAAC TGTTCAACCT TGTCAGGATT ATTTGTCAGT 8160 

AAATGAACTG ACTGAAGTCC CAGATCTTTA AGCATTTTTG CTCCAATATC ATATTCTCTT 8220 

AAATCACCTT CAAAGCCTAA TGCAAGATTG GCATCAAGCG TATCCATGCC T7CATCTTGT 8280 

AAATGATAGG CTTTTAATTT ATTGATAAGT CCAATTCCTC GTCCCTCCTG TCGCAAGTAA 3 3 40 

AGTAAGACAC CCGAACCATT CTCAACAATC A7TTTCATAG CCTTATCGAA TTGCTGTCCA 8400 

CAATCGCAAC GTAAAGACCC TAAAACATCT CCTGTTAAAC ATTCGGAGTG GACCCCACAT 84 60 

AATACATTGG CTTCATCCTC TATATTTCCC ATAATAAGAG CAAGATGATC TTCCCCATTT 8520 

AGTTTATCTA TATAGCTAAT TCCTTTGAAA TTACCGTATC TAGTAGGCAT ATTGACAGTT 8580 

GAAACTCGTT CTACCAGCTG ATCATATAC? TTTCTATATT CTTCTAATTC TTTCATGCTA 8640 

ATTAGTGGAA TGTTGTGTTT TTTCGAGAAC TGAATTAAAT CATCTGTTCT CATCATTTTG 87 00 

CCATCATGAT TCATTATTTC ACAACATAGG CCACACTCTT TTAGTCCAGC TAATTTTAAT 8760 

AAATCAACAG TTGCTTCTGT GTGTCCATT7 CTTTCTACGA CACCACCTTT TTTTGCAATT 9320 

AAAGG AAAC A TGTGTCCTGG CCTGCGAAAA TCAGAGGGTG TTATATCTTC AGCTACACAC 38 8C 

ATACGTGCGG TCAGTCCTCT TTCCTCGGCA GAAATACCTG TGGTCGTTTC TTTATAATCA 8940 
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ATTGAAACTG 


TAAAAGCACT 


CTTATGATTA 


TCTGTATTGT 


TTTCAACCAT 


AGGTG AAAC C 


9000 


ATTAATTGAT 


TAGCTAAACT 


TTCGCTCATA 


GGCATACAAA 


TTAATCCTTT 


CG C AT AAGT A 




GCCATAAAAT 


TAACATTTTC 


7GTTGTAGCT 


GCTTCTGCAG 


AACAAATTAA 


GT f"* T f* P*T**TV A 


9 l :?n 

7 l^U 


TTTTCTCTAT 


CCTTGTCGTC 


TATAACAAGA 


ACAAGTCGTC 


CCTTCTGCAA 


TGCTTCTAAT 




GCTTCTTGTA 


TTTTTCGATA 


TTCCATTGAC 


TGATTATCCT 


TTC^GCTAAA 


*V i ^— /A i 4 i L \J 


9 240 


ATATAATAGT 


TCCTTAGATA 


TTT CTG ATTT 


TGGAGAGTTA 




* « i vjWaL.a* n 


Qinn 
j J uu 


TTTACCTAAG 


ATATCATTTT 


CAAGATTTAC 


TGTACTCCCG 


AC^TGTTTAC 


TfTTAAGAAT 


9 3 60 


GGTTTGTTCC 


AAGGTATGAG 


GGATAACAGA 


TACTCAAAAG 


TTTACTTTCC 

4 4 4 1 4 4 VjVj 


ACAPTTTAr.'* 


Q d 0 O 


GACAGTCAGA 


CTAATGCCGT 


CAATTGTAAT 


AGATCCTTTT 


TCAACTATTA 


AA^fTAAAAT 




TTCTTTTTGT 


GTGTTGATTT 


GATACCATAC 


AGCATTATCA 


7CTTTT T A 


TTGACGAGAT 




TTTTCCTGTA 


CCATCAATGT 


GTCCTGTAAC 


GACGTGACCC 


CCAAGTCCAC 


CGTTflAPAnA 


9600 


TAAGGCTCTT 


TCTAGATTCA 


CCTCACTTCC 


ATGTTTTAAT 


AGAGTAAGAC 




jQOU 


CCATGTTTCA 


TTCATTACAT 


CAACTGTAAA 


GGATTGATGA 


T^GAAATGAC 




Q 79 n 


ACAGATACCA 


TTTACTGCTA 


TACTATCGCC 


TAAATGGATA 


TCCGTTAATA 


* * 4 t ivjnuvc 




TTTAATTGAT 


AGTTTACAAT 


TACGAGAGTC 


TTTCTGTATT 


CTTTCAACTT 


TT CCCl A TTTC 


9840 


TTCAATTATT 


CCTGTGAACA 


TGGATAAATC 


ACTTCACTTT 


CTATGAGATA 


GTCATTTCCT 

4 ^ i» 4 4 1 X>W 4 


9900 


ATTTGAGAAA 


ATCCATAAGG 


TTTCAATCTA 


ATACCGTCAT 


TTGG C AAAGA 




9960 


CCTCCGACAG 


GAAACTTGGC 


ACTACCTCCA 


AAAACTTTTG 


GTGCAATA^A 


TA'^TTCACC 


10020 


TCATCAACAA 


TTTGTTGTTC 


CAAACCACTC 


CAATTCATTA 


GACTGCCCCC 


TTCTAGAACT 


looao 


AGGCTATCAA 


TCTGCATGTT 


TCCTAGATGT 


TGCATTAAAC 


TCGATAAGTC 


TAT ATC ATTG 


10 140 


CCTTTTTTCT 


TTATGGAAAG 


TATTTCACAG 


CCATGATTTT 


GATATAGCTT 


CATTTTATTT 


10200 


TTGTCTTCAG 


AGGAAGTGGC 


AATGTAAGTT 


TTAATATCAT 


TTGCTGTTTT 


TACGATTTTA 


10260 


GAGGTAAGAG 


GAGTTCGTAA 


ATGTGTATCG 


CATATGATAC 


GGATAGCATT 


TTTCCCTTCC 


10320 


TCCAATCTAC 


ATGTCAGCAA 


AGCATCGTCT 


TCAATAACAG 


TATTGACTCC 


CACCATAATT 


1039C 


GCACTAACAT 


GGTGTCGTAA 


CTGATGCACA 


TGCTTTCTTG 


CTT CTT CTT C 


AGTAATCCAT 


10440 


TTGGATTGAT 


TTGTTTTAGT 


GGCTATTTTT 


CCATCCATTG 


ACATTGCATA 


TTTCATAAAA 


10500 


ACATAGGGTA 


CATGCTGGGT 


AATATACTTT 


CTAAAACTTT 


TTATTAAGTT 


AACACACTCA 


10S60 


TTTTCTAAAA 


TTCCAACAGT 


AACTTGAAGA 


TTATTTTCCT 


CAAGTATCTT 


TACTCCTTTT 


10620 


CCAGATACAA 


TACGATTACA 


GTCTAGGCTT 


CCAATGACTA 


C7CTTGTAAT 


ACCACTATCG 


10630 
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ATT AT AG CAT 


CTATACAGGG 


AGGTGTTTTC 


CCGAAGTGAC 


AACAGCGTTC 


AAGTGTTACA 


10740 


TAAACCGTCG 


CTCCGACAGG 


GGATTCTCTA 


CAGTTTTTAA 


GAGCATTTCT 


CTCAGCATGT 


103C0 


GGGCCACCAA 


AAAACTCATG 


ATAACCTTGT 


CCGATAATG? 


GATTATCTTT 


TACAATAACT 


10860 


GCGCCGACCA 


TAGGATTGGG 


ATTGACGTAA 


CCAGCCCCTT 


TTTGTGCCAG 


TTTTATTGCT 


10920 


AATTTCATAT 


ATTTTGAATC 


GCTCATCTCG 


CTACCTCCAA 


AAAAATATAC 


CTTGAATAGG 


10980 


GGACTACTCA 


AGCCATACAA 


AAGAAAACTT 


ATGCGATTAA 


CAAAAATGCT 


CTGAAATGAC 


11040 


AAGTAATCAT 


TTCAGAGCAC 


GCAAAAACCA 


CAAATATAC7 


TTTATCTTCT 


TTCATCCAGA 


11100 


CTATACTCTC 


GGCTTTCGAA 


TTTCACCAAA 


TCATGCCTTT 


CGGCTCGTGG 


GCTATACCAC 


11160 


CGGTAGGGAA 


TTTCACCCTG 


CCCTGAAGAT 


AGTTATTCAA 


TTACAGATGA 


TTATAGTACT 


11220 


TAATTTTCAA 


TATGTCAACA 


GATAAATACC 


GATTCTTTTT 


CATATACTGT 


ATTTGTGATA 


11280 


ATCGATTCTC 


GCTCCTCGGA 


TAAAGAAAAT 


ATGATATACT 


AGATAAACGA 


AATAAGAGAG 


11340 


AAGGAATACT 


ATGTACGCAT 


ATTTAAAAGG 


AATCATTACC 


AAAATTACTG 


CCAAATACAT 


11400 


TGTTCTTGAA 


ACCAATGGTA 


TTGGTTATAT 


CCTGCATGTG 


GCCAATCCTT 


ATGCCTATTC 


11460 


AGGTCAGCTT 


AATCAGGAGG 


CTCAGATTTA 


TCTCCATCAC 


GTTGTGCGTG 


AGGACGCCCA 


U520 


TTTGCTTTAT 


GGATTTCGCT 


CAGAGGATGA 


CAAAAAGCTC 


TTTCTTAGTC 


TGATTTCGGT 


11580 


CTCTGGGATT 


GGTCCTGTAT 


CAGCTCTTGC 


TATTATCGCT 


GCTGATGACA 


ATGCTGGCTT 


11640 


GGTTCAAGCC 


ATTGAAACCA 


AGAACATCAC 


CTACTTGACC 


AAGTTCCCTA 


AAATTGGCAA 


11700 


GAAAACAGCC 


CAGCACATCG 


TGCTGGACTT 


GGAACGCAAG 


GTAGTAGTTG 


CAGGAGATGA 


11760 


CCTTCCTGCC 


AAGGTCGCAG 


TGCAAGCAAG 


TGCTGAAAAC 


CAACAATTCC 


AAGAAGCTAT 


11820 


GGAAGCCATG 


7TGGCTCTGG 


GCTACAAGGC 


AACACAGCTC 


AACAAAATCA 


AGAAATTCTT 


11880 


TGAAGGAACG 


AC AG AT AC AG 


CTGAGAACTA 


TATCAAGTCG 


GCCCTTAAAA 


TGTTGGTCAA 


11940 


ATAGGACCAG 


AGAATGACAA 


AACGTTGTTC 


GTGGGTCAAG 


ATGACCAACC 


CCCTCTACAT 


12000 


CGCCTATCAT 


GATGAGGAGT 


GGGGCCAGCC 


CCTCCATGAT 


GACCAAGTAT 


TGTTTGACTT 


12060 


GTTGTGTATG 


GAAACCTATC 


AGGCACGCCT 


GTCTTGGGAA 


ACGGTACTCA 


ACAAACGCCA 


.2120 


AGCTTTCCGA 


GAACTCTTTC 


ATAGCTATCA 


AATTCACTCA 


GTCGCAGAGA 


TGACTGACAC 


12130 


TGAATTGGAA 


GCCATGCTCG 


AGAATCCAGC 


TATCATTCGA 


AATAGAGCCA 


AGCTTTTTGC 


12240 


TACACGCGCT 


AACGCCCAAG 


CCTTTCTACA 


GTTACACGCA 


GAGTACGGCT 


CTTTTGATGC 


12300 


CTATCTTTGG 


TCTTTTCTTG 


AGGGGAAAAC 


TGTCGTTAAC 


GATGTTCCTG 


ATTATCGCCA 


12360 


AGCGCCAGCT 


AAAACACCCT 


TATCTGAGAA 


ATTAGCCAAA 


GATCTCAAAA 


AACGAGGCTT 


12420 


CAAGTTCACA 


GGCCCAGTCG 


CCGTATTGTC 


TTTTCTACAC 


GCTGCAGGGC 


7ACTTCATCA 


1248C 
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CCACGAGAAT GATTGTGAGT GGAAAGGTCT TAAATGATGT CTAACAAAAA TAAGGAAATT 12 540 
CTGATTTTTG CGATTCTCTA TACAGTCCTC TTTATGTTTG ATGCCC77AA ATTGCTGCCT 12600 
TCTTTAATGC CATCTCCCAT TGCAAATTAT CTTGTTTATG TAGTTTTAGC TCTATATGGC 12 660 
TCCTTCTTCT TCAAGGATAG ATTGATCCAA CAATGGAAGG AG ATT AG AAA GACTAAAAGA 12720 
AAATTCTTCT TTGGAGTCTT AACAGGATGG CTCTTTCTCA TTCTCATGAC TGTTGTCTTT 12 7 80 
GAATTTGTAT CAGAGATGTT GAAGCAGTTT CTCGCACTAC ATGCACAAGG 7CTAAATCAG 12840 
TCTAATATTC AAAGTACCTT TCAAGAACAA CCACTACTGA TAGCTCTTTT TGCT.TGTGTC 12900 

ATTCGACCTC TGGTAGAAGA ATTATTTTTC CGTCAGGTCT TATTCCATTA CTTCCAGGAA 129 60 

CGGTTGTCAG CTTTACTAAC CATTATTCTG GTAGGACTTG TTTTTGCTCT GACTCATATG 13020 

CACAGTTTGG CTCTATCAGA GTGGATTGCT GCAGTTGGTT ACTTAGGTGG ACCCCTTGCC 13080 

TTTTCTATTA TTTATGTGAA AGAAAAAGAC AATATCTACT ATCCCCTACT TGTTCACATG 13140 

TTAAGCAACA GCCTCTCCTT AATCATTTTA GCTATCAGTA TAGTAAAATG AAATGAGAAC 13 200 

AGGACAAATC GATTTCTAAC AATGTTTTAG AAGTAGAGGT GTACTATTCT AGTTTCAATA 13 260 

TACTGTAATA TGTGATGAAA ATGCCAGTAA TGATACCGAC AAAAAAGCTG ACAAACTTTT 13 320 

CCCAGCTTTA TTTGTTATAG TCAAAGAGAA TGACTTGTTC CTGTGCATCT ACATGAGCAT 13 380 

GGACCCCAAA GGGTACAATT GCTCTTGGAC TTGCGTGGCC GACATTCAGA TTATAGACAA 13 440 

TCGGGATATT GCTGTCAATG ATATCCAATA GTGCCTCTTT ATAC7CCTCA TGGAAAGTTT 13 50C 

CATCCATAGG TTTTCCGACC AAGAGTCCAT TGATGACCGC GAATATGCCA GTGTCCTTTA 13 56C 

AAGTTAGCAA CATCTTTTTG AAGTCTTCTG GCTTACCCTT TTCTTCGCTT GTTTCGACCA 13620 

AGAGGATTTT CCCTTCTCAC TCTCAC.VxG? CAOGGAAAAC A -~— T^OCACAGTT 13 530 

CCGTGCTATC TGCGTATCGA GACTTGTCAA AGATATCGTA GAGGGATTCG AGGCAACCAC 13 740 

CCACGATTTT CCCCTCGAAC TGGCCACTTC CTTGCAACAA GTCAAAACCT GTATTTGTAT 13 800 

GACTGACACG AGGTGTTCCC AGGGCCGTGG GACTAAAATC AGTTCGTTCC TCATACCAAA 13860 

CGTCACTAGG GCGGATTTCT GAAATTCTTC CCGTCTCAAT CAATTCTTTA AAGTAGTGAA 13 920 

GGCTATAGGC TACCATTTCT TTGTCTAATT CACAAATGTC TGCT AAAAAG GATTGACCAT 13 980 

AAAAACTCTT GATTCCTAAT TTATGCAACA TGAGGTCGTT CATGCTTGTA TCCGAGAAGC 1404 0 

CAAGAAAAAT TTTTTCCTTG ATAACCTTTT GGAGTTGGTC ATTTTCAAAA AGATAAGGTA 14100 

GCAAGCGATA GCTATCGTCT CCACCGATGG CACATAGGAT CATGTCGATG C7 AT CATC AC 14160 

AAAAGGCATG AATCAAATCC TCTGCACGAG CTTCAGCATC GTCCTTGATA AAGTCTAATC 14220 
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CTTTTAACGA ATGGGGCAAA AAGATCCGAT TCGTCCCAGA TCCTTCAGAC GTT 14 273 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GTGAAGTGCG GCAAAAGGTG CAAGTGATGA GCTCAGGTTC TTTAGCTCTT GACATTGCCC 60 

TTGGCTCAGG TGGTTATCCT AAGGGACGTA TCATCGAAAT CTATGGCCCA CAGTCATCTG 120 

GTAAGACAAC GGTTGCCCTT CATGCAGTTG CACAAGCGCA AAAAGAACGT GGGATTGCTG 130 

CCTTTATCGA TGCGGAACAT GCCCTTGATC CAGCTTATGC TGCGGCCCTT GCTGTCAATA 240 

TTGACGAATT GCTCTTGTCT CAACCACACT CAGGAGAGCA ACCTCTTCAG ATTGCGGGAA 300 

AATTGATTGA CTCAGGTGCA GTTGATCTTG TCGTAGTCGA CTCAGTTGCT GCCCTTCTTC 3 60 

CTCCTGCCCA AATTGATGGA GATATCGGAG ATAGCCATCT TGGTTTGCAG GCTCGTATGA 4 20 

TGAGCCAGGC CATGCCTAAA CTTGGCGCCT CTATCAATAA AACCAAAACA ATTCCCATTT 4 80 

TTATCAACCA ATTGCGTGAA AAAGTTGGAG TGATGTTTGG AAATCCAGAA ACAACACCGG 540 

GCGGACGTGC TTTGAAATTC TATGCTTCAG TCCGCTTGGA TGTTCGTGCT AATACACAAA 600 

TTAAGGGAAC TGGTGACCAA AAAGAAACCA ATGTCCGTAA AGAAACTAAG ATTAAGGTTG 660 

TAAAAAATAA GGTAGCTCCA CCGTTTAAGG AACCCCTACT TGAAATTATG TACCGACAAC 720 

CAATTTCTAA GACTOGTGAG CTTTTGAAGA TTGCAAGCGA TTTGGATATT ATCAAAAAAG 7 80 

CAGGGGCTTG GTATTCTTAC AAAGATGAAA AAA7TGCCCA AGGT7C7GAC AA7GCTAACA 840 
AATACTTGGC AGAGCACCCA GAAATCTTTG ATGAAATTGA TAAGCAAGTC CGTTCTAAAT 



900 



TTGGCTTGAT 7GATGGAGAA GAAGTTTCAC AACAAGATAC TGAAAACAAA AAAGATGAGC 9 60 

CAAAGAAAGA AGAAGCAGTG AATGAAGAAG TTCCGCTTGA CTTAGGCGAT GAACTTGAAA 1020 

TCGAAATTGA AGAATAAGCT GTTAAAGCAG TCGACAAATC CGCTACTTTT TCGATTTTTG 1080 

ATTCAAGTTT TT AG ATT AT A TATAGTAGCT TGAAATAAGA TATGAACAAC TCTATTAGGA 114 0 



1200 
1260 



AAGTCAAATT AATTTCTAGA AATCTTTTAG CAGCTACAGC GTACTATTCC AAACTCAACC 
AACTATAATA GATCGAAACT AGAATAGTAC ATATCTACT7 CTAAAACATT GTTAAAAATC 

CATTTCACTT 7CC7TA7TTC ATTCCGCTAT ATATAGTTTG CTGTTTCTTG TCGCTCCTCT 13 20 

GGAAAGCTGA TATAATACCT TTATGAATAA AAAACCAACA GTCGACCTGA TACATGGTCC 13 80 
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CATTCTTCCC TCGCTCTTAA GCTTCACCTT TCCAAT7TTC CTATCAAATA T7T77CAACA 14 40 

GCTCTATAAC ACTGC7CATG TCTTGATTG7 TGGACGATTT CTTGGTCAAG AATCCTTGGC 1500 

TGCAGTAGGA GCGACGACAG CGATTTTTCA CCTGATTGTA GGTTTTACAC TTGGTGTTGG 1560 

CAATGGCATG GGGATTGTCA TTGCTCGTTA TTATGGGGCT CGCAATTTCA CTAAAATCAA 16 20 

CGAAGCAGTA GC AGCCACCT GGATTTTAGG TGCTCTTTTG AGCA77CTAG TTATGTTGCT 1680 

GGGCTTTCTT GCC7TC7ATC CTCTCTTGCA ATACTTAGAT ACTCCTGCAG AAATTCTTCC 1740 

TCAATCTTAT CAATATATTT CTATGATTGT GACCTGTGTA GGTGTCACCT TTGCTTATAA 1300 

TCTTTTTGCA CGC7TG7TGC GGTCTATTGG TGACAGTCTA GCAGCCC7CC GATTTCTGAT 1960 

TTTCTCTGCC TTGGTTAATG TGGTTCTGGA TCTCTATTTT ATTACGCAAT TCCATCTGGG 1920 

AGTTCAATCC GCAGGACTTG CTACCATTA? TTCGCAAGGT TTATCAGCGG TTCTCTGCTT 1980 

TTATTATATT CGTAAAAGTG TGCCAGAACT CTTGCCACAG TTTAAACATT TCAAATGGGA 2040 

CAAAAGC7TG TACGCCGATC TCTTGGAGCA AGGTTTGGCT ATGGGCTTGA TCACTTCAAT 2100 

7CTATCTATC GGCAGTGTGA TTTTACAGTT TTCTGTTAAT ACATTTGCTG CAGTGATTAT 2160 

TAGTGCCCAG ACGGCAGCTC GACGCATTAT GACCTTTGCC CTTCTTCCTA TGACCCC7AT 2220 

TTCTGCATCA ATGACGACCT TTGCTTCTCA GAATCTAGGA GCTAAGCGAC CTGACCGTAT 22 80 

TGTTCAAGGT CTTCGAATCG GCAGTCGTTT AAGTATATCC TCCCCAGTTT TTGTTTCTAT 2340 

TTTCCTCTTT TTTGCCAGTC CAGCTTTGGT TTCCTTCTTG GCTAGTTCCA CAGATGGTTA 2400 

CTTGATAGAA AATGGAAGTC TCTATCTGCA AATCAGTTCA ACCTTTTATC CCATTTTGAG 24 60 

CCTC77G7TC AT77ATCGCA ATTGCTTGCA GGGCTTGGGG CAAAAGATCC TTCCTCTAGT 2520 

T777AGC7TT A7TGAAC7AA TCCGAAAAA7 CC7T7TTCTC 3~7' T ~.\ 77 A 77C777GGGC 2530 

AGGATATAAG GGTGTTATCC TTTGTGAACC TC7TATCTGG GTTGCCATGA CAGTTCAACT 264 0 

G7ACTTCTCA T7ATTCCGTC ATCCCTTGAT AAAAGAACGC AAGGCAATCT 7GCCAACCAA 27 00 

AC7CCAATCC TAGTTGGATT TACTGAATAA AATCCATTTC CTC7AGTGAA AA7CCAAAAA 2760 

ACTTGTGTTC TCTTCTTTAG TTTGG7GTTG AAAA7AG7TT AACAGAC7T7 TGAC7TC7T7 2920 

TA7ATGATA7 AA7AAAGTAT AGTATTTATG AAAAGGACAT ATAGAGACTG TAAAAATA7A 2380 

CTTTTGAAAA TCTTTTTAG7 CTGGGGTGTT ATTGTAGATA GAATGCAGAC CTTG7CAGTC 2940 

CTAT77ACAG 7G7CAAAA7A G7GCCTTTTC AAGTTCTATC TACAAGCC7A ATCCTCACTA 3000 

AGATTGTCTT C777G7AAGG 7AGAAATAAA GGAGTT7CTG GTTCTGGAT7 G7AAAAAATC 3060 

AGT7GTTT7A A7TGATAAGC AGTAGAATAT GGAAATTAAT GTGAGTAAAT 7AAGAACAGA 312C 
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TTTCCCTCAA GTCGCCCTGC AACCATATAG GCAACTACAC GC AC ACT CAA CTGGGAATCC 3 130 

GCATTCAACC GTACAGAATG AAGCGGATTA TCACTGGCGG AAAGACCCAC AATT AGG TT7 3 240 

TTTCTCGCAC ATTGTTGGGA ACGGTTGCAT CATGCAGCTA GGACCTGTTG ATAATGGTGC 330C 

CTCGGACGTT GGGGGCGGTT GGAA7GCTGA GACCTATGCA GCGGTTGAAC TGATTGAAAG 3 360 

CCATTCAACC AAAGAAGAGT TCATGACGGA CTACCGCCTT TATATCGAAC TCTTACGCAA 3420 

TC7AGCACAT GAACCAGC7T TGCCCAAAAC GCTTGATACA GGGAGTTTAG CTGGAATTAA 34 80 

AACGCACGAG TATTGCACGA ATAACCAACC AAACAACCAC TCAGACCACG TTGACCCTTA 3540 

TCCATATCTT GCTAAATGGG GCATTAGCCG TGAGCAGTTT AACCA7CATA TTGAGAACCG 3 500 

CTTGACGATT GAAACAGGCT GGCAGAAGAA TGACACTGCC TACTGGTACG TACATTCAGA 3 660 

CGGCTCTTAT CCAAAAGACA AGTTTGAGAA AATCAATGGC ACTTGG7ACT ACTTTGACAC 37 20 

TTCACCCTAT ATGCTTGCAG ACCCCTGGAG GAAGCACACA GACGGCAACT GGTACTGGTT 3780 

CGACAACTCA GGCGAAATGG CTACAGGCTG GAAGAAAATC CC7GATAAGT GGTACTATTT 3840 

CAACGAAGAA CCTCCCATGA AGACAGGCTG GGTCAAGTAC AACGACACTT CGTACTACTT 3 900 

AGACGCTAAA GAAGGCGCCA TCGTATCAAA TCCCTTTATC CAGTCAGCGG ACGGAACAGG 3 960 

CTGGTACTAC CTCAAACCAG ACGGAACACT GGCAGACAAG CCAGAATTCA CAGTAGAGCC 4020 

AGATCCCTTC ATTACAGTAA AATAATAATG GAATGTCTTT CAAATCAGAA CAGCGCATAT 4080 

7ATTAGGTCT TGAAAAAGCT TAATAGTATG CGTTTTCTTG TGGAGATATT TCCTTCAATT 4140 

TTGCTACTAT ATTAAACAAA AA7CAAAAAG CAAAC7AGAA ACTTATCCTC AAATAAAATC 4 200 

TAAA7TTGAC AATGTAAACC GAGTCGGATA CCTTTAACTA CTGTTTTGAG GTTGAAGATA 4 2 60 

CGA77T77CA TAGGAACTCA TCAATTTTAG ATTTTTAACC AGCATCAATA AATTGCTTCC 4 3 20 

TTGTTTTGTC ATAATTTTTT TATTTAAAAA ATTATCACma GAGTGTGCTA 7TCTT7TTAT 4 380 

GAGAGGTGTA TGAATATGAT AAATG7ATGT GATAAA7GTA 7GTCATCTTG GAAAAACAAT 4 440 

AAAAGAACTT AGAATATCTT CAAATC7TAC TCAAGATAAG ATTCCTGAGT ATTTCTCTVT 4 500 

GAATCAAAGC ATGA7TGCCA AAATGGAAAA AGGTGAAAGG AATATCACGA ATGGATTTAA 4 560 

G7AATAAAGC TTCAAATCTT AGAAAAAAGT 7GGGAGCTGA TGGTGAATCG CCGATAGATA 4 620 

TT7TTAAATT GCTACAAAAG ATAGAAAATT TGACGCTCG7 ATTTTATGCA C7CGGAAAGA 4680 

ATATTAGCGG AGTCTGTTAT AAAGGAACTC AGTTCAGTCT CATTGCAGTC AA7TCAGACA 4740 

7GCCATTAGG AAGGTAAAGA TTTTC7T7AG CACATGGACT G7ATCA7CT7 TA7TA7GATC 4800 

AGGTGAAGAA GAGTTCAGTC AG7CTTATCT TGA7TGG7CA AGGAGA7GAA ACTGAAAGAA 4 86G 

AAGCGGATCA GTTTGCTTCT TATTTTTTAA 7777CCCA7C TTCACTG7A7 AGGATGGT7G 4 92C 
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AGGAAATCAG AGAAAATGCC AATAGAACTC ATCTTCAACT AGAAGATATT ATAAAATTGG 4980 

GTCAGTTTTA TCGTATCAG7 CATAAAGCTA TGTTATATAG ATTGAGGAAT GATGGATACC 5040 

TTGATGCAGA AGAAATTAAA AATATGGATA TTAGTGTTAT ACAGACACCT TCAAGATTAG 5100 

GCTATGATAC AAGTTTATAT CGTCCTTTGT CAGAAAGTAA AAAAGAAATG GCATTAGGAT 5160 

AATATATTAA TTCAACTGAA CAACTTTTAG AAAATAACAG AATTTCGCAA GGCAAGTATG 5220 

AGGAACTGTT ACTACATCCT TTCAGATATC ATATTGTATA TGGGCTAGAT GAAGAGGGGG 52 90 

GACTTGTCGT TTGACTAGTC GTGTATTTAT TGATGCAGAT TGTATTTCAG TATTTTTATG 5340 

GGTTGGCACT GAACATCTTT TAGAAAAGCT CTATTTGGGT AAAATTGTTA TTCCACAAGA 5400 

GGTGTATGAT GAAATCAATA TACCTACAAT TCCCCATTTA AAATCTAGGA TAGATCAGTT 54 60 

GGTAGCTAAG GGTTCAGCTG AGATTGTGAG CATAGACATT GGAACTGAAG AATACGCATT 5520 

ATATAGAGAT TTAACAAGAA ATCATGATAG TAACAAGATT ATTGGTAAGG GAGAAGGGGC 5580 

ATCTATTTCC TTAGCGAAAA AGCATAATGG GATATTACGA AGTAATAACC TAAGAGATGT 564 0 

TAAA7CATAT GTAGAAGAAT TTTCTTTAGA ATATATGACA ACAGCAGATA TACTGATTGA 5700 

AGCGTT7AAA GCGTAATTTA TTACTGAATA AGAGGGCAAT CATATCTGGA ATAATATGCT 57 60 

TAAAAAGAGA AGGAAAATTG GTGCAAATTC ATTTTCAGAC TATCTTCGTG GAAGTATTCA 5920 

TCAAAATAGA CAAAAATAAA TTTGGATAAA TCGAACTCAC TATTCAGGAG GCATATGAGC 5980 

AATTCGAAAA AGAAAAGTGT CAAATTGAGC CTATAGGAGT AGAAGTGAAA TAGTAAGTCC 594 0 

TCCA7AGTCG ATGAGAGAAA AGTTCTCCTT GAACTTTTCC TGAACTATCA GTCGCATGTC 60C0 

AAACGATATG TACCG7AATG TGAGACGGGA TAGCGAGTAG TTTTTGGTTA TTTTA7CAAA 5060 

AAAC7TA7.\T TTTA7TA7AC CCAA7CA7AA AA7A7AA7AA AAA A 7, \; A A7AACGAAAA rT.:C 

AACATGAATG TCAAAAAGAT AATGTCAATT TTTCAATCC7 7TTATGTTCA TGTCAGTATT 6180 

GAGGAACTGA CTTTGACTTT ACCAATCAGT TTTGTAAAAA GGTTTGAGTA TACTCAAATG 62 40 

ACTTTTCATA AGGAATCATT TTTATTGATT AAAGAAAAGA GAAGGGGGAG TTTGAGTTCA 6300 

TTTGTTACTC AGGCTCGCAC TATGGGTGAA AAAGCCAATA TGGATGTTCT TTTGGTCTTT 6 3 60 

TCGAAGTTAT CAGACAGTGA AAAAAAGCAA TTACTTCAAG C7AGAGTTCC GTTTGTAGAC 64 2C 

7TTAAGGGAA ACCTCTTCTT CCCTCCATTG GGACTAGTAC TCAATGCGAA TGA7ACTGAA 64 80 

GTCCCTAAGG AATTAACACC TAGCGAACAA TTAACGTGGA TTCCCTTTTT ATTGACAAAA 6540 

GC7CAAAAAG TACTAGATGT TGATTTGCTT TCACAAGTCA CTCCAC7TCC AAACTCAACA 6600 

ATTTA7AGGT GTTTGAGGAC TTTTAAAGCT TTATAT7CCT 7AAACAAGCA AAATAAGCTT 6660 
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T AC AC AT AT A CGGTGTCAAA CAAAGAATTA TTCTTAAAAT CCGTGTCATG TTTATTTAAT 6"? 20 

CCCATCAAAA AACGGATTTT ATTGCCAGAT GGCGATATAA AGCAGATAAA ATCTGTTTCT 6780 

AACCTTCTAT ATGGTGGTGC TTATGCTTTG TCGCATTCAA CTTTTTTAGC TGAAACGCAT 6840 

GAAAATATTA GCTATGTCAT ATGGCAGACA AAATTCAATC AGTTATCCTT GCCACTTTCT 6900 

CAGCATGTTT TAAAATGAAA GATGCTAGAG ATATGGAAAT ATCGTCCTTT TGTATCTGAG 69 60 

TTTTGGAATG ATTTTAAAAA TAATCATGAT AAACAATTTG TAGATCCGAT TTCTCTTTAT 7020 

TTCACCTTAA AAGATGATGA TGACCCACGT ATAGAGCAAG ACAGTCAACC ACTAGAAAAT 7 080 

ATGATATTAC AGTATCTGGG AGAAGATGAT GCCAGCTAAT ACGAAAGTTA TTTTTCAAGA 7140 

AATGTTTGCG CATTTTCAGA ACTATTATGT TCTGATTGGG GGAACTGCTA CCTCTATCCT 7 200 

ATTGCATTCG CAAGGATTTA AAAGTCGCAC AACAAAAGAT TATGATA7GG TCATCATTGA 7260 

TGAAGTAAAA AATAAGGAAT TT TAT ACTA C CTTGAATCAT TTTTTAGAAT TGGGAGAGTA 7 320 

TCAAGGAAGT CAGAAACATG AGAAAGCGCA GCTTTTTCGA TTTACAACAA CTAATCCTGA 7 3 80 

GTTTCCTTCT ATGATTGAAC TATTTAGTAT CTTACCAGAA TATCCATTAA AGAAGGACGG 7 440 

TCGAGAAATT CCCTTACATT TTGACCAAGA TCCTACTTTA TCAGCCTTAT TATTGGATCA 7500 

AGATTATTAT AATATATTGG TGCATGAAAA AGAAACCATT CAGCGGTATT CGGTATTGAG 7 560 

TAATTGTGGT TTATACTCTT CGAAAATCTC TTCAAACCAC GTCAGCTTCC ATCTACAACC 7 620 

TCAAAACAGT GTTTTGAGCA GCCTGCAGCT AGCTTCCTAG TTTCCTCTTT GATTTTCATT 7680 

GAGTATTAAT TATTTTTAAG CCTAAAGCTT GGCTGGATAT GAGGGAGCGC TCTGCCACAC 77 40 

GTGCTCAAGG TTTAAGTAAG TCC ATT AAAA ACCATTTGAA TGACCTTACC CGTTTGACAG 7 80C 

CTTCCTTGCT AGGAGATGAA AAGTTATCCG CTATAACATC AAGTAGTGCG GTAAAAGCAG 7860 

ACATGCACCG CTTTGTGATA GAATTAGAGC CTGTGAAGTC AACTATTCTT CAAAATAATG 7 9 20 

ACATTTCATT GGATCAAAAT GAAATTTTTG AAATTCTGAA AAATTTTCTC GATGGTTAAA 7980 

ATAATTGTAG CGAGATGGCT ATATTGAATT CGTCTA7ATC TGGAAACTAG AAAAAACTTC 804C 

AATTTCAGGA GAAAATGAAC TCAATCTTCC CACAATCAAA CGTATAGTAT CAAGGTTTTT 8100 

CAAGACCTGA TATTATGCGT TTTTTGCTTT TCAAAACTTT TTGCCCAGTC TTCGTTTTTA 8160 

TCCTCTAGTC ACTTGATTTG TTTCAGGTGG TTTTTTAGTA TAGTAGAATG AAACGAGAAC 82 2C 

AGGACAAATT GATCAGGACA GTCAAATCGA TTTCTAACAA TGTTTTAGAA GCAGAAGTGT 3280 

ACT ATT CT AG TTTCAATCTA CTATAGTTAA ATCTCCGCTC AAGTCTACTG GTGAATCTAT 8 340 

GATTGTAATA CTCTTCCAAA ATCTCATCAA CCACGTCAGT CTTGCCTTGC AGTCTGTATC 8400 

TTACTGACCA AGCTAGTGAT GCATTTAGAA TAGGTGATTT GGAGCCTCCT ATTAGCTAGG 84 60 
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AAATCCTCCT 


CATAGTCCTT 


TGCTCAGGCT 


AGGCTCTTTC 


AACATTCAAC 


ACTCAACTGG 


3520 


TTGATCTACT 


TGATAGGAAG 


GGAGTTACTA 


TAAAATACTC 


AGGCTTCCAT 


CATATTTTTT 


3580 


GAAACGATTG 


TGTAATCAAA 


ATGTACCAAT 


ATTGTACTAT 


TGGTACAGAA 


GATGTTGTGA 


3640 


ATGGATAAAT 


ATATCATAAC 


TGCTATCTCA 


AAAAGATTTC 


ATATGTCTGT 


GCATATATAA 


3700 


TAGACTTCCT GCAAAACTAG 


AATCCTAGTT 


CATGATTGAT 


AATACCAGCA 


ATCAAATTCA 


8760 


TTCGTAATCC 


AAAGCCTTTA 


CGATGATTTC 


GATAGGTTGT 


TGAAAACATT 


TTAAACGTTT 


38 20 


CTACTTTGGC 


AAAGATGTTC 


TCAACCTTCC 


TTCTCTCC7T 


AGATAGCGCA 


TGGTTATAGG 


883C 


CTTTATCTTC 


AGC7GTTAGC 


GGCTTGAGTT 


TGCTGGATTT 


ACCTCGACT7 


TGTGCTTGAG 


8940 


GACATATCTT 


CATGAGCCCT 


TGATAACCAC 


7CTCACCCAA 


GATTTTACCA 


GCTTGTCCGA 


9000 


TATTTCTGCA 


ACTCATTTTG 


AACAACTTCA 


TATCATGACA 


ATAGTTCACA 


GTGATATCCA 


9060 


AAGAAACAAT 


TCTCCCTTGA 


CTTGTGACAA 


TCGCTTGAGC 


CTTCATAGCG 


TGAAATTTCT 


9120 


TTTTACCAGA 


ATCATTCCCT 


AATTCTTTTT 


TTAGGGCGAT 


TGATTTTTAC 


TTCCGTCGCA 


9180 


TCAATCATTA 


CCGTGTCCTC 


AGAACTAAGA 


GGAGTTCTTG 


AAATCGTAAC 


ACCACTTTGA 


9 24 0 


ACAAGACTTA 


CTTCAACCCA 


TTGGCTCCCA 


CCGATTAAGT 


TGCTTTCGTG 


AATACCAAAA 


9 300 


TCAGCCGCAA 


TTTCTTCATA 


AGTGCGGTAT 


TCTAGGCTTA 


ATTTAGGTTT 


TCGTCCACCT 


9 360 


TTTGCGTCTT 


TAAGTTGATA 


AGCTGTTTTT 


AATACACCTA 


ACATCTCTTT 


AAAAGTCGTG 


94 20 


CCCTGAACAC 


CAACAAGACG 


CT7AAATCCT 


GTATCACTTA 


ATTGTTTACT 


TGCTTCATAA 


9480 


TTTCGCACGG 


AGTCTATTGA 


CTCTTTGGTA 


GGTCTCAATG 




CTATCCCGAG 


9540 


AATTATTTTC 


CCGCCATTTG 


TATTTGCAAA 


TGCTGAGTAC 


GTTTCCCAGA 


AACACTCTGG 


9600 


AAGATTCTTT 


TTACCTTTTT 


TCTATTCTAA 


ATCAACCCCT . 


TCAAATTTTA 


ACT C CAT ATT 


9660 


TTTCCTTTAC 


ATCTGTTTTT 


TGTGGTTCTG 


CTATTTCTTC 


AAGTTGAGTG 


ATAATATAGC 


9720 


GAATTGAATT 


TCGAGAGTTT 


TTACTCAGTT 


AATTTCTTTT 


TTAACCCACT 


TTAATTGCTT 


9780 


TTTTAACACG 


GGTTAAAAAA 


GAAATTAAAG 


TGGGTTAA7T 


TTTCTTCA 




9823 


(2) INFORMATION FOR SEQ ID NO: 42 











<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3369 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(O) TOPOLOGY : linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
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CCGCGAAAGA TATTTTTCAA CAAGAGTTTG GACGTGAGCT CeeW-?TAT AAT AAAGT AG 60 
AAGTTGACGA GTTTTTACAC GATGTCATCA AGGACTATGA AACCTATGCT CCCTTGGTCA 120 
AGTCACTTCG TCAGGAAATT GCGGATTTGA AGGAAGAATT AACTCGTAAA CCCAAACC7T 180 

CACCAGTTCA AGCAGAACCC CTTGAAGCGG CAATTACAAG TTCTATGACG AATTTTGATA 2 40 
TTTTGAAACG CCTGAATAGA TTGGAAAAAG AAGTTTTTGG TAAACAAATT TTAGATAACT 300 

CAGATTTTTA AGTAGTTATT TGAGATCTCC AATTTTTGGA TAATCGCGTC AGGAGAATTG 360 

TTTCTCATGA GGAAAGTCCA TGCTACCACA CGCTGTCATG CCTGTAGTGT TTGTGCTAGG 420 

CGAAACCATA ACCCTAGCGA CGAGAAATCG TTACGGCAGT TGAAATGGCT AAGTCCTTGG 4 30 

ATAGGCCAGA GTAGGCTTGA AAGTGCCACA GTGACGGAGT CTTTCTGGAA ACAGAGAGAG 540 

TGGAACGCGG TAAACCCCTC AACCTACCAA CCCAAATTTT GGTCGGGGCA TGGAGTACGC 600 
GGAAACGAAC GTAGTATTCT GACTGCTATC AGCTAGAGCT GTTAGTGGTA GACAGATGAT 660 
TATCGAAGGA AGTGGTCCTA GTCACTTCTG GAACAAAACA TCGCTTATAG AAAATTCCAT 7 20 

ATAGGTTGGG CCTCACAAA? TTTCTCAACC TCATTTTTTA AAGTGGACAT ATAGAAAGGT 7 80 

CTTGCAAGAC TGTAACATGA AAAAAGAATT TAATTTAATT GCAACTGTGG CAGCAGGGCT 840 
TGAGGCTGTC GTTGGTCGTG AAGTCCGAGA GTTGGGCTAC GATTGTCAGG TTGAAAATGG 900 
ACGTGTTCGT TTTCAAGGAG ACGTGAGAGC TATTATCGAA ACCAACCTTT GGCTTCGGGC 960 

AGCAGATCGT ATCAAAATTA TCGTAGGAAC GTTCCCAGCT AACACTTTTC AAGAGCTATT 1020 

TCAGGGAGTT TTCGCTTTGG ATTGGGAAAA TTATTTACCA CTTGGAGCTC GGTTCCCGAT 1080 

TTCAAAAGCT AAATGTCTTA ACTCCAAACT TCACAATGAG CCCAGTGTTC AGGCTATTTC 114 0 

TAAGAAAGCT GTTGTCAAGA AAT7CCACAA ACACTATGCT CGCCCAGAAG GGGTTCCTC7 1200 

GATGGAGAAT GGCCCAGACT TTAAGATTCA CGTCTCTATT CTCAAAGATG TGGCAACTC? 12 60 

CATGATTGAT ACGACCGGGT CTAGCCTCTT TAAACGTCGT TATCGTACCG AAAAAGGTGG 132 J 

CGCTCCTATC AAGGAAAATA TGGCAGCAGC CATTTTACAA CTTTCTAACT GGTATCCACA 1380 

CAACCCTTTG ATTGATCCGA CCTGTGGTTC GGGGACTTTC TCTATTCAGG CAGTTATGAT 1440 

TGCTAGAAAG ATGGCGCCAG CTCTTCCTCG CTCTTTTGCA TTTCACGAAT CGAACTGCAT 1500 

CAGCGATCGC TTGATTCAAG AAGTGCGCAC AGAAGCGGCT AAAAAAGTAG ACCCTCACCT 1560 

TGAGCTGGAT ATCATGGGCT GTGATATTGA TGCTCCCATG GTGGAAATTG CTAAGGCCAA 162 0 

TGCTCAGGTA GCTGGTGTTG CAGGAGACAT TACTTTTAAG CAGATGCGCG TGCAGGATTT 1680 

ACCTTCCCA? AAAATCAATG GAGTAATCAT TTCCAATCCG CCTTATCCTG AACGTTTGTC 17 40 

AGATGATGCA GGGGTGACCA ACCTCTATGC TCAGATGGGG CAAGTATTTG CACCGCTGAA 1300 
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AACTTGGAGC AAATTTATCC TGACTAGTGA TGAAGCTTTT GAAAGCAAGT ATGGTAGCCA 1860 

AGCAGATAAG AAGCGTAAGT TATACAACGG AACCTTGAAA GTCCATCTAT ATCAATATTT 1920 

TGGTCAGCGT GTCAAACGGC AAGAGGTAAA ATAGAAAGGG ATACTCATGA GTAAAAAAAG 1980 

ACGAAATCGT CATAAAAAAG AAGGTCAAGA ACCGCAATTT GATTTTGATG AAGCAAAAGA 204 0 

GCTAACACTT GGTCAAGCTA TTCGTAAAAA TGAAGAAGTG GAATCAGGAG TCTTGCCTGA 2100 

GGATTCCATT TTGGACAAGT ATGTTAAGCA ACACAGAGAT GAAATTGACG CGGATAACTT 2160 

7GCCAC7CGT CAATACAAAA AAGAGGAGTT CGT7GAAACT CAGAGTCTCG ATGATTTAAT 2220 

7CAAGAGATG CGTGAGGCTG TAGAGAAGTC AGAACCTTCT TCGGAGGAAG TTCCATCTTC 2280 

TGAAGACATC TTACTACCCT TGCCTCTGGA CGATGAGGAG CAAGGCTTGG ATCCTCTATT 2 340 

GCTAGATGAT GAAAATCCAA CAGAAATGAC TGAAGAAGTG GAAGAGGAGC AAAACCTTTC 24 00 

TCGTCTGGAT CAAGAGGACT CAGAAAAGAA AAGTAAAAAA GGCTTTATTT TGACCGTTTT 2460 

GGCGCTTGTA TCAGTAATTA TTTGTGTCAG TGCTTATTAT GTCTACCGTC AAGTCCCTCG 2 520 

TTCGACTAAG GAAATTGAAA CTTCTCAATC AACTACAGCC AATCAATCGG ATGTGGATGA 2580 

TTTTAATACA CTTTATCACG CCTTTTACAC AGATAGCAAT AAAACGGCTT TGAAAAATAG 264 0 

CCAGTTTGAT AAACTGAGTC AACTCAAGAC TTTACTTGAT AAGCTGGAAG GTACTCGTGA 2700- 

ACATACCCTT GCCAAATCTA AATATGATAG TCTAGCAACG CAAATCAAGG CTATTCAAGA 2760 

TGTCAATGCT CAATTTGAGA AACCAGCTAT TGTGGATGGT GTGTTGCATA CCAATGCCAA 2820^ 

AGCCAAATCC GATCCTAAAT TTACGGATAT TAAAACTCGA AATACGGAGC TTGATAAAGT 2380* 

GCTACATAAG GCTATCAGTC TTGGTAAGAG CCAGCAAACA AGTACTTCTA GCTCAAGTTC 2940 

AAGTCAAACT AGCAGCTCAA GTTCAAGTCA AGCAACTTCA AATACGACTA GTGAGCCAAA 3000 

ACCAAGTAGT TCAAATGAGA CTAGAAGTAG TCGCAGTCAA GTCAATATGC CTCTCTCCAG 3060 

TCCAGGGGTT GCTGTTCAAA GAAGTGCCAG TCGTGTTGCC TATAATCAGT CTGCTATTGA 312 0 

TGATAGTAAT AACTCTGCCT CGGATTTTCC GGATGGTGTC TTGGAACAAA TTCTAGCGAC 3180 

TTCACGTTCA CGTGGCTATA TCACTGGAGA CCAATATATC CTTGAACGTG TCAATATCGT 324 0 

TAACGGCAAT GGTTATTACA ACCTCTACAA GCCAGATGGA ACCTATCTCT TTACCCTTAA 3 300 

CTGTAAGACA CGCTACTTTG TCGGAAATGG CCCTGCTCAT CCGGATGACT TAG ATT ACTA 3 3 60 

AGCAGTCGG 3 3 69 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9713 base pairs 
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(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
AAGTTTACAA TTTAAATGAA TTAACAATTT TCCCAACTAA AAGCACTCCA GTTACCGCAA 60 

CGTTTGTACT GAATGTACTA AATCGCATTC CATCAACTTC ATCTGTTTCG TCAACTTGAA 120 

CAGATACTAA TTGAAGATTT AATACTTCTG CTCCCATAGC TACCTCCTCC TATTTAAATT 130 

TTTGGGATTA AGTACTTTAT CCACCCTCAT ATACTCTCTC CACCAGTAAA ATGCAAGCAA 240 

TGATACAAAA TAGAT7TAAC TATTTTATAT ACCGAAAACT TACAAATTTT TAAGAAATAA 300 

TTTTTGCATT CTTAAAGATA AAATAGGAAC TTTTAGTAAT AAATATTAAA ATAAATAAAA 360 

TAATAGATAC TATAAAATTT GGAAGTATTA ACCCCAAAAG ATTCATATCA TCTATTAAAA 420 

TATCCTCTAA AGAGTAGTAT ATTAAAGCCA TAATTTTAAT GTTAAGTAAA AATGCAATTA 4 80 

ATGAAGTAAC AAATGTCAAA AATATAGCCT CACCAACTTT AATCTTAACC ATCTGGTAAT 540 

TAGAAGTTCC TAAAATTTCA AATTGCTGAA TCTCAATCCT TTCTTGATGC CATGACAAAA 600 

ATGCAATTCA AATAATATTT GCAAGTACTA TCAAAATTGG TGCTCCTACA TAGACAATAA 660 

ATGCTACTTT TAGCTCTAAA TCACTGTCAT CTTGAAATTG AGATAGTATA TTCTGAGAAA 720 

TCATTTGAAA ACTAGAAATT AGTAATATAG CTCCTGTAAT TGCAGCACTG ATAGATTTTA 7 90 

TATAAGACTT ACAATATAGT AAATTCCACT TCGAAACAAT GAACATAAAA TTATTTCTAA 840 

ATATAATTAT AGAAAGTAGT TTGATAAAAC ATGACTGTAT AAAAGGAGAT AATTCATAAA 900 

TAATCACAAT ATCTAAGATT ACAATATTGA ATATTATCTG GGCCTTCGCT AAAATTGTGC 960 

TATCTTGGAA AATTTGTTGC AAAGAAAGCA ACCAGATAAC ACTAAAACCA GCCAATAGCA 1020 

GTATTCTTTT TACTATTGAA AGAACATGCC TTATTTTAGA ACTCTTCCTA TTTCTAATCT 1080 

TCTTGAACGT ATAAAAGCAA CCACTTAGAA AGGCTAAAAA TGAAATCAAC ACTACTGTAA L140 

TGATACATCC AACAGCACTC GTTTGAAATT GGATATCAGG TAATATATTT TCCCCCAAAA 12 CO 

AGTATTGTAA AAAATAATAA TAATTTGACG TAACAAATAT AGAGCATAGA TA7GCAATAA 12 60 

AACTAATAAT CGAGGAAATG ATAAAAATCT GTCCCCCCAC AAGAAATGAT AGTTGAAGGC 1320 

CACTTCCTCC CAACACCTCC AGAAGTTCGT AATCATCTCT AAAAATTTCA ACCAACATAT 13 80 

TTATTATGTT AGAGAGCACA AAGAATAATG TTACTCCTCC GAATACTATC GGAAACATAA 14 40 

AAATTGGTTT AGGATCTGGA AGTCCGACAA ATACTTGCGA ATTATTCTCA ACATTAATTA 1500 

CCCCATTAAC AGCCAATCCC ATAACTAAAC TCGAAACAAA AATTACTGCT GAAACGCCTA 1560 



WO 98/18931 



PCT/US97/19588 



407 



ACCATTCTTT 


CTTATTATGT 


AAAAATTGAT 


AGTAAACTAA 


TCTGAGCATC 


7CTATTCCTC 


1620 


CGTAGTTGAT 


TC7ACC7C7A 


AGATTTTATA 


CAACTCTTCC 


CCGCTAGGTC 


7ATGAAGTTC 


1680 


TTTGAAAATT 


TTTCCATCTT 


TCAATATTAA 


TGCACGATCA 


GTTTTCGAGC 


CCAATTCTAT 


1740 


A7CGTGCG7T 


ACCATAATTA 


CACACTTACC 


CGCCCCTACT 


AACTCTCTCA 


ATAATTCAAA 


1300 


AATTACTTCA 


CGAGAAACGC 


TGTCTAAAGC 


CCCAGTTGGC 


TCATCAGCAA 


ATATTATATC 


1860 


ACTATCAGCA 


ATAACCGCTC 


TAGCTATAGC 


AACCTTCTGT 


TGTTCTCCAC 


CAGACAGAGT 


1920 


TCCAACAAAA 


TCGTTTAAGC 


CAGCATTAAA 


CTTCA7TC77 


TTGAG7AACT 


TT7CTACA7T 


1980 


TTTAATAGTT 


AATTTTTTTT 


G7GATAATCG 


CAAAGGAAGT 


GC7ATA7TTT 


CTAT7ACCGG 


2C40 


CAGGGAAGGT 


ATTAAATTGT 


ATGCTTGAAA 


7A7AAAAGAT 


ACTTCGTTAC 


GTCTTA7ACT 


2100 


TGACAA7T77 


CCA7T7CTCA 


7TTTATAGGC 


GTTGATTCCA 


TTTAAAATTA 


C77CCCCACT 


2160 


7CTTGGT7CA 


AG C AAA C TAG 


AAATACATTT 


TAATAAAGTT 


GACTTTCCAG 


AACCACTAA7 


2220 


TCCTAGAATA 


CTTATAAATT 


CTCCTCTCGA 


AGCAGAAAGA 


GAAACATTTT 


TCAGCACTTG 


2280 


CAACGTTTTA 


TTATTTCCTA 


GTAAAAATTG 


ATGATACAGC 


CC7TTCAC7T 


TTAA7ATATA 


2340 


ATCTTTATCC 


ATATTCTTGC 


CTCCAATCAC 


TTAATTTTGA 


AAAGTGTTCC 


A7TTTCCAAT 


2400 


TTATATATAT 


CAGTGTATCT 


CTTGTCATTT 


AAGTCATAAT 


CATG7GAAAC 


TTCAATAAAT 


2460 


GAAATACCTA 


AATTGAACAG 


AATATCATGT 


ATGGAATTTG 


AATTATCATT 


ATCTAAATTA 


2520 


GCTGATATTT 


CGTCAAATAA 


GTACACTTTA 


TTATTTCTAA 


TCAGACCTCT 


AGCTAAAGC7 


2580 


A7TT7TTG7T 


777CACC7CC 


AGACAAATTA 


CTACCA7TT7 


CACCACATTC 


ATAAT7TAG7 


2640 


ATATCTATCT 


TTTCTAATTC 


TTCATATAGA 


TTTACCTTTT 


TTAACACCTC 


AA7TATCTGA 


2700 


TCA7CTGAAA 


AA7A7TCAT7 


TTGAAATAAA 


GTTACCTTCT 


CACGAATAGT 


AG7G7CAAAA 


2760 


ATATATGGTG 


TCTGATCAAC 


TG7TGGTATT 


GAATCTGAAC 


TCTTTTTCCC 


ATGTGATAAC 


2920 


AAA 1 T * AC AT 


AACCTTTTTG 


TGGCTTTAAA 


GAACCATTAA 


TTAAATTTAA 


AATCGTTGTT 


2880 


TTCCCACTAC 


CAGAAGTTCC 


TCTTAATAAT 


ACCCTAAATG 


GTGACTTAAA 


TGAGAAGTCA 


2940 


ATACTTAATT 


TATTTTCTGG 


TGTAATAGAA 


TATACAACAT 


CTT7CATCTG 


7ATCTCATCT 


3000 


ATTGATGAAG 


TATACAGTCC 


GTTATTATCA 


TGTTCAGCGT 


CTATAAAAT7 


CT7CTC7CCA 


3060 


CTTAAG7AT7 


TTAAAAACGG 


TTTCCTTAAA 


TCTTTGGTTG 


7A7TTA7C7T 


ATTTAATGAA 


312C 


TAGGCAATTG 


ATTGTATCGG 


CCCTAAAACT 


TTATCGTTTG 


CTAAGAAAAT 


ACCTATCAGT 


3180 


TCACTAAAAG 


AAAGGCTTTT 


ATGATAAATT 


ACAAAATAAC 


ATCC7ACAAC 


CAAGGGAACT 


3240 


AGAAAGCAAA 


AACCTGAAAT 


TAGTACTGCA 


ACCAATTTTG 


AAAGAACCTC 


TGATCGTTTC 


3300 
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AAATTAAAAG TAGAATCTTC TAGTTTATCC AACTTTTTAT CCGACAAACT AATTATTTCT 3 360 

TTAGTAACAG AATAAGATTT TAATGTCTTA AAACCATTAA AAATTTCTTT TATTATGTGA 3420 

GTATACTCTG CATTGCTGTT AGACTACTCA TTAGCTGAAT TAG AC AAC AT CTTCTTCATA 3480 

AAGACAGGTA CTATAATCGG CAATGCTGAT AATACAATAA ATATTATTGA nACTAGGAAG 3 540 

TTTAAATAAA GCATAAAACT TAGAGAGACG ATGAACAACA ATATTGAAGA AATTATTTCA 3 600 

AAAATTTGTC TAAAATAGTT TTCTTCGATT AATCTCAAAT CATTTGACAA AACTGAAATA 3 660 

ATAGATGAGT AATCTTTAAC CATTTCAGAA GAAAGATACT GTTCTCTAAA ATATCCTTGT 3 7 20 

TTAATTTTTA CATTTATATC TTTAGTTATT GATGCTTCCG TTACTTCTAA ATAGTAATTT 3 7 80 

GATATATAGA TTGCTGACCA ACCCAGAATA CTTATAGCAC CAAATCTTAG AACGTCAGAA 3 840 

AATGAGGAAG TCTGATTTAA ACTACCTGCA TATACAATAA TTCCTGAGAG CAAGACACCA 3 900 

TTAAACGAAG ATAGAAATAT TAAAATCCCC ATTAATATAA GTTTAGTCT? TTTTATAAAT 3 960 

TTTAAATAAT TCATAAGTTA TTCCTTCCCA CTTCTTCAAA GAAATAATTT AAAGTATCAA 4020 

TCATTAACAG AACATCTGAT GGAGTAAAAC CTCCATCACC AGCTGCTTTG TTTAAATACA 4 080 

ACAAACTTTT AACTCCAATA GAATTTAATT TCTTTGACCA CTCTATCACT TCGTTATTAT 414 0 

TAATATA7GG GTCTTTCTCA CCCAAAATAT TAACTATAAC AGTATTTCAG TCTCGTGCCT 4 200 

TTTCAATATT TTGCATAGGC GAATATGACT TTATATAACC CTTTACTTCA GGGTCTCTAA 4 260 

TATCTCCCCA CTCTGCTATT TCGGTCTTAG AAAGAGGATC ATTTGGATTC TGAAGTGTAT 4 320 

CATAAGGATT TATAAATGGC GAAAATAAGA GAATGCTTTG CAATAAATTT TTTTCCTCGT 4 380 

TCAACACCGC ACCAGCAATT ATTCCACCTC CACTAGAACT TATTAAACCT AATCGCTTAC 4 44 0 

TGTCAATTAC ATCATTTTCC CTTAAATAAT TTACTCCCTC AATAAAATCT CTGATAGAAT 4 500 

TCCATTTCTT TAACCCCTT7 CCTCACCCAT ACCAT7CACC ACCCAAATAG CCTCCACCTC 4 560 

TTACATGAAC TATAGCATAA ATAAAACC7C CATCTATTAT AG AT AAC AT A ATTTCATCTA 4 620 

AATCAGAATT ATCA7TCTTA CCATAAGCCC CATAGACACT TAGAATACAT TTTTTTCT*I*C 4 680 

TTGGGACCTC ATCCGTATCT TCACTTTTCC AAAATAAAGA AATCGGTATG CTTACATCAT 4740 

AACTGTCTTT TTTAGTCCAA ATCACCTTAG AAAAATATTT ACTATTATTC GATTTTATCA 4 800 

TGGGTCTTTC AAATTCAGTT TTTAATGTAT TTTCTATTAA ATCAAAACTA AGTATTTTTT 4 860 

CGTAAAAAGT TCTCCTCTCT AAAAACAGAA G AAC AC GAT C AG AAAATG AA TTTTCATAAA 4 920 

GTGTTGTCTT TTCATCAAAT GTTATCTTAT TAACACTCAA CTCCCTCAAA CTATTATTTT 4 9 80 

TAAATGTAGC AAGATAAAAG ACGGAATTCG CTGCCTTTGA ACAGTCTAAA AGGATATAAC 5040 

GTCCTATACA GTGAACTCTT CTAGCCCTAT CTTGATATGC TATAGTAATA CAAACTCTGT 5100 
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CTCCCGAACA AGTTTCCCTT AGAATTAGTT GATCTTTCTT TTCTTCAGTT GAAGAGAGCC 5160 

CAAGAAAGTA CTGTGCTTTT TCTGTACTAA ATAGAGCGAT ATCTCTAGGT GTTGGGGCTA S22C 

CCGTTTCTGT GTAAGAGTGT CTAACAAAAC CCGTCCGGTC GAAACTGTAT AGAAAAATCC 52 80 

TGCCTTTCTG AAAGTCTACT GACTTTACAA AACAATTATT GCTATCAATG TGGACTATTT 534 0 

TTAATCGAAA AGAGCATTCG TTTTCTTCAA ACAGTTCCTC TTCTGTAAAG CTATCAAAAG 5400 

ATTTATAGAA TAACTTACTT GGCCTCCCGT ACTCTTTGCiT CCGAGTATAC ATAACACCGA 54 60 

ATTTACCCAA ATAGAACCAA CTTTCTACTG AAATATCTTC AATGATAAAT AACTCTTCCA 5520 

TAGTATATTT TTTTATTCCA ATTAAAT7AC TCGTACGCAG TGAGGATACA ACCAAAACTA 5580 

TATAACTCTC ATCAGATGAA ATCCTAACAT CCTGTAAGAT ACTATCATCT GGCAAAGTAT 5640 

ATTTTTCCAC ATCAAAGACA ATTTTAACTG AATTTGAATT GTCTAAACTG GAAGAACTAA 57 00 

CCTTAGGAAT CCAGTCA7TA TCTTCGACAT ACCATTCCTT TATTACACCA GTATTGGGTA 57 60 

TACTCCAATT ATCAAATTGG TACCAATATC GCCCTCTCCT AAATATCAAA GAATTCCATT 5 820 

TTTTTAATTC CTGAAATGAT CAAGAGATAG ACCTCTTATA CTGTGTTTTT TCCTCTATTC 5880 

TATTTAAAAA TATTTCATTA CTCTGATTCA CAAGTATGAC CCCTTAATAA TGGTATCTAA 5 940 

AT ATT AT AT? TGAGGAAGAA TCGTCAATTT ATTATCCATT ATTGATACCA ATCCAATTGC 6000 

AACACCCGCA AATCCCGAAG CAATATCTGT TGTTATCTTT AAACCATTAT CTCCCGCAAT 6060 

AACAAATCCT TCTTCAATTA CACACAAATA TCTATAAAGT TGTTCAATTA ATTTCTTTTG 6120 

7CCTGAAAAG TTATCATCGA TATCACTATA TAT AT TAT T A GCAACTTCAA GACCACAAAA 6130 

TCCCTTAAAT AAACCTGGTA ATACACAAAA AACTACATCA GTTGCCCTCT CTAAAGAAGT 6240 

TAAATATTTT AAGTATTTGC TTGACAAGAT TTCTTTATTT CTATTAATAA GTAAAAGCAG 6 300 

GCCAGCACTT CCAGTTGCTA CATATGGTAG TAATCTATGA CCTTGGCTGT ACTGCAATGA 5 360 

ATT ATT ACTA TCTACTTTAT AAGCAACTAA TTCTTTATCT ACAGCCAATT CTAGACCATT 6420 

TTTATAGATA CTTTCACCAG TTAATTTATA AGCTTCACCG AAGAGCCAAG CTACCCCTGC 64 80 

GTGACCATAT AGTAATCCAC CAAAATTCTC ATAAGGATCG TTACTCTGAA CATC ACT AG C 654 0 

GCCAACTTTA CAAAAAGTTT CTGGATTTTC TATATAATTT AAAGTATATT CTCTAAGCCT 6 600 

AATTAGTATT TCTTCTCCTA GTTTATTATC AATTCCCCCT TTACTAAGAA AATACAGTCC 6660 

AACCAGTAAA ATTCCAGCCT GCCCACTATA TAAATTTTTA TTTTGTGAAT TCTCAAATAT 67 20 

CTCTATAAAA TGAGTTGTAA AAAGTTCAAC TGCCCGATCT A7CTCCCCAA ATTCATAAAT 67 30 

GAGCCAGATT GTACCAATT? TACCATCAAA AAGACCAGAA AGGGACGATT TCTTAAAATT 634 0 
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ATTTACTGCC 


TCATTAATAA 


CCTGTGTTCG 


AATCTCATAA 


TAGTCATCAA 


ACTTGAAATT 


69C0 


TTTTACTTTC 


TTACCTAGTT 


GTTGATAACT 


CCAAAGGATA 


GCTAAATCTG 


AAAACGCAAT 


6960 


TCCTTGATTA 


AAATTCAGAC 


CATAATAATG 


AACTGGGAAG 


AATCTTGATT 


GAAATTCTTT 


7020 


ACGCCACTGT 


CCATAAGTTA 


GCGTAAACCC TCTCAATAAT 


TTTATAATAA 


AATCTTGTAT 


7080 


ATCTTGCTCA 


CTCTCGATAG 


TTCTAATCTC 


ATGCATGGGT 


TTTAAAACTT 


TTTTCCTGGA 


714C 


AATATTCTCA 


ATCTGTGGAC 


ATTTAGAATC 


TAGATATGAC 


AATAAACTTT 


CTACATAATC 


72C0 


TATATGTTCT 


CTTCTATAAC 


CCAAAGACTC 


AAATAGT7TT 


TTTCCTTCTA 


7CCTGGTTTG 


7250 


ACTTACATAG 


TTGTATGTCA 


AATCCGATGT 


AGTTACTAGT 


GGCATGTATA 


AATAATGAGC 


7320 


TATTTGTCTA 


ATACCATACC 


AATCTATCTC 


ACTGGGAAGT 


GTTTCTCGCC 


ATGCTCTAAA 


7380 


ACCAGGGGCT 


GCAACTTTAT 


GTACAACTTT 


TTCATCATTT 


GAAAAGACAG 


CCTGTTCCCA 


7440 


GTCTATTATA 


CTAATCTCAT 


CTTCATCCTT 


AACCAAGATA 


TTTCCTAAAT 


GTAAATCTTG 


7500 


ATGATATACA 


TTTTCAGAAT 


GAAACTTATT 


CGTTAAATCG 


ATGAGTTTTT 


CTACTATCTT 


7560 


TGAAACTC7C 


AATAGATAAT 


CTTTGGTCTT 


ATCAACAACT 


TCATATAAAG 


GAAAATTATT 


7620 


GGTAACCCAT 


CTATTTAGTG 


GAACGCCCTT 


CATATGTTCA 


ATTCCTAAGA 


AGGTGTGCTC 


7680 


CCAGATCTTA 


CCGTGCCAGT 


ATATTTTAGG 


CGTCTCACTC 


CATTCATTTA 


GAATTTTTAG 


7740 


TGCTTTGCAC 


TCCGAAGCTA 


ATTTCTCTGA 


AGAATAAGTA 


CCATCAAATC 


CTAGACCTGT 


7800 


ATACGCTCTA 


GCCTCTTTTA 


AAATTATTTT 


TTTCCCA7CT 


TCTTTTAGCC 


TAGCATTATA 


7860 


TATCCCACCA 


CTCTTTGAAA 


ATCTAATTGC 


ATTATCTATA 


ATAAAGGGAA 


AGTCTCCCTG 


7920 


TTTTTTATCT 


TTCTTCTCAA 


GCCA7T7ATT 


CAAAAAGTCA 


GGGGGCACTA 


TACCTTTTGG 


798C 


AATTTTAAAT 


ACTGGTAAAC 


GTTCATCTTT 


AACAACTTCA 


TCGCCAACAA 


TTAATTCATC 


9040 


AATAGCAACC 


TTCTTTTCA7 


CATCCCTTGA 


CCGCCTAAAC 


ACACCATACC 


TCAGATATAT 


8100 


7CCTGCTTCA 


TCCCAACCTT 


TATCGCTTAA 


AATATATGGC 


CC ATT AT ATT 


GCTTTAACCC 


9L60 


ACTT7CTAAC 


CTTTGCAAAA 


CCGACTCTAA 


TTCATTTTGA 


TTTGGATAAC 


ATGTAATAAA 


8220 


TTTACCAGAA 


AATCCTCGAC 


TAACCAATTT 


CCCGTTTCGC 


ATCATAAA7T 


TGTCTTCTGT 


8280 


ACTAAGATGT 


TTAAATGGAA 


TTCGCATTTC 


ATGGCAAATT 


TTTGCTACAT 


CTTGTAACAA 


8340 


TTCATCTGAA 


CTGTTATACT 


CTGAACTAAT 


GTGTATTTTC 


CACCCTTGTC 


TTTCAACAAA 


8400 


TTTTCCAATA 


GGGTATTGAT 


AAACCCACTC 


ATCATTATTC 


ATTACTTCCT 


GCCAATTAAA 


8460 


AGGCAGACTT 


ACTTGGTACT 


7TATGC7AG? 


ATCTGTACTA 


TAATCATTAT 


TAGTGAAAAA 


3520 


GAAAGGATGC 


TCCAAATTGA 


AATTATAATC 


CATAACAAAA 


TCTCCAAGAA 


ATTTTATCAA 


8580 


ACTTAATATA 


TCTATAGCTA 


GACAGACTTA 


TTTAAATAAA 


AAGGGAGAAT 


CCTTTGGATT 


864C 
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CTCCCCATAT 


AAGCACTAAC 


ATTCCAACGT 


GCACATATTG 


GAACGACATC 


CATAACTCCA 


9700 


CACAATCTCT 


AAAGTTTACA 


ATTTAAATGA 


ATTAACAATT 


7TCCCAACTA 


AAAGCACTCC 


8760 


AGTTACCGCA 


ACGATTTCTA 


CTGAATGTAC 


TAAATCGCAT 


TCCATCAACT 


TCATCTGTTT 


9820 


CGTCAACTTG 


AACAGATACT 


AATTGAAGAT 


TTAATACTTC 


TTCTGCCATA 


GCTAGCTCCT 


8830 


CCTAT7TAAA 


TTTTTGGGAT 


TAAGTACTTT 


ATCCACCCTC 


ATTATACTCT 


CTCCACCAGT 


8940 


AAAATGCAAG 


CAATTATACA 


ATGTTGTCAC 


atagaaaata" 


ATGTTTCCGT 


"AACTTTTCAA " 


9000 


AGTAACTTCC 


ATCTCTCTCC 


CAAAACTGGA 


AGTTAGTTTT 


AGAAGTTACC 


TAAAAATCAG 


9060 


GTCACC7ATT 


TTAAAAAAGC 


AGCAAACTAT 


AAACTAGTAG 


G7TCCACACC 


AAATGTAGTC 


9120 


CCATACTCCC 


CCATAAGTCA 


GATTTATAGC 


GCACCATACC 


TAAAAACATC 


CCAAGTGAAA 


9130 


CATACAAACA 


CCAAGCTAGA 


ATGGTTCCTG 


TATGATGTGC 


TAAGGCAAAT 


AAAACACTTC 


9240 


1 LAAALK-AAL. 


I UlViA 1AICI 






TAAAATTTCT 


CGATACAGAA 


9300 


ATTCTTCAAC 


CATACTCGCA 


TTGATTAAGA 


ACAATAAAAA 


TGAAAACCAA 


GGAATTTGAT 


9360 


GTTGAAGGCC 


AATTAAGTTT 


GCTTGATTCG 


TGCTTCCTTG 


AGCATGAATC 


AGACTAAAAC 


9420 


ATAGACTTAT 


AATCAGTAGG 


CTAACAAATT 


CAACACCAAG 


CCATTTCATC 


CTAGATTTCA 


9480 


TATTGACCTT 


ATCCGCTTGT 


TTGCGTTGGC 


CATACATCCA 


TAAAAAAGAA 


ATGAGTGACC 


9540 


AACCATAGAG 


AATCTGTAGT 


ATACTTmACT 


CACCGATACA 


AAGAAATTTC 


AATAAGTATA 


9600 


GAGrTACCAA 


TAsGACATTT 


ACTTGTTGGA 


ATATATAAAC 


TCGAATTATT 


CTTTTCATAG 


9660 


TTACCTCCGA 


AATAAATCTT 


CATAATCTAA 


ATCTAATACC 


TCCACAATCC 




9713 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS : 

<A> LENGTH: 8657 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNES5 : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



AAAGAAATTG 


TCAGAGAGTG 


GCTAGATGAA 


GTAGCAGAGC 


GGGCTAAGGA 


CTATCCAGAG 


60 


TGGCTGGATG 


TTTTCCAGCG 


TTGCTACACC 


CATACCTTGC 


ACAATACGGT 


TGAAATCTTA 


120 


GAAGATGGTT 


CAACTTTTCT 


CTTGACTGGG 


GATATTCCTC 


CCATGTGGCT 


TCGAGATTCG 


130 


ACAGCCCAAC 


TCAGACCCTA 


CCTTCATGTA 


GCTAAAAGAG 


ATGCCCTCCT 


GCGTCAGACC 


240 


ATTGCAGGTT 


TGGTCAAACG 


7CACATCACC 


TTGGTACTCA 


AGGATCCCTA 


TGCTAACTCC 


300 
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TTCAACATTG 


AGGAGAACTG 


GAAAGGGCAC 


CACGAGACTG 


ACCACACAGA 


CCTTAACGGC 


360 


TGGATCTCGG 


AGCGCAACTA 


TGAGGTGGAT 


TCGCTTTCCT 


ATCCTTTGCA 


GTTGCCTTAT 


420 


CTCCTCTGGA 


AAGAGACTGG 


CGAGACTAGT 


CAGTTTGATG 


AGATTTTTGT 


CGCAGCGACT 


480 


AAGGAAATTC 


TCCATCTGTG 


GACGGTGGAA 


CAAGACCACA 


AGAACTCTCC 


TTATCGTTTT 


540 


GTCCGAGATA 


CGGACCGTAA 


GGAAGACACC 


TTGGTAAATG 


ATGCCTTTGG 


ACCTGACTTT 


600 


GCAGTGACAG 


GTATGACTTG 


GTCAGCTTTT 


CGTCCGAGTG 


ATGACTGTTG 


CCAGTATAGT 


660 


TACTTGATTC 


CGTCAAATAT 


GTTTGCTGTA 


G7AGTCTTGG 


GTTATGTGCA 


AGAAATCTTC 


720 


GCAGCATTAA 


ACCTAGCTGA 


TAGCCAGAGT 


GTTATTGCTG 


ATGCCAAGCG 


TCTTCAGGAT 


780 


GAAA7CCAAG 


AAGGAATCAA 


AAACTACGCT 


TACACCACCA 


ACAGCAAGGG 


CGAAAAGATT 


340 


TACGCTTTTG 


AAGTGGATGG 


CCTAGGAAAT 


GCCAGCATCA 


TGGATGATCC 


AAATGTACCA 


900 


AGTCTACTAG 


CTGCGCCCTA 


TCTGGGCTAC 


TGTTCGGTCG 


ATGATCAACT 


GTATCAAGCT 


960 


ACTCGTCGTA 


CCATTTTGAG 


CTCTGAAAAT 


CCATACTTCT 


ACCAAGGAGA 


ATACGCAAGC 


L020 


GCTCTCGGCA 


GTTCTCATAC 


CTTCTATCGC 


TATATCTGGC 


CAATCCCCCT 


TTCTATCCAA 


1080 


GGCTTGACAA 


CAAGAGATAA 


GGCAGAGAAA 


AAATTCTTCC 


TCGATCAGCT 


GGTTGCCTGC 


1140 


GATGGTGGTA 


CAGGTGTCAT 


GCACGAAAGC 


TTTCATGTAG 


ATGATCCGAC 


CCTCTACTCT 


1200 


CGTGAATGGT 


TCTCCTGGGC 


TAACATGATG 


TTCTGTGAGT 


TGGTCTTGGA 


TTACTTGGAT 


1260 


ATTCGCTAAG 


GGGCTCGCTT 


TAGCTCAACC 


GATTCTTATC 


AGAATCACAA 


GTTTACATTT 


1320 


AAAACGTTAA 


AATTTAAATT 


TAGAATGAGG 


TTTTACTTCA 


TGGAAAATCT 


TGTTGTACA? 


1330 


ATTATCTCAC 


ATAG7CACTG 


GGATCGTGAG 


TGGTACTTGC 


CTTTTGAAAG 


CCATCGTATG 


1440 


CACTTGCTCG 


AATTGTTTGA 


CAATCTCTTT 


GATCTCTTTG 


AAAATGACCC 


TGAGTTCAAG 


1500 


AGTTTCCACT 


TGGATGGACA 


AACTATTGTC 


CTTGATGACT 


ACTTACAAAT 


TCGCCCTGAA 


1S60 


AATCGCGACA 


AGGTCCAACG 


CTACATTGAC 


GAGGGCAAAC 


TTAAAATTGG 


TCCCTTTTAC 


1620 


ATCTTGCAGG 


ATGACTACTT 


GATCTCCAGT 


GAAGCCAATG 


TCCGCAATAC 


CTTGATTGC? 


1680 


CAACAAGAAG 


CTGCCAAATG 


GGGTAAATCA 


ACCCAGATTG 


CCTACTTTCC 


ACATACCTTT 


:740 


GGAAATATGG 


GACAAGCGCC 


TCAAATTCTT 


CAAAAATCAG 


GCATTCACGT 


GGCGGCCTTT 


1800 


GGTCGTGGTG 


TGAAGCCGAT 


TGGATTTGAC 


AACCAAGTCC 


TTGAAGATGA 


GCAGTTTACG 


i860 


TCTCAGTTTT 


CAGAAATGTA 


CTCGCACGCT 


GTGGATGGTA 


GTCGTGTTTT 


AGGTATTCTC 


1920 


TTTGCCAACT 


GGTACAGTAA 


CGGGAATGAA 


A7TCCACTTC 


ACAAAGATGA 


GGCCTTGACC 


1980 


TTCTGGAAAC 


AAAAATTGTC 


AGATGTGCGT 


GCCTACGCTT 


CGACCAACCA 


ATGGTTGATG 


2040 


ATGAACGGCT 


GTGACCACCA 


GCCTGTACAG 


AAAAATCTGA 


CCGAAGCCAT 


TCCTCTCCCA 


2100 
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AATCAACTCT 


TCCCGGATGT 


AATCTTTGTT 


CATAGTTCT7 


TTGATCAATA 


TGTTCAAGCT 


2 150 


GTAGAAGGTG 


CGCTTCCTGA 


ACACTTATCA 


ACTGTTACAG 


GCGAGTTGAC 


CAGTCAGGAA 


222C 


ACAGATGGCT 


CG7ACACACT 


TGCCAACACT 


TCTTCATCCC 


GCATTTACCT 


AAAACAAGCC 


2280 


TTCCAAGAAA 


ATACCAACCT 


CCTAGAGCAA 


GTCGTAGAAC 


CCTTGACTAT 


TATCACTGGT 


2340 


GGACACAACC 


ACAAGGACCA 


GTTGACCTAT 


GCTTGGAAAA 


CACT7TTGCA 


GAATGCGCCA 


24C0 


CATGATAGTA 


TCTGTGGCTG 


TAGCGTGGAC 


GAAGTTCACC 


G CG AG ATGG A 


AACGCCT^TT 


2 4 60 


GCCAAGGTCA 


ACCAAGTAGG 


AAACTTTGTT 


AAAAGTAAC7 


7GC7CAACGA 


GTGCAAGGGT 


2 52 0 


AAAATTGCTA 


CGGATAAGGC 


TCAAAGTGAC 


TATCTCTTTA 


CTG7CATTAA 


CACAGGCTTG 


2530 


CATGATAACG 


TCGATACTGT 


CAGCACAGTG 


ATTGATGTGG 


CGACTTGTGA 


Tf^CAAGGAA 


2640 


TTGCACCCAA 


CAGAAGGCTA 


CAAAAAGATG 


GCTGCTCTTA 


TCTTGCCAAG 




2 7 00 


GAGGACTTGG 


ATGGTCGTCC 


TGTAGAGGCT 


ACAATCGAAG 


ACCTCGGAGC 


* 4 4 h 4 VJAU 


2 7 60 


TATAATTTAC 


CAAAAG AC AA 


GTTCCGCCAA 


GCTCGTATTG 


CTCGTCAAC7 




2 920 


ATTCCAGTTC 


ACCTAGCGCC 


GCTTTCTTGG 


ACAACCTTCC 


AAT7GCTGGA 


AGGAAAACAA 


2 830 


GAACACCGTG 


ACGGTATTTA 


CCAAAACGGA 


GTGATTGATA 


CACCATTCGT 


AACGG7GAGT 


2940 


G7CGATGACA 


ACATCACAGT 


CTATGACAAG 


ACAACTCACG 


AAGCCTA7GA 


AGACTTTATC 


3000 


CGCTTTGAAG 


ACCGTGGGGA 


CATCGGAAAC 


GAGTATATCT 


ATTTCCAACC 


AAAACG AAC A 


3060 


GAGCCAATCT 


TTGCAGAGCT 


TAAGGGCCAC 


GACG7CTTGG 


AAAACACAGC 


TTGC7ATGCT 


3120" 


AAAATCTTGC 


TCAAACATGA 


ATTGACCGTG 


CCTGTCAGTG 


CCGATGAAAA 


GCTAGAAGAA 


3180 


GAGCAACAAG 


G7ATCATCGA 


GTTTA7GAAG 


CGTGAGGCTG 


GACGGTCAGA 


AGAATTGACA 


3240 


AACATTCCTC 


TCCAAACTGA 


GTTGACTGTC 


77CCTTGACA 


ATCCAC.^V-.T 


CCCC77CAAG 


3 300 


ACTCGCTTTA 


CTAACACTGC 


CAAGGATCAC 


CGTATCCG7C 


TCTTGGTCAA 


GACTCA7AAC 


3360 


ACCCG7CCAA 


GCAATCATTC 


TGAAAGTATC 


TATGAGGTGG 


TGACACGACC 


AAACAAACCA 


342C 


GCTGCTTCAT 


GGGAAAACCC 


TGAAAATCCT 


CAACACCAAC 


AAGC7TTTCT 


CAGTCTGTAT 


3480 


GACGATGAAA 


AAGGGGTGAC 


TGTATCCAAC 


AAGGGATTGA 


ATGAATACGA 


AATCC7TGGG 


3540 


GATAACACCA 


TTGCCGTGAC 


CATTTTCCGT 


GCATCAGGTG 


AGCTAGGTGA 


C7GGGGCTAC 


3600 


TTCCCAACGC 


CAGAAGCACA 


ATGCTTGCGG 


GAGTTTGAAG 


7CGAGTTTGC 


ACT7GAATGC 


3660 


CACCAAGCCC 


AAGAACGCTT 


CTCAGCC7AT 


CGTCGTGCCA 


AAGCC7TGCA 


GACACCGT77 


3720 


ACCAGCCTTC 


AGCTTGCTAG 


ACAGGAAGGA 


AGCGTGGT7G 


CGACTGGTAG 


CCTCTTGAGC 


373C 


CATTCTGTTC 


TCACCA7ACC 


GCAAGTTTGT 


CCAACAGCCT 


T7AAGGTAGC 


7GAAAA7GAA 


384C 



WO 98/18931 



PCT/US97/19588 



414 

GAAGGCTATC TGCTTCGTTA CTACAATATG TGTAGTCAAA ATGTACGTGT GCCAGAAAGT 3 900 

CAACATCTCT TCCTTGACCT ACTTGAACGA CCATACCCAG TTCATTCAGG ACTATTGGCT 3960 

CCACAAGAGA TTCGTACAGA ATTCATCAAA AAAGAAGAAA TTTAATTTCA AAAAGTAAAC 4020 

ATCAAAAGAA AGGAGGGGCG AAAAAGTAAG AACTAACTGC TGATTCCCCC CTTTT ATGGT 4080 

AAAAACAATG ACCATTGCAA CGATTGATAT CGGAGGGACT GGGATTAAGT TTGCCAGTCT 4140 

GACTCCTGAT GGCAAAATAC TGGATAAGAC AAGTATTTCA ACGCCTGAAA ACTTGGAGGA 42 00 

TTTACTAGCC TGGCTAGATC AACGCTTGTC AGAACAGGAT TACAGTGGGA TTGCTATGAG 42 60 

CGTTCCAGGT GCAGTCAATC AAGAGACAGG TGTGATTGAT GGCTTCAGTG CGGTGCCCTA 4 3 20 

CATCCATGGC TTTTCTTGGT ATGAGGCGCT TAGCTCTTAT CAGCTACCTG TCCATTTAGA 43 80 

AAATGATGCC AACTGCGTTG GACTCAGTGA ACT ACT AG CT CATCCAGAGC TTGAAAATGC 4440 

AGCCTGTGTC GTGATTGGGA CAGGGATTGG CGGAGCCATG ATTATCAATG GTAGACTTCA 4500 

TCGAGGTCGC CACGGTCTGG GTGGAGAATT TGGCTACATG ACAACCCTTG CCCCTGCTGA 4560 

AAAACTTAAT AACTGGTCGC AACTAGCATC AACTGGGAAT ATGGT A CG AT ACGTGATTGA 4620 

AAAATCTGGT CATACTGATT GGGACGGTCG CAAGATTTAC CAAGAGCCCG CACCTGCTAA 4680 

TATCCTTTGT CAAGAAGCCA TTGAGCGCAT GAACCCCAAT CTCCCCCAAG GCTTGCTCAA 4740 

TATCCAGTAT CTGATCGATC CAGGTGTCAT CAGTCTCGGT GGCTCTATCA GTCAAAATCC 4800 

AGATTTTATC CAAGCTGTCA AGAAGGCTGT TGAAGACTTT GTCGATGCCT ACGAACAATA 4860 

CACGGTCGCA CCAGTTATCC ACGCCTGCAC CTATCACGCA GATGCCAATC TCTACGGTGC 4920 

TCTTCTCAAC TGGCTACAGG AGGAAAAGCA ATGGT AAGAT TTACAGCACT TAGTCTCAAA 4980 

CAAACGCAAG CTATTGAGGT TTTAAAAGGT CACATTTCTC TACCAGATCT GGAACTGGCT 5040 

GTCACTCAGT CTGACCAAGC ATCTATCTCT ATCGAGGGTG AGGAAGGTCA CTATCAATTG 5100 

ACCTACCGCA AACCTCACCA ACTTTATCGT GCCTTGTCCT TCTTGGTAAC AGTTCTAGCA 5160 

GAAGCTGATA AACTAGAGAT TGAGGAACAA GCAGCTTACG AAGATTTGGC TTACATGGTT 5220 

GACTGTTCTC GAAATGCGGT GCTGAATGTG GCTTCTGCCA AGCAGATCAT TGAGATATTG 52 80 

GCTCTCATGG GCTACTCAAC CTTTGAGC7T TACATGGAAG ACACTTACCA GATTGAAGGG 53 40 

CAGCCTTACT TTGGCTATTT CCGTGGAGCT TATTCAGCAG AGGAGTTCCA GGAAATCGAA 54 00 

GCCTATGCCC AACAGTTTGA CGTGACCTTT GTACCATGCA TCCAGACCTT GGCCCACTTG 54 60 

TCGGCCTTTG TCAAATGGGG TGTCAAGGAA GTGCAGGAGC TCCGTGATGT AGAGGACATT 5520 

CTTCTCATTG GCGAAGAAAA CCTTTATGAC TTGATTGATG GCATGTTTGC CACGTTCTCT 5 580 

AAACTGAAGA CTCGCAAGGT CAATATCGGG ATGGACGAAG CCCACTTGGT TCGTTTGGGA 5640 
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CGCTACCTGA 7TC7GAACGG TGTTGTGGA7 CGTAGTCTCC TCATGTGCCA ACAC7TCCAC 5700 

CGCGTGCTGG ATATTGCTGA CAAATATGGT TTCCACTCCC AGATCTCGAC TCATATC77C 57 60 

TTCAAACTCA TGTCAGCGGA TGGCCAGTAC GACCGTGATG TGGAAATTCC AGACGAAACT 5820 

CGTGTCTACC TAGACCG7CT CAAAGACCGT GTGACTCTGG TTTACTGGGA TTATTA7CAG 5880 

GATAGCGAGG AAAAATACAA CCGTAATTTC CGCAATCATC ACAAGATTAG CCATCACCT7 5940 

GCATTTGCAG CGGGAGCT7C GAAGTGGATT GGCTTTACAC C7CACAACCA TTTTAGCCGT 6000 

CTAGTGGCTA TCGAGGCTAA TAAAGCCTGC CGTGCCAATC AGATTAAAGA AGTCATCCTA 6060 

ACGCCTTCGG GAGACAATGG TGGTGAAACT GCCCAGTTCT CTATCCTACC AAGCTTGCAA 612 0 

ATCTGGGCAG AACTCAGCTA TCGCAATGAC CTAGATGGTT TGTCTGCGCA CTTCAAGACC 6180 

AATACTGCTC TAACGGTTGA GGATTTTATG CAGATTGACC TTCCCAACCT CTTACCAGAC 6240 

CTACCAGGCA ATCTCAGCGG TATCAATCCC AACCGCTATG TT7TTTATCA GGATATTCTT 6300 

TGTCCGATTC TTGATCAACA CATGACACCT GAACAGGACA AACCGCACTT CCCTCAGGCT 6 360 

GCTGAGACGC TTCCTAACAT TAAAGAAAAA GCTGGAAACT ATGCCTATCT CTTTGAAACT 6420 

CAGGCCCAGT TGAATGCTAT TTTAAGTAGC AAAGTAGATG TGGGACGACG CATTCGTCAG 64 80 

CCCTACCAAC CGGATGATAA ACAAAGTTTA CAACAAATCG CCAGACAAGA ATTACCAGAA 6540 

CTTAGAAGCC AAATTGAACA CTTCCATGCC CTC7TTACCC ACCAATGCC7 GAAAGAAAAC 6600 

AAGGTC7TTC CTTTGGATAC AGTTGACATC CGTATGGGCG GAC7C7TCCA ACGCATCAAA 6660 

CGAGCACAAA GCCG7ATCCA GGTTTATCTC GCTCC7CACC TTGACCGCAT CGACCAGCTG 6720 

GAAGTTGAAA TCCTACCATT TACTGACTTC TACGCAGACA AGGATTTCGC AGCAACTACA 67 80 

CCC/v\CCAGT GCCATACTAT TCCGACAGCG TCCACGATTT ATACGACTTA A7ATTCTTCG 6340 

AAAATCTCTT CAAACCACGT CAGCTTCCAT CTCCAACCTC AAAACACTCT TTTGAGCAAC 6900 

CTGCAGCTAG CTTCCTACTT TGCTCTTTGA TTTTCATTGA GTATAAAAAC AAGAACACCT 6 960 

7GCTTGCCCC AGGGTGTTTC GCGTGAAACA GAACAATTAT CTGGTTTCAA ATGCTACAGT 7 020 

TAGACAAACT TATGATAAAA TAGCAGAAAG TGAATGTTTC CTAAGAGCAA TTGGAGGTAT 7 080 

TATGCTACAC TTAAAATTAG TAAAACAAGA AATAGAAGCT GAAAAGCCAG CATCTGTAGA 7140 

AGCTTGGATC ATTTCCGTCA AATTTAAAAA AGGTTGCTAC CGACATATAT AGATTCCAAA 7 200 

AACAAAAACG TTAGCGGAAC TAGCAGATGT GATTTTATGG ACTTTTGATT TTGCAAATGA 7260 

TCATGCTCAC GCATTTTTCA TGGATAATGT TGAGTGGAG? CATGCAGATT CTTACTTTCG 7 3 20 

TAGCTTTGTT AGTGACGATG TTGAAGAACG TTACACAGAA AATG7CTA7C 7GGA7AGCC7 7 3 3C 
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AAGTGTCAAA CAAAAATTTA AGTTTATTTT CCACTTCCG7 GWPG^.TGCC CTTTTGAATG 
CCAAGTGCTG AGAGAAATCG AGACAGAGGA CGAAGAAGCT TATCTCGTAC GTTCGGTTGG 
AACGTCGCCA CAACAATATC CAGATTATGA TGGTTTTGAC TATGAAGAAT GCTAAAATTG 
AAATCAGTCT GTGTACGC7T AGTATTTCAA TAGACTTCCT GCAAAACTAG AATCCTAGTT 
CATGATTGAT AATACCAGCA ATCAAATTCA TTCGTAATCC GAAGCGTTTA CGATGATTTC 
GATAGGTTGT TGAAAACATT TTAAACCTTT TTACTTTGGC AAAGATGTTC TCAACCTTGC 
TTCTCTCCTT ACATAGCGCA TGGTTATAGG CTTTATCTTC AGCTGTTAGT GCCTTGACTT 
TGCTGGATTT ACGTGAAGTT TGTGCTTGAG GACATATCTT CATGACCCCT TGATAACCAC 
TGTCAGCCAA GATTTTACCA GCTTGTCCGA TATTTCTGCA ACTCATTTTG AACAACTTCA 
TATCATGACA A7AGTTCACA GTGATATCCA AAGAAACAAT TCTCCCTTGA CTTGTGACAA 
TCGCTTGAGC CTTCATACCG TGAAATTTCT TTTTACCAGA ATCATTCGCT AATTCTTTTT 
TTACCGCGAT TGATTTTTAC TTCCGTCGCA TCAATCATTA CCGTGTCCTC AGAACTAAGA 
GGACTTCTTG AAATCGTAAC ACCACTTTGA ACAAGAGTTA CTTCAACCCA TTGCCTCCGA 
CGGATTAAGT TGCTTTCGTG AATACCAAAA TCAGCCGCAA TTTCTTCATA AGTGCGGTAT 
TCTAGGCTTA ATTTAGGTTT TCGTCCACCT TTTGCGTGTT TAAGTTGATA AGCTCTTTT T 
AATACAGCTA ACATC7CTTT AAAAGTCGTG CGCTGAACAC CAACAAGACG CTTAAATCGT 
CTATCAGTTA ATTGTTTACT TGCTTCATAA TTTCGCAGGG AGTCTATTGA CTCTTTGCTA 
GGTGTCAATG TTTTTTTCAT CTATCCCGAG AATTATTTTC CCGCCATTTC TATTTGCAAA 
TGCTGAGTAG GTTTCCCAGA AAGACTCTGG AACATTC7TT TTAGCTTTTT TGTATTCTAA 
ATCAACCCCT TCAAATTTTA ACTCCATATT TTTCCTTTAC A7CTGTTTTT TGTGGTTCTC 
CTATTTCTTC AAGTTCACTG ATAATATAGC GAATTGAATT TCGAGAGTTT TTACTCAGTT 
AATTTCTTTT TTAACCC 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 11384 base pairs 
(3) TYPE: nucleic acid 
(C) STRAKDEDNESS : double 
(01 TOPOLOGY: linear 



7440 

7500 

7560 

7620 

7680 

7740 

7800 

7960 

792C 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

3520 

3580 

8640 

3657 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
TCTATTTTGG GTATAGACTT ACCTATAAAG AAAAATATCT ATACACTGCC TTACTAGCTA 60 
TACTGAACGA GTCAACAAAA ACGATATATA TTGATGATAT AAATACAGCA AGATTTTTTA 120 



WO 98/18931 



PCT/US97/19588 



417 



ACTTCTTTGG 


CAATCA' T, AT' r 


CCTAATTCGT 


CTTTAAAAAA 


AATTGACTAT 


ATCGCACCTT 


iao 


C AC A AATTGT 


* TWf, CA' T *TTAGT 


ACGTACGTTC 


GACAACGTTC 


TAAAGTAATT 


CCTAAAATTT 


240 


Try a a r* at AT 


AT^AAA ATfA 


A CTTT TTT AT 


TAGAGAATAT 


AGATGTTTCT 


GGTTACACTG 


300 


lAAA I A I 1 1 1 


AVjnAun l \. AA 


TTAACAAAAC 


ATAGAACAAT 

r\ 1 AUAA^iAA k 


CAAAATTAGT 


AAAAACTAAC 


360 


1 1 L unlv 1 


CATCTATAAA 


T ACCT AAC AA 


AACCACGCGC 


CTTGCCTGCT 


GATCGAAAGA 


420 


AAG GT AC AAA 


* AG A rGAA I A 


^CAAACAAAA 


AATAAAAAG 


AATGCCCAAA 


G AGTTT ATT A 


480 


I uL i AG i G t I 


Inttl nwwLy 


m * t *G A C C AA CT 


AACCGGCAAA 


AAAGCCCGTA 


CAACTGTTAC 


540 




/vWwuww^U 


T rn AAAGT AAA 


AGCGCGTGAT 


GCGATCAATA 


CTTTTGCTGC 


600 


1 AA1 GGC r A t 


AG AG 1 1 AAAVj 


ACAACCCGAC 


AATTACAACA 


TATAATGAGC 


TTGTAAAAGT 


660 


TTGGTGGGAT 


AGT TAG AAG A 


AT AfACTTAA 


GC C AAA T ACT 


CGCCAATCCA 


TGGAGGGATT 


720 






CTGT ATTT GG 


CG ATT AC AAG 


CTATCTAAAC 


TTACTACGCC 


780 


TATTCTTCAA 


v» AGCAAO 1 AA 


A C AAA TGGG C 


TGACAAGGCA 


AATAAAGGCG 


AAAAAGGGGC 


840 


atttgctaac 


lAGTCa I ; GC 


TCCATAACAT 


G A AT AAGC CT 


ATTTTGAAAT 


ATGGCGTAGC 


900 


I A I CCAGG I A 


A 1 ALnnl nLn 


ACCCAGCTAA 


TGATGTCATC 


GTTCCACGCA 


AACAGCAAAA 


960 


ft /"* ft ft ft ft ff^i" 'T* 

AG AAAAGGC 1 


GCTGTC AAA I 


ACTTAC acaa 


CAAAGAATTA 


AAA C AGTTT C 


TTGATTATTT 


1020 


AGATGCTCTG 


m rp/i ft ft ft ft 

GATCAA * G AA 


ATTATGAGAA 




GTTGTTCTGT 


ATAAGACTTT 


1080 


a r:\aGC gag I 


GG I IGCGvjIA 


t***agtgacgc 


TCTGGCTCTT 


GAATGGTCTG 


ATATTGACCT 


1140 


agaaagcgg * 


G i * A * GAGG A 


TC A AT A AG AC 
I w AA 1 AAurtL 


AC**AAACCCC 

AV, « nAAVjV.UV> 


TATCAGGAAA 


TAAACTCACC 


120C 


4 AAATCAAGC 


GCTGGTTATC 


CTGATATACC 


AATAGACAAA 


GCCACATTAC 


TTTTACTGAA 


1260 




aaccgtcaac 


AAATTCAGTC 


TTGGAAATTA 


GGCCGATCTG 


AAACAGTTGT 


1320 


ATT CT CTGT A 


TTTACCCACA 
111 AV«wwnwn 


AATATGCTTA 


TGCTTGTAAC 


TTACGCAAAC 


CCCTAAATAA 


1380 


GCATTTTGAT 


GCTGCTGGAG 


TAACTAACGT 


ATCATTTCAT 


GGTTTCCGCC 


ATACACATAC 


1440 


TACTATGATG 


CTCTATCCTC 


AGGTTAGCCC 


GAAAGATGTT 


C AGT AT AG AT 


TAGGCCACTC 


1S00 


TAATTTAATG 


ATCACTGAAA 


ATACTTACTG 


GC AT ACT AAC 


CAAGAGAATG 


CAAAAAAAGC 


1560 


CGTCTCAAAT 


TATGAAACAG 


CTATCAACAA 


TTTATAAAAA 


ATAAGGGTGA 


CCCATTTCCG 


1620 


GGCTACCCTC 


TTACTATACC 


AAAAATTAGT 


AGGGGTAGTA 


AAAAGGGTAT 


TAAATTATAA 


1680 


AAAGCACTAA 


GGGAAAGCGC 


CCCAAAGTGC 


TT ATTT C AAA 


GGCTTTATAG 


CCTATAATCA 


1740 


CATAAAGAGA 


TTATTTTTTA 


AGGTTGTAGA 


ATGATTTCAA 


TCCACGATAT 


TCAGCTACTT 


1300 


CACCAAGTTG 


GTCTTCCATA 


CGAAGCAATT 


GGTTCTATTT 


AGCGATGCGG 


TCTGTACGTG 


1860 
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AAAGTGAACC AGTCTTGATT TGTCCTGCGT TAGTTCCAAC TGCAATATCA GCGATTGTTG 1920 

AATCTTCAGT TTCACCTGAA CGGTGTGATA CAACAGCACT GTAACCAGCT TCTTTAGCCA 1980 

TTTCGATAGC TTCAAAAGTT TCAGTAAGAG TACCGATTTG GTTAACTTTG ATAAGGATTG 2040 

AGTTAGCAGC ACCTTCTTGG ATACCACGTG CAAGGTAGTC AGTGTTT G TT ACGAAGAAGT 2100 

CGTCACCAAC AAGTTGTACT TTCTTACCAA GACGTTCAGT AAGAGCTTTC CAACCATCCC 2160 

AGTCGTTTTC ATCCATACCA TCTTCAATAG TGATGATTGG GTATTTGTTA ACCAATTCTT 2220 

CAAGGTAGTC GATTTGTTCT GCAGATGTAC GAACAGCAGC ACCTTCACCT TCAAATTTAG 22 90 

TGTAGTCGTA AACTTTACGT TCTTTATCGT ACAATTCTGA TGAAGCACAG TCAAATCCCA 23 40 

TAAATACGTC TTTACCTGGT ACATATCCAG CAGCTTCAAT CGCAGCAAGG ATAGTTTCAA 2400 

CACCATCTTC AGTTCCTTCG AAACGAGGAG CGAATCCACC TTCGTCACCT ACGGCAGTTT 24 60 

CCAAACCACG TGATTTAAGG ATTTTCTTAA GAGCCTGGAA GATTTCAGCA CCGTAACGAA 2520 

GGGCTTCTTT AAATGTTGGC GCACCAACTG GCAAGATCAT GAACTCTTGG AAAGCGATTG 2 580 

GAGCGTCAGA GTGAGAACCA CCCTTGATCA TGTTCATCAT TGGAGTTGGA AGAACTTTAG 2 640 

TGTTGAATCC ACCAAGATAG CTGTAAAGTG GGATTTCAAG GTAGTCAGCA GCAGCACGAG 2700 

CTACAGCGAT AGACACACCG AGGATTGCAT TCGCACCCAA TTTACCTTTG TTAGGAGTAC 27 60 

CGTCAAGTGC GATCATAGCA CGGTCAATAG CTTGTTGATC ACGTACATCG TAGCCAATGA 2820 

TAGCTTCAGC AATGATGTTG TTTACGTTGT CAACAGCTTT TTGTGTACCA AGACCACCCT 2880 

AACGAGATTT GTCACCGTCG CGAAGTTCAA CTGCTTCGTG TTCACCAGTA GAAGCTCCTG 294 0 

ATGGAACCAT ACCACGTCCG AAAGCACCTG ATTCAGTGTA AACTTCTACT TCAACTGTTG 3000 

GGTTACCGCG TGAGTCTAGG ACTTCGCGAG CGTAAACATC AGTAATAATT GACATTTTTT 3060 

ACTCTCCTTA TCAGTTAAAT TTTTTACACC TCTATAATAC CTTAAAACCC CTCCTTTTTC 3120 

AAGAAAAAAC GTTATCTTTG TGCAACTTTT CCTTAACTTT ATAAAGTAAT CGCTTTCTTT 3180 

TCTCTGTTTT ATTCTAACTT TTATGATATA CTGTTTTCAT GACAGATTTA TCAAAACAAT 3 2 40 

TACTTGAAAA AGCTCATGGT GGGTTAAAAA TAAATCCCGA TGAGCAAAGA CGCTATCTTG 3 300 

GTACTTTTCA GGAAAGAGTT CTTGGATATG TAGATATTCA CACAGCAAAT AGCCCTCAGT 3 360 

TAGAAAAAGG CTTTTTATTT ATTTTAGAAA ACCTTCAGGA AAAAGCAGAG CCACTATTTG 3420 

TGAAGATTTC ACCAACTATC GAATTTGATA AGCAAGTTTT CTACTTAAAA GAAGCAAAAG 3480 

AAACTGATAG TCAAGCCACC ATAGTATCTG AACACCATA? TACTTCTCCT TTTGGCCTGC 3540 

TTATTCATAG CAATGCACCA GTTCAAGTAG AAGAAAAAGA CCTTCGACTT CCTTTTCCAA 3 600 

AACTT TGGG A AGTTAAAAAG GAAGAACCAG CCAAAACATC CTTATGCAAG AAATGGTTTA 3660 
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GCTAAATC7T 


GCACATATTT 


AATAAGTGCC 


CAATATTGGC 


AGCCGTGCGC 


TCCAGATAGA 


3720 


AACTCGCATT 


TTTCAAACTA 


TCTTCTAAAC 


GTTCACTTTT 


CTCCAAAATA 


GAAAAGACAG 


3780 


CTTGGATATT 


TTCAAATGGT 


AGGGGAGGTA 


AATCTTCAGC 


AAGACTACCG 


CAAATAGCAA 


3840 


TAACAGCAAC 


TCCAACAGGG 


GTTCTTTTTG 


CAACACCTAT 


AGGCGCTTTC 


CCAGCAAAGC 


3900 


TTTGACTATC 


AAGTCTTCCT 


TCTCCAACAA 


CAACCAAGTC 


AGCATCTGAA 


ACTTTCTTAT 


3960 


CAAAGTTGAT 


TAAGTCCAAG 


CAGGTATCAA 


TTCCAGACAC 


GATACTTGCC 


TGAGCAAAGG 


4C20 


CACACAAACC 


ACCAGCAAGG 


CCTCCACCTG 


CTCCTGCTCC 


TTTAATTTCT 


AATGTTGCAG 


4080 


GTGAGAATTT 


TTCATAAAAA 


TCTTGGATCG 


CCTGATCTAC 


GACTGCAAAC 


ATAGTCGGAT 


4140 


GTAGACCTTT 


TTGATTGCCA 


AAAGTGTAAG 


TCGCACCTTG 


ATGACCACAT 


AAGGGACTCA 


4200 


CGACATCTGC 


TAAAATATGA 


ATTTGAACAC 


CTTCAGGAAT 


TTTATAGCAA 


TTTTCTGTTG 


4260 


AAACAGAAGC 


TAAGTTTAAT 


AAGGATTGAC 


CGGAAGCAGG 


CAAGACATTT 


CCATCCCTAT 


4320 


CATAAAATTG 


ATAACCTAAA 


CCAGCAGCAA 


TCCCCAGTCC 


TCCATCATTA 


CTGGCCGTGC 


4380 


CACCAACACC 


CATATAAATA 


TCTTTAATCC 


CTTTACACAT 


GAGATCAAGA 


ATCAACTCTC 


4440 


CAATACCACA 


ACTTTGGATT 


TCAAGTGGAT 


TTCGTTTCTC 


TAGCGGAATT 


TTTCCAAGAC 


4500 


CAACCAAGTC 


AGCTACTTCA 


AATAGTGCCA 


GTTCCCCTTT 


TTGAAAATAG 


CGCATGGCTT 


4S60 


CTTTTTGTCC 


AAAAGGGTCT 


GTCACTTGGA 


TCCATTTTTC 


TTTTAGGTCA 


AGACAATGTC 


4620 


GGATAGCATC 


TACAGTACCT 


TCTCCCCCAT 


CACCAACAGG 


GCAGAGGAGA 


CATTCTACAT 


4680 


CTGCTATCCA 


TTGTTGGAAG 


CCTCTTTTTA 


TTGCTTCAGC 


TACCTGTTGA 


GCTGTCAAGC 


4740 


TTTCCT7AAA 


CGAATCCCCT 


GCAATTACAA 


TCTTCATATT 


TTCCCTCATT 


CTAAACAGTC 


4800 


AATCAAAGGG 


ACAACTTCTA 


AAAAATCCCT 


CTTGTCAACA 


TGATCTCGTA 




4860 


GAGCACTTCT 


TTGGCACAAA 


AGGCGATTCC 


TAACTTCGCC 


GACTTCAACA 


TTAATAGATT 


4 920 


ATTAACCCCA 


TCACCGATTG 


CCACCGTTCT 


TTCTTTAGAA 


AGTTTTAGTT 


TCTTTCTCCA 


4980 


TTTTTCCAGA 


GTCTCTTTTT 


TGACCTGGGG 


ACTTATAATT 


TGTCCAACTA 


ATTTTCCTGT 


5040 


TAAAAGACCT 


TCTTTGACTT 


CAAGCTAGTT 


GGCAGTGAAA 


TAGGCAATAC 


CAAGCGATTT 


5100 


TGCTAATCTC 


TCCAACTATT 


GGTGTAAATC 


CACCAGACAC 


CAGACCAACT 


AGGATGCCAT 


5160 


TCTTTTGGAG 


AATAGAGATG 


AACTCTCGGA 


CATTTACCGA 


TAGATGAATT 


GAGTTGAAGA 


5220 


CGTTATCAAA 


GACCAAAATA 


GGAAGACCTT 


CCAACAAGGA 


CACTCTTTTT 


CTTAAACTGC 


5280 


TTTCAAAGAC 


CAACTCTCCT 


CGCATTGCTC 


GACTTGTAAT 


CTGCGAAATT 


TCCGCCTCAT 


5340 


GACCTGCCTC 


TCTCCCTAAA 


AGATCAATCA 


CTTCTTCTAG 


GATTAAGGTT 


CCATCTACAT 


5400 
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CCAAAACACA CAACCCTTTT AC7TGAGACA TCAGTTCTCC TCTCTAAACA GCCTAAAAAT 5460 

CGTA7GAAGT CATCATACGA TTTTATCTAT TAATTAACTA AACTATCGTA CAAGTCAAGG 5520 

TATGACTTGC AGGCTGTATC CCATGAGAAG 7CACTCTCCA TAGCTTGTTT TTGTAGGTTT 5580 

CTCCAAATGT CTGGATGGTT TCTATACAAG TCCAATGCTG 7TTGGAAAG7 CCAATTTAAC 5640 

CAATAAGGAG ATAGATTGTC AAAGCTAAAG CCAGTACCGC TTCCTTCGAT TGGATTGAAA 5700 

GCGCGAACTG TATCTCGCAA GCCTCCAACT TCATGGACCA ATGGCAAGGT TCCATAACGC 5760 

ATAGCCATCA 77TCAGACAA GCCACACGGT TCAAAACGAC TTGGCATGAG GAAGAGGTCA 5820 

CAAGCACCGT AGATTTCCTG AGCAAGTTTG ACATCAAAAG TGATATTTGT TGATAGCTTG 5880 

T CTGGGT AAA TCTGAGCAAA CCATGAGAAA GCTCCTTCAA AGGCTGGATC GCCAGTTCCC 594 0 

AAAAGAACAA TCTGAACATC TTCTTGCAAG ATATGGTGAA GACTTTCGAC CACCACATCA 6000 

AAACCTTTTT GACGTGTCAA ACCAGAAACA ATTCCCACCA GTGGAACGTC TGCTCTAACA 6060 

GGCAAGCCAA CTCTTTCTTG CAATTTTGCC 77A7TTTTGG CTTTCCCAGA CAAATCTTCC 6120 

TGATTGAAAT GATAGTCTAA AAGAGCATCC CTCTCAGGAT TATAAAGATC AGCATCAATC 6180 

CCATTCACGA TACCAGATAC TTTACCAGAC TCCATTTTAA GAATCTGATC CAAATTACAT 6240 

CCAAACTGAC TAGTCATAAT TTCATGAGCA TAGCTAGGTG AAACGGTTGA AACACGGTTC 6300 

GCATAGAGAA TACCTGCCTT CATCCAGTTC AGACAGTTGT TCCATCGAAG GGTGCCATCA 6360 

GCGTAACGTT CAAAGCCAAC TCCAAACAAA TCACCCAACA TTCCTTCTGA AAATTGTCCT 6420 

7CGAAT7C7A AATTATGAAT GGTTAAAACT GTTTCAATGT CCTCATAGCC TTGAATCCAA 64 80 

CGGTA7TTTT CC7TCAACAA GAAAGGAATC ATAGCTGTAT GGTAGTCATG AACATGGAGA 6 5s0 

AGATCAGGAA TAAAGTCAAT CCTTTCCATA GCCTCAATGC CACCCAGTTG GAAAAACGCA 6600 

AAGCGT7CTC CGTCATCAAA A7CACCGTAA ACA7CACCAC GGAAGAAATA ATATTGATTG 6660 

7CAA7AAAGT AGAAGG7TAC ACCATTTAAT ACTG7TTTC7 TAA77CCACA A7ACTG7C7G 6720 

CGCCAACCAA CGCTCACC7C AAAA7GAAGC ACA7CTTCAA 7C7GA777CC AAA777AGCL" 67 80 

7CTACCATAT CATAGTAGGG 7AAAA7CAC7 GCAAC7TCG7 GCCCACCTT7 7ACCAGTGAT *.840 

777GGAAGAG CGCCAA7GAC G7CTCCCAAA CCACCTG777 77GAAAAGGG 7GCACCC7C7 6900 

GCTGCTACAA ATAAAATTTT CATGAATGAA 7A7CCTCTG7 TACTTTAGCA CCTT7CTTAA 6960 

CCACAAC7GG ATGTTCTGCA G77CC7CGAA 7CACAACACC ATGC7CAAC7 TCAACCCCTT 7020 

TGTCCAAGAT ACCATATTCG ACC7GAGCCC CTTC7CCAAT AACAACACGA GGGAAGAGCA 7080 

GGCTA7C7TT AACCAAGC7A 7CCTTATGGA CATGAATA7T ACG7GA7AGA ACAGAATTAG 714 0 

C7ACTTGACC TTCAA7AATA CTACCAGAGG CAAACTGAGA AG7GCT7ACC 7TAGATGTA7 72 CO 
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TAGCATAGTA AGTTGGCTCT TCGTTTTTGA CCTTTGTATA AATCTTTTGG TTTGGTGAGA 7 2 60 

AAACAGAATA GAATTTTTGT GATTCAAGCA TATCGATATT CGCTTGATAA TAAGATTTAA 7 320 

CAGAGTGAAT ATTGGCTAGA TAGCCCGTC7 ACTCGTAGGC GAAAGCTCCC TCTTTTACAG 7 3 80 

CCAAATCCCG TAAAACATAG CGCAATTTCT CTGGATGTTC TTTTTTAGCT TCTTCTTCCA 7 440 

AGTGTTCAAT CAACCAAGGT GTATCAACGA CAAAGATATC TGTAGACATA TTGAACGTTT 7 500 

CAGCTGTTGA CTTGCTATCA AAGAGTTTAT GAGAAACAAC ATGGTCTCTT TCATCTACAT 7560 

CCAAGATTGC ATTTACTTCT GAAATATCTT TCTTAGCTAG TTTTTTATAA ACTACAGTGA 7 620 

TAGGCTCTTT TGTTGTACTA TGTAGGTGGA AAACTTGGTT CAAATCAATG TTAATAAGAA 7 680 

CATCGCAGTT GAGGGCAACC GTTTGGTTTG AGCCAGAACG TTTCAAATAA GTAAGAAGCT 77 40 

GTTGGTAGTA TTCTTTTCCA ACTGTACTAC TTTCTACACG GGTATTGTAA ATTCCTAGAT 7800 

AGTAATGGCT AAGAAGGGTT GATAAGCCCC ACTCGCGTCC TGAACGAATA TGGTCAAATA 7 8 60 

CTGAGCTGAT ATTATCCTGC TGGAAAATAC CAAAGACACT ACGAACACCT GCATTAGCAA 7 92 0 

GGCTTCAAAG TGGGAAGTCA ATCAAACGAT ATTTCCCACC AAATGGCAAA CTTCCTACTC 7 9 30 

GACGGTGCTC CGTCAATGTC GACATATTGT GAAAACCAAC TGTATTTCCT AAAATGGCAG 8040 

AATATTTATC AATCTTCATC TGTTGCTACC CCCACTACTT CATTATATCC TACAACTTGT 8100 

ACTTCATCTG TTCCATCAAT TTCGACACCG TCAGAAATAA TCGCACCTTC ACCAATAATG 3160 

CCACGTT7AA TCTTAGCTCC TTGACCAATG ATAGCTCCAC TCATGATAAC TCAATCAAGC 8220 

ACTTCCCCTC CTTCGCGAAC 7TGCGCCCCT GTTGAAAGGA TAGAATCTTT AACAGTTCCA 82 80 

7CAACGAAAC ATCCCTCTAC AACTAATGAG TCTTCCACAT GAGCATTTGC CCCGAGGAAG 8 340 

TTTGGTCGTG AAATCAAGTT TCTTCAGTAA ATCTTCCATT CACCCTTACC ACTATCCAAG 8400 

GCATTTTCTG GAGAAATATA CTCCATGTTC GCTTCCCAAA GTGACTCAAT AGTACCAACA 84 60 

TCTTTCCAAT AACCACTAAA TTCGTAAGCA TAAACACTTT CACCTCACTC AAGGTAATTT 3520 

GGAA7GACAT TTTTACCAAA GTCTGACATG CCAACCTTGC TCTTTTCAGC AGCGACTAAC 8580 

ATATTACGAA GGCGTTGCCA ATCAAAAATG TAGATTCCCA TAGAAGCTTT TGTAGATTTA 3640 

GGTTGAGCTG GTTT7TCTTC AAATTCAACA ATACGATTGT TAGCATCTGT GTTCATGATA 8700 

CCAAAACGGC TTGCTTCTTT AAGAGGGACG TCTAAAACTG CTACTGTCAA GCTGGCATTA 87 60 

TTATCCTTAT GAGACTGGAG CATATCATCA TAGTCCA7TT TGTAGATGTG ATCCCCAGAC 8820 

AAAATCAAGA CATACTCAGG ATTGACACTG TCGATATAGT CGATATTTTG GTAAATAGCG 8 880 

TGACTAGTCC CCTCAAACCA ACGATTTCCT TCACTTGCAG AATAAGGTTG AAGAATAGAG 8940 



WO 98/18931 



PCT/US97/19588 



422 



ACACCTGAAT 


TAATACCGTC 


TAGTCCCCAG 


CTTGAACCAT 


TCCCAATATG 


GTTGTTGAGA 


9000 


GCAACTGGTT 


CATACTGTGT 


AACGACCCCA 


ACATTGTGAA 


TCCCTGAGTT 


GGCACACTTT 


9060 


GATAGGGCAA 


AGTCAATGAT 


ACGGTAGCGC 


CCACCAAATT 


GCACAGCTGG 


TTTTGCGATG 


9120 


CTTTGAGTGA 


GTTTACCGAG 


ACGAGTTCCT 


TGCCCACCAG 


CAAGAATCAA 


AGCTAACATT 


9130 


TCATTTTTCA 


TTTTCTACTC 


CTTTTTGGTT 


TTTATTTGTG 


ACGCTTTTAG 


TAGATTTCAA 


9240 


GCGACGTTTG 


ATTTTCCATA 


CACTTGCTCC 


CATAGCCGGT 


AGGGTAAAGG 


TTAAGGTCTG 


9300 


CTCATAATCT 


TTCCATAGTC 


CTTCTTGCGT 


TTCAACACTT 


TGATTATGTT 


CTTTCCAAAC 


9360 


GCCTCCCCAC 


TCTTCCAACT 


CAGTATTCCA 


TACTTCTTCG 


TAAATTCCTG 


CAACGGGTAG 


9420 


TCCGATTGTA 


AAATCTTTCC 


GCTCAACAGG 


TACCATATTA 


AAGATACAGA 


CTAACATTTC 


9480 


TCCCTTTTTA 


CCCTTACGAA 


TAAAGGAAAG 


AACACTCTGG 


TCTCGATTAT 


CCCCATCAAT 


9540 


GATTTCAATA 


CCATCATAGC 


TGGTATCAAT 


TTCCCACAGA 


CAGCGATGA? 


CTTTGTAAAA 


9600 


CTGGTTTAGC 


TGAGAAGCGA 


AATACTTCAT 


CTTAGCATTC 


ATTGGGTCTT 


CTAGGTTAGA 


9660 


CCATTCCAAC 


TGTTCTTCAG 


ATTTCCATTC 


TAGGAATTGA 


CCGTATTCGC 


TACCCATGAA 


9720 


GAGCAATTTC 


TTACCAGGGT 


GACAAATTTG 


GTACGTATAG 


AGATTGCGCA 


AGCCTGCGAA 


9780 


TTGATTGTAA 


CCATCTCCCC 


ACATCTTATG 


CATCATACTC 


TTCTTGCCAT 


GAACCACTTC 


9840 


ATCGTGCGAG 


AATGGCAAGA 


GATAATTCTC 


CTTGAAAACA 


TACATAAAGC 


TGAAAGTCAC 


9900 


CAGGTTAAAG 


TCATATTTAC 


GATAGATCGG 


ATCTTCTTCG 


TAGAAACGGA 


GGATATCATT 


9960 


CATCCAGCCC 


ATGTTCCATT 


TGTAGTCAAA 


TCCTAGACCA 


CCAATCTCT? 


TCATTCCCCT 


10020 


AATCTTGATC 


GCAGACGAAC 


TTTCTTCTGC 


AATCATCATC 


ACATCTGCA7 


ATTC7AACTT 


10080 


AATAACCTCA 


TTCAAGCGCT 


GAAGGAAATA 


ATAACCTTCA 


TAGTTGACAT 


TTCCGCCATC 


10140 


TTTATTAGGT 


GTCCATCGAG 


CATCATCATA 


GTCCAAATAG 


AGCATGTTGC 


TAACAGCATC 


10200 


CACACGAATA 


CCATCCAAAT 


G AT AG AC AT C 


AATCCAATGC 


TTAATGCAAG 


AAATTAAGAA 


10260 


GGACTGGACT 


TCATTTTTTC 


CAAGGTCAAA 


ATTAAGGGCA 


CCCCAACCAT 


GGTTATGAGC 


13320 


CTTATTATGG 


TCTTGGTATT 


CAAAAGTCGG 


TGTCCCATCA 


TAATAGGCTA 


AGGCATCATC 


10380 


GTTGATGGTA 


AAGTGACTGG 


TACCCAGTCC 


ACAATAACCC 


CAATATTATG 


GGTATGACAC 


10440 


TCCTCGACAA 


AATCTTGAAA 


CTCCTCTGGT 


CGCCCATAAG 


CATGCTCTAA 


AGCGAAGTAA 


10500 


CCCATAAGCT 


GATACCCCCA 


ACTCAAGCCC 


AAAGGATGGC 


ACATCAAGGG 


CATAAACTCA 


10560 


ATATGAGTAT 


AGTTCATTTC 


AACGAGATAA 


GGAATGAGTT 


CATCCTCCAG 


CTGGGCAAAA 


10620 


CTATAAGGAC 


TGCCATCAGA 


ATTTCTTTTC 


CATC AT CC AG 


CGTGAACTTC 


ATAAATATTG 


10680 


ACAGGACGCT 


CTTCAAAGCC 


CCAACGTTTT 


CTTCCTCCCA 


GCCAAAGTCC 


ATCCTTCCAT 


10740 
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TTCTTCTCAC GAAGCTCTGT TACGATTGCC CCTGTTCCTG GACGAGCCTC ATACCTGACA 10800 

GCAAAAGGGT CAATCTTCAT CAGTTGATGA CCATTTTGAC GTGTCACATG ATATTTGTAA 10860 

ATATGCCCTT CTTGACCCAT ATTGGTAAAG ACTTCCCACA CCCCAAAATC ATTTCTTACC 10920 

ATTGGAATCT GATTTTCAAT CCAGTTGGTA AAATCACCAA CCAAGTGAAC AGCCTGAGCA 10980 

TTAGGTGCCC AAACACGGAA GGTATAGCCA TGCTCTCCAT TTAGTTCTTC CCTATGTGCT 11040 

CCTAGATAAT GTTGGAGATA AAAATTTTCA CCCGTCATAA ACCTTTTTAA TGCTTCTCTA 11100 

TTATCCATAT ACTCCCCTTC TCCTGTAACC GTTTTCTATG TTTTTATTAT ACTACCTTTT 11160 

TAGAGAAGAT TCAAGTAAAT TACTATACTT CTTTAATTAT TTTGAAAATC TACAACAAGT 112 20 

TCACTTACTC GTTCAATTGT AAATCAATAT TTTTTCAAAA AATTGCGAAA ACGCCTTTCT 11280 

TTTTCTACTA TAGTGAAATG AAATAAAACA TGCGCAAATC GATTAAGGAA TTTAATCTAA 11340 

TTTCTAACAA TGTCTTAGAA ATCAAAGTGT ACTATTTTAA CTCC 11384 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7571 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND EDN ESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ 10 NO: 46: 

TCCTTCCCCT GGAAAAGGAA TTCTCCTTAG 60 

CACCATCTTC GTACCATTCA TTTTGGGAAT 120 

TGTTGGAACA CTCATCTACA TGATTGCTAC 180 

TTCATTTGCC TTTATCACAC CTATGTCACT 240 

TGCTGCCCAA ACAGGGGTTA TCTTGACTGG 300 

CCGATTTGTA GGAACAAAAT GGATTGATAA 3 60 

GATCATCGTT ATCGGTCTTG GACTTGCAGG 4 20 

AGACGGAAAT TGGAAAAATG CTCTGGTAGC 4 80 

CAATACAAAA GG AAAAGGCT TCCTACCAAT 540 

TTACCTTTTC GCACTAACTC TTCGCTTGGT 600 

GTTCGAAATT CCTGGTTTCT ACTTCCCATT 660 

TCTTTACTTT GGTCCAGAAG CCATCGCTAT 720 



TCTTGATTTG TTACTAGACG TTGACCAACG 
TTTCCAACAC GTTTTCGCCA TGTTTGGTGC 
GCCTGTATCT GTTGCCCTTT TTGCTTCAGG 
TGGTTTTAAA GTTCCAGTTT ATCTAGGTTC 
GGCTATGAAA GAAATGGGGG GGGATGTATC 
TTTGGTCTAT GTCCTTGTTG CTACCAGCAT 
ACTCTTGCCA CCAATCATTA TCGGTCCTAT 
TTCAGCTGTT ACCAATGCAG GTCTTGTAGC 
CGTTGTTACT TTCCTAATTG CTGCCTTTAT 
CATTCCATTC CTCTTTGCCA TTATCGGTGG 
TGACTTTACA CCAGTTCTTA AAGCCAACTG 
TAGCACAGGT GGTGCCTTTA AAGAGTACAA 
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CTTCCCAATC GCTATCGTAA CAATTTCTGA ACATATCGGA GACCATACTG TTTTGGGTCA 780 

AATCTCTCCT CGTCAATTCT TAAAACAACC AGGTCTTCAC CGTACTCTTC TTGGTGAC GG 840 

TATCGCAACT TCTGTTTCTG CCTTCCTTCG TGGACCAGCC AATACAACTT ACGGAGAAAA 900 

TACAGGCGTT ATCGGTATGA CTCGTATCGC TTCTGTCTCA GTTATCCGTA ACGCTGCCTT 960 

CATCGCGATT GCCCTCAGCT TCCTTGGTAA ATTCACTGCC TTGATTTCAA CTATTCCAAA 1020 

CGCTGTACTT GGTGGTATGT CAATCCTTCT CTATGGGGTT ATCGCCAGCA ATCC7TTGAA 1080 

AGTCTTGATT AAAGAACGTG TTGATTTCGC TCAAATGCGA AACCTCATCA TCGCAAGTGC 1140 

TATGTTGGTT C7TGGACTTG GAGGAGCTAT CCTTAAACTT GGTCCAGTTA CACTTTCAGG 1200 

7ACTCCCCTT TCAGCCATGA CAGGAATCAT CTTGAACTTG ATCTTGCCAT ACGAAAATAA 12 60 

AGACTAAGAG TCTAAATACA CCTAATCCAC TCAGACAGCT GAGTGGATTT TTCGTATACC 13 20 

ATAATAAAAG TGTCTTAACA AAATTATTAA AATCAAAAAA CGTATAATAT CAGATATTCT 1380 

AAAACCTTGA TACTGTACGT TTTATCATAG AAATTTTTAC TTTATTTTCT CATCAAATGA 144 0 

GATTTGCATC AATCTCTTGT CTTACTTCCG TTTCTTCTTC GCTTTCTTCA TTTTGTTAGC 1500 

CATACGTTTC ATGGACTGTT TCATGGCAAA TTCACCAATT TTACCTTTCA AACCGCCACC 1560 

AAACATCTCG CTCATATCTG GCATTCCTCC TCCTCCGAGA GCTGATAAGT CAGGCATACC 162 0 

GCCTTGTCCC ATCATTCCTT CAAGGGCAGA CATATCCATT CCTCCCATAT TTGGCATATT 1680 

TTTACGAAGG TTATTTGGAT TAATCCCCAT TTGCTTCATC ATTTTATTCA TATCCCCAGA 1740 

CATAACACCC TGCATGAGCT GTTTAGCCTG GTTAAAGTCC TTGATGAATT TATTGACTTC 18C0 

GACGAATGTA TTTCCAGAAC CACCAGCAAT ACGACGGCGA CGGCTTGCAT TTAACAAATC 1360 

TGGCTTTTCA CCCTCTTCAG CTGTCATCGA AGACACAATG GCACGTTTAC GAGCAATCTG 1920 

GCGTTCATCC ACCTTCATGT TTTGAAGCGC TGGATTGTTG GCCATACCTG CAATCATCT7 1980 

CAGCAAGTCT TCCATCGGCC CCATATTTTG CACCTGATCT AATTGATCGA TGAAATCATT 2040 

AAAATCAAAG GTGT7TTCGC GCATCTTCTC AGCCATTTCA AGGGCTTTTT GTTCATCGl'A 2100 

TTCCTGAGAA GCTTTCTCAA TCAAAGTGAG CATATCCCCC ATACCAAGCA 7ACCCCTAGA 2160 

CATGCCCTCT GCGTGGAAGG TTTCAATGTC CCTAATCTT7 TCACCTGTAC CAGTGAACTT 22 20 

GATTGGTTTT CCAGTAATGT GACGAACAGA CAGAGCAGCA CCACCACGAG TATCGCCATC 2280 

AATCTTGGTA AGGATGACCC CAGTCACTTC CAACTGAGCA TTAAACTCAC GCGCAACATT 2 3 40 

GGCTCCT7CC TGACCAATCA TAG CATC AAC GACAAGCAAG ATTTCATTTG CTTCACCCAA 2400 

7CCTT7CACA TCACGAAGCT CATTCATCAC GAGCTCATCA ATCTGCAAAC GACCCGCAGT 24 60 

ATCAATCAAG ACATAC7CGT TATGATTAGT TTGGGCTTCC TCCAAACCTT GACGTACAAT 2520 
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CTCAACAGCT GGTACTTCTG TTCCAAGTGC AAAGACACCC ACATCAATCT GTTGTCCCAA 2 590 

CCTCTTAAGC TGGTCAATGG CAGCTGGACG ATAAATATCC GCCGCAATCA TCAAACCACG 2 640 

ACCATTTTCT TCTTTCTTGA GTTTGTTGGC CAATTTACCA GCAAAGGTTG TTTTACCAGC 2700 
CCCTTGTAAA CCAACCATCA TGATGATGGT TGGAATCTTA GGTGACTTGA TAATTTCTGC 2760 
CGTATCAGAA CCTAAAACGG CTGTCAATTC CTCATCAACG ATTTTAATAA TCTGTTGCGC 2 820 

AGGATTAAGT CTATCAATGA CCTCATGCCC GACTGCACGC TCACGAACTT TCTTGATAAA 2880 

CTCCTTTACA ACAGGCAAGG CAACCTCGGC CTCGAGCAAG GCCAAGCGAA TTTCTTTCCT 2940 

TGCCTCTTGG ACATCAGATT CAGAGATTTT TCCTTT7TTA CG7AGAT777 TAAAGACCTT 3 300 

CTGCAAACGT TCTGTTAAAC TTTCAAATGC CATTTTTCTT CCTCTTATTC TCTATTATCA 3060 

ATGCTTGTTA AAATTTCTAT CTGCTCCTGC AGAAAGTCAT CCTTGGGATA GCGCTCCAAA 3L20 

ATCTGATCAA AAATCTGACT GCGGACAATA TAGTCCGAG7 ACATGTGCAA TTTCATCTCA 3180 

7AA7C77CCA GAATCTTTTC TGTTCGCTTC ATATTGTCAT AGACAGCCTG ACGACTGACA 3 240 

CCCAAC7CC7 CGGCAATTTC AGCAAGGCTG TAATCATCAG CGTAGTAGAG CTCGATATAA 3 300 

TTCATTTGCT 7ATC7G7CAA AAGCGCCGCA TAAAATTCAA AGAGCGCATT CATACGATTG 3 360 

GTTTTTTCGA TTTCCATAAC TTTTATTATA CCAAAAATTA GCCTAATCTA CCACACTAGG 3420 

AACCCGATCC AAGAACATAG ATAGCTAAAT TTGAAAAAGA CATGAGCCTA GCCCCAAGTA 3430 

A777CCAAT7 GATAGCTGGC AAAGGGATGT CCC7C77GA7 TTTGTAGTTG ATAATCTAGT 3 540 

7CAATC7777 GCCTATCAAC T7CA7AA7CG CTCGTTTGGA TGATAAACTC CTGCATGCCC 3 500 

ATAGGTGTAG GAATATAGGC TAAACTATCG C7A7CC777A GAAAGCGCAT AATCGTCTTG 3660 

CGA77AGAAA ATCCGCTCAT CACAAG7TC7 TCACCATCAA A7T7AATCAC TAC77777CC 3 720 

TTTTCCTCAT TATAGAAAAG CAGGTACCTA TAATCTCCTT TTTCATGCAC TTCCACATCA 3 780 

TAAAGCTGGT CAATCACTTC CAACTGCTCA 7CAAAC7GAA 7CG7A777CG CATCCGAATC 3 840 

TTCACATCAG GCCCTCTTTC TTGTCTCTTC TCCTACTATT TTACCAAAAA GAGCAGGATT 3 900 

TTGCTATAAT GGTCATATGA ACGAAAAAGT ATTCCGTGAC CCTGTTCACA ACTACATCCA 3 960 

TGTCAATAAT CAAATCATCT ATGACTTGAT TAATACAAAA GAATTTCAGC G7TTGCGCCG 4020 

GATCAAACAA CTGGCAACTT CCAGTTATAC CTTCCACGGT GGAGAACACA GTCCCTTCTC 4 080 

TCACTGTCTA GGAGTCTATG AAATTGCACG ACGCATCACA GAGATTTTCG AAGAAAAATA 414 0 

7CCTCAGGAA TGGAATCCTG CCGAGTCTCT CTTCACCATG ACCGCTGCTC TCCTACACGA 4 200 

CCTTGGGCAT GCTCCCTACT CCCATACTTT 7GAACATCTC 7TTGATACAG ACCATGAAGC 4 260 
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CATTACTCAG GAGATTATTC AAAATCCTGA GACAGAGATT CACCAAGTCC TGCTACAAGT 4 32 0 

GGCACCTGAT TTCCCAGAAA AGGTGGCCAG TGTCATTGAC CATACCTATC CTAATAAGCA 4 380 

GCTCCTCCAG CTCATTTCTA GTCACATTGA CCCAGATCGC ATGGACTATC TCTTGCGCGA 44 40 

CTCCTATTTT ACAGGAGCAT CCTATGGGGA ATTTGACCTG ACTCGAATCC 7CCGACTCAT 4 500 

TCGTCCTATC CAAAATGGTA TCGCCTTTCA CCGCAATGCC ATGCACGCCA 7CGAAGACTA 4 5 60 

CGTCCTCAGT CGCTACCAGA TGTACATCCA GGTTTATTTC CACCCCGCAA CACGCGCCAT 4 620 

GGAAGTTCTC CTACAGAATC TTCTCAAACG CGCCAAGGAA CTCTATCCTG AGGACAAGGA 4 680 

TTTCTTTGCC CGAACTTCTC CACACCTCCT GCCTTTCTTC GAAAAAAATG TGACCTTGAC 4 740 

TGACTATCTG GCTCTGGATG ATGGCGTGAT GAATACCTAC TTCCAGCTTT GCATGACCAG 4 800 

TCCTGACAAG ATTCTTGCAG ATTTATCGCA TCGCTTTGTC AACCGCAAGG TCTTTAAATC 48 60 

CATTACCTTT TCACAAGAGG ACCAAGATCA ACTTACTAGC ATGAGAAAAT TGGTTGAGGA 4920 

TATCGGCTTT GATCCCGACT ACTACACTGC CA7TCATAAG AACTTTGACC TCCCTTATGA 4980 

TATCTATCGT CCCGAATCTG AAAACCCACG GACAGAGATT GAGATTTTAC AAAAAAATGG 5040 

AGAACTGGCC GAACTCTCTA GCCTGTCTCC TATCGTCCAA TCCCTTCCTG GCACTCCCCA 5100 

CGGAGATAAT CGCTTTTATT TTCCAAAAGA AATGTTGGAC CAAAACAGCA TCTTTGCAAG 5160 

CATTACCCAG CAATTTTTAC ACTTCATTGA GAACGATCAT TTTACCCCAA ATAAAAACTA 5220 
GAAGAGGAAA TTTATCACTA TTAAACTAAT TCCCGTTGAT ATCGACGGAA CCCTTGTCAA . 52 80 

CAGCCAAAAG GAAATCACTC CTGAAGTTTT TTCTGCCATC CAAGATGCCA AAGAAGCTGG 53 4 0 

TGTCAAAGTC GTGATTCCAA CTGGCCCCCC TATCGCAGGC GTTGCCAAAC TTCTAGACGA 54 00 

CTTGCAGTTG AGAGACGAGG GGGAC7ATGT GGTAACCTTC AACGGTGCCC TTGTCCAAGA 5460 

AACTGCTACA GGACATGAGA TTATCAGCGA ATCCTTGACT TATCAGGATT ATCTAGATAT 5520 

GGAATTCCTC AGTCGCAAGC TCGGTGTCCA CATGCATGCC ATTACCAAGG ACGGTATCTA 5530 

TACTGCAAAT CGCAATATCG GAAAATACAC TGTACACGAA TCAACCCTCG TCAGCATGCC 564 0 

TATCTTCTAC CGTACCCCTG AAGAAATGGC TGGCAAAGAA ATTGTTAAAT GTATGTTTAT 5700 

CGATGAACCA GAAATTCTCG ATGCTGCGAT TGAAAAAATT CCAGCAGAAT TTTACGAGCG 57 60 

CTACTCCATC AACAAATCTG CTCCTTTCTA CCTCGAACTC CTTAAAAAGA ATGTAGACAA 5820 

GGGTTCAGCC ATTACTCACT TGGCTGAAAA ACTCGGATTG ACCAAAGATG AAACCATGGC 5 8 80 

AATCCGTCAT GAAGAAAATG ACCGTGCCAT GCTGGAAGTC GTTGGAAACC CCGTTGTCAT 5 940 

GGAAAATGGA AATCCAGAAA TCAAAAAAAT CGCCAAATAC ATCACCAAAA CAAATGACGA 6000 

ATCCGGCGTT GCCCATGCCA TCCGAACATG CCTACTCTAA AAGTATCATT TTTCAATAAG 6060 
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AATTGATTAG CAATAAAATC CAATGAATTT TTTTAGCAAA CTATTTAATT TAAAACAAAA 512Q 

TAATCATAAT AGAGACACAA ATTCTGATTG TAACAA7TTT TACCTAAACG AATTAGAATG 6130 

TGGCCTTACT CCTGGGCAAC TCATACTCAT AGATTGGACT CAAAAAACAG GGAGAAATTA 6 24 0 

TAATTTCCCA AGATATTTTA AATACTCTCT TCAAATTGAC CCTGAATCTA CACACAATCA 6300 

ATTATACAAA TTAGGATACT TCACTAAAAA TAAGACTTTA TCATATCTTA CAGTAGTAGA 63 60 

ATTAAAAACT ATATTATCTA AACATAATTT AGCTACTTCT GGAAAAAAAG CAGAATTAAT 6 420 

TACAAGAATA ATTAATAATG TTAACATTGA CAATTTAGAT ATTCCGTTCG AATTTAAACT 6480 

AACAAAAGAA GCACAAAATC TTATTATCGA ACATAGTGAC TATATCAAAG CATACTATGA 6 540 

TAAAGACATA ACTATCGAAG ATTATTGTAA AGAAAAAAAC AATATCTCTT TTAAAGCAAC 6600 

TTTTGGTGAT ATAAAATGGA GTCTCTTAAA TAAACAAGCT CATAGGAATA CTGTATCAGC 6660 

AGATTTTGGA TCCT7ATCTA ACACACGAAA GCCTCAGGGA AGACATTTGG AACAAGAAGG 6720 

TAATATTAAA CATGCTTTAA TATATTACAT ACAATCTTTG ATAATTACTA TTTCAGGATT 6780 

AGAAAACAAT TTTTCACCCA CTCATTATCC AGTATATTAT CCCGATTCGA TACCTCACTA 6840 

CTCACTAAAA CATATTCAAA CATTAATGGA ATCATTATCT GATGACGATT ATGATTTTGC 6 900 

TTTTGATGAA GCATTATTTC GCTTCTCAAT TTTGAATGCA AATCATTTTT TATCTAAGGA 6960 

AGATATTGAC TATTTAAGAG TTAATTTACC TCGTTCCACT GCTGAAGAAA TAAACAATTA 7020 

CTTAAAGAAA TATGAATGTT ATACTCCTTT AAATAATTTA GAACTTCACG ATTTTGAATA 7 080 

AATTCACTAT ACAAACATTT ATATACTCCA TATAGTCTCA ATTTTATCTC A7CATTCCCC 7 HO 

AAATTTTTCA ATAATAAAAC GCATAATATT ATGGAGACAA TCCCCTA7AT TATGCGTTCT 7 200 

TTTAATATCA AAGACTTTT7 CACAAACTTC TTTGATATCT AATTACATGC CCCCTCCACC 7 260 

AATCGAACCT GCAACTACTC CTTAGGAGGG AGTTGTTATA TCCATTGAAC TAAGGGACCT 7 3 20 

AGATAAAAAC TCTGCTAAAT GAGCAGAGTT TTTTAGTCGA ATTAACGACC GATTTCTTTG 7 3 80 

ATACGAGCTG CTTTACCTTG AAGAGCACGC AAGTAGTACA ATTTCGCACG ACGTACTTTA 744 0 

CCGTAACGAA CAACTTCGAT TTTTTCAACA CGTGGAGTGT CGATTGGGAA GATACGCTCA 7 500 

ACACCTACAC CGTTAGAGAT TTTACGAACT G7GTACTTTT CTGAGATTCC AGCAC CT TTA 7560 

CGTGCGATAA CAACACG 7 577 

(2) INFORMATION FOR SSQ 10 NO: 47: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4945 base pairs 
{ B ) TYPE: nucleic acid 
{CI STRANDEDNESS: double 
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( D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 47: 

CCTCGCTGAT GATTGGTGCT GTTTTATTTG CTGGTCCACC CTTGGCTGAA GAAACTGCAC 60 

TTCCTGAAAA TAGCGGAnCT AATACAGAGC TTCTTTCAGC AGAGAGTGAG CATTCGACCA 120 

ATGAAGCTGA TAAGCACAAT GAAGGGGAAC ATGCTAGAGA AAACAAGCTA GAAAAGGCAG 180 

AAGGAGTACC GATAGCATCT GAAACTGCTT CGCCAGCAAG CAATGAAGCT GCAACTACTC 2 40 

AAACTGCAGA AGCAGCTAGC GCAGCTAAAC CAGAGGAAAA AGCAAGTGAG GTGGTTGCAG 3 00 

AAACACCATC TGCAGAAGCA AAACCTAAGT CTGACAAGGA AACAGAAGCA AAGCCCGAAG 3 60 

CAACTAACCA AGGGGATGAG TCTAAACCAG CAGCAGAAGC TAATAAGACT GAAAAAGAAG 4 20 

TCCAGCCAGA TGTCCCTAAA AATACAGAAA AAACATTAAA ACCAAAGGAA ATCAAATTTA 480 

ATTCTTGGGA AGAATTCTTA AAATGGGAAC CAGGTGCTCG TGAACATGAT GCTATTAACC 540 

GCGGATCTGT TGTCCTCGCT TCACGTCGGA CAGGTCATTT AGTCAATGAA AAAGCTAGCA 600 

AGGAAGCAAA ACTTCAACCC TTATCAAACA CCAATTCTAA AGCAAAAGAC CATGCTTCTG 660 

TTGGTGGAGA AGAGTTCAAG GCCTATGCTT TTGACTATTG GCAATATCTA GATTCAATCG 7 20 

TCTTC7GGGA AGGTCTCGTA CCAACTCCTC ACGTTATTGA TGCAGGTCAC CGTAACGGGG 780 

TTCCTGTATA CGGTACACTC TTCTTCAACT GGTCTAATAC TATTGCAGAT CAAGAAAGAT 840 

TTCCTGAAGC TTTGAAGCAA GACGCAGATG GTAGCTTCCC AATTGCCCGT AAATTGGTAG 900 

ACATGGCCAA GTATTATGCC TATGATGGCT ATTTCATCAA CCAACAAACA ACTGGAGATT 960 

TGCTTAAACC TCTTGCACAA AAGATGCCCC AGTTTATCCT CTATAGCAAG GAATATGCTG 1C2C 

CTAAGGTAAA CCATCCAATC AAGTATTCTT GGTACGATGC CATGACCTAT AACTATGGAC 1080 

GTTATCATCA AGATGGTTTG GGAGAATACA ACTACCAATT CATCCAACCA GAAGGAGATA 1140 

AGGTTCCGGC AGATAACTTC TTTGCTAACT TTAACTGGGA TAAGGCTAAA AATGATTACA 1200 

CTATTGCAAC TGCCAACTGG ATTGGTCGTA ATCCTTATGA TGTATTTGCA GGTTTGGAAT 1260 

TGCAACAGGG TGGTTCCTAC AAGACAAAGG TTAAGTGGAA TGACAT T TTA GACGAAAATG 13 20 

GGAAATTGCG CCTTTCTCTT GGTTTATTTG CCCCAGATAC CATTACAAGT TTAGGAAAAA 13 80 

CTGGTGAAGA TTATCATAAA AATGAAGATA TCTTCTTTAC AGGTTATCAA GGAGACCCTA 1440 

CTGGCCAAAA ACCAGGTGAC AAAGATTGGT ATGGTATTGC TAACCTAGTT GCGGACCGTA 1500 

CGCCAGCGGT AGGTAATACT TTTACTACTT CTTTTAATAC AGGTCATGGT AAAAAATGGT 1560 

TCGTAGATGG TAAGGTTTCT AAGGATTCTG AGTGGAATTA TCGTTCAGTA TCAGGTGTTC 1620 
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TTCCAACATG GCGCTGGTCC CAGACTTCAA CAGGGGAAAA ACTTCGTGCA GAATATGATT 1680 

TTACAGATGC CTATAATGCC GGAAATTCCC TTAAAT7CTC TGGTGATGTA GCCGGTAAGA 174 0 

CAGATCAGGA TGTGAGACTT TATTCTACTA AGTTAGAAGT AACTGAGAAC ACCAAACTTC 1800 

GTGTTGCCCA CAAGGGAGGA AAAGGT7CTA AAGTTTATAT GGCATTCTCT ACAACTCCAG 18 60 

ACTACAAATT CGATGATGCA GATGCATGGA AACAGCTAAC CCTTTCTGAC AACTGGACAA 19 20 

ATGAAGAATT TGATCTTACC TCACTAGCGG GTAAAACCAT CTATGCAGTC AAACTATTTT 1980 

TCGAGCATGA AGGTGCTGTA AAAGATTATC AGTTTAACCT AGGACAATTA ACTATCTCGG 204 0 

ACAATCACCA AGAGCCACAA TCGCCGACAA GCTTTTCTCT AGTGAAACAA TCTCTTAAAA 2100 

ATGCCCAAGA AGCGGAAGCA GTTGTGCAAT TTAAAGGCAA CAAGGATGCA GATTTCTATG 2160 

AAGTTTATGA AAAACATGGA GACAGCTGGA AATTACTAAC TGGCTCATCT TCTACAACTA 22 20 

TTTATCTACC AAAAGTTAGC CGCTCAGCAA GTGCTCAGGG TACAACTCAA GAACTGAAGG 22 80 

TTGTAGCAGT CGGTAAAAAT GGAGTTCGTT CAGAAGCTGC AACCACAACC TTTGATTGGG 2 340 

GTATGACTGT AAAAGATACC AGCCTACCAA AACCACTAGC TGAAAATATC GTTCCAGGTG 2 4 00 

CAACAGTTAT TGATAGTACT TTCCCTAACA CTGAAGGTGG AGAAGGTATT GAAGGTATGT 24 60 

TGAACGGTAC CATTACTAGC TTGTCAGATA AATGGTCTTC AGCTCAGTTG AGTGGTAGTG 2520 

TGGATATTCG TTTGACCAAG CCACGTACCG TTGTTAGATG GGTCATGCAT CATGCAGGAG 2 580 

CTGGTGGTGA GTCTGTTAAC GATGGCTTGA TGAACACTAA AGACTTTGAC CTTTATTATA 2 640 

AAGATCCAGA TGGTGAGTCG AAGCTAGCTA AGGAAGTCCG 7GC7AACAAA GCACACGTGA 2700 

CAGA7ATCAC TCTTGATAAA CCAATCACTG CTCAAGACTG GCGCTTGAAT GT7GTCACTT 27 60 

CTCACAATGG AACTCCATCC AAGCCTAT7C GTATCTATAA CTGCAAAATG TATGAAAACC 2320 

TTGATACTGA GAC7GTCAAT ATTCCGATGG CCAAGGCTGC AGCCCCTTCT CTAGGCAATA 2 880 

ACAAGGTACA AGTTCGCTTT GCAGATGTAC CGCCTGGAGC AACTATTACC CTTTATGATA 2 940 

ATCCAAATTC TCAAACTCCG CTCGCAACCT TCAAGAGCGA AGTTGGAGGA GACCTAGCAA 3 COO 

GTGCACCATT GGATTTGACA AATCAATCTG GTC7TCTTTA TTATCGTACC CAGTTGCCAG 3 060 

GCAAGGAAAT TAGTAATG7C CTAGCAGT7T CCGTTCCAAA AGATGACAGA AGAATCAAGT 3120 

CAGTCAGCCT AGAAACAGGA CCTAAGAAAA CAAGCTACGC CGAAGGGGAG GATTTGGACC 3180 

TTAGAGGTGG TGTTCTTCGA GTTCAGTATG AAGGAGGAAC TGACGACGAA CTCATTCGCC 3 24 0 

TAACTCACGC AGGTGTATCA GTATCAGGTT TTGA7ACGCA TCATAAGGGA GAACAGAATC 3 3 00 

TTAC7CTCCA ATATTTGGGA CAACCGGTAA ATGCTAATTT GTCAGTGAC7 G7CACTGGCC 3 3 60 
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AAGACGAAGC AAGTCCGAAA ACTATTTTGG GAATTGAAGT AAS*P€^1GAA CCGAAAAAAG 34 20 

ATTACCTAGT TGGTGATAGC TTAGACTTGT CTGAAGGACG CTTTGCAGTG GCTTATAGCA 34 80 

ATGACACCAT GGAAGAACAT TCCTTTACTG ATGAGGGAGT TGAAATTTCT GGTTACGATG 3 540 

CTCAAAAGAC TGGTCGTCAA ACCTTGACGC TTCATTACCA AGGCCATGAA GTTAGCTTTC 3 600 

ATGTTTTGGT ATCTCCAAAA GCAGCATTGA ACGATGAGTA CCTCAAACAA AAATTAGCAG 3660 

AAGTTGAAGC TGCTAAGAAC AACGTGCTCT ATAACTTTGC TTCATCAGAA GTAAAAGAAG 3720 

CCTTCTTGAA AGCAATTGAA GCGGCCGAAC AAGTGTTGAA AGACCATGAA ACTAGCACCC 37 80 

AAGATCAAGT CAATGACCCA CTTAATAAAT TGACAGAAGC TCATAAAGCT CTGAATGGTC 3840 

AACAGAAATT TACGGAAGAA AAGACAGAGC TTCATCCCTT AACAGGTGAG CTTCAAGAAC 3 900 

TCTTGGCTGC CAAACCAAAC CATCCTTCAG GTTCTGCCCT AGCTCCGCTT CTTGAGAAAA 3960 

ACAAGGCCTT GGTTCAAAAA GTAGATTTGA GTCCAGAAGA GCTTACAACA GCGAAACAGA 4020 

GTCTAAAAGA TCTGGTTGCT TTATTGAAAG AAGACAAGCC AGCAGTCTTT TCTGATAGTA 4080 

AAACAGGTGT TGAAGTACAC TTCTCAAATA AAGAGAAGAC TGTCATCAAG GGTTTGAAAG 4140 

TAGAGCGTGT TCAAGCAAGT GCTGAAGAGA AGAAATACTT TGCTGGAGAA GATGCTCATG 4200 

TCTTTGAAAT AGAAGGTTTG GATGAAAAAG GTCAAGATCT TGATCTCTCT TATCCTTCTA 42 60 

TTGTGAAAAT CCCAATTGAA AAACATAAGA AAGTTAAGAA AGTATTTTTC TTACCTGAAG 43 2 0 

GCAAAGAGGC AGTAGAATTG GCTTTTGAAC AAACGGATAG TCATGTTATC TTTACAGCAC 4 3 80 

CTCACTTTAC TCATTATCCC TTTGTTTATG AATCTGCTGA AAAACCACAA CCTGCTAAAC 4440 

CAGCACCACA AAACACAGTC CTTCCAAAAC CTACTTATCA ACCGACTTCT GATCAACAAA 4S00 

AGGCTCCTAA ATTGGAAGTT CAAGAGGAAA AGGTTGCCTT TCATCGTCAA GAGCATGAAA 4560 

ATACTGAGAT GCTAGTTGGG GAACAACGAG TCATCATACA GGGACGAGAT GGACTGTTAA 4 620 

GACATGTCTT TGAAGTTGAT CAAAACGGTC AGCGTCGTCT TCGTTCAACA GAAGTCATCC 4 630 

AAGAAGCGAT TCCAGAAATT GTTGAAATTG GAACAAAAGT AAAAACAGTA CCAGCAGTAG 47 40 

TAGCTACACA GGAAAAACCA GCTCAAAATA CAGCAGTTAA ATCAGAAGAA GCAAGCAAAC 4 800 

AATTGCCAAA TACAGGAACA GCTCATGCTA ATGAAGCCCT AATAGCAGGC TTAGCCAGCC 4860 

TTGGTCTTGC TAGTTTAGCC TTGACCTTGA GACGGAAAAG AGAAGATAAA GATTAAATAT 4920 

CGAAAAATCT TGTGAAATCT TTCCG 494 5 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25002 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi» SEQUENCE DESCRIPTION : SEQ 10 NO: 48: 

GACAACTCAA GTAGCTTTTT CTTATTTTGA AAAAGGAGAT CAGAGTTTAA CTATGTCAGA 60 

AAAATCACAA TGGGGGTCGA AACTTGCTTT TATTCTAGCA TCTGCTGGCT GGCCATCGGG 120 

CTTGGTTCCG TTTGGAAGTT TCCCTACATG ACTGCTCCTA ATGGCGGTGG AGGCTTTTTA L30 

CTAATCTTTC TCATTTCCAC TATTTTAATC GGTTTCCCTC TCCTGCTGGC TGAGTTTGCC 24 0 

CTTGCCCGTA GTGCTGGCGT TTCCGCTATC AAAACCTTTG GAAAACTGGC CAAGAATAAC 3 00 

AAGTACAACT TTATCGGTTG GATTGGCGCC TTTGCCCTCT TTATCCTCTT ATCTrTTTAC 360 

AGTGTTATCG GAGGATGGAT TCTAGTCTAT CTAGGTATTG AGTTTGGGAA ATTCTTCCAA 4 20 

CTTGGTGGAA CGGGTGATTA TGCTCAGTTA TTTACTTCAA TCATTTCAAA TCCAGCCATT 4 80 

CCCCTAGGAG CTCAAGCGGC CTTTATCCTA TTGAATATCT TCATTGTATC ACGTGGGGTT 540 

CAAAAAGGGA TTGAAAGAGC TTCGAAAGTC ATGATGCCCC TGCTCTTTAT CGTCTTTGTT 600 

TTTATCATCG GTCGCTCTCT CACTTTCCCA AATGCCATGG AAGGCGTTCT TTACTTCCTC 660 

AAACCAGACT TTTCAAAACT GACTAGCACT GGTCTCCTCT ATGCTCTGGG ACAATCTTTC 720 

TTTGCCCTCT CACTAGGGGT TACAGTCATG TTGACCTATG CTTCTTACTT AGACAAGAAA 780 

ACCAATCTAG TCCAGTCAGG AATCTCCATC GTAGCCATGA ATATCTCGAT ATCCATCATG 840 

GCAGGTCTAG CCATTTTCCA AGCTCGATCC CCCTTCAATA TCCAGTCTCA AGGGCGACCC 900 

AGCCTGCTCT TTATCCTCTT GCCTCAACTC TTTGACAAGA TGCCTTTTGG AACCATTTTC 960 

TACCTCCTCT TCCTCTTGCT CTTCCTTTTT GCG AC ACT C A CTTTTTCTGT CGTCATGCTG 1020 

GAAATCAATG TAGACAATAT CACCAACCAG GATAACAGCA AACGTGCCAA ATGGAGTGTT L080 

ATTTTAGGAA TTTTGACCTT TGTCTTTGGC ATTCCTTCAC CCCTATCTTA CGGTGTCATG 1140 

GCGGATGTTC ACATTTTTGC TAAGACCTTC TTTGACGCTA TGGACTTCTT GGTTTCCAAT 1200 

CTCCTCATGC CATTTGGAGC TCTCTACCTT TCACTTTTTA CAGGCTATAT CTTTAAAAAG 12 60 

GCTCTTGCAA TGGAGGAACT CCATCTCGAT GAAAGAGCAT GGAAACAAGG ACTGTTCCAA 13 20 

GTCTGGCTCT TCCTTCTTCG TTTCTTCGTT TCGTCATTCC AATCATCATC ATTGTGGTCT 1380 

TCATTGCCCA ATTTATGTAA TCAAAAAGGA CTTGAGTAGT GAACTCAGGC CCTTTCTTTT 1440 

TATGGATGCC TAACAATCAA TTCCAAACCT TGCCCTTCCA CACTCCAACC TTCAACATCA 1500 

CTTGGTAGGA TAAAGTGGCT GCCTTTTTGA AT7GGATAAT 7TT7CCCCTC AACAGTTAGC 1563 
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TGACCTTGAC CAGCCAAGAC ACTCAA7AAG CTGTAGTCAC CTGTCTTTTC AAAGTCAACT 1620 

TTTCCAGTAA TTTCCCACTT GTAAACTGCG AAGAAATCAT TAGATACAAG CAGAGTGGAA 1680 

CGCAAATCAT CTGCTTTAAC AG77ACAGGA CGGCTATTTG CTGGCTCACC AATGTTCAAG 1740 

ACATCGATGG AT77TTCAAG ATGAAGTTCA CGCAAGTTGC CTTTGTCATC CTTGCGGTCA 1800 

AAGTCATAGA CGCCATAGGT GGTATCGCTA GACTGCTCGC TTTCAAGGAT 7AAGATACCC 1860 

GCCCCGATAG CCTGCATACT CCCGCTTGCT ACATAGAAGA AATCTCCAGC CTTAACAGGG 1920 

ACTTTGCTCA ACAAGTCATC CCACTTCTTG TCCTCGATTT GCTGGCGGAG 7TCTTCTTTT 1980 

GACTTGGCAT TGTGACCGTA GATAATCTCT GAACCTTCAT CCCCTGCGAT AATGTACCAG 2040 

CATTCTGTTT TTCCGAGTTC GCCTTCATGC TCGAGTCCAT AAGCATCGTC TGGGTGAACT 2100 

TGGACACTGA GCCAGTCGTT GGCATCGAGG ATCTTGGTCA AAAGTGGAAA TACAGGTTCT 2160 

GGACGATTGC CAAATAATTC ACGGTGTTCC GCATACAAAG 7AGCAAGATC TGTTCCCTCG 2220 

TAACGACCAT TGGCAACTTT AGAGACTCCA TTTGGATGGG CTGAGATGGC CCAATATTCT 2 280 

CCGATTTTTT CACTTGGGAT GTCCTAGCCA AACTCATCAC GTACCTTGGC TCCACCCCAG 2 340 

ATTTTTTCTT GCATAACTCA TTGTAAAAAT AATGGTTCTG ACATCTCGAT CTCCTGTCTG 2 400 

ATTTTTC7CC CCTCATTATA GCAAAAAAAG AGTTCGAATT GAACTCTTTT TTACATCTTA 24 60 

TAAAGCAGGG AGAAGATTTT ATAAAAATAG TAAACAAATG TGCTCTACCC GATCCTTGCA 2520 

CCATTGCTAT AAATGACATC CTTGTACCAA TAGAAGGACT TCTTCTTCCT ACGTTTGAGA 2 580 

GCTCCGTTTC CTACATTATC TCGATCTACA TAGATAAAGC CATAGCGCTT ATTCATTTCC 2 640 

CCTGTCCCAG CTGAAACCGG ATCGATACAG CCCCAAC7CC TATAACCAAG CAACTCAACC 270C 

CCGTCTTGG7 AAATGGCATC TCGCATGGCC TTGATCTCCG CCTCTAAG7A AGTAATCCGA 2 7 50 

7AGTCATCTG CTACATAACC A77CTCA7CC GGTGTATCCA TAGCACCGAG TCCATTTTCT 2 8 20 

ACGATAATAC TAAACTAAAA TCAAAAAGCA TTATATAATA GTGATATGAA ATCAACTAAA 2880 

GAAGAAATCC AAACCATCAA AACACTTTTA AAAGACTCTC G7ACAGC7AA ATATCATAAA 2 940 

CGCCTTCAAA TCGTTCTATA G7AAAATGAA ATAAGAACAG TACAAATCGA 7CAGGACAG7 3000 

CAAA7CGATT TC7AACAATG TTTTAGAAGT AGGGG7GTAC TATTCTAGTT TCAATCTAC7 3060 

A7ATTTCGTC TGATGGGCAA ATCTTATAAA GAGA7TATAG AAC7TTTATA GTAGTTTGAA 3120 

A7AAGA7G7G AACAACTC7A 7CAGGAAAG7 CAAATTAAT7 7ATAGAAATA TTTTAGCAGC 318C 

CAAGGTG7AC 7G77ATAGAT 7CAATACAC7 ATAGACTGTA A7CAAACAAC GATTTCGCGA 3 24 0 

AATG7AAAAA AATATGAGGA GTTCGGACTC GACTC7CTCC TTCAAGAAAC ACGTGGTGGT 3 300 

CGTAACCATG CATA7ATGAC AG7TGAGGAA GAGAAAGCCT T7C77GCCCC CCAT7TGAAG 3 3 60 
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CCTACAGAGG 


CACGAGAATT 


TGTTACAATT 


GATGCCTTAT 


TTCAGGCTTA 


TAAAAAGGAG 


3420 


TTAGGTCGTT 


CCTACACACG 


TGATGCCTTC 


TATCAACTGT 


TGAAGCGCCA 


TGGTTGGCGA 


3480 


AATATTACGC 


CACGTCCAGA 


ACATCCTAAG 


AAAGCAGACG 


CTCAAACCAT 


TGTTGCGTCT 


3540 


AAAAATAAAA 


TCTCAATCCA 


AGAAGGCAAG 


AAAGCGTTTT 


AAATATAGTA 


GACGTTTTCG 


3600 


TAAGGTTTGC 


TTGATGTACC 


AAGCTGAAGC 


TGGTTTCCGT 


AGAATCAGTA 


AACTGGGATC 


3660 


TTGTTGGGCT 


CCAATACCAG 


TAGGTCCACA 


TATCCATAGT" 


CACTATATAC 


GAGAATTTCG " 


3720 


CTATTGTTAT 


GGAGCTGTTG 


ATGCCTATAC 


AGGCGAATCA 


TTTTTCTTAA 


TAGCTGGTAG 


3730 


ATGTAATACT 


GAGTGGATGA 


ACGCCTTTTT 


AGAAGAGCTT 


TCACAAGCTT 


ATCCTTTTAC 


3840 


TCGTTATGGA 


CAATGCTATA 


TGGCATAAAT 


CAAGTACCTT 


AAAGATTCCG 


ACTAATATTG 


3900 


GTTTTGCATT 


TATTCCTCCA 


TACACACCAG 


AGATGAACCC 


CATTCAACAA 


GTGTGGAAAG 


3960 


AGATTCGTAA 


ACGTGGATTT 


AAGAATAAAG 


CCTTTCGAAT 


TTTGGAAGAT 


GTCATGAATC 


4020 


AACTCCAAGA 


TGTCATACAA 


GCATTGGAGA 


AGGAGGTCAT 


AAAGTCCATC 


GTTAATCGGA 


4080 


GATGGACTAG 


AATGCTTTTT 


GAAAGCAGAT 


GAGTATTATA 


TGCAATTTCT 


TTATATAAAA 


4140 


AGACCGGATT 


GCTCCCATCT 


TTCAATAGTT 


CATATTCTCA 


ATTTCTATTT 


TAAAAATAGC 


4200 


TAAGGTTAAC 


GTCAAATGAC 


TACGCGACCT 


ATTTCATACG 


ATAAAAATCA 


ACCACTAGAC 


4260 


CAGCAGGTCC 


TTGAACTAAT 


AAGGACTCTG 


TTCCCCAATC 


GGTTACAGTT 


GGTCCGTGTA 


4320 


AAACCTTTAT 


ACCAAGCTCG 


TTCAACCGTT 


TCTAGTTCTG 


CTCTACATCC 


TCAACCTCGA 


4380 


TATGAATAAT 


GATTCCTGAC 


TGAAAGTTTT 


CCAAAGGAAC 


CAAATCATTT 


TGTGACAACA 


4440 


TAAGGCACTG 


ACTACCAATC 


GTAAACTGAG 


CAAAACCATC 


ATTAGCATAA 


TC7GCCTTTT 


4500 


TATCCAAGAT 


ATCCTCCAAG 


TCAGCACAGA 


CTTGGGGAAC 


ATTTCAAACG 


ATAATATCTA 


4560 


ATTGATTTAA 


ATTCATTTAC 


TCTCCTCCA? 


AAAAAGACCC 


GATTGCTCCG 


ATCTTTTAAA 


4620 


GTTCTGCTCT 


ATGAAAATCA 


AAGAATAAAG 


TCTACAAGTT 


TCATATTTCA 


TTTTCGGCGA 


4680 


GAGGAATTAT 


TTAATTCCGC 


GTGATTGCAA 


TCCTTCTTCT 


TCCAAGAAGA 


GACGGAATGG 


4740 


TACGAGTTCT 


TCTGCTTCGT 


ATTTTTCCTT 


GAAGGCTTTG 


ATAGCTTCTT 


CTGAGTGAAG 


4800 


TT7TCGATCC 


AATTCAAGTA 


CTTCTACTGG 


AAGTGGACGG 


TGT7GAGTGA 


?GC GAG CATC 


4360 


GATGACAACA 


GTTTTACCTT 


CTTTGTTCAA 


TTTAACACCT 


TCTGCAACAA 


CTCCATCGAT 


4920 


GTCTTCGATA 


CGGTCAACTG 


TGAATCCAAC 


AGCTCCTTGA 


GCTTCCGCAA 


TTTTAGCGTA 


4980 


GTCAGCGTTT 


GTGAAGTCTA 


CACCAAACAA 


GTGTTTCTTT 


GTATCTTCGT 


ATTTGTTCTT 


5040 


GATGAACCCG 


TACTCAGCAT 


TTGAGAAGAC 


AAGGTTGATA 


ACTGGAAGGT 


CGTATTGAAC 


5100 
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CTTTGTGATA ACG7C7CCG7 ACCACATGTT GAATGCTCCG TCACCCATGA TGTTCCATAC 5160 

TTGGCGATCT GGATTGTCTT TCTTAGCAGC GATACCACCA GGAAGGGCAA TACCCATTCT 5220 

CGCAAAGAGT GGAGATGTAC GCCACATGTT CTTAGGTGTC ATGTGAAGGT GACGAGTAGA 5280 

TGTTTGAGTA GTGTTACCTA CGTCGATTGA G7AGATAGCG TCTTGATCAG CATCTTTGTT 53 40 

GATTGCATTG TAAACTTGAT ACAATTGCAA TTCACCCTCA GTTTTACCTT CGAGTTTGTT 5400 

CATGTAATCA CGCCACTTTT GGTTCTTCTT AACGTTTGCA CGCCACCATG GAGTTGATTC 54 60 

AACTGGGTTT ACTTTGTCAA GGATAGCTTT AGCTGCTTGA CCAGCATCAC CAAGGATTGA 5520 

AGCGTCAAGG GCATGACGTT TACCAAGTTT GTAAGGCTCG ATATCGACTT GGATGAATTT 5580 

TTCAGTGTTC TTCAATGCT7 CGTAAACTTC AGCAAATGGG AACT7TCAAC ' CAAGGAAAAG 5640 

AACTGTGTCT GCTTCAAAGA CCACTTCGTT GCCTGCTTTC CAACCAACAC GGTAAGCAGA 5700 

ACCTGTCAAA CCTTCATAGT TCCATTCGAA AGCTTCAAAG TTTTTACCAG TTGTGATGAT 5760 

TGGTGCTTTG A7TTTACGTG ACAATTCAGT AAf CACTTCA CCAGCTTTAA CACCACCAAA 5820 

TCCAGCATAG ATAACTGGGC GTTCAGCATT GTTCAAGATT TCAACAGCTT TGTCGATTTC 5880 

AACTTCGTTC AAAGCAGCAG CGATGAATGA GCGTTCGTAT GAACCTGAAC CGTAGTATGA 5940 

GTTTTCATCG ATTTCTTGGA AACCGAAGTT TACTGGAATT TCAACAACAG CTGGACCTTT 6000 

TTTAGAAACT GCAGCACGGC AGGCTTCGTC AATTACTTTT GGCAATTGCT CAGCGTAAGC 6060 

TACACGTTTG TTGTAAACAG CGATACCGTT GTACATTGGG TTTTGGTTAA GCTCTTGGAA 6120 

AGCATCCATG TTCAATTCGT TAACTCGACG TGATCCAAGG ATCGCTAGGA ATGGACTGTT 6180 

A7CCATAGC7 GCATCGTAAA CACCCTTAAT CAAC7CAC7C GCACCTGGAC CACC7CAACC 6240 

AACTGCAACC CCGAT7GACC CGCCGAATTT ACC7TCCATA ACCGCTGCAA GAGCACCTGT 6300 

C7C77CG7GG CGAACTTGTA AGAAACGGAT ATCTTTGTCT TCAGCCAAAC* CC7CCA7CAA 63 60 

TGAGCTGAGT GTTCCTGATG GGA7ACCGTA GATTGTATCT ACCCCCCATG TTTTCAATAC 64 20 

GTTAAGCATT GC7GCAGATG CAGTAATTTT CCC7TCACTC ATAA7GATAA CTCTCCTTCA 64 80 

ATTTTTTTAA ACTTGGAGAA TACGATTACA TAGAATTGGA AACGT7CTCC AAATTTTTAC 6540 

TATTCCACTG TATCATATTT ATGCTGACTT T7CTAAAAAT CTCCTCAAAA CTC7CTATTC 6 600 

TCTATTCTAA 7ACAG7T7TG AAAGTTCTCT CAT7TCTGTT TTATAACAAA GAAATCTAGT 6660 

CATTACTTTT AGTCTATT7T ACTAAAATTT AACAGAAGGG AACTGGTCAG AACAGATACA 6720 

GAACTAAAGG CCATGGCTAG ACCTGCCAAT TCTGGG7TGA GAGCCAG7CC AACACCTGAA 6780 

AAGACTCCTG CTCCAATCGG AATTCCGACA ACA7TGTAGA TAAAACCCCA GAAAACATTG 684 0 

AGTAGAATTC GATGAAAGGT TTTCTTAC7C A7ATCAAAGG CACCAACCAC TCCTAAAAGA 6900 
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TTATTGGTTG 


TCAACAGCAA 


, ATCTGCTGAC 


TCGATGCCGA 


TATCTGTTCC 


AGCTCCCATA 


6960 


GCAATCCCCA 


CATCTGCTAC 


ACTAAGGGCA 


CCAGCCTCAT 


TGATACCGTC 


CCCAACAAAG 


702C 


GCTACTTTCC 


CTCACTCTTC 


CACTTTATGG 


ATTTCATCGC 


CTTTTTCTTC 


TGGCAAGACG 


7080 


CCTGCAATGA 


CCTCTTCAAT 


TCCCATTTGA 


TCTGCAATAG 


CACGCGCCAC 


ACCAGCATTC 


7140 


TCTCCTGTCA 


GCATGACTGT 


TCGGAGACCA 


CCTTTTTTTA 


GCTGACTCAT 


GGCTAGCTTA 


7200 


GCATTTTCCT 


f AGGAATATC 


TTGCAAAGCA 


"AGCAAGCCTT 


T G ATTT C ATT 


CTCAACAGCT 


7260^ 


AAGAACACAA 


CTGTCTTAGC 


TTCTTTTTCT 


AGTTCTTCTA 


GTTTATCTTG 


ATAAGTATTA 


7 3 20 


GAAATATCCA 


TGCCATCCAG 


CATTTTAGCA 


TTTCCAAGTA 


AAACTTGTTT 


TCCATTGATT 


7380 


CGCCCTGAAA 


CACCTTTCCC 


GTGCAAGCAC 


TGAAAATTTT 


CAACAGTTTG 


AAACTCAAGT 


7440 


CCAGCTTCAC 


TCGCTCCCTT 


AACGATAGCC 


TCAGCCAGTG 


GCTGTTGAGA 


AGCATCTTCC 


7500 


AAGGAGGCTG 


CCAACCCAAA 


CACTTCTACT 


TCGTCGCCGA 


TGACATCTGT 


TACCACACGT 


7560 


TTCCCTTCCG 


TCAAAGTCCC 


GGTCTTATCA 


AACACAAGGG 


TTTGAACTTT 


CTGCATTTCC 


7620 


TGTAAGACAG 


TTCCATTTTT 


GAGGAGAACC 


CCCATCTTCC 


CACTACCTCC 


TGTCCCCACC 


7680 


ATAACCGCTC 


TCGGTGTTGC 


AAGTCCCAAG 


GCACAAGGAC 


AGCCGATAAT 


CAAAACCGCC 


7740 


ACTCCGTAGA 


GAAGAGAGGA 


CACAAAGCTA 


GCTCCAAGCA 


CAACCACACT 


ATCCCTGAGC 


7800 


AAGACGAACC 


AAACCCAAAA 


GGTCATGATT 


CCTAAAATGA 


CAACTACTGG 


GACAAAAATC 


7860 


CCTGAAATCT 


TATCCGTCAA 


GTCCTGAATC 


GGCGCACCAC 


TTCTCTGAGC 


TTTCTTCACA 


7920 


AAATCCACAA 


TCTGAGCCAA 


AACAGTCTCT 


GAGCCAACTT 


TTTCTCCTCT 


AAAGACAAGC 


7930 


CTTCCACTAT 


CATTCATCCT 


TGAGCCAATG 


ACAGTATCTC 


CAACTGTCTT 


GTCCACAGGC 


8040 


AGACTCTCAC 


CTGTCACCAT 


GGATTCGTCA 


ATACTAGAGA 


CACCTTCTAC 


TACGACACCA 


8100 


TCAACAGCAA 


TCTTTTCACC 


GGGACGCACT 


CGAATCAGGT 


CGCCTACCTT 


GACTTGTTCC 


8160 


AAAGGAACTT 


GCACATAACT 




a AC. a ctc. 

r\/Vj/M_ i » l_ 1 Vj 


l-Ulj 1.(1 Mbw 


. l oLAAO 1 Ut- 




AGTAATTTCT 


CCACAGCTTG 


GGACGTATTT 


TTTCTCATTT 


TTTCCTCAAA 


AACTGCTCCC 


828G 


AAAAGAACGA 


AAAAGAGGAT 


AAATCCAGCA 


CTTTCGAAGT 


AAACAGGGAG 


ACCAGCAAAG 


3340 


AGACCAACTA 


GGCTATAGAA 


ATAAGCCACT 


AGAGTTCCCA 


GCGCAACCAA 


GGTATCCATG 


8400 


TTGGCATTGT 


GCTTTTTAAA 


ACTGGCCCAA 


GCACTCTGGA 


TATATGGCTT 


ACCTGCAACT 


3460 


AACATAATAG 


GCCTTGTTGC 


TAGAAAGCTT 


CCCCAATGCA 


TGACTTGATG 


ACTAATGCTA 


8520 


CCTGTCAACA 


TCCCAATCAT 


GAGAATCACA 


AGACGCACAG 


TAAACATACT 


ACTAATCCAA 


8530 


AAACGTTGCA 


GGAGAGATAG 


ACATTTTCGA 


GTCTTCTCAA 


CGACTGTATA 


GCTTCCCTTT 


8640 
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TGCATCTTCA 


TGCCACAACA 


AAATTCATGT 


CGCCCTAATT 


CTTCAGGCGT 


AAAACGAATG 


8700 


ACTTTCTCCT 


CATCTACGCC 


GATTGGTTCC 


AAGATACCTT 


CTTCTTCAAA 


CAGAATTTCC 


8760 


TTATAACAGT 


TTGAAGGAGT 


AGCACGATGA 


AAGGTAATCT 


CAGCTGGAAT 


TCCCTTTTGA 


8820 


AGCTGCATAT 


GGGCTGGATG 


ATAGCCTTTT 


TCAGCTCGGA 


TACGGATTTT 


TTGAATGCCA 


8880 


TTTTCTAAGC 


TTGCTTTCAC 


AATTTCTGTC 


ATAGTCTCCA 


CCTACTCTAC 


AATCATCTTG 


8940 


CCGTGCATCA 


TGTTCATACC 


ACAAGCAAAG 


CCAAACTCTC 


CAGCCTGTTC 


AGGCGTGATT 


9000 


TCCACTACAT 


ACTCTTCCCC 


CATTGGCAGG 


TTCCCATGTA 


CACCAAAATC 


TGGAAAAACA 


9060 


ATTTGATCCA 


GACATGGTGA 


AGGATCCTTG 


CGGTCAAAGA 


CAATGCGTGC 


TGGCACTGAT 


9120 


TTCTTGAGGA 


CAATCAACTC 


AGGAGTATAG 


CCTCCCATCA 


CTTCCACTCG 


AATCTCTTCG 


9180 


TATCCGTTTT 


TTTGCTGGGC 


TTTTTGTCCA 


GATTTTTCAG 


GCTTTTTGAA 


AAACCAAAAC 


924C 


AAGATAAACG 


CGATAAGGGC 


AATACAAATA 


ATGGTTACAA 


TACTATTTAA 


CATGACGTCT 


9300 


CCTTTACATA 


CAATTACATC 


TTACTTCTCT 


TACAGCACTT 


GATTTCTTCT 


CTGAAATCAC 


9360 


AGCTTCCAAG 


TCTTCCAAGT 


CAGTCTGACT 


AAATTCACAT 


TCTACAATCA 


ACTCAGCCAA 


9420 


CAAATTCCTA 


ATCCTACGGC 


AACAAACCTT 


GTCTTTGATA 


TCTTGGACAA 


GTAAATCCCG 


9480 


ACTTTGGTCT 


AGAGTTAAAA 


CGGCTGAATA 


AACAAAGGAC 


TTGCCTTCTT 


TTTTCCGAGT 


9540 


CAAACACTCT 


TTATCAACCA 


CACGAGCCAA 


AAGTGTCTGA 


ACCGTGGACT 


TGGACCAGTC 


9600 


AAACCCCTCT 


GCCAAAACCC 


TAATCAAATC 


TCTACTGCTC 


TGCTCCCCCT 


GCATCCAAAT 


9660 


AATCTTCATG 


ACCTGCCATT 


CTCCATCTGA 


AATCTGCATT 


ACCATACCTC 


CAAAATCTAC 


9720 


ATTTGTCAAT 


TACACTCATC 


AG7ATACTCT 


TAAAATCTAC 


ATTTGTCAAT 


TATAGAAATA 


9*80 


ATATTTTCTT 


CGAAAAATAG 


AATTTTAATC 


ATTTGAAAAA 


CGATTTGCAG 


TCAAATATTA 


9840 


CTATATAAAC 


AATAAAAATA 


TCCTATACTA 


AAGAAAAAAG 


AAAACAACCA 


CTAGGGGTGC 


9900 


GTAAAGCTCA 


GATTAACGAC 


TGTTAGATCC 


CTCTGACTCA 


ATCTAGGTAA 


TGCTAGCTGA 


9960 


TGGAAGTGGA 


AATGATAATG 


GGGACTAGCA 


GTCTTCTATT 


GCCTTTCTAA 


AACAGACTAG 


10020 


CTTGTTCTTA 


AGAATACAAA 


CTTCAGTTGG 


TTGGGACCTT 


TTAGATGACT 


TATTTACCCG 


10080 


TTGCTTTGAC 


CATTGCAGGG 


ACTGACCCTA 


GTGGTGGTGC 


TGGCATTATG 


GCAGATTTAA 


10140 


AGTCATTCCA 


AGCGAGAGAT 


GTCTATGGAA 


TGGCTGTTGT 


AACCAGTCTT 


CTCCCTCAAA 


10200 


ATACCAGAGG 


TGTTCAGCTA 


ATCGAGCACG 


TTTCTCCTCA 


AATGTTGAAA GCCCAATTGG 


10260 


AGAGTGTCTT 


TTCTGATATT 


CCACCTCAGG 


CTGTAAAAAC 


TGGAATCTTC 


GCTACTACTG 


L0320 


AAATCATGGA 


AATCATCCAA 


CCCTATCTTA 


AAAAACTGGA 


TTGTCCCTAT 


GTCCTTGATC 


10380 


CTGTTATGGT 


TGCTACAAGT 


GGAGATGCCT 


TGATTGACTC 


AAATGCTAGA 


GACTATCTCA 


1044C 
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AAACAAACTT ACTACCTCTA GCAACTATTA TTACGCCAAA TCTTCCTGAA GCAGAAGAGA 10500 
TTGTTGGTTT TTCAATCCAT GACCCCGAAG ACATGCAGCG TGCTCGTCCC CTGATTTTAA 10560 
AAGAATTTGG TCCTCAGTCT CTGGTTATCA AAGGCCGACA TCTCAAAGGT GGTGCTAAAG 10620 
ATTTCCTCTT TACCAAGAAT GAACAATTTG TCTGGGAAAG CCCACGAATT CAAACCTGTC 10680 
ACACCCATGG TACTGGATGT ACCTTTCCTG CAGTCATTAC TGCTGAACTA GCCAAGGGCA 10740 
ACAGTCTTTA" CCAGGCAGTT GATAAGGCC A~ AGGCCTTT AT CACAAAAGCT ATTCAAGAT G 10800 
CCCCTCAACT CGGTCATGGT TCTGGTCCAG TCAACCATAC AACTTTTAAA CATTAACAAA 10860 
AAAAACTCTC TAGTTCCCAC TTTAAGGGAA TTAGACAGTT TTTATACTCT TCGAAAATCT 10920 
CTTCAAACTA CCTCACCTTC CATCTGCAGC CTCAAAACAC TGTTTTGAGC TGACTTCGTC 10980 
AGTCTTATCT AAAACCTCAA GGCAGTACTT TGAGCAACCT GCGACTAGCT TTC7AGTTTA U040 
CTCTTTGATT TTCATTGAGT ATTAATTAGG AAAGAATGTT ATGCAACTTT TTTAAAAAGG U100 
CTTGCCTTTT TGCCTCAATA TCTTCTGCTT GCATCAAATC ACGTACAACA CCTACACCAC 11160 

CTATGCCAGT GCCCATAAGC TGATCAATAT TCTCCGAAGT CAAGCCTCCA ATAGCAACTA U2 20 

CTGGAATGCC AACCGTTTGG CAAATTCTTT TCAAGGTCGA TATCAGAGTA ATGGGCGCAT 11280 

TTTCCTTGGT CGTCGTTGGG AAAATGGCTC CTGTACCCAA GTAATCTGCA CCTGATTTCT 11340 

CCGCTTCCAG AGCTCTTTTA ACCGTTTTAG CGGTGACACC GAGGATTTTT TCAGGACCCA 11400 

ACACTTTCCC AGCTACCCAA ACTGGTAATT CATCATCTCC GATATGCAGA CCTCCTCCAT 11460 

CAACCGCAAC ACAAACATCC AACCGATCAT CCATTATCAA GGGTACCTCA TAACCATCTG U520 

TTA7TTCCTT GACTTCTTTT GCCAGTTCAT AATATTGATT CGTTG7CAGA 7TTTTTTCTC 11580 

GCAATTGGAC TATGGTAACC CC7CAACCCC AGGCCGTCTC AACTTTTGCA AGAAAGCTTT 11540 

CCACGGAATC TTGATAGCGA TTGGTTACCA GATATAGTCT AAGTGCTTCT C7ATTCATAA 11700 

ACCTCTCCTT TGATGGTATC TAGCCAATTT TCATCTCTTC TTAGGAGCGA AAGCTGATTG 11760 

AGTACTTGGT AACGAAATTC TTCCAATCCC ATTCCT7GAA CAACTATTTT CTCAGCAGCG 11820 

ATATTGAGAT AAGAGACTGC TAAGCAAGAA GCTTCAAAAC CAGTCTTTCC TTGGCTGAGA 11880 

AAAACAGCTG TTAAGGCTCC AACCAAGTCT CCTGTCCCTG TTATCCAGTC TAATTCAGTA 11940 

CAGCCATTTC CCAGTACAGC CACCTGATTT TTCGAAACGA CGAGGTCCTT GGGACCTGTG 12000 

ACTAAGAAAG ACATACCAGG ATAGGTCTGA CACCAGTCTT TCAAGACTTG AAGCAAATCC 12060 

TCCCTTTCTT GATCTTTACC ACTCGCATCG ACCCCAACGC CCTGGTGCTT TAATCCAACA 12120 

AGACTTCGAA TTT CTG AC AT GTTTCCTTTA AGGACCGTAG GTCTATAGTC TAAAAGGTCT 12180 
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TTAACTAAGC TCTTACGAAT CGATGAAGTC GTTACGCCAA CCCCATCTAC TACCATCCGG 12240 

ACAGAAGATT GCTTTCCATA CGAAGCTGCC ATGCGGATTG CTTTTTCCTT CTCAGCTGAC L2 300 

AAATGCCCCA AATTGATCAA GAGAGCCTGA CTTTGCTTAG TAAAATCAAG AACTTCACGG 12 360 

GAATCATCTG CCATGACAGG TTTGCATCCC AGAGCCAAAA TCCCATTTGC CAGCATCTCA 12 420 

CAAGAAATCT CATTGGTAAT GCAGTGAATG AGGGAACTAG AGCCTATAGG AAAGGGATTT 12480 

GTAAATTCCT GCATCAGTCT ATCCTTTCAC TAAAGAAATA TCCCTGCAC? TTTTTAAAGA 12 540 

ATTCCTCCTT CATTAAAAAT CGAAAGGCAA TAAAGGAAAT CCCTCTACCA ATCAAGGTTG 12 600 

CTCCGAAAAA TCGAGGCGTG TAGATAAACC AGCTAAGCTT ACCAGCTCAT CCTGTAAAGA 12660 

GT AC C AT AAC AGGATAGGAA ACAATGGAAC CAATAATACC TGTTCCCAAA ATCTCTCCTA 12720 

GAGCAGAATA GTGAAATTTT CGACCGTACT TATAAAACAG ACCTGCTAGA AGGGCTCCAA 12780 

AAGTCGCTCC TCTGAGAGCT AAAGGCGGAA TCCCTTGAGT CGTCATACGG ATAAAGGCTG 12840 

TGACTCTAGC CATAGCCAAG GCATAAACAG GTCCCATCAT GATTCCTCCT AGAATATTGA 12900 

CTACACTCGA CATCGGTGCC ATTCCCTCAA TTCGAAAGAT AGGTGTAACG ACTACATCAA 12960 

GGGCAATCAT CATAGATAAA ATGGTTAATT TCTGAACTTC TAATTCCTCC TTTCTCATGC 13020 

TTCTATTCTT CTCCTTTTTC TAAAGACTGT AAATCGCTCT TCCATGTCTG GTGTTGGTAG 13080 

GCCATTTCCC AAAACTTGGC TTCCATATGA ACACTGATCT GGAACCCATC TAGCATTTTT 13140 

TGCTTGTCTG TCTCGTCACT TTCTCGATAG AGCTGATTGA CCAGTGCTCC CTCCTCTCTG 13 200 

ATCTCTTCCT CTAACTCATC CGTAATATAA GTTTCAATCC ATTGTTGATA GACACCATTT 13260 

GCTGATCCTT TAAGA7TAAG 7CAT7TCCCT ATATCA7CGT ATAACCAAGG ACAAGGAAGC 13 320 

AAGCTTGCAA AAGCCATCGC TAAGTTCGGT 7CTGCAAATT GCCTATAAA7 ATGAGAAATG 13 3 80 

TAATGATAAC AGGTTGGAGC GATTGGATGT TCCTCCATTT CCTGGTCGCT GATTTCCAAT 13 4 40 

TCCTTGAAAA ATTGTTGGCG AATAAATAAC TCACCCTCCA CTAAACCCTG AGCATTTTGT 13500 

TTCAAGAGTC TTTTCATCTC TTGGTTTGAA GTCTTATCAG CCAAAAGATG ATAGATTTCT 13560 

GAGAAACCCT TCAGATAGTA GGCATCCTGA ATCACGTAAT AGCGGAAAAT GCCACGTTCT 13 620 

AAATTCCCCT CTTGTAATTG TAAAATAAAG GGATGATGAA AGGAAGCCTG CCAAGCTTTC 13 630 

TTGGATAATT CCATCGCAAT ATCTGTAAAT TCCATAATAA CTCCTTTATA AAAATAGACT 13740 

GGTTTGAAGC AATAAAAAGA AAAGCAGGTA GATTAATTTT GTTTTTTTAG GAATATAAAA 13 800 

AGTCCGATAG CTATTCTTCA ACT GTG CATC TTCGTCATAT CCGTGAGCAG ATAGAGCTCT 13860 

CAGGTAAAGA TGGCGCCACC TAAAGACTGT CATCAGAACC TTACTGTAAA TCAAGGGCGA 1392C 

CCAAAAATGT AGTTCTTGAC CACGTAATAG GCAAGCTTCT TTGAGGGACT TGATTTCTTC 1398C 
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CTGAATGAGA GGAAAACAAT TGAATACCAC AATCAAGGCA TACCACCAAG AGCCTGATAG 14040 

CCCCTTTTGA GCCAAGTACA AGAGAAGCTC TTTTAGTCAA AC AC AG G AAA CAAAGACAAG 14100 

GCCGATACAA ACTGTCACAA AGCCCCTCGT TCCAAGCATG ACTGCCTGTG AACCATCTCC 14160 

GTCTAACTCA ACTGCCCAGT AGTTGGCAAA AGATGGTAAA ATGCCAACTA TCATCATCCA 142 20 

AGCTAACATT TTAAATCGAC GGTAATAGAG CATAAAGAGA ATACAAAATG CCACTACCCA 142 80 

AAGAGTCAGA GCAATCGAAG GAATGAAAGA TGTTTCCAAG GATAAAATCA GCAAGAAGAG 14 340 

ACTGATAATC GGTGTCTGGG TTGCTACTTT GACCATACTA TCTCACCTCC CCTTGGGTAT 14400 

TGCTACTCTG AGATG7AACT GGTTTGGTAA TGGTCACTTC 7TTCACA7CC CCAAGACCC7 144 60 

GACTAGTCAT CTCAATCCAA TAATCAACCA CAGAAATCAA AGGGTCTAAA CGATGACTAA 14520 

TGACCAGAAA ACTTCTTCCT TGATTCCTCT CCTCCACAAT CCACTTGCAA AAATAATGGC 14580 

AGGCTCTATC ATCCAAACCT GCAAAAGGTT CATCTAGCAA GATCACGGAA GCCTTACTGG 14 640 

TCAAGA7CC7 CAGGAGCTGA AGAATTTTTT CCTCACCACC ACTTAAT7CA TAGGCACTCT 14 700 

TATCGACTGC CTGCTCCAAA TCAAAATATC CTAAAGCTTC AAAAATCCGC TGATTTCTTT 14760 

CAGAATCAGG TCCATCTAAT TGAAGCTCCT CTCGCAGACT GACTCCGATA AACTCCTTCT 14 820 

CAGCTTCCTG AACAACACCA GTCAGATCAC GATACAAACT CTTTTTCTTT TTCAGGACCG 14880 

AACCCTTCCA AGTAATGCTC CCCTTATACT TTTGAAATTG AAGAATAGAC CGAAAGAGGG 1494 0 

TTGATTTCCC GACACCATTG TCACCCAGGA TACAGGAAAT CCCTTGATAG AATGTGAAAT 15000 

CAGCAATTGA AAAGAGGGGG CCATTACCAA CCTCACCAGT CACACCGT7C ATATCCAA7A 15J60 

GTTCCGCCC7 AGAAGCAACT TCCTTTGAAG CAACCTGTGT CATCTCATAG GAAGGCATTT 15120 

GAAACACTTC CCTTAGTTTT CCGTCTCTTA CCTCCACCAT ATCGTCGATA TAGGCTTTAT 15180 

AGTCAGATAA ATCATCCTCG CACAAAATAA CTGTCTTCCC ATCATAGACC AACTCTTTTA 15240 

GAATCTCCAA 7ATCTCGATT CTGCTCTTGC GGTCAATGGA ACCGAAGGGC 7CATCCAAGA 15300 

GATAGACCCT AGGATTCATG GCAAAGAGGA CAGCCAGCGC 7CCTTTTTGC TTTTCCCCAC 153 60 

CTGATAAGTG ATGGATGAGA CGGTGCAAGA TG7CCTTGCA ACGACATTGC TGGACAACC7 15420 

CTGCTATTTT AGAATCAATT TCCTGAAGGT GATAGCCGAT ATTTTCCATG GTAAAAACCA 15 480 

ACTCCTCAAA CAAGCTCTCC ATGGTAAATT GATGATTAGG ATTT7GCAAG AGAATACCAA 15540 

CCGTCTGGAC ACGTTCGACG ATAGAAAGCT GACTGACCTC GCTCCCATCT ATCAGGACTT 15600 

GACCGCTA7A GGGAAGAGAA C7AAC7TGGG CAA7CA7TTG AAAGAGGCTG GA7TT7CCAG 15660 

AC CC AC 7 ACT CCCAACTAAC AAGGTAAAGG CTTGCGCATG AAAAGTAAAA TCAAACGCC7 1572C 
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CAGAGAAGAT TGGGGACTGA ATCGCTCGTA GTTCCAGACC CATCTATGCT TTTCCTCCAG 15780 

TTGCAAACTG ATGATAGAGT TTGACAATGG CACGAACCAA GATGGTACAG AAGAAATAAA 15840 

CAGAAATAAA ACGTACCACA AGCAAGGAAA GGACAAACGG AAGGGAAAAG GCGTAGTAAC 15900 

CTAACTTAAT GT ATT CAT AG ACAAAGCTAA CAAGCGTAAT CCCAATACTA TTAGCAGTTA 15960 

GAGAGAGCCA ACTTTCATAG CGATTCTTAG TTACGATAAA ACCAAATTCA CTTCCCAAAC 16020 

CTTG AAC AAA GCCACACAAA AGAGCTCCTA GACCAAATTG GCTACCATAA AGGACTTCAG 16080 

CAAGCGCAGC TAGCACTTCT CCAATCGTTG CACTTCCGAC TCTCGGAACA AAGATGGCAG 15140 

CAATGGGCGC AGCCATACAC CAGAGACCGA AGAGGATTTC ATTGGCAAAC GCC7GCAAAC 16200 

CAAGAGGTGT TAAGAGTAGA CTGAGAATAT TATACACATA TCCTGAACCA ACGAAAACCC 162 60 

CACCAAAAAA GATAGACAAG AAAGCAAGCA AG AT AA CATC TTTTAACTGC CATTTTTTCA 16320 

ACATAAAAAA CTCCTTTTTT TAAAGAAAAG TGAGGCACTC AAGAAGACCG ACCTAAATAC 16380 

TTTGTATACC AGACTGAATT TAGAACAGTA CACAAGAACA CTAAAATATT TCTAGAAATT 164 40 

AATTTGAATT TTCTAATTGA TTTGTTCGCA TCTTATTTCA ATCTACTATA TCATCTTCAT 16500 

CCAGTTTCGT AAAAGAAAAA ACTCTAATTA CAGATACAAA TTAGAGTTCA GCTTACAAGA 16560 

TTAGACAGTT CTTTTCGACA TACGAAAAAA ACATTTCACA TTTCCCTTCG CCAGTCTTAA 16620 

CTGTATCAGG TTCAATGGGT ATCATCTCAG CCTAAAGCAC CCCAAATGTC TTTATTATTT 16680 

AATTATGTGA TT ATT AT AAC ACACATTTTA TACTAGTTCA ACAAATTCAA CTGGAAATAC 16740 

AGCCTTGCAC TCACAAAGAC AGCAGATCTT TCTTTTGCAA AAAACAAATC ACC7CTTTGA 15800 

TGAATTAGCC ATTCAAGCTG AATCTGGACA TAGCTTTTTA AAAAAGGAAA ATCCTACTTA 168 60 

CTTAGAATCC AAGGATAGAT ATCTATTGTT CACTCATTTC CCGAACAGTT TTTTCTATAT 16920 

TTTTTGCATA CGATATTGCC GAAATGATTG AAACGCCATC CATATTGGTC TTTATAATGT 16980 

CTTTAATATG TTTCGTCTGT ATCCCACCAA TTGCAACTAA AGGCATTTGT GGCAATAGTT 17040 

TTCTCATCAA TTCAAGACCT TCATAACCTA TAGTACCACC AGCATCATCC TTTGACTGCC 17100 
TACCAAATAC AGGCCCAACA CCTACATAAT CTACATATTC AACTTTTGAT TCTTCAAATT 17160 
CTTCTTCGTT TCTTATACAA AGACCAATTA TTTTATCTCC CATCAATTTT CTAATTTCAT 17 220 
CAACACCAAT ATCATCTTGA CCTACATGTA CGCCATCGGC GTCAATTTCC ATTGCTAAAT 17280 
CTATATCGTC ATTAACGATA AATGGAACAT TGTATTTTTT ACAAAGTTCT TTAATTTGGA 17 340 
TAGCTAGCTC AACTTTTTCT AAGCCTTCTA AAGCACCCTC ACCTTTTTCT CCAAATTGAA 174C3 
ATAAGGTTAT ACCACCTTTT AAGGCTTCCT CAACGACTGT ATATAGATTT TTTCCTTGGC 174 60 
AAGTAGTCGT TCCACAAATA AAATATAGTT TTAGTAATTC TTTATGAAAC ATCTTACTTC 17520 
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ACTCTTTTCA ATTCCTTTAC ATC7TCATCT GTAATCTCGT ATAAGGCATT TATAAATTCA 17 530 

ACTTTAAATG TCCCAGGAAG A7GTCCATTT GGACGTTTTT CTGCTATTTC TCCAGCCATA 17640 

TTG7AAACCA ACACTGCTGT TTTTAATGAT TTCAATTCTT GACCTTTTTC TACTCCCATA 177 00 

AACCTTGCTA CTACAGCTCC TAATAAGCAT CCTGTCCCAA TGACTTTCGG CATCATAGCA 17760 

CTACCATTAT GAATCATTAC CACTTCTCCA TTAACAGCAA TGGCATCCAC 7TCACCTCTT 17820 

ACTACTATTG GAATATTGAA CTTCTCATTT GCTGCTAGAG CAATTTCGTC AATATTATCT 17880 

ACGCCCGCAC TATCTACTCC TTTAGATGCC ACATCTATTC CTACTAAAGA GGCAATCTCG 17940 

CCAGCATTTC CTCTAATCGC TGCTAGTTTA TAATTCTTCA TT AG AT CATC TGCTACTTTT 13000 

TTTCTATATT CTCCTGCTCC ACAGGCTACA GGATCTAAAA CTGCTCGGAC ATTATATTTC 18060 

TCTGCAATTT TCAGAGCAGC TTGGTATAAT TTCCAATTTT CATCTGTCAA TGTTCCTATG 13120 

TTTATTAATA AACCACCAGC ATACTTTAAC AAATCCTCTA AATCTGCTGG AAACTCACTC 18180 

ATCCCTCCTG AGGCGCCCAG TGCTACTAAT CCATTTGCTG TGAAATTTTT TACTACATCA 18240 

TTCGTTATAC AAATGACCAA TCCTCCTTTT TCTTTTAATA ATTTTAAACT TGTCATATTG 13 300 

AAATCCTTCC 7TTTCACTTT ATACGATCTA CTAATTTCGA TTTATCTTTA GTTGAGAATT 18360 

TTTTTCATTT ACATTGAATG ATTTATACTC AATGAAAATC AAAGAGCAAA CTAGGAGGCT 18420 

AACCGCAGGT TGCTCAAAAC ACTCTTTTCA GGTTGTCGAT AGAACTGACG TGCTTTGAAG 18480 

AGAT7TTCGA AGAGTCTTAC CTCATCAAAT TTGTAAATAT CATCAGCCTT CTCTAGACAT 18540 

CGTAACCAAT ATCAAAAAAA GCTAATTCTA AAGCCACTCC TTCATTCCAG CGTTGCTGAA 1S600 

GTTCTG7CAA ATCTTCTCGA TTTTTACCCA CACGATTGAG 7TCG7CAACC ACAAATTCAA 13660 

CCCACTCTGC AAAGAAAGGA CCTCTGTGGA GATTGATCCA TTCCGAATGA ATATAGACTT 187 20 

CAGGTAAAGC CAAATCTTTA GAACCCCAGT CTAAATAGAG ACCTTCTGCA ATGACCAGCA 187 80 

TGACCAAAAG ATGGGCATAG TCTGATGAAG CCACCGCCGA ATACATTACA TCCTGAAAGC 1 83-10 

CTTTTGTTAC AGGGTGCAAA GTCACTTCTA GATAGTCATT CTCTGCTACT TTTAACTCTT 18900 

TAAAAGCCTT TTGGAAATAA CCATCTTCAT CTGCTTCAAG AAAGCCTAGT TGCTTGGCAA 18960 

AACGAAGCTT GGATTCAAGT TTATCTGCGT GACTACGCAG GCACCCAGCA TCGATAAGAA 19020 

CCCATCAAAG AAGTGATAAT CTTGAATCAG ATAGTCCTTT AAGACCTTAT TCTCAATTGT 19080 

CCCCGCAAAA AGTTCCTTAA CAAAACGATG ATTGATTCCA GCCTCCCAAT CCTTCTGACT 19140 

CCTTTT7AA7 AATTCTCCAA CAGTCAAACC TCCCTCAAA7 GCATAGTCTT GTGTTTCCAT 192G0 

A77TACTTCT CCTCTCTTTA CTTGTTAGTA ATTAATAAAA CACCAAGAAA TA7CAAGCAA 192 60 
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AATCGTAATT CCACTTGATC CTTTTAAAGC ACATCGAGAG CAW 1 " - TAG A GAGCTAACTA 19 3 20 

AACAAGCCTA TCCAGTTTAT ATAAACAAAA AACTCCAATT ACAATCAAGA ATTAGACTTG 19 3 80 

ACTTACAAGA TTAGACCCTT CATTTCACCA TACGAAAAAA CTGTTCACAT TTCCCTTCGC 19440 

CAGTCTTAAC TGTATCAGGT TCAATGGGTA TTATCTCAGC CTAAAGCACC CCAAATGTCT 19500 

CTA7TATTTA ACTACTGAAC CAGTATAGCA AAAAATGAAA GCCCTAGCAA GATATTTGAC 19560 

CGAAAAATAT CTTT AT AT AT AATATATTGA AACTAGAATA GTACACCTCT ACTTATAAAA 19620 

CATTGTTAGA AATCGATTTG ACTGTCCTGA TTGATTTGTC CTATTCTTAT TTCATTTTAC 19680 

TATAGTTTTC GATAGCAATT TATTCTTCCA ATACACGAAG AAAAACCTCC ACATTCAGTG 19740 

GAGGCAATCT GTTTTATCAA TACAATTTTA AGTCACGAGG GTCAACTGGG AAGGTTGGGT 19800 

TGTATGGATT GTGACGCAGC TTGAAGTGTT TGACATCTTC AATGGTCTGA GTTCCAGACA 19860 

ATTGCATAAC TGTCTTCAAT TCCGCATTCA AGTGTTCAAA GACTTGACGC ACACCGACAC 19920 

TACCACCGAG AGCCAAGCCA TAGATGACAG GGCGTCCAAT AGCAACCAAG TCTGCTCCTG 19980 

ATGCCAAGGC TTTAAAGACG TGTTGACCAC GACGAACACC AGAGTCAAAG ACAATCGGCA 20040 

CACGTCTATC AACTGCTTCT GCCACTTCTT GAAGCGAGTC AAAGGCAGCT GGTCCACCGT 20100 

CGATTTGACG ACCACCGTGG TTGGTTACCC AGATACCAGA AGCTCCTGCA GCAACCGAAC 20160 

GTTCAACGTC CTCACGGCAT TGTGGTCCCT TGACATACAC AGGAAGACCA GACTATTCAG 20220 

CGATAAATTC TACATCGCGT GGAGACAAGC GTTCTTTACC TGATTTGTAA ACAAAGTCCA 20280 

TTGATTTACC AGCACCTTCT GGCAGGTATT CTTCAACAAT CGGCATGCCA ACTGGGAAGA 2 0 340 

CAAAACCATT ACGCTTATCC ACTTCACGAT TCCCCCCTAC AGTACCATCT GCCGTCAAGA 20400 

CAATCGCTTT ATAACCTTCA GCCTTCACAC GGTCCATGAT GTGGCGGT7G ATACCGTCAT 20460 

CCTTACTAAA GTAAAATTGA AACCAATGAG GTGTCCCTTG GAGGGCTTCA GAAATCTCTG 20520 

GAAGGTCAAC AGTAGAGTAA GAACTGGTTG TATAAAGAGA ACCAAACTCA TGCACACCAC 20580 

GCGCAGTCGC CACTTCCCCC TGTTCATTTG CCAATTTATG AGCCGCAACA GGTCCCATAA 20640 

TGATTGGAGA AGATAGTTTT TCACCTGCAA ATTCAATCTC TGTACTTGGA TTTTCTACAT 20700 

TGCAAAGTGT ATGAGGAACG ATGAGCTTGT GGTTAAAGGC ACGGATATTC TCTCTTAAAG 20760 

TGAAAGTATC TTCCGCCCCA CTAGCGATAT ACCCAAATCC TGCTTTAGGA ATAACTTGTT 20820 

GCGCCATTGG CTCCAAATCA TAGGTATTGA TGAArTCTAC ATGACCTTCT GCATTGCTTG 20880 

TTTTGTATGA CATAAAATGT CCTCCTTAAT AAGTAAGCCT TTACTTTGTG 7ATTACAAAA 20940 

ATATCTTAAC TCTTTTTCAA AACTTTTAAA ATATTTTGTT TGGAAATTTC AGAAATTTTA 21000 

TGTCTATGAT AAAAATCCTT ATAACGGCAA TAAAAAATAG ATATTATCCA AAGAAGATTT 21060 
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TAAGTGCTAC 
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V_ V_ » Li AVjMA 1 l_ 


TTArACHftAT 


TGGGCGTTAC 


GATTGGGATA 


TCCATCCTGT 


21840 


CTGTCCTCTT 


A TCP ATP! A TP! 


T^CACAACAC 


TCATGGGAAT 


CATC ATG AC C 


TCCCATTCTA 


21900 


IjA/i I l»A 1 AVwVj 


nl 1 1 I 1 AAl— A 


CC ATTCT AT C 


TGGAATTTAT 


CCGTATCATG 


CCCCAGCTGG 


21960 


TGCT ACT CTT 


CAT CGTTTAC 


TTTfiGCTTGG 

ill VAJ*— 1 4 SJVJ 


CTCGAAACTT 


TAATATCAAT 


ATCTCAGGTG 


22020 




LAI Inl V»l*i 1 1 


TTT AC C CTCT 


GGGGAACAGC 


TGAAATGGGA 


GACTTGGTAC 


22080 




U At. 1 ILiV .L 




AGTTTGAAAG 


TGGACAGGCA 


CTCGGCTTGA 


22140 


V_ . AA lull V. A 


nL 111 AV. i A*w 


C AC A"*C A* 1 " C A 


TCCCACAAGT 


CTTAAGAAGA 


CTGCTACCGC 


222C0 


nwOv. 4 A 1 l_AA 


ILi l\» I V. AV» 1 


CGGATGATTA 


AAACCACTTC 


ATTAGTTGTT 


TTGATTGGGG 


22260 


TTf Tflf A A ("IT 




GGACAACAAA 


TCATCGATAG 


CAATCGCCTG 


ACCATCCCAA 


22320 


CTGCTTCATT 


TTGGATTTAT 


GGAACCATTC 


TAATCTTATA 


TTTCGCAGTT 


TGCTAQ.V.W TA 


& £. J 0 VJ 


TTTCCAAACT 


ATCCACTCAC 


TTAGAAAAAC 


ATTGGAGAAA 


CTAAATGTCT 


GAAACTATCT 


22440 


TAGAAATCAA 


GGAACTAAAA 


AAATC CTTCG 


GAGACAATCC 


CATCCTCCAA 


GGACTTTCTC 


22500 


TAGAAATCAA 


AAAAGGGGAA 


GTTGTTGTCA 


TCCTAGGGCC 


ATCTGGTTGT 


GGGAAAAGTA 


22560 


CCCTCCTTCG 


TTGCCTCAAC 


GG CTT AG AAA 


GTATTCAACG 


TGGAGATATT 


CTTCTGGATG 


22620 


GTCAGTCTAT 


CGTTGAAAAT 


AAAAAAGATT 


TTCACCTAGT 


TCGCCAAAAG 


ATTGGCATGG 


22680 


TCTTTCAAAG 


TTATGAACTC 


TTTCCCCATC 


TGGATGTCTT 


ACAAAACCTC 


ATCCTAGGCC 


22740 


CTATCAAAGC 


TCAAGGAAGG 


GACAAGAAAG 


AAGTAACGGA 


AGAACCTTTC 


CAATTACTAG 


22800 
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AGCGTGTCGC 


T7TGCTGGAT 


AAACAACATA 


GCTTTGCCCG 


TCAATTATCT 


GGTGGACAGA 


22960 


AGCAACGTCT 


TGCAATTCTC 


CGTCCCCTCC 


TAATGCATCC 


AGAAATCATC 


CTTTTTGACG 


22920 


AGGTGACTGC 


TTCCCTGGAT 


CCAGAAATGG 


TGCGTGAGGT 


GCTGGAACTT 


ATCAATGATT 


22980 


TGGCCCAAGA 


AGGCCGTACC 


ATGATTTTAG 


TAACCCACGA 


AATGCAGTTT 


GCCCAAGCCA 


23040 


TTACTGACCG 


GATTATCTTC 


CTCGACCAAG 


GGAAAATCGC 


TGAAGAAGGA 


ACAGCTCAAG 


23100 


CCTTCTTTAC 


CAATCCGCAA 


ACCAAACGAG 


CCCAGGAATT 


TTTAAACGTC 


TTTGACTTTA 


23160 


GCCAATTCGG 


CTCATATCTA 


TAAAGGAGAT 


TC7TATGAAA 


CTATTCAAAC 


CACTCTTAAC 


23220 


TGTTTTAGCA 


CTTGCCTTTG 


CCCTTATCTT 


TATCACTGCT 


TGTAGCTCAG 


GTGGAAACGC 


23290 


TGGTTCATCC 


TCTGGAAAAA 


CAACTGCCAA 


AGCTCGCACT 


ATCGATGAAA 


TCAAAAAAAG 


23340 


CGGTGAACTG 


CGAATCGCCG 


TGTTTGGAGA 


TAAAAAACCG 


TTTGGCTACG 


TTGACAATGA 


23400 


TGGTTCTTAC 


CAACCCTACG 


CTACGATATT 


GAACTAGGGA 


ACCAACTAGC 


TCAAGACCTT 


23460 


GCTGTCAACC 


TTAAATACAT 


TTCAGTCCAT 


GCTGCCAACC 


GTGCGGAATA 


CTTGATTTCA 


23520 


AACAAGGTAG 


AT ATT ACT CT 


TGCTAACTTT 


ACAGTAACTG 


ACGAACGTAA 


GAAACAAGTT 


23580 


GATTTTGCCC 


TTCCATATAT 


GAAAGTTTCT 


CTGGGTGTCG 


TATCACCTAA 


GACTGGTCTC 


23640 


ATTACAGACG 


TCAAACAACT 


TGAAGGTAAA 


ACCTTAATTG 


TCACAAAAGG 


AACGACTGC? 


23700 


GAGACTTATT 


TTGAAAACAA 


TCATCCAGAA 


ATCAAACTCC 


AAAAATACGA 


CCAATACAGT 


23760 


GAC7CTTACC 


AAGCTCTTCT 


TGACGGACGT 


GGAGATGCCT 


TTTCAACTGA 


CAATACGGAA 


23920 


G7TCTAGCTT 


GGGCCCTTGA 


AAATAAAGCA 


TTTGAAGTAG 


GAATTACTTC 


CCTCGGTGAT 


23880 


CCCGATACCA 


TTCCCCCAGC 


AGTTCAAAAA 


GCCAACCAAG 


AA7TCCTAGA 


C7TCATCAAT 


23940 


AAAGA7ATTG 


AAAAATTAGG 


CAAGGAAAAC 


TTCTTCCACA 


AGGCCTATGA 


AAACACACTT 


24000 


CACCCAACCT 


ACGGTGACGC 


TGCTAAAGCA 


GATGACCTGG 


TTGTTGAAGG 


TCGAAAAGTT 


24060 


GATTAGTCAT 


TAACTCTTAA 


AAGGAACTGG 


ATTTTAAGCT 


CCAATCCCTT 


TTTAAGATTT 


24120 


TACCTATAAC 


ATCCTGAGTC 


TATCTAAGAT 


GTTCAATCTG 


AACACAGTGT 


ACATACTTTA 


24180 


TCTTCTATTG 


CATATACTTT 


ATCACATAAG 


ATACCAATAT 


CCTCTTCACT 


ATGACTAGCA 


'.1240 


ATCAAAATTG 


TTGTCCCTTT 


TTCACTAGAG 


AGCTTTCTAA 


ACAATGTTCT 


CATATTTTCT 


24300 


ACACTTGATT 


TATCCAAGGC 


ATTCATAGGT 


TCATCTAGTA 


AAAGAATAGA 


GGGATTCTCC 


24360 


ATAATTGCTT 


GAGCAATCCC 


TAGCTTTTTC 


CTCATACC7A 


GCGAATAAGT 


TTTAACTTTC 


24420 


TGGTCTTTTT 


CCTCATATAG 


ACCAACTAT7 


TTCAGTGTA7 


CA7TCA7TTC 


CTGATTACCA 


24430 


ACTACTCCTC 


GTATGCTTGC 


CAAATATTGT 


AAATTCTTAA 


AGCCACTATA 


ATAATTTATA 


2454C 


AAACCAGGTT 


CTTCAATCAA 


AGCTCCCAAA 


TTAGCTGGAA 


7TTTTCTCTC 


AGGAACAATA 


2460C 
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TTTTCCCCAT TGATTAACAC TTCTCCATAA GACCGACTAT ATAAACCAGC TATTAATTTA 2 4660 

AACAATACAC TTTTCCCTGA GCCATTCGCA CCAGTAATTC CTATAATTTC CCCCTGTTTA 24 720 

CAACTAAAGT TAAGGTTTTG AAAAACACAT GTCTTTTTTA ATTTCAACTC AATATTTTTT 24 780 

AATGTAATTA TTTCATTCAT TCTATAAACC TCCTCTTTTG ACGAGTGAAA TAGAAAATGC 2 4 840 

TTTGAAAAAG AAAGACTAAA AATAGCAACT GAAGAAATAA ATCTCGTCCT ATATCTCCAT 24 900 
TCCCTCGATT CAAAAT AT AA AATAGATAAT TAGTTCCATT TCCTACAAAT AGACCACCAA " 24960 " 

ACACAATCAT GAGTAAAAAG AAACTAACGC AAGCAAAGTT CG 2S002 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : X1443 base pairs 
<B) TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO : 49: 

CAGGTACGGT GAGGCGCAAC TAAAATATAA TTTTCATCTT GATTAGGAAT TTTATCAGTA 60 

TTATGATAGT GAGCATTGCC ATTGATGGAC CATAAGAGCA ATACAACTAA TCCACGCAAA 120 ** 

TAAGTATAAA ACATGCCATC TCCTTCGATT GTTTTCTTGT TATTATTATA CCTTATCAAA 180 

GGAGGGCTGG CAAACTTTTC CCTTGACTAG ATACATATTT AGGATGAAAT TAGAATTCTG 240 

TTAAAAAAAA TGATATAATA GAATTTATGG ATAAAAATAA CATTATCCCA TTAACCCAAA 300 

GAGAAGTCAA GGAAAGACAG CCTGAGGGT? TCC7CAATCA CTTTACCCCA TCAGCCAGTA 3 60 

CCAGCACTTC GCAAATCGTT AAACGAAATG TCTTTACCCT 7TTTAACGCT TTGAACTTTC 420 

CCATTGCTTT GGCTCTTGCC TTTGTGCAGG CTTGGAGCAA TCTGGTCTTC TTTGCTGTTA 480 

TCTGCTTTAA CGCTTTTTCT GGGATTGTGA CCGAGCTACG ACCCAAACAC ATGGTGGACA 540 

ACCTCAATCT CATGACCAAG GAAAAGCTCA AAACCATCCC TGATCCTCAG GAAGTTGCTC 600 

TTAATCCTGA AGAATTAC7G CTAGGAGATG TCATTCGTTT GTCTGCAGGA GAGCAGATTC 660 

CTAGTGATGC CTTGGTTTTG GAAGGCTTTG CGGAAGTCAA TGAACCCATG TTAACGGGAG 720 

AAAGTGATTT GGTGCAAAAG GAAGTTGACG GCTTACTTTT GTCAGGAAGT TTCCTAGCCA 780 

GTGGGTCAGT TTTAT.CTCAA GTTCACCATG TCGGTGCAGA CAACTATGCT GCCAAACTCA 840 

TGCTTGAGGC TAAGACCGTT AAACCCATCA ACTCCCGTAT CATGAAATCG CTGGACAAGT 900 

TGGCTGGTTT TACTCCGAAG ATT AT C ATT C CCTTTGGTCT GGCTCTCTTG CTGGAAGCCT 960 
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TGCTTTTAAA AGGCCTGCCT CTCAAGTCAT CCGTTGTAAA CTCGTCGACA GCTCTTTTGG 1020 

GAATGTTGCC TAACGGAATT GCCCTTTTGA CCATTACTTC GCTCTTGACT GCAGTGATTA 1080 

AGTTGGGCTT GAAAAAGGTC TTGGTGCAGG AGATGTACTC TGTTGAGACC TTGGCGCGCG 1140 

TGGATATGCT CTGTCTGGAC AAGACGGGTA CCATCACCCA AGGAAAGATG CAGGTGGAGG 1200 

CTGTTCTTCC GTTGACGGAA ACGTATGGTG AAGAGGCTAT TGCCAGCATC TTGACTAGCT 1260 

ACATGGCCCA TAGTGAGGAT AAGAATCCAA CTGCCCAAGC CATTCGCCAG CCTTTTGTGG 1320 

GAGATGTTGC TTATCCTATG ATTTCCAATC TTCCCTTCTC GAGCGACCCC AAGTGGGGGG 13 80 

CTATGGAGTT AGAAGGCTTG GGGACAGTTT TCTTAGGGGC ACCTGAGATG TTGCTTGATT 1440 

CTCAAGTCCC AGAAGCTAGG CAGGCCTTGG AGACAGGATC ACGTGTCTTG GTCTTAGCTC 1500 

TCAGTCAGGA GAAATTAGAC CATCACAAAC CACAGAAACC ATCTGATATT CAGGCTCTAG 1560 

CCTTGCTGCA AATCTTGGAC CCCATTCGAG AGGGAGCAGC AGAGACGCTG GACTATCTCC 1620 

GTTCTCAGGA GGTGGGACTC AAGATTATCT CTCGTGACAA TCCAGTTACG GTGTCCAGCA 1680 

TTGCCCAGAA GGCTGGT7TT GCGGACTATC ACAGCTATGT AGATTGCTCA AAAATCACCG 1740 

ATGAGGAATT GATGGCCATG GCGGAGGAGA CAGCTATTTT CGGACGTGTT TCCCCTCATC 1800 

AAAAGAAACT CATCATCCAA ACGTTGAAAA AAGCGGGACA TACAACGGCT ATGACAGGGG 1860 

ACGGGGTTAA TGATATCTTG GCCCTTCGTG AGGCGGATTG TTCTATCGTG ATGGCGGAGC 1920 

GGGATCCAGC AACCCGTCAG ATTGCCAATC TCCTTCTCTT GAACTCAGAC TTTAATGATG 1980 

TTCCTGAGAT TCTCTTCGAG GGTCGTCGCG TGGTCAATAA CATTGCCCAC ATCGCCCCGA 2040 

TTTTCT7GAT AAAGACCATC TATTCCTTCC TGTTAGCAGT CATCTGTATT GCCAGTGCTT 2100 

TACTAGGTCG GTCAGAGTGG ATTTTGATTT TCCCCTTCAT TCCCATCCAG ATTACCATGA 2160 

TTGACCAGTT TGTGGAAGGT TTCCCACCAT TCGTTCTGAC TTTTGAGCGA AATATCAAAC 2220 

CTGTTGAGCA GAATTTCCTC AGAAAATCCA TGCTTCCTGC CCTACCAAGC GCTCTCATGG 2280 

TCGTCTTCAG CGTCCTCTTT GTGAAAATGT TTGGCGCGAG TCAAGGTTGG TCTGACTTAC 2 340 

AAATCTCAAC 7CTACTCTAT TATCTCTTGG GGTCAATTGG TTTCTTATCC GTATTTAGAG 2400 

CCTGCATGCC ATTTACCCTA TGGCGTGTCC TCTTGATTGT TTGGTCAGTA GGAGGTTTCC 24 60 

TAGCCACACC TCTCTTCCCA AGAATTCAAA AACTGCTTGA AATTTCAACC TTAACAGAAC 2520 

AAACCTTCCC TCTTTATGGT GTCATGATGT TGGTCTTTAC CGTGATTTTC ATCCTGACCA 2 580 

GTCGTTACCA AGCGAAAAAA TAAATCAAAA CCACCAGTGT GAACTGGTGG TTTGTTCTGC 2640 

GGCTATAAGC CGCTTCTACC GGCCAGGGCC AAAGGCCCAC CGAAATAGCT TCCTCGCGCA 2700 

CCACTTTCCC GAGCAGGTGC TAAAGCACCT 7ACTTACTTC CTCTTATTTA TTTCGCCAGT 27 60 
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AAACGGATCT 


ACTGACTCGA 


ATAACGTGAG 


CTGGTCTGCT 


ACTCTGTCTT 


CTTGTAATTG 


2920 


ATTCTGAATA 


TATTCAGCTA 


TCACTTTCTG 


ATTACGGCCT 


ACCGTATCTA 


CATAATAGCC 


2880 


TCTACACCAA 


AACTTGCGAT 


TGCCATATTT 


GTATTTTAAA 


TTCGCATGC7 


TATCAAAAAT 


2940 


CATCAAACTG 


CTCTTGCCCT 


TTAAATAGCC 


CATAAAGGAC 


GAAACACTAA 


GTTTCGGAGG 


3000 


AATACTGATA 


AGCATGTGAA 


TATGGTCTGA 


ACAAGCATTC 


GCTTCATGGA 


TTATTACACC 


3060 


CTTACGCTCA 


CATAACTCAC 


GTATGATTC7 


TCCGATACTA 


GCTTTGTATC 


TGCCATAAAT 


3120 


GATTTGACGA 


CGATATTTGG 


GTGCAAAAAC 


AATATGATAT 


TTACAATTCC 


ATGTGGTATG 


3130 


TGATAAACTT 


TGATTATCCT 


CTCTCATGAG 


GTACCTCCTG 


TATGATATGT 


TGTAGTGGCG 


3240 


GAGAAACCAC 


TTCTATCTTA 


TCATTTTAGG 


AGGTTCTTTT 


TGTTACCACG 


CTAAAAGCTC 


3300 


TATGGAAcCA 


CTAGCATAGC 


TAGTGGTTTT 


CGGGAGACAA 


CAAGAAAGAC 


TGCAATCTGT 


3360 


GGATTGCAGT 


TTTTTATACG 


ATCGATCTAT 


CCTAGATCTG 


ATGTCCAAGG 


CCTACGTGCC 


3420 


GATCATCTAT 


CGGTGAACCC 


AAGAGCGACC 


CTCAAGCCTG 


CTTGGATTGA 


GGTAATAGAT 


3480 


TCAAATATCT 


GTAGTTAGAC 


TATTTGAACT 


TTGATGTAAG 


AAAGAGAAAG 


CGACAGATTG 


3540 


AAGTAATTTT 


AACTCTCTTC 


TATTGCTAGA 


ACAAATGGTC 


GGATAGCTTG 


GTACTTTCAA 


3600 


AATCAAGATG 


CTATCTATTG 


TTAAATGGAA 


CATAGTGTTA 


TTTATTAGAA 


AATCGTTTGG 


3660 


TTTATTTCTT 


ATCAAATACG 


AAAAGCAACT 


TAAATATTTC 


AACTAAAATA 


GATGTTATCA 


3720 


AGAAAAGGTA 


AAATGATTTT 


GGCATAGTGA 


CGTTCTGTTC 


TATTTGATAT 


CATATTTTTG 


3780 


ATAAAAACAA 


AAATGTCCAT 


TGCAAACGAC 


AAAATGCGAA 


G7ATATTATT 


TTTTGAAAGC 


384C 


GATATAATGG 


ATTCATAAAG 


GAGGTGTATC 


GTGTCTAGAA 


AACAAGAACA 


AATGGAAACG 


3900 


TTGTTGCTCC 


TTTTGCGAGA 


TAG7AACGA? 


7ATATATCTC 


CTAAAGTATT 


CGGAGAAAAA 


3960 


TTAAATTGCT 


CTGATAAAAC 


GGTTTATCGC 


CTTGTCAAGG 


GAATCAACAA 


ACATTGTCCG 


4020 


L» I AO AAvjC A * 


1 V. A t I I 


T£AAAAAGGC 


AGAGGTTTCA 


AATTAAATCC 


AAGAAGTTCC 


4080 


CTCGTGGACG 


TTGATGGGAA 


TTTTACAGAG 


GCTTTTGATC 


CTGAAGTAAG 


GCGTGAAAAA 


4140 


TTACTAGAAC 


GTCTCTTGTT 


GACTGCTCCT 


AAGCCACATT 


CTATTTATCA 


TTTAGGAGAG 


4200 


GAATTCTACG 


TAAGCGAGTC 


AGTAGTACTA 


AAAGATCGTC 


AGATATTACA 


AGAGAGTCTA 


4260 


GCAATTTATG 


GGTTAGATTT 


AAAAATGAGA 


CAACGAAAGC 


TTTTTAT7GA 


TGGGGATGAG 


4320 


GCTCAAATTC 


GTTCAGCCAT 


TCTAAATCTA 


CTGCCAATCT 


TTAATCAGTT 


GGATTTAGAG 


4380 


CAAATTACAC 


AGAA7AAGCT 


TCAGCCTCTT 


GACGGAGAAC 


TTGCTCACTT 


TTGTTTGGGA 


4440 


TTACTGATTA 


CACTTGAGAG 


AGAATTGGGG 


GTAAACATTC 


CCTATCCATA 


TAATATAAAT 


4500 
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ATTTTCTCTC ACCTGTATAT TTTTATCAGT AGGAATCGTC GTAGTACTAG TATTCATGTT 4 560 

GTAGCACCTT CAAAACCTAC TATTCTTGAT GAGAAAATTT ACAGTGTCTC TCAAAAAATT 4620 

ATTCAAGAAA TTGAACAATA TTTTAGGATG AAGGTTGATG CAGTTGAGAT TGACTATCTT 4680 

TATCAATACG TTCTATCTTC GAGATTGCAA AAACCATTTT CTTCCGGGAA CCTTCCTTTT 4740 

TCTCAGCGAG TTTTAGATGT CACTCATTAC TATTTTAGCC GTATGTGTAT GGACAATAGA 4800 

GAGATTGAAA CGACAGATCC TGACTTTGTT GACTTGGCGA GTCATATCAG TCCCTTACTG 4860 

AGGAGATTAG ATAATAGACT ACAGATTAAG AATAGTCTTT TATCACAAAT TCTTTTAACC 4 920 

TATCCTAATC TGGTTAAAGA GTTAACAACT ATTTCTAAAG AAGTCACTCT AGTATTTGGT 4980 

TTTGCTTCCT TGAGTCTGGA CGAGATTGGT TTTCTACTCT TATATTTTGC ACGGTTTCAA 5040 

GAAAAGCGAG CACGTCCTCT AAAAACAGTA GTGATGTGTA CATCAGGTGT CGGAACTTCA 5100 

GAGCTTTTAC GAGCACGATT AGAAAAGCAA TTTTCTGAAT TGGATATTAT TGATGTAGTT 5160 

GCTTATCATC AATTAGATGA GCTGATAAAT CTATATCCAG ATTTAGATTT CATTGTGACG 5220 

ACGGTAGCTT TGCAGGAACC AGCAAGTGTC CCGTTTGTCC TAGTTAGTGT TTTTCTAACC 5280 

GAGGGTGATA AACAACGTCT TCAAGCAAAA ATTCAGGAGA TAAACTATGA ATAATCTTTC 5340 

CCTTGTCCTT ATCCATATAT CTGTTCAAAA TCCTCAAGAA GCCTACAAAG AATTACCAAA 54O0 

TCAAATCAGC CTTCTTCTTT CTGAAGATAC AGAAAAAATA GAAGACCTTC TATATTACCG 5460 

TGAGAGACAG GGAAGTATAG AGGTTGCTAA AGGTGTTCTT CTACCACATT GTGAAGGAAA 5520 

CTTTCAACAT CATGTCTTAG TGATTACTAG ATTAAAATCA CCTATCAGAG AATGGTCGAA 5580 

GGATATCCAG TGTGTTGACC TTATTATCGC TTTCCCCATT GCAGTATCAC AGGACAAGTC 5640 

ATGTATTAAA ACATTGATGA GAAGACTAGC AGATGAATCA TTCATAAATC AATT AAAAC A 5700 

GTTAACAAAA GAAGAATTAC CCCACATAAT ATATGGAAAT CAAAGATATT CTTAATGTGA 5760 

GTCTGATCCA GACGGATTTA CAGATCCAGA GCAAAGAAGA GGTTTTTGAG GCATTAGCTC 5820 

AACTATTCCT TGAGACGGGT TATGTGTCTG AT AG AG AC C A ATTTATCGAA GGTCTTTATC 5880 

AGAGAGAGGC AGAAGGACAG ACCGGTATTG GG AATT AT AT TGCTATTCCC CATAGCAAGA : i940 

GTTCTGCTGT GGAGAAGGCG GGGGTAGTCA TAGCTATAAA TCACAATGAG ATTCCTTGGG 6000 

AGACCATTGA TGGGAAAGGG GTCAAAGTAA TTGTACTCTT TGCAGTTGGT GATGATACAG 6060 

AAGCTCCTAG GGAGCATTTG AAGACCTTAT CACTCTTTGC TCGAAAACTT GGTAATGACG 6120 

AAGTTGTTGC CAAATTAGTT CGCGCTCAGA CATCTGATGA TGTGATTGCA GCTTTTTGTT 618C 

AATAAGAAAA AATTTTGGAG CGTATCCGTA TGAAAATTGT TGGTGTTGCA GCTTGTACTG 6240 

TGGGAATTGC CCACACTTAT ATTGCACAGG AAAAATTAGA GAATGCCGCA AAGGTAGCTG 6 300 
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GACATGTGAT TCATGTTGAG ACTCAGGGGA CAATAGCCCT AGAAAATGAA TTGACTCAAG 63 60 

AGCAGATTGA TGCAGCGGAT GTAGTTATTT TAGCAGTTGA TGTTAAGAT7 TCTGGTATGG 642 0 

AACCCTTTGA GGGTAAAAAG ATTATCAAGG TTCCAACAGA AGTGGCAGTC AAATCTCCCA 64 80 

ATAAACTGAT TGCTAAAGC? GTTGAGATTG TTACGAAATA ACTGAAAATA TTTAAGGACA 6540 

AAATATATGT TGAAACACTT AAACTTAAAA GGTCACTTAT TGACAGCCAT TTCCTATATG 6600 

ATTCCAATTG TTTGTCCTGC AGGATTCTTA GTTGCCATTG GTTTAGCAAT GGGGGGTGGT 6660 

GTTCC7GACG CTCTTGTAGC AGGAAAATTC ACTATCTGGG ATGCTTTAGC AACTATGGGT 6 720 

GGTAAAGCCC TTCGTCTCTT CCCACTTGTT ATTCCTACAG GTTTGTCTTA CTCGATTGCT 67 30 

GGTAAGCCAG GGATTGCACC AGGTTTTGTT GTTGGTCTAA TTGCCAATTC TGTTGGTTCA 684 0 

GGGTTTATCG GTGGTATCTT GGGAGGTTAT ATAGCTGGTT TCTTGGTTCA AGCGATTATT 6 900 

AAAAAGGTCA AAGTACCAAA CTGGATTAAA GGTTTAATGC CAACCTTGAT TATTCCTTTT 6960 

GTACCCTCTT TGCTAAGTAC TTTGATTATC ATTTATATTA TTGGAGCGCC TATCGCAGCC 7020 

TTTACCAACT GGTTGACGAG CTTATTACAA AGCTTGGGAA GTGCTTCAAA TGGTTTGATG 7 080 

GGGGCAGTTA TTGGAATTCT CAGTGCTCT7 GACTTTGGTG GCCCACTTAA TAAAACAGTC 7140 

TATGCGTTTG TGTTGACTTT ACAGGCTGAA GGTCTGAAAG AACCATTGAC TGCTTTACAA 7200 

TTGGTGAATA CTGCTACACC AGTTGGATTT GGATTGGCCT ATTTTATCGC GAAATTACTC 72 60 

AAAAAAAATA TCTATACTCA ACACGAAATC GAAACATTGA AATCCGCTCT TCCTATGGGG 7 3 20 

ATTGTCAATA TTGTTGAAGC TGTAATTCCG ATTGTTATGA ATAACTTGGT TCCAGGTCTC 7 3 80 

ATTGCAACAG GTATCGGTGG TCCTGTTCGT GGTGCTGTTT CTTTGACAAT GGGTGCTGAT 7440 

TCTGCTCTGC CAT7TCCTCC AGTCCTTATC TTACCAACCA TGACTCGTCC ACTAGCTGG7 7 500 

ATTTGTGCCT TGTTAGCTAA CATTGTACTC ACAGGACTTG TCTACGCGAT TTTGAAAAAA 7 560 

CCAATAAAAC ATGCAGAACC AGTTATGACT GTTGAAGAAG AGATTGATTT GTCAGATATT 7 620 

GAAATTTTGT AAGAGGGTAA CGATGTCAAG AATTGAATTT TCACCATCTT TGATGACCAT 7 680 

GGATTTGGAC AAATTCAAAG AGCAGATTAC TTTTTTGAAT GATAAAGTAG CATCTTATCA 774 0 

TATCGATATT ATGGATGGCC ATTTTGTTCC CAATATTACC TTGTCTCCTT CGTTCATTCA 7800 

AGAAGTTCAA AAAATTAGTG ACACACCTTT ATCAGTTCAT CTGATGGTCA CAGACCCAAC 7860 

CTTTTGGGTA GATCAAGTTC TCGATTTACA ATGTGAGTAT ATTTGTATTC ATGCTGAAGT 7 9 20 

7CTGAATGGT CTTCCTTTTC GTTTGATTCA TAAAATTCAT GATGCAGGTC TAAAGGCTGG 79 80 

TCTTCTCCTT AATCCTGAAA CACCTGTTTC TACAATCTTT CCCTACATTG ATTTACTTGA 8040 
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AGCGGATGCC 


CGTGAAACTC 


AAGCGGCTTG 


GCATCATGCC 


TTGACJACTA 


CCACCACTCC 


9900 


AACTGTCATT 


GTCTTAACCC 


GTCAAAACTT 


GGTAGTTGAA 


GAAGGGACAG 


ACTTTGGTAA 


9960 


GGTCGCTAAA 


GGAGCCTACG 


TCGTGTATGA 


TACCCCGGGA 


TTTGATACTA 


TTATCATTGC 


10020 


TAPAGGATPT 


CAGGTCAATP 


TAGCTATCAA 


AGCTGCTAAG 


GAATTGGTTT 


TACAAGGTCG 


10080 


T A A A P/F & PGT 
I n/vivj i r\v_L> l 


GTGGTATPTA 


TGCCCTCAAC 


CGAACTATTT 


GATCCTCAAG 


ATGCTACCTA 


10140 


p ^ ap/ia ap a p 


ATTTTAP^AT 
All* Intwttl 


r<^ AAG ACTGG 


TCGTCGTGTG 


GCCATTGAAA 


TGGCAGCGAC 


10200 


L. L. /v^nu 1 1 oLi 


m APAAGTATG 


TTGGT' T1 TGGA 


TGGCGCGGTC 


ATCCGTATTG 


ACATCTTCGG 


10260 




PPAGPTPAGA 


CTGTGATTGA 


TAATTATGGA 


TT7ACGCTAG 


AGAATATCGT 


10323 


tgptpaagtt 


AAGTPPPTAT 


AGAAACCAAT 


TACAATGAAG 


ATACACCTGT 


TGTCAGACTA 


10380 


rr AG ATCT AC: 


TGATAGAPAC 


TAATCAGATG 


ATTGGTTATT 


TAAAAACTGT 


AATGAAAATG 


10440 


TAATAATTf*A 


TCTACGAAAG 


TTATAG7AGA 


TAGTATACAC 


AATAGAGTAT 


ACCCTGAAAC 


10500 


GGTTGP G AAH 


TAPGPTAATC 


ACTTTGCTAC 


TCATCTAGAT 


AGTTTCTTTA 


ATCAATAAAC 


10560 


APAGPATCCA 


CAG AT TG ACT 


TAGGATATTG 


TAAGTTTTTT 


GAAAGCTAGA 


GAGAAGGTCT 


10620 


PTAAAATTAA 


AAAAPCCATA 


GTATAGGATG 


TTGAAATGAT 


GAACTGCACC 


CCAAAAGTTA 


10680 


GAP AG A A AAA 


AATPTAAPTT 


TTGGGGTGTT 


TTTATTATGA 


AATTAACTTA 


TGATGATAAA 


10740 


GTTPAGTTPT 


ATGAAPTTAG 


AAAACAAGGA 


TATATCTTAG 


AGAACCTTTC 


AAATAAATTT 


10800 


PPG AT A A ATA 


ATTPTAATP m 


TAGGTACATG 


ATTAAATTGA 


TTGATCGTTA 


CGGAATAGAG 


10860 


TTTfTriAAA 
1 » _ Li I L.rti\/V\ 


AAGGP.AAAAA 


TCGTTACTAT 


TCTCCTGATT 


TAAAACAAGA 


AATGATTCAT 


10920 


AAAGTCTGAC 


ATGAAGCCTC 


GACTAAAGAT 


AGAGTTTCTC 


TTGAATACGG 


TCTCCCAAGT 


10980 


CGTACGATAC 


TTCTTAACTG 


GCTAGCACAA 


TACAGGAAAA 


ACGGGTATAC 


TATTCTTGAG 


11040 


AAAACAAAAG 


GGAGAGTACC 


TGAGAGCGGA 


GAATGCCATC 


CTAAAAAAG? 


TAAGAGAACT 


111 00 


CCGATTGAAG 


GAGGAAAAAG 


AGAAATAAGA 


AAGACAGAAA 


TTGTTCAAGA 


A7TAATGACT 


11160 


GAGTTTTCGT 


TAGATCTTCT 


TCTAAAACCC 


ATTAAACTAG 


CTCCTTCCAC 


CTACTACTAT 


11220 


CACTTGAAAC 


AGCTAGATAA 


ACCAGATAAG 


GACCAAGACC 


TTAAAGCTGA 


AATTCAATCC 


11280 


ATCTTTATCG 


AACACAAGGG 


AGATTATGCT 


TATCGCCGGG 


TTCATTTAGA 


ACTAAGAAAT 


11340 


CGTGCTTATC 


TGGTAAATCA 


TAAAAGAGTT 


CAAGGCTTGA 


TGAAAGTACT 


CAATTTACAA 


11400 


GCTAGAATGC 


GACAGnAACG 


AAAATATTCT 


TCTCATAAAG 


GAG 




11443 


(21 INFORMATION FOR SEQ ID NO: 50: 









ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5338 base pairs 
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( B) TYPE: nucleic acid 

(C) 5TRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CCAATTACAT TATATTATCA AAATCGTCCA AACTGGCTCC ATGAATGAGC CAGCCAAGCA 60 

ACTCTTTATC ACTCAGCCAA GTCTCTCCAA 7CCAGTCCCA GATTTGGAAA ATGAAATGGG 120 

CATTGAGATC TTTATCCGCA ATCCCAAGGG AATCACCTTG ACCCGTGATG GCATGGAGTT 130 

TCTCTCTTAT GCCCGTCAGG TTGTCGAGCA GACCCAGCTT CTGGAGGAAC GCTATAAAAA 2 40 

TCCTGTCGCC CACCGCGAAC TCTTTAGCGT TTCGTCTCAA CACTATCCCT TTGTGGTCAA 300 

TGCCTTTGTC TCTTTGCTCA AGAAAAGCGA TATGGAGAAA TACGAACTCT TCCTTCGTGA 3 60 

AACTCGGACT TGGGACATTA TCGACGACGT CAAGAACTTC CGCAGTGAGG TCGGGGTCCT 4 20 

CTTCTTAAAC AGTTACAACC GTGATGTTTT AACCAAGATG CTCGATCACA ATCACCTGCT 4 80 

AGCCCACCAT CTCTTCACAC CGCAACCGCA TATCTTTCTC AGCAAGACCA ACCCTCTCGC 540 

AAAGAAAGAC AAGGTGAAAC TGTCTGATTT GGAGAATTTC CCTTACCTCA GCTATGACCA 600 

AGGGACGCAC AACTCCTTCT ACTTTTCAGA AGACATTCTT TC7CAAGAAC ACCACAAGAA 660 

ATCCATTGTG GTCAGTGACC GTGCCACCCT CTTTAATCTC TTGATTGGTT TCGATGGTTA 7 20 

TACCATTGCG ACAGGGATTT TGAACAGCAA CCTAAACGGA GACAATATCG TTTCTATCCC 7 30 

ACTGGATATT GATGACCCGA TCGAGCTGGT CTATATCCAG CATGAGAAAA CCAGCCTATC 340 

TAAGATGCGC CAACGCTTTA 7AGAC7ATC7 CCTAGAAGAA GTTCAGTTTG ATAGTTGAGA 900 

AATGATAAGA ACCAATATGT AGGCTAGCAA CAACCTGCAC ATTGGTTCTT TTTACTTATA 960 

ATTAAAAGTT TCCCCTGCCA ACTTATCAGC TAGCTTGGGA AAGAGAGTAT AAAACTTATG 1020 

GGCTAGGTTC AACAAAATCG GGAGATTGAG TTCTCGTTTC 7TTTTTCCTA TAATCTTCAC 1080 

AATCTTTTTA GCCACTGCAT CTGGTTCTAG CAGGAAGCGA TCAACCGATT TAAGATAACT 114 0 

TCCATCTGGG TCGGCTTGGT CGAAAAATCC TCTACGGATT GGTCCTGGAT 7GACTCTTCT 1200 

CACATAGACT CCATAGGGCA TAAGTTCGAG TCGCAGAGCA 7TTGAAAAAC CAATAGCCGC 1260 

AAACTTGGTC CCTGAGTAAA GACTAGACTT GCCAGTAGCT ATTAGACCTG CCATGCTGAC 13 20 

GATGTTGATG ATATGCCCTT TGCTGCTT7C CTTCATACGA CCCGCAAGGT GACGAGACAG 13 80 

ATTCATCAGG GCAAAGGTAT TGACCTCAAA CATCTGCTGA ATATCTTTA7 CAGCAATCTG 144 0 

GTCAAATCCC TCAAAAATCC CGTAACCAGC GTTGTTAATC AAGACATCAA TCTTGCCATA 1500 

GCGGAGATAA AGATCAGTTA CCACAGCTTC TAGGGCTGAA TCGTCGGTAA TATCAATTTC 156C 
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AATCAATTCT 


GCATGGGAAT 


AATTTCCGTA 


GAGTTGGGCT 


AATTTTTCCT 


TATTTC7ACC 

* ** 4 4 1 W 4 ** W W 


1620 


AAGCAAGATG 


AGTTGGTCAT 


TGGGCAGGAG 


TTTGACCATT 


TCTTCAGCTA 


GACCACCGCT 


1680 


AGCTCCGGTA 


ATGAGAATAG 


TAGGCATACT 


TATCCTTTCT 


GTGACTGCTA 




* 7 d n 


TCTTCCAAGT 


CTTTGACCAC 


ATGGACATTT 


TCAAAAATTG 


TGGCAGCG7C 


• I » ^ I i VJnu i 




TTGCTAATAT 


CTTTTGAGAG 


GAAACGGGCA 


CTGATATGGT 


TGAGTAGGAG 


ULy ill uuv-rt 


IDOU 


CCTGCTTCTA 


CCCCTACTTG 


TCCACCTTCC 


ATATTAG7TG 


AGTGACCATG 


GTTACGAGCA 


19 2 0 


ATTTTTTCAT 


CACCCTTGCC 


ATAAGTGGAC 


TCATCAACTA 


GGACATC7GC 






AGACGCACAC 


TGGCACCCGT 


TTTTCGAGTG 


TCTCCTAAAA 


TACTGATAA? 


CT7ACCTGGA 


2 040 


CGTGGCGCTG 


AG AT AT ACT C 


TGCTGCCTTG 


ATTTCACT7C 


CGTCTTCCAA 

WW ± W 4 4 V* w on 


AACAAflATCC 


z * u u 


TGGCCGTTTT 


TGATTTTACC 


AAAAAGCGGG 


CCGAACGGAA 


CACCAGCAGC 


PTTG A rtTTTT* 
L 1 1 \jf\\j 1 1 1 i 


< :oU 


TCAGCATCCA 


GCGTCCCTTC 


TAGATCCTTT 


TGCATGACAC 


GATAGCCAAC 


ACAGAAAA^A 


44«U 


GTGTGGTCCA 


GCTCCTCTGC 


ATACACAGTG 


AATTTATCGG 


TTTCAAGAAT 


AC CC AHA 


22 80 


GAATCTTGGT 


CAAACTCATG 


GAAATGAATG 


CGGTAGGGCA 


GACGAGAACC 






AGGCTGGTTA 


ACACAAA7CA 


CTTGATTCCT 


TGAGGTCCGT 


AGATTTCCAA 






TCTTCATTGG 


CCTGAAAGGC 


ACGGCTAGAA 


AGGAAACCTG 


GCAAACCAAA 






CCATGCAGAT 


GGGTAATAAA 


GATTTTGCTG 


ACCTTACGTG 


GTCGAATTGT 






A7CCCATTTT 


GCGTACCTTC 


TCCACAGTCA 


AAGAGCCAAA 


CTTCGTTAAT 


CTCATCCAAA 


25g o 


AGTTTCACGG 


CGAGACTTGA 


AACGTTGCCG 


GCTTTAGAGC 


GCTGACCAGC 


CCCCGTTCC7 


2 640 


AAAAATTGAA 


TATCCATTCG 


AT ACTTT CT A 


ATTAATCAAT 


ATATAACATG 


CC^GTCCCGT 

twl vJv. IJVI 1 


2 700 


TTTCCCATCG 


GAAATAGCGT 


TTGCCAGAAA 


.-iAGCACCACC 


77CT7GCAA? 


AAATCC^CT^* 


2760 


GGCTGTAGCC 


TTTCAGACCT 


TTTCGACCAT 


CAGCCAATCT 


7TCCAAATCA 


CTCAAAGCTC 


28 20 


TGAGACTTTC 


TAGGCTGATA 


ACTTCCTCGT 


CCTCGACAGC 


CTTCATGTAA 


ATCT7ACCAG 


2880 


ACTCTTCAAA 


GACTAATTGA 


TGGGGGAAAA 


TTTGCGCAAT 


TTCAAACAGC 


AAGTCA7CCG 


2940 


AGATTTTCTC 


CTCATTTTCA 


AAGAAAATCC 


GACCAAGGCC 


GTCACTCTCA 


7AACAAAAAC 


3000 


CAAAGGATTT 


ACCAGACAGA 


TTAAGCCGAA 


TAAAAGGCTT 


ATTTTC7AGC 


GTGAAAC7TG 


3060 


GCTCAGTATT 


GTAAAGATTC 


AGTTCCTGAC 


TGAGTTCTGC 


AAAATAATCC 


GTCGCAGCCT 


3120 


GAGGACTCTT 


TTTCTGATAG 


AGTTCTGCAA 


AGTAGGCATT 


AACAACAC77 


GGCGGAGGTG 


3190 


TAATAAGTGT 


TAACTGCTCC 


TGATCTGTTT 


TACCAGCTAG 


AAGC7GATCC 


AGATAGACC7 


3240 


TGTCCAGACT 


TGTATAACCT 


CCATACTTTA 


GAGCCAAAGT 


7TTAATATCA 


GTCATAAAAT 


3300 
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TCTTCTAACC TCCATTTATT TTTCTCGGAA ATGTAGCCTG TAA^C" .^TTC GCCGTCTTCC 3 360 

TGATAATCAC GTTCTTCCAG AATTGCAACA CTCTCTAAAT CATGAATCTT GTAGGACTTT 3420 

GAAAAAGGCA CTCGCAGGGT AAATGCTTCA AAAATTTCCT TAATCTTATC TAGCAATAAT 3480 

GCTTGCAAGT TTTCACGACT GTCCTCAGAC TTGGCACAAA TGAGGGTATA TGGCGTTTGG 3540 

GTAGGCGTGA AATCCTCCAC CAAATCCGCT TTATTA7AAA GCGTCAAGTG AGGAATATCT 3600 

TCCATGTCCA CGTCTTTCAT GATGGAGAGA ACCCTTTTTT CATGCTCCTC GTGGTAAGGA 3660 

TTGCTAGCAT CGATAACATG AACCAGAAGG TCCACATGCT TGCTTTCTTC CAAGGTTGAC 3720 

TTGAAACTGG ACACCAACTC TGTCGGCAAA TCTTGCATAA AGCCAACGGT ATCTGTCAAA 3780 

GTTACTTGGA GATTGCCTCC CAGATGAATA CTCTTGCTTG TCGCATCCAG AGTCGCAAAG 3840 

AGCTCATCTG CTTCATACTG GGTCTTACTG GTCAAGATGT TCATGATAGT TGATTTCCCA 3900 

GCATTAGTAT AACCAATCAA ACCAATCTTA AAAGTGCTAG ACTCCAAACG TTTTTCTCTG 3960 

ACAGTCGCAC GATTTTTCTC AACCACCTTG ACCTGGCGCT CGATATCCGT CATTTGATTG 4020 

CGAACGCTAC GACGCTTCAG CTCCAGCTCG CTTTCACCAG GACCACGGGA ACCAATTCCC 4080 

CCTgCCTGAC GGCTGAGCAT AATCCCCTGA CCAACCAAGC GAGGCAAAAG GTATTTGAGT 4140 

TGGGCTAGGT GGACTTGGAG CTTCCCTTCA TCGCTTCGAG CCCGCATGGC AAAGATATCC 4 200 

AAAATCAACT GCATACGGTC AATGACCTTA ACACCGAGAA CTTCCTCTAG ATTGACATTC 4 260 

TGCCTTGGGG TCAGACGATT GTTGACGATG ACAGTAGTGA TTTCTTCTGC ATCCACCATA 4 3 20 

AGCGCAATCT CTTCCAACTT ACCAGAGCCG ACGAAGGTCT TGGAATCATA TTTTTCACGT 4 380 

TTTTGTCTGT AGCTATCTAC AACGACTGCC CCTGCCCT7T TCGCTAAACT AGCCAATTCT 4440 

TCCATGGAGA GGTCAAAACT CTCCATACCC TGCAATTCCA CACCAATCAG CAGGACTCCC 4 500 

TCCTCTTTTT TCTCCGTTTC AATCATCTAA AAACTCCTCT ATCTGGCTTA AAA7GCGGTC 4 560 

TTGTACACCA GATTCTCCAA TCTGATAAAA GGTGACCTGC ATGCGATTAC GGAACCAGGT 46 20 

CAGCTGACGC TTGGCAAAAC GACGAGTCGC C7GTTTAAGA CTCTCACTAG CTTCCTCCAA 4 680 

CGTCTGCTCT CCACGGAAAT AAGGAAAGAG T7CCTTATAG CCAATTCCTT TAGCAGCCTG 4740 

TACATTAGGG GAATGGTCAA ACAGCCACTT GGCCTCATCC AAAAGCCCAG CCTCAAACAT 4 800 

CAAATCCACT CGGTGGTTGA TACGCTCATA AAGTTGACTA CGTTCATCAT CCAAGCAGAT 4 860 

AATCAGCGGT TCATACAACG 7CTCTTGATT TTCCAAATCC TGACCAAAAT GGGCAATTTC 4920 

TAAGGCACGC ATAGCACGAC G AC G ATT AAA CTGGGGAATC TCAAGCCCTG CTTGATCCAC 4980 

CAAATGGGCT AATTCCTCAT CTGAAT ATGG CTCCAAACTA CCTCCATAAG CTAAAATCTC 5040 

CTCATGAGGA GTCTCCCCAC CTAGCTGCTA ACCTTCTAGC AAGCTCTGGA TATAAAGTCC 5100 
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AGTCCCACCG GCGATAATGG CTAGCTTGCC ACCG7TGTCA ATACCCTCAA TAGTCATCTT 5150 

AGCTTCTGAA ACAAAATCAA AAGCCGAGTA AGACTCGGTT ATCTCTCTAA CATCGATTAA 522 0 

ATGATGAGGA ACAGCTGCCT GCTCTTCTGG ACTAGCCTTG GCCGTCCCAA TATCAAGTCC 5230 

TC GAT AG ACT TGCTGGCTAT CTCCACTAAC CACTTCGCCA TTAAAACGCT TTGCGGGG 53 38 
(2) INFORMATION FOR SEQ 10 NO: 51: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19446 base pairs 

(B) TYPE: nucleic acid 

(C) STRAW DEDNE S3 : double 
(0) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CGGAAACCCA TCTACTCTCC ATCGTTTGCC AGACCAAGCA ACACGAATCT TAGATGCTTC 60 

TCCCCAACAG ATTGCAGATT TAATCGGTAA GAAAAGCGAT GAAATCTTCT TTACCTCGGG 120 

TGGAACAGAA GGGGATAACT GGCTTATCAA GGGTGTGGCC TTTCAAAAAC CTCAGTTTGG 180 

CAAGCACATC ATTCTTTCAG CCATTGAACA TCCAGCAGTC AAACAGTCAG CCCTCTGGTT 2 40 

GAAAAGTCAA GGATTTGAAG TGGATTTTGC TCCAGTTGAT AAGAAAGGCT TCGTCGATGT 300 

TCAGGCGTTA CAGGTTTGAT ACGGCATGAT ACAATCCTCG TTTCCATCAT GGCTGTGAAC 3 60 

AATGAAATCG GCTCTATCCA ACCTATTGAG GCTATTTCAG AATTCTTGGC AGACAAGCCG 423 

ACTATTTCCT TCCACGTTGA TGCGCTTCAC GCCCTTCCCA AAATTCCGAC TGAAAAGTAT 4 80 

CTGACAGAAC GCGTGGATTG CGCGACTTTC TCTAGTCACA ACTTCCACCC GGTTCGAGCT 540 

GTTGGCTTTG TCTATATCAA ATCTGGCAAC AAGATTACAC CTCTTCTTAC AGGTGGTGCC 600 

CAGGAGCGAG ATTATCGTTC GACAACTGAA AATGTGGCAG GGATTGCAGC GACAGCCAAG 660 

GCCCTCCCTT TGTCTATGGA AAAGCTAGAT ATCTTTAGGA GCAAGACTGG GCAGATGAAG 72 0 

GCAGTGATTC GCCAAGCTCT TCTGAACTAT CCGCATATTT TTGTCTTTTC AGATGACGAA 780 

AACTTTGCAC CTCATATTCT GACTTTTGGA ATCAAAGGTG TTCGAGGTGA AGTCATCGTT 840 

CACGCCTTTG AAGACTATGA TATTTTCATC TCAACAACCT CAGCTTGTTC ATCTAAGGCA 900 

GGAAAACCAG CCGGTACCTT GATTGCCATG GGAGTGGACA AAGATAAGGC CAAGTCAGCT 960 

GTGCCTCTTA GCCTAGACTT GGAAAATGAT ATGAGTCAGG TCGAGCAGTT TTTGACCAAG 1020 

TTAAAATTGA TTTACAATCA AACTAGAAAA GTAAGATAGG AGCATTCATG CAGTATTCAG 1330 

AAATTATGAT TCGCTACGGA GAGTTGTCAA CCAAGCGTAA AAACCGTATG CGTTTCATCA 114 3 
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ATAAACTTCG TAATAATATT TCGGACGTTT TCTC7ATCTA TACCCAAGTT AAGGTAACAG 1200 

CAGATCGCGA CCCTGCCCAC GCTTACCTCA ATGCAGC7CA TTACACAGCA GTTGCACAAT 12 60 

CTCTCAAACA AGTTTTTGGA ATTCAAAACT TTTCTCCTGT TTATAAGGTT GAAAAATCTG 13 20 

TAGAAGTTTT GAAGTCTTCT GTCCAAGAGA TTATGCGGGA CATCTACAAG GAAGGTATGA 13 80 

CCTTTAAGAT TTCTAGCAAG CGTAGCGACC ACAACTTTGA ACTTGATAGT CGTGAACTCA 14 40 

ACCAAACACT TGGAGGGGCT GTATTCGAAG CCATTCCAAA TGTGCAAGTT CAAATGAAAA 1500 

GTCCTGACAT CAATCT7CAG GTGGAGATTC GTGAAGAAGC AGCCTATCTT TCTTATGAAA 1560 

CCATTCG7GG GGCTGGTGC7 TTGCCACTTG GAAC77CACG TAAAGGGATG CTCATGTTGT 1620 

CAGGAGGGAT TGACTCACCT GTACCAGGTT A7C7TGCTC7 TAAGCGTGGG GTGGATATCG 1680 

AGGCAGTTCA CTTTGCTAGT CCACCATATA CTACTCCTGC TGCCCTCAAG AAAGCGCAGG 17 40 

ACTTGACCCG TAAATTGACC AACTTTGGCG GAAATATCCA GTTTATAGAG GTGCCTTTCA 1800 

CACAGATTCA AGAGGAAATC AAAGCCAAAG CGCCAGAAGC TTATTTGATG ACTCTAACTC 18 60 

GTCGCTTTAT GATCCGCATT ACTGACCGTA TTCGTGAGGT ACCAAA7CCT TTGGTTATCA 1920 

TCAATGGGGA AAGTCTAGGT CAAGTAGCCA GCCAAACCCT TGAAAGTATG AACCCTATCA 1980 

ATGCTGTTAC CAACACTCCC ATCATTCGTC CTGTGGTTAC CA7GGACAAG TTGGAAATCA 204 0 

TTGACATCGC CCAGGAAATC GATACCTTTC ACATTTCAAT CCAACCGTTT GAAGACTGTT 2100 

GTACCATTTT TGCACCAGAT CGTCCAAAAA CAAATCCTAA AATTAAGAAT GCGGAGCAGT 2160 

ACGAAGCGCG TATGGATGTT GAAGGCTTGG TTGACCGACC AGTGGCTGGA ATCATGATTA 22 20 

CTGAAATCAC ACCTCAAGCC GAAAAAGATG AAG77GA7GA CTTGATTGAC AA7CTCCTC7 2280 

AATTCAGAAA ATCCAAAAGA ATAGCGAAAA TCAGTAAAAA AACTTACTTT TTTCTCTAAA 23 40 

AACAGGTAAA AAACTAACTT TTTTTATTTT TATGATATAA 7GATATAAAA TTTTGAATAT 2 400 

ACAGAG7TTT CTGACAATGA ATCAATCCTA CTTTTATCTA AAAATGAAAG AACACAAACT 2 4 60 

CAAGGTTCCT TATACAGGTA AGGAGCGCCG TG7ACGTATT CTTCTTCCTA AAGATTATGA 2 5 20 

GAAAGATACA CACCCTTCC7 ATCCTGTTGT ATACTTTCAT GACGGGCAAA ATGTTTTTAA 2 580 

TAGCAAAGAG TCTTTCATTG GACATTCATG GAAGATTATC CCACCTATCA AACGAAATCC 2 640 

GGATATCAGT CGCATGATTG TCGTTGCTAT TGACAATGAT GGTATGGGGC GGATGAATGA 27 00 

GTATGCGCCT TGGAAGTTCC AAGAATCTCC TATCCCAGGG CACCAGTTTG GTGGTAAGGG 2 7 60 

TGTGGAGTAT GCTCAGTTTG TCATGGAGGT GGTCAAGCCT TTTATCGATG AGACCTATCG 2820 

TACAAAAGCA GACTGCCAGC ATACGGCTAT GATTGGTTCC TCACTAGGAG GCAATATTAC 2 880 

CCAGTTTA7C GG7T7GGAA7 ACCAAGACCA AAT7GGT7GC TTCGGCG777 TTTCATCTCC 2 940 
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AAACTGGCTC CACCAAGAAG CCTTTAACCG CTATTTCCAG TGCCAGAAAC TATCGCCTGA 3000 

CCAGCGCATC TTCATCTATG TAGGAACAGA AGAAGCAGA? GATACAGACA AGACCTTGAT 3060 

GCATGCCAAT ATCAAACAAG CCTATATCGA CTCCTCGCTT TGCTATTACC ATGATTTGAT 3120 

AGCAGGGGGA GTACATCTCG ATAATCTTGT GCTAAAAGTT CACTCTGGTG CCATCCATAG 3180 

TGAAATCCCT TGGTCAGAAA ATCTACCAGA TTGTCTGAGA TTTTTTGCAG AAAAATGGTA 324C 

AGTTAAGAAA GGAAAAAACG AAATGCATAT TGAACATCTT AGCCACTGGA GTGGTCATCT 3 300 

TAACCGTGAA ATGTACCTTA ACCGTTATGG ACATGGTGCG ATTCCAGTTG TGGTCTTTGC 3 3 60 

TTCATCAGGT GGTAGTCACA ACGAATACTA TGATTTTGGC ATGATTGATG CCTGTGCTTC 342 0 

CTTTATCGAG GAAGGCCTTG TCCAGTTCTT 7ACCCTATCT AGTTTGGATA GTGAGAGCTG 3480 

GTTGGCTACT TGGAAAAATG CTCATGACCA AGCGGAAATC CACCCTGCCT ACGAACGTTA 3510 

TGTGATTGAG GAGGCCATTC TTTTATCAAG CACAAGACAG GTTGGTTTGA TGGCATGATC 3600 

ACCACAGGTT GCTCTATGGG AGCCTATCAT GCACTCAATT TCTTCCTCCA GCATCCAGAT 3660 

GTCTTTACCA AAGTGATTGC TCTCAGTCGT GTTTACCACG CACGTTTCTT TGTCGGTCAT 3720 

TACTACAACG ATGATGCTAT TTACCAAAAC TCGCCACTAG ATTATATTTG GAACCAAAAC 37 80 

GACCGCTCCT TTATTGACCG TTACCGTCAG GCAGAGATTG TGCTGTGTAC GGGGCTTGGA 3840 

GCCTGGGAAC AAGATGGTTT GCCATCCTTT TACAAGCTCA AAGAAGCCTT TGACAAGAAA 3900 

CAAATTCCAG CCTGGTTTGC TCAATGCGCA CATGATGTCG CCCATGACTG CCAATGGTGG 3960 

CGTAAACAAA TCCCTTATTT CCTCGGTAAT CTCTATTTAT AAAAGGACTT ACCTATGAAT 4C20 

TACCTTGTTA TTTCTCCCTA CTATCCACAA AACTTTCAAC AGTTTACCAT CCAACTACC? 4080 

AATAAAGGCA TCACAGTCTT GGGAATTGGT CAACAGTCTT ACGAGCAATT GGATGAGCCC 4140 

TTGCGCAATA GCTTGACCGA GTATTTTCGT GTTGATAATC TTGAGAACAT AGATGAAGTC 4 2 00 

AAACGTGCAG TTCCTTTTCT CTTTTATAAA CATGGTCCAA TTGGCCGCAT CGACTCTC\C 42 60 

AATGAA7ACT GGCTTGAGCT AGACGCAACA CTCAGAGAAC AATTCAATGT TTTTGGTGCC 4 3 20 

AAACCAGAGG ATCTCAAAAA GACGAAATAT AAGTCTGAAA TGAAGAAACT TTTCAAAAAA 4 380 

GCAGGTGTTC CTGTGGTACC TGGAGC7GTT ATCAAGACGG AAGCAGATGT TGATCAAGCA 4440 

GTGAAAGAAA TCCGTCTTCC AATGATTGCC AAACCTGATA ATGGAGTGGG AGCAGCCGCA 4500 

ACCTTTAAAC TTGAGACAGA AGACGATATC AATCACTTCA AGCAAGAATG GGACCATTCA 4 5 60 

ACCCTTTATT TCTTTGAAAA ATTTGTCACT TCCAGCGAAA TCTGTACCTT TGACGGGCTC 4 620 

GTGGACAACG ATGGAAAGAT TGTCTTCTCA ACAACCTTTG AC7ACGCCTA TACACCGCTT 4 630 
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GACCTCATGA TTTATAACAT GGACAATTCT TATTATGTGC TCAAGGATAT GGATCCTAAA 4 740 

CTGCGCAAGT ATGGGGAAGC AATTGTCAAA GAATTTGGTA TGAAAGAACG GTTTTTCCAT 4300 

ATTGAGTTCT TCCGTGAGGG GGACGATTAT ATTACCATCG AGTACAATAA CCGCCCTGCA 4860 

GGTGGTTTTA CCATTGATGT TTATAACTTT GCTCATTCCT TGGACCTTTA TCG7GGCTAT 4 920 

GCAGCTATTG TCGCAGGAGA GGAGTTCCCG GCGTCAGACT TTGAAACTCA GTATTGTTTG 4 980 

GCTACTTCTC GCCCTGCAAA TGCTCACTAT GTTTATTCAG AAGAGGATT7 GCTTGCCAAA 5040 

7ATAGCCAGC AGTTCAAGCT TAAAAAAGTC ATGCCAGCTG CCTTCGCGGA ACTTCAAGGA 5100 

GATTACCTGT ATATGCTGAC CACTCCGAGT CGACAAGAAA TGGAGCAGAT GATTGCAGAT 5160 

TTCGGACAAC GTCAAGAATA AGAACTATCG GATTAAGGAA ATTAACTCCC TTAATCCTTT 52 20 

TGTTTTGTCT GATAAAAAAT AAGAGCATCC CAACAAGGTA GCTATCATAA AACTTGTTCG 5280 

ATAACTATTT GAAGCAGGAT TAGGTGGTCA GAAATTAAAT TTTAATATTT CAATTGAG7C 5 340 

ATAGTATTGT GTTTGCGTAT CCTTAAATCA GCTAAAAGGA TCCATGACGA CACCTATACC 5400 

ATA7AC7TTT CAAGA7ACCA AACAACTC7A TTAATATTCA ATGAAAATCA AAGAGCAAAC 54 60 

TACGAAGCTA GCCGCAGGTT TCTCAAAACA CTGTTTTGAG GTTGTGGATA GAACTGACAC 5520 

AGTCAGTATC ATATACTACG GCAAGGTGAA GCTGACGTGC TTTGAAGAGA TTTTCCAAGA 5580 

GTATAAAATA 7TCAGGTCAC GCATAGATAT AGTTAATTGA AGC7TTGTTT GAAATCTGAT 5640 

AAAATAATGA TATTACTAAG TTTTAAAAAC TAAAGAAAAG GCAACATATG ATTACAGGCG 57 00 

AATTAAAAAA TAAAATCGAT CAGCTCTGGC AAATTCTTTC GACAGAAGGA AACGCAAATC 57 60 

CTTTAACAAA TATTGAACAG TTGACTTATC TCTTATTTAT GAAAGATT7G GATAGTGTCG 5820 

AGCTTGGACG TGAAAGTGAT GCTGAATTTC TAGGGATTCC TTATGAGGGA GTTTTTCCAA 5380 

AAGATAAACC TGAATACCGT TGGTCAACTT TTAAAAATAT AGGAGATGCT CAGGAAGTTT 5940 

ATCGTTTAAT GACTCAGGAG ATTTTTCCGT TTATTAAAAA TCTCAAGGGG GATACAGATG 6000 

A7ACAGCCTT 7TCACGATAT ATGCGAGAAG CTATTTTTCA AATAAATAAA CCTGCTACGC 6060 

TTCAAAAGGC AATTTCTATC TTAGATGTTT T7CCAACTAG GGGATTAGAT GTAGATTTTG 6120 

ATAATGACAA ACAAAGTATT ACTGATATCG GAGATATCTA TGAATATC7G TTATCAAAAT 6130 

TGTCGACCCC AGGTAAAAAT GGACACTTCC GTACACCTCG TC AC AT CATC GATATGATGG 6240 

TTGAGTTGAT GCAACCGACT ATCAAAGATA TCATCTCAGA 7CCCCCTATG GGTTC7GCTG 6 300 

GC7TCTTAGT A7CTGCTAGC CGTTAC77AA AGCGTAAGAA AGATGAATGG GAAACCAATA 6 3 60 

CAGATAATA7 CAATCATTTT CATAATCAGA TGTTTCA7CG AAATGA7ACG GATACGACTA 64 20 

TGTTGAGAC7 7GGGGCGATG AACATGA7GC 7ACATGGAGT AGAAAA7CCA CAAATCACTT 64 80 
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ACCTTGACTC GCTCTCTCAA GATAATGAAG AAGCCCATAA ATATACTTTG GT7TTAGCAA 6340 

ATCCTCCTTT TAAGGGCTCA CTTGACTACA ATTCAACCTC TAATCACCTT CTTGCAACCG 6600 

TAAAAACCAA AAAAACAGAA TTACTCTTTC TTTCTCTTTT C7TGCGAACT TTAAAACCAG 6660 

GTGGACGAGC AGCACTTATC GTACCTGATG GTGTCCTTTT TCGTTCCTCT AAAGCTCATA 672 0 

AAGGAATTCG TCAGGAAATT GTAGAGAATC ATAAGCTTCA TGCTGTAATC TCAATGCCTA 6780 

GTGGTGTGTT CAAGCCTTAT GCTGCAG7TT CAACTGCCA? TCTCA7C777 ACAAAAACTG 6840 

GTAATGGTGG TACTGACAAA GTCTGGTTTT ACGATATGAA AGCGCATCGT TTAAGTTTGG 6900 

A7CATAAGCG ACAACCGATT AGCGACAA7C ATATTCCAGA 7AT7ATCGAA CCC7TTCATC 6960 

ATCTTGAAAA AGAAGCAGAA CGTCAGAGAA CGGATCAATC 7TTC7TTGT7 CCACTTGCTG 7020 

AGATAAAGGA AAATGATTAT GATTTGTCTA TCAATAAATA TAAAGAGATT GAGTATGAAA 7080 

AAGTTGAGTA TGAACCAACA GAAGTCATAT TAAAGAAAAT CAATGATTTA GAAAAAGAAA 7140 

TTCAAGCTGG CT7CGCTGAA TTGGAAAAAT TAC7CAAG7A GGGAGGTGGC 7C7ATGAAAA 7 200 

AAGTGAAGTT GGGGG AAGTC TTATCTCTAA AAAAAGGCAA GAAAGCCACT GTACTTGCTG 72 60 

AACAAACAAC TCTAAGCCAA CCTTATATTC AAATACA7CA TTTAAGAAAT AATAATAATT 7 320 

TAAAATTCAC TGAAAGTTTA AATATGACTG AAGCACTCCC AGATGATATT CTGATAGCAT 7 3 80 

GGGATGGAGC TAATGCAGGA ACAGTTGCTT ATGGATTATC GGGAGCTGTT GGTAGTACAA 7440 

77ACCC7CTT AAAAAAGAAT GAGCGATACA AAGAAAAAAT TATATCAGAT TACTTGGGAG 7 500 

TC777T7GCA AAGTAAATCG CAGTATTTAC GAGATCATTC AACAGGTCCA ACAATTCCTC 7560 

ATTTAAACAA GAATATATTA CTTGA77TAC AATTAGAATT GCTAGGTATC GAAGAACAAG 7 620 

ACAACAT7A7 CTGTATTCTT AATACCA77A AAAGGC77A7 7 A CT AAAA G A AAA777CAGT ^5 3C 

TAGATGAACT AAACTTGCTC GTCAAATCCC GATT7AACGA GATGTTTGGG GAAAATAAAA 77 40 

TATTTGAAAG CATTGATAAC TTATTTGATA 7TATAGATGG TGATAGGGGC AAAAATTATC 7 800 

CTAAATCAGA TGAGTTCTTT AGTGAGGACT ACTGTT7ATT TTTAAATACA AAGAA7GTTA 7 860 

CTAAAAACGG ATTTTCATTC GATACAAAGC AA77TATCAC TAAAACAAAG GA7AAATTAC 7 9 20 

TTCGAAAAGG CAAACTTGAG CGTTATGATA TAG7CTTGAC AACAAGAGG7 ACTCTTGGAA 7 980 

ATGTAGCGTA CTACGATGAA TTAATAAAA7 ATAAACATT7 ACGTATAAAT 7CAGGTATGG 3040 

TAATATTACG TCCCAAGACA CCAAATC7AA ATCAGAAAT7 7ATTATCCA7 CTTTTAAGGA 8100 

ATAATAAT7A TAGTCGAGTG A7ATCAGGAA GTGCTCAGCC TCAGTTACCA AT7ACAAAAT 8160 

TAAAAAAAAT ACTTCTCCCC C7CCCCCCAC TAGCCCTCCA AAATGAG7TC CCACACTTTG 82 2 C 
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TAGTCCACCT CGACAAATCA CAATTTGCTT GTGAGATAGC TATAAAAGTG TGGAGAAATA 8 280 

CCTTGAAATT TAGTATAATA TAGCTAAACT ATTTGTTTAA AGTGAGAAAA AAATGGGAAA 3 3 40 

TTTTAGCTTT CTTTTAAAAA ATGACGAATA TGAATCTTTT TCAAAACCTT GCATTGAAGC 8400 

TGAGAATATG ATTGCTACAT CAACTGTGGC TACTGCCTTT ATGGCGCGTC GTCCTTTAGA 8460 

GCAGGCTCTC CATTGGATAT ATAGTCACGA TTCATATTTA GAAGCTCCCT ATCGTGCTAC 8520 

TCTATCTTCT TTAGTATGGG ATGATGATTT TAGGGATATC GTAGATTCTG AACTCCACAA 8580 

GCAGATAGTT CTGTTGATTC GGTGGGGAAA CCATGCTGCT CATGGTGGTC AAATTAAGGA 8640 

ACGAGAAGCG ATTTTAGCTT TCCATCATTT GTATCAGTTT GTTAATTTTA TCGATTATTG 87 00 

TTACAGCAAT GAGTTTCTCG AGCCTTATTT TGATGAGAAG TGCTTACCAC TTTCAGCAAA 8760 

CATCAAATAC CGAGAAACTC CACAATCTAT CATAAAGTTA CAAGACAGTT TACCAGAACT 8820 

GCCTGATTTT CATGAACAGA TGGCTGCTCA GTCCGTAGAA GTTCAAGAGA CTTATACTGA 3880 

AAAACCTGAC ACTGCAGCGC AACGGCAAGA TCTCCCTTTC CATATTGATC AATTATCTGA 8940 

GGCAGAGACA AGAAAGCTCT TTATTGATAT CGATCTCCGT TTAGCAGGAT GGATATTTGA 9000 

AGAAAACTGT CGTGTTGAGA TAGCCGTTGA TGGTCTCAAG CACGGTTCAG GAATTGGTTA 9060 

CTGTGACTA7 GTACTTTATG GTAAAAATGG GAAAATTTTA CCCATTCTCC AGGCTAAAAA 912 0 

AGCCTCTGTC AATCCAGAAG TAGGGGAAGT ACAGGTCAAA CAATATGCTG AAGCTTTGGA 9180 

GAAACATATC GGCTATCAGC CAATTTGCTT TATTACAAAT CGCTTCAACC ACT AT AT ACT 924 0 

TGATGGTCCG AACCGCCGCC AGATTGCAGG CTTTTACTCT CAAGAAGAAT TGCAATTAGT 9 300 

GATGGATAGA CGTCATCTTC AAAAACCGCT TGAGGATATT TCTACTAAAA TTAGGGACGA 9 360 

TATTTCCGGG CGTCACTACC AAAAACATGC CATTGCAACC GTTTGTGAAG CTTTCTCTCA 94 20 

TCATCGTAGA CAGGCACTTT TCGTTATGGC AACTGGGGCG GGGAAAACTC GTACAGCAGT 9480 

TTCTCTAGTT GATATCTTAT CACGTCATAA CTGGGTAAAA AACGTTCTCT TCTTAGCCGA 9540 

TAGAACTTCC TTGGTTAAGC AAGCATATGA TTCCTTTACA AAATTACTCC CAGATCTTTC 96C0 

CCTTTGTAAC TTCTTAGAAG ATAAAGAAGG AGCTCAATCA AGTCGCATGG TCTTTTCAAC 9660 

TTATCCCACC ATGATTGGAG CGATTAGTGG TCAAGAAGAA GTAAATCAAC GCCCTTTCAC 9720 

TGTTGGGCAT TTTGACCTTA TCATAATTGA CGAATCTCAC CGTTCTATTT ATCAGAAATA 97 80 

CAAGTCCATT TTTCATTATT TTGATGCAAG AATTGTAGGC TTAACAGCTA CTCCGCGTCA 9 840 

AGATTTAGAT AAAAACACCT ATCGATTCTT TAATTTGGAG AATGCGGTTC CAACATATGC 9 900 

ATATGATTTG GAAGAGGCTG TTAAAGACGG ATATTTAGTA GCCTATCATT CTATCGAAAC 9960 

CAAACTGAAA CTACCTACGC ATGGTCTACA 7TATGATGAT TTGTCCGAAC AAGAAAAGGA 10020 
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ACATTTTGAT 


AGCAAATTTG 


AAGACAATAG 


CTGTGAAAAA 


GATATTGATG 


GGAGTGTATT 


10080 


TAATTCCTTT 


ATTTTCAATA 


AAAGTACAGT 


AGAAATTGTT 


TTAAATGAAC 


TCATGACAAG 


10140 


AGGAATTCAG 


ACAGCCTCGG 


GTGATGAAAT 


TCGTAAAACT 


ATTATTTTTG 


CTAAAAATCA 


10200 


TGATCATGCG 


GAATATATCA 


GAGGTATTTT 


TAACAACCGC 


TATCC7GAAA 


AAGGGAGCGA 


10260 


CTATGCTCAG 


GTGATTGATT 


ATAGTATTAA 


GCATTATCAG 


ACCTTGATTG 


ATGATTTTAA 


10320 


AATTAAGGAG 


AAC7A7CCTC 


AAATTGCGAT 


TTCTGTCGAT 


ATGTTAGATA 


CAGGTATTGA 


10380 


TCTACCAGAG 


GTTCTTAATT 


TAGTCTTCTT 


CAAGAAAGTA 


CGCTCTAAAA 


CTAAGTTTTG 


10440 


GCAGATGATT 


GGTCGAGGAA 


CCCGTCTATG 


TAAAGA777A 


TTTGGACCTC 


AGCAGGA7AA 


10500 


GGAAAACTTC 


TTGGTATTTG 


ATTATGGCCA 


CAATTTTGAT 


TATTTTCGTG 


CAGATCCAAG 


10560 


AGATGGAGAG 


GGTCGTCACA 


T7GTTTCGCT 


GACTCAGCGT 


TTATTTAATA 


TCAAAGTGGA 


10620 


CTTGATTCGA 


GAACTTCAGG 


GACTCCAATA 


CCAAGAACAT 


CAGTTTGCGA 


GAGCATACCG 


10680 


7CAGCAGCTT 


GTCTCGGAAC 


TTCAAGGTCG 


TATAGAGAGC 


TTAAATGAGT 


TGGACTTCAG 


10740 


CG77CC7ATC 


GTTTTAGATA 


CAGTTTATAG 


CTATAGGAAA 


TTGGAAAGTT 


GGCAGAATCT 


10800 


AACTGCTGTT 


ACAAGTGAAA 


CCATTCAAAA 


AAATCTCTCT 


CCGCTTTTAT 


T7GATCAAGA 


10860 


TAAAGAAGAT 


GAGATGGCGA 


CCAGATTTCA 


TTTGTGGTTC 


CTTCATATTC 


AGTTGGGGCA 


10920 


ACTGACAGCT 


AAATCTTCCA 


CTGTTCATAT 


TTCCCAAGTG 


ATGAAGACGG 


CTAGAGCTCT 


10980 


TTCTCCTATT 


GCCAATA7CC 


CGCAGGTTTT 


TGAGCAGCCT 


GAAATTATCA 


GGAAAGTACA 


11040 


GGAGCCTGAA 


TTTTGGAAAG 


AAGTTAACTT 


GTCTGAT7TC 


GAAAAAATTC 


CTCTTGCTAT 


1UC0 


TCGAGATTTA 


TTACAGTTTT 


TGGATAAAAC 


AGACCGTAAA 


CCCTACTATG 


TTAACTTTGA 


11160 


AGATCGTATA 


C7C7CCACTC 


TTCACGAGAC 


CACACCA77T 


TTGCAGCTCA 


ACGATCTTCG 


11 = 20 


CTC7TACAAT 


GAAAAAGTTG 


ACCATTATTT 


GAAAAC7CA7 


CTGGATGAGG 


AGTCCATTTC 


11280 


TAAGCTATAr* 




ML» 1 i Li AL. AIL. 


1 L» A . L» A 1 A 1 L» 


CTTGCACTTG 


AAAAA I I Cj r .T 


1 1 i a n 
11340 


TTGCGAAAAA 


TTAGGTAGTA 


AAGCAGACTA 


CCAAAGTCAT 


TATGAAAATA 


AGCCAATTCC 


11400 


GAGATTGCTT 


CCTCAGATTA 


TTGGCTTAGA 


TAGAGAGTCT 


GCCAATCGTA 


T77T7TCTAA 


11460 


ATTTTTGTCG 


GATGAGAATC 


TTAATCCCAG 


GCAGAT77CA 


TTTGTAAAAT 


TGATTGTAGA 


11520 


CTACATTGTA 


GAAAATGCTT 


TTTTAGAGAC 


CAAAGTGTTA 


ACGCAAGAGC 


CCTTTAAATC 


11580 


T7ATGGTTCT 


CTTCAACTAC 


TCTTCCAACA 


CCAACTACCA 


GTACTTCGTA 


ATATTGTTCA 


11640 


AATCATTGAA 


CTTATCAATA 


ATCGAGCTGG 


ACAAGCGGCT 


TAAATTC7AA 


AC7CA7TCCC 


11700 


A7GCTGACAC 


TCATTTAAAA 


TTAAAAAGAG 


7AGAAAT7TA 


TCC7A7ATA7 


GAGAAG7TT7 


1176C 
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ATTAGCAAGA 


ATGTCATCGT 


TTTCCTAGAA 


TACAGTATCA 


GTTGTTAAGT GGTTGATAAA 


11820 


TTTCAAAGTA 


GATACTTGTA 


CCACGATGTT 


TGTTGATCGA 


■ .paj ■> % V++fS ft ft ft 

GTTA TTAACA 


AAAGAGCTAC 


11880 


TTTGATTTTA 


AAGAAATAGA 


AAACAAAAAG 


CCGAGCAAGA 


ATTCAATTGC 


AGGAGAAAAT 


11940 


GAAATAATAC 


TCAATGAAAA 


TCAAAGAGCA 


AACTAGGAAA 


C7AGCTGCAG 


GCTGCTCAAA 


12000 


ACACTGTTTT 


GAGGTTGCAG 


ATGGAAGCTG 


ACGCGGATTG 


AAGAGATTTT 


CGAAGAGTAT 


12060 


AAATCTTCCT 


AGGATAAAGC 


AAAACGCATA 


GTATCAAGGG 


TTTTCAACAC 


TTGATACTAT 


12120 


GCGTTTTCTG 


ATGTTAAAGA 


CTTTCTACCA 


GGTTTTTTAA 


AAGCATAATT 


GTTAGTTGTA 


12130 


GTCATTTATT 


ATTCTTCAAA 


GAAAAATGGT 


GGGGCGAATT 


TTTTCAGTTC 


ft ft ft fm ^ ft ^ 

TTCAAAGCAC 


12240 


TTTTGAGCAG 


TATCTGCATC 


TTCACAGATG 


ATAAGACAGA 


CATC ATT ACC 


« ft ft ft f% ■ \ ft 

A C AAAGGGT A 


12300 


GCGATAGCGT 


CAGGGAAGCT 


CAAAGTATCA 


ATGATAGAAC 


CAAAGGATTG 


AGv-CAGTCCA 


12360 


GGAAGGGTTT 


TTAGTAGGAC 


TTGGTGTTGA 


ACTGGGCGCA 


TCCAGACAAG 


GGCGTCTTCC 


12420 


ATGTAGAGTT 


CGAGACGTTT 


TTCCCATTTT 


GAGATGGAAC 


CATTGTTAAG 


AACATAATAA 


12480 


GCCCTATCTT 


CTTCGCCGAC 


TTTTGATAGG 


TTCATATTTT 


TGATGTCGCG 


TGAGAGGGTT 


12540 


GCCTGGGTTA 


CTTGAATGTC 


GTTCTCAGCA 


AGAAGGGCTT 


GCAACTCAGC 


CTGTGTATGA 


12600 


ATCTTGTTTT 


TTGTGATAAG 


AGCGCGTATA 


AGTTGGTGGC 


GGTGTTCTGA 


■ imw ft /wft/v ft ft 

TTTATTCATA 


12660 


ATAATCTAAC 


TCCTTTTAGC 


AAGGTAAGGT 


AAGCATGGAC 


TGAGCGAGGT 


CGACAGTCAA 


12720 


GTGGTAGTCT 


GTATTGTCAC 


GGATGGTGAT 


TTCAAACTCA 


GTAGTATAGA 


GG ACT AAACG 


12780 


GAGAGTGTCT 


CCTTCTTTTA 


CCTTGTAAA7 


AGTTGGCTGC 


AGTTCAAATT 


ft ft ^T*/** ft, ^ 

GAACGTCCAT 


12840 


CCATTCATCT 


GCAGTAATAT 


CCTCTACTAA 


CAGTAAATCA 




GTAAATT AAG 


12900 


GTAACCTTTT 


GTCACGACTC 


GTTGTGCCTC 


TGGTCTAAAT 


GGCAATTCAC 


AGAGATTTTC 


12960 


CAACA7GTGA 


TAGCGACCCT 


TGTCAATGGT 


7CTAGCACTT 


AAAATAGCTG 


-jA r aaggttg 


13020 


TAGGTATTTC 


TTTTGCCCAA 


ATTCTAGCAG 


TTGGGCAGAT 


AAGAGCCCCT 


TGTTTGTACT 


13080 


CGATTTGATA 


CGAAGATTGA 


GCTGAGCGCG 


ACCCTTTACC 


TGGAGATCTT 


TAGTCACAGG 


13140 


AAGGTTAATA 


GTAATCTGA7 


TGGCTTTCCC 


TTGATACAGC 


TCTGTATTGA 


AGGTTTGGTA 


13200 


TGTCTTACCA 


TAGCGCTCAA 


AATCCTTATC 


TGGGTACTCG 


TTTTGAATAG 


CTTGCTCTTC 


13260 


TTGACCAAGT 


GAGAAGCTTT 


CACAGTTTTC 


TTGCCCACCG 


AAGTTATCAA 


GTGATAACCA 


13320 


AGTCTGTGGA 


GCTGTATTGT 


CCTGCCAGAT 


AACAGTAGGA 


AGTTGAAAGT 


CTGTTTCCTG 


13 380 


TCCTAGTAAT 


TTCTTGGTCA 


ATAACGCATT 


TATGCACTCA 


■TGGAAGTCAA 


TTGATTGCCA 


13440 


ATTGTTCATG 


TAAACATGGG 


CACCATTATG 


GAAAAAGAGA 


TGCTTGTGTA 


TATGAGTAGG 


13500 


AAGAGCATGG 


AACATCTGGT 


AAACATGAAG 


TGGTTTCACA 


TTCCAATCCT 


GAGAACCATG 


13560 
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AGTAAAGACA ACCTCTCCCT TTACTTTATC GGCATTGAGC AGAT/VaTTGC GGTCATGCCA 13 620 

AAACTGATTG TAGTCCCCAG TTTTTCGGTC TAGCTGAGCT TTCACTTTTT CTAAGTCACC 136 80 

TTCGTGAGC7 TCATTGCCAC GGATATAGTC GCCAGCTAAG AGATTACGAG AATAGGTTAA 13740 

CTCACCAAGG GAGTCAAAGT CCTCACCTGG ATAACCACCT GGGCTAGTCA CCAGACCGTT 13 8CC 

TTCACGC7AG TAGTTGTACC ATGATGAAAT TCCTGCCTCG GCAATGATAA CTTCTAAACC 13 860 

~AT~ CGACTCCf " GTAGTCGCAA " GACCATT C^^C^CT^CT' AGATAGGAAA " "GTCCTOTTCf i~3920 ~ 

AGCAACTTTT CCGTTTGACC AATCAGCCTT GACTTGACGC TGGCGCGTGT GATCAGTAAA 13 930 

GGCACGGCAA CGACCGTTAA GCCAATCGAT GACATTTTTA TAAGCCTCGA TTTCCTCGTA 14040 

GTCTCCATTA GTCATGAAAC CTGTCGAGTC TTTGGTACCA ACACCTGAGA CAT AG AG ATT 14100 

GGCAAAGCCT CTCGGAAGGA AGTAGTCGTT TAGTGTATAG CTAGACTTGA TGTGAGTTAG 14160 

CTTTTCCTCA GCCTCTGCTA TAAGCTCAGC TTTACCTTGG GGTTGGACGA GATTTAGTTG 14 220 

AGGTTTCTCT AGCTCAATCT TGTGAGGAAG CTTAACCTCA AGCTCGCCCT CCATCTTGTA 14 280 

GAGAGCCTTG TCACTACCCT TGTCATTGCT TCCCTGATGA TAAGGGCTGG CTCTCATGA? 14 34 0 

GGCAGGGATT TTTCCATCAA AACGAGGGCG AATAATGCTA ACCTTTACTA GGTCTGATAG 14400 

CCCTTTTTGG TCACTATCGA CACGAGACTC AACGTAAACG ACTTCACGAA TGACATCCTG 14460 

GTTAGAAAAA GTAGCCAAAC TCTTGCCGTT AAAGTAGTGG TAGTCATTAT CCTCCGCAAT 14520 

AAGACCATCA CTAACAAGTT CGTCGATAAG AGTATTTCCT TTTTTGGTGC GAGTATTGAG 14 580 

TAACTGATAG AGATTTTCAA TCAAGTCACC ATATATAATG CGAAATCCAG TTTCTTTACC 14 640 

AAAAACGTCA CTATCTTCGA AGTCAACCAA ATAAGAAAAG CCTAAAACTT GAAAAGCAAC 14700 

AGTATAAAAA ATATCTGCTG 7CAGTTCATC T7CTGATTGA AAAAATGTCA GCAGGTCTGT 14 760 

TTTTTTATCA GCTCCTAGGA TAGAAAGTGG GTAGTTGGTG TCTTGATAAG TGAAAAAGAA 14820 

ACGACGTAAA AAGG7TTCAA CTGAGTCTTT GTGATTGGCT GTATTTTGTA AATCAAAGCC 14 830 

ACATTTTTTT AGTTCAGATA AGACATTTTC TTTTGGAAAA TTGATATAAC TATATTCATT 14 940 

AAAACGCATA GAACCTCCAT ATAGAATGAC AGTTAAGGTT ATTATATCAA AAAAAAAGCA 15C00 

GAAAGGGAAT TGTTAACTTC AAAAGGAAAT AATCCAATAA AAATGAATAA AGTACTAAAT 15060 

TCAATATAGA GAACAGAGTA ACAATAAGAA TAAATAGATA GGGTATAAAA CTTCTAGCAG 15120 

ATTTATATTA TATGCTTTCT ATTTTTATAT ACAATATAGT ATAAATATAA AAATCATGAC 15180 

AAAAATACAA ATGAA7AGAA AATAAATTAG TAAGCTGATG AAATTTTTCT CAAGAGAAGC 1S24C 

CATTTATAGG TGAAAATGGT ATAATATAGT GAGAAGGATA GAGGAGAAGT GTAAATTGAT 15300 
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CGCACAACTA CATACAAAAA CAGTCTATAG TTTTATGCAA ACCGTCATTT CGATCGAAAA L5360 

GTATGTGAGA GCAGCTAAAG AATACGCCTA CAC7CATTTG CCTATGATCG ATATTGACAA 15420 

TCTTTATGGC GCTTTCGACT TTCTAGAGAT TACAAAAAAA TACGGCATTC ATCCTTTGCT 15480 

AGGGCTTGAA ATGACAGTGT TTGTAGATGA TCAGGGAGTG AATTTCCCCT TTTTAGCTCT 15540 

ATCTAGTGTG GGCTATCAGC AGTTGATGAA GCTTTCGACA GCCAAGATGC AGGGGGAGAA 15600 

AACTTGGTCA GTCCTGTCCC AGTACCTGGA GCATATCGCC GTCATTGTGC C7TATTTTGA 15660 

TAGAGTTGAG TCGTTAGAAC TAGGCTGTGA TTACTATATA GGGGTTTATC CAGAAACACT 15720 

AGCAAGCGAA TTTCATCATC CTATCT7ACC TCTTTATCGG GTCAACGCTT TTGAAAGCAG 15730 

GGATAGAGAA GTTCTTCAAG TTTTAACAGC GATTAAAGAA AATCTACCCC TCAGAGAAGT 15840 

TCCCTTGCGT TCGAGACAAG ATGTCTTTAT ATCAGCAACT TCTTTAGACA AACTATTCCA 15900 

AGAGCGTTTT CCGCAAGCTT TGGACAATTT AGAAAAGCTT ATTTCAGGCA TTTCTTACGA 15960 

CTTGGATACT AGTCTGAAAC TGCCTCGTTT TAATCCAGCT AGACCAGCAG TAGAGGAGTT 16020 

GAGAGAGCGT GCTGAACTGG GGCTTGTTCA GAAGGGGTTG ACTAGTAAAG AATATCAAGA 16080 

TAG ACT AG AC CAAGAATTGT CTGTTATTCA TCATATGGCC TTTGATGATT ATTTCTTGGT 16140 

TGTTTGGGAT TTGTTGCCTT TTGGACAATC GAATGGCTAT TATATGGGAA TGGGAAGGGG 16200 

TTCTGCAGTA GGCAGTTTGG TTTCTTATGC CTTAGACATC ACGGGGATTG ACCCAGTAGA 16260 

GAAAAATCTG ATTTTTGAAC GCTTTCTTAA TCGTGAACGC TATACCATGC CTGATATTGA 16 320 

TATTGATATC CCAGATATTT ATCGTCCAGA TTTTATCAGA TATGTTGGTA ATAAATATGC 16 380 

7AGTAAACAT GCGGCACAAA TCGTTACTTT TTCAACCTTT GGAGCCAAGC AAGCTCTTCG 164 40 

AGATGTCTTG AAACGCTTTG GTGTGCCAGA CTATGAATTA TCTGCAATTA CTAAGAAAAT 16500 

CAGTTTTCCT GACAA7CTTA AGTCCGCCTA TGAGGGAAAT CTCCAGTTTC GTCAGCAAAT 15560 

CAATAGTAAG TTAGAATACC AAAAAGCTTT TGAGATTGCT TGCAAGATAG ACCGCTATCC 16620 

AAGCCAAACC TCTGTCCATG CGGCTGGTGT TGTAATTAGT GACCAAGATT 7AACCAACTA 16 680 

CATTCCTCTA AAGTATGGTG ATGAAATTCC ACTGACTCAC TATGATGCTC ATGGAGTTGA 16740 

GGCTAGCGGA CTTTTGAAGA TGGACTTTCT GGGACTACGA AATTTGACCT TTGTCCAGAA 16800 

GATGCAAGAG TTGCTTGCTG AAACAGAACG TATTCATCTG AAAATTGAAG AAATCGATTT 16860 

AGAAGACAAA GAAACGTTAG CTTTATTTGC CTCTGGTAAT ACAAAAGGTA TCTTTCAATT 16920 
TGAGCAACCA GGTGCCATTC GTCTGCTTAA GCGTGTGCAA CCAGTCTGT7 7TGAAGATGT 
CGTCGCGACT ACTTCTCTAA ATCGACCGGG .TGCTAGTGAC TATATCAATA ATTTTGTGGC 
AAG AAAGC AT GGGCAGGAAG AAGTGACTGT TCTGGATCCA GTACTGGAGG ATATTTTCCC 



16980 
17040 
17100. 
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TCCAACCTAC 


GGCATAATGC 


TCTATCAGGA 


GCAGG7TATG 


CAGGTTGCCC 


AGCGACTTGC 


17160 


CGGATTTAGT 


CTTCGGAAAC 


CCGATATTTT 


GCGTCGGGCT 


ATGGGGAAAA 


AGGATGCCTC 


17320 


TGC CATC CAT 


GAGATGAGGG 


CTTCCTTTAT 


TCAAGGTTCA 


TTAGAACCTC 


GTCATAC7GT 


17230 


GGAAAAAGCA 


GAGCAGGTCT 


TTGATGTTAT 


GGAGAAGTTT 


GCAGGTTATG 


GTTTTAACAG 


17340 


GTCACACGCC 


TATGCCTACT 


CAGCCTTGGC 


CTTCCAGTTG 


GCTTATTTCA 


AAACGCATTA 


17400 


7CCAGCCATT 


TTTTATCAGG 


T C ATGTT AAA 


TTCTTCCAAC 


"actgattact 


TAATAGATCC 


17460 


ACTTGAAGCA 


GGTTTTCAAG 


TAGCCTCTCT 


A7CCATCAAC 


ACCATTCCCT 


ATCACGATAA 


17520 


AATTGCCAAC 


AAGGCCATCT 


ATCTACCTTT 


GAAATCCAT7 


AAAGGAGTCA 


G7AATGATTT 


17530 


AGCTCTCTGG 


ATTATTGAAA 


ATAGACCTTA 


TTC7AACAT7 


GAAGATTTTA 


TAGCTAAA7T 


17640 


ACC7CACAAT 


TATCTGAAAC 


TTCCTCTGCT 


AGAACCTTTG 


GTAAAAGTTG 


GTCTTTTCGA 


17700 


TTCATTTGAA 


AAAAATCGTC 


AAAAAGTATT 


TAATAACTTA 


GCTAATCTAT 


7TCAATTTGT 


17760 


GAAAGAGTTG 


CGAAGTTTCT 


TTGGAGATGC 


7ATTTATAGT 


7GGCAGGAAT 


CGGAAGATTG 


17820 


GACGGAACAA 


GAAAAATTTT 


ATATGGAACA 


AGAGCTTTTA 


GGGATAGGTC 


TCAGCAAACA 


17880 


7CCACTACAA 


GCTATTGCAA 


GTAAGGCTAT 


TTACCCGATT 


ACCCCAATCC 


GAAATTTCTC 


17940 


AGAAAATAGC 


TATGCTATTA 


TCTTGGTTGA 


AGTTCAGAAA 


AT AAAAGTG A 


TTCGTACCAA 


18000 


AAAGGGTGAA 


AATATGGCCT 


TCTTACAGGC 


AGATGATAGT 


AAGAAAAAAT 


TGGATGTCAC 


18060 


TCTCTTTTCA 


GACTTATATC 


GTCAGGTTGG 


ACAGCAAATA 


AAAGAGGGAG 


CCTTCTACTA 


13120 


TGTAAAAGGA 


AAAATACAAT 


CACGTCATGG 


CCG7CTGCAA 


ATGATTGCAC 


AAGAAATAAG 


19180 


AGAAGCAGTT 


GCTGAACCCT 


TTTGGATACA 


GG7GAAAAA7 


CATGAATCCG 


ATCAAGAAAT 


18240 


TTCACGCATT 


TTAGAACAAT 


TTAAAGGCCC 


AA7CCCAGTC 


ATCATCCGG7 


ATGAAGAGGA 


18300 


ACAGAAAACC 


ATCGTTTCTC 


CCCATCATTT 


TGTAGC7AAA 


TCCAATGAAT 


7AGAGGAGAA 


13360 




ATCGTTATGA 


AAAC GAT TT A 


TCGCTAAAAA 


TACCGAAAAT 


AGAAGAATTT 


1 84 2C 


TCAACGTAAA 


TGTGGTATAA 


7CAGTAAGAA 


TGTTAAAAGA 


AAAAGGAGCA 


TAACCAATAT 


J 8430 


GAAACCTATT 


GC7GTTTTGA 


CTAGTCGTCC 


AGACGCCCCT 


GGTATGAACG 


CTGCCATCCG 


18540 


TGCAGTTGTT 


CGTCAAGCAA 


TTTCAGAAGG 


AATGGAAGTT 


TTTGGTATCT 


ATGACGGATA 


18600 


TGC7CG7ATG 


GTTGCCCGTG 


AAAT7CATCC 


CCTAGATGCA 


GCTTCAG7AG 


GGGACATCAT 


13660 


TTCTCGTGGT 


GGTACTTTCC 


TTCACTCAGC 


TCGTTACCCA 


GAGTTCGCTC 


AACTTGAAGC 


18720 


GCAACTTAAA 


GGGATTGAGC 


AATTGAAAAA 


ACACGCAA7T 


GAAGGTGTAG 


TTGTTATCCG 


19730 


TGGTGACGGA 


TCTTACCACG 


GCGCTATGCG 


TT7GACTGAA 


CATCGCTTCC 


CAGCTATTGG 


18840 
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TCTTCCAGGT ACAATCGATA ACGATATCGT TGGTACTGAC TTTAC", TCG GTTTTGACAC 18900 

AGCGGTTACT ACTGCCATGG ACGCTATCGA TAAGATTCGT GATACATCAT CAAGTCACCG 189 50 

TCGTACTTTT GTAATCGAAG TTATGGGACG TAACGCTGGT GATATCGCTC TTTGGGCTGG 19020 

TATTGCAACT GGTGCTGATG AAATCATCAT CCCTGAAGCA GGCTTCAAGA TGGAAGATAT 19080 

CGTAGCAAGC ATCAAAGCTG CTTATGAATG TGGTAAAAAA CACAATATTA TCCTCTTAGC 19140 

TGAAGGTGTG ATGTCAGCGG CTGAATTTGG TCAAAAACTT AAAGAAGCTG GAGATACAAG 19 2 00 

CGACCTTCGT GTAACAGAAC TTGGACATAT TCAACGTGGT GGTTCTCCAA CTGCGCGTGA 192 60 

CCGTGTTTTG GCGTCACGTA TCCCTGCACA TGCTGTTAAA CTTCTTAAAG AAGGTATCGG 193 20 

TGGTGTTGCG CTTCCTATTC GTAACGAAAA AATGGTTGAA AATCCAATTC TTGGTACTGC 19 3 80 

AGAAGAAGGG GCATTGTTTA GCCTTACTGC AGAAGGTAAG ATTGTGGTTA ACAACCCAGC 1944 0 

TACAAA 19446 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH : 16S93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 52: 

TCGTAAATAT CCTCTGTTTT TGGATTTTGT TTCTTAATCT CTTTCGCAAG TGCCTTCATC 60 

ATAGAAATAG GACCACACAT ATAGACGGTT GCATGTTCGG CCACTTCT7T TTGTTCAAAA 120 

TTAAGATAGC CGTCTTTCGT ACTGTCGATT AGATGGAGTT CAAAATTAGG ATTTTTCTGA 130 

GCATAGTTAC GCAGTAAA7C TAGGTAGACT CCATTTTCAT CTCCACGCAA GCTATAGTAG 24 0 

AAGTGAACCT GTTTATCTAA AATAGGATGT TCACGGATGT AAGAGATGAA GGGCGTGATC 300 

CCAATACCTC CAGCAATCCA AACCTGATTT TCTCGTCCTT CTTCTATGAT CATGTGTCCG 360 

TAAGCTCTGT CTAGGGTTAC TTTGCTCCCG GCTTGAAGAT TATCATAGAT ATTCTTGGTA 4 20 

TGCTCCCCTG AAGTTTTAAC AGTAAAGTAA AGAGTTTGAC CATCACCTCC TGAGATAGAA 4 80 

AAGGGATGCG GAGCACTTTC AAAGCCTTCT TGGAAAATCT TTAGAAAGGC AAATTGTCCT 54 0 

GATTGATAGT TGAAAGGTCT GCTAAGATGG ATTTGAATTT CTCTAGTATC GTGATTTAAG 600 

CGTTTGAGAT GGGTAATTTT CCCTAGATAG GGGAAGGAAA TCTTTTCATA TAG AAAAATG 660 

ATATAAAAAC CAGCTAGTAA GCCTAAAAGG GCATAGCTAC CAACAAGAAA ACTTAGAAGA *?20 

TTAAATGTAA GGAGACGATT GCCCATTATC ATGTAGATGT GAAAGAG7CC TAAAATATAG 78 0 
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GCTAGGTAAA CCAGGCGGTG AA7CCATCGC CAAGCTTCGT ATTGGATGTA TTTGCCTAAA 840 

TAGCCGACAA GGATGATGCT CGCAAACATA TAGATGGCAA GATTGCCAAA CTGAGCAGCT 900 

AAGCGAGAGC CCCACAAACC GCCCATACTA AAGTTATGAA AGA77AG7AG GA7GAT7GAG 960 

AGAAAGGCTG TGAAT77G7G GACGGTGTAG ACCTTCTCCA AACTGTGAAA CCAGCTTTCT 1020 

AG7AG7GGGA GACGAGTGGC TAGGATAAAA GTCAGAGATA GGCTTCTTAA AGCTAGTCCT 1080 

GGAATCATGA A7TGGGGAGA AG7GTTCATC CAAGTCAAAA GAGTCAAGAT AAAACTAGCT 1140 

ATGA7AAAGA GTAGTCCTTT GACTGATTTC A7ACAAAA77 CCATTTCATT TAGATTTCGA 12 00 

TTTGTTGTAA A7AAAT7TC7 TACATTTTAT CATAGAAAAT G7ATGG7GTC AAA7TGAGC7 12 60 

CTATAAATAT CTACTCTCAT CAAAAAACTC TCCAATTGAA CTGGAGAGTG GCTGTTTATA 13 20 

CTCAATGAAA ATCAAAGAGC AAACTAGGAA GCTAGCCGCA AGTTGCTCAA AACACTGTTT 13 80 

TGAGGTTGCA GATAGAGCTG ACGTGGTTTG AAGAGATTTT CGAAGAGTGT TATTCTGCAG 1440 

CTTGTTGCCA ACGTTTGGCT AGCATATGAG ACAGGCTAGA AATTGCTAGG TTAAAGCTGA 1500 

AGTAGATGAG GGCAATCAGG ATGTAAAGAC TGAACACCTG CTCTGGTTCG AAATAACGGC 1560 

CCATGAGAAT T7GGC7CGC7 CCAAAGAGTT CTTGTAGGGC GATAACAGAG TAGAGGAGAC 1620 

TCG7A7CC7T AATCACGGTA ACAAACTGAG AAATGATGGC TGGTAGCATT TTGCGGATGG 1680 

CTTGTGGGAG AATGATGTAG TAGAGGATTT GGGCTGAGGT GAAGCC7TG7 GACATTCCTG 1740 

CTTCGTACTG TCCCTTGTCT ACGGCATTGA GACCGCCTCC AATAATCTCA GCCAAGGCTG 1800 

CTGATGTAAA GAGAGTAAAG GCTGTAATAC C7GC7GG7C7 GCA7T7CA77 TTGAACACCA 18 60 

AAAACA7AGT AAAAATCCAG ACAACG77CG GAACGTTGCG CACAAACTCG ATATAAATAC 1920 

7GGAAA7AA7 GCG7AAGACA GCA7TTT7CC CA777C7CG7 GACAGC7AGC ACCCTACCGA 1980 

7GATAGTAGA GAGGATGATG GCAATCAGAG AAATATAGAG GG7CAAGCCA AATCC7T7AA 2040 

AGATAAAGAC 7AGGT7ATC7 GGGG7TAAAA C77C7AAAAT AGAT7CCA7A GTAACCTCC7 2100 

AAAC7CAA7A GGC7T7T77G 77GGC7TGC7 CCA7CTTGCG ACCAAAC7GG GCAACAGGGA 2160 

ACCATAGAGC AAAGTAGAGA AGAGCAGCAC C7AAAAAGGC 7GGTA7A7AG 7T7CCG77GA 222C 

GAGCCGACCA AGACTTAGTC ACAAACATCA AG7C7ACTCC AGAGATGA7A GCTACAGTAG 2 280 

AGG7C7TCTT GATGAGG7TA ACAA777GG7 7GG7CAATGG AGGGAGAATG ATCCGGAAGG 2 340 

CCTGAGGCAA GA7AA7CAAG CGCATGGCAC 7GA7A7AGG7 AAAACC77GC GACAAGGCGG 2 400 

CC7CCA7C7G ACCAC7AGGA ATAGACTGAA 7CCCTCAACG AATAACC7CA GCGATATAAG 2 4 60 

CGCCG7GA7A GAG7CCCACG CAGAGAACGG C7G7CCAATA AA77GGAA7C ATGATGATA7 2 5 20 
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GGTCACTGAT AAGAGC7AGG CCATAAAAAA CAATAACAAA CTGCACCAAG AGGGCAG7A7 2580 

TTTGGTAAAA TTCAACAAAG ATGCGAGCTA AAATGCGTAA AATTGGACCT TTACTGG7TG 264 0 

ACATGGCACC AAAGAAGATG CCCAAAACCA 7ACCGAGGAT AAAGGAACCA ACCGCTAGGG 2 700 

CAAGGGTGAA GAGGAAACCA TTGAAAAATT GTCCAAAATC CTGAAAATAG CCTCTCCAAG 2760 

ATGATAAATC TGTCATGGGG TGTCCTCCTT AATCTGCAGT ATGGCTAGAT GGTTTGACCT 2920 

TGTAACGGTC ATAAAGTTTC 7GCAAACTAC CATCCTTGCT CCATTTAGTA ACCAAGTTAT 2380 

CAAGATAGTC GTTGACCTCT GTA7TTGATT TCTTGGTAAC AATACCGTAG TCAGA7CGCT 2940 

TGAAACTATC ATCTAGTAGT GCTGTCCGTT 7AC7AGTGTA GCCAGATAGA A7AGAGCCGT 3000 

CAACGGAAAA GGTATCGATA CGATGAGCGT GC AGGGAAGT AA7CAATTC7 GGGTACCAAC 3060 

CAAGTTCCAC GAATTTAAAC TTCAGACCTT TCTTTTTACC CAGTTCAGTA ATCAGGCCTT 3120 

GGGTGATAGA ACCTTGGGCG ACTCCGATGG TTTTGCCGTT TAGGTCCTCA A7CTTTT7GA 3180 

TT7TGGCAGA TTTATTGACC AAAAATC C AG AAGCGTCTGT GTAGTAGCGA CTCGTAAAGT 3240 

TGTAGAGTTT TTTGCCTTCC TCCC7CATCG 7AAAGCTCCC GATATCCATA TCGACCTGTT 3 3 00 

CATTGTCTAG AAGGGGGCCG CGGGTTTGTG CTGTAACCGG CACATAGCGA ATCTTGACCT 3 3 60 

TGAGTTCATC AGCTACCATC TTGGCCAAGT CGGTTTCGAT ACCAGAATAA GTACCCG7CT 3420 

TCGGATCTTT GTAACCAAAA TTCGGAACGT CTTGTTTGAC ACCGACAACC AGTTCGCCTC 3480 

TTTTTTGAAT GTCTGCGATA CTTGTATCAG CCTGGACTGG TTTGGCAGCA GCAAGCCCGA 3 34C 

AAACGCTAA7 C AAT AATGCT GATAAAAAGA ATTTTTTTTC ATAGGCGCCT CCTTATTTGA 3600 

CTTTG7CACT TTCG7CG77C ATAATTTTCC TGAGGAATTG T7GCCCACCA GGT7CGCTTG 3 660 

GATTGTCAAA AAAG7TATCG ACATCTGTCG TATC7ACTAA AACTTCTCCG TCGGCCATAA 372 0 

AGA7AATGCG G7CCGCAACC TCTCGAGCAA AGCCCATT7C GTGGGTAACG ATCA7CA7GT 3 7 90 

TCATCCCATC ATGCGCCAGT TTCTGCATAA CTGCTAGAAC ATCTCCGATA GTCTCAGGAT 3840 

CAAGAGCAGA TGTTGGTTCA TCAAAGAGGA GGAGT7CCGG A7GCATAGCA AGACCACGAG 3900 

CGATGGCGAT CCGC7GTTTT TG7CCACCAG ATAGCATGGC GGGA7AGCAA 7CTTTC7TGT 3 9 60 

CCCACATATT TACAAATTCC AGA7ATTTTT GGGCGC7TT7 7TCAGCTTCT TTTTTA7CAA 4020 

T7CCTAGAAC TTCAATGGGT GCAAGCGT7A CGTTTTC7AA CACAGC7T7G TGTGGATAAA 4080 

GGTTAAAATG TTGAAAAACC ATGCCGACTT CCTTGCGAAG AGGTACCAAA TCTTTCTGGC 4140 

TGGCACCAGC AACTTGGTGC CCA7TGACTA GGAGAC7TCC TTTGTCAACA G7CTCTAAAC 4 20C 

CATTCA7CGT ACGGATAAGA GTGGACT7CC CAGAGCCAGA AGG7CCAAGC AGGACAACAA 4 260 

CTTCTCCTTT TTCAAAACGG AGATTGATG7 TGCGGAATGC GTGG7AG7CT CCG7AATATT 4 320 
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TTTCGACGTT TT7AAA7TC7 ACTAAAGCCA TGAGAGATC7 C7AT7GTCTT ATAT7TTATA 4 3 80 

ACACGGTTCT ACAATAAAAG AATGTTCTTG TCAAATCATA TCTGAAAAAA TTCACTATAG 4 440 

TGAAATAAGA ACAGGAAAAA TCGATCCCCA CAGTCAAATC GATT7CTAAC AATATT7TAC 4 500 

AAGTAGAGGT GTACTATTCT AGTTTCAATA TACTATAAAA TCTTATAAAA AAGCAATCTC 4 560 

GAT AG AG AAA ACGTCTAAAT CATGTTATAA TGAAGCAATA GAATTCTTAG AAAGAGTGGA 4 620 

TGTCT7T7TC ATAACACCTA C7TATGAATG GCAGTTTGCC CTGCAGCTAC AAGATGCGGA 4 630 

TT7TACAAAG ATAGCCAACA AGGCTGG ACT GGGTCCTGAG GTCCC7CCGT TATTGTTTGA 4 7 40 

GAGAGGCATT CAGAACCAAG AAAGTCTGAA CAAGTTTTTA GAACCTTCC7 TGGACGACT7 4 800 

ACATCATCCT TATCTGCTCC ATGATATGGA CAAGGCAC7G GAGCGGATTC GTCAGGCTAT 4860 

TGAAGAAGGG GAAAATATTC TTG7TTATGG AGACTATGAT GCGGATGGCA TGACTTCGGC 4 920 

TTCTATTGTG AAGCAAACTT TGGAACAACT TGGTGCTGAG TGCCGAGTTT ACCTGCCAAA 4 980 

TCGTTTTACC GATGCCTATG GCCCTAATGC TACTGTTTAT AAATACTTTA TCGAGCAAGA 504 0 

ACGGATTTCC TTGATTGTGA CGGTGGACAA TCGGGTTCCT GGTCATCAGG CTATTGCATT 5100 

GCC7CACTCT ATGCGAG7AC ATC7CATTGT GACACACCAT CATTCCATGC CTGAAACCCT 5160 

GCCAGATGCT TATGCTATTG TCCATCCTGA ACATCCAGAT GCCGATTATC CTTTTAAATA 52 20 

TTTGGCTGGT 7GTGGAGTTC CTTTCAAGTT GGCTTCTCCC CTGTTAGAAG AAGTGCAAGT 52 80 

CGAAT7GCTT GATTTGCTCG CTATTCGAAC TATTGCAGAT ATGGTGAGTC TGACGGATGA 53 40 

AAATCGTATC TTAGTTCAAT ATGGTCTGGA AATCT7CGGT CATACCCACC GCA77CC7CT 5400 

GCAAGAAATG CTGCACATGG CTGGCATTCC TGCCAACGAA GTAACAGAAG AAACGGTTGG 5460 

777CCACA77 CC7CCTCGT7 7CAATCCCT7 GGGTCGCTTG GATCATCCCA ATCCTGCCAT 5 52 0 

TGATTTGTTG ACTGGATTTG ATGATGAGGA AGCGCATGAG ATTGCCCT7A TCA7TCACCA 5 580 

GAAAAACGAA GAGCGCAAGG AAA7CCTTCA GTCTATCTAT GAAGAACCCA AGACCATCGT 5640 

GGA7CC7GAG AAGAAGG7TC AGGTCTTGCC CAAGGAAGCC TGGAATCCTG GGG77CTAGG 57 00 

AATCGTGGCT GCTCCTTTAT TGGAAGAATT GGGACAGACA G7CA7TG7TC TTAATATACA 57 60 

AGACGCTCCT GCCAAGGCCA GTGC7CGTAG 7GTGGAAGCG GTCGATATT7 TTGAAGCTCT 532 C 

GGATCCCCAT CGAGACCTCT TCATCGCCTT TGGAGGTCAT GCAGGTGCAG CGGGTATGAC 5880 

GCTCGAAGTT GAGCAACTCT CAGATTTATC TCAGGTTTTG CAAGAT7ATG 7TCGTGAAAA 5940 

ACGTGCAGAT GC7CGTGGCA AGAATAAGT7 AAACCTAGAT GAAGAG7TGG ATTTGGAGGC 6000 

ACTTAGCT7G GAAACGGTCA AAAGTT77GA ACGTTTAGCT CCT7TTGGAA TGGATAATCA .6060 
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GAAACCTATT 


TTTTATATCA 


AGAATTTTCA 


GGTCGAAAGT 


CCTCGTACTA 


TCGGGGCAGC 


6120 


TAATGCCCAT 


CTAAAGCTGA 


AAATTTCCAA 


GGGTGAGGCG 


AGTTTTGAAG 


TGGTAGCCTT 


6130 


TCGTCAAGCC 


AGATGGGCGA 


CAGAGTTTTC 


TCAAACCAAG 


AATCTAGAGT 


TAGCCGTTAA 


6240 


ATTGTCTGTC 


AACCAATGGA 


ATGGCCAAAC 


TGCCCTCCAG 


TTGATGATGG 


TGGATGCGCG 


6300 


AGTGGAAGGT 


GTTCAACTTT 


TTAACATTCG 


TGGAAAAAAT 


GCAGTCTTGC 


CAGAAGGTGT 


6360 


TCCAGTCTTG 


GATTTTCCTG 


GAGAACTGCC 


AAATCTTCCC 


GC7AGTCAAC 


CTGTTGTCGT 


6420 


AAAAAACATT 


CCAGAGGATA 


TTACTCAGCT 


GAAGACCATT 


TTTCAGGAAC 


AGCATTTCTC 


6480 


TGCTGTCTAT 


TTCAAAAATG 


ATATTGACAA 


GGCTTATTAT 


CTGACAGGTT 


ATGGGACTAG 


6540 


AGATCAGTTT 


GCCAAATTGT 


ACAAGACTAT 


TTACCAGTTC 


CCAGAGTTTG 


ATATTCGCTA 


6600 


CAAGCTGAAA 


GATTTGGCTG 


CATATCTTAA 


TATTCAACAA 


ATCTTGCTGG 


TCAAGATGAT 


6660 


TCAAGTATTT 


GAAGAACTAG 


GCTTTGTGAC 


GATAAAAGAT 


GGTGTGATGA 


CAGTCAATAA 


6720 


AGAGGCGCCA 


AAGCGGGAGA 


TAGGAGAAAG 


TCAAATTT AC 


CAAAATCTCA 


AACAAACCGT 


6780 


TAAAGACCAA 


GAAATCATCG 


CGCTGGGTAC 


CGTGCAACAA 


AT7TATCA77 


TTTTGATGGA 


6840 


AAAAGAG7AG 


AAGTTAGGAA 


AGACTTGGGA 


AATCAACTCT 


TTTTTGAAAA 


CAGACCTTCA 


6900 


TTTTGAAAAT 


CATCAAAAAA 


ATCGTATAAT 


GGTAGGAAAA 


GATTCGGCTG 


AAAGTATCAG 


6960 


AACTTTTAGA 


ATAAGAGGGT 


AGAATTCCCC 


TATAATCAAG 


ATAAACTAAG 


ATTTTGGAGC 


7020 


AAAAATGAGT 


AATATCAGTT 


TAACAACACT 


TGGTGGTGTG 


CGTGAGAATG 


GAAAAAATAT 


7080 


GTACATTGCT 


GAAATTGGAG 


ACTCCATTTT 


TGTTTTGAAT 


C7AGGG7TAA 


AA7ATCCTGA 


7140 


AAATGAACAA 


TTAGGGGTCG 


ATCTGGTGAT 


TCCAAACATG 


GA7TACC7TT 


TTGAAAATAG 


7200 


CGACCGTATT 


CCTGGGGTTT 


TCTTGACCCA 


CGGGCATGCG 


GATGCCATTG 


GTGCTCTACC 


7260 


GTATCTCTTG 


GCAGAGCCTA 


AAGTTCCTGT 


ATTTGGGTCT 


GAGTTGACCA 


TTGAGTTGGC 


7320 


AAAGCTCTTT 


GTCAAAGGAA 


ATGATGCCGT 


TAAGAAATTT 


AATGATTTCC 


ATGTCATTGA 


7380 


TGAGAATACG 


CAGATTGATT 


TTGGTGGGAC 


AGTGCTTTCC 


TTCTTCCCTA 


CGACTTACTC 


7440 


CGTTCCAGAG 


AGTCTGGGAA 


TTGTCTTGAA 


GACATCGGAA 


GGAAGCATCG 


TTTATACAGG 


7500 


TGACTTCAAA 


TTTGACCAAA 


CGGCTAGTGA 


ATCTTA7GCA 


ACTGATTTTG 


CTCGTTTGGC 


7560 


AGAGATTGGT 


CGTGACGGCG 


TCCTGGCTCT 


CCTCAGTGAT 


TCGGCCAATG 


CAGACAGCAA 


7620 


TATTCAGGTG 


CCTAG7CAAA 


GTGAAGTTAG 


GGA7GAAAT7 


ACCCAAACTA 


TTGCTGACTG 


7630 


GGAAGGTCGT 


ATCATCCTTG 


CAGCTGTTTC 


CAGTAATCTT 


TCTCG7ATTC 


AGCAGATT7T 


7740 


TGACGCTGCG 


GATAAAACAG 


GTCGACGTAT 


CC7CTTCACA 


GGATTTGATA 


TTGAAAATAT 


7800 


CGTCCGCACA 


GCGATTCG7C 


TTAAGAAGTT 


GTCTTTAGCC 


AACCAAATTC 


TTTTGATTAA 


7860 
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GCCTAAAGAT ATGTCTCCCT TTGAAGACCA TGAGTTGAT7 ATTCTTGAGA CAGGTCGTAT 7 920 

GCCTCACCCT ATCAATGGAC TTCGTAAGAT GTCGATTGGT CGCCATCGTT ATGTAGAAAT 7 980 

CAAGGATGGG CACCTAGTCT ATATTGCTAC GGCTCCCTCT ATTGCTAAAG AAGCCTTTGT 8040 

TGCGCGTGTG GAAAATATGA TTTATCAGGC AGCTCGCCTT GTCAAATTCA TTACCCAAAG 8100 

TTTACATGTA TCAGGGCACG GAAATGTGCG TGATTTGCAG CTGATGATCA ATCTTTTGCA 8160 

ACCTAAGTAC CTCTTCCCTG TCCAAGGGGA GTATCGTGAG TTGGATGCTC ACGCTAAGGC 82 20 

TGCCATGGCA GTTGGGATGT TGCCAGAACG CATCTTCAT7 CCTAAAAAGG GGACGACCAT 82 80 

GGCTTACGAG AATGGAGACT TTGTTCCAGC TGGATCGGTT TCAGCAGGAG ATATCTTGAT 3 340 

TGATGGGAAT GCCATTGGTG ATGTTGGAAA TGTTCTTCTT CGTGACCGTA AGGTCTTGTC 8400 

AGAGGATGGA ATTTTCATCG TGGCTATTAC AGTCAACCGT CGTGAGAAGA AAATTGTGGC 8 4 60 

TAGGGCTCGT GTTCACACCC GTGGATTTGT TTATCTCAAG AAGAGTCGCG ATATTCTCCG 8520 

TGAAAGTTCA GAATTGATTA ACCAAACGGT AGAAGAGTAT CTTCAAGGAC ATGACTTTGA 8530 

CTGGGCAGAT CTCAAAGCTA AGGTTCGTCA CAATCTGACC AAGTACCTCT TTGATCAAAC 8640 

CAAGCGTCGC CCAGCCATTT TACCAGTAGT CATGGAAGCA AAATAATCGT TGAAATAAAC 8700 

AGAGAGAAAG TCGAGTTTCG GCTTTTTCTT ATAGAAAAAT AGAAGGAGAA AATCATGGCA 8760 

GTGATGAAAA TCGAGTATTA CTCACAAGTA TTGGATATGG AGTGGGGGGT GAATGTCCTC 8820 

TACCCTGATG CCAATCGAGT GGAAGAACCA GAGTGTGAAG A7ATTCCCGT CTTCTACCT7 3880 
TTGCACGGGA TCTC7GGAAA TCATAATAGT TGGCTTAACC GGACCAATGT AGAACGCTTG 
CTTCGACCAA CTAATCTCAT CGTTGTTATG CCCAATACCA GCAATGGTTG GTACACCGAT 
ACCCAGTATG GTT7TCACTA CTACACCCC7 CTAGCAGAGG AATT^CCACA GCTTCTCAAA 
CGCTTCTTCC CTAATATGAC GAGCAAGCGT GAAAAGACC? TTATCCCTCG TCTTTCTATG 
GGAGGCTACG GCTGCTTCAA ACTGGCTCTT ACGACAAATC GTTTTTCTCA TGCAGCTAGT 

TTTTCAGGTG CCCTCAGCTT TCAAAACTT7 TCTCCTGAAA GTCAAAATCT GGGAAGTCCA 9240 

GCCTACTGGA GAGGTGT*TTT TGGAGACATT AGAGACTGCA CAACTAGTCC CTATTCTCTT 9 3CG 

GAAAGTCTGG CTAAAAAATC GGATAAAAAG ACCAAACTTT GGGCGTGGTG TGGCGAACAG 9 36C 

GATTTCTTGT ACGAAGCCAA TAATCTCGCA GTGAAAAATC TCAAAAAACT AGGTTTTGAT 942 C 

GTGACCTATA GCCATAGCGC TGGAACTCAC GAGTGGTACT ACTGGGAAAA ACAATTGGAA 9480 

GTTTTTTTAA CAACCCTACC AATTGATTTC AAATTAGAAG AGAGACTGAC TTAGTTTGAA 9540 

CTTCAGCATA GGGGGAGTAG AACTAAAATA AAATA7CTTT TCACTAGACT TTTCAAACGm 960C 



3940 
9000 
9060 
9120 
9180 
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AAGTACTACA ATAGTAATAA AATACTGGAG GAAAGAGAGT AGGAAATGTA CCGTTATCAA 9 660 

ATTGGCATTC CC AC ATT AG A ATATGATCAG TTTGTCAAAC AACATCAATT AGCCAATGTA 97 20 

TTACAAAGTA GTGCTTGGGA GGAAGTTAAG TCTAATTGGC AACATGAGAA GTTTGGTGTT 97 80 

TACAGGGAAG AAAAATTACT GGCGACAGCT AGTATTTTGA TTAGAACTCT TCCGCTAGGC 9840 

TATAAAATGT TTTACATCCC AAGAGGACCT ATATTGGATT ATGGGGATAA AGAACTCTTG 9900 

AATTTTGCCA TTCAGTCTAT TAAGTCCTAT GCTCGCAGTA AGAGAGCGGT TTTTGTGACT 9960 

TTTGACCCAA GTATTTGCCT ATCTCAAAGT TTAATCAATC AGGAAAAGAC AGAATTTCCT 13020 

GAAAATCTGG CTATTATTGA TAGTTTGCAA CAAATGGGAG TAAGGTGGTC AGGAAAAACG 10G80 

GAGGAAATGG GAGACACCAT TCAACCTCGT ATTCAGGCGA AAATATACAA GGAAAATTTT 101 40 

GAAGAAGATA AACTTTCCAA GTCAACAAAA CACGCTATTC GAACAGCACG AAACAAAGGG 10200 

CTTGAGATTC AATATCGTGG ACTGGAACTA TTAGATTCAT TTTCGGACTT GATGAAAAAA 102 60 

ACTGAGAAGC GAAAAGAGAT TCATTTGAGG AATGAAGCCT ATTATAAAAA ATTGTTAGAT 10320 

AATTTTAAGG ACAAGGCCTA TATCACCTTG GCCACCTTGG ATGTTTCTAA ACGTTCGCAA 103 80 

GAGTTAGAAG AACACTTAGC GAAAAATAGA GCCTTGGAAG AGACCTTTAC TGAGTCGACT 104 40 

CGAACTTCAA AAGTAGAAGC GCAGAAGAAG GAAAAAGAAC GTTTGTTAGA GGAATTCACC 10 SCO 

TTCTTGCAGG AATATATAGA TGTAGGTCAA GCGAGAGTTC CTTTACCCGC TACTTTGAGT 10560 

TTGGAATTTG GTACTACCTC TGTCAATATA TATGCTGGTA TGGATGATGA TTTTAAACCT 10620 

TACAATGCAC CAATTTTAAC TTGGTATGAA ACGGCTCGCT ATGCCTTTGA ACGAGGTATG 10630 

ATCTGGCAAA ATTTAGGTGG TGTTGAAAAC TCTCTCAATG CTCCAC7TTA TCATTTTAAC 10740 

GAAAAATTTA ATCCAACGAT TGAAGAATAC TTGGGTGAAT TTACAATGCC CACTCATCCT 10800 

CTC7ATCCTC TGTTAAGACT TGCTCTTGAT TTCCGTAAAA CATTAAGAAA AAAACATAGA 10860 

AAGTAAGTAT ATGGCACTAA CAACACTCAC GAAAGAAGAG TTTCAGACTT ATTCTGATCA 10920 

GGTTTCTTCT CCTTCCTTTA TGCAATCTGT CCAGATGGGG GATTTGCTAG AAAAAAGAGG 10980 

GGCTCGAATT GTTTATCTTG CTTTGAAACA AGAAGGAGAA ATTCAAGTTG CAGCTCTGGT 11040 

TTATAGCCTG CCCATGCTGG GTGGTCTGCA TATGGAACTC AATTCGGGGC CGATTTATAC 11100 

CCAACAAGAT GCTCTTCCAG TTTTTTATGC AGAGTTAAAA GAATATGCCA AGCAAAATGG 11160 

TGTATTAGAG TTGCTTGTAA AACCCTATGA AACTTATCAA ACTTTTGATA GCCAAGGTAA 1122G 

TCCAATAGAT GCTGAGAAAA AAAGTATTAT TCAAGATTTG ACTGATTTAG GTTATCAATT 11230 

TGATGGCTTA ACAACAGGTT ACCCAGCTGG AGAACCAGAT TGGTTATACT ATAAAGATTT 11340 

AACTGAATTA ACTGAAAAGA GTTTGCTTAA AAC7TTTACC AAAAAGGGTA AACCCTTGGT 11400 
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GAAAAAGGCT 


GAAACCTTTG 


CCATTCGGTT 


GAAAAAGTTA 


AAACG7GAAG 






TTTTAACAAT 


A7AACAAAAC 


AAACCTC7GA 


ACGTAGAGAA 


*^A7 ACTGATA 


AAACTTTAftA 
AAAO 111 ngn 


* 1 <i ■? n 


A7ATTA7CAC 


CATTTTTATG 


ATACTTTTGG 


AGAACAAGCG 


GAG7T Tr TrA 




1 1 SQft 


AAATTTTTCG 


GACTATATGA 


GCAAATTCCA 


AGG7GAACAA 


AG^AAACTAn 


A A P A A A h P»t»T 


* 1 &40 


GGACAACTTG 


CGACTTGATT 


TGAGTAAAAA 


TCCTC ATTCT 


GAHAAAAA Af* 


i a a a Tr & a /—t 

1 V. AAi. I 


117 00 


GAGAGAATAT 


TCTAGTCAAT 


T7GAAACGT7 


TGAAGTTCGA 




U\j<-OALiAt_ I I 


11760 


GATTGAAAAA 


TA7GGAGAAG 


AAGATATTGT 






I I I ATA i OCC 


1192 0 


7CAGGAAACG 


ACTTATCTCT 


TTAGTGGTTC 


C7ACACTGAC 


1 1 . rtrt i * 


* i a ruoccc 


11880 


TGCACTGCTT 


CAAAAATATG 


TTATGTTGGA 


AAGCATAAAA 


rGTC.flAATAf 


CT A A AT" iriA 
L AAA i A^» AA 




CTTCCTAGGC 


ATTCAAGGGA 


TTTTTGATGG 


AACTCATCGT 


VJi l I I vjv-\j i I 


t tAAAOAOAA 


12000 


T7TTAATCGC 


7A7AT7CTAC 


GCAAAGCAGG 


TACT7TCCGT 


7ACCATGCAT 


I 1 t AAA 


1 £ U OU 


ATACAAAGCT 


ATCCAGTTAC 


TCAAAAAAAT 


AGT AGG APfiT 




MU 1 v_ AO 1 A r 


12120 


77AGATTTCT 


77TAGCTTCT 


TTTAGTAAAA 


TAATTf^TAT 


TTfiPTAflAAA 


CfTf?* fiTAfA 


1 2 IdO 


CATGCGCTGG 


CTTTTTCGTT 


TGATAGGGGC 


TTITTTTTCT 

■ 1 » V. 1 1 i tv. 1 




Ul . lUi 1 1 




GCGTCTGGTT 


TGGATAGTTG 


TGCTCTTATG 


TGTC C TTG CT 






i t inn 


GAACGGAGAT 


TTTCAAGGAG 


CGCTAAAGCA 


AGCAGAAC^G 


^PAflTAAA A A 


TTfVTTf™ & Af~ A 


I J J bU 


AAGTATTGAC 


CAATGCGAGA 


AAACAGGGCA 




"*T A AC C C A C A 


^.AuM t AO 1 


12 420 


CCAGCATTCT 


GAAGGAAGGT 


GGGCACAGGC 


C7CTGC7CCT 


fti I i rtC\- l OO 


iTrrr.pArsT 
A I LLOLnvjn i 


i •) it art 

1 j. 1 d u 


GGATTCACCC 


77TCAAGAGG 


CTTATTTAGA 


AGCAATCCAG 


AAPTftnAATP 


AAAPTr*f;Tf*P 




TTTTAACTTT 


GAACTCGTGA 


CTCAGTCTAC 


"AAGCCGGAT 


ATTArr.nrTA 


V. jonUn • VjAA 


i -> 'nn 
1 «, o UU 


CGACGGAGGC 


AC7CCTG7GG 


CAGGAGAGGC 


GGAAAGTCAA 




TAAPin^rTA 
I rtALnuVjOL A 




ATTCTTGTCC 


GTAACGGTGC 


GGTTGAATCA 


TTATTATTTG 


7CCAA7CCAT 


AC7ATGCCTA 


12720 


CTCCTATGAA 


CGCCTTGTCC 


ATACGGCAGA 


ACATGAGTTA 


GG7CA7GCGA 


T7GGCT7GGA 


12780 


CCATACAGAT 


CAGAAGTCTG 


TCATGCAACC 


AGCACCTTCC 


T7TTA7GG7A 


7CCAGGAAGA 


12840 


GGATGTTGCA 


AACCTCCGAA 


AAATATATGA 


GACTAGTGAG 


7ACGG7AC7A 


TCTTTCCCTA 


12900 


CTTTTTTTGC 


7ATAATGGAA 


CTATGAACAA 


CTTGATTAAA 


7CAAAAC7AG 


AGCTC7TGCC 


12960 


GACCAGCCCT 


GGTTGCTACA 


TTCATAAGGA 


TAAAAATCGC 


ACCAT7A7C7 


ATCTAGGAAA 


13020 


CCC7AAAAAT 


C7CCGTAA7C 


GAGTACGGTC 


CTA77TTCGT 


CGAAG7CA7G 


A7ACCAAGAC 


13080 


AGAGCC7C7C 


GTGTCTGAAA 


7TC7CCA7T7 


TGAATTTATT 


GT7ACGGAG7 


C7AA7AT7GA 


13140 



WO 98/18931 



PCT/US97/19588 



474 

GGCACTTC7C CTAGAAATCA ACCTGATCAA GGAAAACAAG CCCAAGTACA ATATCATGCT 
CAAGGATGAC AAGTCCTATC CTTTCATCAA AATCACCAAT GAGCGCTATC CACGCTTGAT 
TATCACTCGT CAGGTCAAAA AGGACGGAGG TCTTTATTTT GGACCCTATC CCGATGTGGG 
GGCAGCCAAT GAAATCAACC GGTTGCTGGA TCGGATATTC CCTTTTCGTA AGTGTACCAA 
CCCGCCCTCT AACGTCTGT? TTTATTACCA TA7CCCCCAG TCTATGGCCC ACACCATCTG 
TAAGAAGGAT GAGCCTTATT TCAAGTCTAT GGCCCAGGAG GTGTCTGATT TTCTGAAAGG 
TCACGATGAC AAAATCATCC ATGATCTCAA CAGTAAAATG GCAG7ACCAG CACAAAGTAT 
GGAGTTTGAA CGTGCGGCGG AATACCGTGA CCTGATTCAG GCTATTGGAA CGCTTCGAAC 
CAAGCAACGG GTCATGGCGA AAGATTTGCA AAATCGCGAT GTCTTTGGCT ACTATGTGGA 
TAAGGGCI'GG ATG7GTGTCC AGGTTTTCTT TGTCCGTCAG GcAAGCTCAT CGAGCCCGAT 
GTCAATCTCT TCCCCTACTT CAATGATCCA GATGAGGATT TTTTGACCTA TGTAGGACAA 13800 
TTCTATCAAG AAAAATCTCA TCTAGTTCCC AATGAGGTAC TGATTCCGCA GATATTGACG 13860 
AAGAAGCTGT CAACGCTTTG GTGGATTCCA AGATTCTTAA GCCTCAACGT GGAGAGAAAA 
AACAACTGGT CAATCTACCC ATAAAAAATG CTCGTGTTAG TCTAGAGCAG AAGTTCAATC 
TGCTAGAAAA ATCTGTCCAA AAGACTCAAG GACCTATTGA AAATCTAGCG CGTTTGCTCC 
AAATCCCGAC CCCAGTACGT ATCCAGTCCT TCCATAACTC TAATATCATG GGAACTAGCC 
CTGTTTCGGC TATCGTGCTC TTTGTCAACG GTAAACCGAG TAAGAAGGAT TACCGTAAGT 
ACAAGATAAA AACGGTTGTT GGACCACACG ACTATGCCAG CATGAGAGAG GTCATTCGCA 
GACGCTATGG TCGACTACAG CGTGAGGCTT TGACTCCTCC AGATTTGATT GTGATTGATC 
GGGGGCAAGG TCAACTCAAT ATCGCTAAGC AGGTTATCCA AGAGGAACTG GGCTTGGATA 
TTCCAATTGC TGGGCTGCAA AAGAATGATA AGCACCAAAC CCATGAATTG CTCTTTGGAG 
ATCCGCTTGA GGTGGTGGAT TTGTCTCGCA ATTCTCAGGA ATTTTTCCTC CTCCAACGCA 
TCCAAGATGA GGTGCACCGC TTTGCTATCA CTTTCCACCG CCAACTGCGC TCCAAAAATT 
CTTTCTCATC TCAATTGCAT GGGATTGACG GTCTGGGACC TAAACGCAAG CAGAATCTTA 
TGAAGCATTT CAAGTCTTTG ACCAAAATCA ACGAAGCCAG TGTGGATGAG ATTGTCGAAG 
TTGGGGTACC TAGAGTCGTT GCAGAGGCTG TGCAAAGAAA GTTGAACCCG CAGCGAGAAG 
CCTTGCCTCA AGTAGCAGAA GAAAGAGTAG ATTACCAAAC CGAAGGAAAC CACAATGAAC 
CATAAAATCG CAATTTTATC ACATG7TCAT GGCAATGCGA CGGCGCTAGA ACCAGTGATT 
GCAGATGCTA AAAATCAAGG GGCCAGTGAA TATTGGCTTC TGGGAGATAT TTTTCTTCCT 
GCTCCAGCCG CAAATGACTT AGTCGCCCTG CTAAAGGACC TTCCTATCAC ACCAACTCTT 



13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 



13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14823 
14890 
149<;C 
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CGAGGCAATT GGGA7GATCG TGTCCTTGAG GCTTTAGATG GGCAATATCG CTTAGAAGAC 15000 

CCACAGGAAG TTCAGCTCTT GCGTATGACA CAGTATTTGA TGGAGCGAA? GGATCCTCCA 15060 

ACCATTCTCT GCCTACCAAG CTTCCCTTTG CTGGAAAAGA AAGAAATTGA CCGATTGCCC 15120 

TTTTCTATCT CTCATAA7TT ACCTGACAAA AACTATGGTG GTGACTTGCT AGTTGAGAAT 15180 

GATACACAGA AATTTGACCA ACTGCTAGAT GCGGAAACGG ACGTGGCAGT TTATGGTCAT 15240 

■-CTTCACAACC AGTTGCTTCC TTATGGAAGT CAAGGGCAAC AAATCATCAA TCCAGGGTCG " 15 300 

ATTGGCATGC CCTATTTTAA TTGGGAGGCG TTAAAAAATC ACCGTTCCCA GTATGCCGTG 15360 

ATAGAACTTG AAGATGGGGA ATTACTCAAT ATCCAATTTC GTAAAGTTCC TTA7CATTAC 154 20 

GAAGCTGACT TAGAATTGCC CAAG7CCAAC GGGCTTCCCT TTATCGAAAT GTATGAAGAA 15480 

CTGCGTCGTG ACGATAACTA TCAGGGCCAC AATCTGGAAT TATTAGCCAC CTTAATAGAA 15540 

AAGCATGGCT ATGTAGAGCA TGTGAAGAAT TTTTTTGATT TTTTGTAACA GTTTCCTAAA " 15600 

ATAGCCAATG CAAACTAAAA AAGCGATTTG CTGGTCCAAT CGCTTTTAGT ATATCTTATA 15 660 

CTCAATGAAA ATCAAAGAGC AAACTAGGAA GCTAGCCGTA CCTTGCTCAA ACCACAGCTT 15720 

TGAGGTTGCA GATAAAGCTG ACGTGGTTTG AAGAGATTTT CGAAGAGTGT TATTGTAACT 15780 

GAGATTGATC TGGCAGGTAA GAACCACCTA GATAGGTATT GCTGAGTTTT TCAAGGGTTC 15840 

CGTCTTGATA GAGTTCTTTG AGCGCTTTAT CAAATTGCTC TTTAAACTCT TTTTGGTCGC 15900 

TTGAGAAAAT GATATAATTG CTGGGGCTAT CTGCAGAAGG TAAATCAACG ACTGAGAGGT 15 960 

CTAAACCACG CTCCTTCATA ATCTTTTGAA CCGATACCT7 CTCAAAAAC? AGGAAATCAA 16020 

ACTCTCCGTT AGCAACGTC? AGGATTCGTT TACCAATATC CTCACCAGAA AAATTAATTG 16080 

TAGCGGGATT ATCAGTGTGT TTCTGATTCC AGTTATTGAT GAATTGACCG TTAGAAGT7C 16140 

CGCTATCCTC TTGTGTTCTT TTACCAGCGA TCTGCTCAAG AGAAGTCAAA GCATTTTTCT 16200 

TGTTGCTGAC AAGGACGAGG GGATTGTTGG AAATTGGAAG CGAGTAAAGG TATTTTTCAG 162 60 

CACGCTCTTT TGTCTAACTC AAGTTATTGG CCGCAGCCTG ATAGTGACCA CAATCAAGTC 16 3 20 

C7CCCAAGAT CCTCTCCCAC GCGCTTCTTT GGAATTGAAT CTCGTAGTCG CTGAGTTTTT 163 80 

CATCTACTGC CTTTAAAACT TCGATATCAA AGCCTGTCAG ATTGCCCTTG TCTTCGTAGT 16440 

CAAATGGTGG CACGTCGCCA GCTGTAGCAA GGACCAT7GT CTTTTGAGCG CTAGTCTCTT 165CC 

TGGGTGTACC TTGATTCTCA CAGGCAACCA AAAATGGTAG GATAGCTAGT AATAGGCTAA 16560 

ATTTTTTCAT ACTGTCTCCA TTCAAATGTA AAC :6593 
<2> INFORMATION FOR SEQ ID NO: 53: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 3510 base pairs 
(3) TYPE: nucleic acid 
(C) STRANCEDNESS: double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



CGGATATCCT 


TATATCCTTG 


TTCCTGGAAC 


CATTGTGCGA 


ATTGCTCAAC 


ACTTTTTTCA 


60 


CCTTGAATTC 


CTGGTGCAAT 


GACAGTAAGA 


ATTTCGAAAT 


CACGATCTGG 


TTTCGCCGCT 


120 


AGTTCCATCA 


ACTCTGGCAT 


ACTTTTCTTG 


CATGGACCAC 


ACCATGAAGC 


CCAAAACTTC 


180 


AAGTAAACCT 


TTTTACCCTT 


AAAATCAGAT 


AACTTAACTT 


CTTTGCCATC 


CATGGATTGC 


240 


AATGTGAAGT 


CTGGAGCATC 


TTTTCCAACA 


GCAATTTGTT 


GTACAGTCGT 


TTGTTGTTTT 


300 


GGCTGTTGTG 


CTGCTTGAGT 


CTTTTTAGTT 


TCTTCCTCAC 


CACAGCCCAT 


CAATACAACT 


360 


AATGACAAGA 


GACTTAAGCC 


AGCAAACATT 


ACTTTTTTCA 


TTTGTCCTCC 


TTTATTCAAA 


420 


AATTCCAGCT 


AGAACATTTA 


CTTGTCCTAA 


TACT AAC AAA 


ATTCCCATTA 


AAACAATGAG 


480 


GAAACCACCA 


ATTTTCTTTA 


GT AG CATC AT 


ATGACGCTTC 


ATTTTACTAA 


AATATGCCAT 


540 


GACTAGACCT 


GAAGCTAGTG 


CCAATACCAA 


GAAAGGAAGG 


GCCATCCCaC 


AGTGTAAATG 


600 


AG ACT AT AAA 


TCGCTCCTTG 


CCAAGCGCCA 


TTGCCTCCAG 


AAGCCCCAAG 


TGCTAAAACA 


660 


GAACTTAAAA 


CTGGACCAAT 


ACAAGGTGTC 


CAACCAAAGC 


TAAAGGTAAT 


ACCAAGTAAA 


720 


AAAGCTGACC 


AATAACGATT 


AGAATCTGAT 


TTTTTAAAGG 


TAAAACTTTT 


TTGAACTTCT 


780 


AAT7TCTTCA 


AATGAAAAAT 


TTCCATCTGG 


TGAAGACCCA 


AAATGATAAT 


AATAGCTCCC 


840 


ATGCCATATC . GAAACCAATT 


TGCATAGAGA 


ATATGACCAA 


AGTAACCAGC 


ACCAAACCCT 


900 


AGAATAAAGA 


AAATGAGACA 


GATACCAGCG 


ATAAAGCAAA 


CTGTTCGAAT 


CAAGCCTGAC 


960 


CAGAGAACCT 


TTCTCCCAAA 


CAAAGAAAAG 


CTTTTTGCAC 


TTTCTTGATC 


ATCCAATAAA 


1020 


ATCCCAGCAT 


AGACTGGCAG 


AAGAGGAAAA 


ATACAAGGAG 


AAAAAAAGGA 


TAAAACACCT 


1080 


GCTAGAAAAA 


CAGAGATTAA 


AAATACTATC 


GTTTCCAATA 


AAGAACCAAC 


TTTCTTAATA 


1140 


ATTCTAATCC 


TATTTTACTA 


TATTCAATTT 


TATTTGTAAG 


CTTTCTGCTA 


CGCAAAATCG 


1200 


TATCGGGCAC 


TATTGGACCA 


ATCTTTTCTT 


TTGCTACTCA 


AGGCGGATCT 


TATCCCCCAA 


1260 


AATAGCCAAA 


AAGCAACGAC 


AAGGATTACT 


CATCGCTGCT 


TTTGTGAACG 


AAAATCTCTT 


1320 


TTAGGTCTGA 


CATTTCATAA 


ATCATGTTTT 


ACTTGAGTTT 


GTCAAGGATT 


GCTTTAAGCT 


1380 


CCTCTACTAC 


TTTAGTTTCT 


GTCTCTGCTG 


AGCCATTTTC 


TTCTTTCACG 


AAATCAAGGG 


144C 


TTTCTTGGAG 


AACCTTTTCG 


GCTTTGGCAA 


GGACTTTTTT 


ATCCCC7TTT 


TCTGCATCTA 


150C 
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rw\L V. 1 1 w/\ 1 1. 


AATTCCGTGC 


TTAATTGCTG 


GATT" ;GAC 


TCTTTCTTAC 


•560 


GGLGAA 1 O AO 


O C Ay AAOO*- A 


ATCACCCCTA GGAGGGCAAG 


TAGACTGACC 


ACAATCACTC 


1620 


CTGCCGGAAt- 


TGAGTTTGTT 


TCAGTCATCT 


TATCTGAATC 


CTTACTATCT 


TCCGTTCCTT 


1680 


GTTTTGCATC 


cttc r ro i 


TGTGCAGGCT 


TGCTGTCGCT 


AGCATTTGCT 


TTCACATCTT 


1740 


TGAGAGAGTC 


CAAGGOAOLL 


CACCCTTCAC 


AGACTCTACT 


GCAGTATGCA 


GACCTTACTC 


1300 


TGTCAAGGCA 


CT ATCTT(_ L O 


GAGCTTTTTG 


AGCATCTAGG 


AGGACAGCCT 


TGGTTGCATC 


"i860 


GATTTTCGGA 


TC AG AT AC TG 


TTCCCAAAGC 


TTTCAAGCGT 


TGGTCTAACT 


CTTGACTCAA 


1920 


GGCACGAAGT 


TCAGAC.TTO I 


CAACTTGCTC 


TTGAGCTTGT 


GTGCTCCTTG 


AGCTAGCCGA 


1980 


AGCGCTTGCT 


R oo 1\ OTV **p *AO 

ACLAO I v_ 1 AO 


GATCTTGAGT 


CGGAGCTGAC 


CTTGGAGCTG 


GGACAGGGCT 


2040 


TGCAGGTTGA 


CTAGGAAOAO 


TT ATGGT AT A 


TTGAAACTAG 


AATAGTACAT 


ATGGACTTCT 


2100 


AAAACATTGT 


T AGAA 1 1 \_OA 


TTTTACTCTC 


CTGATCGATT 


TGTCCTATTC 


TTATTTCATT 


2160 


TTAC TA TAA L 


AALtOA I OU I 


GTGGTTAATG 


TTGGTAAGAG 


AAACTTCTGA 


AACCAAGCTT 


2220 


CAAAAAAG tC 


OCTCG * LA L U 


CTCTCTTCGT 


AAGTCATTGG 


AGCGATTAAT 


TCACCATTTG 


2280 


TTAGACCTGC 


AACC AAAOAA 


ATCCTCTCAT 


ATCTTCTTCC 


AGATACTTTG 


CCTCTTATTA 


2340 


ACTGACCTTT 


TAATGAGCGA 


CCATATTCTC 


GATAAAAATA 


AGTATCGAAT 


CCTGTTTCGT 


2400 


CAATCTAAAC 


ft /"»0*t*0 n OO 

AGGTG CT AGG 


TGCTTTAAAC 


TATTAAAATT 


CTTAAGAAAT 


AAGGCTACTT 


2460 


TTTCTGGGTC 


TTGTTC AT AG 


TAGGTGTGGT 


7CTTTTTTTC 


GAGTGTAGCC 


CATACCTTTC 


2520 


AGCGCATAGT 


GGATGG i AO * 


TGGATGACAG 


CCAAAicTCAG 


AAGCTATTTC 


AGTCAAATAA 


2580 


GCrTCTGGAT 


TGTC- AO I AAO 


ATAGTTTTTA 


AGTCTATCTC 


7ATCAACTTT 




2640 


GTTCCTTTTA 


OTTOOTOOTT 

CTTGOTOO i I 


TAGCTCTCCT 




TTAGCTTTAA 


CCAGCCATAA 


2700 


ATGGT ATT AC 


OTGAOA 1 * * O 


GAAAACCTGT 


GATGCTTCTG 


TTATACTACC 


TATTCGCTCA 


2760 


CAATAAGAGA 


GAACTTTTTT 


ACGAAAATCT 


ATTGAATATG 


CCATAAGAAG 


A i i A * A*-L Hk, 


2 820 


ATTGTGTACT 


ATTTTTCCT7 


CATTTCACTA 


TAACACAAAA 


TAG ATT ATT A 


TTACATAACA 


2880 


AAAAAGAGGT 


CTAAACCTCT 


TAACTCAATT 


ACTCCGCCAG 


TAGGACTCGA 


ACCTACGACA 


2940 


TCATGATTAA 


CAGTCATGCG 


CTACTACCAA 


CTGAGCTATG 


GCGGATTAAA 


GCTAAGCGAC 


30C0 


TTCCCTATCT 


CACAGGGGGC 


AACCCCCAAC 


TACTTCCCGC 


GTTCTAGGGC 


TTAACTTCTG 


3060 


TGTTCGGCAT 


GGGTACAGGT 


GTATCTCCTA 


GGCTATCGTC 


ACTTAACTCT 


GAGTAATACC 


3120 


TACTCAAAAT 


TGAATATCTA 


TTCAATTTAA 


GAAAACCGTT 


CGCTTTCATA 


TTCTCAGTTA 


3180 


CTTTGGATAA 


GTCCTCGAGC 


TATTAGTATT 


AGTCCGCTAC 


ATGTGTCGCC 


ACACTTCCAC 


324C 
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TTCTAACCTA TCTACCTCAT CATCTCTCAG GGCTCTTACT GATATATAAT CATGGGAAAT 3300 

CTCATCTTGA CCTGCJtTtCA CACTTAGATG CTT7CAGCGT TTATCCCTTC CCTACATAGC 3360 

TACCCAGCGA TGCCTTTGGC AAGACAACTG GTACACCAGC GGTAAGTCCA CTCTGCTCCT 3420 

CTCGTACTAG GAGCAGATCC TCTCAAATTT CCTACGCCCG CGACGGATAG GGACCGAACT 3480 

GTCTCACGAC GTTCTGAACC CAGCTCGCGT 3510 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

I A) LENGTH : 20986 base pairs 
(3) TYPE : nucleic acid 
;C> STRAND EDNESS : double 
(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



CGGAGAAAAA 


CATGGCTAAG 


TCAAACTTTG 


AAAAAGTAGA 


ATCACTTGTT 


CCCTCCCTTC 


60 


GTGATAACAA 


AATCACAGGC 


TACCGTATCT 


CTAAAGAAAC 


GAATGCGCGT 


GAAATGTCTA 


120 


TCATTGCTCT 


GGCGCACGGT 


CGTGCAAAAG 


TAAAAAATAT 


TTCATTTCAA 


ACAGCCCTAG 


180 


GCCTAATTGA 


TTTCTATGAA 


AAAAATTATG 


AAAAATTTGA 


AGATTAATCT 


TTGGATAACG 


240 


GCGGATTCTT 


GACCTTCAAG 


TAGTAGAGAT 


AGAGAATCTG 


CCTTTTCATT 


TTGAGGACAG 


300 


CAAAAAGACT 


GCACGGTTGA 


TGCACCCTTT 


TCTTTTTATT 


TGACATACCG 


TTGAACGAAC 


360 


TCTTTTGTTC 


GGTCTTCTTT 


AGGATTGGTG 


AAGAGGTCTT 


CTGGTTTACC 


TTCTTCAGCG 


420 


ATCACGCCCT 


TATCCATAAA 


GATAACACGG 


TGAGAGACAT 


CACCCGCAAA 


TTCCATTTCA 


430 


TGGGTTACGA 


CAATCATGGT 


CAACCCTTCC 


TGAGCCAGGT 


CCTGCATGAT 


TTTGAGGACT 


540 


TCTCCAACCA 


TTTCTCGATC 


GAGAGCTGAT 


GTTCGTTCAT 


CAAAGAGAAT 


AGCCTCCGGA 


600 


TTCATGGAGA 


GGGCACGAGC 


GATGGCCACA 


CGTTGTTTTT 


GACCACCTGA 


CACTTGT7TT 


660 


GGTTTGCCTT 


GCCAGTAGCC 


TTCTCCCATG 


CCGACCTTTT 


CCAGCTTTTC 


TTTGGCAATC 


720 


TTTTCAGCTT 


CTGTGCGTTC 


GCGTTTTAGG 


ACAGTTGTCT 


GAGCGACGAT 


TGTGTTTTCA 


780 


ACAACATTCA 


GATTTTCAAA 


GAGGTTAAAG 


GATTGGAAAA 


CCATCCCCAA 


CTTTTCACGG 


840 


TATTGCGTGA GCTCATAGCC 


TTTTTCGAGG 


ACGTTTTGTC 


CATGATAAAG 


GATTTGTCCA 


900 


TCAGTTGGTG 


TTTCAAGTAG 


CTTAATCGAG 


CCTAGGAAGG 


TCGATTTTCC 


GCTTCCAGAG 


960 


CTTCCGATGA 


TAGAGATGAC 


CTCTCCCTTG 


TGGACAGTGA 


GTGAAATGTC 


TTTTAGCACT 


1020 


TCGTTTTGTC 


CATAGGATTT 


TTTGAGGTGT 


TTAATTTCAA 


GGATTGCTTG 


TGTCATTATT 


1080 


TCAAATCCTC 


CGTTTGCATT 


TGGTTAGCAC 


CTGTAGTGTA 


GGTATCCATG 


TCCATTCTGC 


1140 
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GCTCCATAAA 


GCGTAGGATA 


CGTGTTACGG 


TCAACCTCAC 


GACAAAGTAA 


ATCACGGCGA 


1200 


TGATTGTAAA 


TG7CTGGAAG 


TATTGATAGG 


TTTGTGTTGC 


CACGGTATTT 


CCTGAGAAAT 


1260 


AAAGTTCGAC 


AACAGAGATA 


ACGTTCAATA 


CAGATGTATC 


TTTGATATTG 


ATGACAAATT 


1320 




TGCACGTAGG 


ATGTTACGGA 


CTACCTGAGG 


TAGGACAATC 


TTACGCATGG 


1380 


TCTGGTTATG 


GGTCATACCA 


AGAGCAGTCG 


CAGCTTCAAA 


TTGTCCCTTG 


TCAACTGCTA 


1440 


GGATACCACC 


ACCGACGATT 


TCAGTCATGT 


AGGCACCGGT 


ATTGATTGAA 


ACGATGAAGA 


~~Tsod 


TAGCAGCCAG 


TG7ACGGTCA 


AGGTTGATCC 


CGAAACCTTG 


GGCAGTTCCA 


TAGTAGATAA 


1560 


CCATCGATTG 


AACAATCATT 


GGCGTACCAC 


GGAAAATTTC 


AA7GTAGACA 


TTGAGAACCC 


1620 


AGCCGACTAG 


TTTTTGTAGG 


CCGTAAATGA 


CTTTGTTTTC 


AGAGAGAGGA 


GCAGTACGGA 


1680 


AGACACCAAT 


GGCAAGTCCA 


ATAATGAGAC 


CTATCATGGT 


TCCGACGATA 


GACATTAAAA 


1740 


GAGTGATACC 


AGCACCACGC 


AAGAGTTGTT 


GCCAGTTTTC 


AGAAAGAATT 


TTAGCAACTT 


1800 


GGCTAAAGAA 


ACTACTGCTA 


GTCTCTTCAG 


TTGTTGTAGC 


TTCCGCAGGT 


TCTTCCTTGA 


1860 


TCATACGATC 


CATCAAGGCA 


ACTTGGTCAT 


CTTTTGAAAT 


GGTTTCAATG 


CTGGCATTGA 


1920 


T'^GGCTAAT 


ACGATTGTCA 


TTTTTACGAA 


GCCCGATAGC 


GA7AGCTGTA 


TCTTCTTCCC 


1990 


GACTT^TGAA 


ACCAGGTTCT 


ACTTGAATCA 


TCTTGAACTT 


AGAG7TCGCA 


GCTTCAGCAG 


2040 


TCAGTGCTTC 


TGGACGTTCA 


GAAACATAAG 


CATCAATGAC 


ACCAGCCTCA 


AGAGCTTGTC 


2100 


GC AT^TG AGC 


GAAGTCTCCC 


ATGCCTGTTT 


CTTTTTTAGC 


ACCTGGGATT 


TGTGCAATCA 


2160 


AC^TATAAAG 


GTAGACCCCT 


TGTTGAGAAG 




ACCGTTAAAG 


T CATC C AAA G 


2220 


ATTTAGCACT 


TGCGTAGGCA 


GAATCTTTTT 


TGACAAGCAA 


AACTGG7TCG 


CTAGTATAGT 


228C 


AACTGCTCGA 


AAAGGCAATT 


TCTTGTTTGC 


GTTCTGCAGT 


TGGACTCATA 


CCTGCGATAA 


2340 


TCATGTCAAT 


CTTACCAGAA 


GTAAGGGCAG 


GGACTAGACC 


TTCCCACTTG 


GTTTTAACAA 


2400 


CCAAAGGTTC 


TTTACCTAAG 


TCCTTAGCGA 


TTTTCTTGGC 




T CGT A^G CC m 


2460 


TGGCATACTG 


ATTGGTCCCA 


TCGATTTTGA 


CAGCTCCGTT 


GCTATCATCA 


TCCTGGGTCC 


2520 


AGTTAAAGGG 


AGCATATGCT 


GCTTCCATAC 


CCATCCGTAA 


ATATTCATCG 


GCTTGAGCAA 


2580 


CATTGACAAG 


TCCTAGCATC 


AGCAAGAGAC 


TTGTGAAAAT 


AGATAACTAy 


ATGTGGCTCA 


2640 


7GATTTCTCC 


TATTCTCATC 


TATTAAAAAA 


TAACTG7CTC 


CTATTTTATC 


GAAAAATGCG 


2700 


TAATTTTTCA 


ACATAAGTAA 


GTCTTTACTT 


ACCAAAAAAT 


GCTATAATGA 


TAAGAAAGAT 


2760 


AAAAAGGGGG 


CTTAGTTGAT 


GAAAAAAACT 


TTTTTCTTAC 


TGGTGTTAGG 


CTTGTTTTGC 


2820 


CTTCTTCCAC 


TCTCTGTT7T 


TGCCATTGAT 


TTCAAGATAA 


ACTCTTATCA 


AGGCCATTTG 


2880 
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TATATTCATG CAGACAATAC GGCAGACTTT AGACAGAAGA TAGTTTACCA GTTTGAGGAG 2940 

GACTTTAAGG GCCAAATCGT GGGACTTGGA CGTGCTGGTA AGATGCCTAG CGGGTTTGAC 3000 

ATTGACCCTC ATCCAAAGAT TCAGGCCGCG AAAAACCGTG CAGAACTAGC AGATGTGACT 3 060 

AGCGAAGTAA CAGAAGAAGC GGATGG7TAT ACTGTGAGAC TCTATAATCC AGCTCAGGAG 3120 

GGCGACATAG TTGAAG7TGA CC7CC7CTGG AACTTAAAAA ATTTACTTTT CCTTTATGAT 3180 

GATATCGCTG AATTAAAT7G CCAACCTCTG ACAGATAGTT CAGAGTCTAT TGAAAAGTTT 3240 

GAATTTCATC TAAGGGGAGA CAAGGGGGCT GAAAAACTCT TTTTCCATAC AGGGAAACTT 3 300 

TTTAGAGAGG GAACGATTGA AAAGAGTAAC CTTCATTATA CTATCCCTT7 AGACAATCT? 3 3 60 

CCGGCTAAGC GTGCAGTTGA GTTGCATGCC 7ATTGGCC7C GGACCGATTT TCCTACCGC? 3 4 20 

AGGCATCAGG GATTGAAAGG GAATCGTTTA GAAGAGTTTA ATAAGATAGA AGACTCGATT 3 4 80 

GT TAG AG AAA AAGATCAGAG TAAACAACTC GTTACTTGCG TCCTCCCT7C GATCCTT7CC 3 540 

ATCTCCTTGT TATTCAGTGT CTGCTTCTAT 7TTATTTATA GAAGAAAGAC CACTCCTTCA 3 600 

GTCAAATATG CCAAAAATCA TCGTCTCTAT GAACCACCAA TGCAATTAGA GCCTATGGTT 3 660 

TTATCAGAAG CAGTCTACTC GACCTCCTTG GAGGAAGTGA GTCCCTTGGT CAAGGGAGCT 3*720 

GGAAAATTCA CCTTTGATCA ACTTATTCAA GCTACCTTGC 7ACATGTGAT AGACCGTCGG 3? 80 

AATGTCTCTA TCATTTCAGA AGGAGATGCA GTTGGTTTGA GGCTAGTAAA AGAAGATGGT 3 840 

TTCTCAAGCT TTGAGAAAGA CTGCCTAAAT CTAGCTTTTT CAGGTAAAAA AGAAGAAACT 3 900 

C7TTCCAA7T TGTTTGCGGA TTACAAGGTA TCTCATAC7C TTTATCGTAG AGCCAAAGTT 3 9 60 

TCTGATGAAA AACGGATTCA AGCAAGAGGG CTTCAACTCA AATCTTCTTT TGAACAGGTA 4 0 20 

TTGAACCAGA TCCAAGAAGG AGTGAGAAAA CGAGTTTCCT TC7GGGGCC7 CCCAGATTAT 4 080 

TATCGTCCTT TAACTGGTGG GGAAAAGGCC TTGCAAGTGG GTATGGGTGC CTTGACTATC 414 0 

CTGCCCCTAT TTA7CGGATT TGGTTTGTTC 77CTACACTT TACACCT7CA TGGCTATCTT 4 200 

TACCTCCCTT TCCCAA7ACT TCGTTTTCTA GGGTTAGTT7 TGTC7GT7TT CTATTATTGG 4 2 60 

AAGCTTCGAC TAGATAATCG TGATGGTGTT CTAAATGAAG CCGCAGCTGA GGTCTACTAT 4 3 20 

CTCTGGACCA GTTTTGAAAA TATGTTGCGT GAGATTGCAC GATTGCATCA GCCTCAACTG 4 3 30 

GAAAGTATTG TGCTCTGCAA TCGCCTCTTG GTCTATGCGA CCTTATTTGG CTATGCGGAC 4 44C 

AACGTTAGTC ATTTGATGAA GGTTCATCAG ATTCAACTGG AAAATCCAGA TATCAATC7C 4 50C 

TA7C7AGCTT ATGCCTGGCA CAGTACGTTT TATCATTCAA CAGCACAAAT CAGCCA7TAT 4 5 50 

GCTAGTGTCG CAAATACAGC AAGCACCTAC TCTCTATCTT CTGGAAGTGG AAGTTCTGGT 4 62 0 

GGTGGCT7CT CTGGAGGCGG AGGTCGCGGC AG7ATCGGTG CCTT77AAAC AGAGCTACCA 4 680 
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TAGACTGAAA 


AAGTATGATA 


TAATGGAAGA 


TAGAAAAAAG 


ACAAACTATA 


AGAAAAGTCA 


4740 


ATAGTTTTAT 


CTAAACTATT 


TCTTATTTCA 


ATTTGATGAT 


TTGGCGATGA 


TTTTAGAGCA 


4800 


CGGCAAAAAG 


CCCTTGAAAA 


AGTCCATTTT 


TTCAAACGTA 


ATCCTGTGTT 


AATTTCAGAA 


4860 


ATT A CATC AC 


TTTT7GTTCG 


TCAAATGGCA 


GCTCTTTTTT 


AGGATATAAA 


ACAGGGTTCG 


4920 


H ATAAfITT m T 


TTTGCAAGGT 


GGATGATGCC 


TACATTGTAA 




ATTCTAACTT 


4980 


AHTCTT AAG A 


^ACGCCTTAG 


AAGCAGGTGA 


AAAGCGAGGG 


CATCCTTTGG 


CAGCTTGTAT 


5040 




CGCAGATGAG 


GGGAACCCCG 


TTTG AC C ATT 


CTTCCAGCTA 


AATCAATCTG 


5100 


ACCTGACTGA 


TAAATAGAAG 


AATCCAGTCC 


AGCGAAAGCT 


TGTAATTGAG 


CAGGATTATC 


5150 


AAAGGPATGA 


ATATTTCGAA 


TCTCGGCTAA 


AATGACCGCC 


CTAAACGATC 


CCCAATCCCA 


5220 


flTAACCGTCG 


TGATGACCGA 


GTTGAACTCA 


GCCATCGAGT 


CATTGATACA 


TCTTTCCCCC 


5280 


T TGTC AA TG A 


GCCTCTTGTA 


ATGCTTGATG 


ATTTCGAATT 


CACCAGCAGG 


AGATGTTGTT 


5340 




GAGGTGCGAC 


TGAGAGGATA 


TCCTGAATTT 


TAGAAGCGGT 


CAATCGCTTA 


5400 


ATTTCTATCA 


GCTTATCAAA 


TCCTGCCTCA 


ATCCTTTTCT 


GAGGATTAGG 


GTAGCGTGTC 


5460 


A A n AG* T TGCT 


AGGTATATTC 


TGAATGCTTT 


CCAACGATTT 


TATCCAACTC 


AGGAAAGATG 


5520 


ATATPAAflAC 


AACGAGTGTA 


TTGTACTTTC 


CAATCAGACT 


CTTTTTCTTG 


AGACGATGAA 


5580 


T ATftTr'T Aflf* 


ACT ATTTTT 


AGGTCTACTT 


GCCGATTATC 


CTGTTGAAAT 


TGTTCACGAT 


5640 




AAGAAGTTTA 


AGAGCGATGC 


CATGAGCGTC 


TTTCTTATCC 


GTTTTACTCT 


5700 




^AATGATTTG 


GCAAATTCCT 


TCATCAGCAA 


AGGATTGTAG 


CTGTAAACTT 


5760 


T A T A*'* C CTT G 


T^CATGCAGG 


AAGTTCAGTA 


CATTAAACCC 


ATAATGTCCA 


GTATCTTCAA 


5820 


HAHrnATGAG 


ACAGTCTTGG 


TTGATCTGTC 


GAATAGACAG 


ATCTAAGAGT 


TCAAAACCAC 


5380 


PTT T ATT ATT 


TGAAAAAGTG 


AGTGGTTTAA 


GAACAGTTTT 


TCCTCGAACA 


TTCAACGCTG 


5940 


TAACATCGTG 


TTTATTTTTA 


GCGA7ATCAA 


TG CCT AC AT A 


AAGC A i vjOU A 




6000 


ATATAGTATT 


TCAAGTCTAC 


TTGGTTATCC 


ACGAATTTTT 


TCCCTTGTTA 


CCTTAGACCA 


6060 


GATCAAACGT 


CTATGCGTTA 


TCAAACTCAT 


TACCAATTGA 


AACAAAAGCT 


GTGCTTAGAG 


6120 


CCTTTCGGAA 


ATCGTCAACC 


GATTGGAGGA 


AATGAACTAA 


TCCATAGTCG 


CTTATTCCAA 


6180 


GTATACCACT 


TGGCCTTTGG 


CAGTAGCTAA 


CTGCGCTAAA 


TATAATATAG 


GGAGTAATCT 


6240 


ATGTATCTTA 


TTGAAATTTT 


AAAATCTATC 


TTCTTCGGAA 


TTGTTGAAGG 


AATTACGGAA 


6300 


TCC7TGCCGA 


TTTCCAGTAC 


AGGTCACTTG 


ATTTTAGCAG 


AGGAATTCAT 


CCAATACCAA 


6360 


AATCAAAATG 


AAGCCTTTAT 


GTCCATCTTT 


AATCTCCTCA 


TTCAGCTTGG 


TGCTATTTTA 


6420 
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GCAGTTATGG TGA7TTATTT TAACAAGCTC AATCCTTTTA AACCGACCAA GGACAAACAG 6480 

GAAGTTCG7A ACACTTGGAG ACTATGGTTG AAGG7CTTGA TTGCTACTTT ACCTTTACTT 6540 

GGTGTCTTTA AATTTCATGA TTGGTTTCAT ACCCACTTCC ATAACATGGT TTCAGTTGCT 6600 

CTCATGTTGA TTATCTACGG GCTTCCCTTC ATCTATTTGG AAAAGCGCAA TAAAGCGCGT 6 6 60 

GCTATCGAGC CAAGTGTAAC AGAGTTGGAC AAGCTTCCTT ATACGACCGC TTTCTATATC 6720 

GGACTCTTCC AAGTTCTTGC TCTTTTACCA GGGACTAGCC GTTCAGGTGC AACGATTGTC 67 80 

GGTGGT7TCT TAAATGGAAC CAGTCGTTCA CTTCTGACAG AATTTACCTT CTATCTTGCC 6840 

ATTCCTGTTA TCTTTGCAGC TAGTGCCTTA AAGATTTTCA AATTTGTGAA AGCCGGAGAA 6900 

CTCTTGAGCT TTGGGCAATT GTTTTTGCTC TTGGTCGCCA TGGCACTAGC TTTTCCGGTC 6960 

AGCATGGTGG CTATTCGCTT CTTGACCAGC TATGTGAAAA AACACGACTT CACCCTTTTT 7020 

GGTAAATACC GTATCGTGCT TGGTAGTGTT TTGCTACTTT ACAGTTTTGT CCGTTTATTT 7080 

GTATAAGAAA AACCTTGAAG GGGCAACTCT TCAAGGTTTT ATACTCTTCG AAAATCTCTT 714 0 

CAAACCGCGT CAGCTTTATC TGCAACCTCA AAACACTGTT TTGAGCAGCn CTGCGGCTAG 7200 

CCTCCTAGTT TGCTCTTTGA TTTTCATTGA GCTTTAAAAT CCAGTCATGG TAATCCCCAA 72 60 

TAGGCGCACA CCTCTTTCTT TCTTGCTTAA TTCTTCATAG AGTTGCAGGG CTATTTGGCT 7 3 20 

TATCTGACTA GCATCTTGTG TTTTTTGAGC AAGACTTTTT CGTTTGGTAA GAGTTGAAAA 73 80 

GTCCTCGTAG CGCATTTTCA AAATGACAAT TTTTCCAGCT TTTTCTTGTT GATGTAGATT 74 40 

GAGAGCGACT TTTTCTGATA GAAGAGTCAC CTCTTTTTTG ATATCTTCCT CAGCAACGAG 7500 

AATCTTCCCG TAGGTTTTCT CCTTGCCGAT TGATTTACGG ATCCGATTGG ATTTGACTCG 7 560 

AGAGTTGTGA ATGCCACGAG CCTTTCGATA CAGATCATAG CCTAGTCTAC CAAAACGGTC 762 C 

TA7TACGGTT ACCTCAGGAA CTTCAACTAA ATCAGCACCA GTAAAAACGC CCATTTGATG 768 0 

AAGACGTTCT ACTGTCTTTT TTCCTACTCC ATGAAATTTG GAAATATCCA TTTGTTTGAG 77 4 0 

AAAATCCTCA GCCTGTTCAG GTAGAATCAC TGTCAAACCA TGTGGTTTTT GATAATCACT 7 800 

CGCCATTTTA GCTAAGAATT TGTTGTAAGA AACCCCTGCG GAAGCAGTTA GATGGAGTTC 7 860 

TTGCCAGATA TCTTTTTGAA TGAGGCGAGC AATTTTGACC GCTGACTTGA TACCGAGTTT 7920 

ATTTTCTGTC ACATCCAAAT AGGCTTCCTC AATGCTCATG GGTTCAATCA AATCTGTATA 7 980 

GCGCTTAAAA ATAGCTCGAA TCTGGAGTCC CACAGACTTG TATTTCTCAT AATTCCCTCA 8C4 0 

GATAAAGACA GCCTGGGGAC AACGTTCATA ACCTTCC7TC GAACTCATGG CAGAATGGAC 8100 

ACCAAAAGCT CTTGCCTCAT AACTACAGGT AGAAACGACT CCCCGTCCAC CTGTTTGCCG 8160 

AGGGTCCCTT CCAATAATGA CAGGTTTTCC TCTCACTTTA GGATTATCCC TGATTTCCAC 8 2 20 
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TGCAGCAAAA AAGGCATCCA TGTCAATATG GATGATTTTT CTTGACAAAT CATTTAACAA 
ACGAAAAATC AACATGCCTA GCACCTTTTT ATACTCTTCG AAAATCTCTT CAAACCACGT 
CAGCTCTAT* TGCAACCTCA AAACAGTGTT TTGAGCAATC TGCGGCTAGC TTCCTAGTTT 
GCTTTTCGAT TTCCATTGAG TGTTACTGCT TATTyTCTTT TATTATACCC TTTTTTCTGA 
AAAAAAGAAA AAAGGACTTT ATTTTTTCAA AAATATAATA CAGTTTGAAA TAAAATATAG 
ACTGTTTT AG AAAAGAAAGT GTAAAAATAG GGAATTTTCA CTTGTTGAAA TCGGTTACTA 
TATGGTATAC TTGTCTTATG AATGTAACAG ATGACTGTTA CTAGAAAAAA GAGGACATTA 
ATA7GCTTGT TAAGACAGTT GTTGAAGCAC AAGATATTTT TGACAAAGCT TGGGAAGGCT 
TCAAAGGCGT AGATTGGAAA GAAAAAGCAA GTGTATCACG ATTTGTACAA GCTAACTACA 
CACCTTATGA TGGAGACGAA ACCTTCCTTC CAGGACCAAC AGAGCGTTCA CTTCACATCA 
AGAAAATTGT AGAAGAAACT AAAGCACACT ACGAAGAAAC TCGTTTCCCA ATGGACACTC 
GTCCAACATC TATCGCTGAT ATCCCTGCTG GATTTATCCA CAAAGAAAAT GAAGTTATCT 
TCGGTATCCA AAACGATGAA CTCTTCAAAT TGAACTTCAT GCCAAAAGG? GGTATCCCTA 
TGGCTGAAAC TACTTTGAAA GAAAATGGAT ACGAACCAGA CCCAGCTGTT CACGAAATCT 
TCACTAAATA TGTAACAACA GTTAACGACG GTATTTTCCC TGCCTACACT TCAAATATTC 
GTCGCGCTCG TCACGCACAC ACTGTAACTG GTCTTCCAGA TGCATACTCA CGCGGACGTA 
TCATCGGTGT TTACGCACGT CTTGCTCTTT ACGGTGCAGA CTACTTGATG CAAGAAAAAG 
TAAATGACTC GAATGCAATC AAAGAAATCG ATGAAGAAAC AATCCGTCTT CGTCAAGAAC 
TAAACCTTCA ATACCAAGCA TTGC AAC AAG TTGTTCGCCT GGCTCACCTT TACGCCCTTC 
ATGTTCGCAA ACCAGCGATG AACGTGAAAC AAGCAATCCA ATGGGTTAAC ATTGCTTTCA 
TGGCTGTCTG CCGTGTGATT AACCGTGCTG CTACATCTCT AGGTCGTGTA CCAATCGTAT 
TGGACATCTT TGCAGAACGT GACCTTGCTC GTCGTACATT TACTGAATCA GAAATCCAAG 
AATTCGTTGA TGATTTCCTT ATGAAACTTC GTACAGTTAA ATTTGCTCGT ACAAAAGCTT 
ATGACCAATT GTACTCAGGT GACCCAACCT TTATCACAAC TTCTATGGCT GGTATCCCTA 
ACGACGGTCG TCACCGTGTT ACTAAGATGG ACTACCGTTT CTTGAACACT CTTGACAACA 
TCGGTAACTC ACCAGAACCA AACTTGACAG TTCTTTGGAC TGACAAATTG CCATACAACT 
TCCGTCGCTA CTGTATGCAC ATGAGCCACA AACACTCTTC TATCCAATAC GAAGGTGTAA 
CAACAATCCC TAAAGACGGA TATGGTGAAA TGAGCTCTAT CTCATGCTGT GTGTCTCCAC 
TTGATCCAGA AAATGAAGAA CAACGCCACA ACATCCACTA CTTCGGTGCT CGTGTAAACG 



3280 
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TTCTTAAAGC 
AACTATTTGA 
ACTTTGAAAA 
ACTACATGAC 
AACGTGCCAA 
CTATCAAATA 
AAACAATCGG 
AATCCTTGAT 
AAGCTACAGT 
ACTCACCAGT 
TTGAATTCTT 
ACTTGAACTC 
CACAAGTATC 
CAATTCTTCA 
TGAACGATGT 
ACTGTGTAAA 
TCCACGAAGT 
TAATAAAAAG 
A7TCCTTCGG 
AAAGCTAAGC 
AAATCCGTTT 
TGATGAGTTT 
TGATCTAGT7 
GAGCTAACGT 
A7AGCAC7AG 
TCTTCAGACA 
GAGAGAGTTT 
CCAGAGATTT 
TCATCTGTTG 
TAATGAGAGG 



CCTTCTTACT 
TATCGAACCA 
ATCTCTTCAC 
TCATAGGTAC 
CATGGGATTC 
CGCTACAGTT 
TGACTACCCA 
CGAACCTTAC 
ATCACTTTTG 
TCACAAAGGT 
CTCACCAGGT 
ACTTTCTAGC 
ACCTCGCCCT 
TGGTTACTTC 
TTACGAAAAA 
CACTAAATAC 
TCTTTCAATG 
GAACCCTCGG 
CCCAATATGC 
TTGACAAAGG 
TTTGAAGTTT 
GTTAGTGGCC 
TTTATACCAA 
GTCTTGAATT 
TTCATACAGG 
CTCCCTAGGA 
CCCACTATCT 
ATCATCAAAT 
GACAATGTGG 
GATATAACTT 



GGTTTCAATG 
ATCCCTGACG 
TGGTTGACTG 
AACTACGAAG 
GGTATCTGTG 
AAACCAATCC 
CGCTGCGGTG 
ACAACTCGTC 
ACAATCACAT 
GTATACCTCA 
GCTAACCCAT 
CTTGACTTTA 
CTTGGTAAGA 
GAAAACGCTG 
ATCATGTCAG 
CTCACTCCAG 
GATGACGCC7 
TCAAACGACT 
AATGTTTTTG 
ACAAATTTCG 
TCAAAGTTCC 
TCAAGTTTAG 
ATAAATGTGC 
AAGCCCCAAA 
TCATAGTAAT 
GTTAAGGTCT 
TTTAGGATAA 
TGCTTCATGA 
AAACGATCGA 
CC AG AC AT AT 
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GTGGTTACGA 

AAGTTCTTGA 

ACACTTACGT 

CTGTTCAAAT 

GATTTGCTAA 

GTGACGAAGA 

AAGATGACCC 

TACGTAGCCA 

CTAACCTTCC 

ACGAAGATGG 

CTAACAAAGC 

GTTATCCACC 

CTCGTCATGA 

GACAACACGT 

GCGAAGACGT 

AACAAAAAAC 

TGGATGCATT 

GAGGGTTTTG 

CGCTCTTTGT 

TCCTTTCTTT 

GAAAACCAAA 

CGTTAGAATA 

TCAAAGTGGT 

ACTGG7CACT 

CTT7AACTTC 

CTCTGAAAGT 

ATT7CCAGTA 

TGTTGATTCT 

GAACAATTTT 

CAACAGTGAC 



CCATGTTCAC 
ATTTCAATCA 
AGATGCCTTG 
GGCCTTCTTG 
CACTGTTGAT 
TGGCTACATC 
ACGTTCAAAC 
CAAACTATAC 
TTACTCTAAA 
TTCTGTGAAC 
TAAAGGTGGT 
TGACGGTATC 
ACAAGTTGAT 
TAACTTGAAC 
TATCGTACGT 
TGAATTGACA 
GAGCTAATCA 
TGCTTGGGAT 
CAACTGTAGT 
TTTCATGTTC 
GGCATTGCCC 
AGGCAATTCA 
TTTAAAGGTG 
ATTCTTCTCT 
AGGTACTAGA 
TCTAGCATAG 
ATATTTAAGA 
AGTCTGATTA 
AGCATTGGGA 
GACTT7AACT 



AAAGACTACA 
CTTAAACCGA 
AACATCATCC 
CCAACTAAAC 
ACATTGTCAG 
TACGATTACG 
GAATTGGCAG 
AAAGACGCAG 
CAAACTGGTA 
TTGTCTAAAC 
TGGTTGCAAA 
TCATTGACTA 
AACTTGGTAA 
GTTATGGACT 
ATCTCTGGAT 
CAACGTGTCT 
ACTTCTTCAA 
AGTATGAGCA 
GGGTTGAAAA 
AGGGCGATAA 
TTGATGTCTT 
ATGGCGTTAG 
CGGTTGAGAT 
TGTACATGAA 
GTAAAGATTT 
AAAGGCTTAA 
GCTCTGTATT 
AGAGCCCTGC 
AATAATTTCT 
TTTTTTCTAG 
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CTTCTTTCGA GTACTTGAAG AAATGATTTC GGATGGTTGT TTGACGTCTG TTATCAAGAA 11320 

TGCTCATCAT TTTCTTAGTG TTGAAATCCT GAGCAATGAA AGCCAATTTC CCCTTCTCGT 11880 

AGGAGAATTC ATCCCAGGAG AGGATTTCAG GCAAAGTGGT GTAATCCTCT TGGAAATCAA 11940 

ATTCCTTGAC CTTACGATAG ACGGTAGAGG TAGAGGTAGA GGTAGAGATG GCTAATTTAG 12000 

AAGCGATATG TGTAAGAGCC TCTCTGTTGA GTAGGAGTTG GGCAATTTTC TGTCTCACCA 12060 

TTTCCGAGAT TTGCCAATTT TTCTGAACGA GAGTTGTTTC AGCTACAGTG ACTTTCCGAC 1212 0 

ACCACTTCCA TTGAAA7CGT CTCTTTTTCA AA7GAATGAG GCTAGCGAAA CCACCAATCT 12180 

CGATAAAAGG GATTT7AGAA GGCTTTTGGA AGTCGTATTT GATTTGTTTT CCT77ACAGT 1224 0 

GTTTACATTT AGGTGGG7CA TAATCAAGTG TAGCGAAGAC TTCGATATCG GTATCGTGCT 12300 

GAATGGCTTT ATTTAAGGTG ATGTTTTTGT CTTTTATTCC GATGAGTAAT GTGGTATGAT 12 360 

TGATGTCTTC CATAAGATAC TTTCTAATGA GTTGTTTAGG CGCTTTTCAT TATAAGTCTT 12420 

ATGGGACTTT TTTCATAC7C AAAAAGCCCT ATAATCTCCA CAGTGGCATT TACCCACTAC 12480 

AGAAATTATA GAGCCAGAAA AAACACTTTT GTTCACTAGC AGAAACTAGA GAGCAGAAGT 12540 

GTTTTTCTCT TCAGATTTAC CCAAAAC7GC GAAATATGGC GATAAGAATA GAGATGGCTT 12 600 

AGGAAGCCCC TTTTTCTGTG TAGACAGTAC GATGAACTTA TAACAAATAG TGAGCCTTTT 12 660 

TAGCAATCAT TGCGACCCGT TTGTCAAAAG CCTCTTTTCG GATATCTACA ATTGTCTGAT 12720 

AGATGACACG CTGTTGGCTA ACATGCAAAT C7AACGCAA7 CG7CAAAAAC TGATGTTTCC 127 80 

CTTTGGGATA CTGCTTTTTA ACGTAAGGCA GGTATTCTTT CGTTGTAATA ATAATCAATG 12 340 

CC7CTC7CAA ATGCTCCTCT GAAGGAGGAG GAC7AA77AG AATATTCTAT CCTGTAACAC 12 900 

AGGCAACTTT GTCAGTAAAA TTCCG7AAAA TAATGGACTT TATTAACTTT ACATCTCCTT 12960 

GATTATTTAA AA7GATAAAA ATCGGGATAG CAGGTAGTGA GGAAAAGATG GTTTCTGTCA 13020 

AGTAGAGTGA GAAAAGGTAC ACCCGA7CC7 GGTCGATAAC TCCTTCAATC TTCTGCTCAG 13080 

TCATCCACTG TTGAACAATT GCTTTCGAAA TATGATACAG TGGCTTGTCG CTTTCAATCC 1314 0 

CATAATGTTC GTAATAATTA TAATAGGGAA CTAGATTTTG TAAACCAAAC AAAAACGTTC 13 200 

TTGTTAAGAA AGTCAGTGCT GTTAAAAAAG AAAGAGAATT CGAAATGTCA TTTCCTAAGA 13260 

TATTCTTGAA CTTGGATAGT AGATGCTTTC CTCTTGTATG CTGAAGAATC AGTTGAATAG 13 320 

TA7GAGTCTT TTTTTCTTGA TTCCATTTGT CCT7GGAAAA CGAACAAT7A GCAGAACAAT 13 380 

AAACCAAAAA GATATAATCC AGTTCTTCCT GAGTAAAAGT CATGTTGGCA TGTGGCTCTA 134 40 

AGTAAGTTTG GCAATGTTCC ATCAAAATCG GATACATAAA GAGGTTTTTT AATTTTTCAA 13 500 
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ACTCTTTCCA CTCAGGGAAC TCAAGTCGAA ATTCCCGACC TTTCCAAGTG AGTGCCACTA 13560 

GTATGCTAAA ATGAACATAC TCGTCAGGTG TGATTTCTAA CACTTCATGA CTGAGTTGAG 13620 

AATTAGACTG CACAATCATA TGTGTGACCC AATCCATACT TCCATCATTC AAATCATAAA 13630 

TCTCAATACC AAAATGAAAC TCCAGGAGTC CAATTAAAAA ACGAATCCGA TATTCAGGAC 13740 

CAACTACTTG ATTTTTCACA AGGTCCAAAC CTACTGAACG TXGTAACAAG CCACACTTTT 13800 

GTCGTACGCG GTAGCCTGTT GCGATGGAAA TATACTCTTT TTCTGTAAAT TCCTTAAAGC 138 60 

TTTGATTACC TTCTACTAGA AAGAAGCCGA GTATTTTTAA AATACTTGAT TGGTTATAAA 13 920 

GCTGATGGAA GTAATAATTC GTTTGATGAG AATGCTGTTC CATTAATTGA ACTTGTTGCG 13 980 

TATCTAAATT AAATCTCAAC TCTTCCTCGA ATGTTTCTTG TAATTCCTGC AAAATGCTTA 14040 

GGAGACTTTT AGATTGTAAT GAAGTTAAAG TACACAGTTC ATCTAGTTCA ATAGACCGAA 14100 

TATCCAATAA TATATT7AAA ATGGTAATTT TATCTCTAAT TCTTTTTTCA ATCTATTTGT 14150 

TTAGCATAGT TACCGAATCT TAGTTGCATA TAGATAATTT TAATTATTAT AATACAAAAG 14220 

AAACTAATTG TCTTGTCAAA AAGGTTGTGG AATTTCCGAC TTTATTGATA AAACAGCATG 14280 

TAATAAAACC CATTTTAAAG ATAGTAATGA GTATTGGTGG AGTTTTATGG CTTATTTTTT 14 34 0 

TTATTAGAAA ATATTTTTTT ATCAAATATT CTCGTTCTAT AAAAAAATAT GTGATAAAAA 14400 

TATCTATTGT GATGGAAGTT GTTTTAATTT ATACTAGGAT AGTTAATAGT AA TACT AT AC 14460 

TATACTATAT TGTATACAAC TGTGTCATTG CCAGGTTGAG AAGATAGCTA TAACGCACTT 14520 

TTATACCCTT TTGCTACGTT TGTTAGTGAA CGGATTAACT CAGTGAGATA AATTTTATCA 14 580 

CAACATAAC? AATCCGTTTC TTCCTCTATA CAGATTGAAA GTACC7ATCA ATCATAGAAG 14 640 

GATTAACTTC TTCTATGAAT AATGCTTAAC AGGGAGACAC ACATCAAAAA AGTAAGAAAG 14 7 00 

A7ATTTCAGA AGGCAGTTGC AGGACTGTCC TGTATATCTC AGTTGACAGC TTTTTCTTCG 147 60 

ATAG7TCCTT TAGCACAAAC GCCTGAAACC ACTCCAGCGA TAGGAAAAGT AGTGATTAAC 14 8 20 

GAGACAGGCG AAGGAGGAGC GCTTCTAGGA CATGCCCTCT TTGAGTTGAA AAACAATACG 14 880 

GATCGCACAA CTGTTTCGCA AACGACAGAG GCGCAAACAG GAGAAGCGAT ATTTTCAAAC 14940 

A7AAAACCTG GGACATACAC CTTGACAGAA GCCCAACCTC CAGTTGGTTA TAAACCCTCT 15000 

ACTAAACAAT GGACTGTTGA AGTTGAGAAG AATGGTCGGA CGACTGTCCA AGGTGAACAG 15060 

GTAGAAAATC GAGAAGAGGC TCTATCTGAC CAGTATCCAC AAACAGGGAC TTATCCAGAT 15120 

GTTCAAACAC CTTATCAGAT TATTAACGTA GATGGTTCGG AAAAAAACGG ACACCACAAG 15190 

GCGTTGAATC CGAATCCATA TGAACGTGTG ATTCCAGAAG GTACACTTTC AAAGAGAAT7 15240 

TATCAAGTGA ATAATTTGGA TGATAACCAA TATGGAATCG AATTCACCC7 TAGTGGGAAA 15300 
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ACAGTGTATG 


AACAAAAAGA 


TAAGTCTGTG 


CCGCTGGATG 


TCGTTATCTT 


GCTCGATAAC 


15360 


TCAAATAGTA 


TGAGTAACAT 


7CGAAACAAG 


AATGCTCGAC 


GTGCGGAAAG 


AGCTGGTGAG 


15420 


GCGACACGTT 


CTCTTATTGA 


TAAAATTACA 


TCTGAT7CAG 


AAAATAGGGT 


AGCGCTTGTG 


15480 


ACTTATGCTT 


CCACTATCTT 


TGATGGGACC 


GAGTTTACAG 


TAGAAAAAGG 


GGTAGCAGAT 


15540 


AAAAACGGAA 


AGCGATTGAA 


TGATTCTCTT 


TT7TGGAATT 


ATGATCAGAC 


GAGTTTTACA 


15600 


ACCAATACCA 


AAGATTATAG 


TTATTTAAAG 


CTGACTAATG 


ATAAGAATCA 


CATTGTAGAA 


~'iF66CT — 


TTAAAAAATA 


AGGTACCTAC 


CGAGGCAGAA 


GACCATGATG 


GAAATAGATT 


GATGTACCAA 


1572C 


TTCGCTGCCA 


CTTT7AC7CA 


GAAAGCTTTG 


ATGAAGGCAG 


ATGAGA7T77 


GACACAACAA 


15780 


GCGAGACAAA 


ATAGTCAAAA 


AGTCATTTTC 


CATATTACGG 


ATGGTGTCCC 


AAC7ATGTCG 


15840 


TATCCGATTA 


ATTTTAATCA 


TGCTACGTTT 


GCTCCATCAT 


ATCAAAATCA 


ACTAAATGCA 


15900 


TTTTTTAGTA 


AATCTCCTAA 


7AAAGATGGA 


ATACTATTAA 


GTGATTTTAT 


TACGCAAGCA 


15960 


ACTAGTGGAG 


AACATACAAT 


TGTACGCGGA 


GATGGGCAAA 


GTTACCACAT 


G7TTACAGAT 


16020 


AAGACAGTTT 


ATGAAAAAGG 


TGCTCCTCCA 


GCTTTCCCAG 


TTAAACCTGA 


AAAATATTCT 


16080 


GAAATGAAGG 


CCGCTGGTTA 


TGCAGTTATA 


GGCCATCCAA 


TTAATGGTGG 


ATATATTTGG 


16140 


CTTAATTGGA 


GAGAGAGTAT 


TCTGGCTTAT 


CCGTTTAATT 


CTAATACTGC 


TAAAATTACC 


16200 


AATCATGGTG 


ACCCTACAAG 


ATGGTACTAT 


AACGGGAATA 


7TGCTCCTGA 


TGGGTATGAT 


16260 




TAGGTATTGG 


TATTAACGGA 


GATCCTGGTA 


CGGA7GAAGC 


AACGGCTACT 


16320 


AGTTTTATGC 


AAAGTAT7TC 


TAG7AAACCT 


GAAAACTATA 


CCAATGTTAC 


TGACACGACA 


16380 


AAAATATTGG 


AACAGTTGAA 


TCGTTATTTC 


CACACCATCG 


TAACTGAAAA 


GAAATCAATT 


16440 


GACAATGC7A 


CG ATT AC AG A 


TCCGATGGGT 


GAGTTAATTG 


ATTTGCAATT 


GGGCACAGAT 


16500 


GG AAGATTTG 


ATCCAGCAGA 


TTACACTTTA 


ACTGCAAACG 


ATGGTAGTCG 


CTTGGAGAAT 


16560 


GGACAAGCTG 


TAGGTGGTCC 


AC AAAA 1 uA 1 


KjKj L v>*j 1 1 lo k 


7AAAAAATGC 


AAAAGTGCTC 


16620 


7ATGATACGA 


CTGAGAAAAC 


GA7TCGTGTA 


ACAGGTCTGT 


ACCTTGGAAC 


GGATGAAAAA 


1668C 


G7TACGT7GA 


CCTACAATGT 


TCCTTTGAAT 


GATGAGTTTG 


7AAGCAATAA 


ATTTTATGAT 


16740 


ACCAATGGTC 


CAACAACCTT 


ACATCCTAAG 


GAAGTAGAAC 


AGAACACAG7 


CCCCGACTTC 


16800 


CCGATTCC7A 


AGAT7CGTGA 


TGTGCGGAAG 


TATCCAGAAA 


TCACAATTTC 


AAAAGAGAAA 


16360 


AAACTTGGTC 


ACATTGAGTT 


TATTAAGGTC 


AATAAAAATG 


ATAAAAAACC 


ACTGAGAGGT 


16920 


GCGGTCTTTA 


GTCTTCAAAA 


ACAACATCCG 


GATTATCCAC 


A7ATTTATGG 


AGCTATTGAT 


16980 


CAAAATGCCA 


CT7ATCAAAA 


7GTGAGAACA 


GCTGAAGATG 


GTAAGTTGAC 


CTTTAAAAAT 


17040 
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CTGTCAGATG 
CAAAATAAGC 
ATCCTTCCAC 
AATGAACCTA 
TTCTATCTGA 
TAAAGTGTAG 
GAAGAGATTT 
GGGGAGATGT 
TACATTGCTC 
CAAATTTTTA 
AACAGTTTTT 
AGATGGGGAT 
AC7GGGTCTT 
TACTAATAAT 
AACATTTAAA 
AGGAGCTAAA 
CAGTTTATCA 
AATTGAAATT 
AGAAGCAAAC 
TGTAGATAAA 
TACAAAAATT 
AGGTTTGGCA 
AGGTGATTAT 
TTTAGCTAAA 
ATTGAATGAC 
TAATAATCCA 
GACATTCACC 
AACGTTCGAT 
AGACAAAAAT 
ACGTACTATA 



GGAAATATCG 
CTATCGTTGC 
AAGATATACC 
TTCCTCCAAA 
TACGTTGCAT 
AAATGATAAT 
AAGTTTACTT 
TTTCGAAAAC 
ACAGTTGATT 
ACAA7GCTTC 
GCGGCTGGGA 
ATGGATAAAA 
CTACCTGCAA 
GAAATTATTG 
CTCTCAGGGG 
TTTAACACGG 
ACTTATGTCG 
GAATTACCAT 
CCAAAAATTG 
GATACACCTG 
CCAGCACTTG 
TTCAACAAAC 
GCTCTAACAG 
GTGAATGACC 
AAAGCAATTG 
GATCACGGGA 
AAGACATGGG 
TTGGTTAATG 
ACAGTTACTG 
AAAGGGTATT 



ATTATTTGAA 
CTTCCAAATA 
ACCCGGTTAC 
GAGAGAATAT 
GATGATGGGA 
ATCTATGTTC 
GGTGAAACCT 
TTGCACAGAA 
TTAAGAGATA 
CTCCCTTATT 
CGACAACAAC 
TTGCAAATGA 
ATGCAAAACA 
ATGAAAATGG 
CAATCCCGGC 
CAAATTTACC 
GTGAAGATGG 
TCAACCATG? 
ATAAAGATTT 
TGAACCACCA 
CTAATTATCC 
GTACAGTGAA 
AACTACCAAC 
AAAACGCTGA 
TAGAAGTACC 
ATACTCCAAA 
TTGATGCTAC 
CTCAGACTGG 
TTAACGGATT 
CAGCAGATTA 



488 
AATTCTGAAC 

GTAAATGGAG 

GACTTTACGA 

CCTCGAACTG 

GGAGTTCTAT 

TGAACGATAC 

GTTTTATTCG 

AAAGGATTAT 

TGAATAAGGA 

ACTGACAGCG 

ATCTGTTACC 

GTTAGAAACA 

AATTGCCGGT 

CCAAACTCTA 

AACTGCAATG 

AGCTGCTAAG 

AGCAACCTTA 

TGTGGATCCG 

CAAAGGTAAA 

ACTTCCAGAT 

AACAGCAAAC 

AGTAACTGTT 

TGGTTTTGAT 

AAAAACTGTG 

AGAATCTAAT 

GCCGAATAAG 

AGCTGCACCA 

TAAAGTTGTA 

GGATAAAAAT 

TCAAGAAATC 



CAGCTGGTTA 
AAGTCAGAGA 
ATGATAAGCA 
G7GGTATCGG 
TATACACACG 
TTTTAAGAAG 
TAAGTAAACT 
TATTGTCATG 
GAAATCATGA 
AGTAGCCTGT 
GTTCATAAAC 
GGTAACTATG 
GTTATGTTCC 
GGAGTGAATA 
AAAAAATTAA 
TATAAAATTT 
ACAGGTTCTA 
CATGTCTATC 
GCAAATCCAG 
GTTGTAGAGT 
TGGAGCGATA 
CATGATCTTC 
TTGAAATTAA 
AAAATCACTT 
GATGTAACA7 
CCAAATGAAA 
ATTCCGCCTC 
CAAACTGTAA 
ACAGAATATA 
ACTACAGCTG 



TAAACCCGTT 
TCTGACTTCA 
CTATATTACC 
AATGTTGCCA 
GAAACATCCG 
TAGCACTCAA 
ATCATTGAAA 
TGTAATTCAT 
AATCAATCAA 
TTTCAGCTGC 
TATTGGCAAC 
CTGGTAATAA 
TTTGGACAAA 
TTGATCCACA 
CAGAAGCTGA 
ATGAAATTCA 
AAGCAGTTCC 
CAAAAAATAC 
ATACACCACG 
ACGAAATTGT 
CAATGACT3A 
CACTTGAAGC 
CAGATGCTGG 
ATTCGGCAAC 
TTAACTATGG 
ACGGCGATTT 
GAGCTGAAGC 
CTTTGACAAC 
AATTCGTTGA 
GAGAAATTGC 



17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
13060 
18120 
18130 
18240 
18300 
18360 
18420 
13480 
13540 
18600 
18660 
18720 
18780 
18840 
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TGTCAAGAAC TGGAAAGACG AAAATCCAAA ACCACTTCAT CCAACAGAGC CAAAAGTTGT 19900 

TACATATGGT AAAAAGTTTG TCAAAGTTAA TGATAAAGAT AATCCTTTAC CTGGGGCAGA 189 60 

ATTTGTAATT GCAAATGCTG ATAATGCTGG TCAATATTTA GCACGTAAAG CACATAAAGT 19020 

GAGTCAAGAA GAGAAGCACT TGGTTGTTAC AACAAAGGAT GCTTTAGATA GACCAGTTGC 19080 

TGCTTATAAC GCTCTTACTG CACAACAACA AACTCAGCAA GAAAAAGAGA AAGTTCACAA 19140 

AGCTCAAGCT GCTTATAATG CTGCTGTGAT TGCTGCCAAC AATGCATTTG AATCGGTGCC 192 00 

AGATAAGGAC AATCAAAATC TTGTGAAATT ACT7TCTGAT GCACAAGCTC GCTTTGAAAT 192 60 

TACAGGCCTT CTTGCAGGTA CATATTACTT AGAAGAAACA AAACAGCCTG CTCGTTATCC 193 20 

ATTACTAACT ACCCCTCAGA AATTTGAAGT CACTGCAACT TCTTATTCAG CGACTGGACA 19380 

ACGCATTCAG TATACTGCTG GTTCAGGTAA AGATGACGCT ACAAAAGTAG TCAACAAAAA 19440 

AATCACTATC CCACAAACGG GTGGTATTGG TACAATTATC TTTCCTGTAG CGGGGGCTGC 19500 

CATTATCCGT ATTGCAGTGT ACGCATATCT TAAAAACAAC AAAGATGAGG ATCAACTTGC 19560 

TTAAGTAAGA CAGAAAGGAG CCATTGATGA CAATGCACAA AATGCAGAAA ATGATTAGTC 19620 

GTATCTTCTT TGTTATCGCT CTGTGTTTTT CTCTTGTATG GGGTCCACAT CCAGTCCAAG 19680 

CGCAAGAAGA TCACACGTTG GTCTTGCAAT TGGAGAACTA TCAGCAGCTC GTTAGTCAAT 197 40 

TGCCATC7CG TGATGGTCAT CGGTTGCAAG TATGGAAGTT GGATCATTCG TATTCCTATG 19800 

ATGATCCGGT GCAAATTCTA AGAGACTTGC ATTCGTGGGA TCACAATAAA CTTTCTTCTT 19 860 

TCAAAAACAC TTCGTTTGAG ATGACCTTCC TTGAGAATCA GATTGAAGTA TCTCATATTC 19920 

CAAATGGTCT TTACTATGTT CGCTCTATTA TCCAGACGGA TCCGGTTTCT TATCCAGCTC 19980 

AATTTCTTTT TGAAA7CACA GATCAAACGG TACAGCCTTT GGTCAT7G7A GCGAAAAAAA 2 0040 

CAGATACAAT GACAACAAAG GTGAAGCTGA TAAAGGTGGA TCAAGACCAC AATCCCTTGG 20100 

AGGGTGTCGG C7TTAAATTG GTATCAGTAG CAAGAGATGT TTCTGAAAAA GAGCTTCCCT 20160 

TGATTGGAGA ATACCCTTAC AGTTCTTCTG C7CAAG7ACG GAGAACTCTC TATACTGATA 20220 

AAAATGGAGA GATTTTTGTG ACAAATCTTC CTCTTGGGAA CTATCGTTTC AAGGAGCTGG 20230 

AGCCACTGGC AGCCTATGCT GTTACGACGC TGGATACGGA TGTCCAGCTG GT AG AT CATC 2 0 340 

AGCTGGTGAC GATTACGGTT GTCAATCAGA AAT7ACCACG TGGCAATGTT GACTTTATGA 20400 

AGGTGGATGG 7CGGACCAAT ACCTCTCTTC AAGGGGCAAT GTTCAAAGTC ATCAAAGAAG 20460 

AAAGCGGACA C7ATACTCCT G77CTTCAAA ATGGTAAGGA AG7AGTTC7A ACATCAGGGA 20520 

AACATGC7CC TT7CCGACTG GAAGGTCTAG ACTATGGGAC ATACTATTTA TGGGAGCTCC 20580 
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AAGCTCCAAC TGCTTATCTT CAATTAACAT CGCCTGTTTC CTTTACAATC GGGAAAGATA 20640 

CTCGTAAGGA ACTCGTAACA GTGGTTAAAA ATAACAAGCG ACCACGGATT GATGTGCCAG 20700 

ATACAGGGGA AGAAACCCTT GTATATCTTG ATGCTTGTTG CCATTTTCTT GTTTCGTAGT 2 0760 

GGTTATTGTC TTACGAAAAA ACCAAATAAC TGATATTCAA TGTACATCAT TATGAATAGG 20820 

ATAGCAGGCT GAAGGGAAGA CCAGACTACT CTGAGGTGAT GTTAATCACG AATCATGGTG 208 80 

ATGTGGCATG AATCATCAAT AACGGATATC ACCCTCGCCA GATTGTGCCA GCCTCATTGT 20940 

GGGTTATTGT TTGTAAAACG ATAGGACTGG TCTGGTAATC ATTTTA 20986 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21040 base pairs 
O) TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
(D| TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

CCCAGCAAAA AGCCATCCGA AGATGACTT7 TTTGCTATTT AATTTCTGTA TAACTTACTT 60 

CCAAGCCACG CTTAACAGCT GGACGATTGC CAATTTTTTC TCCCCATTTT ACTAGATTTT 120 

GATAACTTGA GGCATCCAAG AATTTTGCAG AACCTTGGTA AAGATTTCCT TGAACTAACT 180 

GTCCATACCA AGACCAGATA GCAATATCTG CAATCGTATA CTCATTGCCT GCAATATAAG 2 40 

G7TTCTGAGC CAATTCCTTA TCCAATAAAT CCAACTGGCG TTTCACTTCC ATCGTAAAAC 300 

GGTTAATAGG ATATTCCAAT TTTTCAGGAG CATAATTCAA GAAATGTCCA AATCCCCCAC 360 

CTAGAAAAGG TGCTCCACCT GCTTGCCAGA ATACCCAATT CAAAACTTCT ACCTTTTCCA 4 20 

CAGCATTACT TGGTAAAAAG GCTCCAAATT TCTCAGCAAG GTAAAGAAGA ATATGAGCAG 480 

ACTCAAAGAC TCTTACGTTT TCAGTACCTG ACTGGTCCAA TAAGGCTGGA ATCTTGGAAT 54 0 

TTGGATTGAG CTTCACAAAG TCTGATCCGA ATTGATCCCC ATCCATGATA GCAATCTTAT 600 

ACAAGTCGTA AGCCGCTTCC TTAAAACCAG CTTCTAGTAA TTCTTCCAAT AAGATAGTAA 660 

CCTTCACACC ATTTCGTGTT CCCAGTGAAT AAAGCTGAAA AGCTTGTTCT CCTTTTGGCA 720 

AGTTTTGTTC CAAACGGGCA CCTGCTGTTG GTCTGTTTAG CCCCGTAAAA CCTCCTTCAT 730 

TACTAGCTTC ATCCTGCCAT ACGGTCGGTA ATTGATATGC TGACATCCGA AACCTCCCTT 84 0 

AAATCGCATT CTTGTCAAAA CCCAGTTTGC GTTGAATAAA CTTAACGATT TCGACGATGA 900 

TAATCATTGA GAAGCTTCCA GCCATAACAA TTCCCCATTG TGACAAGTCT AGTTTGGTTA 960 

CGTGGAAGAT TCCTTCAAGC GGTTCTACAA CGAT7GTTCC CATGAGAAGG ATAAAGGATA 1020 
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CCAAGATGGA CCAGTTAAAG GTCTTAGACT TGAATGGGCC AACTGTCAAG ATCGATTGGT 1080 

ACACAGACTT CACATTGTAG GCATGGAAGA GCTGAATCAA ACCAACGGTT GCAAAGGCCA 1140 

TCGTTAGGGC ATCTGCATGA ATAGCATGAT TGTCACCCAC ATCAACTCGC TAAGCAATCG 1200 

CAAGGCCATA AACACTCATA ACAAGACCTC CTTGGAGTAC ACCTTGATAA ATGATAGAAC 12 60 

TCAAAACACC ACCTGAGAAG AAGCTTGCCT TGCGTCCACG TGGTTTATGA TTCATGACAC 13 20 
CAGGTTCCGC AGGTTCAACA CCAACAGCGA TAGCTGGGAA CGTATCCGTT ACCAAGTTGA ' 1380" 

TCCACAAAAG ATGAACCGGC TGTAAGACAT CCCAACCAAA CAAGGTTGAT AGGAAGATGG 1440 

TTAATACTTC AGCAGTATTA GCAGAAAGTA CCTACTGAAT AGTCTTTTGA ATCTTTGAGA 1500 

AGACCTTACG TCCTTCTTCC ACTGCGACGA TAATAGTCGC AAAGTTATCA TCTGCAAGAA 15 60 

T CAT AT C AG A AGCCCCCTTA GAAACCTCTG TACCAGTGAT TCCCATACCG ATACCGATAT 1620 

CGCCTGTTTT CAGAGCTGGC GCGTCATTGA CACCGTCACC TGTCATGGCA ACGACTTTAC 1680 

CTTGTTTTTG CCAAGCCTTG ACGATACGAA CCTTGTCTTC TGGAGACACA CGGGCATAAA 17 40 

CAGACTATTG ACCAACGAC? TTTTCAAATT CTTCATCTCA CAGTTCATTG AGTTCAGCAC 1800 

CAGTTAAAAC GTGACCTTCT GTATCGTTTG CGTCAATGAT TCCCAAACCT TTCCCAATGC 1860 

CTTCCGCTGT GTCTTGGTCG TCACCTGTAA TCATAATTGG ACGGATTCCC GCTTCCTTAG 1920 

CCACACGAAC AGCCTCAGCG GCTTCACGAC GTTCAGGGTC AATCATCCCA ATCAAACCAG 1980 

TAAAAATTAA ATCATTTTCA AGCTCTTCAG AAGTGAGATT TTCTGGAATA CTATCGATAA 204 0 

TCTTATAAGC ACCTCCAACG ACACGCAAGG CTTCATCAGC CATTTCAGAA TTGTTTGTAC 2100 

CAATGACATT TGTAACCT7C TCATCAATCG GAGCAA7ATC CCCAGCCTTA TCACGAAGAA 2160 

GACAACGTTT TAAGAGTTGG TCTGGCGCAC CCTTGACTGC TACAACGAAA CGACCATCTG 2 2 20 

GCAATGGGTG AACTGTTGAC ATGAGCTTAC GGTCAGAGTC AAATGGCAAT TCACCTACAC 2 2 80 

GAGGATATTT CTCTAAGAAA CCTTTCACAT CATAGCCCTT GTCCAAGGCA TATTGGATAA 2 3 40 

AGGCTGTTTC GGTTCGGTCA CCAATCAAGT TACCTTCCAC ATCGATTTTC GTATCATTGG 2 400 

CCAAGACAAC TCAACGAACT AGTGGCATTT CAAGACCTAG TTCAATATCA TCACCTGAGT 2 4 60 

CATGTACAAC CGC AT CG TAG AAGACTTTTT CGACTGTCAT CTTGTTCATA GTCAGCGTAC 252 0 

CAGTCTTATC AGAAGCGATG ATTTCAGTTC AACCAAGTGT TTCAACTGCT GGCAACTTAC 2 5 80 

GAACGATGGA ATGTCGTT7G GCCAAAACTT GAGTACCAAG AGAAAGAACG ATGGTAACGA 2 640 

TAGCACGAAG TCCTTCTGGA ATGGCTGCAA CGGCAACGGC AACAGAAGTC AACAACTCAC 2 7 00 

CAAGTGGATT TTTCCCTTGA ATGAAGACAC CCACTACAAA AGTAACAAGG GCAATGACCA 2 7 60 
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AGATAGCATA GGTCAAGACC TTAGAAAGGT TGTTCAAATT TTCTTTGACT GGTGTATCAC 2 820 

TCTCATCCCC ATCTTGAAGC ATACCAGCAA TATGACCAAC TTCAGTGTAC ATACCTGTAT 2 880 

TGACAACAAC ACCCATCCCA CGACCATAGG TTACGTTTGA GTTTTGGAAG GCCATGTTGA 2940 

CACGGTCACC AATACCAGCA TCTGTCGCAA GCTCGACTGA CAAGTCTTTT TCGACTGGTA 3 000 

CAGATTCACC TGTCAAGGCT GCTTCTTCAA TTTTAAGAGA GTTGGCTTCT ATCAAACGTA 3060 

GGTCCGCTGG TACCACGTCA CCTGCTTCAA GGGCAACGAT ATCGCCTGGT ACCAATTCTT 3120 

TAGAGTCAAT CTCTGCCATC TGTCCATCAC GAAGAACGCG GGCAACTGGA CTAGACATCG 3130 

ATTTGAGGGC TTCAATAGCT TCTTCACCTT TTCCTTCTTG GTAAACACCA AAGGCAGCGT 324 0 

TGATGATAAC CACACCTAGG ATGATAATGG CATCTGCGAT ATCTTCCCCA CCAGAAGTCA 3 3 00 

CGACTGACAA GATTGccGCC GCAACTAGGA TGATAATCAT CAAATCCTTA AATTGCTCGA 3 3 60 

TGAATTTGAC CAAGATTGAT C G TTTCTCGC CTTCTTCGAG TTCATTGTGC CCAAATTCGG 3420 

CAAGGCGCTT TTCCGCCTCA CTTGATGACA AACCTTGCTC GGTCGCATCC ACAGCCTGCA 34 80 

ACACCTCTTC AGGGCTCTGA GTATAAAACG CTTGGCGTTT TTGTTCTTTT GACATGTGTC 354 0 

TCCTCCTTCA CATTGTGTGC AAAACAGACT CTCTTTCTCT CATACCTTTT CACGACAAAC 3 600 

AAAAAGAAAC CTGTTAATCA TAACAAGTCT CGCTCTTTAA CATAGGGCCG GAAAGCATAC 3 6 60 

TTTTCAGCAT AAAATTCGCA ATGACGACAC TATCACAGGT TTCTGCCAGC TACTCCCTTG 37 20 

AGTAGTACCA TTATACCAAA TTTTGGGGAG TTTTCAAAGA GTAAAAACTG CCTTATTTGA 3 7 30 

ATTTTTCCTT GAAAACCAGT ATAATGGTAG AATGCTATGT GACTAGAAAG GAACTTGAAT 3 840 

GAAGCAATCT ATCTCAAATC TCAAGTTAGC TGAGCCTGGA GCCATTATCA GTATTTCGAC 3 9CO 

CTATTTGATC TTGTCTGCAG CCAAA7TAGC AGCTGGTCAT CTCCtTCATT CATCCAGTTT 3960 

CCTGGCCGAT CCTTT7AATA ACGTATCGGA CATCATTGCA AATGTGGCCC TCTTAATCCC 4020 

GATTCGGATG GCCCGCCACC TGCAGACCGT GACCACCGTT TTGGTCATTC GAAGATTGAA 4 080 

GATTTGGCAA GCTTGATCAC TTCTATCATC ATGTTCTATG TCGGTTTCGA TGTTCTAAGA 4140 

GATACCATTC AAAAGATTCT CAGTCGGGAA GAAACGGTCA TTGATCCTCT TGGTGCAACT 4 200 

CTAGGAATCA TTTCTGCAGC GATTA7GTTT GTGGTCTATC TCTACAATAC 7CGCCTCAGT 42 60 

AAGAAATCCA ACTCCAATGC GCTGAAGGCA GCTGCTAAGG ACAATCTTTC TGACGCTGTT 4 3 20 

ACCTCACTTG GAACCGCCAT TGCCATCCTA GCTAGTAGTT TCAATTATCC GATTGTGGAT 4 380 

AAACTGGTTG CT AT CATC AT CACTTTCTTT ATCTTGAAGA CTGCCTATGA TATCTTCATC 444 Z 

GAGTCTTCCT TTAGTCTTTC AGATGGCTTT GACGACCCCC TGCTCGAGGA CTACCAAAAC 4500 

GCTATCATGG AAATTCCCAA AATCAGCAAG GTCAAATCCC AAAGAGGTCG CACCTACGGT 4 56C 
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AGCAACATCT ACCTGGATAT TACACTAGAG A7GAATCC7C ACTTCTCTGT TTTTGAAAGC 4 620 

CATGAAATCG CGGATCAGGT CGACTCTATG C7CCAGGAGC GTTTTCCCCT CTTTGATACC 4 630 

GATGTCCATA TCGAACCAGC ACCTATCCCT GACGATCAAA TTTTAGACAA TGTCTATAAA 4740 

AAATTGCTTA TCCGTGAACA ATTGATTGAC CAAGGAAACC AACTAGAAGA ACTCTTGACT 4 800 

GATGATT7TG TCTATA7TCG CCAAGATGGA GAGCAGATGG ATAAAGAGGC TTATAAGACC 4860 

AAAAAAG AGT TAAA7TCTGC 7ATCAAGGAC ATTCAAATTA CTTCCATCAG TCAAAAAACC 4 9 20 

AAACTCATCT CCTA7GAGTT AGATGG7A7C ATCCATACCA GTATCTGGCG TCGCCACGAA 4 980 

ACCTGGCAAA ATATCT77CA TCAAGAAACC AAAAAAGAAT AGAGAAATCC TTTCATGAGA 504 0 

CGGCATTTTT CTATTCTTTT ATACTCAATA AAAA7CAAAG TGCAAATTAG GAAGCCGGTC 5 ICO 

ACAGGCTGTA CTTGAGTCCG CAATCTCAAG CCGACATAGT 7TGCACTTTC ATTTTCGAAT 5160 

AGTCTTAACT ATCAAATTCA CTGAGATACT CATAGCGTTC GTATTTTTCA AGGAGTGCTT 52 20 

CATTTTTCTC ATCCAATTCT TTTTGGAGAG TAGCCAGCTT ACCAAAGTCA GACCCCTTAC 5280 

CC7CCA7TTC C7CT7CAATA GCAGCGATAC CTTTTTCCAA GGTTTCAATA TCACCTTCAA 53 40 

TACTTCCCCA CTCCTGCTTT TCTTGGTAGG TCATGCGTTT CTTGTCTTCT CGAACCTTGA 5400 

CCACTTTTTC CTTTTCGGCC TTTTGCACTT GATTGGCCAT ATCTCTTTCA AAAGCTTTTT 54 60 

CATCAAGATA GTCGGTGTAA TGACCAAAGA AAGGACGAAT CTTGCCATCC TCAAAAGCGA 5520 

GAATCTTGCT CCCTACCTTA TCCAACAAAT AGCGGTCGTC ACTCACTCTT AAAACCCCAC 5530 

CTGCAAAACC TTGCAAGAAA TTCTCTAAGA CTG7CAAAG7 TGCAATATCT AGC7CAT7CC 56 40 

TTGGCTCGTC TAAAAGAAGA ACATTTCGTT TTTCCAAAAG CAGTTTCAGC AGA7AAAGAC 570C 

GTT7TTTCTC ACCCCC7GAC AATTTCTCAA TCAAACTCCC ATCCC7dAA CG7CCGAAGA 57 60 

GGAATTGCTC CAGCAACTCA GCGATGGAAG TCGTAGAACC ACCACTGGTC TTGACCTCCT 5820 

CTGCCACTTC CTGCACC7AA TTGATCACAC GCTTGCT77C ATCCAAACCC TCAATTTCTT 5880 

GAG AG AAA T A GGCGATGCGA ACAG7TTCCC CAATCACAAC TTGTCCTGCT CTCGGCTCAA 5940 

GACTTCCTCC AATCAGGTTA AGTAGGGTTG ATTTTCCAAC ACCATTGTCC CCAACAATTC 6000 

CAATACGGTC T77AGCC7GA ACTAAGAGAT 7AAAATT77G CAAAATGGGC T7ATTTTCAT 6060 

AGGCAAAGGA AACATCCTGA AACTCGATGA CTTTCTTCCC AATCCGAC7C GTTTCAAAGT 6120 

7CATAGTCAA GTCTGTCTCA GCACTACTGC CTGAAAC77C CTTTTTCAGA TCATGGAAAC 6180 

GATTGATACG ACCTTGTTCC TTGGTCGCAC GCGCCTGCGG TTGTCTGCGC A7CCAGGCCA 62 4 0 

AT7CTTGTTT GTAGAG77GT TCTTTTTTGT GAAGAAGAGC CGCGTCGCGC TCATCCTGTT 6 3 CO 
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CCGCCTTTAC GCGAACATAG TCCTCGTAAT TTCCCTGGTA CTCGGTCAAG CCTGCACGAT 6 3 60 

CCAACTCGAA AATCCCTG7T GACAAAGCGT CTAAGAAATA ACGATCGTGA GTGATAAAAA 64 20 

GGACGGTCTT CTTAGAATTT T7CAAAAAGA GGGTCAGCCA CTCAATAATC GCAATATCCA 6480 

GATGGTTGGT CGGCTCATCC AAAAGCAAGA GGTCGTGGTT GCCAACTAAG ACTTGTGCCA 6540 

ACTGTACCCG TCTTCTCAGA CCACCTGACA ATTCCCCAAC AGGAGTAGAT AAGTCTTGAA 6600 

TGCCCAATTT GCTAAGAACG GTCTTGACCT GACTTTCGAT TTCCCAAGCT TGCAGAGAGT 6660 

CCATCTCTGC CATGACACGT TCCAAACGCG CCTGCTTGTC CTCACTATAG TCGAGCATAA 6720 

7CAATTCATA CTCACGAATG AGCTGGATTT CCTTGAGTTC AC TAG AT AG A ACCCTATCCA 6 7 80 

AAACTGTCTT TCTATCATCA AAATCAGGAT CCTGAGTCAA GTAACCAATC TGGTAATCAT 6B40 

TTTTAGCTGA AAAAGGACTG ACATCCCCAT CAAATCCAGA AACACCAGAA AGGACGTCCA 6900 

AAAGGGTGCT CTTGCCAGTC CCATTGACAC CGATTAAACC AATTCTGTCT AAGTCATGGA 69 60 

TAATAAAGGA AATATCCCTA AAAACGGTCT TGTCACCAAC CGATTTACTT AGTTTTTCAA 70 20 

CGATAAAATC ACTCATTTTT TCTCCCTCAG GTAAGCATCG ATGGCTTCAC GATTATTCTC 7090 

CAATTCTCCA TCGACAATGC CAAACTCAAT CTCTGTTAAA ATCTCTCCCA ACTCTGCGCC 7140 

TGGCTGATAG CCATATTCCT TGATCAAAAT ACCGCCATTA ATCTGAATCT CTTTCTTGTC 7 200 

ATGGATAGTC AAGCTTTGGT ATTTTTCTGT GATGGCTTGT GGGTTGACTT CTTTTCCTTG 72 60 

ACCTTGACGA AGATTTTCAG CCTGTAAAAG CAAATCTATG TCAAAGCGAT AACAATCTCG 7 3 20 

CTTGCTCAAT TCTCCATTTT CACGCAGAGC CAAAATAATC AGCAAATCCT GAACTTGCTT 7 3 30 

CCCAAACTGC CCTGAGCTCT TCCAAGATTT CAAAAATGAC TGCGCATTTT CAATCTCCAA 74 4 0 

AGCCCATAGT AAAGCCGCCC AGGCTTGTTC ACAGGATTCA AAAGTAAAAT CAGTCTCCAA 7500 

ATCAAACAGT C7G7TGACCT TG7CC7GGCT AGATGCCATA TCAGGGAGAT ACTCATAACC 7 5 60 

TTGACTCTCA ATCATGGAAG CCAAGCCCCT TCTCCAAAAT GGAGCCAGCA AGAGTTTATC 7 620 

AAACTCCACG AAGGTACGCT CTACAGAAAT TTTCTCCAAA AGCGGCGTCA AGGTCTTCAT 7 630 

AGCTTTAAAT CTTTCTCCCT CAAGTGCAAA ACCAAGACTA GCCTGAAAAC GGAAACCACG 7 7 4 0 

CATAATCCGT AAACCATCTT CGTTGAAACG CTCACTAGCC ACTCCAACTG CTCGCAAGAC 7 800 

TTCCTTTTCC AAATCTTCTA AACCATGGAA CAAGTCAACG ATTTCTCCTG TCTCATCCAA 7860 

GGCAAAGGCG TTGACTGTGA AATCACGGCG TTTGAGGTCT TCTTCTACCG ATCGTACAAA 7920 

GGAAACCGCA CTGGGTCTCC GATAGTCCAC AT AG A CATC C TC7GTCCGAA AGGTTGTTAC 7 980 

CTCATACTCC TCATCCCCAT CTAAGACCAA CACGGTTCCA TCCTCGATTC CGATATCGGC 8C40 

TGTTCGCGGA AAAATCTGCT TGGTCTCTTC TGGATAAGAA GACGTCGCAA TATCCACATC 3 ICO 
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GGCTGACGTA 


9660 


CGGCTGATTT 


GAATGGCTAG 
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ATGCTGGCAA 
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CTGATCACGC 
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CAATTAAAGA 


CGCTT7CAAC 


CA7GATGCCT 


AGGCGAGCAC 


TTGTAATCAA 
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GAAATTCCAG 


AAGATAGGAA 


AAGCCCCAAG 


GCATGTGTAT 


AGCCTTCTTC 


TTTCACCGAA 


9900 


GTTAAGATCT 


CATCTAACTT 


GCCAATACTT 


GGTTGACTCG 


TCTTAGGCAA 


TTCAGAAGCC 


9960 


TGAGCCATTT 


TTTGGTAAAA 


TTCCTCAGCA 


GACAGATTGA 


TGCCTTCGAC 


ATATTCCTCA 


10020 


CCATCAATAT 


TGACAGGAAT 


ATCCAAGACA 


AACAAGTCTT 


CTCTTTGCAA 


GATCTCTGCA 


10080 


CTGAGATAAG 


CAGAGGAATC 


TGTGAAAACA 


GCTAATTTCA 


TATTAGAACT 


CCAAATTAAT 


10140 


TCCTGGTAAG 


TCTAATCCAA 


TTTCAGTCAC 


TTCGTAAGTC 


AAACGATTGA 


GCATGTTCAA 


10200 


ACATGGACGA 


GCCAACGTTT 


CCACCTCTTC 


TTGGTTCAAT 


TCACTTGGTT 


CATTCACAAT 


10260 


ACGGCCATCC 


ATATGGTTTA 


CTTGTGAGAT 


TGTTCCACTA 


ATGACAAACT 


TATCAAATAC 


10320 


AATCATAAAG 


CTCAAGATGA 


CAATCAAGGA 


AGTCACTTGA 


TTTTCTTGGT 


CATGTTGGAG 


10390 


CAATTGGAAA 
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CCTTGGTTTC 
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CTCTGATAAG 


CTCTTACCAG 
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TTTCCTTCAT 


GATTTTAGTC 


AATTGAACTG 
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ACCATCACCT 


GTGGTATTGT 


10920 


AATCCATCAA 


GATAACGTGA 


CCAGACTGTT 


CACCACCAAG 


GTTGTAGCCT 


GATTTTCTCA 
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AACGT AGCGG 
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CTTGTTAATA 
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GTGGAAACCA 
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7AACAGTTGT 
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TGATGTACAT 


AATCTTGTCA 


CCATCAACGA 
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TGTCACCATT 


CTCATCAACA 


GCAATCAAGC 


GGTCACTGTC 


TCCATCAAAG 


GCCAAACCAA 


11220 


TAGCTGACCC 


ACTTTCTTTG 


ACCACTTCTT 


GAACGGCTTC 


TGGATGTGTT 


GAACCAACAT 


11280 


TAACGTTGAT 


GTTAACACCG 


TCTGGTGTTT 
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CGTCAATTGG 
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CGCACTTGCT CCTTCAGTTT CAACCAAGTA ACCTAC7GCT GGTGTTGCAA GG AC AC CAAG L1700 

7TTGTATACC TCAATCCCTA CTGAAAgGAG ACCTGCCACC AACCCCCATT CCAACATTTC 11760 

CCCTGAAATA CGTG7GTCAC CTCCTACAAA GACT7TCCGC GCTTCCGTTT CATGTTGACT 11820 

AAGAACATAG CCTCCAAAAC GTCCTAGTTT AAAGGCTAAT TCTCGTGTTA G77C7AGG77 11880 

AGCTTCTCCA CCCAC7CCAT CAGTCCCAAA ATATTTACCC ATTCTTATAA AATCCTTTTC 11940 

7ATT77TAAT TCCTTTTTGA ACTAGTTGCT TTCGTTGACG AAGATGTCTC CCATCAACTC 12000 

CTTGTACTTG AATTTGATGT GCTTGAACTT GGTGCTACTG G7TT7G7AGT CACCTTCATT 12060 

ATTGTA7CAA ACGGAGTGAT AACTGCCGGT AAGACAACAC CAT7GCCC7C GATTGCCTGC 1212 0 

AAACGTACTG AACCACTGTA ATTACCTCTT ATACGTTCGC TAGT7GGCAA AACAGCCATA 12180 

ATCTTATCAA TTCTATCCAA TGTCTCTTGG TCACTCGTAA 7AGACAC77C TTTATCTGAC 122 40 

ACCATGACAT T7TCAATT7G TACCCGACTA TCAATTTCAC 7ACCCTCAAT CTCTCGTACA 12 300 

A7CTT7ACCT TA7CCTTC7G AGCCT7CTTA CCAATCTTGA CTG7AATTTT TTGCGGAGTC 12 3 60 

GCCACAGCGG TCAGCCCATT GGGTAAATCT TCAATCCTCA AAGGAACTTC AATCCTTCCA 124 20 

ACACCGGCAT CTGTTAGG7C AGCAGTAACC TTGAATTTAC GTG7ACTTTC 7TGCATT7CA 12480 

C7AGCTAGCG ATAGCCGATT TGCACCAGTC AAGACCACTC ATACTTCTGA AGCAAAACCG 12540 

C7AATAAAAT ACTTATCACT A7TATAGCGT ATGTCAATAG GGACATTTGT TACTCTATTA 12600 

GTATAGG7TT CCG7TTT7AC C7CCCTAGCA CTGCTACTGT TTTGAAAATT CGTCGCCCTA 12 660 

GCATAGACAA A7AAGACACA AGCAAAAAAG AG7GAGCATA TGATA7ATAA AC7AT7TTT7 12720 

T7CATG777C . CATCC7CCTA GCAATCG7TC TTTAAAACTA AGACCCAC77 CCTCTT7TGG 12780 

AAC7AAGATT 7CACG7AA77 CTCTTTCAAA TTCA7CAAGT G7TACGT7GT GCTTAAACCT 12 840 

7CCATTATAG GTTATCGAAA TTCC7CCCG7 77CC7C7GAT ACGACAAAAG TCAAGGCATC 12900 

7GACACTTC7 CATAAACCGA TAGCCGCCCG C7G7CTGGTC CCAAA77CC7 TGGAAATCCC 12960 

TG7GTT7TTT GTCAAGGGCA GATAGGCAGA CCTCACAGCG ATACG7TC7T CTTTCA7AAT 13020 

CACCGCACCA TCATGTAGGG GAGTG7TGGG AATAAAAATG TTAATGAGAA GTTCTCCACA 13 080 

AATCTTAGCA TCCAAGGGAA 7TCCTG7CGA AATATAC7CC TGCAAGGTAC GTACACGCTG 13140 

AATAGCAACC AAGGCCCCGA TT7TACGAGG ACTCATGTA7 7CAACAGAC7 7AACAAAGGC 13 200 

ACCAATCATC TGTTCCTCAG CAC7AATAGG GGCATTGGAA AAGAAATCTG TCGCTCTTCC 13 260 

CAAACGTTCC AAACCAGTCC GAATCTCTGG AGAGAAGATA ACAACCGCCG CAA7AACCCC 13320 

ATAAGTAATA AT77GATTGA 7TAACCAAGA AA7CCTAGTC AAACCAA7CA 7A777GCAAG 13 3 90 
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GATTTGAGCT AAAATAAACA CCAAAACTCC ACGTACCAAA ATCATAATCT TCGTTCCTGC 13440 

AATACCTTTT CTAAAATGCT ATAAAATATA AGCAACAATC AAAATATCAA TCAGATTGAT 13 500 

AGCTATCGTC CATGGACTTG CAAACAAACT GGTCCAATAT TGCAGATTGG ATAATTGTTG 13560 

AAAATTCATC CCTGATATCC TCCCTATCAA AACACTTTCG TCCTATTATA CCATTTTCTG 13620 

GCATTTTTTT CCCTATCCTA GTCCATTTTA CATTGAACAA AAftTATCATA AAATAAACTG 13 680 

ACTAAAAAAA ACAAAGGAGA AACTATGTCT CAACTCTATG ATATTACCAT TGTGGGTGGT 13740 

GGTCCTGTCG GG C TTTTTGC AGCCTTTTAT CCCCACC7AC GCCAAGCCAA GGTTCAAATC 13 800 

ATCGACTCTC TTCCCCAGCT AGGTGGACAA CCTGCTATTC TCTACCCTGA AAAGGAAATC 13 860 

CTAGACGTAC CAGGCTTCCC AAACCTGACT GGAGAAGAGT TGACTAACCG CTTGATTGAA 13 9 20 

CAGCTAAATG GATTTGATAC CCCTATTCAT CTCAATGAAA CGGTTCTTGA GAT7CACAAA 13 9 80 

CAAGAAGAAT TTGCCATCAC AACTTCTAAA GG AAGTCACC TGACTAAAAC AGTTATCATC 14 040 

CCTATGGGTC GCGGTGCCTT CAAACCACGT CCGCTGGAAC TTGAAGGGGT TGAGGGCTAT 14100 

GAAAATATCC ACTACCACCT TTCTAACATT CAGCAATACG CTGGTAAGAA AGTGACCATT 14160 

CTTCGTGCCC GAGACTCGGC TGTGGATTGG CC7TTCGCT7 TTGAAAAAAT CGCACCAACT 14220 

ACCCTTGTTC ACCCCAGAGA TAATTTCCGT GCCTTGGAAC ACAC7GTTCA AGCCTTGCAA 14 280 

GAATCATCTG TAACCATCAA GACACCATTC GCCCCTAGCC AACTCCTTGG AAATGGAAAA 14340 

ACACTTGATA AACTTGAAAT CACAAAAGTC AAATCTGATG AAACTGAAAC CATTGACCTA 14400 

GACCACCTCT TTGTCAACTA TGGTTTCAAA 7CT7CTG7CG GTAACCTTAA AAACTGGGGG 14460 

CTCCACCTCA ACCGTCACAA GA77A7CGTC AACAGCAAAC AGGAATCCAG CCAAGCAGCT 14S20 

ATCTATGCTA TCGGTGACTG CTGCTACTAT GACGGAAAAA TTGATCTGAT TGCGACAGGC 11580 

C7CCCAGAAG CTCCAAC7CC TGTCAACAAC GC7A7CAAC7 AC ATTG AC CC TGAACAAAAA 14 640 

GTACAACCAA AACACTCTAC TAGTTTATAA AAAAGAACCA CGACTCACA7 ACCATTCCTG 14700 

GTTTTATAAT TCATCCGCTA TCTTATTGAT TTTTCTGAGT CTGTGATTGA CACCACTTTT 147 60 

GCTCAGAGGG GTGCTGAGAC TATC7CCTAA CTGCTGGATA GAGTAGTCTG GGTCCTGAAT 14 820 

CCTCAGTTGC GCCACTTCCT GCAAATCTAC 7GGCAAATTT TCTAAGCCCA TGATATCTTT 14 880 

GATTTTACTG ATATTGTTAA TGGTCTTCAT GCTGGCAGAA ACTGTCCGAG CGATATTAGC 14 940 

TGTCTCGGCA TTATTAGCCC GATTGAGGTC GTTACGGGTT TCTCGCAAAA TCTTAACCCG 15000 

CTCAAAATCA TCACCTGCC7 GCATGGCTCC TATTACTATC AAGAACTCCA TAATGTCTTC 150 50 

TGCTCGCTGG AGATAGGTCA CAGCCCCCTT CTTGCGCTCA AGCACCTTGG CATCCAGTAA 15120 

AAACTGTTGG AGAAGGGAGG CAATTCCTTG CGCGTGGTCC AGATAAACAG AACTGATTTC 15190 
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TACTGTGAAA 


CTCATTTTTT 


CACCTGTATA 


ATGCGCATCA 


ACTCGTCCAC 


AATCAAATCT 


156S0 


CCATCGTGGA 


AGGCACCGCC 


ATTTTCCAGA 


CGAAGGAAGT 


TAGATGAAAT 


CACGCGCGAA 


15720 


ACTTGCTTAC 


AAAGACCTAC 


AAAATCGTGT 


TCCACTTGCA 


CTAACTATTC 


ATCAAAACGG 


15780 


TTGGAATTCA 


TGTATTCCTG 


AGGCACTTTT 


TCAATATTCA 


CCAAGACAGT 


GTCGATAAAA 


15840 


GGGCGACCAA 


GGTGACGATG 


CAAGACTTCC 


ACGTGGTCGC 


TATCTGTAAA 


GTGTTCCGTC 


159O0 


TCCCCACGTT 


GGGTCATCAT 


ATTGCACACA 


TACCCAATTT 


CTGCCTTGGT 


TTCCAAAAGA 


15960 


GCCCGCCCAA 


TTTCCTTAAT 


CACCATATTG 


GCCAAAATAG 


AGGTAAAGAG 


GGAACCTGGC 


16020 


CCTAGGACAA 


TCATGTCACT 


TTCAAGGATG 


GTCTGCACTA 


CTCCACGGCT 


GGCCAGAGGC 


16080 


GTATCATCGT 


TTAGGGCATT 


GGTCACATAG 


ACATTGTCAA 


TTATGCCTCG 


ATCGTCTACA 


16140 


ATATGACTCT 


CTCCAGCCAC 


TTCTGTCCCA 


TCCTGAAAGA 


CTGCATGAAG 


GGTCAAAGGA 


16200 


TGGTCACTGG 


AAGGATAAA7 


TTTCCCTGTT 


G7A7GCAAAA 


ATTTGCTCAA 


TAACTGCATG 


16250 


GCATTATAGG 


TTGAACCCTG 


CATTTCTGAC 


AAGCCAGCAA 


TGATGAGATT 


TCCCAATGGA 


1632C 


TGGCCAGCAA 


AGGCTCCGGC 


ATCCTCAGAG 


AACCGATACT 


GAAACACCTT 


CTCATAAAAC 


16390 


TTAGGCATAT 


CCGACATGGC 


CACAAGGACA 


TTACCAAGAT 


CACCTGGCGG 


TGTCAACTGT 


1644C 


TGCATATTTT 






CC AC CATC AT 


CTGCCACCGT 


CACGATAGCT 


16500 


GCGATTTCCA 


CATCTTTTTC 


CCGCAGACTT 


TTTAGAATGA 


CGGGACTTCC 


AGTCCCTCCA 


16S60 


CCAATCACCG 


TTATCTTTGG 


TTTTCTCATG 


AACGGTTTAC 


CGTTTCCTTT 


CTGCGGTCTT 


16620 


TGTCGCGATG 


CCCTTCATTA 


ACAGACCAAT 


TCTTGGATAA 


GTCCTGCGCC 


AACCGTTTAG 


16680 


CAAATGCCAC 


ACTACGGTGT 


TGTCCACCCG 


TACATCCCAT 


GGCAATGGTC 


AAAACGGACT 


16740 


TACCTTCCTT 


TTGGTAACTT 


GGCAGAATCG 


GCTCAATCAA 


GGCCAATAAA 


TGTTGATAAA 


16800 


AGTCTTCTGA 


CTCAGGATGG 


TTCATGACAT 


AATCATAAAC 


AGGTTCATCC 


ACACCCGTTT 


16860 


CGTTTCTCAG 


TTCTGGTAAA 


TAATAGGGAT 


TTGGCAAGAA 


ACGGACATCA 


AAGACCAAGT 


15920 
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CCGCATCAAT CGGGATTCCA TACTTAAATC CGAAAGACAT GACTTCGATA CGGAAAGACT 159 80 

GGGCTTGTTC TTGGTCTGAA AACTGCTCTG CAAGGC7TT7 GCGCAgCTCA CGTGGAGTGA 17040 

GTTCAGTCGT ATCCACCACA TTTTCCCTCA TATTTTTCAA ACGTGCCAAG AGTTCACGTT 17100 

CCAACTTGAT TCCATCTAAA ATACGACCGT CTGCTGCTAC TGGGTGACTC CGTCTGGTTT 17160 

CCTTGTAACG AGCGACCAAT TCCTTATCAG CCGCATCCAA AAAGAGGATT TTGAAATCCA 17220 

AACCATCTTG ATT7TCCAAC TCATCCAAAA CACCTTGAAT CTCTGAAAAG AAAGAACGGC 172 80 

TACGCA7ATC CAC7ACCAAG GCCAACTTAG GATTGTCTTC CTTAATTTCA ACCAGCTGCA 17 340 

AAAACTTAGG CAAGAGAGCT GGCGGCATA7 TATCAATGGT GAAATAACCT AGATCCTCGA 174 00 

AGGACTGAAT GGCTACAGTT TTCCCTGCGC CACTCATCCC TGTCACAATC ACCAAGTGAA 174 60 

GTTG7TTC7T TGTCATCTTT TTCTCCTTAT ATCAAAAGAA GTTTGGCAAC ACCAAACTTC 175 20 

AAC7ACC7TA TCCAA7C7CT GCGATGACTT CAATTTCGAC TTTTACATCA CGAGGAAGAC 17580 

GACCTACCTC CACAGCTGAA CGAGCTGGGA ATTCCTCT7T GAAGGCCGTT TGGTAAACCT 17 640 

CATTAAAAGG AACAAAC7CG TTCATATCCC TCAAGAAGCA AGTTGTTTTG ACAACATGGT 17700 

CAAAGTCTGT TCCTGCTTCT GCCAAAATAG CACCGATGTT TTTCAACACT TGCTCTGTCT 177 60 

GTTCTTGCAT ATTCTCTCCT ACAATTTCCC CACTTTCACG GGATAGGGGA ACTTGACCGC 17 920 

TAGCAAACAA AAGGTTGCCA ACGATTTTTC CTTGAACATA GGGTCCCATA GCCTTTCGGG 17880 

CCTTATCTGT ATGAATTCTT TTTGCCATTT TCTTTTCCTC ACAATTTTTC TAAGAT7GCA 17 940 

TCCCAAGCCT CATCCATCCC TGCCTTACTG ACAGATGAAA AGAGGATGAA ATCGTCACTC 18000 

GGCTCAAAGT T7AA7TTC7T TTTGATTCCT GATTCATGCT TG77CCAT77 ACCACGAGGA 18060 

ATCTTGTCCG CCTTGGTCGC CACAATGATG AC7GGAATCT CATAATACTT GAGAAATTCG 18120 

7ACATCTGCA CATCATCTGC TGACGGGTCA TGACGAAGGT CAACTAGAC7 GACAACCGCA 18180 

CGGAGATTT7 CCCGAG7CGT TAAG7ACTCC 7CAATCATGC ACCCCCACTT TTCACGTTCC 182 40 

7TTTTAGAAA CACGAGCATA GCCATAACCA GGCACATCCA CAAAGCGCA7 CTTGTCATCA 18 300 

A7GTTAAAAA AGTTCAGGAG CTGGGTTTTA CCAGGTTTTC CTGATGTACG GGCGAGATTC 18360 

7TACGGTTCA ACATAGTGTT CATAAAGCTG GATT7ACCAA CATT7GAACG CCCTCCTAGG 134 20 

GCAATCTCTG GCAGT7CATC CTGCGGATAG TGGGACTTAT TAGCTGCAC7 GAGCAAGATT 184 80 

7CACCATTGT GTGTATTAAG TTCCATAGTC ACCTCTAGGC TG7T7CTACC ATCGGTTTAT 18540 

CCGT7CCATC TACAG77TC7 TTAGTGATGC GAACCAA7TT CACAT7TTCC TGACTCGGCA 13 600 

CCTCAAACAT GACATCTAGC ATGGTTTCTT CGATGATGGA GCGAAGTCCA CGCGCCCC7G 18660 

TCTTCCGTTC GATTGCTT7A TTAGCAATCT C77GAAGGGC T7CGTCGTCA AATTCCAACT 18720 
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CAACATCATC ATAAGAAAGC AAGGTTTGGT ATTCTTTCAC CAACCCATTT CTTGGCTCTT 13780 

TCAAGATGCG AACCAAGTCA TCAACGGTCA ATTGCTCAAG AGCCGCAAAA ACAGGCAAGC 18840 

GTCCAATCAA CTCAGGCATA ATACCAAATT TTTGAATCTC TTCAGCGATG ATTTCTTGCA 18900 

TGTATGAGCT GTTTTCGTCA ATCGCCTTAT TATTTTGACC AAATCCGATG ACTTTTTCAC 18960 

CCAGACGTTG TTTGACAATT TCTTCAATAC CATCAAAAGC ACCACCCACG ATGAAGAGGA 19020 
TATTffTTGT AT CCACTTGA ATCATCTCTT GTTCTCGATG TTTGCGTCCA CCTTGAGCCG " "19080 

GTACGCTAGC AACAGTTCCC TCAATAATCT TGAGAAGGCC T7GTTGCACC CCTTCACCAG 19140 

AAACATCACG TGTGATAGAC ACATTCTCAC TCTTCTTCGC AA7CTTCTCA ATTTCATCCA 19200 

CATAGATAAT GCCACGCTCT GCACCTTCGA TGTTAAAGTC AGCAACCTCC AAGAGTTTGA 192 60 

GGAGGATATT TTCCACATCC TCACCCACAT AACCACCCTC CGTCAGAGCT GTCGCATCCG 19 3 20 

CAATAGCAAA AGGTACATTC AAGCTCTTAG CCAAGGTCTG GGCAAGGAAA CTTTTCCCTG 193 80 

AACCAGTTGG GCCAATCATC AAAATGTTTG ACTTCTGCAA ATCCACATCT TCTCACTCT7 194 40 

CGCCTGTATC GTGGAAATTG ATGCGTTTGT AGTGGTTATA AACCGCCACT GCCAAGGCAC 19500 

GCTTGGCACG ATCTTGACCA ATTACATAGT GGTTCAAGAT ATGGAGGACT TCAATTGGTT 19560 

TTGGCACCTC AGACAAGTCT GCCAAGACTT CCTCAACCAA TTCTTCTCGA ATGATTTCCT 19 620 

GAGCTAACTC CACGCATTCA TTACAAATAA AAGCATTGTT GCCAGCAATT ATTTTTTGTA 19 680 

CTTCTTCTTG GTTTTTGCCA CAAAATGAGC AATAAACCA7 CATATCATTT TTTCTATTTC 197 40 

TAGACATCAT TTCCTTCCA7 TCTATACTGT CATTCTATCT AAAATAAGGT CATGTAAAAA 19800 

GCATGAATAC TATTGACCAG ATTGGTAAAG GCATTTAACC AAAGGACGAT AGAAAGCCCG 19860 

TAACGCTTTT TACGAAAACC TTGTGCTCCT GCCAGAAACC AGATCAAACA CAGAAAAGCC 19920 

GTGAATAGAC CAAATAAACT CCGTTCCATT AGACTTCCTT TCTCTTGCCG TATTGGATGG 19980 

TAAAATCATA AGGATTCTTC TCATCTTTGG CGTAAAATTT GCTTGAAACT GTCTCAAAAA 2004C 

GAGACAAGTC AAGTTCTTCA GGGAAATAGG TATCTCCTTC CACCCGAGCA TGAATGTGAG 20100 

TGACAATCAC TTCATCAAGG TAAGGTTCAA AAGCCTGAAA AATTTGCTTC CCACCGATAA 20160 

TGTAGAGATT C TTTTCTTGA GCCTGATACC AGTCAAGAAC AGACTGGACG TCCTGAAAAC 202 20 

TAGCAACCCC ATCTATCTTT TCTTCCGGAT TACGCGTCAA AATCAAGGTT 7CCCGTTTTG 202 80 

GAAGCAAGCG ACGCCCCATC CCATCAAAGG TCACACGCCC CATCAAGATA GCATGATTCA 20340 

GAGTTGTTTC TTTAAACTCC TGCAATTCTG CTGGCAAATG CCAAGGCAGA CGATTTTCCT 20400 

TACCAATCAC ACCCTCTTCA TCCTGGGCCC AAATAGCTAC GATT7TCTTA GTCATGCTTC 204 60 
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CATCCTTTTC ACTGATAGTA CTATTTTATC AAAAAACTCA AAAAAAGACT GGTTTGGAAT 20520 

ACCTTACAAA ATAGAAAAAA TCTGTAAGAA ATTTCCTACA GATTTATCTA TGTTTCCTTA 20530 

TTTCTTACAA ACCAGGTCCT TGTCCAAGTT CGGCTGCAAG CATCCAAATT GTTTTATCTG 20640 

TTTCAGTTTT AGCGCCTGCA AAGATACCGT TTGTCACATC GTCACCTTCT TCATCAGTGA 20700 

CATCCAAACC TTTTTGGAAA AGTTCTGACA AGTAACGGTA GATAACAAGA ACACGTTCCA 20760 

AGCTTTCTTC AACATTACGG TATTCACCAG CTTCTTCTTC GATTTCACTA TTTTGAAGGA 20820 

ACTCTGTCAA TGTAGAGAAT GGGCTTCCAC CGAGTGTAAT CAAGCGTTCA CTCAT7TCAT 20880 

CCAATTGACC GTCAAGAGC7 TCCATGTACT CATCCATTTT 7GCATGCCAT ACAAGGAAAC 20940 

CACGACCATG C AT AT AC C AG TGCACTTCGT GCAAAGCAAC GTGAGCTACA TACAAATCAG 2 I COO 

CAACAGCTTG GTTCAAGACT TCCTTTGTTT TTGCCAATGC 21040 



J2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2387 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
( 0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

ATTCTTAATA CGATTAAAAG GCTTATTACT AAAAGAAAA? T7CAGTTACA TGAACTAAAC 60 

TTGCTCCTCA AATCCCGATT TAACGAGATG TTTGGGGAAA ATAAAATATT TGAAAGCATT 120 

GATAACTTAT TTGA7ATTA7 AGATCGTGAT AGGCCCAAAA ATTATCC7AA ATCAGATCAG 18C 

TTGTTTAGTG AGGAGTACTG TTTATTTTTA AATACAAAGA A7GTTACTAA AAACGGATTT 240 

TCATTCGATA CAAAGCAATT TATCACTAAA ACAAAGGATA AATTACTTCG AAAAGGCAAA 300 

CTTGAGCGTT ATGATATAGT CTTGACAACA AGAGGTACTG TTGGAAATGT AGCGTACTAC 3 60 

GATGAATTAA TAAAATATAA ACA7TTACCT ATAAATTCAG GTATGGTAAT ATTACGTCCC 4 20 

AAGACACCAA ATCTAAATCA GAAATTTATT ATCCATGTTT TAAGGAATAA TAATTATAGT 480 

CGAGTGATAT CAGGAAGTGC TCACCCTCAC TTACCAATTA CAAAATTAAA AAAAATACTT 540 

CTCCCCCTCC CCCCACTAGC CCTCCAAAAT GAGTTCGCAG ACTTTGTAGT CCAGGTCGAC 600 

AAATCACAAT TGGCAATCCA AAAATCTCTC GAAGAAC7TG AAACTTTGAA GAAATCTCTG 6 60 

ATGCAGGAGT ATTTTGGCTG ATATTCTGCC ATTGTAATTA CGGTAATGAT TTGTTATAAT 7 20 

ACTTCAAAGG AGGAAATCAG ATGGTAGTAA AAACAAGAAA ACAAGGAAAT TCAATCACCA 7 80 

TTACGATTCC AAG7GAA7TT AATATTCCAA GTCC7C77AA ATACCAACCG AAATTGTTAC 840 
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CAAGTGGTGA 


GATTATCTTT 


ACTCCTGAAG 


AATTGGGGCA 


GCAGGTTTCT 


TATGTATCTG 


9C0 


ATGATGCCTT 


TGACTTAAAT 


TTAGATAAAA 


TATTTCACGA 


ATACGACGAT 


GTTTTCAAAG 


950 


gtttg gtgg a 


AAAATGAGAA 


TCTATTTGAC 


AGAAAAGCAA 


ATTGAAAAAA 


TAAATGCTTT 


1020 


A. fir* A AT^PAA 


CGGTATTrTC 


CAAATGAGAA 


AATTCAAACA 


GTTAGTCCTT 


CTGCCTTAAA 


1030 


l A 1 yn 1 lulu 


AAf^TACCAG 


AACAATTTGT 


CTTTGGGAAG 


CCTCTTTATC 


CAACAATTTT 


1140 


L on i rt/v\o^.rt 


AGGATAGTAT 


TTGTCCAATT 


GATAAAGAAG 


CATGTTTTTG 


CTAATGCTAA 


1200 




GCTTTCTTCG 


TTTTGGTCAA 


ATTTTTACAA 


TTAAACGGCT 


ATCGTTTTTC 


1260 


T.T 1 A agggt A 


G AAG AAG C AG 


TAAAAATGTG 


TCTAACCATC 


GC ACT AG AAG 


CTTTAACTGA 


1320 


Tf, A A A A A A TG 


AC AAG CT ACT 


CCAAATGGAT 


TTCTGAACAT 


TCTGTTAGAG 


AAAAGGTCAA 


1390 


AAAGTAACPT 


ACTA m GCTCG 


ATTTGAATGA 


GCACAAGAAA 


ATAAATGAAC 


AGACAATATT 


1440 


AGAATTGTGT 


AA m GC AC AAA 


CTGATATTGT 


CTCTTTTTAT 


TCATGAATAA 


GAAAGTGAGA 


1500 


AATTATCftAA 




CAATTATCAT 


GCAAGAAATG 


TTACCTCTTT 


TAAATAATGA 


1560 


rtLArt 1 I f\\— L A 




AGAGTTTAGA 


ACATCATCTA 


GTAGACGGAA 


AAAAGCAGCA 


1620 


PAAflTATTfT. 


AATAATAACC 


TGTTGCAACT 


ATTTATTACC 


GCCAAGCAGG 


TAGAGCGCTG 


1630 


T Az-fT^a AAA 




A m TATCAGAG 


GACGATTGAA 


AACTTGTTTA 


ATGCTATTAA 


1740 




ArAPAAr TCA 


CAACAGATGA 


TTTAAGGAGT 


TATTTAGCAA 


ATTACCAGTC 


1800 


■ TYIAAA AflflAT 


TGTAGTAAGG 


CAAATTTAGA 


CAATATTACC 


CGTATATTGT 




1863 




C AGG A AGAGG 


ATATATCATT 


AAAATTCCCA 


TTCGACGGAT 


ACAGAAAATT 


1920 


AAG ACTG AG C 


AAAATGTGAA 


GGAAACTTAT 


ACTGATGAAC 


ATTTGGAAAT 


TATGCGTGAT 


1930 


AACTGTGAAA 


ATTTCACACA 


TTTGGCAATA 


ATAGACCTAC 


T AoCAT>- onv. 




2 OA C 


GTAGGGGAGC 


TTGTACAGTT 


GAATCGTTCA 


GATATTGATT 


TTGAAAACAG 


AGAGTGTGTT 


2100 


GTCTTTGGTA 


AAGGAAAGAA 


GGAGAGACCA 


GTATATTTTG 


ACGCTCGTAC 


GAAAATTCAT 


7160 


TTAAGAAATT 


ATCTTAACCA 


C A G AAAAG AT 


AGTCACCCTG 


CTCTTTTTGT 


AACCCTAGTT 


2220 


GGAAAAGTCC 


AGAGG CTTGG 


AATTGCTCGT 


GTAGAGATTC 


GCTTAAGAAA 


GTTAGGACAC 


2280 


AAACTCGGCA 


TACAAAAGGT 


TCACCCACAT 


AAGTTCAGAA 


GAACTTTAGC 


GACTAAGGCA 


2340 


ATTGATAAAG 


GTATGCCTAT 


CGAACAAGTC 


CAAAAACTGC 


TAGCTCA 




2337 


(2) INFORMATION FOR SEQ ID NO: 57: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10669 base pairs 
(3) TYPE: nucleic acid 
< c > STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

ATATTAAAGC CACTTTCTGT GCGCTAGGGA AAAATCTTCC TGGGAATGAG GACTTGGTGA 60 

AGAGGATAAA ATCTGAAGGT CATGTTGTTG GAAACCATAG CTCCACCCAT CCGATTCTCT 120 

CGCAACTCTC TCTTGATGAA CCTAAAAAGC AGATTAC7GA 7ACTGAGGAT CTGCTAACTA 13C 

AAGTGCTCCG 7TCTAGTTCT AAACTCATGC CTCCACCTTA TGGTGCTATT ACAGATGATA 24 0 

TTCGCAATAG CTTGGATTTG AGCTTTATCA TGTGGGATGT GGATAGTCTG GACTGGAAGA 300 

GTAAAAATGA AGCATC7AT7 TTGACAGAAA TTCAGTATCA AGTACC7AAT GGC7CTA7CC 3 60 

TTTTGATGCA TGATATTCAC AGTCCGACAG TCAATGCCTT GCCAAGGGTC AT7GAGTATT 4 20 

TGAAAAATCA AGGTTATACC TTTGTGACCA TACCACAGAT GCTCAATACT CCCCTAAAAG 480 

CTCATGAGCT GTACTATACT CCTCATGAAT AAGCAAGAAA AAATAGCTCT CTTAGATA7T 540 

TGACAGACTT ATTTTTTACA GAATATAGTA CTACTTAAAA AATC7TTTAT GCTATAATTC 600 

ATGAATAAAA TAGAAGGAGA AGCATATGAA TACCTATCAA TTAAATAATG GAGTAGAAAT 660 

TCCAGTATTG GGATTTGGAA CTTTTAAGGC TAAGGATCGA GAAGAAGCCT ATCGTGCAGT 720 

GTTAGAAGCC TTGAAGGCTG GTTATCGTCA TATTGATACG GCGGCGATTT ATCAGAATGA 780 

AGAAAG7GTT GGTCAAGCAA TCAAAGATAG CGGAGTTCCA CGTGAAGAAA TCTTCGTAAC 84 0 

TACCAAGCTT TGGAATAGTC AGCAAACCTA TGAGCAAACT CGTCAAGCTT TGGAAAAATC 900 

TATAGAAAAA C7GGGCTTCC ATTATTTGGA 7TTGTATT7C A7TCATTGCC CCAACCCAAA 960 

ACCGCTCAGA GAAAATGACG CATGGAAAAC TCGCAATGCG GAAC7TTCGA GAGCGATGGA 1020 

AGACCTCTAT CAAGAACGGA AAATCCCTGC TA7CGGCG7T AGCAATTTTC TTCCCCATCA 1090 

TTTGGATGCC TTGCTTGAAA CTGCAACTAT CGTTCCTGCG GTCAA7CAAC TTCCCTTGCC 1140 

GCCAGGTCTG TATCAAGATC AAGTCGTAGC TTACTSTCCT GAAAACGGAA 7TTTATTGCA 1200 

AGCTTGGGGG CCTTTTCGAC AAGGAGAACT GTTTGATACC AAGCAAGTCC AAGAAATAGC 1260 

AGCAAATCAC GGAAAATCGG TTGCTCAGAT AGCCTTGGCC TGGAGCTTGG CAGAAGGATT 13 20 

TTTACCACTT CCAAAATCTG TCACAACCTC TCGTAT7CAA GCTAATCTTG A77CCTTTCG 13 8C 

AATTGAACTG AGTCATGAGG AGAGAGAAAC CTTAAAAACC A77CCTCTTC AATCGGGTCC 1440 

TCCACGAGTT GATGATGTGG ATTTCTAGAA AA7CATAAAA AGAATTCTAC AT7ATTCTAA 150C 

TTTTTGATA7 AATACTCACC AGGAAAGAAA GTCT7ATGGC G7TCT7CAAG CGAGCTTGGG 1560 

ATAG7GGGAG CCAAGTAGGG CAAAA7AAAG GGCTGGCGCT TTC7CTACTA 7T77CAAAAA 1520 
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CAATGAAGTA 


ATAAATTAGG 


GTGGAACCGC 


GTTTCTGACG 


; CCCCTAGGTT AAATCAACCT 


1630 


AGGATTCTCA 


GATGTGCTTC 


TTTTGCTTAT 


TCAGTCTATT 


GTGTGAAAGA 


AAGGAGAGCC 


1740 


GTGGACAACC 


TTTATCTTGT 


AAAAGACGAT 


AGTCAACTAG 


CTACATTTCG 


TGATTTTGTA 


L800 


GTAAGAAATA 


CTGAAAAGTT 


GAAAGATTAT 


CAATCTTTTT 


TAAAGAATGA 


ACTTGCAGTC 


I860 


TGTGATTTAC 


CGCAAGCTGT 


TAT7TCGTCA 


GATTTTAATG 


CTGCTACACA 


GATTATTAGG 


1920 


GAAAGTGCTG 


TTCCAACCTA 


TACAAATAAT 


AGACGAGTGG 


TTATGACCCC 


TGATTTAGCT 


1980 


GT7TGGAAAG 


AATTGTATTT 


GTATCAGTTG 


ATGGACTACG 


AGTGTTCTGA 


GCAAACTCAA 


20-40 


GCAATAGAAA 


CTCACTATCA 


TTCTTTATCT 


GAAAATT7CC 


TCTTACAGAT 


TGTAGGACAT 


2100 


GAGTTAGCTC 


ATTGGTCGGA 


CATTTTTTAG 


ATGATTTTGA 


TGGTTATCAC 


TCTTATATCT 


2160 


GGTTCGAAGA 


GGGGATGGTT 


GAATATATTA 


GTCGCAAGTA 


TTTCTTGACA 


GAAGAGGAAT 


2220 


7TCAAGCCGA 


AAAAATTTGT 


AATCAATCTC 


TCGTAGAACT 


TTTTCAGAAG 


AAGTATAGTT 


2280 


GGCATTCA7T 


GAATGATTTT 


GCTTCTTCGA 


CTTATGATAA 


GAACTATCCA 


AGTATTTTTT 


2340 


ATGAATACTG 


GCCCAGCTTT 


TTGACAGTAG 


ATAAGTTGGT 


AG AAAATTT A 


GGTAGTGTAC 


2400 


AAGCCGTCTT 


AGATTCTTAT 


CATTTATGGG 


CAAATACAGA 


AAAAACTTTT 


CCCTTGTTAG 


2460 


ATTGGTTTGT 


TCAGCAGAAA 


TTAATTGAAA 


AAGAAATATA 


AAAACTAAAG 


GAGTAAACAA 


2S20 


TGTCTAAGAA 


ATTAACATTT 


CACTGCATCA 


GTGGCAGAGA 


CCTCCTTACA 


GTCGGGCTGC 


2580 


TCCACGCTCA 


GCACTAGAGT 


GCCTGAGCTA 


GACGCAGTAC 


TAACTCGTCT 


TGCCTCGTAT 


2640 


GATCGACGAG 


GCAGACTCGT 


GTCGCAAGTA 


ATTATTTTTT 


ATTAAGGAGT 


ATTCAATGTC 


2700 


TAAGAAATTA 


ACATTTCACT 


GCGTCAGTGC 


CAGAAACCTC 


CTTACACTCG 


GACTGCCCTA 


2760 


CGCTCAGCAC 


TAGAGTGCCT 


GAGCTAGACG 


CAGTACTAAC 


TCCTCT7CCC 


7CGTATAATC 


2320 


GACGAGGCAG 


ACTCGTGTCG 


CAAGAAATTA 


TTTTTTATTA 


AGGAGTATTC 


AATGTCTAAG 


2880 




TTr a af a a a t* 


1 All I I vj Av- I 




TTTGGAATGA 


CCAAGATTGT 


2940 


ATGCTTATGC 


AGGCTTATGA 


TAATCAAAAA 


GGTGCGGGGA 


CAATGAGTCC 


TTACACTTTC 


3000 


CTTCGTGCTA 


TCGGACCTGA 


GCCATCGAAT 


GCAGCTTATG 


TAGAGCCATC 


ACGTCGTCCT 


3060 


GCTGACGGTC 


GTTATGGGGA 


AAACCCTAAC 


CGTCTCTACC 


AACACCACCA 


ATTCCAGGTG 


3120 


GTCATGAAGC 


CTTCTCCATC 


AAATATCCAA 


GAACTTTACC 


TTGAGTCTTT 


GGAAAAATTG 


3180 


GG AATCAATC 


CTTTGGAGCA 


CGATATTCGT 


TTTGTTGAGG 


ACAACTGGGA 


AAACCCATCA 


3240 


ACTGGTTCAG 


CTGGTCTTGG 


1TGCGAAGTT 


TGGCTTGACG 


GAATGGAAAT 


CACTCACTTC 


3300 


ACTTA7TTCC 


AACAAGTCGG 


TGGATTGGCA 


ACTGCCCCTC 


TGAC7GCGGA 


AGTTACCTAT 


3360 
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GGTTTGGAGC GCTTGGCTTC TTACATTCAA GAAGTAGACT CTGTCTATGA TATCGAGTGG 3 4 20 

GCTGATGGTG TAAAATACCG ACAAATCTTT ATCCAGCCTG AGTATGAGCA CTCAAAATAT 3 430 

TCATTTGAAA TTTCGCACCA AGAAATGTTG CTTGAAAACT TTGATAAGTT TGAAAAAGAA 3 540 

GCTGGTCGTG CATTAGAAGA AGGCTTGGTA CACCCTGCCT ATGACTATGT TCTCAAATGT 3 600 

TCACATACCT TTAATCTGCT TGACGCGCGT CGTGCCG7AT CTCTAACAGA GCGTGCAGGC 3 660 

TATATCGCTC GTATCCGTAA CTTGCCCCGT GTCGTAGCCA AAACCTT7GT CGCAGAACGC 37 20 

AAACGCCTAG GCTACCCACT TTTGGATGAA GAAACAAGAG CTAAACTCCT AGCAGAAGAC 37 30 

GCAGAATAAA GAGAGTGACA AATTACGAAA ATCGGCCAAC AGAGTCAGCC CTGAGCCAGT 3 840 

TGCCGCAGTG ATGAAGGTAT CCT7ACTGAA ACTAAGGATA CTAGGCAAAA TTGGAGACTT 3 900 

TTGGCTCCAA TTTTAGCAAT GAAACAACGA ACTTCCTTGC 7TGCGTGCCA ATCACATAAG 3950 

GCAAACTGCA AAATAAAAAG ATACTTTTCG GAGAAAAAAC ATGACAAAAA ACTTATTAGT 4020 

AGAACTCCCT CTTGAAGAAT TACCAGCCTA TGTTGTTACG CCAAGTGAAA AACAACTAGG 4080 

CGAAAAAATC GCACCCTTCC TCAAGGGAAA ACCCCTGTCT TTTGAACCCA TTCAAACTTT 4140 

CTCAACACCA CGTCGTTTGG CTGTTCGTGT AACTGGTCTT GCAGACAAAC ACTCTGATTT 4200 

AACACAAGAT TTCAAGGGTC CAGCAAAGAA AATTGCCTTA GATAGTGATG GAAACTTCAC 42 60 

CAAAGCAGCT CAAGGATTTG TCCGTGGGAA AGGTTTGACT GTTGAAGATA TCGAATTCCG 4 3 20 

TGAAATCAAG CGTGAAGAAT ATGTCTATGT CACTAAGGAA CAAATTGGTC AAGCAGTTGA 4 3 80 

AGCCATTGTT CCAGGCATTG TGGATGTCTT GAAGTCACTG ACTTTCCCTG TCAGCATGCA 4 4 40 

CTGGGCGGGA AATACCTTTC AATACATCCG CCCTGTTCAC AC7TTAACTG TTCTCTTGCA 4500 

TGAGCAAGAG TTTGACTTGG ATTTCCTTGA TATCAAGGGA AGTCGTGTGA GTCGTGGCCA 4 560 

TCCTTTTTTG GG AC AAG AAA CCAAGATTCA GTCAGCATTG AGCTATGAAC AAGACCTTCG 4620 

TAAGCAGTTT GTAATCGCAG ATCCATGTGA ACGTGACCAA ATGATTGTTG ACCAAATCAA 4 680 

GGAAATTGAG GCAAAACATG GTGTACCTAT CGAAATTCAT GCGGATTTGC TGAATGAAGT 4 7 40 

CTTGAATTTG GTTGAATACC CAACTGCCTT CATGGCAAGT TTTGATGCTA AATACCTTGA 4 8C0 

AGTTCCAGAA GAAGTCTTGG TGACTTCTAT GAAGGAACAC CAGCGTTACT TTGTTGTTCG 4 360 

TGATCAAGAT GGAAAACTCT TGCCAAACTT CATTTCTGTT CGTAACGGAA ACGCAGAGCG 4920 

TTTCAAAAAT GTCATCAAAG GAAATGAAAA AGTCTTGGTA CCCCGCTTGG AAGACGGAGA 4980 

ATTCTTCTGG CGTGAAGACC AAAAATTCGT GATTTCAGAT CTTGTTGAAA AATTAAACAA 5040 

TGTCACCTTC CATGAGAAGA TTGGTTCTCT TCGTGAACAC ATGATTCCTA CGGGTCAAAT 5 ICO 

CACTGTACTT TTGGCAGAAA AAGCTAGTTT GTCAGTGGAT GAAACAGTTG ACCTTGCTCG 5150 
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TGCAGCAGCC ATTTACAAGT TTGACTTCTT GACAGGTATG GTTGGTGAAT TTGACGAACT 52 20 

CCAAGGAATT ATGCCTGAAA AATACACCCT TCTTGCTGGT GAAACTCCAG CGGTGGCAGC 5280 

TGCTATTCGT GAACACTACA TGCCTACATC AGCTGAAGGA GAACTTCCAG AGAGCAAGGT 5 340 

CGGCGCAGTT CTAGCCATTG CAGACAAATT GGATACCATT TTGAGTTTCT TCTCAGTAGG 5400 

ATTGATTCCA TCAGGTTCTA ATGACCCTTA TGCCCTTCGT CGTGCAACTC AAGGTGTGGT 54 60 

TCGTATCTTG CATGCCTTTG GTTGGCACAT TGC7ATCGAT GAGCTGATTG ATAGCCTTTA 5520 

TCCA7TGAAA TTTCACAG7T TGACTTATGA AAATAAAGCA GAGGTTATGG ACTTTATCAA 5580 

CCCTCCTG7T GATAAGATGA TGGGCTCTAC TCCAAAAGAT ATCAAGGAAG CAGTTCTTGC 5640 

AGGTTCAAAC TTTGTTGTGG CAGATATGTT GGAAGCAGCA AGTGCTCTCG TAGAACTAAG 5700 

CAAGGAAGAA GATTTTAAAC CATCTCTTGA AT C ACTTTCT CGTGCCTTTA ACCTGGCCGA 5760 

GAAGGCAGAA GGGGTTGCTA CGGTTGATTC AGCACTATTT GAGAATGACC AAGAAAAAGC 5820 

TTTCGCACAA GCAGTAGAAA CACTCATTTT ATCAGGACCT GCAAGTCAGC AATTGAAACA 5880 

ACTTTTTGCG CTTAGCCCAG TCATTGATGC TTTCTTTGAA AATACTATGG TAATGGCTGA 594 0 

AGATCAGGCT GTCCCTCAAA ATCGTTTGGC AATCTTGTCA CAACTAACCA AGAAAGCAGC 6000 

TAAGTTTGCT TGTTTTAACC AAATTAACAC TAAATAAAAT TTGATAAACG GACTTTATCT 6060 

TATTACAAAG GAGAAGAAAT GGATCCGAAA AAAATTGCTC GTATCAATGA GCTTGCTAAA 6120 

AAGAAAAAAA CAGAAGGCTT AACACCAGAA GAAAAAGTGG AACAAGCCAA ACTACCTCAG 6180 

GAGTACATCG AAGGTTATCG CCCCGCTGTT CGTCACCACA TTGAAGGAAT CAAAATTGTG 62 40 

GACGAAGAAG GAAACCATGT TACACCAGAA AAACTACCCC AAGTACAACC TCAAAAAGGA 6300 

TTACATGCCC GTAGTCTTGA TCATCCAAAT TCATAATAA7 ACTCTTCGAA AATC^-AATTC 6 3 60 

AAACCACGTC AGCTTCACCT TGCCGTACTT AAGTACAGCC TGCGGCTACC TTCCTAGTTT 6420 

GCTC7TTGAT TTTCATTGAG TATATGTATT CTTTCTTTTA ACAAAGATAG ATGAAACGAT 64 80 
AACAAAGAGA CTAGCAGTTT GTGTTTGCTA GTC7TTTTTC GCTAAAAAAG GAACCATAAT 
CCTTCCTAAA AACTATCATT AGTAACTTGC ACCGGCTGTA GCGTCTGCGT CACCACCGTC 

GCCTCCAGCA TCCCCTGAAT CAGAAGCGCC AGAAGTAGCA TCGGCGTCTC CATGACCTCC 6660 

GGCAGCAGGA GCAAATGCTC CGCTACCACC CACCAAACGT TGACCAGTCT CTTTTAGGTA 6720 

CCAGTCAAGC CATGGTTGGA AGTTAAAGAC GATTTCATTG ATACCAGCGT ATGATCCATC 67 80 

AGGATAGTAC ATTGCTTGGT AGTTGTGAGT GTTGATAACA CCTGCAGGAG AACCTGGAAC 6 84 0 

GATCGTACGG ACCTATTCTT GGTTTCCGTT GCGAAGTGTT CCGATAACCC ACTCTACGTT 6900 
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C7TCATACGT GCTGGTGGAA GAGAACCATG AACAGTCGAC ATACGGCTAC CTGATTGAGG 6960 

TGGTACACGT 7TAGCCAACA TAGTGTCTGG ATCTTGGTGA GCGTTGTTGT AGTAGAGGAA 7020 

TTGGTTGTTG TCGTCAGCGT ATGTCAATTC AAATGGCATA CCTTTCAAGA ACATATCAAT 7080 

TTGGTTAACT GTTAGGATAC CGTGGTCCAA TTTGACATAG GTATCACCAG AAACAGCACC 7140 

AGTGAATGCT CCAACTTTTT CTACCCATTC TGGATCGTCA GGGTCAACTT CTGTGATGGT 7200 

TGTAGCGATT GCTT7TCCAC AATCCAAGTC TTCTGATTCG ATTGCT7T7C G77TT7TCAA 7 260 

TTTCGAAACG ACTCCTACGT ATTTAACAAA GTTATCTAAG CAACTTTCAA GGAATTTAAC 7 320 

AGTCCCTTCC TTGGTGATAT TTCCGTTGTT ATCAAAAGCT TCCTTACCT7 TACCAAGAAG 7 380 

GAATTCGTTA CCTGGAAGCG TGTAGGCATT AACACCTCGA GCATCAAGGA TTTTACGAAG 7440 

GTGAACTTGA GCACGTGATG TTCCTTGGTC ATAG7A7GA7 GCACCCACAA TCATAACAGG 7500 

CTTGTTTTCA AATGGATGAA CTTCGTATGA AAGCCATTCA AGTACAGATT TGAGTGAAGC 7560 

TGAGATACTG TGCTTATGCT CAGGAGTAGC AATGATAACA CCATCTGCAC GAGTAATTTT 7 620 

CTTA7A7AAA 7AACCTAATT GGAAACTTTC ATCCCATTTT TCATCTTGCT TAAACA7TGG 7 680 

AACTTCGTCA ATTTCAACAA CTTCTAATTC AAATTTGAGT TTGAAGTAGC GACGGATAAA 77 40 

TTCCAAGAGC TTACCGTTAT ATGATTGATC CTAGTTTGAT CCAACAAGTC CAACAAATTT 7800 

C ATTC TTTTT GGTCTCCTAT CTTACAAATT TTCCCAGTCA AAGTCTTCAG CATCTTTGCG 7860 

AAGTAATTCT TGTGCATTAC GTAATTTTTC TGTGATTTTT ACAAAGATAC GCAAGTCATC 7 92 0 

AAAGATGGCA TCCAATTTCT TGATAACATC AAGGTCAACC AAGTCGCCAC TTGGGTTAAA 7 9 30 

7GC7TCAAGA GAGTGTGAGA GCAAGAATTC ATCTGGAAGA ACATTTGCCT TGATTTCAGG 904 0 

AGCATTCAAG ATTTGACGAA GTTCCAATTG CCCACGAGA7 CAACCAACCG 7ACCS7AAGA 3100 

AGCACCTGTA ATCATGATTC GTTTGTTCAA AAGTGCG7AA ATACCA7AAG ACAACCAAGC 3160 

AAGAGCGCTC ATCAAAACAG CTGGAATAGA G7GATCA7AC 7CAGGAGTAC CGATAA7AAC 32 20 

GCCATCTGCC TCTTCGA77T 7AGCAGCAAT 77CCAATA7T 7CAGCAGCTA C7TGCTTCTC 3280 

AGCTGGTTTG 7TGAAGACAG GAATGGCCTT GATTTCAACA AGTTCAAT7T CAGCTTTGTC 3 34 3 

AGTAAAGTGT TTT7GCATGT ATTGAAGCAA 7TGACGCTT7 G7AGAACG7T TTGAA7TTG7 9400 

7CCAACAATA GCAA7AAGTT TTAACATGAC ATT7CC77TC TC7TTTTACA TAATACAA77 94 60 

7TAAAATTCC ATTGAAACAG TTGTCTCTAT AGAGTAGGAA 7TCCTGAAGA ACAGC77AGG 95 20 

TGGCCTTC7T TATCGATGAG GATGACTTCG A7GCCCTCCA AAC7TTCGAC TTGCCAGAGG 3530 

ATAGAAGCAG GTCTTTCTCC AAAGAGTCGA G7CGTCCAGA TTTCGCCA7C GAC7GATTTA 3640 

7CAGAGATGA TTGTTACAC7 CGC7AGTTCC GTT7CAACAG GA7A7CC7GT TTGACTG7CA 370C 
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AAAATGTGAT GGTAATCTTC TCCATCGACG GTCAGGTGAC CTTCATAAAT GCCTGAAGTC 8760 

ACGACAGATT TAT7CACAAC AGGGATGGTC ATTAAATGAT TTCCCCTAGG A7TCCCTGGG 8820 

TCTTGAATCC CGATTTGCCA TGGGTTATCC CCTCTTCCCT GATTTTTTCC AATGGTCAGG 8880 

ATATTCCCTC CCACATTGAT CAAGCCAGAA GTCACCCCCT CTTTCC7AAG AAATTGGGCA 894 0 

ACCTTATCCG CACTGTATCC TTTGGCTAAA CAACCTACAT CGATCTTCAT TCCTTTCTGT 9000 

TTTAAAAACA CAGTAGAAGT AGAAGAATCT AACTCCATAC CATGAGGATT GATTAGAGGC 9060 

ACCACCCAT7 CAATTTCTTG AGGCTGGGCG ACCTTGGCAT CTGAAAAACC GATACGCCAG 9120 

CTTTGAATTA ACCGACCAAT GCTGATATTG AGCTGCCTAG AGAGCGCTAG GCTATGCTCT 9180 

AACCCAAGTG AAATCAGCTC AAACAGGTCT GGATGAACCG TGACGGGGGC TATTCCTGCT 92 40 

TGATAATTGA TTTCCATCAA CTCAGATTCT TGACTATTGG CGTTCAAGCG GTATTCAAGT 9300 

TCTTTGAGCA AGTCAAAGGA TTTTTGGAGA AAGATATCGG CTTGCTCATC CACTAATGAA 93 60 

ATAGTGATAG TAGTCCCCAT TAGCCGTTCA GAATGTGAAC GAAGAGTCAA CCTACCAACT 9420 

CCTTTCTCTT ATAGAAAATA AGTTGTAATA TCAAATAATC ATCTAAATTG AAGCCCTTAC 94 80 

ATTTCATTTT CATGTTATTA TAATACCATA AAGTTAGAAT TTTCACAAAC AAAATTTGGA 9540 

AAAAGTCAAG AAATATGCTC ATAAAATTCA TCAGCCTTGA AAACAGGATA AATGGGGAAT 9600 

TATTTTTGAT AAAAAATGCT GAAATAATAG TACCCCCCTT GTAAACGCTA ACGGTAAATG 9660 

GTATACTAGT AACCTAAATT TAGAATGAAG GCACGAAATT TTTATCAGTA AAATCGTTGT 9720 

AGTCCCTCC7 AACCACGCTG CTACAGCATC TATCAATACC ATGTTGGATA ATTTTGGAAA 97 80 

TGAGAACGAA ATTCTTCTAT TTGACCAAAA CTCTAACATC TCTTTCCTAG GATGTGGAAT 9 940 

GCCTCTTTGG ATTGCTGAAC AAATTGACGG TGCTCAAGGC TTCTTCTATT CTGATAAAGA 9900 

AAAATTGGAA CCTAAAGCTC CTAAAGTTTA CATGAACTCA CCTCTTCTTT CAATCGACTA 9960 

TGATAACAAA GTACTTACAG CGGAAGTTGA AGGAAAAGAC CACAAACAAT CATACGAAP A 10020 

ATTGATTTTC GCTACAGGCT CTACACCAAT CTTGCCACCA ATCGAAGGTG TTGAAATTGT 10080 

TAAAGGAAAC CGCGAATTTA AAGCAACTCT TGAAAACGTA CAATTCGTGA AATTGTACCA 10140 

AAATGCTGAA GAAGTTATCA ATAAAC7TTC TGACAAGACC CAACACCTCG ACCGTATCGC 10200 

CGTTGTTGGT GGTGGTTACA TCGGTGTTGA ACTTGCTGAA GCCTTTGAAC GTCTTGGAAA 102 60 

AGAAGTTGTC CTTGTTGATA TCGTTGATAC TGTCTTGAAC CCTTAC7ATG ACAAAGACTT 10320 

CACACAAATC ATGGCGAAGA ACTTGGAAGA TCACAACATC CGCTTGGC7C TAGGTCAAAC 1C380 

7GTTAAAGCA ATCGAAGGTG ACGGTAAAGT TGAACGCTTG ATT AC TG AC A AAGAAAGCTT 10440 
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TGACGTGGAT ATGGTTATCC TTGCAGTTGG TTTCCGTCCA AACACAGCCC TTGCAGGTGG 10500 

TAAGATCGAA CTCTTCCGCA ACGGTGCCTT CCTTGTAGAC AAGAAACAAG AAACATCTAT 105 60 

CCCAGACGTT TACGCTGTTG GTGACTGTGC GACTGTTTAT GACAATGCTC GTAAACATAC 10620 

AAGCTATATC GCTCTTGCTT CAAATGCTGT GCGCACTGCT AACGTTGGT 1066 9 
(2) INFORMATION FOR SEQ ID NO: 53: 

{il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7542 base pairs . 

(B) TYPE: nucleic acid 

( C > STRANDEDNESS ; double 
(Di TOPOLOGY: linear 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

CGCGCTAATA GATACTTTAT GATAGAATAA AGAACAAGAT TGACAAGTAA GAGGAAACAT 60 

TATGCAAAAT CAAACACTCA TGCAATACTT TGAATCGTAT CTCCCCCACG ACGCTCAACA 12 0 

CTGGACGCGT CTGGCTGAAA ATGCTCCACA CCTAGCTCAT CTGGCGATCA GTCACGTCTG 180 

GATCCCACCA GCCTTCAAGG CAACCAACGA AAAAGATCTC GGCTATCGGG TCTATCACTT 2 40 

ATTTGACTTA GGAGAGTTCA ACCAAAAAGG GACTGTCCGC ACCAAGTATG GTTTCAAAGA 300 

AGACTATCTT CAAGCCATTC AAGCCCTTAA AGCACAGGGA ATTCAACCTA TGGCCGATGT 3 60 

AC7TCTCAAC CACAAGGCTG CTGCCGATCA CACGGAAGCC TTTCAGGTTA TCGAACTTGA 4 20 

TCCTGTAGAC CGTACAGTTG AACTTGGAGA ACCCTTCACC ATCAATGGCT GGACTAGTTT 4 30 

TACCTTCGAT GGTCGCCAAG ATACCTATAA TGCCTTCCAC TGGCATTGGT ACCACTTCAC S4 0 

CGGTACAGAC TACGATGCCA AACCCACTAA ATCTCGCATT TATCTGATCC AAGGGGACAA 600 

CAAGGGCTGG GCCAACGAGG AATTGGTCGA TAACGAAAAC GGAAACTACG ACTACCTCAT 6 50 

GTATGCCGAC CTAGACTTTA AACATCCTGA AGTCATCCAA AACATCTATG ACTGCGCTGA 7 20 

TTGGTTCATG GAAACGACTG GTGTACC7GG TTTCCGTTTG GATGCCGTTA ACCATATTCA 7fl0 

CTCTTTCTTT ATGCGCAACT TCATCCGCGA TATGAAGGAA AAATACCGTG ACGATTTCTA 840 

TGTTTTTGGT GAATTTTGGA ACCCAGACAA GGAAGCCAAT CTGGACTATC TCGAAAAAAC 900 

GGAAGAACAC TTTGACCTTG TCGATGTTCG TCTCCACCAG AATCTCTTTG AAGCCAGTCA 9 60 

AGCTGGCGCA AACTATGACC TTCGTGGCAT TTTCACAGAT AGCCTGGTTG AACTCAAGCC IC20 

TGACAAGGCT GTGACTTTTG TCGACAACCA CGATACCCAA CGAGGACAAG CCCTTCAGTC 1080 

TACCGTTGAA GAATGGTTCA AGCCAGCAGC CTATCCCCTC ATTTTGTTAC GCCAAGACGG 1140 

CCTTCCATGT G7CTTTTACG GAGACTACTA TGGGATTTCA GGGCAGTATG CTCAAGAAGA 120C 
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TTTCAAACAA ATCCTTGACC GCCTCCTAGC CATCCGAAAA CATTTGGCCT ATGGAGAACA 12 60 

AAATGACTAC TTTGACCATG CTAACTGTAT CGGTTGGCTA CGTTCAGCTG CTGAAAATCA 13 20 

ATCCCCAATC GCAGTCCTTA TCTCAAATGA CCAAGAAAAC AGCAAGTCAA TGTTTGTCGG 13 80 

TCAAGAATGG ACTAATCAAA CCTTTGTAGA TTTACTTGGT AACCACCAAG GTCAAGTTAC 1440 

AATTGATGAC GAAGGTTATG GACAATTCCC TGTCTCACCT AGATCCGTAA GTGTCTGGGC 1500 

ACTCAATACC ATCTAATAGC TCATAATAAC CAAGCTAGGT CCAAGCGGAT TTGGCTTTTT 1560 

TGTATTCACA AAAAGACCTA CCCAAATGGA TAGATCTTTA CTTGATTACA ATTTACCTGC 1620 

TACTCCATCC AACAATTCTT GGATCTTAGG TTGGTTGCTT CCTCCTGCCA TGGCCATATC 1680 

TGGTTTACCA CCACCACGTC CATCGATGAT TGGTGCTAAT TCTTTGACAA GGTTTCCTGC 1740 

ATGAAGGTCT TTTGTCTTGC TTGCTACAAG GACATTGACT TTGTCACCGA TAGCGGCAAC 1800 

TAGGACAAGA AGATCAGAGT AGTCTTTTTG TTTCCAGTTA TCTGCAAAAG TACGAAGGGC 1860 

ACCGGCATCC GAT AC AG AC A CTTCACTAGC AATGTAACGA TGACCGTTGA CTTCCTTAAC 19 20 

ATCTTTGAAG ATATCGCCTG CGGCTGCAGC TGCGGCTTTT TCTTTCAACT CAGCATTTTC 1980 

TTTTTGAAGT TGACGAACTT GTTCTTGAAG TCCTTCTACC TTGTGAGGTA CTTCCTTGAC 2040 

TTGAGGTGCT TTCAAGGTTG CTGCGATAGC TTTAAGAGCA TCCTCTTGTT CACGATAGGC 2100 

TTCAAAGGCT TCCTTACCAG TCACTGCCAA GATACGGCGA GTTCCTGAAC CGATTCCTTC 2160 

TTCTTTGACA ATTTTGAAGA GACCAATCTC AGAAGTGTTG TCAACATGAG TACCACCACA 2 220 

AAGTTCAATA GAGTAGTCAC CGATAGTCAC GACACGAACT TCCTTGCCGT ATT7CTCACC 2 230 

AAAGAGGGCC ATACCTCCCA TTTCTTTAGC AGTGTCAATA TCCGTTTCAA CTCTCTTCAC 2 340 

T7CAAGTGCT TCCCAAATTT TCTCGT7AAC TTGCTGTTCA ATCGCACGAA GTTCCTCAGC 2 4 00 

AGTTACTGCT TGGAACTGGG TAAAGTCAAA GCGAAGGAAT TCAACTTCGT TAAGAGATCC 2460 

TCCCTCTCTT GCGTGGTTTC CAAGGATATT GTGAAGGGCA GCGTCAAGCA AATGAGTCGC 2 520 

AGTGTGGTTT TTCATGACAC GGTGACGGCG ATTGCTATCA ATTGCCAAGG TATATTCTTG 2580 

GTTCAAGGCA AGCGGTGCAA GGACTTCAAC TGTATGAAGG GCTTGACCAT TTGGGGCTTT 2640 

CTGAACATTG GTCACAGTAG CCACAACCTT ACCTCACTCA TCCAAGATTT GTCCGTAGTC 27 00 

AGCTACCTGT CCACCCATTT CAGCATAAAA TGACGTTTCC GCAAAGATAA GAGAGGCAGT 27 60 

TCCTTCTGAA ACAGCTCCTA CTTCTGCATT GTCAGCAACC ATAGCTACCA ATTTAGAAGA 2 820 

CAATTGGCTA GCATTGTAGT TGAAGACACT TTCTACAGTG ATGTTTTCAA GAGTTTCATT 2 980 

TTCCATACCC ATTGAGCCAC CCTTGACAGC TGACGCACCC GCGCGTTCTT GCTGTTCTTT 2940 
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CATGGC7CCT TCAAAACCTT CACGGTCTAC AGTCATACCA GCTTCTTCAG CCATTTCTTC 3000 

AGTCAA77CA ACTGGGAACC CATAAGTATC ATAGAGTTTG AAGACATCTG AACCAGCGAT 3060 

AACAGATTGA CCTTTT7CTT TCAAGTCTGC TACAATGCCT TGGGCAAAGT GTTGACCTGA 3120 

GTCAAGGG7A CGGGCAAATG ATTCTTCTTC GCTCTTAACG ATTTTCTCAA TAAAGTCACG 3180 

TTTCTCAAGC AC7TCTGCCT AGTAGCTTTC CATGATTTTT CCAACAGTTG GAACCAATTT 3 240 

GTAAAGGAAA GGCTCGTTGA TACCCAATTT TTGACCATGC ATAGAAGCAC GACGCAGAAG 3 300 

ACGACGAAGA ACATAACCAC GACCTTCATT TCCTGGAAGG GCACCATCAC CGATAGCAAA 3 3 60 

TGAAAGAGAA CCAATGTCCT CTGCGATAAC CTTGAAGCTC ATG7TGTCGC CATCTTGGTC 3 4 20 

ATAAACCTTA CCAGACAATT TCTCCACTTC ACGGATAATC GGCATGAAGA GGTCCGTTTC 34 80 

AAACTTCCTC TTAGCCCCTT GGA7AACGGC CACCAAACGC TCCAAACCAG CGCCCGTATC 3 540 

AATGTTCTTA TGTGGCAATT CCTTGTATTC GCTACGAGGA ACAGCACGCT CTGCGTTAAA 3 600 

TTGTGACAAA ACGATGTTCC AGATTTCAAT ATAACGGTCG TTTTCAATAT CTTCTGCAAC 3660 

CAGGCGAAGA CCCATATTTT CTGGGTCAAA GGCTTCCCCA CGGTCAAAGA ACATT7C7CT 3 7 20 

ATCTGGTCCA GAAGGTCCCG CACCCATTTC CCAGAAGTTG TCCTCAATTG GAATCAAGTG 3 7 80 

ACTTGCA7CC ACTCCCACTT CAA7CCAGCG GTTCTAAGAA TCTTTATCCT CTGGATAGTA 3 840 

GGTCATGTAA AGTTTTTCAG CAGGGAAATC AAACCATTCA GGGCTTGTCA AAAGCTCATA 3 900 

AGCCCAAGTG ATAGCTTCGT CACGGAAGTA ATCCCCGATA GAGAAGTTCC CCAGCATTTC 3 960 

AAACATGGTA TGGTGACGCG CGGTCTTCCC 7ACGTTTTCG ATG7CGT7GG 7ACGGA7ACC 4 020 

CTTT7CCGCA 77GG7AA7AC GTGGAT7TTC ACGGATAATC GTCCCGTCAA AGTATTTC77 4 080 

AACGCTTGCT ACCCCAGAG7 TGA7CCACAA AAGAGT7GGC TCAT7TACAC GAACCAAACT 4140 

TACTGATGGT TCTAC7CAG7 GACC7T7GGT CGCCCAGAAA 7CAAGCCACA TTTGGCGTAC 4 2 00 

T7GTGCACTA GATAGTTGT7 TCATAT7GTC 7CCTTATTCA CTTGT7TAA7 G7CATTGGC7 4 2 60 

TTCCAGCAT7 7CCACATAGT CAATCGCGAC ACAGAGGGAA ATGACTAGG7 C7GCA7AAGC 4 3 20 

CTCTTCAAGA ACCGTTACGG TATAGCTAGA AG7CACATGG AAGAGTTCC7 TCTTAA777C 4 380 

CGCAATCAAC TGATCGCCAT CATCCAGCAA T77GAAA7TC AAATCCCAGA TATTGCCCTC 4 4 40 

GATACGAAGA CCTAGATTA7 CAAACTCATA C7TATC7CGC CACAAGG7CA ACTTC7TACG 4500 

AATGACAAAA CTCCACCCAT CCCGAAGC7G AATT7CAAAA CGAGGAAGCA ACCTCAAGAT 4 560 

TTCTT7AC7A ATC7CAC7CA CTTGTTCACC AGCCGCATCA TAGATGG7AA AGGTTT7AGG 4 620 

AATCT7AAAA AATGA7CCCT CCACCTGATA GGCAATTTCT CCCCTGTCAT CCTTGA7AGC 4 680 

GAAGCG77CG CCTCCAAGAC GAAACTTTTG TTTGACAACA AATG7TTTCA TCAACACC7C 4 740 
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CAAAAATCAA AAGACAAGCT CATATCACGA ACGSCCAAAA ACCGCGGTAC CACCTTCATT 4 800 

CAATGAACTT GTCATTCTCT TGTTCTTATG CAATTGTATG ATTGAGTAGC ATGACTTCCT 4860 

AGCTTAGATG GC7CGCAGCA CCGCCATTTC TCTGGACTAA GACAAGTGAA AATCAATTCT 492 0 

CAACTTTCTT ATTATAACGT TTTTTTAAGC TTGCGTCAAC TGCAAATGAT CTCCGTTGAA 498 0 

TTAGACCAAT TCCCTACATC TCTGATTACT TTTTCAGGAT ATATTTTTTC TTACTGCCAT 504 0 

TTTTCTTTTT ATCCCAAATT TTCATATTAC TAAACACAGC TACTAGAATA TTTCCAAATA 5100 

TAAAGGTGCC TATCACCCAA TATATGGACT CACTTGTTAG GTATTGTCGA TCCAAGCCAT 5160 

CCTTTAAATG GAATAGTATA GCACTTTCCT TAACAATCAT AAAGGTTGGC CACAAACTTT 52 20 

TTTTGAAAAA AG TAG AC ATT TTCATTATTT GTTGCCGCTT TCTGTAAGGT TAATACTCAA 52 80 

TAAAAATCAA AAAGCAAACT AGGAAGCTAG CCTCAAGCTG TACTTGAGTA CGGCAAGGCA 5340 

ACGCTGACGT GGTTTGAAGA GTATAGGCTT AGTATACTAC TAGGCAAGCA AATAAACAAA 5400 

TAAACAACTA GAATAGAAAA AGATAGGGCT CTAAAAACTG ACTTCTATTC CTTAAAAACG 54 60 

AACCACCTTG ACTGATTCGT CTTCTTACGT TTATCTCCTA CTTCCGATAC ATTTTAAACT 5520 

GTAGGAAGAG GTCGCTATAT TTCCCTGTCC ATTTATGGTC AAATTTCTCA TAAACTTCTA 5530 

GGTGTTTCAT GGTTTCAACA TCGGGATAGA AGGCCTTATC TTCCTTTGTT TCCTCTGGGA 564 0 

GCAATTCCTT CGCTGGTAGG TTTGGTGTTG AATAGCCGAC ATACTCCGCA TTTTGGAGAG 5700 

CATTTTCACC TTTCAACATA AAGTTGATAA AGGCATAGGC TGACTTTTCC TTTTTAACTG 57 60 

TTTTGGCAAT GACCATATTG TCAAACCAAA GATTGCTGGC CTCTGTCGGT ACCACATAAC 5320 

GTAGATTTTC ATTTTTTTCT AACATTTGGC TGGCTTCACC AGAGAAGGTC ACGCCCATTG 5880 

C AAC ATT ATT CTSAATCATA TACCCCTTCA TCTCGTCCGC AACGATAGCC TTGATATTTG 5 9 40 

GAGTCAGTTT GTAGAGCTTA TCCACTGTCT CTTCCAACTG CTGCAGATCC TTGGAGTTCA 6000 

GGCTGTAGCC GAGGGAATTG AGTCCTAGTC CCAGCACCTC ACGCGCCCCA TCAAAGAGCA 6060 

TGATAGAATT CTTATACTCC GGCTTCCAAA GGTCATCCCA ATGCTCAGCC GCTTCATCTA 612 0 

CCATGGTTTC GTTGTAGACA ATTCCTAAGG TTCCCCAGAA GTAAGGGATG CACAATTTAT 6180 

TACCTGCGTC AAAGGACTGG TTGAGAAACT CTGGTCCGAT ATTTTCGATT CCTTCAATTT 62 4 0 

TTGAATAATC AAGCCGAACC AAGAGGTCTT CGTCCTTCAT CTTGTTAATC ATGTATTCAC 6 300 

TTGGAATGGC AATATCGTAG GTCGTTCCAC CCTCCTTTAT CTTAGTGTAC ATGGCTTCGT 6 3 60 

TGGAGTCAAA AGTCTCGTAC TGAACTTGAA TTCCTGTTTC TTCTGTAAAC TGAGTCAAGA 64 2 C 

GTTCAGGATC GATATAGTCT CCCCAGTTAT AGATAACCAA TTTTTGACTA TCTCGACTAT 64 80 
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TGATTTTACT ATCTAAATGA C7CGCAATTC CCCACAAGAC AAGGATAATC GCTGCAATTC 6540 

CTGCTAAAAA TGAATAGATT TTTTTCATCC TTGCTCCTCC TTC7CACGAG AGATAAAGTA 6600 

ATAACCTACA ACTAGGATAA TACTAAAGAG AAAGACTAGA GC AG AC AGGG CATTGATTTC 6660 

TAAGGAAATC CCCTTGCGAG CACGAGAGTA AATCTCGACT GATAGGGTTG AAAAGCCATT 67 20 

TCCTGTTACA AAGAAGGTCA CGGCAAAGTC ATCTAACGAA TAGGTGAAGG CCATGAAATA 67 80 

ACCAGTAATG ATAGACGGAG TCAGGTAACG AAGCATCATT TCCTTGAACA TCTGAAATTC 6340 

ACTAGC7CCC AAGTCATAGG CCGCATGAAT CA7GTCGCCA TTCATTTCC? 7CAG7CGAGG 690C 

CAAGACCATC AAGACCACGA TAGGAATGGA GAAGGCCACG TGACTAGATA GAACGGTCAA 6960 
AAACCCAAGT CAAAACTTGA GTTGCGTAAA GAGAATCAAG AACCTACCAC CAATCATAAC 
GTCAGGCGCA ACCATGAGGA TATTATTGAG TGATAGAAAG GCTTCTTCGT ATT7C7TACC 

AGACTGGTAG A7GTAAATGC CACCAAAAGT CCCGATAATG GTCGCTATCA AGGCTGATAG 7140 

GAAGGCCAAG AAAAATGTCT GAGCCAAAAT CAGCATGAGT CTCCCATCTC CAAACATGGT 7200 

TTCAAAGTGA GTCCAGCTAA AACCTGTAAA GC7ATTCA7A 7CATCACCAG CATTAAAGGC 7 260 

ATAGCCAATC AAGTAAAAGA 7AGGCAGG7A GACGACCAGA AAGACCAGTC CCACATAAAG 7 3 20 

CTTGCCAAAT TT7TTCATCG TTCTCTCCTT TCCTTAG7CA CCCACATGG7 GATGAACA7G 73 80 

G7CAGGATGA GAATCACACC CATGCTTGAA CCCATACCAT AGTTGTCA7T GGTTAGAAAA 7440 

TTCTGCTCAA TAGCCGTCCC CAAGGTGATA ACCCCT7CCC ACCAATCAAA CGGGTCAGCA 7500 

TGAAGAGACT CAAAC7TGGG ATAAAGACCG ACTGAACCCC GG ? 542 
(2) INFORMATION FOR SZQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
AAAACCAAAT TCCGGTATTT TAACCTATGC 7GTAAATACC ATGAAGTCTC TCATGACAGA 

TCAGGTCTAT AACATTAAGG TTGACACAGA AAATGGAAAT TATGTTGGTG AAGCTAGCCA 120 

TG7T7TGGTC CTTTTGACAA ATTACTTCGC TGATAAGAAA ATCTTTGAAG AAAACAAGGA 180 

CGGCTATCCC AACATTTTGA TTCTGAAAGA TCCCTC7ATA TTCTCCAAA7 TATCCGTCAT 24 0 

TCCTGATTTA TTAAAAGGGG ATG7TGTCGC AAA7GATAAT ATCGAGTATA TCAAAGCGCG 3 00 

TAATATTAAA ATCTCTTCAG ATAGTGAATT GGAGTCAGAT GT7GACGGAG ATAAA7CAGA 3 60 
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AGAAGGGACi 


TTAGACTGGG 


1080 


A T^^f A T*f 

nl > V.M 1 V. 1 O I 


r* a f a rr**v isr 
LnLALL 1 AAO 




TCTTTGGTGG 


TGATTTACAG 


GGGATTATTG 


1140 


M L I A l \J AA 


TTACTTGCAA 


oao r^roooTA 


rTACTGGACT 


ATATCTTTGT 


CCCATCTTTG 


1200 




r i» TP" jO&ir 


1 ALAA - AOO A 


L AO ATT ACTT 


«/» ft ft ft mn*/^ * /•» 

TGAAATTGAC 


CGTCATTTTG 


1260 


r&r Af a af f a 
OAOAOAAOOA 




OaAoTOOTOO 


ATCAAGCGCA 


TCATCGTGGC 


ft i-M/^ ft ft ft AM A ft. 

ATGAAAGTCA 


1320 




LiO 1 Al 1 ; AA . 


v_ a i a * roo . r 


v_Ot.AATv.TuT 


TCAATGGAAA 


AA TGTCGTCA 


13 80 


AAA ITCPT'*! 
MAAA 1 V«0 1 vj A 


a p" a fT*fT*P" P**** 
AL AOTv_TOO i 


T»IT1A PP « IT* 

r A i AAwA l i 


UG TTCCATAT 


TCAACAATTC 


CoAGTGACAA 


1 440 


v_ - unnAnui. 1 


nu i L AA I AAO 


nununL * 1 AL 




TTTTGGTTTC 


/*• ft /-"* y^ ft •» f-A ft 

OAOOACTATA 


1500 


TCC^T \AG 


AArf\ i AV_ AOV_ V- 


a aTp" \,"*ap"" 

AA i -_AVj t\\ iVJ 


► -AAvjAA I i. A 


TCTTTTAcXAG 


GTTOV.UAOT . 


1 5 60 


ATTflflATTfTA 


AP.AftTTTAA"* 


n i U UA X • l 


PPPPTTTPP A 

VjOV.t* I I IVjVjA 


TPTPPrr ft a t* 
1 0 1 OOO i AA i 


f up a t*»p/"' a f /■* 
OAOA iTviAH. 


icon 


ATCAGTTCTG 


GAAGGATTTT 


CCTAAGGCAC 


TTTTAGCTAA 


AAATCCTGAT 


CTTTATATCC 


1680 


TAGGAGAAGT 


CTGGCATACA 


TCTCAGCCTT 


GGC7AAATGG 


ACATCAGTTC 


CATGCCGTCA 


1740 


TGAATTATCC 


TTTATCTGAT 


AGTATCAAGG 


ACTATTTCTT 


ACGAGGAATT 


AAGAAGACAG 


1800 


ACCAGTTCAT 


CGATGAAATC 


AATGGAGAGT 


CTATGTATTA 


CAAGCAGCAG 


ATTTCAGAGG 


1860 


TCATGTTTAA 


7CTCTTGGAT 


TCACATGATA 


CAGAGCGAAT 


CCTGTGGACG 


GCCAATGAAG 


1920 


ATGTTCAACT 


GGTTAAATCA 


GCCTTAGCCT 


TTC7CTTTTT 


ACAAAAAGGA 


ACACCGTGCA 


1980 


TTTATTACGG 


AACCGAGCTA 


GCCTTGACTG 


GAGGACCAGA 


TCCAGATTGT 


CCTCGTTGTA 


2040 


TGCCTTGGGA 


ACGTGTATCA 


AGTGACAATG 


ATATGCTGAA 


CTTTATGAAG 


AGGCTGATTA 


2100 



WO 98/18931 



PCT/US97/19588 



516 



AAATTCGGAA 


ATACGCGTCA 


GTAATCATTT 


CGCATGGCAA 


GTATAGCCTT 


CAAGAAATCA 


216C 


ACTCTGATCT 


ACTACC7CTG 


GAATGGAAAT 


ACGAAGGACG 


GATCCTCAAA 


GCAATATTCA 


::2o 


ACCAATCAAC 


AGAAGATTAT 


CTTTTAGAGA 


AAGAAGCAGT 


AGCACTAGCA 


AGCAATTGCC 


2230 


AAGAATTGGA 


TAATCAGCTT 


GTCATCTCTC 


CAGATGGATT 


TATGATTTTC 


TAAAAACTAG 


2340 


TTGATGAAGA 


TTATGGTACA 


TTTCATACCT 


TATATAGTAT 


AATAAGGCTA 


GTTACTAAAC 


2400 


TTGTAAAGGA 


GAACTTAAAT 


GAATTGTAGA 


GGACATGAAA 


CAAGACAAAG 


AATTGTTAGA 


2460 


CATTTTGAAG 


TTCAGCCTAA 


AGCACATATT 


AAGCTGTTAG 


CAAATCAACA 


AAAACATAGT 


2520 


GATGCACGAC 


CAACTATTGA 


AGATGAATAT 


TATGTA7TTA 


TCGCTGAGAG 


TAAAATTGAT 


2580 


GGCAAGAAGG 


AAGTTATTCA 


GTGTTGCATG 


GGTGCGGCAA 


GGGATTTTTT 


AGAACTAATT 


2540 


AATCACAAAG 


GCCTACCTCT 


TTTTAATCCG 


CTTGTAGGTG 


ATTCTCATGT 


AAATAATAGA 


2700 


CAAGAATATG 


ACAATACAGG 


GAGTGGAAAT 


TTATAACCTG 


AAAAGTGGAA 


TCAAACTCCA 


2*760 


AAGCAGCTTT 


ATAATGCTAT 


AATGTGGTTC 


ATTATTTTAT 


GGAATGCTAA 


GCCGGATACA 


2820 


CCTTTATTTA 


ATTTTAAAGA 


CGAAGTAATT 


AAGTATAAAA 


CATATGAGCC 


TTTTGAAAGC 


2880 


AGTATAAAAA 


GAGTAAATAC 


TACTATAAAG 


AATGGTAGTA 


AAGGGAAAAC 


TCTGACTGAC 


2940 


ATGATTAATC 


GCTACAGAGC 


GGATAACGAT 


ATT AG A GAT G 


AAATTTGTAA 


CTTTAATATT 


3000 


CTGAAAAATA 


AAATTCCTGA 


TATGAAAAAC 


CAACAAGGAA 


ATACAATGGA 


ATCTTACTTT 


3060 


TAGTTATTGT 


TGAATTTTGG 


GTATTCTATA 


AAATATCCTA 


ATTGACATTT 


AAATAGTAGA 


3120 


CTATACAATA 


TAGTTAAAAT 


ATCAGTAAAA 


ACAACACTTT 


ATTGAGGTAT 


TGGATACGCT 


3130 


TTGCTAATAG 


CCTAATAATC 


ACATGTGCAG 


TGTTGCTACA 


AC G AAAAAGG 


TGATAATCCT 


3240 


TGATTTCAAG 


CTATTTTATA 


AGCATTTTGT 


CTTTGTAGAT 


AAAGGCAATT 


TTGACAATAA 


3300 


AAATCCTAAA 


AGGTGAATCG 


TTATAGATGT 


ATTTGTAGAT 


ATCGTTTGCG 


CATCGAAAAA 


3360 


ATTAATACAA 


GAA7AAATAT 


TTATAGCTCT 


TTAGGTGACT 


TTTATAGAAG 


TAAAGTTTAC 


3420 


GATAGAAAAA 


CAAGAAATAA 


CCCACCATTT 


TTGGTGCGTT 


ATCCTTTTTT 


ATGCTATAAT 


3480 


GCATTTATAA 


AAATAAAGGA 


GTTTGCTATG 


ATTGGAAAGA 


ACATAAAATC 


CTTGCGTAAA 


3540 


ACACATGACT 


TAACACAACT 


CGAATTTGCA 


CGGATTCTAG 


GTATTTCACG 


AAATAGTCTG 


3600 


AGTCGTTATG 


AAAATGGAAC 


GAGTTCAGTC 


TCTACCGAAT 


TAATAGACAT 


CATTTGTCAG 


3660 


AAGTTTAA7C 


TATCTTATCT 


CGATATTGTA 


GGAGAAGATA 


AAATGCTCAA 


TCCTGTTGAA 


372C 


GATTATGAAT 


TGACTTTAAA 


AATTGAAATT 


GTGAAAGAAA 


GAGGTGCTAA 


TCTATTATCT 


3730 


CGACTCTATC 


CTTATCAAGA 


TAGTCAGGGA 


. ATTAGCATTG 


ATGATGAGTC 


TAATCCTTGG 


3840 


ATTTTAATGA 


GTGATGATCT 


ATCTGATTTG 


1 ATTCATACGA 


ATATCTATCT 


AGTACAAACT 


3 900 
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TTTCATGAAA TAG AG AC AT A TAGTGGCTAT TTGGATGGAA TTGAACGTAT GTTAGAGATA 3 9 60 

TCTGAAAAAC GGATGGTGGC CTAATGGAAA TCCAAGATTA TACTGATAGT GAATTCAAAC 4020 

ATCCTTTAGC AAGGAATCTT CGTTCACTGA CAAGAGGAAA AAAGTCCAGT AAGCAACCTA 4080 

TAGCGATTT? GCTTGGAGGG CAAACTGCTG CCGCTAAGAC TACAATTCAT CGTATTAAAC 4140 

AGAAAGAATT TCAACGAAAT ATTGTTATCA TAGATGGTGA TAGTTTTCCT TCTCAGCATC 4 200 

CACACTATTT AGAACTGCAG CAAGAATATG GCAAAGACAG TCTACAATAT ACCAAAGATT 42 60 

TTGCAGGAAA AATGGTAGAG TCTTTAGTAA CAAAATTGAG TAGTTTGAGA TACAATCTTT 4 3 20 

TGATAGAGGG AACTTTACGA ACAGTTGATG TTCCAAAGAA AACAGCACAA CTCTTGAAAA 4 3 80 

ATAAGGGATA TGAAGTACAA TTGGCCTTAA TTGCGACAAA CCCTGAATTG TCGTATCTAA 4440 

GTACTCTTAT CCGTTATGAA GAACTGTACA TTATCAATCC AAATCAAGCA CGCGCAACTC 4 500 

CAAAAGAACA TCATGATTTC ATTGTAAATC ATCTAGTTGA TAACACACGA AAATTGGAAG 4 560 

AACTAGCTAT CTTTGAAAGA ATTCAAATTT ACCAACGAGA TAGAAGTTGT GTATATGATT 4620 

CAAAAGAAAA TACAACTTCA GCAGCAGATG TTCTTCAAGA GTTACTCTTT GGGGAGTGCA 4680 

GTCAGGTAGA CAAGGAGATG TTGCAGGTGG GGGAAAAGAG ACTTAATGAA TTACTTGAAA 47 40 

AATAAACAAT TGATATTTTT AGGAGAATAG AAATGAGAGG CTTTAATAAC AAGATAAAGT 4800 

CTGTTTATCA AGAACTAACA AATTCCAAAC AGAAATTCGG TAGCTTTCAC AAGACTTTAA 4 860 

TTCATTTGCA TACACCTGTT TCTTATGATT ACAAGCTATT TTCTAATTGG ACTGCAACGA 4 9 20 

AATATAGAAA AATTACTGAA GATGAACTAT ATGATATATT TTTTGAAAAT AAGAAAATAA 4 980 

AAGTTGATAA GACAATTTTT TTTAGTAATT TTCATAAGGT TGTTTTTTCT ACTTCAAAAG 5C40 

AATATATTAC TTTTCTTATG TTAGCAGAGG CAATCATAAA AAATCGAATA GAAATAGTTG 5100 

TAGTAACTGA TCATAATACT ACCAAAGGTA TTAAAAAGTT ACAAATGGCA GTCTCAATCA 5160 

TAATGAAAAA TTATCCGATT TATGATATAC ATCCTCATAT TT.TACATGGA GTAGAAATTA 52 20 

GTGCACCAGA TAAATTGCAT ATTGTATGTA TATATCATTA TCAACAAGAA TCATCCGTTA 5280 

ATCAATGGTT AAGTGAAAAT ATTATAAGTG AGAAAGATGG AAGTTATCAA CATTCACTGA 5340 

CTATAATGAA GGATTTCAAT AATCAAAAAA TAGTTAACTA TATTGCTCAT TTCAATAGTT 5400 

ATGACATTTT GAAAAAAGGT TCTCACTTAT CAGGTGCATA TAAACGAAAA ATTTTTTCTA 54 60 

AACAAAATAC ACGATTTTGG AGTTTAATAT TAACTCGAAA GAATCTTCGC AACAACTTGA 55 20 

TATTCTCTAT AAAGAAGTTC GTGTATTAAG TTTGGGACAA AAAGTTGTAG CCATCCTTCA 5580 

TTTTTTATTA GCATATAGTG ATTATTCTAA AGACTTCAGA CCATTGATTA TTGATCAGCC 5640 
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TGAAGACAAT 


CTAGACAATC 


GTTATATTTA 


CACGCATTTA 


GTTCAGCAGT 


TTAGAGATGT 


5700 


GAAAGCTCAA 


CGTCAAATTA 


TTTTAGCAAC 


ACATAATGCT 


ACAATTGTAA 


CAAATTCTAT 


5760 


GACAGATCAA 


GTTGTTATTA 


TGGAGTCAGA 


TGGAGTTAAC 


GGATGGAT7G 


AATCACAGGG 


5820 


ATATGTTAGT 


GAAAAATATA 


TAAAAAATCA 


TATCATCAAT 


CAATTAGAGG 


GAGGAAAAGA 


5880 


TTCCTTCAAG 


CATAAAATGT 


CTATATATGA 


GACGGCTTTA 


TCAGAGTAGA 


GTCAG AAAAA 


5940 


CTAGGTTAGA 


AATTTAGCCT 


ACTTTTTTCT 


TTGTCCGACA 


GGC AT AGTGT 


ACATCTGAGG 


6000 


TCCAAGTCCT 


CTGTGGATAT 


TTGCTGCAGA 


TGAAACCAAT 


AGCGACTCCT 


AAGCCTGAAT 


6060 


ATCGTGAGGT 


AGGGGGGATA 


GGAAGGAATT 


AGCGAAATCA 


AGGTTCTACA 


AACAGAATCG 


6120 


7CACTTGAAG 


C CAT AT AT AG 


CGGATGAGGA 


ACTCTAAAAT 


CCAAATAGGT 


GTCGTAACCT 


6180 


ATATACGTAA 


ATTACGAGAG 


TAAACTAGGA 


AAGATGTACG 


GCTTATTCCG 


TGAGCGTTTA 


6240 


GGACGTAGTA 


CAACGAATCA 


TGGGAGTCAG 


CTGAACACAT 


AGTATTGAAC 


AAATTTCTGT 


6300 


AATGGAAATG 


GAGCGAAGAA 


GTGAACAATT 


AAATGAATAC 


CTCTCTAATT 


AAATTTGTCA 


6360 


ATTCTAATTC 


CTGGTATGAA 


AAGACAGTGA 


CCTGAAAATG 


TAAACGATGG 


GACCTCATCA 


6420 


TAAATATAGG 


ACCCTACATG 


CAGTGGTGTT 


AGACATTACT 


CCTTACTTGA 


TTTGTGATAA 


6480 


CTTCCCCAAA 


TTTCTTCTGC 


TATACTTTTC 


TCAACTTTTA 


AAAATCCAAC 


TAAGAATTTT 


6540 


ACCTGGGGGT 


TTGGGGGCGG 


AGCACTAAGT 


TATCTTATCG 


TTAGCTGTCA 


AAACTGGTAG 


6600 


GTTTTGATAG 


CCTGGCGATA 


TGATTTTTGG 


GATATTGTGG 


ACACAATATC 


TGAGCTCGCA 


6660 


AAGCCTTACA 


AGAATGAAAA 


TCAGTTGTTG 


GAAAAGTGTA 


CTGACATTGT 


ATGGTAGCTC 


6720 


ACATTGTCAC 


TACAACTATT 


TTGGAAAGGA 


AGTAGCAGTA 


TGAAACGAGA 


TGTGCGTGAT 


6790 


ATTCGGAAAC 


AATTTCGTTT 


AACAGAAGCA 


GAAGAAAAGC 


AAATTCTAGC 


TTTGATGAGA 


6840 


G AG C G GGG AG 


AGACTAA7TT 


CTCTGATTTT 


CTTCGTAAAA 


GTTTACTTTC 


CTCTGATTTA 


6900 


CAAAAACAGA 


TGGAGACATG 


GTTTGCCCTC 


TGGCAATCCC 


AAAAACTAGA 


ACAAATCAGT 


6960 


CGTGACGTTC 


ATGAAGTTTT 


AATCTTGGCA 


CAGTCAGAAC 


GTCAAGTCAC 


CCAAGAGCAT 


7020 


CTATCTATTC 


TCTTAACGTG 


CGTGCAGGAA 


TTGATTCAAG 


AGGTTGCAAA 


CACCATACCC 


7080 


CTCAGTAAAG 


AATTTCGTGA 


GAAGTACATG 


AGGTAAGCAC 


ATGGAACATC 


GTTACCGAAC 


7140 


CAATCTCAAG 


AAAGTGTTTT 


TGTCTGATAG 


TGAGTTGAAC 


CAACTAAATA 


TAAATATCGA 


7200 


TCAAAGTGGT 


TGTAAATCCT 


TTTCTGAATA 


TGCGAGACGA 


ACTCTACTCG 


ATCCTGGTAT 


7260 


GAATTTTATC 


ACGATTGACA 


CAAACGGTTA 


CCAACATTTA 


GTGTTTGAGT 


TAAAGAGGAT 


7320 


TGGCAATAAT 


ATCAACCAGA 


TTGCTCGAAG 


TGTTAATCAA 


TCTCAGTTAA 


TTTCTGGTGA 


7380 


AGAATTGCAG 


GAGTTGAAAA 


AAGGAATTGG 


TGAATTGATA 


AAAGAAGTTG 


ATAAGGAATT 


7440 
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TAATCTGCAA GCGCAGAACC TAAAGCAGTT CCATGGTCAT CACTAAACAC TTTGCCATTC 7 500 

ACGGAAAGAG TTACCGCAGA AAGCTTATCA AGTACATTCT CAATCCTGAG AAAACCAATA 7 560 

ATCTTGCCTT GGTGTCGGAC TA7GGCATGA AGAATTTTCT GGACTTTCCT AGCTATGAGG 7520 

AAATGGTGCA GATGTATCAT GAAAATTTCA TCAGCAACGA 7ACGCTTTAC GATT7TCGCC 7 680 

ACGACAGGAT GGAAGAAAAT CAACGAAAAA TACACGCTCA CCACATCATT CAGTCTTTC7 7740 

CGCCAGAGGA TCATATCACT CCTGAACAAA TCAATCGGAT ACG7TATGAG ACTCTGAAGG 7300 

AATTAACTGG TCGCAAA777 CGTTTTATCG TTGCGACCCA TGTTGATAAA GACCACCTGC 78 60 

ACAATCACAT CATTATCAAT TCAGTAGATA GCAA7TC7GA CAAAAAGCTC AAGTGGGACT 7 920 

ACAAGGTGGA GCGAAATCTT CCCATGA7T7 CTGACCGTTT 7TCTAAAA7C GCAGGTGCTA 7980 

AAATCATTGA GAACCCCTAT TCTCACCAGC GG7ATCAAC7 CTATCGTAAG ACTAATCACA 8040 

AGTATGAACT CAAGCAGCGA CTCTATTTTT TGATGGAACA TTCTAGCGAC TTTGAGGATT 8100 

TCAAAAAGAA TGCTCCGCTA CTACATCTCG AGATGGATTT CCGTCACAAG CATGCCACCT 8160 

TTTTTATTAC GG ACTCAACT ATGAAACAGG TGGTGCGTGG CAAGCAACTC AATCGCAAGC 82 20 

AGCCTTACAC AGAAGAAT77 TTTAAGAACT ACTTTGCCAA AAGAGAAA7A GAAAGTCTCA 8280 

TGGAATTTTT ATTGCTGAAA GTTGAGAATA TGGATGATTT ACTTCAGAAA GCAAAACTTT 8340 

T7CGACTAAC TATCAATCCT AAACAAAAGC ATGTTTCTTT TCAATTTGCA GGAGTCGAGC 8400 

TAAAGGAGAC ACAGCTAGAC CAGAAAAATC TT7ATGATG7 AGAGTTTTTC CAAGATTATT 34 60 

TTAAAAATAG AAAAGA7TCC CAACCTCCAG AAACTGAGGA TTTCGTTCAA CTTTATCAAG 3520 

AAGAAAAGTT ATCCAAAGAA AAAGAACTTC CAAGCGATGA GAAG7TCTCC GAGTCCTATC 3580 

AAGAG77CAA GAC7AACAGA GATGCCCTTC ATGAA7TTGA GGTGGAGTTG TCACTCAATC 8640 

AAATTGAAAA AGTAGTGGAT GATGGAATTT ACGTCAAGGT CAAGTTTGGT ATTCGTCAGG 8700 

AGGGACTTAT CTTTGTCCCG AACATGCAGC TTGATATGGA AGAGGATAAG GTGAACCTTT 37 60 

TCATCAGGGA AACCAGCTCC TACTATGTCT ACCACAAAGA CGCTGCCGAG AAAAATTGTT 8820 

ATATGAAAGG TCGAACCTTA ATTAGACAGT TCAGCTATGA AAATCAAACC ATTCCATTAC 8880 

GCAGAAAAGC GAC AGTCGAT ATGATT AAAG AGAAGATTGC GG AAGTCGAT GCTTTGATTG 3940 

AACTGGAAGT AGAAAATCAA TCTTATGTCA CGATTAAAGA TGAGTTAGTG CATGAACTAG 9000 

CAGCGTCTGA ATTGAGAATC AATGAGTTGC AAGAACGAAT GTCAACCTTG AATCAAGTAG 90 60 

CACAATATCT ACTGGCTTCA GTTGAAAGTA AGCAAGAAAT GAAATTAAAT CTTTCAAAAC 9120 

TGAATATAAC TGAGAATATC AGTGCTAATA TTCT7CACAA AAAATTGAAG AGCCTGGGGA 9130 
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ATCAACTGGA ATTGCAAAGG GGCAGGTATG AAAAGATGGT AGT 922 3 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6827 base pairs 
(Bi TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
TCTGCTGGCT ACCATCATCT GACTTGGGCA AGACCAAAGT CTTAGTTACA ACTGTATTCT 60 

TCTCAGCATT TTCAATAACT GGCAATGCCG ACTGAACCGT A7CTTTTTCT GTTTTTGTAG 120 

CTGGTCCAGT TTCT TT TTTC TGTCCGCAAC CAACCAGGAC AAAAAGGAAA GCTAGACTAA 180 

CAAGAACTAT TTTTTTCATT TCTTTCTTCT TTCTTTTTGA AATTAAAATA GAATAAGACT 2 40 

GGGAAGTGCT CCCAGCCTTG ATGTTTATAG AGCTCCACGC AAACGTGCTT CTGCATTTTC 300 

TACATTACGG ACAGAGCGTG GTAGGAAGGC ACGAATATCC TCTTCCTTGT AGCCAACTTG 3 60 

CAGGCGTTTT TCATCTACAA GGATTGGGCT CTTTAAAATT CTCGGTGTTT CCATAATCAG 4 20 

ATTGAGAACT TCATTGACAC TCAAATCTTC AATA7CCACT CCAAGGGCTT TGGCATAGCG 4 80 

ATTTTTAGAC GAAACGATGC TGGCTATTCC GTTATCTGTT TTGGTTAGAA TATCCAGTAA 540 

TTCTTCTCTC GTAATTCCTT CTTTACCAAG GTTTTGTTCT TTATAACTTA ACTGGTGCGC 600 

ATTGAGCCAG GTTTTTGCTT TTTTACAGCT AGTACAACTT GAGACTGTAT AAATTTTAAT 660 

CATGTACCTA CCCCTTTCGC TACATGTTAC TATCAGTT7A GTCTATTATA CCATAAAAAA 720 

CATCCGACTT GCGACCTATT TTTAATTTTT TTTGACTTTT TTCCTCATTT TCGTACTTTT 780 

TTC7TGACAA ACAACTAAAT GACTATCAAC TCTTT7CGAC CTAGGGTCAA TAATTCACAA 940 

CCTGTCTCTG TAATCAGGAT ATCATCCTCG ATACGAACGC CATATTTGCC TTCGATATAG 900 

ATACCTGGTT CATCGCTCAA GGCCATACCT GTCTTAATAG TTTCTGTACA AGTCTGACTA 960 

AAGTAGGCTT CCTCATGGAT ATCCAGACCA ATACCGTGGC CAATGCCGTG AGTAAAGTAG 1020 

TCACCATAAC CTCCCTCAAT GATAATATCA CGAGGGATT: TGTCAAAGTC ACCGAAACCT 1080 

AAGCCTGCCT TAGCTTGGTC AATCAAGGCT TGGTTAGCTT TTAGAACCGT ATTGT AAATC 1140 

TCTGCCTGCT CATCGCTAAC ATGCCCTAGA TAGATAGTCC GGGTCATATC ACTGACATAG 1200 

TGGTCATAGA GACAGCCGAA GTCCATGGTG ATGGCTTCTC CCAACTCCAC TGGTTTGTCC 12 60 

ATTGGATGGG CATGGGGTTT AGAAGAATTG ATACCGCTAC CTAGGATCGT ATCAAAAGAT 1320 

AAGCCAGATG CTCCCAACTC ACGCATGCGG AAATCAAGGA AGTTGGCAAT CTCAATTTCA 13 80 
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GTTTTTCCTG GTTTGATAAA GTCAAGCGCA TCGCGGAAAG CTTCGTCTGA GATAGAACAA 14 40 

GCCTTGCGAA TCGCTGCAAT C7CTCCCTCA TCCTTAATCA TACGAAGACC TTCCACAAAC 1500 

TGAGTTTGTG GAAGCAAGTT CAAACCTGCA AAAGCTGCCT CCATACGGTG GTAATAAGAC 1560 

ACTGAAATCT CATCTTCAAA ACCGATACGA GTCAAGCCCA TGTCCTTAAC AATTCCTGCA 1520 

ATGACAGCCA ATTCATCACG ATCAGCCACA ATCTCAAAAC CACTGGTTTC TTGCTTAGCT 1630 

GCGATGATAT AGCGAGAGTC TGTCACTAAG ACCTGACGGT CACGACTGAT AAAGACTGTT 1740 

CCGTTTGAGC CCCAAAAACC AGTCAAATAA TAGACGTTTT TAAGATTGTT CATGATGATA 1800 

CCATCTAGTT CTTTTTCTTG CATTTTAGCT ACAAATGCTT GTACGCGTTT ATTCATGATG 1860 

TAACTTTCCT TTCAAATAGT GTCCTGTATA GCTGGCTTCG TTCGCAGCTA CTTCTTCTGG 1920 

AGTTCCTGTT ACGATGATGG TTCCACCACC GACACCGCCC TCAGGTCCCA AGTCAATGAT 1980 

ATGGTCTGCC GTCTTGATAA CATCCAGATT GTGCTCGATG ACGAGGACTG TATTGCCATC 2040 

GTCTACAAAG CGAGCTAAAA CCTTGAGCAG GCGAGCAATG TCCTCTGTAT CAAGCCCTGT 2 ICO 

CGTCGGCTCA TCCAGAATGT AGAAAGATTT TCCTGTCGAT CGTTTGTGGA GTTCGCTAGC 2160 

TAACTTCATA CCTTGGGCTT CTCCCCCAGA AAGGGTGGTA GCTGGCTGTC CCAAGGTCAC 2220" 

ATAGCCTAGC CCTACATCCT TGATGGTCTG GACTTTGCCT TGAATTTTCG GAATGTGTTG 2280 " r ~ 

GAAAAATTCT ACCGCATCGT TGACCGTCAT ATCCAACACC TGCGAAATAT TCTTTTCCTT 2340 

GTAGTGAACT TCTAGGGTTT CACTGTTATA GCGGGTTCCG TGGCAAACTT CACAAGCCAC 2 4 00~ 

ATAAACATC7 GGCAAGAAGT GCATCTCAAT CTTCATAATC CCGTCACCTG AGCAAGCTTC 24 60* 

ACAGCGACCT CCCTTGACGT TGAAACTGAA GCGCCCCTTC TTGTAGCCTC GAATCTTGGC 2 520 

T7CATTTCTC TGAGCAAAAA GGTCACGTA? A7CGTCAAAA ACTCCTGTAT AGGTAGCTGG 2530 

CTTAGACCTC GGCGTCCGTC CGATAGCGCT CTGGTCAATA TCAATCAAAC GGTCGACATG 2640 

CTCAATCCCT GTAATAGTCT TAAACTTACC AGGTTTCTCT GAATTACGGT TCACCTTCTG 2 7 00 

GGCAATGGCT TTTTTGAGAA TGCTGTTGAT TAGAC7CGAT TTCCCTGAAC CCGACACACC 27 60 

TCTCACTGCG ATAAATTTTC CTAGTGGAAA GCGAGCCGTG ACATTTTGCA AGTTGTTCTC 2820 

ACGCGCTCCT ATCACTTCAA TAAAACGACC ATTTCCCACA CGGCGCTCTT CTGGTACTGG 2880 

GATGACACG? TTGCCTGACA AGTACTGACC TGTGATAGAC TTGCTGTTGC GAGCCACTTG 2 9 40 

CTTACGTCTA CCTGCTGCAA CAATCTCACC ACCAAAAACA CCGCCACCAG GACCAACGTC 3000 

AATCAGATAA TCAGCCTCAC GCATGGTATC TTCGTCGTGT TCCACCACGA TAAGAGTATT 3060 

GCCCAAGTCA CGCATCTT7T TCAGACTGGC AATCAGGCGA TCATTGTCCC TCTGGTGAAC 312 0 
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ACCGATTCAC GGCTCGTCTA GCATATAGAG GACACCTGAT AGCTTGGAAC CAATCTGGGT 3180 

TGCCAAACGA ATGCGCTGAC TTTCCCCACC TGAAAGGGTT CCTGCTGAAC GTGACAGGGT 3 2 40 

TAGATAGTTA AGACCCACAT TATTAAGGAA GGTCAAACGA TCCTTGATTT CCTTGAGAAT 3 300 

GGGACGAGCA ATCATCGCTT CATTTTCAGA CAAAGTTAAC TGGCTCACCA AGTCCAAGTG 3 3 60 

GTCAGCGATA GACAGGTCTG AGATTTCTCC AATATGTCGC CCTTGCTGGC CGCCCACACG 3 4 20 

GACAGACAAG GCCTGGTCAT TGAGACGATA GCCTTGACAG CTTCCGCAGG TCAGCTCATT 3 4 80 

CATG7AGAGA CCCATCTGAG TGCGAG7GTA ATCGCTATTG CTTTCATGCT AACGACGTTT 3 540 

GATATTATTG ATAACTCCCT CAAACGGAAT GTCGATATCG CGCACGCCAC CAAATTCATT 3 600 

CTCATAGTGG AAATGGAATT CCTTACCATC TGACCCATAG AGAATCAAGT TCTTATCTTC 3 660 

TTCTGACACC TCCTCAAAAG GCTTATCCAT ACCCACTCCA AAGACTTTCA TGGCCTGCTC 3720 

TAACATGTTT GGATAGTACT TGGATGAGAT ACCA7TCCAA CGTCCTAGCG CTCCCTCACG 37 90 

TAAGGTTTTG CTAGCATCTG GCACTACCAA ATCAGTATCC ACCTCCACCT TGATGCCCAA 3 840 

GCCGTCACAC TCACTACAAG AGCCAAAAGG ACCATTCAAA GAAAAGAGAC GAGGCTCTAA 3 900 

CTCTGGGACA GTAAAACCAC AAACTGGACA CGCA7AATGC TCAGAGAACA ACAACTCCGA 3 960 

GTCGTCCATG GTGTCGATAA TGACATAACC TTCTGCAATA CGAAGGGCAC CCTCAATGGA 4020 

ATCAAAGAGA CCACTACGAA TGCCCTCCTT GATAACAATA CGGTCAACCA CGACATCGAT 4080 

ATTGTCTTGC TTGCTCTTAG ACAACTCTGG CACTTCGGTC ACATCATAGA CTTCCCCATC 4140 

CACACGGACA CGAACATACC CGTCTTTCTG AACCTTCTCG ATAACACTCT TATGTTGGCC 4 200 

TTTTT7CTTG CGGATGACAG GAGCCAAGAT CTCCAAGCGC TCGCCTTCAG GTAACTCCAA 4 260 

AACCTTATCA ACGATTTGCT CCACAGAAGA AGCATTGATA GCTCCATGTC CGTTGATACA 4 3 20 

GTAAGGCCTC CCCACACGTG CGTAGAGGAG ACGCAGATAG TCATTGATTT CAGTCGTCGT 4 3 80 

TCCCACCGTC GAGCGAGGAT TTTTACTAGT CGTTTTCTGG TCGATGGAAA TAGCTGGGCT 44 40 

GAGACCATCA ATGGCATCTA CATCTGGTTT TTCCATATTT CCCAAGAACT GACGAGCGTA 4 500 

GGCGGACAAA CTCTCTACAT AGCGACGTTG TCCCTCCGCA TAG AG ACT AT CAAAAGCCAG 4 560 

ACTGGACTTC CCTGAACCTG ACAAGCCAGT CACGACAACC AACTTGTCTC GCGGAATCTC 4 620 

CACATCAATA TTTTTTAAAT TATGGGCACG CGCCCCATGA ATGACAATTT TATCTTGCAT 4 680 

CTTTCTTCTT TCTAGTCCAT TATTGCTTAC C ATT AT AC C A AAAAAAGTGA GATTCTATTA 4740 

CCCAAAAGGC CGATTTTGTA GTATAATAGT ACAGTGTGAA AAAATCTGAA AAATGAGAAA 4 800 

GGATAAGGGA TATGAAACAA G TT T TT CTCT CTACAACAAC TGAATTTAAA GAGATCGATA 4 8 60 

CGCTTGAACC GGGTACTTGG ATCAATCTCG TCAATCCGAC TCAAAATGAA TCACTCGAAA 4 920 
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TCGCCAACAC CTTCGATATT CATATTCCTG ACCTTCGAGC ACCGCTCGAT GCGGAAGAAA 4 980 

TGTCTCGTAT TACCATTGAA G AC GAG TAT A CCCTGATTAT CGTAGACGTG CCGGTCACGG 5040 

AGGAAAGAAA TAACCGCACC TACTACGTAA CCATCCCCCT TGGTATTATC ATCACTGAGG 5100 

AAACCATTAT CACTACGTGT TTGGAACCAC TACCTGTCCT TGATGTCTTT ATCAACCGTC 5160 

GATTGCGTAA TTTCTATACC TTCATGCGTT CACGTTTTAT CTTTCAAAT? CTTTATCGCA 52 20 

ATGCAGAGCT TTACCTAACA GCCCTTCGTT CAATCGACCG CAAGAGTGAA CAAATCGAAA 52 80 

GTCAACTGCA TCAATCAACT CGTAATGAAG AATTGATTGA GCTCATGGAA TTGGAAAAAA 5 3 40 

CTATCGTCTA TTTCAACGCC TCCCTCAAAA CAAATGAGCG CGTGATTAAG AAATTGACCA 5 J 00 

GTTCAACCAG CAATATCAAG AAATACCTTG AGGACGAAGA CCTGCTTGAA GACACCCTGA 54 60 

TTGAAACCCA ACAGGCCATC GAGATGGCAG ATATTTATGG AAACGTCTTG CATTCTATGA 5 520 

CAGAGACCTT TGCCTCTATC ATTTCTAACA ACCAGAACAA CATCATGAAA ACCTTGGCCC 5580 

TTGTGACCAT CGTCATGTCC ATCCCAACCA TGGTCTTTTC TGCCTACGGG ATGAACTTTA 5640 

AGGATAATGA AATCCCCCTA AACGGAGAGC CAAATGCCTT CTGCTTAATC GTCTTTATCG 5700 

CCTTTGCTAT GAGTGTCTCG CTCACTCTCT ATCTCATCCA TAAAAAATGG TTCTAAGAGG 5760 

AGTTCCTATG TCTCAAATTG ATCTACAAAA ATtAACTAAG AAAAACCAAG AGTTTGTCCA 5 320 

CATTGCTACC CAACAATTCA TCAAAGATGG GAAAACAGAC GCTGAAATCC AGACTATTTT 5880 

TGACGAAGTC ATTCCCCAAA TCCTTGAGGA GCAATCTAAA GGTACAACTG CCCGTTCCCT 5940 

ATACGGCGCA CCAACTCATT GGGCTCATAC CTTCACTGTC AAAGAGCAGT ACCAAAAAGA 6000 

GC ATCCAAAA GAAAATGATG ACCCAAAACT GATGATTATG GACTCACCTC TTTTCATCAC 6060 

TAGCCTCTTT GCCCTTGTCA GCGCCCTCAC AACCTTCTTT GCGGCACAGC AACCTTTCGG 6120 

CTATGGATTG ATTACTCTTC TATTAGTTGG ACTGGTTGGT GGATTTGCCT TCTACTTGAT 6180 

GTACTACTTT GTTTACCAAT ACTATGGACC AGATATGGAT CGCAGTCAAC GTCCACCTTT 6240 

CTGGAAATCT GT ACT ACT? A TCCTAGCTTC TATGTTCCTT TGGTTGCTTG TCTTCTTTGC 6300 

AACAAGCTTC CTACCAGCTA GCCTTAACCC AGTACTGGAT CCATTGCCAC TAGCTATTAT 6 3 60 

TGGAGCAGCC CTCCTAGCCC TTCGCTTCTA TCTCAAGAAA CGC7TGAATA TCCGTAGTGC 6420 

AAGTGCAGGA CCAACACGCT ATCAAGAATA AGAAAACGAT AAAAGCAACT GCAGGTGCGG 6480 

TTGCTTTTTC ACTTACTTTT TTGAGTTATA TTCAATGAAA ATCAAAGAGC AAACTAGGAA 6540 

CCTACCTGCA GGTTGCTCAA AGCACACCTT TGAGGTTGCA GATAAAACTG ACGTGGTTTG 6600 

AAGAGATTTT CGAAGAGTAT TAAAAGTATT CTTCTGAAAT CCCACATAGC TTTCTCTTAT 6 6 6C 
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ATTTTCTCAT AAAATAGGCT CAATCTATTT CTAGGAGGAT GAGATATGCT TTCTACTATT 
GGTATTGTTA GTTTATCTAG TGGCATTATC GGAGAGGATT TTGTCAAACA CGAAGTGGAC 
TTGGCTATCC AACGTCTCAA GGATCTGGGA CTCAATCCCA TCTTTTT 
(2) INFORMATION FOR SEO ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(Di TOPOLOGY: linear 



(xii SEQUENCE DESCRIPTION: SEQ ID NO : 61: 



CTGGCTAGTT 


GCATAGACCA 


AAGTTGCTTC 


TTCATCAACA 


AAACCGTTCA 


TTTCAAAATA 


60 


GGAAAGCAGC 


TCATCAGGAC 


TCTCCAAACG 


AATCCCTTTG 


TAATCCAGCT 


CAACTGCCAC 


120 


CTCTTTCAAG 


GCTGCAAGAA 


GAAGTGTTCC 


CAGGCCCTGT 


CTCTGATGGT 


CAAACTCGAT 


1B0 


GACTAAAGAA 


TGTACTTTTA 


GACATTGCGG 


ATTGTCTGAC 


TGGGGACTTG 


ATAAAATATA 


240 


GCCTAAAAGT 


TGATTTTCAT 


CCCTAGCTAG 


AAGAAAGGTA 


TCCGCACACT 


TACGGATACT 


300 


TTCTTCTAAA 


ATATGGGAAA 


GTTGCTGCTT 


TTCAGCTGGA 


AAAGACGACG 


TCTGAAGTGC 


360 


CCCTATCTCA 


GGCAAATCAG 


ACTTGCTTGC 


CTCAATGATC 


TTAATTGCAA 


TTTCCATGGG 


420 


AACATCCTAT 


TGAACATTGC 


TTGTCAAGTT 


AGACAAGAGA 


CGCTCAAATG 


AGTATTCATA 


480 


GGTTTGGATG 


TCTCCTGCTC 


CCATAAAGAC 


GTAAACAGCA 


TTGTCATGGT 


CTACGAGTCG 


540 


AGAAACATTT 


TCAACAGTAA 


TCACTTGGTG 


TTTTTTGTTG 


ATTTTGTTGG 


CTAGGTCTTC 


600 


TACCTTAACG 


TCACCATGAT 


CTACTTCACG 


AGCCGAGCCA 


TAAATTTCCG 


CTAGATAAAC 


660 


ACCATCTGCT 


TGGTTTAAAG 


CATCGGCAAA 


G7CCTCCAAC 


AACC C AATGG 


TTCTTGTAAA 


720 


GGTATGCGGT 


TGAAAGACTG 


CTACAATTTC 


CTTCCTTGGG 


TATTTCTGAC 


GAGCCGCATC 


780 


CAAGGTCGCA 


ATAATTTCTG 


TTCGATGGTG 


GGCAAAGTCA 


TCCATAATCA 


CTGTATCATT 


840 


GACAATTTTC 


TCAGTGAAAC 


GACGTTTAAC 


ACCCGCAAAT 


GTTTTCAAGT 


GCTCACGCAC 


900 


CAAGTTCAAA 


TCAAATCCTG 


CTGTGTAAAG 


AAGACCAATA 


ACCGCTGTCG 


CATTCATGAT 


960 


ATTCTGACGA 


CCAAAGGTTG 


GAATGTGGAA 


TTGCCCCAAG 


TTTTGTCCAC 


GGAAATGAAC 


1020 


GGTGAAGGTT 


GAACCAGTTA 


TTGAACGAAG 


AAGATCACTA 


GCTACAAAGT 


CATTGCCTTC 


1080 


AGCTTCAAAA 


CCATAA7AAT 


AAATTGGTGC 


ATCACACGTA 


ATCTTACGCA 


ATTCAGCATC 


1140 


TTCACCATAG 


ACAAAAAGAC 


CCTTGGTGAT 


TTGTTTGGCA 


TAGTCGTTAA 


AGGCATTAAA 


1200 


AACATCCTCG 


AGACTTGTGA 


AATAATCTGG 


ATGGTCAAAG 


TCAATGTTGG 


TGATAATAGA 


126C 
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GTATTCTGGG 


TGGTAAGGCA 


TGAAGTGACG 


CTCATATTCG 


TCAGATTCAA 


AGACAAAATA 


1320 


TTTGGCATTG 


GCCGAACCAC 


GACCTGTCCC 


ATCTCCAATC 


AAGAAGCTGG 


TATCTGTAAT 


1380 


GTGAGACAAG 


ACATGAGACA 


ACATACCTGT 


CGTTGAAGTT 


TTTCCATCTC 


CTCCTGCTAC 


1440 


TCCCATGCTA 


ACAAAGTCAC 


GCATAAAGCT 


ACCTAGAAAC 


TCATGGTAAC 


G7TTGTAGCT 


1500 


GATACCATTT 


TGGTCCGCAT 


AGGCAATTTC 


GACGTTGTTA TCTGGACGAA 


AGGCATTTCC 


1560 


AGCGATAATT 


TCCATATCAC 


CGTCTAGATT 


TTTTTCATCA 


AAAGGAAGAA 


TGGTAATTCC 


1620 


TGCCTGCTCA 


AGACCGCGTT 


GGGTAAAGTA 


GTACTTTTCA 


ACATCTGATC 


CCTGAACCTT 


1680 


GTGCCCCATC 


TGCTCCAACA 


TCAAGGCCAA 


GGC ACT CATC 


CCTGATCCCT 


TAATTCCGAT 


1740 


AAAATGATAT 


GTCTTTGACA 


TGTTTTCTCC 


CCTATTCTGT 


CATTCTGGTC 


AGATTCAACT 


1800 


CTTGGGC AAC 


CCGACGTTCT 


TGTTCTGTTT 


GTTTACTTTT 


TTTATTGTAC 


ATTTGGCTCT 


1860 


TCTTTAGAAA 


ATCATAATTG 


TTTTTCTTTG 


GAGCAGGTGC 


TGACACTTCT 


TCATTCTTGG 


1920 


TAGGGATAGA 


ATGAACTTCT 


TCCGCCAAGA 


TATAATGAGA 


CTGGGTCAAT 


TTTTGGCTAT 


1980 


ATTTCACAAA 


TTCACCAGGA 


TTTTCCTTTT 


GGAAAGGAGC 


TGTCGGTTGA 


TTGCCCTGTC 


2040 


TAACTAGACT 


GGGCTGAGAA 


TGACGTCTCG 


CAAGGCTGAA 


ATCCTGAGTT 


AGGTAGTTAG 


2100 


CAGAGCGTTT 


CTTTTTCAAG 


TCCGCACGCG 


CTTCTTCACG 


CGCCACCTCC 


GCATAGCTCT 


2160 


TTCCTTCTTT 


TTTAACCCCT 


AAAGGAGCCT 


TTTTAGGTTT 


TTCGACTTGC 


TTTTCAATCG 


2220 


CTTTTACTGG 


TTTTTCTTCA 


GCAATAGGAG 


CCCATTCTAA 


ATAATTTTTA 


TCTCGATACT 


2280 


CACCCTTGAT 


ATTACTGATC 


AGATCAGACT 


CATCA7AGAG 


ATTCATGACT 


GGCATTTCAG 


2340 


TCAACATGAC 


CTCGTCATCT 


GACACCAATG 


GAAATCGTTC 


TTGTTTCATT 


TTCTATTTCC 


2400 


TTTCAACACT 


TCATTATAGC 


GTATTGTCTT 


GATTTTTCAA 


CTGCTGCCTT 


CAGAAATTCC 


2460 


CAAAATTTCT 


CTAATTTCTG 


CTAGGGTCAG 


ACTACCACGT 


GACTCTGTCC 


CGTCCAATAC 


2520 


TTGTGACACC 


AGATG7TTCT 


TTTGTTCTTG 


GAGTTCCTGA 


ATTTTTTCTT 


CAATGGTTCC 


2580 


CTTGGTCACC 


AAGCCATAGA 


CCTCAACCGT 


TTCTTCCTGA 


CCCATCCGAT 


GGGCACGGCC 


2640 


AATGGCTTGC 


GCTTCCACCG 


CAGGATTCCA 


CCAAAGGTCA 


ACCAAGATCA 


CTGTATCTGC 


2700 


ACCTGTCAGG 


TTCAGACCGA 


CCCCACCAGC 


CTTGAGGGAA 


ATCAGAAAGG 


CATCTCTTTC 


2760 


TCCTTGGTTA 


AAGGCCTTGG 


TCATGTCTTG 


TCTTTCCTTG 


GCTGGGGTTG 


AACCCGTAAT 


2820 


TTTAAAGGAA 


GTCAGGCCCA 


AGTCTGGCAG 


TTCTTGTTCA 


ATTTTTTCCA 


ACATTCCCTT 


2880 


GAACTGAGAG 


AAAATCAAGA 


CACGGTGTCC 


GCCGTCTGCC 


ACCTGTACCA 


GTAGGTCTCG 


2940 


GAGACTATCT 


ACTTTGCCGC 


TGGCTCCCTG 


ATAATCTTCC 


ATAAACAGGG 


CAGGAGTGTC 


3000 
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ACATATTTGA CGCAAGCGCA TCAAACCAGA TAAAATTTCC ACACGACTTC GCTGAAATTC 3060 

CTGTTCTGAC ACTTGAGCCA GATGGTCTCG CATCTGTTGT AACTGGGCAA GGTAAATAGC 3120 

CTTTTCCTCG TCTTCCAGTT CATTTTTATA AACCACCTCA ATCAAGTCTG GCAATTCAGT 3i80 

CACAACTTCT TCTTTCTTGC GTCGCATCAC CAAACGCTTC ATAAACTGAG CCACTCGCTC 3 240 

TCCTGGCAAT TTCATAAATT CTTTCTTGCT TGGCAAAAGT CCAGGCATGA CGATTTGGAA 3300 

AATAGACCAC AACTCACCCA GATGGTTTTC AATCCCAC7T CCTGACAAGG CAAAGACCGA 3 360 

CGGCACCACA AATTGTCTCA AGGTCTGGGC AATCTTGGTC TCGGCATTTT TCATGACCTG 3 42 0 

AGCCTCATCT AAGAAAAGGA AGTCAAAGGC CATCCCTTGA TAAAACTCAC TGTCCTGACG 3480 

GAAGCTGGCA TAGCTAGTCA CATAGATTTG ATGGCTCTCG GCAAGAATCT CCTCACGACT 3 540 

TGCTTTCAAA CCATGAACAA CAGTCACATC CAACTCTGGA GCAAATTTCT GAAACTCATC 3 600 

TCCCCACTTC TAAATCAAAC CCGACGGAGC GAGAATCAAA ACCCGACTTT CTTTTGTCAC 3660 

TTCACTAGTC AAAAAAGCAA TGGTCTCAAG GGTTTTCCCA AGTCCCATAT CATCAGCCAA 3 720 

AATCCCACCA AAACCATAAT GATGGAGCAT CTGCAACCAC CCAATTCCCT TTTCCTGATA 3 7 80 

ATCTCGCAAG TCACCCTTCA CCTGAGTTGC TTGCAAAGGA AACTCCTCTG GATGCGTCAA 3 840 

ATCCTGGGCC AGATTCTGGA AT7CTTGTCA AAAAGAAACA CGGTCTCGCC CTTCAAAGAG 3 900 

ATGAGCTAAA CTGTACCCCA AGCATTTCCG AGCCTGCAAG GTCCCATCTT TTAATTCAAA 3 9 60 

TTGCCCCAGT 7CCTGTACAT TTTGGCGAAT TTTCTTGCTT TCTTCATCGA AAAAGTAAAC 40 20 

TTGATTAGAC GAATCAATAT AAAAATCCTG ATTGGCAACC AACCCCTCCA TGGCTTGGTC 4 080 

GATTTCCTCC TGGACAATAT TTTGAAAATC AAACTGGATT TCCAAGAGAC CTCCCTTGGA 4:40 

GGCAATCTGC ACCTGAGGAC TCGCTACGCT ATAAACCTCT TCTAGTTTAT CTCATAGCTC 4 200 

AACATCCCCG AGTTTTTCAA AGACTGGAAT GATATCATGA AAAAAATGAT AGACAGACTC 4 260 

CGCTTTTAAG GCCTGACGCC AAGATTGAAA ATCGGCCTCA AAGCCCGCAG CCAAACAGAC 4 320 

TTGGAAAATT CTTTCTTCTA ACTCTCCGTC ACTTGAAAAG GGTAATTCTT CTAGCTCTTG 4 330 

TCCGCTACAT ACCTGTCTAT TTCCATAATC AAACTGAATT TCTAAACGAA TCCGATTATC 4 4 40 

TTCTTCCCTG TCAAAGTAAA AAGAGGGCGC AAAAGTTTTG ATTTGTAGAC GTTCTGGAGC 4S00 

TCAAACGGTG CCCATCTGGA TAAAAAGACT CAGACACGAG GCCAATTTGT CTCGATCACT 4 560 

CCTATCAAAT TGCAGGTATT TCTTTCCTTG TTGACCCACA GGTAACGCTT TAATTTCCTT 4 620 

GAGAAGACGC ATCTGCTGGT CTGTTAAAAA ATAAACCTGA CCTTTATGGA AAAGTACTCC 4 680 

TCCCTGATAA AAGACATTGA CCCTAGGACT CTCACTGATT TCCATTTCAA AATAATCCGA 4 7 40 

GTATTCTGTT ACTGTAAAGG CAAATAGATT GGCATCAGCA TGCATATCCT GAAAAAGCAG 4800 
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GCTTTGGTAG CTATCCACTT GATGGTCAAA TTGAAAATGG GGCAAGGCCA TCAGTAAATT 4860 

CACACCCTGC TCAAAAAAGC TCAGAGGGAA AAAGAGGTCC CGACCTTGGT TTTGGAAAAA 4920 

GAGGTCTGGA ACCAGCCCTT CCTCCGTTAG TCCGTGCAAG AAAGTCAAAA CTTCTTGGCT 4980 

GGCATCATCA AAGGCTTCCC AAGAAAGAGA CTCCTCATAA ATCTTGCCAA TCATATACGA 5040 

CTTTCTCTGC TCGACAATCC TTAAAAAAAG TGGAATATCC CGAATGACAT AGTATT7TTG 5100 

GCTATTGATT TGGCCGATTC TCAGAGTCCA CAAGATATGA TTGCTTCCTG CTTCCACCTG 5160 

ACCCACAGCT GATAACTCAT AGGCGCATTC TGATTTTCCA CATAAAATTC GATCCAAAAA 522 0 

CTTGCCACCC AAGGTCACCT TGGTTTCAAC AGCCTCTTTT TCTTCATCAC CTTCTTCCAG 5280 

ACTCCACAAG ATTTCCTGAC CACGCTCATC ATTTTTCAGA AAATGCTCTA GCGCTGCCAA 5340 

ATCCACACAG TAGCCCCTCT TTTGAAAAAA ATCACAGGCA CAAAAAACCA AATCATCCTC 5400 

TAAACTATAG CGCAGTTCTT CTTCTGCAAC GCGAGCGTAG AGCCGATTCT TCTTTTCCTT 5460 

GATGATATCA ACCTTACCAG TTTCATAAAG GGCAACACCT TCGATACCAA TTTTCCCCGG 5520 

AATCAATTTA GCCATATTTT CACCTTTACC TTATCTTTTT ATTATACCAT ATTTTCGCCT 5 580 

ATGAAAATAG CCTTCTAGGA AGACTTTTCT CCTAGAAGCC TGGATTTTTA ACGTTTGGCA 5640 

AAAGTAGCCA CAATCCGCTG ACAGACTTCT TGCAACAGAC ATTTGGGCAT ACCTATATTC 5700 

ATGCGGGCAT GGACACTTCC TTCCTCTCCA AAATCCAAAC CACGGTTGAG CATAACCTTG 5760 

GCTTCATTTC TCAACAACTC TTGCAATGTT TCATCAGTCA GGTCATAAGC TGAAAAGTCA 5820 

AGCCAAATCA AGTAGGTACC TTGCCGTTTC ATGACCTTGA TTTTAGTCTC T7TTCCAAAT 5880 

AGATCCATCA CATAATTGAT CTGGTCTTCA AAGACTTGCT TGAGTTCCTC TACCCAATCT 5940 

TTACCGTATC GATACGCAGC TTCTGTCGCC AAATAACCCA AGCCTGAAAT 7TCA7GCTGA 6000 

TTATTGGCCA ACAGGCCTTT CTGGAAAGCC AGTCTCAACT TAGGATTTTC AATGACTGCA 6060 

TAGCAATTTT TTGTTCCAGC AATATTAAAT GTTTTAGTGG CACT.GCTCAA GACCATAGCA 5120 

AAATTTTTGA AGGCAGGATT CATCGTATTC AAAGACTGGT GTTTGTGACC AAAGACGGTC 6180 

AAATCTTGGT GAATCTCATC CGAAACTAAC AAAACACCGT GTTTTTGGCA GAGT7GGCCA 62 4 C 

ATCTTCTCCA ACACTTCTTT TTCCCAAACA CGTCCACCAG GATTGTCACC GTTGCAAAGA 6 300 

ACATAGAGTT TAACCTCCTC TTCCACCAAA TCCTTTTCAA G7TGCTCAAA GTCAATCTCA 6 3 60 

AACAGACTAT CCTTTTCCAC TAAGGAATTA GTAATCAATC TACGATTATT CAACT7GACA 6420 

C7GCGAGCAA AGGGTGGGTA GACAGGCGTG TTAATTAAAA CCGCCTCCCC TTCTTTTGTA 6480 

AAGGTTTGAA TAGCTGTTGA GATGGCTGGT ACCACACCCT CGATAAAGAC AAGAGCCTC7 654 0 
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TTGTCAAACT 


TGTAACCGTA 


TTGTGTACCT 


TCCCACTTTT 


GAACTTCCTT 


AATTAAGTCT 


6600 


TCACTGGCAT 


ACGTATAACC 


ATAAACCAGT 


TCGTCTGCCT 


AAGTTTCCAC 


GGCTTGGCGG 


6660 


ATTTCACGCA 


AG AC C AC AAA 


GTCCATATCC 


GCTATCCAAG 


CTGGTAGAAC 


TTCACTATCC 


6720 


GTTTCTGTTT 


CTTTCCATTT 


ATAGGTATGG 


TCCCCTAAAC 


GGTTGGGCAG 


GCTTGTAAAA 


6780 


TCATATTTTC 


CCATCTTTGT 


CTTATCCTTC 


TATGGCTTGG 


CGCAAATCTG 


CAATCAAATC 


6840 


TCTAGCATCC 


TCAATCCCAA 


TAGACAAACG 


CAAGAGGTCA 


TCTGTCAAAC 


CATAAGAATG 


6900 


CCCTACCTCT 


GCTGGAATAT 


CAGCATGAGT 


TTGAGTCGTT 


GGATAAGTAA 


TAAGACTTTC 


6960 


CACTCCACCC 


AAACTTTCCG 


CAAAAGAGAA 


GACCTTGAGA 


CTGTTCAAAA 


TATGAGGAAT 


7020 


GCGTGTTTCA 


TCGGCTACTT 


TAAAGGAAAT 


CATGCCTCCA 


CGACCACTGT 


AGAGAACTTC 


7080 


C TT AACTGCT 


GGAGAATCCT 


TCAAAAAGGC 


AACCACTTCT 


TGGGCGTTAG 


CTGTTGAGCG 


7140 


CTCCATACGA 


AGAGACAAGG 


TCTTGAGACC 


ACGAACCAAC 


TGGTACCTCT 


CAAATGGAGA 


7200 


CAAGACTGCC 


CCTGTTGTAT 


TAAGATTGTA 


AAAAAGCTTC 


TCGTATAGTT 


CTAAACTATT 


7260 


GGTCACAACC 


ACTCCAGCCA 


AGACATCATT 


GTCCCC7CCT 


AGATACTTGG 


TTGCTGAATG 


7320 


GACAACGATA 


TCTGCTCCAT 


CTTCAATCGG 


ACGTTGGTAG 


ATAGGGCTAT 


AGAAGGTATT 


7380 


CTCCACCACC 


ACTTTGGCAC 


CCTTAGCATG 


ACCCAATTTT 


GCTAGTTTTT 


CGATATCAAA 


7440 


TTCCAACATC 


AAGGGATTGG 


TTGGGGTTTC 


GATATAGAGA 


ACATCCACAT 


CCTTTTCTAA 


7500 


CTCGGCAATC 


AACTCTTCTT 


CTGTATTCGC 


ATAGGTAAAA 


TGGAAATGAC 


CTTCCTGCTC 


7560 


CACTTGGTTA 


AACCAGCGAA 


AAGAACCACC 


GTAAAGATCA 


CCCACTGCCA 


AGACCTTACT 


7620 


TCCTACTGGA 


AAGACGCTAA 


AGGCCAGTAC 


AATAGCTGAC 


ATCCCTCACC 


TACTCCCTAC 


7630 


GCCATACTCT 


GCTGACTCAA 


TAGCCGCCAA 


GACTTCCTCA 


CCCTTACTAC 


GAGTTCGATT 


7740 


TTTAGTCCGC 


CTA7AG7CAA 


ACCCAGTAGA 


TCCACCAAAC 


7CTCGA7GC7 


GA7AGGTCGT 


7800 


TGAAAAATGA 


AGTGGTGTCA 


CCAAAGCACC 


TGTTGCCTCA 


TCACACTTGA 


TCCCTGCTTG 


7360 


TGCTAAAATT 


GTGTTAATGT 


GTAATTCCTT 


GCTCATACAA 


TTCCTCCAAA 


TCTATAGTAA 


7920 


CTATTGTACC 


ACTTATTTTG 


TATCCTTCGT 


TTTCTTGTTT 


TCAAGAGCTA 


GTTATAGTTT 


7980 


CAAACTATAT 


AAAAAGGGAG 


TTTTTCCTGC 


TCCCTTTAAT 


AG ACT AT AAA 


ATGGTGAATC 


3040 


TCAAAAGACA 


CCTTCACTCT 


ATCATTTGCT 


CCTGCACAAA 


ACGAGCATAA 


CGCTCATGA7 


8100 


TTTCCAGTAG 


TTCCTTATGA 


GTTCCTGAGC 


CAGTGATTTT 


CCCCTCCTCT 


AAGAAGAAAA 


8160 


TACAATCCAC 


ATCTTTTACC 


GTTGACAAAC 


GATGCGCTAT 


AA7CACAACC 


GTCTTCTCCT 


3220 


TT ACT AC AG A 


ATAGAGGCTA 


CTGATAATCG 


CATACTCAGA 


ATCCGCATCA 


ACATTAGCAG 


3280 


TGGCTTCATC 


AAATATAAGA 


ATTTCAGCAT 


CTTTTAAGTA 


CGCTCTACCT 


ATTTCAAGTC 


3340 
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T7TCCT7CGC 


CCCCCTGACA 


AGAGTCGTCC 


CCCT7CACCA 


AC 7 TC ACT AT 


CTAGTCCCTC 


3400 


TTTCATGGAG 


CGAATCTCAT 


CACCTAGTGA 


TACTAAG7CT 


AGCACTT7CA 


TCAATTCA7C 


3460 


A7CAG7TACT 


AAGCGATTCA 


AACCGAGACA AAGATTGTCA CGAATACTGC 


CAGA7AAGAC 


9520 


TGCATTATTT 


TG7GAAACCC 


AAGCGATTTT 


ACTTCTCCAT 


7CTTTTAAGT 


TAAAATCATA 


3580 


TATACTTGAT 


7CCTCCAT7A 


GAATATC7CC 


TGAAAGCGGT 


T7ATAAAACC 


GCTCTAACAA 


3640 


ACGCACAATC 


GTTGATTTTC 


CTGATCCAGA 


TGGTCCAACA 


AAAGCAATT7 


TTTGCCCC77 ~ 


"8700 


GAAAATTGAA 


CAAGTAA7AT 


CCTTTAAGAC 


AGGTCGATTT 


TCATCATAAC 


CAAAATAGAC 


8760 


ATCGTTAAAA 


TTCAACCCTC 


GTCCTGATAC 


CGATT7TCCT 


CCCTCAAATT 


TTTC7TTAGG 


9820 


AACTGCAAGC 


AAG7TCTCCA 


GTGCAACTGA 


AGATCCCTTG 


CTCCTAGAAT 


AAACAGTTAC 


8380 


AAAATTACC7 


ATATTACTAA 


TAGGATTAAG 


TAATTGAAAG 


AGGTAAA7CA 


AAAACGAAAC 


8940 


CAAGCTTCCC 


ACAGATATAT 


ATCCTGCCCT 


GACCCCATAA 


CCCCCATAGG 


TTAGCATCAC 


9000 


AGCTATAGTC 


GCAAAGATAA 


ATAAGAGAGC 


AAACCGGGTC 


TCAAAAGAAG 


TAACCCTATC 


9060 


TCATT7CAGT 


GAATTGTTTT 


GTACCCTTTC 


AATACAATTA 


TCCAAAACAT 


CCTG7ACACT 


9120 


TTTCTCTGCT 


TGCTTACTCT 


TAATTAATTC 


ATCTTCT7GA 


ATC7TTTCAG 


TCAATTGCCC 


9130 


TGTTAAATTT 


CCTCCTGTAA 


ACGACGACTA 


TACTTTTCAC 


TGATATTGGA 


AAGGGGCAAG 


9240 


ATAATAAACA 


TCATACAAGG 


AACAGTGATG 


AATAAAAGTA 


GAGAAAGATT 


CCAATCAACA 


9300 


CTAAATAAGA 


CTACAATGGA 


ACCAAGTACC 


ATAACTAAAC 


TCAGAATAAT 


ATT7GGGAAA 


9360 


GTCGTAATTA 


AAAACTCACG 


AATGACACTC 


GTGTCATTGA 


CAATGGCAGA 


AGTCAACTCC 


9420 


CCACTTTGGC 


TCTTA7CAAA 


GAAGGATTTC 


TCTACATAAA 


7CAACCCC7C 


TATCACTTTT 


9480 


77CC7GA777 


r * ,rT% GC m \' T """'7'** 


T77TTCACCC 


GATTGAC7AA 


ACAGATACTA 


ACCAATAGAA 


9540 


GAAAACAAGG 


CTTGACCAAT 


AAAAATCAAA 


AACGATTGAA 


ATACTTTGGA 


GCCTATATT7 


9600 


TCAATAGAAC 




TAAATCCTTT 


AAGATAAGGG 


GAAGCAACAA 


AGCAAG7ACA 


9660 


CTAGACAGAA 


CAAGTAAGAA 


ACTCCCCATA 


ATCACCT7AG 


TATCTACTC7 


TAATAATTTT 


9720 


AATTTCATAA 


ATACTCCTTA 


TAATATTTCA 


ACGGATAAAG 


TCGGGAATAA 


CTCAATTTGA 


9780 


GGATAAAATC 


TAATAAATCT 


TCCTATAACA 


AAACGCATAA 


CATCTAGGA7 


TTTATATACC 


984C 


TGATATTATG 


CGTTTTTAAG 


CACAAAGAC7 


TCTTACACAA 


ACTTATCTAC 


AATTAGA77T 


9900 


TAT77GACAT 


GTTTTGCCAA 


TTCTTCTTGG 


GCTTTT7TAT 


TGGATTCTTC 


T7TTTCTTTC 


9960 


AACCA77TTT 


1 CTCTGGCTTT 


TGCATATTCG 


TCTGT7G7GA 


, CAATCTTATC 


TTG7ACT7TG 


10C20 


AGGTAT77AT 


' ATGATTCAAC 


' CCCTTTTGTA 


CCGGTTAAAC 


: CATAGGCAGC 


AGCAAATGGT 


10080 
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ACGGTTCTTC TCAA7GA7GG TGTTCCCCCA 
7CAATCAACC AAGC7TGAAT A7CACCA7AT 
7TATTAGCTT CTTCCAACAT TTCAGTA7AC 
TTGGCCTCAC CAGGCTCTAG TCCAAGATTT 
ATATCGAGAT AGGTTGACGG GTCTTGATAA 
TAATCTTTCT GAGCAGCTGT TTGAGCAAAG 
AATTGCTGAA 7GTCAA7CAC TACATTATCA 
ATAGAACTAA CTCCTTGTAT GCCTACTTTA 
ATTGGGAAT? GAACACCCTT TGCTTCGAGT 
GCTTTCTCAG GATTGTAGTA AGGGTCTTGA 
TTACCATAGT TGACCA7CT7 AGAGGCTACA 
ACAAACTTTC GAGGAACCAC 7AGGTTACGC 
CACTGAGCCC CATAAGATGT TCTGTCAAAA 
TTGACAACTG CTTCCTGAGT CCATTTCTTT 
TAAGACTTCC TATCTAGGTT AAAATTAAAG 
ATGATATTGT TTTTGTATTT TTCTTTAATC 
CGAGCCGTAG TATAAGCACC AGCTGTAAAA 
TCATAGTAGG TCAATTTCAC ATCGTCTACA 
7TTTTCTTAT ATTCAATAGC AGATTTTGAG 
TACAAAATAC TAGATGGATC CGCCTTCCCA 
CCA77AACAG GAAAAAG7AT CC7TGCAAG7 
ACCAAAGTAT ATTGAACCGT TTGGTCATCA 
CTTGTTTTAC CAC7GA7ATA GTCATCCAAA 
GC7TC7GA77 7TTTA7CAGC TGCA7AT7GC 
GGCGCA7AT7 C7TC7CCC7C AGAAGTAAAC 
TAGGTCAAAC CGTCCTGAGA AACAGTCCAA 
TAT7GG7CAT TTTC7AATAA CCCG7CTACC 
CGG77TTCTG CTAGATAGTT CAAGCTAGA7 
TTTGACGCCG 7GCTAGAATT TCCACACGCC 
CCTGCCAAGG T7ACATAT7T GCTCT7AGAC 
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CGCGAAACAC TTGGAAGAAC TAAAGAACTA L0140 

TTCTCA7AAC GT77GGCCGG ATCT7GCTCT 10200 

ACATCCAGTC CAACTGCCT7 AGCCT7GTCA 10260 

TGCAGAAATC C7CCAC7ATT AGTATTAAAA 10320 

TCAGCTCCCC AACCGCCATG A7ATAAATCA 103 80 

7AGCCTGAAC 7G7CAAAC7C A7C7GATG77 10440 

GAACCTAAAA CAGAT7CAA7 TGA7TGTT7G IOSCO 

7CTC77AC7T CCACAG7C77 A7CCAAGTGG 10560 

7CTT7CT7AG C77CCGCAAA C7TAGCCTTG 10620 

CCATCCGCAA AC77GA7ACC 77CCCA7TCC 10680 

ACTTCACCAA AC7CTTT7CC CTTGA7ACTG 10 7 40 

AAAATCTTTC TTGCACC77C TTTCCCTTCA 10800 

GCAAAATTGA TAGCCTGACG GAAG7TTT7A 10860 

TCAA7GTCAC TTC77TTAGA ACTATAATTC 10920 

AAATATGAAG TTGAAT7TTG CATACTATAG 10980 

CCT7CATACC TGGACCTGTT AGGAAAAAGA 11040 

TTACCTTCCA G7GA7TC7TG G7CGC7ACCA 11 100 

AAGACATTCT TAGCA7CCCA CTAATTACCC 11150 

ACAAG7GCT7 TCATCAAGAA ACC7CCATTG 11220 

AAA7CATCCC C7TTTGATTT CAGGAAATCT 11280 

G7T7T7CAAT 7CCAG7AAAC 77C7GC7TTA 113-10 

AGTGCCTTGA CACCGACAG7 TGAAAAGTCG L1400 

CCAGCAACAC AG7CC7GCAC TAGA7ACAAC 11460 

AAACCTCTCA CAAAATCCTG GGCAG7TACA 11520 

CAC7TGGCA7 CC77ACGAAG T7TGTAGC7A 11580 

TCC7CTGCTA ATGATGGAA7 AATAT7CCCA 11640 

AAATT7GCAA CAATATCGCA TGTTGC7GCG 11700 

GGATCACTTC AA7AAACA7A C7TGTACCTT 11760 

C7CAATAAAA CTCCTGTACC CACGACAACA 1182C 

77TTTCA77T CCGG 11364 
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(2) INFORMATION FOR SEQ ID NO: 62: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

TAACTCCACT AAACATAATA TAAGGAGAGA AAATCTCTGC AATAGAACGT ATTACAAAAG 60 

CTGCTCACTT AATTCATATG AACGATATTA TCCGTGAAGG GAATCCTACT CTACGCGCGA 120 

TTGCTGAGGA AGTCACTTTC CCCCTATCTG ACCAGGAAAT CATCCTAGGC GAAAAGATGA 180 

TCCAATTCCT TAAACATTCC CAAGATCCTG TCATGGCTGA AAAAATGGGA CTCCGCGGTG 240 

GTCTTGGACT GGCTGCTCCC CAGTTAGATA TCTCAAAACG CATTATCGCT GTTTTGGTAC 3 00 

CTAATATTGT TGAAGAAGGC GAAACTCCAC AGGAAGCCTA CGATTTGGAA GCCATTATGT 3 60 

ACAATCCAAA AATCGTCTCT CACTCTGTTC AAGATGCTGC TCTTGGCGAA GGAGAAGGTT 4 20 

GCCTGTCTGT TGACCGTAAC GTGCCTGGCT ATGTTGTTCG CCATGCCCGC GTTACTGTTG 4 80 

ACTACTTTGA CAAAGATGGA GAAAAACACC GTATCAAACT CAAAGCCTAC AACTCCATTG 540 

TTCTTCAGCA TGAAATTGAC CACATTAACG GTATCATGTT TTACGATCGC ATCAATGAAA 600 

AAGACCCATT TGCAGTTAAA GATGGTTTAC TCATTCTTGA ATAAACAAAA TCCCCTTGCA 660 

AGACGGGGTT T7CTGTTA7A ATAGAGGCAT GAAAACAAAT GATATTCTCT ATGGTGTCCA 720 

CGCCGTTACC GAACCCCTCC TTGCAAATAC AGGAAACAAA CTCTACCTCC AAGAAGATCT 7 30 

CCGAGGTAAG AATGTTGAGA AAGTCAAGGA ACTAGCTACA GAAAAGAACG TGTCCATTTC 34 C 

TTGGACATCA AAAAAATCTC TCTCTGAGAT TACTGAAGGT CCTCTTCATC AAGGTTTTGT .900 

TCTACGAGTG TCTGAATTTG CCTATAGCGA GCTAGATTAC ATCCTTGCAA AAACACGCCA 960 

AGAAGAAAAT CCACTTCTAT TGATTCTAGA TGGTCTAACC GATCCCCATA ATCTGGGTTC 1020 

TATCTTGCGA ACAGCCGATG CGACCAATGT TTCAGGTCTC ATCATTCCCA AGCACCGTAC 1080 

TGTCGGAGTA ACTCCTGTCG TTGCCAAAAC AGCCACACCT CCTATTGAAC ACGTtCCAAT 1140 

TGCCCGAGTG ACCAACCTCA GTCAAACCTT AGGA7AAACT TAAGGATGAA GGTTTCTGGA 12 CO 

CCTTTGGAAC GGATATGAAC GGTACTCCTT GCCACAAGTG GAATACAAAA GGGAAAATCG 12 60 

CCCTCATCAT TGGAAATGAA GG AAAAGGT A TCTCTACCAA CATCAAAAAA CAGCTCCATC 13 20 

AAATGATTAC CATTCCGATG AATGGACATG TTCAAAGCCT TAATGCCAGT GTTGCTGCGG 13 80 
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CCATTCTCAT GTACGAAGTT TTCCCAAATA GACTATAAAA AAGTTTCCAG TCATCTGATT 144 0 

GCAAACTTTT TTATGATTAA CTATGTTCTG TAATGAATTT ATAGGCTTCT TGACCAGCGA 1500 

TACCTCCATC TCCAACCGCT GTTGTTACTT GGCGAAGGTC TTTCAAGCGA ACATCTCCAA 1560 

CTGCAAAGAT ACCGTCGACT GCACTTTTCA TGTGGTTATC TGTCACAATC CATCCTGCCT 162 0 

GATCTTGGAT ATTCAATTCT TTAACAAAAT CGCTAAGAGG GTCCAAACCA ACATAGATAA 1680 

AGACACCACC GAAGGCTTGT TCTCTCACTT GACCTGTTTT CACATTTTCA AATACGACTG 1740 

ATTCTACTCG GTTTTCACCC TTGATTTCCC TTACTACAGA ATCCCAGATA AAGCTGATTT 1800 

TTTCATTCGC AAAGGCGCGA TCTTCTAAAA CCTTTTGGGC ACGAAGTTGG TCACGACGGT 1860 

GAACAATGGT AACACTCTTA GCAAAACGAG TCAAGAAGAG GCCTTCTTCA ACAGCTGAAT L920 

CTCCACCACC AACTACCAAT AAATCTTGGT CACGGAAGAA AGCACCATCA CACACACCAC 1980 

AGTAAGAAAC ACCACGACTG TTCAGTTCTT CTTCTCCAGG CACTCCCAAA GGACGGTGTT 2040 

TAGAACCAGT TGCTACGATA ACTGTACGTG TTTCATATGT TTGGTCATCA GTCATCACTT 2100 

TCTTAAAATC ACCATCGCTT CGACATTTTC AACATAACCA TAAATGTGCT CAACACCAAG 2160 

ATTTTCAAGT GGTTCAAACA TCTTTTCAGC CAATTCAGCT CCACTAATA7 7AGCGTATCC 2220 

TGGGTAATTT TCGATATCAG ATGTATTATT CATCTCACCA CCTCCCACAC CACCTTCAAT 2280 

CAAAGCTACT TTTAGATTGC TTCGAGCAGC ATACAAGGCC GCAGTCATCC cTGCAGGTCC 2340 

AGCACCGATA ATAATACTAT CGTACATATA GATTCCTTCT TTCTTGGTGT AACTATCTTT 2 400 

ATTCTAACTC TG 2412 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS : 

(Ai LENGTH: 7760 base pairs 
( B) TYPE : nucleic acid 
tC) STRAND EDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

CCGATTTGGT GGAATTTTTG TCTCATCATT TAGAAGGTGT TGCAAGAGCA GAGTTTACCT 60 

TGGTGCTTCA TACCAAATTG GGAGAAGCCT CTGTTTTGGC AAATATTGTA GATGTAAACA 120 

AGGATGAATG GATTTTAGGA ACAGTTGCTG GTCCCAATAC CTTATTGGTT ATTTGTCGAG 180 

ATCAGCACGT TGCCAAACTC ATGGAAGATC GTTTGCTAGA TTTGATGAAA CATAACTAAC 240 

GTCTTGGGAG TTGCTCTCAA GACTTATTTT TGAAAACGAG AGACAGAAAA TGGCGATAGA 300 

AAAGTTATCA CCCGGCATGC AACAGTATGT GCATA77AAA AAGCAATATC CAGATGCTTT 3 60 
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T7TGCTCTTT CGGATGGGTC ATTTTTATGA ATTATTTTAT GAGGATGCGG TCAATGCTGC 42 0 

GCACATTCTG GAAATTTCCT TAACGAGTCG CAACAAGAAT GCCGACAATC CCATCCCTAT 4 80 

GGCGCGTGTT CCCTATCAT7 CTCCCCAACA STATATCCAT GTCTTGATTG AGCAGGGTTA 540 

TAAGGTGGCT ATCGCAGAGC AGATGGAAGA TCCTAAACAA GCAGTTGGGG TTGTTAAACG 600 

AGAGGTTGTT CAGGTCATTA CGCCAGGGAC AGTGGTCGAT AGCAGTAAGC CGGACAGTCA 660 

GAATAATTTT TTCCTTTCCA TAGACCCCCA AGGCAATCAA TTTGCCCTAG CTTATATGGA " 72 0 

TTTGGTGACG GGTGACTTTT ATGTGACAGG TCTTTTGGAT TTCACGCTGG TTTGTGGGGA 7 30 

AATCCGTAAC CTCAAGGCTC GAGAAGTGGT GTTGCCTTAT GACTTGTCTC AGGAAGAAGA 84 0 

ACAAATCCTC AGCCGCCAGA TGAATCTGGT ACTCTCTTAT GAAAAAGAAA GCTTTGAAGA 900 

CCTTCATTTA TTGGATTTGC GATTGGCAAC GGTGGAGCAA ACCCCATCTA GTAAGCTGCT 960 

CCAGTATGTT CATCGGACTC AGATGAGGGA ATTGAACCAC CTCAAACCTG TTATCCGCTA 1020 

CGAAATTAAG GATTTCTTGC AGA7GCATTA TGCGACCAAG GCTAGTCTGG ATTTGCTTGA 1080 

GAATGCTCGC TCAGGTAAGA AACAAGGCAG TCTTTTCTGG CTTTTGGATG AAACCAAAAC 1140 

CGCTATGGGG ATGCGTCTCT TCCGTTCTTG GATTCATCCC CCCTTGATTG ATAAGGAACG 1200 

AATCGTCCAA CGTCAAGAAG TAGTGCAGGT CTTTCTCGAC CATTTCTTTG AGCGTAGTGA 1260 ' 

CTTGACAGAC AGTCTCAAGG GTGTTTATGA CATTGAGCGC TTGGCTAGTC GTGTTTCTTT 1320 

TGGCAAAACC AATCCAAAGG ATCTCTTGCA GTTGGCGACT ACCTTGTCTA GTGTGCCACG 1380- 

GA7TCGTGCG ATTT7AGAAC GCATGGAGCA ACCTACTCTA GCCTATCTCA TCGCACAACT 1440 

GGATGCAATC CCTGAGTTGG AGAGTTTGAT TAGCGCAGCG ATTGCTCCTG AACC7CCTCA 1500 

TCTGATTACA GATGGGGGAA TTATCCGGAC 7GGA7T7GA7 GAGACTTTAG ACAAG7ATCC 1560 

77GCCTTC7C AGAGAAGGGA C7AGCTGGA7 TGCTGAGATT GAGGCTAAGG AGCGAGAAAA 1620 

CTCTGGTATC AGCACGCTCA AGATTGACTA CAATAAAAAG GATGGCTACT ATTTTCATGT 1680 

GACCAATTCG CAACTAGGAA ATGTGCCAGC TCACTTTTTC CGCAAGGCGA CGCTGAAAAA 174 0 

CTCAGAACGC TTTGGAACCG AAGAATTAGC CCGTATCGAG GGAGATATGC TTGAGCCGCC 1800 

7GACAAG7CA GCCAACCTCG AATACGAAAT ATTTATGCGC ATTCGTGAAG AGGTCGCCAA 1860 

GTACATCCAG CGTTTACAAG CTCTAGCCCA AGGAATTGCG ACGGTTGATG TCTTACAGAG 192 0 

TCTGGCGGrr GTGGCTGAAA CCCAGCATTT CA77CGACC7 GAGTTTGGTG ACGATTCACA 19 80 

AATTGATATC CGGAAAGGGC GCCATCC7GT CGTTGAAAAG G77ATGGGGG CTCAGACCTA 2040 

7A7TCCAAAT ACGATTCAGA TCGCAGAAGA TACCAGTATT CAACTGCTTA CAGGGCCAAA 2100 
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CATGACTCGG 


AAGTCTACCT 


ATATGCGTCA 


GTTAGCCATG 


ACGGCGGTTA 


TGCCCCAGCT 


2150 


CGCTTCCTAT 


GTTCCTGCTG 


AAAGCCCCCA 


TTTACCCATT 


TTTGATGCGA 


TTTTTACCCG 


2220 


TATCGGACCA 


GCAGATGACT 


TGCTTTCCGC 


TCAGTCAACC 


TTTATGGTGG 


AGATGATGGA 


2280 


GGCCAATAAT 


GCCATTTCGC 


ATGCGACCAA 


GAACTCTCTC 


ATTCTCTTTG 


ATGAATTGGG 


2340 


ACGTGGAACT 


GCAACTTATG 


ACGGGATGGC 


TCTTGCTCAG 


TCCATCATCG 


AATATATCCA 


2400 


TGAGCACATC 


GGAGCTAAGA 


CCCTCTTTGC 


GACCCACTAC 


CATGAGTTGA 


CTAGTCTGGA 


2460 


CTCTACTTTA 


CAACACTTCG 


TCAATGTCCA 


CGTGGCAACT 


TTGGAGCAGG 


ATGGGCAGGT 


2520 


CACCTTCCTT 


CACAAGATTG 


AACCGGGACC 


AGCTGATAAA 


TCCACGGTAT 


CCATGTTGCC 


2580 


AAGATTGCTG 


GCTTGCCACC 


AGACCTTTTA 


GCAAGGGCGG 


ATAAGATTTT 


GACTCAGCTA 


2640 


GAGAATCAAG 


GAACAGAGAG 


TCCTCCTCCC 


ATGAGACAAA 


CTAGTGCTGT 


CACTGAACAG 


2700 


ATTTCACTCT 


TTGATAGGGC 


AGAAGAGCAT 


CCTATCCTAG 


CAGAATTAGC 


TAAACTGGAT 


2760 


GTGTATAATA 


TGACACCTAT 


GCAGGTTATG 


AATGTCTTAG 


TAGAGTTAAA 


ACAGAAACTA 


2820 


TAAAACCAAG 


ACTCACTAGT 


TAATCTACCT 


GTATCAAGGA 


GACTTCTTTG 


ACAATTCTCC 


2880 


ACTTTTTTGC 


TAGAATAACA 


TCACACAAAC 


AGAATGAAAA 


GGACCTCACG 


CATTGTCCCT 


2940 


CCCTTTTGTC 


TATTTTTTAA 


GGAGAAAGTA 


TCCTCATTCA 


GAAAATAAAA 


ACCTACAAGT 


3000 


GGCAGCCCC? 


GGCTTCGCTC 


CTCATCACAC 


GCTTGATGGT 


TGCTAGTTCA 


CTTCTGCAAC 


3060 


CGCGTTATCT 


GCAGGAAGTC 


TTAGGCGCCC 


TCCTTACTGG 


GAAATATGAA 


GCTATTTATA 


3:20 


GTATCCGGGC 


TTCCTTCATT 


GCTCTGGCCG 


TAGTCGGTCT 


AGTTGCTCGT 


GGACTCAATG 


3180 


TTCTCCTCGC 


AGCCTATATT 


GCCCAAGGAG 


TTTCATCCGA 


CCTTCGGGAG 


GATGCCTTCC 


3240 


GTAAAATTCA 


AACCTTTTCT 


TATGCTGATA 


TTGAACAATT 


TAATGCGGGA 


AATCTACTCG 


3 300 


TTCCAATGAC 


AAATGATATC 


AAC C AG ATT C 


AGAACGTTC7 


CATGATCACC 


TTCTAAATTC 


3360 


TTTTCAGACT 


TCCCCTCTTC 


TTCATCGGTT 


CGTTTATCCT 


AGCGGTTCAA 


ACCTTACCTT 


3420 


CTCTCTGGTG 


GGTGATTGTT 


CTCATGGTAG 


TCTTGATTTT 


TGGTTTGACT 


GCTGTCATGA 


3480 


TGGGAATGAT 


GGGCCCTCGT 


TTTGCCAAGT 


TTCAAACCCT 


TCTTGAGCGC 


ATCAATGCCA 


3540 


TTGCCAAGGA 


AAATTTACGT 


GCCGTTCGTG 


TGGTCAAGTC 


CTTTGTCCAA 


GAAAAAGAGC 


3600 


AATTTGCTAA 


CTTTACAGAG 


GTCTCAGACG 


AGCTTCTTGG 


TCAAAACCTT 


TACATTGGTT 


3660 


ATGCCTTTTC 


AGTAGTGGAA 


CCCTTTATGA 


TGTTGGTTGG 


TTACGGGGCC 


GTCTTCCTCT 


3720 


CTATTTGGCT 


GGTCGCGGGA 


ATGGTTCAGT 


CGGATCCCTC 


TGTTGTTGGT 


TCCATCGCTT 


373C 


CTTTTGTTAA 


TTACCTAAGC 


CAGATTATCT 


TTACCATTGT 


TATGGTTGGA 


TTTTTGGGAA 


3340 


ATTCTGTCAG 


CCGTGCCATG 


ATTTCCATCC 


GTCGTATTCG 


AGAAATTCTT 


GACCCAGAGC 


3900 
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CAGCTATGAC CTTCAAGGAT ATCCCAGATG AAGAGTTGCT TGCAAGTCTT AGCTTTGAAA 3 960 

ATGTGACCTT TACCTATCCA ATGGACAAGG AACCGATCCT GAAAGATCTC AGCTTTACTA 4 020 

TTGAACCTGG TCAAATGGTT CGTCTACTTG GAGCGACTGG TCCACCAAAC TCAACCTTGG 4080 

CTCAATTGAT TCCACG7CTC TTTGATCCAC AGGACGGGCC CATTAAAATC GGTGCCAAGG 4140 

ATATTCGAGA AGTGAGTGAA GGAACCCTGC GTAAAACAGT TTCCATCCTT CTCCAACGTG 4 200 

CCATTCTTTT TAGTGGAACG ATTGCAGATA ACTTCAGACA GGGGAAGGGG AATGCTACTC 4 260 

TATTTCAAAT GGACCGCGCA GCCAATATTG CCCAGCCTAG TGAATTCATT CATCGTATCG 4 32 0 

AGAAAACCTT TGAAAGTCCA GTTCAACAAC GGGGAACCAA TTTCTCTGGT GGACAAAAAC 4 380 

AAAGGATGTC GATTGCGCGT GGGATTGTCA GCAATCCACG TATTCTGATT TTTGATGATT 4440 

CGACCTCAGC CTTGCATGCC AAATCAGAGC CCTTGGTGCA ACAACCTTTC AATAAGGACT 4 500 

TGAAGGGGAC GACAACCATT ATTATTGCTC AAAAAATTAG CTCGGTTCTC CATGCAGACA 4 560 

AGATCTTGGT TCTAAATCAA GGACGATTGA TTGGTCAAGG TACGCATGCA GACTTGGTTG 462 0 

CCAACAATCC CGTTTACCGT GAAATCTATG AAACACAGAA ATGAAAGACA AACTATAAGA 4 680 

AAAGTCAATA CTTTTATCTA AACTATTTCT TATTTCAATT TGATGATTTG GCGATGATTT 4 740 

TAGAGCACCG CAAAAAGCCC TTGAAAAAGT CCATTTTTTC AAAGGTAATC CTGTGTTAAT 4 800 

TTCAGAAATT ACATCACTTT TTGTTCGTCA AATGGCAGCT CTTTTTTTAG GATATAAAAC 4 860 

ACCCTTCGGA TAACTTTTTT TCCAAGGTGG ATGATGGCTA CATTCTAATG TTTTC CTTG T 1920 

TCTAATTTAG TCTTAACATA GGCCTTAAAA GCAGGCCAAA AGCGAGGGCA TCCTTTGCCA 4 980 

CCTTGTATGA G7ACCTACCC CAGATGAGGG GAACTCCGTT TGACCATTCT TCCTCCTAAA 504 0 

TCAA7CTGAT CTGACTGATA AATAGAAGAA 7CCACTCCAC CGAAAGCTTG 7AAT7GACCA 5100 

GGATTATCAA AGGCATGAAT ATTTCGAATC TCAGCTAAAA TGACCGCCCC 7AAACCA7CC 5160 

CCAATCCCAG TAACCGTCGT GATGACCGAG TTGAACTCAG CCATCAAGTC ATTGACACAT 52 20 

GTTTCCGCCT TGTCAATGAG CCTCTTGTAA TGTTTGATGT TTTCATTACA CGAGATAAAA 52 80 

CGTCTATGCG TTATCAAACT CATTACCAAT TAAAACAAAA AGCTGTGGTT AGATCCT77C 5340 

GGAAATTGTC AAGCGATTGG AGGAAATGAA CTAATCCACA GCGGCTTATT CCAAC7A7AC 5400 

CACTTGGGCT TTGCCAGTAG CTAACTGCGC TAAA7ATAAT ATAAGGAGGA GTAAAATGAA 5 4 60 

CACAGTTCAA TTTTTTTGGC ATTATTTTAA CGTCTACAAG TTCTCATTTC TAGTTGTCAT 55 20 

CCTGATCATT GTTCTGGCGA CTTTTGCCCA AGCCCTCTTT CCAGTCTTTT CTGGACAAGC 5 5 30 

GGTGACGCAG CTAGCCAATT TAGTTCAAGC TTATCAAAAT GGCAATCCAG AACTTGTATG 564 0 
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CCAAACCCTA 


TCAGGAATCA 


TGGTCAATCT 


TGGCCTGCTG 


GTTTTGGTTC 


TATTTATCTC 


5700 


TACTCTAATA 


TACATGTGTC 


TCATGACGCG 


CGTGATTGCA 


GAATCGACCA 


ACCAGATGCG 


5760 


CAAAGGCCTC 


TTTGGTAAGC 


TTGCTCAGTT 


GACGGTTTCT 


TTCTTTGACC 


GTCGACAAGA 


5820 


TGGCGATATC 


CTGTCTCATT 


TTACCAGTGA 


TTTCGATAAT 


ATCCTCCAAG 


CCTTTAACGA 


5880 


AAGCTTGATT 


CAGGTCATGA 


GCAATATTGT 


TTTATACATT 


GGTCTGATTC 


TTGTCATGTT 


5940 


TTCGAGAAAT 


GTCACGCTGG 


CTCTCATCAC 


CATTCCCAGC 


ACCCCATTGG 


CTTTCCTTAT 


6000 


GCTGATTTTC 


ATCGTGAAAA 


TGGCACGCAA 


ATACACCAAC 


CTCCAGCAGA 


AAGAGGTAGG 


6060 


GAAGCTCAAC 


GCCTATATGG 


ATGAGAGCAT 


CTCAGGCCAA 


AAAGCCGTGA 


TTCTCCAAGG 


6120 


AATTCAAGAG 


GATATGATGG 


CAGGATTTCT 


TGAACAAAAT 


GAGCGCGTGC 


GCAAGGCAAC 


6130 


CTTTAAAGGA 


AGAATGTTCT 


CAGGAATTCT 


TTTCCCTGTC 


ATGAATGGGA 


TGAGCCTGAT 


6240 


TAATACAGCC 


ATCGTCATCT 


TTGCTGGTTC 


GGCTGTACTT 


TTGAATGATA 


AGTCTATTGA 


6300 


AACAAGTACA 


GCCCTAGGTT 


TGATTGTTAT 


GTTTGCACAA 


TTTTCACAGC 


AGTACTACCA 


6360 


GCCTATTATC 


CAAGTTGCAG 


CGAGTTGGCG 


AAGCCTTCAG 


TTGGCC7TTA 


CTGGAGCTGA 


6420 


ACGAATTCAG 


GAAATGTTTG 


ATGCAGAGGA 


GGAAATCCGA 


CCTGAAAAGG 


CTCCAACCTT 


6480 


CACTAAGTTG 


CAAGAAAGTG 


TTCAAATCAG 


TCATATCCTT 


TTTTCATACT 


TGCCTGATAA 


6540 


ACCTATTTTG 


AAAGATCTCA 


GCATTTCTGC 


CCCTAAAGGC 


CAGATGACAG 


CAGTTGTTGG 


6600 


GCCGACAGGT 


TCAGGAAAAA 


CGACTATTAT 


GAACCTCATC 


AATCGCTTTT 


ATGATGTTGA 


6660 


TGCTGGTGGT 


ATTTATTTTG 


ATGGTAAAGA 


CATTCGTGGC 


TATGACTTAG 


ATAGTCTTAG 


6720 


AACCAAGGTG 


GGAATTGTAT 


TGCAAGATTC 


GGTCTTGTTT 


AGCGGAACGA 


TTAGAGACAA 


6780 


TATCCGATTT 


GGTGTGCCAG 


ATGCTAGTCA 


GGAAATGGTT 


GAGGTAGCAG 


CAAAAGCAAC 


6840 


CCACATTCAC 


GACTATATCG 


AAAGTTTGCC 


TGATAAGTAC 


GATACTCTTA 


TTGATGATGA 


69C0 


CCAGAGCATC 


TTTTCAACAG 


GGCAGAAGCA 


ATTGATTTCA 


ATCGCTCGAA 


CCCTGATGAC 


6960 


ACATCCAGAA 


GTTCTCATTC 


TCGATGAAGC 


AACTTCAAAC 


CTACATACCG 


TGACACAAAG 


7020 


CAACATTCAG 


CATGCCATGG 


AGGTGGTTGT 


AGCACGTAGA 


ACTAGTTTCG 


TCATTGCCCA 


7080 


CCGCTTGAAA 


ACCATTCTCA 


ATGCAGATCA 


GATTATTGTC 


CTTAAAGATG 


GAGAAGTCAT 


7140 


TGAACGTGGT 


AACCACCATG 


AACTTTTGAA 


GCTAGGTGGC 


TTTTATTCAG 


AACTCTATCA 


7200 


CAATCAATTT 


GTTT7CCAAT 


AAGAAAGAAG 


TTC7CCTATC 


TGGGCAGCTT 


TTTCTTGTCC 


7260 


ATAAAAAATG 


TT7ATCACAG 


CCTTAAAAAA 


AACATATTAG 


ACGAAAGTCA 


TTTTGAGTGA 


7320 


TATGATAGGA 


CTATCGTTAG 


CATTCGAAAG 


GAGAGGCATC 


ATGGCTAGAA 


CGGTTGTAGG 


7380 


AGTTGCTGCA 


AATCTATCTC 


CCCTAGACGC 


AGAAGGCAAA 


ATCATTCATT 


CATCTGTATC 


744C 
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TTGTAGATTC GCAGAGATCA TTCGTCAAGT CGCTGGTCTC CCTTTAGTCA TTCCTGTTGC 
TGATGAG7CA GTTGTACC7G ATTATGTGGA AATGATTGAC AAACTCATTT TGACAGGAGG 
CCAAAATCTT CATCCTCAGT TTTATGGAGA GAAAAAGACC GTCGAGAGCG ATGATTACAA 
TCTGGTCCGT GACGAATTTG AATTGGCACT CTTGAAGGAA CCCCTTCCTC AGAATAAACC 
AATTATGGCA ATCTGTCGCC G7GTCCAACT TCTCAATCTT GCCTTTGGTG GAACCCTCAA 
TCAAGAAATC GAAGGTCAGG 
(21 INFORMATION FOR SEQ ID NO: 64: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ 10 NO: 64: 

GAGGTTTTAA TTCACTTACC TCTsCCGTAT CTTTATTTAA AATGAATTCT TTTACGGTTG 60 

TATTTCTTGC AAAATCTTTT ACAACAATCT TAATGTTTAG TGTCTTGTCT ATTATTTGTT 120 

TAATATCATT AAATGATGTA TATTCTTTTC CATTTATATA AATATGTTGT TCTTGAATCT 180 

CACCATCGAA TCCATTATTT CTTTTATCAT TGATGTTAAA GACTACAGAT TTTCCATCAG 240 

CATATTCGAT ACTAGTATTT CCCTTAGGAT CAATGTTTAC TTCGGGTTTA ACATTATCAT 300 

ATAAAAACTG A7ACTCCACT CCAACTGCTT TAGCATTCAA ATCGCTATAG CCAGTTTGAA 3 60 

GATAAACATT TCCATCCATA TCTGTTACCT TATCTCGAAA TCCCTTTCCT TTATAGTCTT 42 0 

TCATTCCCCA GTCCATGATG TCACCGTCTT TAACATTCAG CTTAATATTA AAATCTCTAG 430 

TGTTATCAAT GTG7AAATCT CCGTAGATTA AATAATTATC TACAACCCAT TCATTAACTC 540 

TCAATTCCCA GTTAAAACCA CCCTTATCAG AAATCTTACC TCTTAAATAA AATTCTGGAT 600 

TTCGTACATA AATTTTATTA GATTTAGATG GATTAAAGTA GTTCTTATCC ATTGAAAGGT 660 

TTACTGGTTT GGTATCAATA AATAACATCG AGCCATCTTC TTTTATACCT TCTACATTGA 7 20 

ACTTATCCTC TCCAGTGTAT TCTTTATCAT CCTTACCAAA TAATACAAGT TTAGAAGAAT 780 

CTGTCACAAG ATTTCCCTCT TTATCGATAG CTTCCCCTTT ATCCTTCATT TTAAATGTAA 84 0 

ACACTTGATA CCTTATAATG TTAAAGCCGT CCAAAGCCGA CATTAATACA GATTGGGTAC 900 

TTCTTCCATC TTCAACAT7T CTACTATCAG CATAAATTGT TCTTTCTCAA AGGGCTCTTA 960 

GATTAGGATT GGCCTTTTGT ATTTTTGCTA TATCTTCCTT GCTATAGACT CCATTTCCTT 1020 



7500 
7560 
7620 
7680 
7740 
7760 
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CTAACATATC 


CGTTTTTCCA 


GGATTATAGG 


TAGTCACTTT 


TAGTCCATAG 


CCTTTTCTTA 


1080 


GAATCATATT 


ATCC7TTAAC 


AGATATTGTT 


GTTTTTCTGA 


ATCAGAATAG 


ATTTTACCAG 


1140 


ATTCCATTTT 


AGTTAAATTC 


TCTGGTTTGT 


TTTTTGAAAG 


ATCTCCTTCC 


CCTAATTCTA 


1200 


TGACATTCCC 


ATAACTTCAT 


ACATAGGGAT 


ATTCTGATTT 


AGTTTCCTTA 


ATTTTTTCAG 


1260 


GCATTCTAAT 


TTTAATTTCA 


GCTTTTTTCT 


CATCATTATC 


TTTAACAAAT 


AATCTCATAT 


1320 


CTCCTGCAAA 


AGCTAATCCA 


TCCACAATAT 


CATTAATATT 


AGCGTATAGA 


TCAAATGTCA 


1380 


TCGTTTTTGA 


GTGGAAATCA 


TACTTGGTCG 


CTTTGATTTC 


TATAGATTTA 


TAGTTATTCC 


1440 


CATAATATAC 


CTTGGCATTT 


TTAGAAACAT 


TACTTATCTT 


TCCAAGAATT 


TCAAAGTGTC 


1500 


CATCTTTAGA 


CGGACTTAGA 


ACACCATAAA 


TTTTTGATTT 


GATTTCGTCA 


ACTTTCTCAG 


1560 


TTTCATATTC 


TAGATCAGTC 


CCATCATCGT 


AGGCTATTAT 


ATTTCCTTTA 


TCATCGTATT 


1620 


TATAATCGTA 


TTCCTCCATT 


CTCTTACCAG 


TTTCACTTGT 


AAAATCATCA 


ACTTCTCTAA 


1680 


ATTTCTTTTT 


AATGAGTTTC 


TTTAAGTCTT 


TATTTTCAAA 


GTCTCTAATT 


GTTGAAATAT 


1740 


TTCTATCAAT 


AGTAAAACTA 


GATTTTTCTT 


TAATAGACTC 


TTCATTTTCT 


TCATGATGAT 


1800 


gttctacccc 


AGTTCTATCT 


TTTTTTAGAC 


TACCCTCTTT 


TCCATTTCCT 


AAATTTTTAA 


1960 


ATTTAGATTC 


TGCAATCTCG 


CCAAGCTTTT 


GATATTTAGA 


TGAATCTTGA 


TCAGCATCTA 


1920 


CTAGATAATA 


GGAAATCATC 


CCCTTTTCAT 


CAGCCTGATT 


AGCAAATTTA 


ATTCTATGAA 


1980 


TCTTTGTGAA 


ATTGCTAGAA 


CCATCTAATC 


CAATGACTTC 


AATGATTTTT 


CCCCTTAAAT 


2040 


CTCCCGCACC 


TTTAATTTCA 


TAAATGGTAT 


TTCCGTCTTT 


ATCAAGTTTT 


CTATTTCTTC 


2100 


CTTGACCCTC 


ACCTGCGTAA 


GTTACTTCAA 


GA7TTTTTTC 


AACCTCTCCA 


TCTTCATTAA 


2160 


CAAGAGCGGC 


GCCAGCATAC 


CAAACTTCGT 


TCCCAATCTC 


GTCAAATTTT 


TCAGGATGTT 


2220 


CTTTTTGATC 


TCTCGCAAAT 


AGCCTTTCAT 


TCTTATACTG 


ATCTTTTACC 


TTATGATAAG 


2280 


TATCCTTTGT 


AATCAACTTA 


ATTTTTTCAG 


GATTTGAAAA 


ATCAACCGAA 


ACAATCTTAC 


2340 


GGGCGGTCTT 


ATCAATTTTT 


ACAGGAATAT 


AGGAAACCTG 


CCATGGCTAA 


TCTTTAGTTA 


2400 


ATCTATATTT 


AAATTTATAG 


AAATATTGAC 


CTTCCGCAAT 


CGGTTCAAAT 


TGACCTCTTA 


2460 


7CTTAGTAGC 


AGGATCTTGA 


TTATCCTTAC 


TTTCTGGTGC 


ATTTTCTTCT 


CTACCTCTAG 


2520 


GATTATAGAT 


GAGTCCATCC 


CACTTCAAGT 


CACCCCAAAC 


TTTTACTTTA 


GATGATTTGA 


2580 


TTCCCTTTCC 


ATCATTGCTT 


TTAGAATTTA 


AAATTCCTCT 


AATAAAGTCT 


TCTCTCGAAA 


2640 


TGACTTTTAA 


GTCTCTTTGA 


TTTTCTCCCT 


CTTTATTTGT 


ATTTACTATT 


GAAATCAATC 


2700 


CTTCTTCTGC 


ACTTCTTAAT 


ACA 








2723 



(2) INFORMATION FOR SZQ ID NO: 65: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11831 base pairs 
(3) TYPE: nucleic acid 
( C> STRANDEDNESS: double 
(D| TOPOLOGY : linear 



4 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


65 : 






AAAAAAGTGG 


GAATGACTCA 


AATCTTCACT 


GAAGCTGGCG 


AATTCATCCC 


TGTAACACITT 


An 

0 v 


ATTGAAGCAA 


CTCCAAACGT 


TGTTCTTCAA 


CTTAAAACTG 


TTGAAACAGA 


CGGATACAAP 


i. — vj 


GCTATCCAAG 


TTGGTTTCGA 


TGACAAACGC 


GAAGTATTGA 


GCAACAAACC 




130 


CATGTAGCGA 


AAGCTAACAC 


GGCTCCTAAG 


CGCTT C ATTC 


GTGAATTCAA 


AAAfT.TTr.AA 


i d n 


GGCTTGGAAG 


TTGGTGCTGA 


AATTACAGTT 


GAAACATTCG 


CAGCTGGAGA 




inn 


GTAACGGGTA 


CTTCTAAAGG 


TAAAGCTTTC 


CAAGGTGTTA 


TCAAACGCCA 


CGGACAATCA 


i fin 


CGTGGACCAA 


TGGCTCACGG 


TTCTCGTTAC 


CACCGTCGTC 


CAGGTTCTAT 


GGGGCCTGTT 


4 ^ n 


GCACCTAACC 


GCGTATTCAA 


AGGTAAAAAC 


CTTGCAGGAC 


GTATGCGTGG 






ACAATTCAAA 


ACCTTCAACT 


TGTACAAGTT 


GTTCCAGAAA 


AGAACGTTAT 


CCTTATCAAA 


54 0 


GGTAACCTAC 


CACGTGCTAA 


GAAATCTCTT 


ATCACTATCA 


AATCAGCAGT 


TAAACCTCGT 




AAATAATAAA 


GAAACGGCAA 


ATCAGTCACA 


ATGGCAAACG 


TAACATTATT 


TGACCAAACT 


660 


GGTAAAGAAG 


CTGCCCAAGT 


TGTTCTTAGC 


GATGCAGTAT 


TTGGTATCGA 


ACCAAATGAA 


720 


TCAGTTGTGT 


TTGATGTAAT 


CATCAGCCAA 


CGCGCAACCC 


TTCCTCAAGG 


AACACACGCT 


730 


GTTAAAAACC 


GCTCTGCAGT 


ATCAGGTCGT 


GGACGCAAAC 


CATGGCGTCA 


AAAAGGAACT 


840 


GGACGTGCTC 


CTCAACCT7C 


TATCCCCTCA 


CCACAATGGC 


GTGGTGCTCG 


TGTTGTCTTC 


900 


CGACCAACTC 


CACGTTCATA 


CGGCTACAAA 


CTTCCACAAA 


AAGTTCGTCG 


CCTAGCTCTT 


960 


AAATCAGTTT 


ACTCTCAAAA 


AGTTGCTGAA 


AACAAATTCG 


TAGCTGTAGA 


CCCTCTTTCA 


I02C 


TTTACAGCTC 


CAAAAACTGC 


TGAATTTGCA 


AAAGTTCTTG 


CAGCATTCAG 


CATCGATTCT 


1080 


AAAGTTCTTG 


TTATCCTTGA 


AGAAGGAAAT 


GAATTCGCAG 


CTCTTTCACC 


TCGTAACCTT 


1140 


CCAAACGTGA 


AAGTTGCAAC 


TCCTACAACT 


CCAACTCTTC 


TTGACATCGC 


AAATAGCGAC 


1200 


AAACTTCTTG 


TCACACAAGC 


AGCTATCTCT 


AAAATCGACG 


AGGTTCTTGC 


ATAATGAATT 


1260 


TCTATCATCT 


TATCAAAAAA 


CCTGTCATCA 


CTGAAAGCTC 


AATGGCTCAA 


CTTGAAGCAG 


132C 


GAAAATATGT 


ATTTGAAGTT 


GACACTCGTG 


CACACAAACT 


TTTGATCAAG 


CAACCTGTTG 


1380 


AAGCTCCTTT 


CGAAGGTCTT 


AAAGTTGCCA 


ATGTTAACAC 


AATCAACGTA 


AAACCAAAAG 


1440 



WO 98/18931 PCT/US97/19588 



540 

CTAAACGTGT TGGACGTTAC ACTCGTTTTA CTAACAAAAC TAAAAAAGCT AT CATC AC AC 1500 

TTACAGCTGA TTCTAAAGCA ATCGAGTTGT TTGCTGCTGA AGCTGAATAA TCTAAGGAGG 1560 

AAATATCGTG GGAATTCGTG TTTATAAACC AACAACAAAC CGTCCCCGTA ATATGACTTC 1620 

TTTGGATTTC GCTGAAATCA CAACAAGCAC TCCTGAAAAA TCATTGCTTG TTGCATTGAA 1680 

GAGCAAGGCT GGTCGTAACA ACAACGGTCG TATCACAGTT CGTCACCAAG CTGCTGGACA 1740 

CAAACGTTTC TACCGTTTGC TTGACTTCAA ACGTAATAAA GACAACGTTG AAGCAGTTGT 1800 

TAAAACAATC GAGTACGATC CAAACCGTTC TGCAAACATC GCTCTTGTAC ACTACACTGA 1360 

CGGTGTGAAA GCATACATCA TCGCTCCAAA AGGTCTTGAA GTAGGTCAAC GTATCGTTTC 192 0 

AGGTCCAGAA GCAGATATCA AAGTCGGAAA CGCTCTTCCA CTTGCTAACA TCCCAGTTGG 1980 

TACTTTGATT CACAACATCG AGTTGAAACC AGGTCGTGGT GGTGAATTGG TACGTGCTGC 2040 

TGCTGCATCT CCTCAAGTAT TGGGTTCTGA AGGTAAATAT GTTCTTGTTC GTCTTCAATC 2100 

ACGTGAAGTT CGTATGATTC TTGGAACTTG CCGTGCTACA GTTGGTGTTC TCGGAAACGA 2160 

ACAACATGGA CTTGTAAACC TTGGTAAAGC AGGACGTAGC CGTTGGAAAG GTATCCGCCC 22 2 0 

AACACTTCCT CGTTCTCTAA TGAACCCTAA CGATCACCCA CACCGTGGTG GTGAAGGTAA 2280 

ACCACCACTT GCTCGTAAAG CACCATCTAC TCCATGGGCC AAACCTGCTC TTGGTCTTAA 2 340 

AACTCGTAAC AAGAAAGCGA AATCTGACAA ACTTATCGTT CGTCGTCCCA ACGAGAAATA 2400 

ATATTAAACT AGTCGCTTAA GCAACTAGTA AATCCGCCAG CTCCGTAGCC CTCCATAGGA 24 60 

GTGCAAGCCG CTCTGGTACA ACATTTAAAG GAGAAAATAT AAAAATGGGA CGCAGTCTTA 2 520 

AAAAAGGACC TTTCGTCGAT GAGCATTTGA TGAAAAAAGT TCAACCTCAA GCTAACGACG 2580 

AAAAGAAAAA AGTTATTAAA ACTTGGTCAC GTCCTTCAAC GATCTTCCCA ACTTTCATTG 2 640 

GTTACACTAT TGCAGTTTAT GACCGACGTA AACACCTACC TGTTTACATC CAAGAAGACA 2 7 00 

TGGTAGGCCA CAAACTTGGT GAATTTGCAC CAACTCCTAC TTACAAACGT CACGCTGCAC 27 60 

ACGACAAGAA AACACGTAGA AAATAAGGAG AACATAAATC GCAGAAATTA CTTCAGCTAA 2820 

AGCAATGGCT CGTACAGTAC GTGTTTCACC TCGTAAATCA CGTCTTGTTC TTGATAACAT 2880 

CCGTCGTAAA AGCGTAGCCG ATGCAATCGC AATCTTGACA TTCACTCCAA ACAAAGCTGC 2940 

TGAAATCATC TTGAAAGTTT TGAACTCAGC TGTAGCTAAC GCTGAAAACA ACTTTGGTTT 3000 

GGATAAAGCT AACTTGGTAG TATCTGAAGC ATTCGCAAAC GAAGGACCAA CTATGAAACG 3 060 

TTTCCGTCCA CGTGCGAAAG GTTCAGCTTC ACCAATCAAC AAACCTACAG CTCACATCAC 3120 

TGTAGCTGT7 GCAGAAAAAT AAGGAGGTAA AATCGTGGGT CAAAAAGTAC ATCCAATTCG 3180 

TATGCGTGTC GGCATCATCC GTGATTGGGA TGCCAAATGG TATGCTGAAA AAGAATACGC 3 2 40 
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GGATTACCTT CATGAAGATC 
AGCAGTTTCA ACT ATT G AAA 
TGCTAAACCA GGTATGGTTA 
ACTTAACAAA TTGACTGGAA 
TTTGGATGCT CACCTTGTAG 
CCGTCGTGCA CAAAAACAAG 
AACTCAAGTA TCAGGTCGTT 
AGGAACTGTT CCGCTTCACA 
TACTACATAC GGTAAACTTG 
TCGTAAAAAC ACTAAAGGAG 
TCGTGAGTTC CGTGGAAAAA 
TGAATACGGT CTTCAAGCTA 
TCGTATCGCC ATGACTCGTT 
ACACAAATCA TACACTGCTA 
TGAAGGTTGG GTAGCACCAG 
TGAAGAGATT GCACGTGAAG 
ATTCGTAAAA CGTGAAGCAG 
TGTTAAAGAA CTTCGTGGTC 
AAAAGAATTG TTTGAACTTC 
CTTGAAAGAA GTTAAAAAAC 
ATAGACTAGG GAAGGAGAAA 
TGTTGTATCT GACAAAATGG 
CCCAGTCTAT GGTAAACGTA 
TGTTGCCAAA GAAGGCGATA 
ACCTTTCCGT CTTGTAGAAG 
AGAAAACTGA AATGATTCAA 
GCGAAATC7T G A CT AT C AAA 
ATG7TATCGT GGCATCTGTA 
TTGTTAAAGC AGTTATCGTT 



541 

TTGCAATCCG TAAATTCGTT CAAAAAGAAC TTGCTGACGC 3 3 00 

TCGAACGCGC AGTAAACAAA GTTAACCTTT CACTTCACAC 3 3 50 

TCGGTAAAGG TGGTGCTAAC GTTGATGCaC TCCGTGCAAA 3 4 20 

AACAACTACA CATCAACATC ATCGAAATCA AACAACCTGA 34 90 

GTGAAGGAAT TGCTCGTCAA TTGGAGCAAC GTGTTGCT7T 3 540 

CAATCCAACG TGCAATGCGT GCTGGAGCTA AAGGAATCAA 3 600 

TGAACGGTGC AGATATCGCC CGTGCTGAAG GATACTCTGA 3 660 

CACTTCGTGC AGATATCGAT TACGCTTGGG AAGAAGCAGA 3 720 

GTGTTAAAGT ATGGATCTAC CGTGGTGAAG TTCTTCCAGC 3*? 30 

GTAAATAACC AATGTTAGTA CCTAAACGTC TTAAACACCG 3 840 

TGCGCGCTGA AGCAAAAGGT GGAAAAGAAG TAGCATTCGG 3 900 

CAACTAGCCA CTGGATCACT AACCGCCAAA TCCAACCTCC 3 960 

ACATGAAACG TGGTGGTAAA GTTTGGATTA AAATCTTCCC 4020 

AAGCTATCGG TGTGCGTATG CCATC7CGTA AAGGGGCACC 4080 

TTAAACGTGG TAAAGTGATG TTCGAAATCC CTGGTGTATC 4140 

CGCTTCGACT TGCTAGCCAC AAATTGCCAG TTAAATGTAA 4200 

AATAAGGAGA AGGCATGAAA CTTAATGAAG TAAAAGAATT 4260 

TTTCTCAAGA AGAACTCGCG AACCGCCAAA ACGAATTGAA 4 3 20 

GTTTCCAACC TGCTACTGGT CAATTGGAAC AAACAGCTCG 4 3 80 

AAATCGCTCG CATCAAAACA GTTCAATCTG AAGCGAAATA 4 440 

TTTCAATGGA ACGCAATAAT CGTAAAGTTC TTGTTGGACG 4 500 

ACAAGACAAT CACAGTTCTA GTTGAAACAA AACGTAACCA 4 560 

TTAACTACTC TAAAAAATAC AAAGCTCATG ATGAAAACAA 4 620 

TCGTACGTAT CATGGAAACT CGCCCCCTTT CAGCTACAAA 4 680 

TTGTTGAAGA AGCGGTCATC ATCTAATCAA ACCTCAAAGG 474 0 

ACACAAACTC GTTTGAAAGT CGCAGACAAC AGCGGTCCTC 4800 

GTTCTTGGTG GTTCAGGACG TAAATTTGCA AACATCGGTG 4360 

AAACAAGCTA CTCCTGGTGC TGCGGTTAAA AAAGGTGACG 4920 

CGTACTAAAT CACGTCCTCG TCGTCCTGAT GGTTCATACA 4 980 
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TCAAATTTGA 


CGAAAACGCA 


GCAGTTATCA 


TCCGTGAAGA 


CAAAACTCCT 


CCCGGAACAC 


5040 


GTATCTTT GG 


CCCAGTTGCA 


CGTGAATTCC 


GTGAAGCTGG 


CTTCATGAAG 


ATCGTGTCAC 


5100 


TTGCTCCAGA 


AGTACTTTAA 


TTTTTAGGAA 


CAAACTAGTC 


CCCTACCTTC 


AAGCTAGGGT 


5160 


GCCCTTATGG 


GCGTAAGAAA 


AATCAAGGAG 


AAACCTAATG 


TTTGTAAAAA 


AAGGCGACAA 


5220 


AGTTCGCGTA 


ATCGCTGGTA 


AAGATAAGGG 


AACAGAAGCT 


GTTGTCCTTA 


CTGCCCTTCC 


5280 


AAAAGTAAAC 


AAAGTTATCG 


TTGAAGGTGT 


TAACATTGTT 


AAGAAACACC 


AACGTCCAAC 


5340 


TAACGAGCTT 


CCTCAAGGTG 


GTATCATCGA 


GAAAGAAGCA 


CCTATCCACC 


TATCAAACGT 


5400 


TCAAGTTTTG 


GACAAAAATG 


GTGTAGCTGG 


TCGTGTTGGA 


TACAAATTTG 


TAGACGGTAA 


5460 


AAAACT7CCC 


TACAACAAAA 


AATCAGGCGA 


AGTGCTTGAT 


TAATCACGAA 


GGAAAGGAGA 


5520 


AGTATAATGG 


CAAATCGTTT 


AAAAGAAAAA 


TATCTTAATG 


AAGTAGTTCC 


TGCTTTGACA 


5580 


GAACAATTCA 


ACTACTCATC 


AGTGATGGCT 


GTGCCTAAAG 


TAGATAAGAT 


TCTTTTGAAC 


5640 


ATGGGTCTTG 


CTCAAGCTC? 


ATCAAACGCT 


AAAAGCCTTG 


AAAAACCTGC 


TGAAGAATTG 


5700 


CCACT7ATCT 


CAGGTCAAAA 


ACCACTTATC 


AC7AAAGCTA 


AAAAATCAAT 


CCCCGGCTTC 


5760 


CGTCTTCGTG 


AACGTGTTGC 


GATCGGTGCA 


AAAGTTACCC 


TTCCTCGTCA 


ACGTATGTAC 


5820 


GAATTCTTGG 


ATAAATTGGT 


ATCAGTTTCA 


CTTCCACGTG 


TACGTGACTT 


CCACGGTGTC 


5880 


CCAACAAAAT 


CATTTGATGG 


ACGCGGGAAC 


TACACACTTG 


GTGTGAAAGA 


ACAATTAATC 


5940 


TTCCCAGAAA 


TCAACTTCGA 


TGACGTTGAC 


AAAACTCGTG 


GTCTTGACAT 


CCTTATCGTA 


6000 


ACAACTGCTA 


ACACTGACGA 


AGACTCACGT 


GCATTGCTTA 


CAGGCCTTCG 


AATGCCTTTT 


6060 


GCAAAATAAT 


ATAGGAGGTA 


AATCTAATGG 


CTAAAAAATC 


AATGGTACCT 


AGAGAGGCTA 


6120 


AACGCCAAAA 


AATTGTTGAC 


CGTTATGCTG 


AAAAACGTGC 


TGCATTAAAG 


GCGGCAGGGG 


6180 


ACTACGAAGG 


TTTATCTAAA 


TTACCTCGCA 


ACGCCTCACC 


GACTCGTTTA 


CATAATCCTT 


6240 


GTAGGGTTAC 


GGGGCGCCCA 


CATTCACTT? 


ACCGCAAATT 


TGCTCTCACT 


CCTATCCCTT 


6300 


TTCGCGAACT 


TGCGCATAAA 


GGTCAAATTC 


CTCGTGTAAC 


AAAACCATC7 


TGGTAATTTA 


6360 


AGATATCAAG 


AGCGTCAAAA 


CTCCAAGTAA 


AAATAGGAAA 


CTTGACGAAG 


AAACTAAAGT 


6420 


TTCTAGGAAA 


GTTTATCTTT 


7TCACACAGA 


GTTTACCCCG 


GCTTCAATTG 


GCCTTGCCAA 


6480 


TTTGAACACG 


AGCTACAGCT 


TTGGCAAAAA 


AGACCAATTT 


GCTTTGGAGC 


ATTGCTTCTG 


6540 


CATTAAATTG 


TCTATTTTTG 


CTCCTGCTGT 


TACGCTCTTT 


GTATCATGTA 


T7AACTAGCA 


6600 


AGTGCAACTT 


GCAAACTACT 


AGTAAGAGGA 


GAAAAACAAA 


ATGGTTATGA 


CTCACCCAAT 


6660 


CGCAGACTTC 


CTAACTCGTA 


T7CGTAATGC 


TAACCAACC7 


AAACACGAAC 


TACTTGAAGT 


5720 


ACCTGCATCA 


AACATCAAAA 


AAGGGATTGC 


TCAAATCCTT 


AAACGCGAAG 


GTTTTGTAAA 


6730 
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AAACCTTGAA ATCA7TCAAG ATGACAAACA AGCCGTCA7C CCTGTATTTC TTAAATACGG 6840 

ACCAAATCGT GAGAAAGTTA TCACTAACTT GAAACGTGTT TCTAAACCAG CACTTCGTGT 6900 

CTACAAAAAA CGTGAAGACC TTCCAAAAGT TCTTAACGCA CTTGGAATTG CCATCC7TTC 69 60 

AACTTCTCAA GCTTTGCTTA CTGATAAAGA AGCACGCCAA AAGAATGTTG GTGGTGAGGT 7020 

TATCGCTTAC GTTTGGTAAA ATCAAGATAC AAAGCTCGTA AAGAACAAAG CAAAATTAGG 7080 

AAGTTGGAGA AGTTTGT^TA CAAACAAGCC AACTTATCTA TTTTGCACAG TTCTTAGACC 7140 

GTGTTCAGTT CAGC7CTTGA AC7AAATAAG TATCTGAACC CCGTGAAAAC TCCCCG7TCT 7 200 

GGCCTGACAA TTTAACAGGA GAAAATAAAC ATGTCACGTA T7CC7AATAA AGTTATCGTG 72 60 

TTGCCTGCTG GTCTTGAACT CGCTAACAAT GACAACGTTG TAACTGTAAA AGCA7CTAAA 7 320 

GGAGAACTTA CTCGTGAGTT CTCAAAAGAT ATTGAAATCC GTGTGGAAGG TACTGAAATA 7 3 80 

ACTCTTCACC GTCCAAACGA TTCAAAAGAA ATGAAAACTA TCCACGGAAC TACTCGTGCC 7 4 40 

CTTTTGAACA ACATGGTTGT TGGTGTATCA GAAGGATTCA AGAAAGAACT TGAAATGCGT 7 500 

GGGGTTGGTT ACCGTGCACA GCTTCAAGGA TCTAAACTTG TTTTGGCTGT TGGTAAATCT 7 560 

CATCCAGACG AAGTTCAAGC TCCAGAAGGA ATTACTTTTG AACTTCCAAA CCCAACAACA 7620 

ATCCTTGTTA GCGGAATTTC AAAAGAAGTA GTTGGTCAAA CAGCTGCTTA CCTACGTAGC 7 6 80 

CTTCGTTCAC CAGAACCATA TAAAGGTAAA GCTATCCCTT ACGTTGCTGA ATTCGTTCGC 7740 

CCTAAAGAAG GTAAAACAGG TAAATAATGT TGACTGGTTG ATCATCAACC ACCAACCTAT 7 800' 

TTTCCAACTT TG7GCA7AGC ACACGATTTA AAACTAAAGA GGTGAAAACT GTCATTTCAA 7 860 

AACCAGATAA AAACAAACTC CGCCAAAAAC GCCACCGTCG CCTTCCCCCA AAACTCTCTG 7 920 

GAACTCCTGA TCCCCCACGT TTGAACGTAT TCCGTTCTAA 7ACAGGCATC TACGCTCAAG 7 980 

TGATTGATGA CGTAGCGGGT GTAACGCTCG CAAGTCCTTC AACTCTTGAT AAAGAAGTTT 804 0 

CAAAAGGAAC TAAAACTGAA CAAGCCGTTG C7G7CGCTAA ACTCGTTGCA GAACGTGCP A 9100 

ACGCTAAAGG 7AT7TCAGAA GTGGTGTTCG ACCGCGGTGG ATATCTATAT CACGGACGTG 8160 

TGAAAGCTTT GGCTGATGCA GCTCGTGAAA ACGGATTGAA ATTCTAATAG GAGGACACTA 82 2 3 

GAAAA7GGCA TTTAAAGACA ATGCAGTTGA ATTACAAGAA CGCGTAG7TC CTGTCAACCG 82 8 0 

7GTTACAAAA GTTGTTAAAG GTGGACGTCG 7CTTCC777C GCACCTC7TG TTGTTGTTGC 33 40 

7GACCACAAT GGTCGCGTAG GATTTGGTAC 7GGTAAACC7 CAAGAAG7TC CAGAAGCAAT 84 00 

CCGTAAAGCA GTAGATGATG CTAAGAAAAA CTTGATCGAA G7TCC7ATCC 7TGGAACAAC 84 60 

AATCCCACAC GAACTTCTTT CAGAATTCGG TGCAGCTAAA GTA77GTTGA AACC7GCTG7 95 20 
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AGAAGGTTCT GGAGTTGCCG C7GGTCGTGC AGTTCGTGCC GTTGTGGAAT TGGCAGGTGT 8580 

GGCAGATATT ACATCTAAAT CACTTGCT7C TAACACTCCA ATCAACATTG TTCGTGCAAC 8640 

TGTTGAAGGT TTGAAACAAT TGAAACGCGC TGAAGAAATT GCTGCCCTTC GTGGTATTTC 8700 

AGTTTCTGAT TTGCCATAAG AAAGGGGATA AAATGGCTCA AATTAAAATT ACTTTGACTA 8760 

AGTCTCCAAT CGGACGCAT7 CCATCACAAC GTAAAACTGT TGTAGCACTT GGACTTGGCA 8320 

AATTGAACAG CTCTGTTATT AAAGAAGATA ACGCTGCTAT CCGTGGTATG ATCACAGCAG 8880 

TATCTCACTT AGTAACAG7T GAAGAAGTAA ACTAATCAaG TTTTAGGCGA TGTGCACTGT 3940 

ACCATCCCCT AAAACTAGAT ATAG7CATC7 ATGATCACAT CGTATAGGCG AGTTGATGGG 9000 

GGAGACAACC 7T77C7CCC7 TATCGGCCCT AGCATTTTAC AAAAGAGGAG AAAATAAAAA 9060 

TGAAACTTCA TGAATTGAAA CCTGCAGAAG GTTCTCCTAA AGTACGTAAC CGCGTTGGTC 9120 

GTGGTACTTC A7CAGCTAAC GGTAAAACAT CTGGTCGTCG TCAAAAAGGT CAAAAAGCTC 9130 

GTAGCGGTGG CGGACT7CGC CT7CGTTTTC AAGGTGGACA AACTCCATTG TTCCGTCCTC 9240 

7TCCAAAACC TGGATTCACT AACATCAACG CTAAAGAATA CGCAATTGTG AACCTTGACC 93 00 

AATTGAACGT CTTTGAAGAT GGTGCTGAAG TAACTCCAGT TGTTCTTATC GAACCAGGAA 9 3 60 

TTCTTAAACC TGAAAAGTCA GGTATTAAAA TTC7TCC7AA CGGTGAC7TG ACTAAGAAAT 94 20 

TGACTGTGAA AGCAGCTAAA TTCTCTAAAT CAGCTGAAGA AGCTATCACT GCTAAAGGTG 9480 

GTTCAGTAGA AGTCATCTAA CAGAGCTCAC CTATGTTTTT TAAATTATTA AGACAAGCTC 9540 

TTAAAGTCAA GCAGGTTCGA TCAAAAATTT TATTTACAAT TTTTATCGTT TTCGTCTTTC 9600 

GTATCGGAAC TAGCATTACA GTTCCTGGTG TGAATGCCAA TAGCTTGAAT GCTTTAAGTC 9660 

GATTATCCTT C77AAACA7C TTCAGCTTCC TGTCGGGGAA TGCCCTAAAA AACTTTTCGA 9720 
T7TTTGCCCT AGCAGT7AG7 CCC7ATA7CA CCCC77CTAT 7CTTC7CCAA C7C77GCAAA 9730 
TGGATATTTT ACCCAAC7TT GTAGAGTGGG GTAAACAACG GGAAGTAGGT CGAACAAAAT 984 0 

7CAATCAACC 7ACTCCTTA7 ATTGCTCTAG T7CTCCC7T7 TG7CCAA7C7 ATCGGGATTA 9900 
CAGCTGGTT7 TAATACCTTG GCTGGAGC7C AATTGATTAA AAC7GCTT7A ACTCCACAAG 9960 

TTTTTC7GAC GATTGG7ATC ATCTTAACAG CTGGTACTAT GATTG7CAC7 TGGTTGGGTG 10020 

AGCAAATTAC AGATAAGGGA TACGGAAACG GTGTTTCCAT CATTATC7TT GCCGGGATTG 10080 

TTTCCTCAAT TCCAGAGATG ATTCAGGGCA 7CTA7G7GGA CTACTTTGTG AACG7CCCAA 10140 

GTAGCCGTAT CACTTCA7CT ATCATTTTCG 7AATCATTTT GATTA7TAC7 GTATTG7TGA 10200 

TTATTTACT7 TACAACT7AT GTTCAACAAG CAGAATACAA AATTCCAATC CAATATACTA 10260 

AGGTTGCACA AGC7CCTCCA TC7AGC7C7T ACCTTCCG77 AAAAGTAAAC CC7GCTGGAG 103 2C 
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TTATCCCTGT 


1 TA7CTTTGCC 


AGTTCGATTA 


CTGCAGCcTG 


CGGPTATTPT 


i TCAGTTTTTG 


l\J J ou 


AGTGCCACAG 


CTCATGATTC 


GGCTTGGGTA 


ACGGT AP.P AP 

nUVIU L rt\J^./\V. 


AArSPlTTW 

nnUnUA I C I I 


GGCAAGTACT 


10*140 


TCTCCAACTG 


GTATTGCCAT 


GTATGCTTTG 


TTPiATTATTP 

* 1 VJ «V 4 4 ^ 4 4 ^ 


TPTTT Ufa TT 

I l AC A 1 I 


v_ i . L 4 A I At_G 


10 500 


TTTGTACAGA 


TTAATCCTGA 


AAAAGPAGPA 


gagajcPPtap 


lino j/wf/*/* 
AAAAGAGTGG 


TGCCTATATC 


10560 


CATGGAGTTC 


GTCCTGGTAA 


AGGTAPAflAA 


PA AT AT ATP*" 


C I AAACTTcT 


TCGTCGTCTT 


1062 0 


GCAACTGTTG 


GTTCCCTCTT 


w\, 1 L Uvt I V) I U 


ATTTCCATTT 


. AC CGAi 1 Gc 


AGCTAAAGAT 


1C680 


GTATTTGGTC 


TTTCTGATGT 


TtTT'TIfTTT"' 


P.P.TPP A APA A 


C 1 C I C * . G A r 


CATTATCTCT 


10740 


ACAGGTATCG 


AAGGAATCAA 


GC AAT^CCAA 


P.CT T APPTA*^ 


l unMLLi » AA 


GTATGTTGGT 


10300 


TTCATGGACA 


GAACAGAATA 


AAAGTATTTA 


PTP.AATPAPT 
V I un« 1 rtV_» 1 


AAA 4 AC I UAL) 


GGAGTGGAGG 


10860 


TTTAAACTCT 


GACATTTGTA 


AGAGTTGGAT 


CTCCCCTCTT 




TTTAAATCGG 


10920 


GGTGAAAAGA 


CTTTTTGCTT 


CTATTTAAAA 


ATAAAATAAP 


^ArATPi ART" 

O AC A I C AAA T 


CATGAATCTT 


10980 


TTCATTATGG 


GCTTACCTGG 


TCPACflTAAP 


<V! A A PTP Air* 


CACCAAAAAT 


CGTAGAACAA 


11040 


TTCCATGTTG 


CACATATCTC 


AACA^nrnAT 


ATCT m PPf~r > P 


ever* & aivpo 
C I GC AATGGC 


AAATCAAACT 


1110C 


GAAATGGGTG 


TTCTTGCTAA 


CVTP AT AT ATT 


CArAArr'^T 


AA L 1 GGTTCC 


TGACGAAGTT 


11160 


AC AAATGG AA 


TCGTAAAAGA 


ACGCCTTTCA 


PAACATPATA 


TTAAAriASI* 
1 I AArtCAAAC 


AGG ATTCTT A 


11220 


TTGGATGGTT 


ACCCACGTAC 


AATTfi AAP AA 


PPTPATPPPT 


1 CC AC AAAAC 


ATTGGCTGAA 


11280 


CTTGGCATTG 


AACTAGAAGG 


TOTTATPAAT 


ATTPA APTPA 


AC C LTGACAO 


CCTTTTGGAA 


11340 


CGTTTGAGTG 


GGCGT A^ 1 PAT 


p p a p c*fi ppt a 




LTT. CcAcAA 


GGTCTTTAAC 


1 1400 


CCACCAGTTG 


ACTATAAAfIA 


At"! A Ara*TTA(* 
rtVjrt/\Vj/\ i I MV. 


Turpi a r°r~*vr* 
1 ACC AACG 1 C 


AAG ATG AT AA 


GCCTGAGACA 


11460 


CTAAAACC7C 


CTTTCCATCT 


TAATATTGC7 


CAAGGAGAAC 


CAATCA77CC 


7CACTACCGT 


1152 0 


CCCAAAGCTT 


TGGTTCATGA 


CATCGAAGGT 


AATCAAGATA 


TCAATCATGT 


CTTCTCAGAT 


11580 


ATTGAAAAAG 


TATTCACAAA 


TTTGAAATAA 


AGCGTTTTTC 


ACACTTCCAA 


AAATCCGCTA 


11640 


CAAATGTTAT 


ACTGAGATAG 


TCTGACTTAT 


AATTGTTC7C 


TCTGTGTCTA 


GAGGCATCGA 


11700 


ATCGAAATTT 


ATGGAGGTGC 


TTTTGCGTGG 


CAAAAGACGA 


TGTGATTGAA 


GTTGAAGGCA 


11760 


AAGTAGTTGA 


TACAATGCCG 


AATGCAATGT 


TTACGGTTGA 


ACTTGAAAAT 


GGACATCAGA 


11320 


TTTTAGCAGG 


G 










11331 


(2) INFORMATION FOR SEQ ID NO: 66 











( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 66: 

CCCGGCATTT GAAAGCTATT CGTGAAGGAT TTATGATGCC AATGCCTTTG ATTTTAGTCG 60 

GCTCTTTATT TCTTATTCTA ATCAGTTGGC CTCAAGAGGC TTTTACAAAT TGGCTGAATA -120 

GTCTTGGATT GCTAAGTATC TTGACAACTA TGAATCAGTC AACAGTAGCG ATTATCTCCT 180 

TCGTCCCTTG 7TTCGGTATT GCCTACAGGT TGTCGGAAGG ATATGC7ACA GATGG7CCGT 24 0 

CGGCAGGGAT CATAGCCTTA TCCAGTTTTG TATTGATGGC ACCTCCTTTT TCGAGTATGG 3C0 

TTTATGATAA AAATGGGGAG CAGGTCAAGC AGTTATTTCC CGGCGCAATA CCA7TTTCTA 3 60 

GCCTGAATGC ATCTTCTTTG TTTATGGCGA TTACTATTGC ATTGCTTACA GCAGAGATTT 420 

ATCGTATCTT TATCCAGCGC GGAATTACGA TAAAAATGCC AAGTGGTGTC CCAGATGTAC 4 80 

TAAGTAAATC ATTTTCAGCT CTTTTATCTG GTTTTACTAC TTTTCTTTTG TGGGCTTTGG 540 

TCTTAAAAGG TCTTGAAGCG GCAGGACTTG CAGGAGGTCT CAACGGACTC CTAGGTGCAA 600 

TTGTTGGAAC ACCGCTTAAG TTAATTGCAG GAACGCTTCC AGGTATGATT CTATGTGTTA 660 

TTGTAAACTC ATTCTTTTGG TTCTGTGGAG TTAATGGGGG ACAAGTTTTA AATGCTTTTG 720 

TAGACCCAGT TTGCTTACAA TTTACTACAG AAAACCAAGA AGCTCTGGCT GCAGGACAAA 7 80 

CACTCCAACA CATTATTACA TTACCGTTTA AACATTTATT TGTATTTATT GGTGGCGGTC 840 

GAG CG ACT AT TGGTCTTGCG ATTTGTCTCT TCCTATTTAG TAAGACTCGT GCGAATAAAA 900 

CATTACGTAA GCTAGCTATT ATACCCTCTA TTTTTAATAT CAATACAGC7 ATTCTATTTA 950 

CGTTTCCAAC AGTTTTAAAT CCGATTATGC TGATTCCGTT TATTGCTACT CCTACAATCA 102 0 

ATCCC77CAT TACCTA7G7A TCAA7CCC7G TAGGAT7AG7 ACCCTA7ACA ACAGGTG7AA 1080 

TCCTTCCGTG GACAATGCCA CCGATTATAG GAGGCT7CC7 7GCAACAGGG GCTAGTTGGC 114 0 

GAGGAGC7CT ATTACAAGTT G7TTTGATTT 7GG7T7CTGT AGCAAT7TAT TATCCAT7CT 12 00 

TCAAAATTGC AGATAAACGC AATCTTCAAA AAGAAAAAGC TACTGTTGGA GGGAAATAAG 12 60 

ATGGTTATCA GAGTATTTGA TCAACAGAAA AATACTTA77 CTAGCTTTGC CTTAGAGGAA 13 20 

TTAAGTTACT ATATGAATCG GGTCTTTAAG ACTAACATAG AGCTTCTCGA GGAGAAGGAA 1380 

GCGGATATTT TTGTAGGATT AGTCAATAAA GAGGACAGAA AAGACCATGT TCTTA7CTCA 14 40 

TTAGACAAGG GTAAGGGGAG AATTGAGTCT AATACAATTG TAGGTTTAC7 TATTGGAA77 150C 

TACCGAATGT TTCATGAATT TGGGG7TGTG TATACTAGAC CAGGGCGCAG ACATCACTTT 1560 

G7TCCAGAGT TACGATTTGA AGATTT7TTA GATAAACAGC TATCTATAGA TGAAACAGCC 1620 
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AGT7ACTATC 


ATAGGGGAGT 


ATGTATAGAG 


GGAGCGGATT 


CATTTGAAAA 


TATACTAGAT 


1680 


TTCATTCATT 


GGCTACCTAA 


GATTGGGATG 


AACACTTTTT 


TCATCCAGTT 


TGAAAATCCT 


1740 


TACTCTTTTT 


TGAAACGTTG 


GTATGAACAT 


CAATTTAATC 


CATATCTAAA 


TAAAGAACAA 


1800 


TTTTCAAATG 


AATTAGTACA 


AGAATTGAGT 


GATAGGTTGG 


ATAAAGAATT 


GCAAAAAAGA 


1860 


GGTCTTATTC 


ATCATCGTGT 


TGGTCATGGA 


TGGACAGGTG 


AAGTTTTAGG 


TTACTCTTCA 


1920 


" A^TTTCGCT 


GGGAATCAGG 


TCTTAGTATT 


^AGAGGAGA 


AGAAACCCTA 


TGTCGCTGAA 


1980 


ATAAACGGGA 


AACGAGAATT 


GTTTAATACG 


GCTCCGATT7 


TAACCAGCCT 


GGATTTTTCA 


204C 


AATCCAGATG 


TAGCTGATAA 


GATGGTAGAA 


ATTATCAAGG 


ATTATGCCAA 


GAAAAGACCT 


210C 


GATCTTAACT 


ACTTACATGT 


ATGGTTGTCG 


GATGCTCGTA 


ATAATATTTG 


TGAATGCGAA 


2160 


AACTGTAGAC 


AAGAATTCGT 


TTCGGATCAG 


TATATTCGTA 


TTCTCAATCA 


ATTGGATAGG 


2220 


CCTTTAACCA 


GTGAGGGATT 


AGATACAAAG 


ATTTGTTTTC 


TGCTTTATCA 


TGAGTTGTTA 


2280 


TGGGCACCTC 


AGAAAGAAAA 


ATTAGATAAT 


CCTGAACCCT 


TTACCATGAT 


GTTTGCACCG 


2 34.0 


ATTACAAGAA 


CATTTGAAAT 


GAGTTATGCA 


GATGTAGATT 


TTGACAATTC 


CATACCTACC 


2400 


CCTAAACCTT 


ATATGCGTAA 


TAAAATTATA 


CTTCCGAATT 


CTCTTGAGGA 


AAATTTATCT 


2460 


TATCTTTTTG 


AGTGGCAAAA 


AGCATTTAAA 


GGAGATAGTT 


TCGTATATGA 


CTATCCTTTA 


2520 


GGGCGTGCTC 


ATTATGGCGA 


TTTAGGCTAT 


ATGAAAATTA 


GTCAAACTAT 


TT AC AG AG AT 


2580 


GTATCTTATC 


TTTCCAACCT 


ACATTTCAAC 


GGGTACATTT 


CGTGTCAACA 


ATTACGTGCC 


2640 


GGATTCCCTC 


ATAATTTTCC 


TAATTATCTC 


ATGGGGGAAA 


TGCTCTGGAA 


GAAGACAAGA 


2700 


AGTTATCAAC 


AATTCATTGA 


AGAATACTTT 


TCTGCTTTGT 


ATGGGGAAAA 


TTGGCAGTCT 


2760 


CTTCTTCAAT 


ATTTAGAAAA 


ATTATCCATT 


TATTCCTTTT 


G7GAT7ATT7 


TAATGCAATT 


2820 


GGCAGCCGTC 


AAAGTGATGT 


TTTACCGAAT 


CATTATTATA 


TAGCTTACAA 


TCTAGCTGAT 


2880 


aattttttac 


CAATTATTGA 


GGAAAATATT 




TAAATAHT^A 




2 940 


7GGAAACAGC 


TCAGTTATCA 


TCGTGAATAT 


GTTGTTAAGA 


TGGCGAAGGC 


TTTATATCTT 


3000 


CAAGCAACTG 


GAAAAACAAG 


GCAAGCTCAA 


GATGAATGGA 


GAAATGTGTT 


GAATTATATC 


3060 


CGTGGGCACG 


AATTGCTATT 


TCAATCTAAT 


TTGGATGTTT 


ATCGTGTAAT 


TGAAGTAGCA 


3120 


AAAAATTACG 


CTGCTTTCCA 


CTTATAAATC 


ATAAGTATAG 


AAAATGAAC7 


AAGGTATTCA 


3130 


GAGAAGATTG 


ATCCTAAATA 


TTATGAAATT 


TAAGCATTTT 


TAAGATATTT 


ACCGTCAACT 


3240 


TTCTATTTAT 


ATCGTAGCGA 


AGTCATTTTA 


ATAATGATCT 


GTAAAAGATG 


GATCAAGATT 


3300 


GAGGAGGAAG 


AAAGATGAAA 


TCAAAAGAAG 


AAATAAATAT 


GCTTCGTTTT 


ACAATTGTCG 


3 360 
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CTTACGCAGG 


AGATGCAAGG 


7CAGATTTGA 


TGGATGC7TT 


GGCGTTTGCG 


AGAGATGGAT 


3420 


A7TT7GAACA 


GGCAAGAGAA 


TTGGTTGAGT 


C7GCAAACGA 


CTCAATAGTG 


TCTGCCCATC 


3480 


GAGAACAGAC 


TAATTTAT7A 


GCGGAGGAGG 


CATA7GGAGA 


7AA77TTGAA 


GTGAGCTTTA 


3540 


TTATGATTCA 


TGGTCAACAT 


ACTTTGATGA 


CAACGATGCT 


ATTGTATGAT 


CAGGTAAAGT 


3600 


TTTTTA7TGA 


TCAA7ATGAA 


CGAATTCGAA 


AGATTGAAGA 


ACKTATTGGT 


TTGCAA7GAG 


366C 


GATTAGTCAT 


GGAAAATTTA 


CAGGTTAAAG 


CCT7ACCGAA 


CGACTTTTTA 


TTAGGAACTG 


3720 


CTACCGCTGC 


TTATCAAGTA 


GAGGGTGCAA 


CTAGGGTAGA 


TGGCAAAGGA 


ATAAATATGT 


3780 


GCCA7CT7TA 


TTTGCAAGAA 


AA7AG7CCG7 


7CTTACCAGA 


TCCAGCTAGT 


GA7TTTTATT 


3840 


A7CGT7ACGA 


AGAGGATATA 


GCTTTCCCGC 


CAGAACATGG 


TTTGCAGGCT 


7TGCGTTTAT 


3900 


CTATTTCTTG 


GG7TCC7ATA 


TTTCCTGATA 


TAGATGGGGA 


TGC7AATGTA 


T7AGCTGT7C 


3960 


ATTATTACCA 


TAGAGTTTTT 


CAGTCTTGCT 


TAAAACATAA 


7G7CAT7CCG 




4020 


TACATCATTT 


TGATTCGCCT 


CAGAAAA7GT 


TAGAAACAGG 


CCA77GG7TG 


AACAGAGAGA 


4080 


ATATTGATCG 


TTTCA7ACGA 


TATGC7CGC7 


TTTG77TCCA 


AGAATTTACA 


GAAG7CAAGC 


4140 


ATTGGTTTAC 


AATCAATGAA 


CTCATGTCTC 


TTGCTGCAGG 


TCAA7ATA7A 


CGACGTCAGT 


4200 


7TCC7CCAAA 


TCA7CATTTT 


CAATTATC7G 


AAGCAATTCA 


AGCGAATCAT 


AA7ATG7TGT 


4260 


TGGCGCATGC 


TCTTGCAGTC 


CTCGAATTTC 


ATCAATTAGG 


CATTGAGGGA 


AAGGTAGGTT 


4320 


G7ATTCATCC 


TTTAAAGCCA 


GGCTATCCTA 


T7GA7GGGCA 


AAAAGAAAA7 


A7TTTCGCAG 


4380 


CTAAACGGTA 


TGATGTTTAT 


AATAATAAAT 


TTC7ATTAGA 


7GGAAC7TT7 


T7GGGCTAC7 


4440 


ACAGTGAGGA 


CACCCT7777 


CACT7GAA7C 


AAATA7TGGA 


AGC7AATAA7 


TC7ACCTT7A 


4500 


TTATTGAAGA 


TGGTGATTTA 


GAAATTATGA 


AGAGACC7GC 


ACCTCTTAAT 


ACGATGTTTG 


4560 


3CATCAAT7A 


77A7CC77CA 


GAA777A77C 


GTCAAT AC AA 


ACG7GAAAA7 


AGACAAGAAT 


4620 


TTAATTCAAC 


AGGAA7AAAA 


GGACAGTC77 


CTTTTAAATT 


AAATGCTC7A 


GGTGAATTTG 


4680 


TAAAAAAACC 


TGGTATTCCG 


ACAACAGATT 


GGGATTGGAA 


TA7TTA7CC7 


CAAGGCT7AT 


4740 


77CATA7C77 


GC7TCG7A7C 


AAAGAAGAA7 


ATCC7CAACA 


TCCGGTCA7T 


TATTTAAC7G 


4800 


AAAATGGTAC 


AGCCC7TAAA 


GAAGTTAAGC 


CAGAGCGCGA 


GAA7GA7ATT 


A77GA7GACA 


4860 


GTAAGAGAAT 


CCGTTATAT7 


GAGCAACA7T 


TACACAAAGT 


TTTAGAGGCT 


CCACATAGAC 


4920 


GAGTCAATAT 


TCAAGGCTAT 


7TTATATGG7 


C7T7GCAAGA 


TCAA77TTCT 


7GGGCGAATG 


4980 


GCTACAATAA 


GCGATA7GG7 


C777TC7TTG 


TTGAT7ATGA 


AACACAGAAG 


AGA7ATA77A 


5040 


AGAAAAGTGC 


7CTTTGGG7A 


AAAGGGC7AA 


AACGGAAT7A 


AGG77AGCGA 


7T7GACTGA7 


5100 


G7TTAATATG 


TTT7AAATAT 


GAGGTTGAAT 


77TT7ATAGG 


AGGAG777TA 


7GGA7AAGC7 


5160 
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AGTCCCTGCC 


ATTGAAAAGC 


AACAAGGGAA 


ATTTGAAAAA 


ATTTCTACTA 


ATAACTATAT 


5220 


GATGCCTATT 


AAAGATGGAT 


TCATTGCTAC 


TATGCCTTTA 


ATTATGTTT? 


CAAGCTTTTT 


5230 


GATGATTATT 


ATTATGATTC 


CTAAAAATTT 


CGGAGTAGAG 


TTACCGAGTC 


CAGCTATTGT 


5340 


CTGGATGAGA 


AAAGTGTATA 


TGTTAACCAT 


GGGAGTTTTG 


GGTATTATTG 


TTTCAGGGAC 


5400 


TGTTGGAAAG 


TCATTAGTTG 


GAAATGTTAA 


CAGAAAAATG 


CCTCACGGAA 


AGGTAATAAA 


5460 


TGATAT TTC T 


GCAATGTTGG 


CAGCCATATG 


TAGTTATCTG 


GTATTAACTG 


TAACCCTTGT 


5520 


AGTTCATGAG 


AAGACGGGAT 


CTACAAGTTT 


GTCGACAAAC 


TATTTAGGAT 


CTCAAGGATT 


5580 


GATAACtTCC 


TTTGTCAG7G 


CCTTTATTAC 


TGTAAATGTT 


TACCGATTCT 


GTATTAAGCG 


5640 


AGACATTACT 


ATTCATTTAC 


CTAAGGAAGT 


TCCTGGGGCT 


ATATCACAAG 


CTTTTAGAGA 


5700 


TATTTTCCCT 


TTTTCTTTTG 


TTTTACTTAT 


TAGTGGTTTG 


TTAGATATTG 


TATCTCGGTT 


5760 


TAGTTTAGAT 


GTTCCTTttG 


CCCAAGTATT 


TCAACAACTA 


TTGACTCCTA 


TTTTTAAGCG 


5820 


GGCAGAATCA 


TATCCTGCTA 


TGATGTTGAT 


TTGGTTTATG 


TGTGCTTTGC 


TTTGGTTTGT 


5380 


TGCAATTCAT 


GGACCATCTA 


TTCTCTTACC 


TGCTGTTACA 


GCTTTGCAAC 


TGAGCAATAT 


5940 


GGAAGAGAAT 


GCTCAACTTC 


TTGCAAATGC 


GCAGTTCCCT 


TATCATTCTT 


TAACACCTAA 


6000 


TTTCGGGAAT 


TATATCGCTG 


CTATTGGAGG 


AACGGGGGCT 


ACCTTTGTTG 


TACCATTTAT 


6060 


TTTGATTTTC 


TTTATGCGGT 


CTAAACAATT 


AAAATCGGTA 


GGTAAAGCTA 


CAATTACTCC 


6120 


TGTTTTATTT 


GCGGTAAATG 


AACCTCTTCT 


ATTTGGTATG 


CCTGTTATTT 


TGAATCCCTA 


6180 


TCTTTTTGTC 


CCTTTTTTCA 


TGACTCCACC 


AGTGAATGTA 


TTTCTAGGAA 


AGGTCTTTAT 


6240 


TGATTTCTTT 


CGAATCAATG 


GATTTTATAT 


CCAGTTACCT 


TGGACCTTTC 


CTGGTCCCTT 


6300 


CCCATTCTTA 


AT7CGAACCA 


ATTTTCAACT 


TATCTCCTTT 


GTATTTT7AT 


CTTTCATTTT 


6360 


AGTTGTCGAC 


ATATTGATTT 


ATTTGCCATT 


CTGTAGAGCG 


TATGATAGAC 


AGTTACTCCT 


6420 




All Vjt. AAO<- I 


V- AAATUAT AT 


TATTTTAGAG 


GAGGATACAA G7CAAATAAT 


6480 


TCCTGGTGAG 


ATAGATGAAA 


TAAAAAGTAA 


GGAGTTGAAA 


CTACTGGTTC 


TTTGTGCAGG 


6540 


GTCTGCAACA 


AGTGCGCAAT 


TAGCCAATGC 


AATTAACCAG 


GGGGCTAACT 


TAACAGAGGT 


6600 


TAGAGTGATT 


GCGAATTCAG 


GACCGTACGG 


ACCTCATTAT 


CATAT7ATGG 


GTGTTTATGA 


6660 


TTTAATTATT 


CTGGCCCCAC 


AAGTTCGGAG 


TTATTATAGA 


GAGATGAACG 


TGGATGCAGA 


6720 


AAGATTAGGT 


AT7CACATAG 


TTGCTACCAG 


AGGAATGGAA 


TATATTCATT 


TAACAAAGAG 


6780 


TCCAAGTAAA 


GCCTTACAAT 


TTGTAtTGGA 


GCATTACCAA 


GCTGTGTAGT 


AAGTTTTTCC 


6840 


ATCTTTTATT 


TGAGTAAAGA 


TTTTGTTTAC 


AGATAGCCTT 


CGATTTAAAA 


ACCTTCCCCC 


6900 



WO 98/18931 



PCT/US97/I9588 



550 

TTTTTTAATA TAAGAATCCC TCTTTCACAA TTGTAAAAAG AGGCATTTTG TATTTTATCT 6960 

CTTAGACCAA GTTCTCTTCA TAAAGAGAAG GAGGATTGGG TAAATCTCCA AGCGCCCTGC 7020 

AATCATTGCA AAGGATAGGA CAATTTTTCA GATGGGACTA AAGATTGAGA AACTAGAAGT 7080 

CCTTCCTAGA ATAGGCCCGA TATTATTGAA ACAGCTAAAG ACAGCGCTGG TCACGACCAG 7140 

AAAATCATTG CTATCTAGGC TGACAATAAA GATAAGCGCT AGCAAAATCA TACCATACAT 7200 

GACAAAGTAC TTGAGAATCT TATGCTGGGT ATCTTTGTCA ATCACCGTTT TATTAACATG 7 260 

GAGGGTCAAA ACACGGTGGG GCGATAGGAT TGACAAAATT TGGTTTTTGG CAATTTTTGA 7320 

AAGGATGAGG CCTCGAATAA TCTTGAGTCC ACCTGCACTT GATCCAGCAG AGCCACCGAT 7 330 

TGCCATGAGG AAAAGGAGGA TAAACTGGGA GAAGAGGGGC CAGTTCGTAA TATCTCCATA 7 44 0 

TCCAAAACCA GTTGTTGTAA TGATGTTGGA AACCTGGAAG AAGGTCATTT CAAAGCTCTT 7500 

TCAAAACCCT GGGTAGAGGT AGAGGGTGTT GAGGCTAATC AAGCCTCTAG AAACCAGTAC 7560 

AATGACCAAG TAAGCCCTAA GCTCTTCATC TCCAAAGAAG GCCTTGATGC GACCGAGCAT 7 620 

GAGGTAGTAG TAGAGGTTGA AATTTACTCC AAAAACCAGA ACTCCCATAC TGACCAGATA 7630 

GGTAATCAGT GAGCTGCCAT ACTGGGCAAT TCCGTCGTTA TAGACGGTAA AGCCTCCAGT 7740 

TCCCGCTGTC CCCATAGCAA TAACAAAACT ATCGTAGACA GGCATACCGG CTAGATAATA 7800 

GATGATGACA AAGAGGGAGA AGAGAGCTAG ATAAAGCACA TAGAGAATCT GGGCACTGTT 7860 

TTTTAGTTTG GATACAACCT TGCCAAAAAC AGCACCTGGA ACCTCAGCCT TCATCACCTC 7 9 20 

TAGGTGGCTA TTTTTGGCAT TCTCCATAAT AGCAAGTGCA AAAACAACCA CTCCCATCCC 7930 

TCCAATCAAC TGGGTAAAAC TTCGCCAGAA GAGGAGGGAA CGCCTGAGAA CCGAAACCTC 9040 

GTTCAAAATA CTTGCTCCAG TACTTCTAAA TCCAGAACTA ATT7CAAAAA AGGCATCAAT 8100 

AACGCTGGCG ATTTCCCCAG AAAAGACAAA GGGCAGACCA CCAAACAAAG ACCAAAGCA7 SI 50 

CCAACAGAGG GCAACGATCA AGACTCCCTC CTTGGCATAA ATCCGTTGAT TTTTTCGCTT 8220 

CTGTAAACTC CCTGAACCGC CTAACAATAC GAGAATCCCT ATCGTCGAAA AGAGGGCTGT 9280 

AAACACTTCC CTCGATTCAC GGTAATACAC AGCAATCGCA ACAGGAACCA AAAGAAGAAC 8340 

AGCTTCAATC AAAAGTAATT TTGAAAGGAG GTAACGAATC ATACTTTTAT TCATTTCTTA 84 CO 

CCTCGCGATC AAGTCATAAA TCTTGGTGAT GTTTGGCAAC AAGGTTGTTA CTAGCAGCTT 8460 

GTCTCCAACT TCCAACATAT CCTCCCCAGT TGGGAAAATA GTCTTGCCCT TTCGAATAAT 8520 

GGCTGCAATA AGAACCCCTT TTTTCAATTT CAGTTGAGAA AGAGGTTTGG CAGTCATTTT 8580 

ATTGGCTTCC TTGATATGGA ATTGCAGGGT TTCGATTTGG CCATTCGCTA GATGGTGCAT 8640 

AGCTTGAAGG TCTGAATACT GGCCATTAAC TCGACCACGA A7AAAGTGCA TAATCGTATC 87 00 
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TACAGCGATG CTTTTACGTG TGATGATACT TGAAAAATCA GGCGCATTGA TAATCTCGAG 3760 

GAGACTGGTA CGATTGACCT TAGTAATATT TTTCTGTACA CCTACCCTCT CAAGGAACAT 3820 

AGATGTAATC AGATTTTCCT CATCGACTCC TGTTAGAGTC GCAACGGCAT CATAGTGTTG 3 880 

ACCACTTTCT TCCAGCAGGA TATCTTTTGC GGTTCCATCT CCTTGAACGA TGTAGAGATT 3940 

TGGGAATTTC TCGCTAAAGA AGCTGGCGAT TTCACGATTG ATTTCAATGA CTTTTGTATC 9000 

GATACGACTA TCTTTGAGAA TACCAAGTAG ATAATAGGCA ATTCTACCTC CCCCAACGAT 9060 

GAGAAGGCTC TTCACGGCGC GTGATTTAAA ATAATTATGG AAGAGTATCA TATCGACACG 9120 

GTTACCAGTG ACAAAGATTC TATCTTTATC CTGTACAGTC ATGTCACCGC TTGGAATGAT 9180 

AATTTGATGA TCCCTCTCTA TCGCACAGAC AATGACATTA CCAAATTTTT TACGAAAATC 92 40 

AGAAATGGGC ATTTGGCAAA GACCCCTGGT GGACTTGACG ACAAATTCCA TCAGGCTAAC 9300 

GCGTCCACCA GCAAAGCGTT CG AC AG AC AG GGCGTTGGGG AAGTCAATGA TATTCGCGAT 93 60 

AGCGCGGGCA GCCAAGAGCT CAGGATTAAC GATAAGAGAA AAACCGAGAA TATTCTTTTC 9420 

CTTGAAATAA GAGTTACAAT ATTCAGGGT7 CCGCACCCGA ACGATAGTTT CTTTAGCTCC 9480 

CATTTTCTTG GCTAGAACTG CTGCAATCAT GTTGACTTCA TCGTGCTCAG TCAGGGCGAT 9540 

AAAGATATCA CAATCTTGGA CGCTGGCTTG CTCAACAATC GCAAAATCGG CCCCGTTACC 9600 

AAGGATACCA ATGATATCAA AGCGACTGAC AATATGATTG AGAACAGCTT CCTCTTGCTC 9660 

AATCAGCAAA ACATCATGCT TTTCTCCAAC CAAGGACCGA CAGAGGGCAA AACCAACTTT 97 20 

TCCCCCTCCC ACAAGGATAA TTTTCATAAT AAAACCTACT TTTTCATCAT GTAACTATCA 97 80 

TACCCTTTTT CAAGAAAAAA TGCACCTACT AGCTAATAAC AAGAGTTTTT AGTCAAAATT 9840 

CGCTATAAGG TAAAACTATA CCCTAACCAA TTGAAATAGC TATTACCCAC TTTCTC7GAA 9900 

ATATGGTATG ATAAAGCATA TACAAGGAGA TAAAATGAAT AATAATTTAC TGGTATTACA 9 960 

ATCACACTTT GGTCTGGTTG ATGCTGCGGT ATCGGCTATG ATTGGAGTGG CTTTACAACA 10020 

C7CTCCAACC TTAAAAATAC ATCACTTGAC GCACGATATC ACCCCTTATA ATATTTTTGA 10080 

GGGGAGCTAT CGTCTCTTTC AGACGGTGGA TTACTGGCCT GAGGGAACGA CGTTTGTATC 10140 

GGTTGTCGAT CCAGGTGTCG GTTCGAAACG TAAGAGTGTA GTTGCCAAGA CTGCAAAAAA 10 2 CO 

TCAATACATT GTCACGCCAG ATAATGGGAC CCTTTCCT7T ATCAAGAAAC ACGTTCGCAT 102 60 

TGTAGCCATT CGTGAGATTT CTGAGGTGGC CAATAGGCGT CAAAACACAG AGCATTCTTA 10 320 

TACCTTCCAC GGTCGTGATG TCTATGCCTA TACTGGTCC7 AAACTGGCCA GTGGTCACAT 10380 

TACTTTTCAG GAAGTAGGGC CAGAGCTCAC TGTGGAACAG ATTGTAGAGC TTCCAGTCGT 10440 
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AGCGACCATC ATAGAAGATC ATCTGGTCAA GGGAGCCATT GATATTCTCG ATGTGCGTTT 10500 

CGGTTCGCTT TGGACCTCTA TCACACGGGA ACAATTTTAC AAGCTGGAAC CAGAATTTGG 10560 

TGATCGTTTT GAAGTGACCA TCTATCATGC TCATATGCTG GTCTATCAAA ATCAGGTTGT 10620 

CTATGGCAAA TCATTTGCAG ATGTGAGAAT TGGGCAACCs ATcTTTACrc TCAGCaTCTt 10680 

CGATTAGCTG GGCAATTCGT TCTAGTTGGA TTTCGTCAAT CAAGGT 10726 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7163 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TTATCTTTAA CGATATCAAT CAAGATCTGG TCAATAAAGC CATTCCGCCT TATCGTGAAG 60 

TTGGCATCCA AGCCCATGCA TATGTCTGTG ACGTGACAGA CGAGGACGGT ATCCAAGCCA 120 

TGGTCAAGCA AATCGAACAA GAGGTTCGTG TCATTGACAT CCTCGTTAAT AACGCTCGTA 180 

TTATCCGCCG AGTTCCAATG TGCGAAATGA GCGCCGCTGA TTTCCGTAAG GTCATCGATA 240 

TTGACTTAAA CGCACCATTT ATCCTTTCAA AGGCAGTTAT TCCTTCTATG ATAAAGAAAG 300 

GGCATGGAAA GATTATCAAT ATTTGTTCGA TGATGAGCGA ACTGGGACGT GAAACAGTTA 3 60 

GCGCTTATGC TGCTGCTAAA GGCGGCTTGA AAATGTTGAC CCGCAACATT GCGTCTGAAT 420 

ACGCTGCAGC CAATATCCAA TGTAACGGAA TTGGACCGGG TTATATTGCC ACTCCTCAAA 480 

CAGCACCTCT TCGTGAATTG CAAGAAGATG GTTCTCGCCA CCCATTTGAC CAGTTCATCA 540 

TTGCAAAAAC ACC7GCTCCA CGTTGGGGAA ATACTGAAGA TTTGATGCCC CC7GC7GTCT 600 

TTCTCGCTAG TGATGCCAGC AATTTTGTCA ATGGCCACAT CCTATATGTA GATGCCGGTA 660 

TCTTAGCCTA CATCGGAAAA CAACCTGAGT AAAAATAGAA AGAAGATCTT ATGAAAATCG 7 20 

CATTAATCAA TGAAAATAGT CAAGCTAGCA AGAATCACAT TATTTACGAT AGTCTAAAAG 780 

AAGCGACAGA TAAAAAAGGC TACCAATTAT TTAACTATGG TATGCGTGCA GAAGAAGGAG 840 

AAAGTCAATT AACTTATGTG CAGAACGGAC TAATGGCTGC CATCCTTTTA AATACAAAGG 900 

CAGTTGACTT TGTTGTTACC GGCTGTGGTA CGGGTGTACC GCCTATGCTT GCTTTAAACA 9 60 

GCTTCCCTGG TGTTGTCTGT GGTCTAGCAG TGCACCCAAC TCACGCTTAC CTTTATTCTC 1020 

AAATCAATGG TGCTAACCCC TTGTCTATCC CTTATGCCAA AGCATTTGCC TGGGGGGCAG 1080 

AACTGACCCT CAAATTGATG TTTGAACGCT TATTTCCTGA AGAAATGGGC GGTCGCTACC 114 0 
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CAAGAGAACG TGTAATCCCT GAACAACGCA ACCCTCGTAT CTTAAACGAG GTGAAACAAA 1200 

TCACCCACAA TGATTTGATG ACCATCCTTA AAATAATCGA CCAAGACTTC CTCAAAGACA 12 60 

CCATCTCTGG CAAATACTTC CAAGAATACT TCTTTGAAAA CTGCCAAGAT GATGAAGTTG 13 20 

CTGCTTATTT GAAAGAAGTA TTAGCCAAGT AAAGCTATTC TAAACCAGAA AGGAACTAAT 13 80 

GGATGACGAA AATATTACTG TTTGGCGAAC CATTAATTCG AATTTCACCA TTAGATGCCA 14 40 

CCAGTATCGG CGATCATCTT GCCAGTTCGA CTTATTTTGG CGGATCAGAA ATTAACATCG 15 00 

CTTGTAATTT GCAAGCCCTG GGTATCTCAA CGAAAGTTTT TACCGCACTC CCTGCCAACG 15 60 

AGATTGGAGA TCGTTTTC7C ACATTCTTGA AACAGCACCA AATCGATACC AGTTCAATCT 16 20 

GTCGGCTTGG CGATCGAATC GGCCTCTACT ATTTGGAGAA CGGCTTTGGT TGTCGTCAAA 16 80 

GTGAAGTTTT CTACGATCGT AAGCATACGA GTATCAGCCA GATTCGGCCA AACATGCTAG 1740 

ATATGGATTC TCTCTTTCAG GGGATTAGCC ATTTTCATTT TAGTGGAATC ACCGTAGCTA 1800 

TCGGTCAAGA GGTCCGTGCG ATCCTTCTCC TACTCTTGGA AGAAGCCAAC CGCCGAGGAA 1860 

TTCTCGTTTC AATCGATCTC AATCTGAGAA CAAAGATGAT TTCAGTCCTA GAAGCCAAGT 19 20 

ATGAATTTTC TAAGTTTGCA CGTTTTACTG ACTATTGCTT CCGTATTCAT CCTCTCATGA 1980 

TTGATGACCA AAATCTAGAG ATGTTTCCAA GAGACAGTGC TAGCCTAGAA GAGGTGGAAA 2040 

ATCGCATGCG ACTTTTAAAA GAAGCCTATG GTTTCAAGGC CATTTTCCAT ACCCTCCGCT 2100 

CTAGTGATGA GCAAGACAAA AATGTCTATC AAGCCTATGC TCTAGAAGAA CTATTTGAAG 2160 

AGTCTGTCCA ACTAAAAACT CCAGTCTATC AACCAATTCC TAGCGGGGAT CCCTTTA7AT 2 2 20 

CTGG7CCCCT TTACCAACTA CTCCATCATT CCTCCCTAAA AACTACCATT CACTTTCCAG 22 80 

TTGCGAGCCC AACTCTCAAA TGCACTCTTC CAGGAGACCA TCTCTCCACT TCCTCAACTA 2 3 40 

GTATTGAAAA TTTACTGGCA AATGCACAAG ATATCATTCG TTAGGAGAAT TACATGACCA 2 400 

AATCAGATAC GATTATTGAA CTAAAAAAAC AAAAAATTGT CCCTGTTATT CGAGGAAATA 2 4 60 

CAAAGGAAGA AGGACTACAA GCCTCGATTG CTTGTATCAA GGGCGGTATC AAAGCTATTG 25 20 

AAATCGCCTA TACCAA7CAG TATGCAGGAC AAATCATCAA GGAACTTGTA GACTTCTATC 2 580 

AGGACGATCA GAGTGTTTGT ATCGGTGCAG GTACTGTGCT TGATGCCGTA ACTGCTAGAG 2 640 

ATGCCATTCT AGCTGGAGCA AATTACGTTG TTTCTCCATC TTTCCATGCT GAAACTGCGA 2700 

AAATGTGCAA TCTCTACAGC ACACCGTACA TTCCAGGCTG TATTACCCTC ACAGAGATCA 27 60 

CGACTGCACT TCAAGCCGGT AGTGAAATCA TCAAACTCTT CCCACGTAGT ACTCTCAGTC 2 820 

CAGCATATAT CTCTGCAGTC AAGGCACCGA TCCCACAAGT TTCCGTAATG CTAACCGCAG 2880 
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GAGTCGGCCT 


AAACAACATC 


CCTCAATGGT 


TCGCTGCTGG 


TGCAGATGCC 


CTTGCAATTG 


2940 


GTGGCGAACT 


CAATAAACTC 


GCTTCCCAAG 


GCAACTTTGA 


CCGCATCAGC 


GAGATTGCCC 


3000 


AACAGTATAT 


TACACTCAGA 


TAAAATCATA 


ACTACCCGTC 


TAACGGGTGG 


TTTATCTCAG 


3060 


AGCTATAAGC 


CCAAATCATC 


AGCCAGCGCC 


TAAAGACGCT 


GGCTTTCACG 


TTGTTCAAGC 


3120 


CTTATTGCTC 


TTGACTCGTC 


ACTTGCCTCT 


TTAAGAGACT 


TTGGTATTAC 


TTACCACTAT 


3180 


CCCTAAAGGG 


ATCCTCATAT 


TCTTTTACAC 


TCAATTTATC 


TAGTGCTATA 


GTAGATTCAA 


3240 


ACTGGAATAG 


TACACCTCTG 


CTTCTAAAAC 


ATTGTTAAAA 


ATCGATTTGA 


CTGTCCTGAT 


3300 


CCATTTTGTC 


CTGTTCTTAT 


TTCATTTTAC 


TAT AT AT CAT 


ACTTTACTCG 


TTCTCAAATT 


3360 


TTCATACTCA 


TGAAGAAATC 


ATCCACTCGA 


TAATTTCTTT 


AATCTTGACT 


ATATTTCTTA 


3420 


ATTGTGGCTT 


CATTAAGCCC 


TACTGGACTT 


ACATAATAAC 


CTTCCTCCCA 


GAAATGCCGA 


3480 


TTCCCAAACT 


TGTACTTGAG 


ATTCGCGTCT 


TTGTCAAACA 


TCATGAGTGC 


ACTTTTGCCT 


3540 


TTTAAATACC 


CCATAAAACT 


TGAAACACTT 


AGCCTCGACG 


GAATACTGAC 


TAACATGTGT 


3600 


ACATGGTCTC 


GCATTAAGTG 


ACCCTCGATC 


ATTTCAACAC 


CTTTATAACT 


ACACAAGCGA 


3660 


TGAAATATT7 


CGTCTAAACT 


ACT? CT AT AT 


TG ATT AT AG A 


TGACTTTTCG 


TCTATACTTA 


3720 


GGGGTGAACA 


CAATATGATA 


GAACACCTCC 


ACTTTGTGTA 


TGATAAACTA 


TGAGTCTTTT 


3780 


GTGCCATATT 


TTTTCTCCTT 


TCGCTTTACA 


ATTGGATTGA 


ACACCTTTAT 


TGTATCCCGT 


3840 


TTGGAGTTTT 


TTTGGTATAA 


CCTTCGACGC 


GCACCCGTAT 


AGCGGCTGGT 


TGTTTTGTCT 


3900 


CGCACCTCAC 


GGAGCGAGAC 


GGACTAATAT 


AGTGGAGTGA 


AATAGGATAC 


GAACAAATTG 


3960 


ATTAGGAAAA 


TCAAATGAAT 


TTATAGAAAT 


CTTTTAGCAG 


TTATAACGTT 


CTATTCTAGT 


4020 


TTCAAAACGC 


TATAG7CACA 


TAATAATGAA 


GTAAAAAAGG 


ATAAGTATCA 


ACTTATCCTT 


4080 


77TTAAAACA 


AAAA7CC3AA 


GATAT7TGCC 


C77CTTCCCA 


T7T77TC7A7 


777" AC ACT 


4 14r, 


TTCATGTAAT 


TCATCTAGAT 


GATGAACAAA 


TTAGTTGTTC 


TTTCCTCTAC 


GGAATAGATA 


42CC 


AAATCCCCCA 


AGTAGCAAGA 


ACCCTAGACT 


TGCCAAGATT 


CACTGACCTT 


CTCCTGTCTG 


4260 


AGGGAGATTC 


TTTTGATCCG 


AATGG7TCTT 


TTCCTCTTCA 


GATTTTTCCT 


TTTCTTTTGA 


4320 


ATTCTGTACT 


TGTGGCTGAG 


CTGCTTCCTC 


TAGCTTTTTA 


AAGACTTCCT 


GATC7GGAGC 


4380 


TGATTCCTGG 


GTTTCAGGAT 


TATACTAGGC 


AATCTTATAT 


TCATCCCCTT 


CTTT7CGAAT 


4440 


GGTATAGACT 


CCACGTTTCA 


AAACTTCCAA 


TTGGTTGGAA 


ATAGTAGAGA 


CAGAATCATC 


4500 


ATATTTCACA 


ATGCCCCAAA 


CTCCTTGTTT 


AGCATCATAA 


ACAGACTGAA 


GGGTTTCGTT 


4560 


ATTTTCGATG 


AGGCTACTTT 


CTAACTCTTT 


TATCATTTGA 


TTGAAGGTGC 


CACGATCCAC 


4620 


GTTAGGAATG 


AG CAT AT AG C 


CATAAGAATC 


TCTATTTTGC 


TTATGAGCCT 


GACTAATCGT 


4680 
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AACAAATTCA TTTTCAACT7 CCTTGTCTGA CTOTCCTTCA T7CATA7CC7 TCCAGGCTCC 47 40 

CTTTTCCAAA GCCTTACTCA TACTGATTGA ACTCTTCTTA AACAAAAAGT AACCAATATT 4 800 

CTTTTTCCAA TCGAACGATT CTAAAAAGAC AC7T7GCGTT TCAGGATAAT CCTTTTCTTG 4 860 

TTCTGTAAGC GAGGCTTCTT TATCATTGAC ATAGACTTTA TATGGATTAC CTCATTCCAG 4 92 0 

TTTTCTCTGG TCAATTGTAG TTGCAGCAGT ATCTGTTGAA GTGTTTTGGA TATTGCTTCC 4980 
TAAAAAGGCG ATCTTATCCT TTAGCATAAA CCACCTCTTA TGAGCAGTCA ATGTTTGATT 5040 
CCAGTTCGTG AAA7CCA7GG TTGCTGTCGC ATTGGCATCA TCTAGTTTGC TCGTTCCAAC 5100 
GAAAGCAGAC GGTAAAACTT TACCTGTATC GCTATCCGCT C7CT7ACCAT CCC7C7C7C7 5160 
TGTACCAGGC ATCTTATATG GATTAACTGT TGGCCAGTAG CCATCCCTAT AGTGACTCAA 5 2 20 

ATCGCCATTG TAAAGATAGA ACATCCCATC ACTCGTATAC CAACCACGTT TATTTTCCTT 52 80 

G77CA7C7C7 TCGTAATTCA AGGTACGACT GGAAAAGAGT GACAACCCAA ATCCAAACCC 5 3 40 

TTTCTCTGCA TTGTACATGC CTGTTTTATC CATCTTGTTA AACGCACATA GGTAACTTCG 54 00 

TCTTGGAACA CTTGCGACTC CTGCATCACT TAACAACGAT TGCATCAAAC TGATATCCTT 5460 

ATAAGTCTTC AAATTCTTAA AGACATCATA ATAACTATCC GATTGAACAA TCGTCTTCAC 5 520 

AACACTCTGC AAACATTGTT TGGTTTCTCC TTCAGACATA TCCGCTATTC GGTGAATCCC 5580 

TCTTAGTACT TCTACTGCGG CCACGTGCCC CTCGCTATTT GCACCACTGA TCGAGCGTCC 5640 

ACCACTCATA TCCATCAACT CTCCATTCAC CAGCAAAGGA GCAAACGATT TATCAATCCA 5700 

CTGGTACATC G7T7CCATTT 7ATC777A7C GA7TGCA77C T7GG7CTT77 GAA7GAC7GG 57 50 

CAACACTTCA GACAGGCCA7 CAA7CAAAAC A77CCCA7AA GCACCCG7A7 AGGCAACAT7 5820 

GG7G7GG7CG A7A7AGGATC CATC7TGA7A AAAACCT7CA CCTTGC7C7A CCAAC77GAA 5 980 

CACT7GCTCA ATCGAGCGAA TGG7ACAAGA AAT77C7TCA TCA7CCT7AC GCAG7AAACC 5940 

AGCTAT7AC7 T77ACCC77C CCATA7CAAC TAACT77CCA CC7AGACCC7 7GAATGGGTT 6000 

ATCAG7CG7C 7TTCGGAAA7 GTTCGGGA7C 7GGTACAAA7 T777CAA7CA CA7CTGTATA 6060 

TT7777AA77 7CC7CATCAG AGAAC7A77C TT7CA7CAGA GACAAGGTA7 TGT7CA7GCC 5120 

ACGAGG7G7A CCGA7TTCA7 AA7CCCACCA G77CCCAACA A7GC7CT7T7 CAC7AT7G7A 6130 

GACATG777A 7GCA7CCAT7 CCA7GGAA7C CC7GAC7G77 CGAACGACAG T77CA7C7TG 624 0 

A7AA7AACGA GAAGAAGCA7 TCG7CAC7TC C77GGCCA7C 7CC7CCAATT 7CCGA7AAGT 6300 

GGCAGTCAGA 77TGCAGACG 7777ATAA77 7GAAAA7777 7CCCACAAAT AGG7GCGG7C 63 60 

CGCC7GACT7 GAAA7AC7GG ATAGGC7A7C AGC7ACC777 CC77CCAA77 CC7GG777AA 64 20 
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TTTGGCCATC TGTTCATTTT TAGAATCATA GTATTGATTC CCAGCGATGA TGCCATTCCA 64 80 

GTCATCCAAA CGGTCTCTG7 ATGCATCCTT AACAGAGGCC AGAATCTTCA AAGGAATCTT 6540 

TTTCACTTCC TTGCCATCTT TACTGACAAT GACATTGGTT GTCCCTTCCT TAAGAGGTTC 6600 

TAAAATTCCA TTTTTGACTG AAGCAACGTC AGGATTTTCT AC CTT ATAAG TATAGTCCGC 6660 

AAGAGAAAAA ACATGTTTTT TTCCAATTGG TAAATCAATC TTTTCCTCAA GCTGTTTATC 67 20 

TGTTTGAGAA TCCTCAGAAA GCTGGTCTGC TACCTCTACC AGCTCAATAT CCTTAAAGGA 6780 

AACAGTCCCA GTTCCTGTTT CATAGAATAA CTCCAGCTTG AT7TTATCAA CATCTAAAGT 684 3 

CGGGCTATAG TCTGCTTCAA TGGTCTGCCA CTCCTTTGTT CCTGACGTCG TTGCAGAATT 6900 

CCACAATCGC TTGTCCTTAC CACTTTCCTC AATGATACGA ACTTTGGCAA TCCCGATTTT 6960 

ATTATCTGTT TTAATCTTGA AACGCAGTTT ATACTTTTTC TTAGCTTCAA TAGGAACCAT 7020 

ACGGTGAAGC GCTGCCCTTA ATTTCTCATG GCTTGAGATA GTGATAGCCC CATCCTTAGC 7080 

CTCAATGACT CGAGTTGAGG CATCTGCACT ATTCTTCTGG TCTACCCAAG CTGACCACCC 7140 

CCTGAGCTTT CCTTCCTCTC CCG 716 3 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9244 base pairs 
< B I TYPE: nucleic acid 
(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

CGTTATAACA TACATGTAAG CGGTACCCAA AATGGTGCCA ACTCAAAATT TTTAAGGACG 60 

AAAATACATG TCTTCACATC CAATTCACGT CTTCTCAGAA ATTGGGAAAC TGAAAAAAGT 120 

TATGTTGCAC CGTCCAGGCA AGGAGTTAGA AAACTTGTTG CCGGACTATC TTGAAAGGCT 180 

TCTTTTTGAT CATATTCCTT TCTTCGAACA TGCTCAAAAA GAACATGATG CATTTGCCCA 2 40 

ACCTCTTCCC GATGAAGGAA TTGAGGTTCT CTACCTAGAA CAACTCGCTG CTGAATCATT 300 

GACCTCTCCA GAAATCCGCG ATCAATTTAT CGAGGAATAC TTAGACGAAG CCAACATCCG 3 60 

TGATCGTCAA ACCAAGGTTG CTATTCGTGA ATTGCTTCAC GGCATCAAGG ACAACCAAGA 4 20 

ATTGGTTGAA AAAACAATGC CTGGGATTCA AAAAGTTGAA TTCCCAGAAA TTCCTGACGA 480 

AGCTAAAGAT CTAACTGACT 7ACTTGAATC AGAGTATCCA TTTGCAATTG ACCCGATGCC 540 

AAACCTCTAT TTCACTCGCG ACCCATTTGC AACAATTGGA AACGCCGTAT CGCTTAACCA 600 

CATGTTTGCA GACACTCGTA ACCGTGAAAC ACTCTACGGT AACTATATCT TCAAATACCA 66C 
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CCCAATCTAT GGCGGAAAAG TGGATTTGGT CTACAACCGT GAAGAAGATA CGCGTATCCA 720 

AGGTGGAGAC GAGTTAGTTC TTTCTAAAGA CCTCCTTGCA GTAGGTATCT CTCAACGTAC 780 

AGACGCAGCT TCTATCGAAA AACTTTTGGT CAACATCTTC AAGAAAAATG TTGGCTTCAA 840 

CAAAGTTT7G GCCTTTGAAT TTGCTAACAA CCGTAAATTC ATGCACTTGG ATACTGTCTT 900 

CACTATGGTA GACTATGACA AGTTCACTAT TCACCCAGAA ATCGAAGGCG ACCTTCACGT 960 

T7ACTCACTT ACTTACGAAA ACGAAAAACT TAAAATCGTT GAAGAGAAAG GTGACTTAGC 1020 

TGAACTTCTT GCTCAAAACC TTGGTGTAGA AAAAGTTCAT TTGATTCGTT GCGGTGGTGG L080 

CAATATCGTA GCAGCTGCGC GTGAACAATG GAACGACGGT TCTAACACTT TGACCATCGC 1140 

ACCTGGTCTG GTAGTTGTTT ATGACCGCAA TACCGTGACC AATAAGATTT TGGAACAATA 1200 

CGGGCTTCGC TTGATTAAGA TTCCCGGAAG TGAATTCGTT CGGGGCCGTG GTGCACCTCG 12 60 

TTGTATGTCT ATGCCATTTG AACGTGAAGA AGTGTAATCG CTGTTCGATA TTCGTCAATA 13 20 

GAAAATGTAA AAAATAGAAA GAGGAAATAA TAAAATGACA AATTCAGTAT TCCAACGACG 1380 

CAGCTTCTTA GCAGAAAAAG ACTTTACCCG TGCAGAGTTA GAATACC7TA TTGGTCTTTC 144 0 

AGCTCACTTG AAAGATTTGA AAAAACGCAA TATTCAACAC CACTACCTTG CTGGCAAGAA 1500 

TATCGCTCTC CTATTTGAAA AAACATCTAC TCGTACTCGT GCAGCCTTTA CAACTGCGGC 1560 

TATCGACCTT GGTGCTCACC CAGAATACCT CGGAGCAAAT GATATTCAGT TGGGTAAAAA 1620 

AGAATCTACT GAAGATACTG CTAAAGTATT GGGACGTATG TTTGACGGCA TTCAATTCCC 1680 

CGGATTCAGC CAACGTATGC TTCAACAATT GGCAGAATTC TCAGGCGTTC CAGTATGGAA 174 0 

CGGTCTAACT GACGAATGGC ACCCAACTCA AATGCTCGCT GACTACTTGA CTCTTCAAGA 1800 

AAACTTCGGT CCCTTCGAAG GCTTGACATT GCTATACTCT CGTGATGGAC GTAACAACGT 1860 

TCCCAACAGC TTCCTCGTAA CAGGTGCTAT CCTTGGTGTC AATGTTCACA TCTTCTCACC 1920 

AAAAGAACTC TTCCCAGAAA AAGAAATCGT TCAATTCGCA GAAGGATTTG CTAAAGAAAG 1980 

TGGCGCACAT GTTCTCATCA CTGAAGATGC TGATGAAGCA GTTAAAGATG CAGACCTTCT 2040 

TTACACAGAC GTTTGGGTAT CAATGGGTGA AGAAGACAAA TTCGCAGAAC CTGTAGCTCT 2100 

TCTTAAACCT TACCAAGTCA ATATGGACTT AGTTAAAAAA GCAGGCAATG AAAACTTGAT 2160 

CTTCCTACAC TGCTTGCCAG CATTCCACGA TACTCACACT GTTTATGGTA AAGACGTTGC 2220 

TCAAAAATTT GGTGTAGAAG AAATGGAAGT AACAGACGAA GTCTTCCGCA GCAAGTACGC 2 280 

TCGCCACTTC GATCAAGCAG AAAACCGTAT GCACACTATC AAAGCTGTTA TGGCTGCTAC 2 34 0 

ACTTGGTAAC CTTTATATTC CTAAAGTATA ATTTTAGATA ATAAACCGTC TACCAACAGC 2400 
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TATGAGGCCT GCGACTAATA GCTTTAGTCC GG TCCTCTTT TATGTAATGG TAATCTATTA 2460 

TTTCTTA7AA AATATGTGAA AAATCATTAA ATTGAAATCT AAACGCATTC TATTGAGTGT 2 520 

GATAAAGGAG AATTTATGGC AAATCGTAAA ATTG7ACTAC CTTTGGGAGG AAATGCGATT 2530 

CTTTCTTCTG ACCCATCAGC AAAGGCTCAA CAAGAAGCTT TAGTTGAAAC AGCTAACCAT 2 640 

CTTGTAAAAT TGATTAAAAA TGGAGATGAT CTGATTATCA CTCACGGTAA TGGACCTCAA 2700 

GTTGGGAATC TCTTGCTCCA ACATTTGGCA TCAGACTCTG AAAAGAACCC TGCCTTCCCA 2760 

CTCGACTCAC TTGTCGCTAT GACAGAAGGT AGCATCCCTT TCTCCTTCAA AAATGCTTTG 2820 

CAAAATGCTC TCTTGGATGA AGGCATCGAA AAAAATGTTG CCTCTGTTGT AACGCAAGTT 2880 

GTCGTAGATA AAAATGATCC AGCTTTTGTT AACTTCAGTA AACCAATCGG TCCTTTCTAT 2940 

TCAGAAGAAG AAGCAAAAGC AGAAGCCGAA AAAAGCGGAG CGACTTTCAA GGAAGATGCT 3 000 

GCCCGTGGCT GGCGTAAGGT CGTTGCCTCA CCAAAACCTG TTGACATCAA AGAAATTGAA 3060 

ACCATCCGTA CTCTTTTAAA TAATGGTCAA GTCGTCGTAG CTGCACGTGG TGGCGGTATT 3120 

CCCGTCGTCA AAGAAAACAA TCGACATTTG ACTGGTGTCG AAGCCCTTAT TCA7AAAGAC 3180 

TTCGCTTCCC AACGTTTGGC ACAATTCGTT GATGCAGACC TCTTCATCGT TTTGACAGGT 3 2 40 

GTACATTATC TATTTGT7AA CTACAACAAG CCAAACCAGG AAAAATTGGA ACATGTCAAT 3 3 00 

GTTGCCCAGC TGGAAGAATA TATCAAACAA GATCAGTTTG CACCAGGTAG CATGCTTCCA 3 3 60 

AAACTAGAAG CAGCTATCGC TTTTGTCAAT GGTCCTCCAG AAGCAAAAGC AGTTATTACT 34 20 

TCCCTTCAAA ATCTAGGCGC CTTGATTGAA TCTCAAACCG GAACAATTAT TGAAAAAGCA 3 4 80 

TAAGTTGTTT TACTAATAAG ATGTATTCTA TTTCTAGTAT CTTTATATCA AATTAGAAAT 3 54 0 

TATTCTTGAA AACATGTACA ATATTTCAAA AG AT ACT ACT TTTACACTTT AATATCGTAA 3600 

AAC AAA TATA AATAGAAACC GTTTTCTTGA ATGTTTATTT AAGAAAGTAG 77CG7TTT7T 3 660 

ACACTTTCTT AGACATCAGG AGGAAAAACA AATGAGTGAA AAAGCTAAAA AAGGGTTTAA 3 720 

GATGCCTTCA TCTTACACCG TATTATTGAT AATCATTGCT ATTATGGCAG TGCTAACTTG 3780 

CTTTATCCCT GCGGGGCCCT TTATAGAAGG TATTTACGAC ACTCACCCTC AAAATCCACA 384 0 

AGGGATTTGG GATGTCCTCA TGGCACCGAT TCGGGCTATG CTAGGTACTC ATCCAGAGGA 39Q0 

AGGTTCGCTC ATTAAAGAAA CGAGCGCAGC GATTGATGTA GCCTTCTTCA TCCTT ATGGT 3960 

TCCTCCTTTC CTTGGCATTG TCAACAAAAC TGGTGCTCTT GACGTAGGGA TTGCCTCTAT 4 020 

CGTGAAGAAG TATAAGCGCC GCGAAAAAAT GTTAATTTTG GTACTGATGC CTTTGTTTCC 4080 

CCTCGGTGGT ACAACTTATG GTATGGCTGA AGAAACAATG GCCTTCTATC CACTCCTTGT 4140 

GCCAGTTATG ATGGCCGTTG GTTTTGATAG CCTGACTGGT GTTGCAATTA TTTTGCTCGG 4200 
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TTCTCAAATC 


GGCTGTTTGG 


G A i G * A»- . G I 




nrvarAGGTA 


TTGCTTCAGC 


4260 


GACTGCGGGA 


GTTGGTACAG 


GGG ACGGT A T 


GG 1 At* L I V_L» 1 


^» LVjA 1 \- 1 * ^- * 


GGGTTACCTT 


4320 


GACTGCTCTT 


AGTACTTGGT 


TTGTTTACCG 


I [ A I VjV.u\jA I 


AariTTPAAA 
nAun 1 1 ^./w^ 


AAGATCCGAC 


4380 


TAAGTCACTG 


GTTTATAGTA 


CTCGCAAAGA 


AGA1 * 1 Li AAA 


r , ArT* T **PAArG 

Ln^, 1*1 A/\^-\J 


TAGAAGAATC 


4440 


TTCATCTCTA 


GAATCTACAC 


TTAGCAGCAA 


A P* A A A A A Tf* A 

ALAAAAA I V- A 




TATT^GTGTT 


4500 


GACATTCATC 


TTGATGGTAT 


TGAGCTTCAT 


TCCATGGACA 


GAGG 1 1GG^-U 




4560 


TGATGACTTT 


AATACTTGGT 


TGACTGGTCT 


TCCAG i I A I 1 


GG i AA 1 A 1 l\J 




4620 


TACTTCTGCA 


CTAGGTACTT 


GGTACTTCCC 


AGAAGGGGGA 


A 1 Vj*. 1 \_ 1 » » U 




4680 


TATCCTGATT 


GGTGTTATTT 


ATGGTCTTAA 


AG AAG A r AAG 


Al * 




4 740 


TCGTGCTGCT 


GACTTGCTCA 


GTGTTGCCTT 


GATGs* rAGGG 


All Vj<_ i l 


GTATTCAAGT 


4300 


TATCATGAAC 


GACGGTATGA 


TTACCGATAC 


AATCGTGAAG 


1 GGGG I Wv^U 




4860 


CGGTCTATCT 


TCACAAGTCT 


TTATCGTTGT 


AACTTATATC 


TTCTATCTAC 


fTATHTCATT 


4920 


CTTGATCCCA 


TCTTCATCTG 


GTCTTGCCAG 


CGCAAG i A I G 


c f? a tc* a Tern 

Vjkj I A 1 wA 1 wVj 


C TCCACTTGG 


4980 


AGAATTTGTA 


AATGTCCCTC 


CTAGCTTGAT 


TA * GACTGGT 


1 ACGAA 1 \- L L» 


rTTCAGGTGT 


5040 


CTTGAACTTG 


ATTGCACCAA 


CATCTGGTAT 


TGTGATGGGA 


GG 1 G I I GGAC 




5100 


CAACATTGGT 


ACTTGGTGGA 


AATTCATGGG 


CAAACTCGTA 




TTCTAGTGAC 


5160 


CATCGCCCTT 


CTTCTCCTTG 


GAACCTTCCT 


TGGA rTGG 1 A 


T A A lATir.TT. 


AGTGAGGTGA 


5220 


TTCCATGAAA 


A TAG AT AT AA 


CAAATCAAGT 


T AAAG AT G AA 


TTTr*T m A T A 'T* 
1 I i \_ 1 .ftln* 


PATTAAAAAC 


5280 


CTTGATTTCC 


TATCCTTCAG 


TACTCAATGA 


AGGAGAAAA i 




T^GCACAAGC 


5340 


AATCCAAGAT 


GTCCTAGAAA 


AAACTTTAGA 


GAIT i G i GGA 


r* a a t a nriTT 


TCACTACCTA 


5400 


TCTTGACCCT 


AAACGTTATT 


ACGGATATGC 


AGAAA L GGG I 




AGCTTCTGGC 


5460 


CATTCTCTGT 


CATTTGGATG 


TTGTTCCATC 


AGCTGATCAA 


GCAGATTGGC 


AGACACCGCC 


5 52 0 


ATTTGAAGCA 


ACTATCAAAG 


ACGGCTGGGT 


ATTCGGACGT 


GGTGTCCAAG 


ATGATAAAGG 


5580 


CCCTTCGCTC 


GCAGCTCTCT 


ATGCACTAAA 


AAGCTTGCTG 


GACCAAGGTA 


TTCAGTTCAA 


5640 


AAAGCGCGTA 


CGCTTTATCT 


TTGGTACCGA 


TGAGGAAACC 


CTCTGGCGCT 


GCATGGCACG 


570C 


CTACAATACC 


ATCGAAGAAC 


AGGCCAGTAT 


GGGCTTTGCA 


CCTGACTCAT 


CTTTTCCTCT 


5760 


GACCTATCCT 


GAAAAAGGGC 


TTCTACAGGT 


CAAACTTCAT 


GGCCCTGGAT 


CGGATCAACT 


5820 


AGAGCTTGAA 


GTAGGAGGCG 


CCTTTAACGT 


TGTACCAGAC 


AAGGCCAACT 


ACCAAGGTCT 


5830 


CCTCTATGAA 


CAGGTTTGTA 


ACGGTCTCAA 


AGAAGCTGGT 


TATGATTACC 


AAACCACTGA 


5940 
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ACAAACCGTA ACGCTTCTCG GAGTGCCAAA GCATCCTAAC GATCCTAGTC AAGGTATCAA 6000 

TGCTGTCATC CGACTAGCTA CCATTCTTGC TCCTCTCCAA GAACACCCTG CTCTCAGTTT 6060 

TCTTGCAACA CAAGCAGGTC AAGACGGCAC AGGAAGACAA ATCTTTGGTG AT AT AG C AG A 6L20 

TGAACCTTCT GGTCACCTAT CCTTTAATGT CGCAGGTCTC ATGATCAATC ATGAACGTTC 6180 

TGAAATCCGT ATTGACATTC GGACTCCTGT CTTAGCTGAC ASGCAAGAAC TAGTAGAGTT 624 0 

GCTTACAAGA TGTGCACAAA ACTACCAACT CCGCTACGAA GACTTTGACT ATCTAGCGCC 6300 

TCTATACGTC GCAGAAGACA GTAAACTCGT TAGCACACTG ATCCAAATCT ACCAAGAAAA 6 360 

CACTGGCGAT AACAGTCCTG CTATTTCATC CGGTGGTGCC ACTTTTGCTC GCACCATGCC 64 20 

AAATTC7GTA GCCTTCCGCG CCTTATTCCC AGGAGCGAAG CAGACAGAAC ATCAGGCAAA 64 80 

TCAATGTGCC GTTCTAGAAG ATTTGTACCG TCCTATCGAT ATTTATGCCG AAGCCGTCTA 6540 

TCGACTTGCA ACTTAATCAG GCAACTGTTT CTACCAAAAA AAATCGACCG ATTAATGAAC 6600 

TGCACCCCAA AACTTACACA GAATAAATCT AACTTTTGGG CTCTTTTATT ATGAAATTGA 6660 

GTTATGAAGA TAAAGTTCAG ATCTATGAAG TAAGAAAGCA AGGACAAAGC TTCAAACAGC 6720 

TTTCAAAAAG ATTTGGTGTG GATGTTTCTG GTCTAAAGTC ATCTCAATCT TTGAGATGAG 67 80 

CTTTATAAAT CGCTTTTTTC ACTTTTTGCA CTCCTCTTTC GATAAACTCA AACTTTTTAG 6840 

CCGTGGTATT GCCTGATTTT ATAGTATATT GAAACTAGAA TAGTACACCT CTCCTTCTAA 6900 

AACATTTTTA GAAATCGATT TGACTGTCCT GATCGATTTC TCCTGTTCTT ATTTCATTTT 6960 

ACTATATTTG ACCCACTTCG TCTTTAACGC CTTTATTCAT AAGCTCTTGT AATTTTTCTT 7 020 

TACTATCAAT TACTTCTGAT TTTCCGTTGT AATTTATTGT AATAGGTTTT AACTTACCTA 7 080 

ATTTCTCGAC ACGCTCATTA ATTTGATCTT TTTTGAAGGC TGCTTATGTT TTTCCTAAGA 7140 

TTTTTTCAAA AATATArTTA TCACATAGCG GTTTCTCTTC TTCTTCAGCT TGGTTTTTGT 72 00 

ATTAATTTCA AACATAAGGA ACAAATCCTT CATAGTAACC TAATGCTCCC AT AAGTTC AA 7 2 60 

AAGCTTGTTT TCTAATTCAA ACCATTGCAA CTCAGATTTC AGCTTTTCAG ATAAATCCTG 7 3 20 

CTCATCCAAA TAATGACTTG AAATTAGTGC TGAACTCGTT TCTGTA7CCT GTACAGGCTG 7 380 

AGCACCCATA CCAGCAAAAA ATAAACTCGT TCCTAGCAAG ACCGAACAAG CTCCTATTGC 7440 

ATATGGCCTC AAAGAAAAAC GCTGCTTTCT CTCAAATTGA AATTCTTTCA TCCCATCTCC 7500 

CATCATTCAT TATTACTGTA TATTTTGTAT ATCAGAAATA GTTTGTATTC ACAAATCTTT 7560 

CTAGTTATTC CCTTATCATT CCTAATTAAC GGAGATAACA TACAATAATT TTTAGTTAAA 7 620 

TGTATATCGA TGTTTTTTGT TTTTCTTAAT AAACGCAATA C AAAAAG AG C CTG7TACCAA 7 680 

GCTCTTTGTA CTCAATGAAA ATCAAAGAGC AAATTAGGAA ACTAGCCACA GGTTGCTCAA 7740 
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AACACCGTTT 


TGAGGTTGCA 


GATAGAACTG 


ACGAAgTCAG 


CTCAAAACAC 


TGTTTTGAGG 


7800 


TTGCAGATAC 


AACTGACGAA 


GTCACTAACA 


TCTATACGGC 


AAGGCGACGC 


TGACGTGGTT 


7860 


TGAAGAGATT 


TTCGAAGAGT 


ATTAGTCTAT 


TATTTCTTCT 


CAGCCCGAAG 


GGCTGACAAG 


7920 


ATTTGTGTTC 


GGATATCATC 


CACACCATTT 


GGAGTATTTG 


GTAAAAAGAT 


ACTTTGATTT 


7980 


CCTTTAGAGG 


CAAAGGTATT 


CAAGGTATCC 


AAATACTGGT 


TGGTCAAGAG 


GATAGACATG 


8040 


ATTTGTTCTT 


CTGTCATGCC 


AACATTGGCT 


TCCTTGAGTT 


CGGTGATAGA 


CTCTGCCAAT 


8100 


CCATCCACAA 


TCGCCTTACC 


TTGTTGGGCA 


ATCCCCACAC 


CATGAAGGCG 


GTCTTTTTCT 


3 1 60 


GCTTCTGCTT 


CAGCTGCAGT 


GACAATTTTA 


ATCTTGTCAG 


CTTCCGCCAA 


TTCTTGTGCT 


3220 


GCGACCCGCT 


TACGTTGCGC 


CGCATTGATT 


TCATTCATGG 


ATTGCTTAAC 


TTCTGCATCT 


8280 


GGTTCGACCT 


TGGTAATCAA 


GGTTTTCACG 


ATAATGTAGC 


CGTAAGTGGT 


CATTTCTTCT 


8340 


GCTACTTGGT 


GTTGAACTTC 


AAGGGCAATC 


TCATCTTTTT 


TCTCAAACAA 


TTCATCCAAG 


8400 


GTTAATTTTC 


GAACAGAAGA 


GCGAAGAGCA 


TCTTCGATAT 


AAGATTTAAT 


CTGACATTCT 


8460 


CGACCTA7CA 


GTTTATAGTA 


AGCATCTGTC 


ACGCTCTGCT 


CGTTGACACG 


GTACTCAGTC 


8520 


GCTACATTCA 


TCATAACGAA 


CACATTGTCC 


TTCCTCTTAC 


7CTCAACCAC 


AATATCACTT 


8580 


TGCAACAAGC 


GCAACTCAAT 


CCGTCCTGCA 


ATCGACTCAA 


TCCCAAAAGG 


CAAGCGAATA 


8640 


TGAATACCCC 


TATTACCAAC 


CTTTTGGTAT 


TTCCCAAAGC 


GTTCAATAAT 


CGCCACCGAC 


8700 


TGCTGACGAA 


CCACATAAAC 


TCTACTCAGT 


GTGACTATCA 


CCAATAGGAG 


CACACAAACA 


8760 


AT C AG AAAAA 


TCATGAAAAA 


TATTGCCATA 


ATGGAACCTC 


CACAAGTArr 


TTTCTAG7AT 


8820 


TATAGCACAT 


TTAAAGAAGG 


CTGTGCCGTT 


TTTACTCCGA 


TTTTTCCTGA 


AATGTCAATA 


8880 


ATTAGAGCTG 


AATTGTCCTA 


TTGTCGTCCA 


ATCTCTTCCT 


AAAATAACTC 


TTTATAAAAG 


3940 


GCAATCGTTT 


CTTCTAAGGT 


TGGCATAAAT 


GGATTTCCTG 


GTGCGCACGC 


ATCAATCAAG 


9000 


GCATTCTTAG 


AAAGGTATTC 


AAAGTCGAAA 




CAATACCAAG 


TTCAGTCAGT 


9060 


TTCTTAGGAA 


TACCTACTGT 


CTCAGAAAGC 


TTCTCAATCT 


CACCAATCCC 


ATAATCGGCA 


9120 


CATTCTTGAT 


CTGATTTACC 


TTCTACATGA 


AGTCCCAAGG 


CTTTGGCAAC 


ATTGCGGAAA 


9180 


GCTTCTGGTA 


CACGTTTAGC 


ATTTTCACGT 


TCTATAACTC 


GTAGCAACAT 


GGCACAGCAC 


92-10 


ACGG 












9244 



(2) INFORMATION FOR SEQ ID NO: 69: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

CATCTGAACT TTATCATCAT AACTTAATTT CATAATAAAA ACACCCCAAA AGTTAGATTT 60 

TTTCTGTCTA ACTTTTGGGG TGTAGTTCAG TCATTGGACT GACGTTTTTT TGTATGCTTA 12 0 

TTTTGATTTG ATGTAGTTGA TACCATCTGC TTTTGGTGCG ACTGCTTTTC CAAAGAAGGC 130 

TCC7AAGACA AGAATTCTCA AAACATAAGG TGCAATTTGA AGATAAACCG CTGGCACTCC 240 

TTGTAGGAAC GGCAATTGAG AACCGATAAC AGCCAAACTT TGTGAAAGTC CAAAGAAGAG 300 

A CT AG AAAGC ATAGCACCGA TTGGATTCCA TTTCCCAAAG ATCATCCCAG CAAGGGCGAT 3 60 

AAA7CCAGGT CCAACAATAG TTCTCACTGA GAAGTTAACT CAGATTGAT? GCGCATAAAT 420 

CGCTCCGCCA ATTCCACCTA GAAAACCTGA AATAATAACC CCTAAATATC TCATCTTGTA 4 80 

GACGTTGATT CCCAAGGTAT CCGCTGCTTG AGGATGTTCA CCCACAGAGC GGAGACGAAG S40 

ACCAAATTGA GTCTTAAAGA GAATAAACCA AGCAAGGAAT GAGAAGGCAA TCCCCAGATA 600 

ACCAACTAGA CTAGTTGACT TGAAGAAGAT ATCACCAATC ACTGGGATAT TTCCCAAGAC 660 

TGGGAAATCA AAGCGTCCAA AAGTTTGACT TACGTTCTCC CTTTCTCCTT TGTTATAAAG 720 

AACTTTAACT AAGAAAACAG CCAAGGCAGG CGCCATCAAG TTCAATACCG TACCGCTGAC 780 

AACATGGTCT CCACGGAAA7 GAACCGTCGC TGCTGCGTGG ATGATAGAGA AAACACTACC 840 

AACCAATCCT GCTACAAGCA AGGATAGCCA TGGAGTTGCT GCTCCAAATT GTTCTGCAAA 900 

TTCAACGTTA AAGACAACTC CAGAAAAGGC ACCCATAACC ATAATTCCTT CAAGGCCAAC 960 

GTTTACCACA CCACCACCTT CAGAGAAAAC ACCACCGATA CTTGTAAAGA TGAGAGGTGC 1020 

TCAGTAAATC AGCATAGAAG ACACCAAGAG GGGGAGCAAG GTTATAATAC ACATCTTTAC 1080 

TTACCTCCTT TAACTTGTTT TTTCGGTTTG ACAAAGCGTT CGATAAGGTA ATGAACACTG 1140 

ACAAAGAAGA TAATAGACGC TGTTACAATG CTGACAAGCT CAGATGGTAC CTGCGCCGCA 1200 

TTCATACCAG GAGCCCCAAC TTGGAGAACG CCAAATAGGA AGGCTGCAAA G ACT AT AC C A L260 

ATTCGTGAGT TGGCCGCAAG CAAACTAACC GCCATTCCG? TAAATCCGAT AGCTAATGAC 13 20 

GAACCTTCAA CATAGACGTT CTGGAAGGTT CCCAAACCTT CAACAGCTCC ACCAAGACCT 13 80 

GCCAAGGCAC CTGAAATAAT CATAGATAGG ATAATAGTCC GCTTGGCAGA AATACCAGCA 14 40 

TATTCTGAAG CATGTGGATT AAGACCAACT GCACGGATTT CAAAACCAAG AGTTGTTTTC 1500 

TTGAGCATGA ACCAAATAAC TCCAACGCCA ATGATGGCAA AGAAAATACC AATATTCATC 1560 

CGTGAGTTAC CAGTCAACTC AGCCAACCAA GGTG7CTGA? AGGTTGCATT AGCCCCAACA 1520 
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CGAATGGTCG AATCTGTACT 7TGCATGAAG TCTTTAGGGA AAGCATGGAT AAAGGCATTC 1680 

CCTACATACA AGACAATGTA GTTCATCATG ATGGTTACAA TAACCTCTGA CCTCCCTACA 1740 

TAGGCCCTAA GAATACCTGG AATCGCTCCG ACAATCCCAC CAGCAATCAA GGCAATCACG 1300 

ATGGTTGCTA GAATCATCAA GGGACGGGGC ATATCTGGAT GCG AC AG GGC AAACCAACCA 1860 

CTGAGAATCC AACCTGCCAA AGCCTGACCA GGAAGTCCGA CGTTAAAGAA ACCAGCTCGA 1920 

CTGGCAACCG CAAAACCAAG ACCAATCAAG ACCAGAGGAC CCATAGCACG GAAGATTTCT 198C 

CCAATCCCAC GCAGACTGCC AAAGGCTGTA TACAACAATT CTTCGTAGCC CCAAATAGCA 204 0 

TCATAACCGA AGATCCACAT GACAATGGCT CCGAGTAAAA TTCCTAGGAA 7ACACAAATC 2100 

AAGGGAACCG AAATTTGTTC TAATTTTTTA GACATCACTC TTCTCCT7TC CCAAGTTTCC 2160 

ACCAGCCATC AAGACACCAA GTTCTTGTTT ATTGGTTGTT TCTGGTGATA CAATACCTTG 22 20 

AATCTTACCA TCGTGGATAA CGGCAATACG GTCTGAGACG TTTAAAATCT CATCCAATTC 22 80 

AAACCTGACA ACAAGGACAG CCTTGCCATT ATCACGCTCT TCAA7CAACC GTTTGTGGAT 2 340 

ATACTCAATG GCACCGACAT CCAACCCACG AGTTGGCTGG CTAACGATAA GGAGATCAGG 2 4 00 

ATCTCGATCA ATTTCACGAG CAATAATTGC TTTTTGTTGA TTTCCTCCTG AGAGTGCAGC 2 460 

TGCAGGAACT AATTCACTGG CAGCGCGAAC ATCAAACTCT TCCATCAGCT TTTTAGCATA 252 0 ' 

AGAAGT AAT A TTTGAATAAT TCAAAATTCC ATTTTTACTA TGTGGTTCTT TATAGTAGGT 2580 

TTGAAGGGCA ATATTTTCAG ATATCATCAT TTCCAAAATC AAGCCATCAC GGTGACGGTC 2 640' 

TTCTGGAACG TCCCCAACAC T7AG77CTCT AATCTCACCT GGGTGCAAGC C7ACAAT7GA 2 700 

ATCTCCTTTT ACCTCAATGC TACCACATTC AACCTTACGA AGACCTGTAA TGGCTTGAAT 27 60 

CAG7TCACAC TGACCATTTC CATCAATCCC CCCAA7ACCA ACAATCTCTC CAGCACGAAC 2 8 20 

ATCCAAGGAC AGATTTTTAA CAGCTGGAAC ACCACCCTTT TCATTGACCA CCAAATC7TT 2 380 

GATAGACAAA ACCACTTCTT TTGGTTTAGA GCCT7CC77C TCTCTTT7AA AGGAAACAGA 29 40 

ACGTCC7ACC ATCATTTCCG CCAAATCAGC ATTGGTAGCC CCTGCAATTT CAACGGTTTC 3000 

AATTGATTTC CCACGACGGA TAACTGTAAC ACGGTCAGAA ACTGCTCGAA TTTCATCCAA 306C 

TTTGTGGGTA ATCAAGATAA TTGATTTTCC TTCTTTGACA AGATTTTTCA TAATACCCAT 3120 

CAACTCATCA ATTTCTGATG GAGTCAAAAC AGCCGTTGG7 TCGTCAAAGA TAAGGATATC 3180 

AGCCCCCCGA TAAAGTGTTT TTAAAATTTC TACACGTTGT TGGGCTCCAA CTGAGATATC 3 2 40 

TGCTACCTTG GCAGAAGGGT CAACAGCTAA GCCATAACG7 TCAGAAAGAC CCTTGATTTC 3 3 00 

TTTGCTAGCT CCAGCGATAT CTACCACACC ATTTTTAGTC AATTCACTAC CTAAAATGAT 3 3 60 
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CTTTTCAGCC ACTCTCAAGC CTTCAACCAA CATAAAGTGC TGGTGAACCA TCCCCATTCC 3 420 

CAAGCTAGCT GCTTTAGATG GGGAGTCGAG ATTGACAACT TGACCGTTGA CCGCGATTTC 3480 

ACCACTAGTT GGTTCAACAA GCCCTGCTAA CATGTTCATT ACCGTGGACT TACCAGCCCC 3 540 

ATTTTCTCCT AAAAGTGCAT GAATTTCACC TTTTCGTAGG TGCAAGTTGA TTTTGTCGTT 3600 

GGCAACAAAT CCACCAAACA CCTTGGTAAT ATCACGCATC TCAATGACAT TTTCGTCTGC 3 660 

CATGTGCTCT TCCTTTCAGA GTCTTATTTT ATTTCAATAA AACTTGCTAG TTTGTCTAGT 3720 

AGCAAGCTTT ACTTAGACAA AATGACTTTG TCTCAACTCT TAAAAAAGCG CCCCTTGGCC 3780 

GCTTCCTAAG AAATGACTTC CATCCATTAT TTTTCAGGAA CTTTTACGCT TCCATCAAGG 3 840 

ATTTTAGCTT TTGCATCT7C GACAGCTTTT TTACCTTCTT CTGAAACGTT TGTTACTGCC 3 9 CO 

AAGTCAACCC CTTTATCCTT CAATGAGTAA ACGATCACTT GACCGCCAGG GAATTCTCCT 3960 

CTTTCTGCCT TGTTAGAAAT ATCTTTTACA GTTGTACCAA CTTGTTTCAA AGTAGATACA 4020 

AGAACAAACT TTGATTCTTT GCCATCTTTA GAAGTGTATT TACCTTCTGC TTCTTCCTCA 4080 

CCATCAACAC CGATAACCCA AACTTTTTCA TTTTCAGGAC GGCTTTCGTT GAGAGATTTT 4140 

CCCTCTGCAA AGACACCTGC ACCTGTACCA CCAGCTACTT GCTAAACAAT ATCTCCACCG 4200 

GCTGCGTATT GTGCGGCTGC AATTGTTTTA CCTTTACCCG CATCACCAAA TGAACCAGCG 4 260 

TAGTCAACTT GGACTTTGAT AGATGGGTCT ACTGACGCAA CACCAGCCTT GAATCCTCCT 4 320 

TCAAAACGAG AGATAACTTC AGATTCGATA CCACCTACAA AACCAACTTG TTTTCTCTTA 43 80 

GTTGTTTTTG CTGCAGCCAC ACCTGCAAGg TAACCTGACT CATTATCAGC GAAAGTTACG 4440 

CTCGCAACAT TCTTTTCGTC TTTAATCACA TCATCAATCA ACACATAGTT CAACTCAGTC 4 500 

TCTTCTTTTC CTCCATCTTT AACTGCATTA TTAACCCCAA AACCAACACC GAAGATTAGG 4 560 
TTGTAAC7TC CAGCCGCTTC TTGCAACT7G T7ACC37AC? CACC77CAC7 TGTTGATTGG 4 62 0 

AAGTAAGTGA AACCGTTATC TTTTGAAAGA TTGTGTTCTT TACCCCAAGC CTGCAAACCT 4 680 

TCCCAACC7G ATTGGTTGAA TGATTTC7CA TCAACACCAC CAGTATCACT GACGATTGCT 4740 
GCTTTTG7CT TCACATCAGA AGATGAAGCT GCGTTACGAG AAGAGCGGTT ACCACATGCA 4 900 

GCAAGTCCAA CTGCTGCCAC TGCAACTAGG CCAACACCTA CCCAT7C7TT CTTGTTCATT 4 8 60 

ACTGAACCTC CTAAATAAGA TGTGCAACGA TGTTGCAAGT ATCGATTGCT 7CGCCACAAG 4920 
GACCGTGCCA CTCAGAGAGC GACTCAGACT ACTTTAAG7C TGTAAAACAG TATGGAAGTA 4 980 

ATTCCCCGAC CGTCATCTCG ACCGTCGATT TATCT7T7GC CACTAAGGTC ACTTTTAGAT 5040 
CTTGTTCAAA AAATTCAGCC ATCACTTGGC GACAAGCACC ACATGGCGAG ATCGGT7TTT 5100 
CAGTTTGACC A7ACACAATC AATTC7GAAA ATTC7CTTTG GCCTTCAGA7 ATAGCCTTAA 5160 
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AAATACCTGT 


TCTCTCACCG 


CAATTCCTCA 


AACGATACCT 


ACCATTTTCA 


ATATTCACTC 


5220 


CCGTGTAAAC 


ACTTCCGTCT 


TTAGCTACTA 


AAACTCCTCC 


GATAGGAAAG 


TGAGAATAGG 


5290 


GGACATAGGC 


ATGTTTCCTG 


GTTTCAATTG 


CCAGTTCAAT 


CAACTCAGTA 


CTCGCCATCT 


5340 


GCCAATTCTC 


CTTTTAAAAT 


AGCTACCCCA 


GCTGACGTTC 


CGATACGGGT 


CGCACCTGCT 


5400 


TCCACAAAGG 


CAAGAGCATC 


TGCATAAGAA 


CGAGCTCCAC 


CGGCCCCCTT 


GACACCCATA 


5460 


TCAGATCCAA 


CTGTTTCACG 


CATTAATGTA 


ACATCTGCTA 


TCGTAGCACC 


ACCAGTTGAA 


552C 


AAGCCAGTAG 


ATGTTTTGAC 


AAAGTCAGCC 


CCAGCTTTTT 


CGCCCAAT7C 


GCAAACAACA 


5530 


ACTTTTTCTT 


GGTCTCTCAG 


AAGGCAAGCT 


TCAATAATGA 


CTTTCACTAA 


CTTATCACCA 


5640 


CTTGCTTCCA 


CTACTGCGCG 


AATATCTGAC 


TCAACCAAGG 


CTAAATTACC 


TGATTTGAGA 


5700 


GCTCCAACAT 


TGATCACCAT 


ATCAATCTCA 


TCTGCACCAT 


TTTGGATAGC 


TTCTTTTGTC 


5760 


TCAAATGCTT 


TCACGGCTGA 


AGTTGTTGCT 


CCCAAAGGGA 


AACCTACTAC 


TGTGCAAACC 


5820 


TTAACATCTG 


TGCCTTCAAG 


TCCTTTTTTA 


CCATGTTCAA 


CCCAGGTCGG 


ATTAACGCAA 


5880 


ACACTGGCAA 


AGTCATACTC 


TCTAGCCTCA 


GACAACAAAC 


TATCAATTTG 




5940 


GCATCTTGTT 


TTAAAAGCGT 


ATGATCTATA 


TATTTATTTA 


ATTTCATTTC 


CGTTTTCCCT 


6000 


CCATTTAGGA 


GATGATTTCT 


ACAATTTCAC 


GGATTTTTTT 


CACTTCATCA 


CTTATTTTAA 


6C60 


CACATTTTTG 


GAAATCTGTA 


ACTAGTTGAG 


GTGGAATTTT 


TTCATTTGTG 


TATACTTTTG 


6120 


CAACAATTTC 


ACCCTTTTGA 


ACGGAGTCTC 


CAATCTTCTT 


TTCAAAAACA 


ATTCCTGTTT 


6180 


CATAGTCCAA 


GGCATCAGAC 


TTAACTGCAC 


GACCAGCACC 


CAGCCTCATG 


GCATAAAGAC 


6240 


CAAAGTCCAT 


ACCTGGAAGA 


GCTGAAATGA 


CACCCGTTTC 


CTGACCAGCG 


A77TCCACCA 


6300 


CATGAGCTAC 


ATTTACAGCA 


CCATAGAGGT 


CTTCCAAGTC 


TCCACCTTGG 


GCTTGCACCA 


6360 


TTTCCTCAAA 


CTTAGCCACT 


GCTTGACCAT 


TCTCAAGATG 


TTGGTGAACT 


TCTTCAACAG 


6420 


* I I I \j i i /v\L 


All LIj\.LA/v\ 


LLAAOCA AA 


TTTGAGCCAA 


TTCACAAATA 


AAGTGGGTAA 


6480 


TATCCTCACG 


TCCTTGACCT 


TGCAAAATCT 


CCAATGCTTC 


AAGGATTTCC 


AGACGATTTC 


654C 


CAATCGCTCG 


TCCCAAAGGC 


TGGCTCATAT 


CCGTAATCAC 


TGCTACTGTC 


TTCCGTCCAA 


6600 


CAACCTTACC 


AAGATCTACC 


ATACTTTGAG 


CCAACTCACG 


CGCCTCATCA 


ACCGTCTTCA 


6660 


TGAAGGCACC 


CTCACCGACA 


GTCACGTC7A 


GCAAAATAGC 


ATCCGCCCCT 


GCCGCAATTT 


6720 


TCTTGCTCAT 


CACCGAACTC 


GCAATCAAAG 


GAATCGTGTC 


GACAGTTGCG 


GTCACATCAC 


6780 


GAAGGGCATA 


GAGAAGCTTA 


TCTGCTTTGA 


CCAGCTGGTC 


TGATTGCCCA 


ATGACAGATA 


6840 


CTCCAATATC 


CTGAACCTGA 


CGAATAAAAT 


CCTCTTGACT 


ACGTTCTACT 


TGATAGCCC7 


6900 
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TAATCGACTC CAATTTATCA ATTCTTCCCC 
TTGCTACAGG CACACCCAAG CTAGCAACAA 
CGACACCACC AGTACAATGC TTGTCAACTT 
CTTGCCCAGT CTTAACCATA TTCATCGTTA 
TAAAATAAAC AGCCATAGCA AAGGCAGACA 
AGCCTTCTAT CAGCCATTCA ATTTCACTTG 
GGATTAAATC AACTCCTCTC ATTCTTTCAC 
7TTAAGGATT TCACAATTGC CAAACACATC 
TTGTTTTTTC TGGATGACGA TGGTCAAATC 
CTCGATGATT TCATGAACGA CTTGCTTGCC 
GTCAAATCGC CCTTGAACTC TTGCATAAAT 
ATTTTTTTCA GCATTTCTCT GAGCTAAATC 
CGCCTGAACT CCGTAAACCT TGACCAAGGA 
ATCTAGGACT GTCTCTCCTT GGTTGACATC 
GTCAACCATT TTCTTCCTAA AAACACCCGC 
CAAGTCCACT CTCAACTCAT GAATGTCCTG 
TTTACTCATG ACACTATTTT ACCATAATTT 
TAATAAAACG AAAAAGACCG AAGAAAGCAA 
AACACTTATA AATAATAAAC CATTTAGAAC 
AAGTTTATCA TCTATAATCA CCCAGAT7AT 
ATTATCAACA AAATTCCTAA CAAAATGTTT 
AGGATTTCCA AACTTGTCCA AAGTCGTATC 
CCCTCTGTCG CAATCCGTAG GACTAAAAAG 
CATCGTTTTT TACT AAG AAA GCAATTAAGA 
CATGTTCCAT CAAAAAAGTA AAACCGTAAT 
CATTGGTTCC TTTTATAAAA CGTAAAGCAA 
AACGTTTTAA GATTTTCATA GTACACCTCC 
TAAGCTTATA AATCACTAGA ATCTATCTCC 
ATATACCATT GACTTACCAC ATTCAAGAAA 
GATAGACAAA ATATCTAGGG GAAAACAAAT 



566 

CTGTATGGCC AAGACCACGA CCACTCATTT 
GAGGAGCTAA AATCAAGGTT ACCTTATCGC 
TCACACCA7C AATGGCTGAC AGGTCAAACT 
AATCAGAGAT TTCTCGAGTC GTCATTCCTT 
TCTGATAATC AGGAACAGTT CCTGATACAT 
AAGTCAGTTC TTGACCGTCT CGTTTTTTTT 
ACTTCTAAGG ATATAGTATC CCTTGTCTTT 
TTCCATCTTA GACTTGGCAC TTGGAGCTCC 
TCCACCAATT TCCAAGAAAT CTTTACTTTT 
CGCACGGATA GCAGGATTGG AAATGACATG 
ATTAGATTGA AATATCGTCG CTTTTGCATT 
CACGCCACGA CTCTTAATA? CAACCATGGT 
CAAACCTAAT GGACCATAAC CACAGCCTAC 
CAGACACTTG AGCAAGAGTT GACTTCCAAA 
ATCTGTCAAA AAAGTCATTT TTTCTCCCAA 
AGCAGCGTCA GGATTTTCTG CATAGTACAT 
GACTCAAATT GTAAATCCTT TACAAATTGA 
GTCACGAAGC CATTTTCTTC AATCTCTTTC 
TATAAATATC ACAGTCCAGA TAAAAACAAA 
TATTTCTATT GCTTAACCTT AAAATACTTT 
ACATAAAACC CCAACTGATA CGTTTATGTC 
AAATCTTCTA GTGACATGTG GAAGAAATAA 
CAATAAC7AC CCGCAGCAAT CCATTTCGTC 
ACCAACAAAT AAAGACAGCT GTTACAATAG 
AGGTTTCCAC AAAGCATCTA CCATTA7CTG 
AAC7TAAAAT AAAACAGAGT TCCAATATCT 
TATAAG7TGT GAACTAAAAA GCCCCCTTTA 
TATTTCATCA ATAAATTGAT CACTTATACT 
CCGCTTTATT TTTTTAGCT7 TTTATGGTAT 
GACCAACGAA TTTTTACATT TTGAAAAAAT 



6960 
7020 
70B0 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
3040 
8100 
8160 
8220 
8280 
834C 
8400 
8460 
8520 
8580 
8640 
8700 
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CAGCCGCCAG ACTTGGCAAT CTTTACATCG AAAGACAACA CCTCCTTTGA CAGAAGAAGA 37 60 

ATTGGAATCT ATCAAGAGTT TTAATGACCA AATCAGTCTC CAAGACGTTA CAGATATCTA 8820 

TCTCCCCTTG GCTCATTTGA TTCAGATTTA CAACCGAAC7 AAGGAAGATT TAGCCTTTTC 8 880 

AAAAGGAATT TTCCTCCA 889 8 
(2) INFORMATION FOR SEQ ID NO: 70: 



<i> SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 13188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 70: 

TATCTTAACG aGGATTGGGT 7TATCGTCAG TCTTATTGCC CTAATTCTCG GAACAATCCC 60 

TTAAATCATT TTGAAAATAA TCGGCCTGTA GCAGATTTTT ACTGTAATCA TTGTAGTGAG 120 

GAGTTTGAAC TAAAGAGCAA AAAAGGAAAT TTTTCATCAA CAATCAATGA TGGTGCTTAT 180 

GCAACGATGA TGAACCGTGT GCAGGCAGAT AATAATCCTA ATTTC TT TTT TTTAACTTAC 240 

ACAAAAAATT TTGAGGTAAA TAACTTTCTT GTCCTTCCGA AGCAATTTCT TACACCGAAA 300 

TCGATTATTC AAAGAAAACC ACTTGCACCA ACTGCTAGAC GAGCAGGTTG GATTGGTTGT 360 

AACATTGATT TATCACAAGT ACCTTCTAAA GGAAGGATAT TTCTTGTGCA ACATGGACAA 4 20 

GTTAGAGATC CAGAAAAACT TACAAAAGAA TTTAAGCAAG CTTTATTTTT AAGGAAGAGC 4 80 

TCTCTGTCAT CAAGAGGTTG GACAATAGAA ATTCTAAATT CTATAGATAA GATAGAGGGT 54 0 

TCAGAATTTA CCCTTGAAGA TATGTATCGT TTTGAAAGTG ACCTAAAAAA TATCTTTGTT 6 00 

AAGAACAATC ATATCAAAGA AAAGATTAGG CAACAGCTTC AAATATTAAG AGACAAAGAA 660 

ATAATAGAAT TTAAACGTAG AGGAAAGTAT CGGAAATTAT GAAAACGAAA CAACTTGT^G 7 20 

CATCAGAAGA GGTGTATGAT TTCTTAAAAG TCATCTGCCC TGATTATGAA ACTGAAAGCC 7 80 

GTTACGATAA CCTAAGTTTA ATCGTCTGTA CCTTATCAGA TCCCGATTGT GTGAGATGGT 840 

TATCTGAAAA TATGAAATTT GGTGACGAAA AACAACTAGC TTTGATGAAG GAAAAATATG 900 

CGTGGCAAGT AGGAGATAAA TTGCCAGACT GGCTACATAG CTCCTATCAT AGATTATTGT 960. 

TAATAGGTGA ATTATTGGAA AGCAATCTAA AACTGAAAAA GTATACAGTA GAAATTACAG 102 0 

AAACTTTATC ACGTTTAGTA AGTATAGAGG CTCAAAA7CC AGATGAAGCC GAACGACTTG 1C80 

TAAGAGAAAA GTATAAGAGT TGTGAAATTG TTCTTGATGC AGATGATTTT CAGGACTATG 1140 
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ACACTAGCAT ATATGAATAG GTAGATGTTT TTATTTTGTC AACAAAAAAG AGGCTCGCAC 1200 

CTCTTTTTCT TATTTCTTTT TATGATTTAA TACGGCATTG ACGACAATAG CGAGTAGGCT 1260 

GGCTACGACG ATTCCCTTTG AGAAGAACAT TTGGAAGGCT GTCGGCATGC TGACAAAGAG 1320 

ATTACTCTTC TTGAGACCGA CACCTGCAGC GATTGAAACA GCTGCGATAA GGAAGTTGTG 1380 

TTCATTGTTA GCAAAGTCAA CACGGGCGAG GATTTGCATC CCTTGAATTC ATACAAAACC 14 40 

AAACATTACC AGCATGGCAC CACCGAGGAC GGAGCTTGGA ATGATTTGGG CAAGGGCGCC 1500 

AAACTTAGGA AGCAGTCCAA GGAGAACCAG GAAACCAGCT GCGTAGTAGA TTGGCAGCCG 1560 

TTTTTTGATG CCTCACAATT TAACCAAACC AACGTTTTGT GAAAATCCGG TGTAAGGGAA 1620 

GGTGTTAAAC ATTCCTCCGA GAAGTACGGC CAAACCTTCT GCGCGGTATC CGTTGCGAAG 16 80 

GCGCGTGCTG TCGATTGGAT CCTTTGTGAT ATCAGACAAG GCCAGATAAA CACCAGTTGA 1740 

CTCAACCATA GACACCGTTG CGATGATACA CATCATGACA ATAGATGAGA TTTCAAACGT 1300 

TGGCATCCCA AAGTAGAGTG GAGTTGGGAC ATGGACAACT GGAGCTACCG CAACAGGACA 1860 

GAAGTCCACC AAGCCCATAG TAGCAGCAAT GGCAGTTCCA ACAACCAGAC CAATCAAAAT 1920 

AG AG AT AG AC TTCATAAATC CTTTGCTAAA GATGTTGATC AAGAGGATAA TCACAACACT 1980 

AATAGCTGCA AGCAAGAGAC TTTGACCAGT TCGCTCTGGA ACGTTATTTC CCATATTTCC 204 0 

AATAGCGACA GGGATCAAGG TTAAACCAAT CGTGGTAATA ACAGATCCTG TTACGATAGA 2100 

TGGGAAGAGA TTGGCTACTT TTGAGAAGAT CCCTCAAACA AGAACCACCT AAATCCCAGA 2160 

TGCGATAACG GCACCAAACA TAGCGCCACT ACCATGGCTT TGCCCAATCA TAATCAAGGG 22 20 

AGCGACCGAC TGGAATGCAA CTCCAAGAAC GACTGGGAGT CCAATCCCAA AG7ATTTCTT 2280 

GAGTTGGAGT TGGAGGAAGG TTGCCACCCC ACACATGAAG ATATCTCTAG AAATCAGGTA 2 340 

GGTCAACTGC 7CAGCTGAA7 AGCCAAGGGC TGTCGCAATC ATGATGGGAA CCACGATAGA 2400 

TCCTGAGTAC ATGGCTAGTA AGTGCTGCAA CCCAAGAACG GCTGCTTGCG AGTGTTTTTC 24 60 

TTGAGTTTGC ATTAGAGATC TGCCTCCTTA AATACGACTT GACCATTTTC AAAACAATCC 2520 

AAACGACCAA GTGATAGGAC AGGGTAGCCT GCTTTTTCAA GCAAATCACG ACCATCTTGG 2580 

AAGGATTTCT CAATCACGAT ACCGATACCT TGCACTGTGG CACCGGCCTG TTCGATGATT 2 64 0 

TGAATCAAGC CTTTAGCAGC TTGGCCATTA GCAAGGAAAT CGTCGATAAT CAAAACCTTG 2700 

TCCTCTGGTG AGACGAATTT TTCAGCGATA GAAACGGTGC TGGTCACCTC CTTGGTAAAC 27 60 

GAGTACACTT GAGCAGTTAA GATCCCTTCG TTCATGCTGA TCTTCTTAGC TTTTTTGGCG 2 92 0 

AAAATCATGG GAACGTTTAA GGCTTCAGCT GTAAAAACGG CTGGGGCAAT ACCCCACCC? 2 880 

TCAATGGTTA CCACCTTCC7 AATGCCAGTA GTACCAAATT TTTCCGCAAA AACCTTACCA 2940 



WO 98/18931 



PCT/US97/19S88 



569 

ATCTCTCCCA TCAAGCTAAA GTCAACTTGC TGGGTTAAAA AGGAATCTAC CTTGACGATG 3000 

TTATCACCCA AGATATCCCC ATCCTTGAGG ATGCCCTCTT CTAATAATTT CATAAGACCT 3 060 

CCTAAAGTCT AAAAGTTAAT TTACTTGTTG TTTAAATATT TCTATAGTGA TCCCTTTTGC 3120 

TAATACTATA TATTTGATAA AACTATTACG AGCGAAGCGA GTCTTATCAA ATATTTCCCG 3180 

TTGTAGTGGT AT CAT AG AC A ATAATCTTGT TATTGTCTAT GACGGGATTT TTGAGAGTAA 3 240 

AATAGTTCGG GGAACTATTT TAGCCTAAGC CTAGAAATGA AAGAGCTAGG GGCTCAAAAA 3 300 

TTAGGGATGA AATTCCCTGG ATTCCTGAAA TTATTCACAG GATAATTTCA CCTCCCGTCC 3 3 60 

GCACTAATTA AGGGAAATAT TAAAAAAAGA CCTACTTAAT CTCTAAGTAA GTCCCCTAAA 34 20 

TAGACATGGC AAAAACCGCC ATATCTCACT GCTGACTTAC TTATTGTTAG GTGTTCCGGC 34 80 

ACCTTGTAGA AACGTCGTGC CAATTCACGA CATAAACAAG TAAAACGATA TTCAATTTTA 3 54 0 

AATAGGCTTG AGCCAATGTT TTTATTTTAC ACTAAATAAC TTTAGAAATC AACTATTTTC 3 600 

TTAGTGTTTT GGTTTAAAAA ACGAACAAAA AGAAGAGAGG GTGAACAAAA ACTCCATTGT 3 660 

AAGCTAACAG T TAT ACT AAA TGAAAATCAA AGAGCAAACT AGGAAGCTAT CCACAACCTC 3720 

AAAACACTGT TTTGAGGTTG TGGATAGAAT TGACAGAGCC AGTATCATAT ACCTACGGTA 3780 

ACGCGACCTT GACGTGCCTT GAAGAGATTT TCGAAGAGTA TTAGAAGATT TTTCCATCAT 3840 

AAAAGGCATA CTATCAAGCT TTTAGACACC TGACAATATG CCTTTTTCTA ACTTTAAAGA 3900 

CTTTTCCCAA TTTTTATTAT TCTACTCGCT AAATCTTAAA AAATAGCCAT CTGGATCCAA 3960 

AACTGCAAAT TTATGAGGAT AGATATAGGG ATCACTGACA CGAAACTTTC TTTTGGTCAA 4020 

GGGACGATAA ATAGGATAGT TTGCCTTCAT CACTCTTTAA TAGAGTTTTG AAACATCCTT 4 080 

TATGCCAAAG GACACATTGA CTCCACGACC AAAGGGATAG GTCAGTTCAG CTAGTTGATC 414 0 

CTTTGTTCCC TCCTCTAACA TTAGTTGACA CTCTTCAAGA GAAAGAGAAA GTTTTCTTCT 4200 

GGACGTTGGT ATTCAATCCT AAAACCCAGT AAACCACAGT AGAACCACCG GGACTGTTCG 4260 

ATATTCGATA CAAGCAACTC GGGAATGACC GCATTGTAGT CCATATAGAA AATCCTTACA 4 3 20 

AGTCAATTTC CAAGACAATC GGTGTATGGT CTTGGCGACC ACCTGAGTCA ATCATATCAG 4380 

ATTTAGTGAC CTTGTCAGCG ATACGGTTAC TTGTGAGCCA GTAGTCGATT CTCCAGCCTG 4440 

TATTGTTGAT TTTAGAAGTT TTGCTGCGTT GTGCCCACCA AGTGTAGCGT TCAGGAACAT 4 500 

CGCCATGAAC ATGGCGGAAG GTGTCTGTAA ATCCAGTTGC CAAAAGGTTG GTAAATCCAG 4 560 

CACGTTCCTC GTCAGTAAAT CCAGGTGAAC CGCCCTTGCT AGCAGGATTT CCAAGCTCGA 4 620 

TTTCATTGTG GGCTACGTTG TAGTCACCGG TCGCAAGGAC TGGTTTTTCT TTGTCTAGTT 4 68C 
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CAGCCAAATA CTCACCATAT TTGGCATCCC AGAC77GGCC TTCTTCCAAG CGTTTGAGAC 4 740 

CGTCACCAGC GTT7GCAGTC TAAACTTGGG TTACGAAAAA TGCATCAAAT TCTACAGTGA 4800 

TCATACGACC TTCCAAGTCC ATGG7AGAAG GGGCACCGAT TTCTCCGAAC CTGATAGTAG 4860 

GTGTAAGTTC TTTCTTATAA AGGAACATGG TTCCAGCATA GCCTTTACGG GCAGGCTCTT 4 920 

GGGAAGAGCG CCACGTGTTT TCGTAGCCTG GGAAGAGTTC TTCTAAAATT TCCACGTGTT 4 980 

TCTTTGTAGG TCCTTTGGCA GAAAGCTTGG TTTCTTGGAT AGCAATGATA TCAGCATTTT 5040 

CAGCGACCAA GG7TTG7AGC ACTTCTTGGG ACAATTTGGC ACGAGCTGAG TCACTAGTTA 5100 

GGGCAGCGTT TAGGGAATCA ATATTCCATG AGATAAGTTT CATAAAGTTA CCTTTTTCAT 5150 

TCAGATTATA GATTTTATTA TACCAAAAAA AGATCTATTT CCCCAACCTA TGGTTTCAAA 52 20 

AATTACTCTC TTTCGTTTAT AATTAAGAAT CA7777A7GA AAGGGAGTGA AAATACATGA 52 80 

AATTCTACTC TTATGACTAT GTACTCAGCC AAATCCC7CA GCAAAA7CG7 ATCATGGTTG 5 340 

CCTTTCCGAT TGTTCTA77A GCTGTGACAG TTTTTTTTGC TTTCAAGGCA TACCATAATA 5400 

AAAAGGGAAG CGAATTTCGT GAG77GG7CA TGATTTCAGA TCTGCCCTTA T7TAGCTCTG 5460 

CTTTTGCTCA GCATCACGAC TTATCAAAAC AATCAAGTTT CTAACAATAA ATTTCAAACT 5520 

TCACTTCATT TCATCGAGGT 7CTTTCCAAA GATTTCTGAG TAGACAAGTC AGAAGTCTAT 5590 

GTTAATACTT CCACAAACAC AGATGGCGCA CTTATCAACC TCGGAGATCG CTATTATCGT 5640 

GCCCTAAATG GAAGTGAGCC AGACAAGTAC C7GT7AGAGA AAG7CCAA77 G7ATAAGACA 5700 

CACGCAA7TC AAC7GG7GGA 7G7GAACAAA 7GACACT7AA 7TA7ATCGAA A7TT7AA7CA 57 60 

AAC7CG7CT7 GAC7C7CAAA 7AGC7CAACA ACAATGTTCA CTT7GTGAAA CG7T7GAT7G 5320 

ATGG7AAGCC AACTC7CC77 A7CAAAAA7G GGAATAT7GA CCCAGAAGCC TG7CG7TCAG 5880 

77GG7T7GTC TGCA7CGGAT GTA7CCC7CA AAC7TCG7AG CCAAGGGAT7 77CCACATCA 5940 

AGCAAG7CAA ACGAGC7G7G CAAGAGCAAA A7GGGCAAC7 CA7CG77C7G CAAATGGGAG 6000 

A7GAAAATCC 7AAGTA7CCA G7TG7GAC7G ACGG7G7GA7 7CAAC7ACA7 G7C77GCAAT 6060 

CGA77GG7CG TAGCGAAGAG TGGT7GC77G A7AACC7CAG 7AAACAACCG CA7GACAA7G 612C 

7ACCCAA7A7 CTTTAT7GC7 GAA7A7GACA AGGGTGC7C7 7ACAGTCG7A ACT7A7GAAT 619C 

AAGAAAAACC TGGGG7C7TG 7ACTC77CGA AAA7C7C77C AAACCGCG7C AACG7CGCC7 6240 

7GCCGTATG7 AGGT7ACTGA CTTCG7CAGT TC7A7C7ACA ACC7CAAAGC AG7GC777GA 6 3 00 

GCAGCC7GCG GCTACT77CC 7AG777GC7C 7T7GATTT7C A77GAG7A7T CCCC7CAGG7 6360 

77CCA777GC AATCAGAAAG GGATTT7A7C 7CCAT7A77C AAAAACTT7G G7GG7TT77C 64 20 

AAG7TAGAAA AACGCCG77A 7C7AG7CGGA AT7G7GGCCC 7GA7C7TGG7 77CCGTCC7C 64 80 
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AATCTCATTC 


CTCCTATGGT 


1 TATGGGGCGG 


1 GTCATTGATG 


; GCATCACATC 


' GGGGCAATTA 


6540 


ACCCAGCAGG 


ACCTCCTTCT 


TAGCCTATTT 


TACTTGCTAC 


TTCCACCCTT 


' TGGTATGTAC 


6600 


TATTTCCGCT 


ATGTGTGGCG 


TATGTATATC 


CTTGGGACCT 


CTTATTGCTT 


GGGACAGATC 


6660 


ATCCGCTCTC 


GCTTCTTTAA 


GCATTTCACA 


AAAATGTCGT 


CAGCCTTTTA 


TCAAACCTAT 


6720 


CGGACGGGTG 


ATCTGATGGC 


ACACGCAACC 


AATGATATCA 


ATGCCTTGAC 


TCGTTTAGCA 


6780 


GCTGGCGGTG 


TCATGTCTGC 


GGTGGATGCC 


TCTATCACGG 


CTCTGGTGAC 


TTTGTTGACC 


6840 


ATGCTCTTTA 


GCATCTCATG 


GCAGATGACT 


CTTGTTGCCA 


TTCTCCCCCT 


ACCTTTCATG 


6900 


GCCTATACCA 


CTACTCCCCT 


AGGGAGAAAG 


ACTCATAAGG 


CCTTTGGCGA 


ATCCCAAGCT 


6960 


GCTTTTTCTG 


AACTCAATAA 


CAAGGTACAG 


GAGTCCGTAT 


CAGGTATCAA 


AGTGACCAAG 


7020 


TCTTTCGGTT 


ATCAGGCAGA 


CGAGTTGAAG 


TCTTTTCAGG 


CAGTCAATGA 


ATTAACCTTC 


7080 


CAAAAGAACC 


TGCAAACCAT 


GAAATATGAT 


AGTCTCTTTG 


ACCCTATGGT 


TCTCTTCTTT 


7140 


GTTGGTTCGT 


CCTATGTTTT 


AACGCTTTTG 


GTTGGCTCCT 


TCATGGTTCA 


GGAAGGGCAG 


7200 


ATTACAGTTG 


GGAATCTAGT 


CACCTTTATC 


AGCTATTTGG 


ATATGCTGGT 


cTCGccTcrr 


7260 


CTGCCCATCG 


GTTTCCTCTT 


TAATACTACT 


CAGCGAGGGA 


ACGTTTCTTA 


CCAGCGGATT 


7320 


GAAAATCTTT 


TGTCTCAGGA 


ATCTCCTGTA 


CAAGACCCTG 


AGTTTCCTCT 


GGATGGTATT 


7380 


GAAAATGGGC 


GTTTGGAGTA 


TGCCATTGAC 


AGCTTTCCTT 


7TGAAAATCA 


GGAAACACTG 


7440 


ACGGATATTC 


ACTTTAGTTT 


GGCAAAAGGG 


CAAACACTGG 


CCTTGGTTGG 


GCAGACAGGC 


7500- 


TCTCCGAAAA 


CCTCCTTAAT 


CAAGCTCCTC 


TTCCGTGAAT 


ACGATGTGCA 


TAAGGG7GCC 


7560 


ATT7ATCTAA 


ACGGTCACGA 


tat:- :gggac 


TATCGTCTGA 


CAGACCTTCG 


CAGTCTGATG 


7620 


GGCTATGTTC 


CTCACCACCA 


GTT7CTTTT7 


GCCACTTCAA 


TCCTAGACAA 


TATCCGCTTT 


7680 


GGCAATCCTA 


ACTTGCCCCT 


TTCAGCGGTC 


GAGGAACCTA 


CTAAGCTAGC 


CCGCCTTTAC 


7740 


CAAGATATTG 


TAGACATCCC 


TCAAGGATTT 


GATACGCTCA 


TTGGTGAAAA 


AGGAGTCACT 


7800 


CTTTCTGGTG 


GTCAAAAGCA 


ACGGTTGGCT 


ATGAGTCGGG 


CTATGATTTT 


AGACCCTGAT 


7860 


ATCTTGATTT 


TGGATGATTC 


CTTATCCGCC 


GTAGATGCCA 


AGACAGAGTA 


TGCGATTATC 


7920 


GACAACCTCA 


AGGAGATGCG 


AAAGGACAAG 


ACAACCATTA 


TCACTGCCCA 


TCGCCTCAGT 


7980 


GCTGTTCTCC 


ATCCAGATTT 


TATTTTAGTT 


CTACAAAATG 


GTCAAATTAT 


CGAACGACGC 


8040 


ACGCACGAAG 


ACTTGCTAGC 


TTTGGATGGC 


TGGTATCCCC 


AAACCTACCA 


GTCTCACCAG 


810C 


TTGGAAATGA 


AAGGAGAAGA 


AGATGCAGAA 


TAAACAAGAA 


CAATGGACTG 


TATTGAAGCG 


8160 


CTTGATCTC7 


7ATCTCAAGC 


CTTATGGACT 


CCTGACCTTT 


TTGGCACTCA 


GTTTTCTCCT 


3220 
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AGCGACGACG CTCATTAAAA GTGTCATACC CCTCGTGGCT TCCCACTTTA TCGACCAGTA 8280 

TCTCAGCAAT CTTAACCAAC TAGCCGTTAC CGTTTTGCTG GTCTACTA7C GTCTCTACAT 8 340 

CCTACAAACT GTAGTTCAGT ATGTCGGCAA TCTTCTCTTT GCGCGCGTGT CTTACAGTAT 8400 

TGTTAGGGAT ATTCGTCGGG ATGCCTTTGC CAATATGGAG AAACTGGCCA TCTCTTACTT 8460 

TGACAAGACG CCAGCACGTT CTATCGTTTC TCGTTTGACC AACGATACCG AGACGATTAG 8520 

TGATATGTTT TCTGGGATTT TATCCAGCTT TATCTCAGCA GTTTTTATC? TTCTGACAAC 8580 

CCTTTATACC ATGTTCGTGC TGGATTTTCG TTTGACGGCT TTACTCTTCC TCTTTCTTCC 8640 

TTTGATTTTC CTTTTGGTCA ATCTCTATCG AAAAAAGTCA CTCAAAATCA TCGAGAAAAC 8700 

CAGAAGTCTC TTGTCAGATA TCAATAGTAA GCTGCCACAG AATATCGACG GAATCAGGAT 87 60 

TATTCAGGCC TTTAATCAAG AGAAGCGCCT GCAGCCACAA TTTGATGAAA TCAACCAAGA 8820 

ACACTTGGTC TACGCCAACC GTTCTGTAGC CTTCGATGCC CTCTT7TTGA GACCTGCCAT 8880 

GACTTTCCTG AAACTTCTAG GCTATGCAGT CTTGATGGCC TACTTTGCCT ACCGTGGTTT 8940 

TTCTATCGCG ATAACGGTCG GGACCATGTA TCCCTTTATC CAGTACATCA ACCGCCTTTT 9000 

TGACCCCTTG ATTGAGGTGA CGCAAAACTT TTCAACTCTG CAAACGGCTA TCCTTTCTCC 9060 

AGGTCCTGTC TT TGCCCTGA TAGACGAGAG GACCTATGAA CCTCTTCAAG AAAATGGGCA 9120 

AGCCAAAGTC CAAGAAGGCA ATATCCGTTT TGAACATCTG TGTTTCTCAT ATCACGCTAA 9180 

ACATCCGATT CTGGATGACA TTTCTTTCTC TGTTAATAAG GGTGAAACCA TTGCCTTTGT 9240 

ACGTCATACA GGTTCAGGGA AATCCTCTAT TATCAATGTC CTCATGCGCT TTTATGAATT 9 300 

CCAGTCAGGG AGAGTTCTCT TGGATGATCT CGATATCAGG GATTTCAGTC AAGAAGACCT 9 360 

GAGAAAAAAC ATCGGTTTGG TCTTGCAGGA ACCCTTCCTC TATCATCGAA CTATTAAGTC 9 420 

CAATATCGCC A7C7ACCAAG AAACCAGTGA TGAGCACCT? CAGGCTCC'JG CAGCCTTTGT 94 80 

GGATGCAGAT TCCTTTATTC AAGAACTTCC TCAGGGGTAC GACTCCCCTG TTTCCGACCG 9540 

TGGTTCGAGC TTCTCTACTG GGCAACGCCA GCTTCTTGCC 7TTGCTAGAA CAGTCGCCAG 9 600 

CCAGCCTAAA ATCCTGATTT TCGATGAAGC GACAGCCAAT ATTGACTCTG AAACAGAAAC 9 660 

CTTGGTTCAA GCTTCTCTGG CGAAGATGAG ACAGGGCCGA ACAACTATTC CTATCGCTCA 97 20 

CCGCCTTTCT ACTATTCAAG ATGCCAACTG CATCTATGTC TTGGATAAGG GACGCATTAT 97 80 

CGAGACTGGA ACCCATCAGG AACTCTTGGC TCTCGGACCA ACCTATCACA AGATGTATAG 9 840 

TTTGCAGGCA GGGGCCATGG CCGATACTCT TTGAAAATCT CTTTAAACCA TGTCAGCTTT 9 900 

ATCTGCAATC TCAAACCTGT ACTTTGATTT TCATTGAGTA CTAGAAGGAA ATCCTTCAAA 99 60 

TTACAGATTT CTTTCACCGC CTTTTCCATT TTCTCGTATA ATCAAAAATG TTGACAAATA 10020 
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GTATAATAAA AACAAAGGAG AACAGCATCC TGAAATGGGA AGACTTCCCT GTGGAAATGA 10030 

AATCAAGCGA GGTTGACTCT TACTACCAGC TTGTCTCTAA AAGGAACGGT 7CGCTGATTT 10140 

TCAAGCGTTG CTTGGACTGG CTTTTCGCCT TGGTCTTACT GG TTCTGACC TCTCCCATCT 10200 

TTCTCATCTT GAGCATTTGG ATCAAGTTGG ATAGCAAAGG GCCAGTGATT TACAAGCAAG 10260 

ACCGTGTCAC CCAGTACAAC CGTCGGTTCA AGATTTGGAA GTTTCGTACC ATGGTGACGG 10320 

ATGCGGATAA AAAAGCAAG? CTGGTGACTT CTGCTAACGA TAGCCGCATT ACCAAGGTTG 10380 

GAAATTTCAT CCGACGTGTC CCTTTGGACG AACTGCCTCA GTTGGTCAAT GTCCTTAAAG 10440 

CTGAGATGTC CTTTGTCGGT ACACGACCTG AAGTGCCACG TT AT AC AG AG CAGTATAGCC 10500 

CTGAAATGAT GGCAACCTTG CTCTTGCAAC CAGGGATTAC CTCTCCAGCC AGCATCAACT 105 6C 

ACAAGGATGA GGACACAATT ATCAGTCAAA TGACGGAGAA AGGTCTGTCA GTTGATCAGG 10620 

CCTATGTGGA GCATGTTCTT CCTGAAAAGA TGCGCTATAA CCTCGCCTAT CTCCGAGAGT 10680 

TTAGTTTCTT TGGGGACATC AAAATCATGT TTCAAACCGT GTTTGAGGTA CTAAAATAAA 10740 

GTAGTCATAA GAAAATGAGT ACAGATAAAA GGAGCAAATC AATGCCAAAT TACAATATTC 10800 

CATTTTCACC GCCTGATATC ACAGAAGCAG AAATTACTCA AGTAGTGGAT ACCCTGCGTT 10860 

CTGGTTGGAT CACAACAGGT CCTAAAACAA AAGAACTGGA GCGCCGCTTG TCTCTTTACA 109 20^ 

CACAGACACC TAAGACTGTT TGTCTCAACT CTGCGACAGC CGCTCTGGAG TTGATTTTAC 10980 

GCGTTTTGGA AGTGGGACCT GGTGATGAAG TCATCGTTCC ACCCATCACC TATACGGCTT 11040 

CATGTAGTGT CA77ACCCAC GTGGCAGCAA CCCCTGTCAT CGTCCATATC CAAGCAGATA 11100 

CGTTTGAGAT GGACTATGAC CTGCTTGACC AACCTATCAC TGAGAAAACT AAGGTGATTA 11160 

TTCCAGTAGA CCTCGCACGC ATTGTTTGCC ATTATGACCG TTTC7TCCAA GTCGTGGAGA 112 20 

AAAAACGTGA CTTCTTTACC GCTTCAAGCA AGTCCCAAAA GGCCTTTAAC CGTATTCTCA 11280 

TTGTCTCTGA TAGTGCCCAC GCTTTGGGAT CTATTTATAA ACGACAACCT TCTGGTTCTA 11340 

TCCCTGACTT TACTTCCTTC TCATTCCATG CAGTTAAGAA CTTTACAACC GCACAAGCTG 114 00 

GAAGTGCGAC TTGGAAAGCC AATCCAGTGA TTGATGACGA AGAGATGTAC AAGGAATTCC 114 60 

AAATCCTTTC CCTTCACGGG CAAACTAAGG ATGCTCTTGC CAAGATGCAA CTGGGGTCAT 11520 

GGGAATACGA TATCGTTACA CCAGCCTATA AGTGCAACAT GACCGATATC ATGGCTTCAC 11580 

TTGGTTTGGT ACAATTGGAC CGCTATCCAA GTTTGTTGCA ACGCCGTAAG GACATTGTCG 11640 

ACCGCTATGA TAGTGGTTTT GCAGGTTCTC GCATCCATCC TTTGGCACAC AAGACTGAAA 11700 

CTGTCGAATC TTCACGCCAC CTCTACATCA CCCGTGTAGA AGGAGCAAGC CTAGAAGAAC 11760 
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GCAACCTCAT 


CATCCAAGAA 


TTCGCTAAAC 


CAGCAATTGC 


AACTAATCTT 


CACTACAAAC 


11320 


CGCTTCCTCT 


CTTGACAGCC 


TATAAGAATC 


TTGGATTTCA 


TATGACGAAC 


TATCCTAAGG 


11330 


CCTATGCCTT 


CTTTGAGAAT 


GAAATTACCC 


TCCCTCTTCA 


TACTAAATTA 


AGCGATGAAG 


11940 


AAGTAGACTA 


TATCATTGAG 


ACTTTCAAAA 


CAGTTTCTGA 


AAAAGTGCTA 


ACTTTATCAA 


12000 


AAAAATGACA 


AACTACAGTC 


AAGCGAAAGT 


GATCCTGCCC 


CTXAAAAGTC 


TAATTGAGTG 


12G60 


TAAAAACTGT 


TGTTTTCAAT 


TGATAATAGT 


TTACACCTGT 


ACTTGAGGCC 


CCTTTCTCCT 


12120 


C AG AG AG AG A 


ATTTTTATAG 


GATTTTCCTT 


TCTTGTGGGA 


GTCCCGTGGT 


TTGAAATAAG 


12130 


ATGTGAGCAA 


TTTAGTCTAG 


CATTTAGAAT 


CCTTACTAGA 


CATCATTTAG 


AAAATCTAGT 


12240 


GTCTTCTTC? 


AGTTTTCAAT 


TCACCCTATT 


TTTTGAAAGA 


CGTGAGTTTC 


CATGAGTCAG 


12 300 


ATTGTGGAAA 


CTCGCGTCTT 


TTTTTGTTTT 


CAGAATATTG 


TTCAAAATTT 


TGTGCCTGTC 


12360 


TTTCATGTTC 


TACTCATTC7 


TTTGCATGAT 


AGAATTTATA 


GCATGTTGAT 


ATTATAATAA 


12420 


TACAAATATT 


CTATATGTTT 


AGTGATGCTT 


GCTATACATT 


ATTACATCTC 


CTGCGAGACA 


12480 


ATCTATAAAA 


CACTTGTCTA 


CGATTACCTA 


TATGCCCTAT 


TCCAGTATTT 


TAGAAGCACT 


12540 


GCATCTATTT 


TTATCGACGT 


TAAATCTAGC 


TTTT ATAGAA 


CGTCTATTTA 


AGAAATATAT 


12600 


TGTAGTGTTT 


TAGTT7CAAT 


CCGCCATATG 


AGCGATATTC 


AGGTAAATAT 


CCCTGGCGAA 


12660 


TGCTTGTATG 


ACAAGGTATT 


TGTTCTTTCA 


TTTATAATTT 


ACAACATATC 


AACAAATTTA 


12720 


AATATAGTAA 


ATGGGATATT 


TTATATTCAA 


GCTAAGAAAG 


ATAGCATCAC 


TTTTGAATGG 


12730 


AAGGCTAAAG 


AGCAAACTAG 


GAAGTTGGCC 


ATAGATAGCT 


CAAAACCCTG 


CTTTGAGGTT 


12340 


GT AG AT AT AG 


TAAAATGAAA 


TGAGAATAGG 


ACAAATTGAT 


CGCGACAGTC 


AAATCGATTT 


12900 


CTAACAATCT 


TTTACAAGTA 


GAGGTGTACT 


ATTTTAGTTT 


CAGTCTACTA 


TAGAACTGAC 


12960 


CAAGTCAGTA 


ACCTAGACTT 


AGGGCAAGGC 


GGCACTGACC 


TAGTTTGAAG 


ACATTTCCGA 


13020 


AGAGTATAAA 


TTTTAATATT 


TTCTTGTGTT 


ATTCCTTGAC 


AATTCAATTT 


GGAAAATATA 


13030 


TGATAAAGAT 


AATGACAGCG 


GTGTCATTCT 


ATCTATTT7A 


AGAAAAGTAA 


TAATCAATTG 


13140 


TTAAAAATAG 


TAAAAAAATT 


GGAGGTTCTG 


ATGAAATATT 


TTGTTCCG 




13183 


(2) INFORMATION FOR S 


HQ ID NO: 71: 









ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
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AACCAGTGCA TCAGTCTCAC CAAGCACCAG TGCGTCGGCC TCAGCAAGCA CCAGCGCGTC 60 

TGAATCCGCA TCAACCAGTG CCTCAGCTTC AGCAAGTACC TCAGCATCTG AATCAGCATC 120 

AACAAGTGCA TCGGCTTCAG CAAGCACAAG TGCTTCACCC TCAGCAAGTA TCTCAGCGTC 180 

TGAATCCGCA TCAACGAGTG CGTCCGCTTC ACCAAGTACT AGCGCCTCAG CATCAGCGTC 240 

AACAAGTGCT TCGGCTTCAG CGTCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTACGTC 300 

AGCCTCAGCA AGCACATCAG CTTCTGAATC TGCATCAACC ACTGCGTCAG CCTCAGCATC 3 60 

GACAAGCGCC TCAGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCAAGTA CCAGTGCTTC 4 20 

AGCCTCAGCG TCGACAACTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG 4 80 

TGCGTCAGCC TCAGCAAGTA CTAGCGCCTC AGCCTCAGCA TCAACGAGTG CGTCCGCTTC 540 

AGCAAGTACT AGTGCATCAG CATCAGCATC AACGAGTGCA TCGGCTTCAG CAAGTACCAG 600 

CGCCTCAGCT TCAGCAAGCA CCAGTGCGTC AGCCTCAGCA AGTACCAGCG CCTCAGCCTC 660 

AGCAAGCACC AGTGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 720 

TGCGTCGGC? TCAGCAAGTA CCTCAGCGTC TGAATCAGCA TCAACGAGTG CATCAGCTTC 780 

AGCATCAACA AGTCCTTCAG CTTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG 840 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AGCATCAGCG TCAACAAGTG CTTCCGCTTC 900 

AGCCTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTACG TCAGCCTCAG CAAGCACATC 960 

ACCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCCTCAGCA TCGACAAGCG CCTCAGCTTC 1020 

AGCAAGTACC AGTGCGTCAG CCTCAGCAAG TACCAGTCCT TCAGCCTCAG CGTCGACAAG 1080 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAg CCTCAGCAAG 1140 

TACTAGCGCC TCAGCCTCAG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGTGCATC 1200 

AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAG CTTCAGCAAG 12 60 

CACCAGTGCG TCAGnCTCAC CAAGTACCAG CGCCTCACCC TCAGCAAGCA CCAGTGCCTC 1320 

AGCTTCAGCA AGTACCAGTG CGTCAgCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG 1380 

TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 1440 

AGCTTCAGCA AGTACCAGTG CGTCCGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 1500 

AACCAGTGCC TCTGAATCAG CATCAACAAG 7GCCTCGGCT TCAGCAAGCA CCAGTGCGTC 1560 

GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 1620 

AACGAGTCCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 1680 

TGAATCTGCA TCAACCAGTG CGTCCGCTTC AGCGTCAACC AGTGCGTCGG CTTCAGCGTC 174 0 
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GACAAGTGCT TCCCCTTCAC CATCAACGAG TGCGTCGGCC TCAGCAAGCG CAAGTACCTC 1800 

AGCGTCAGct TCCGCCTCAA CCAGTGCGTC GGCTTCAGCA AGCACAAGTG CGTCAGCCTC I860 

AGCAAGTATC TCAGCGTCTG AATCGGCATC AACGAGTGCG TCTCACTCAG CATCAACGAG 1920 

TACGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA TCAACCAGTG CGTCAGCCTC 19 80 

AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTCCT TCAGCCTCAG CGTCGACAAG 2040 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAG CCTCAGCAAG 2100 

TACTAGTGCA TCACCTTCAG CATCAACGAG TGCATCGGCT TCAGCATCAA CCAGTGCCTC 2160 

GGCTTCAGCG TCAACCAGTG CGTCAGCTTC AGCAAGTACC AGTGCTTCAG TCTCAGCATC 2220 

AACAAGTGCT TCACCCTCAG CATCGACAAG TGCCTCCGCT TCAGCAAGCA CATCAGCATC 223C 

TGAATCAGCG TCAACCAGTG CTTCGGCTTC AGCAAGTACC AGTGCTTCAG CTTCAGCATC 2340 

AACCAGCGCC TCGGCCTCAG CAAGCACCTC AGCTTCTGAA TCCGCCTCAA CCAGCGCCTC 2400 

GCCCTCAGCA AGCACCTCAG CTTCTGAATC GGCCTCAACC AGCGCCTCAG CCTCAGCATC 2460 

AACGAGTGCT TCGGCTTCAG CAAGCACAAG CGCCTCGGGT TCAGCATCAA CCAGTACGTC 2520 

AGCTTCAGCG TCAACCAGTG CTTCAGCCTC AGCATCAACA AGTGCGTCAG CCTCAGCAAG 2580 

TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTACGTC 264 0 

AGCCTCAGCA AGCACCTCAG CTTCTGAATC GGCCTCAACC AGTGCGTCAG CCTCAGCATC 2700 

GACAAGCGCC TCACCTTCAG CAAGTACCAG TGCTTCAGCC TCAGCGTCGA CAAGTGCGTC 27 60 

GGCCTCAACC AGTGCATCTG AATCGGCATC AACCAGTGCG TCAGCCTCAG CAAGTACTAG 2 820 

TGCATCGGCT TCAGCATCAA CCAGTCCCTC GGCTTCAGCG TCAACCAGTG CGTCAGCTTC 2 830 

AGCAAGTACC AGTGCTTCAG TCTCAGCATC AACAAGTGCT TCAGCCTCAG CATCGACAAG 2940 

TGCCTCGGCT TCAGCAAGCA CATCAGCATC TGAATCAGCG TCGACAAGCG CCTCAGCTTC 3000 

AGCAAGTACC AGTGCGTCAG CCTCAGCGTC CACAACTGCG TCAGCCTCAG CAAGTACTAG 3060 

TGCATCAGCT TCAGCATCAA CGAGTGCATC GGCTTCGGCG TCAACCAGTG CATCAGAGTC 3120 

AGCAAGTACC AGTCCGTCAg CTTCCCCATC AACAAGTGCC TCGGCTTCAG CAAGCACCAG 3L3C 

TGCGTCGGCT TCAGCAAGTA CTAGCGCCTC AGCCTCAGCC TCAACCAGTG CGTCAGCCTC 3 24 0 

AGCAAGTATC TCAGCCTCTG AATCGGCATC AACGAGTGCG TCCGCTTCAG CAAGTACTAG 3 300 

CGCCTCAGCC TCAGCGTCAA CAAGTGCATC GGCTTCAGCG TCAACGAGTG CGTCTGAATC 3 3 6C 

GGCATCAACC AGTGCGTCCG CTTCAGCAAG TACTAGCCCC TCAGCCTCAG CGTCAACAAG 3 42C 

TGCATCGGCT TCAGCATCAA CCAGTGCCTC CCCTTCACCA AGTACTAGCG CCTCAGCCTC 3480 

AGCGTCAACA AGTGCATCGG CTTCAGCGTC AACGAGTGCG TCTGAGTCAG CATCAACGAG 354 C 
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TGCGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA TCAACCACTG CGTCAGCCTC 3 600 

AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAC CGTCGACAAG 3 660 

TCCGTCGCCT TCAGCAAGTA CCAGTGCGTC AGCCTCAGCA AGTACCAGTG CGTCAGCCTC 3720 

AGCGTCGACA AGTGCGTCGG CCTCAACCAG TGCATCTGAA TCGGCATCAA CCAGTGCGTC 3780 

AGCCTCAGCA AGTACTAGTG CATCAGCTTC AGCATCAACG AGTGCATCGG CTTCAGCATC 3840 

AACCAGTGCA TCAGAGTCAG CAAGTACCAG TGCGTCACCT TCCGCATCAA CAAGTGCCTC 3 900 

GCCTTCAGCA AGTACTAGCG CCTCAGCCTC AGCGTCAACA AGTCCTTCAG CTTCCCCGTC 3960 

AACCAGCGCC TCGGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CAAGTGCCTC 4 020 

GGCTTCAGCA TCAACGAGTG CATCAGTCTC AGCAAGCACC AGTGCGTCGG CCTCAGCAAG 4080 

CACCAGCCCG TCTGAATCCG CATCAACCAG TGCCTCAGCT TCAGCAAGTA CCTCAGCATC 4140 

TGAATCAGCA TCAACAAGTG CCTCGGCTTC AGCAAGCACA AGTCCTTCAG CCTCAGCAAC 4200 

TATCTCAGCG TCTGAATCCG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC 4 260 

AGCATCAGCG TCAACAAGTG CTTCGGCTTC AGCGTCAACC AGTCCGTCTG AGTCAGCATC 4 320 

AACCAGTACG TCACCCTCAG CAAGCACATC ACCTTCTCAA TCTGCATCAA CCAGTGCGTC 4 380 

AGCCTCAGCA TCGACAAGCG CCTCAGCTTC AGCAAGTACC AGTGCGTCAG CCTCAGCAAG 4440 

TACCAGTGCT TCAGCCTCAC CGTCGACAAG TGCGTCGGCC TCAACCACTG CATCTGAATC 4500 

GGCATCAACC AGTGCGTCAG CCTCACCAAG TACTAGCGCC TCAGCCTCAC CATCAACGAG 4 560 

TGCGTCCGCT TCAGCAAGTA CTAGTGCATC AGCTTCACCA AGTACTAGCG CCTCAGCCTC 4 520 

AGCGTCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCACCCTCAG CGTCGACAAG 4680 

TGCGTCCGCT TCAGCAAGTA CCTCACCGTC TGAATCAGCA TCAACAAGTG CCTCCGCTTC 4740 
AGCATCAACG AGTGCATCAC CTTCAGCATC AACAAGTGCT TCAGCTTCAG CAAGTACCAG 
TGCGTCCGCT TCAGCATCAA CGAGTGCTTC AGTCTCAGCG TCAACCACTG CCTCTCAATC 
CGCATCAACA AGTGCCTCGG CTTCAGCAAG CACCAGTGCT TCGGCTTCAG CGTCAACGAG 
TGCGTCTGAC TCAGCATCAA CCAGTGCGTC AGCCTCAGCA AGCACATCAG CTTCTGAATC 
TGCATCAACC AGTGCGTCAG CTTCCGCATC AACAAGCGCC TCGGCCTCAG CAAGTACAAG 
TGCTTCAGCC TCAGCATCAA CCAGTGCATC AGCTTCAGCC TCAACAAGTG CTTCAGCCTC 5100 
AGCGTCAACC AGTGCCTCGG CTTCAGCAAG TACCAGTGCG TCAGCTTCAG CAAGCACAAG 5160 
TGCCTCAGCT TCAGCATCAA CCAGTGCTTC GGCTTCGGCA TCAACAAGTG CCTCAGCATC 5220 
AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTCCA TCAGCATCAG CATCAACCAC 5280 



4800 
4860 
4920 
4980 
5040 
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TGCATCAGCC TCAGCAAGTA TCTCAGCCTC TGAATCCCCA TCAACGAGTG CATCAGCATC 5 340 

AGCATCAACG AGTGCATCGG CTTCAGCGTC AACCAGTGCA TCAGTCTCAG CAAGCACCAG 5400 

TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC 54 60 

GGCATCAACG AG7GCGTCAG CCTCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG 5520 

TGCGTCGGCT TCAGCATCAA CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC 5580 

GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA TCACCmTCAC CATCAACGAG 5640 

TGCATCGGCT TCAGCAAGTA CCAGCGCCTC AGCTTCAGCA AGCACCAGTG CGTCAGCCTC 57 00 

AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC TCAGCTTCAG CAAGTACCAG 5760 

TGCGTCAGcT CAGCATCAAC AAGTGCTTCA GCTTCGCCCT CAACAAGTGC CTCACCTTCA 5820 

GCATCAACGA GTGCGTCGCC TTCAGCAAGC ACCAGTGCCT CGGCCTCAGC AAGCACCAGT 5880 

GCTTCAGCTT CAGCATCAAC AAGTGCGTCA GCTTCAGCAA GTACATCAGT TTCAAATTCA 5940 

GCAAACCATT CGAACTCACA AGTTGGAAAT ACTTCTGGAT CGACAGGTAA ATCCCAAAAA 6000 

GAATTGCCTA ATACAGGTAC TGAGTCGTCA ATTGGATCTG TCTTACTTGG AGTTCTAGCA 6060 

GCTGTTACAG GTATTCGATT GGTTGCGAAA CGCCGTAAAC GTGATGAAGA ACAGTAAGAC 6120 

AACCTGTAAA GTTAGGCTAA ACTAACTCGC GCACATAAAT CAAGCAGAAA ATTGCTAGTC 6180 

GATGATAAAA TAACAGTCAT TGTACCAGTA TACAATGTCG AAAACTATCT GAGGAAGTCC 6240 

CTAGATAGTA TTATTACTCA AAC AT AT AAA AATATTGACA TTGTTGTCC7 TAATGATGGT 6300 

TCTACGGATG CTTCAGGTGA AATTTGTAAA GAATTTTCAG AAATGGATCA CCGAATTCTC 6360 

TATATAGAAC AAGAAAATGC TGGTCTTTCT GCCGCACGAA ACACCGGTCT GAATAATATG 6420 

TCCGGAAATT ATGTGACCTT TGTGGACTCG GATGATTGGA TTGAGCAACA TTATGTAGAA 64 80 

ACTCTATATA AAAAAATAC7 ACAGTATCAG GCTCATATTG CACTTGGTAA TTATTATTCT 6540 

TTCAACGAAA GTGAAGGAAT GTTCTACTTT CATATATTGG GAGACTCCTA TTATGAGAAA 6600 

CTATATGATA ATGTTTCTAT CTTTGAGAAC TTGTATGAAA CTCAAGAAAT GAAGAGTTTT 6660 

GCTTTGATAT CTGCTTGGGG TAAACTCTAT AAGGCAAGAT TCTTTGACCA GTTGCGCTTT 6720 

CACA7AGCTA AATTACGAGA AGATGGTTAC CTCAATCAAA AGGTATATTT ATTATCAGAA 6730 

AAGGTAATTT ATTTAAATAA AAGTCTTTAT GCTTATCGGA TTAGAAAAGG TACTTTATCA 6840 

AGAGTTTGGA CAGAAAAGTG GATGCACGCT TTAGTTGATG CTATGTCTGA ACGTATTACG 6900 

CTACTAGCTA ATATGGGTTA TCCTCTAGAG AAACACTTGG CAGTTTATCG TCAGATGTTG 6960 

GAAGTCAGTC TCGCCAACGG TCAAGCTAGT GGTTTATCTG ACACAGCAAC GTATAAAGAG 7020 

TTTGAAATGA AACAAAGGCT TTTAAATCAG CTATCGAGAC AAGAGGAAAG TGAAAAGAAA 7 080 
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GCCATTGTCC TCCCAGCAAA CTATGGCTAT GTAGACCAAG TTTTAACGAC AATCAAGTCT 714 0 

ATTTCTTATC ATAATCGTTC GATTCGTTTT TATCTGATTC ATAGCGATTT TCCAAATCAA 7200 

TGGATTAAGC AATTAAATAA GCGCTTACAG AACTTTCACT CACAAATTAT TAATTGTCCC 7 2 60 

GTAACTTCTG AGCAAATTTC ATGTTATAAA TCGGATATTA GTTACACAGT CTTTTTACCC 7 320 

TATTTCATAG CTGATTTCGT GCAAGAAGAC AAGGCCCTCT ACTTGGACTG TGATCTAGTT 7 3 30 

GTAACGAAAA ATCTGGATGA CTTGTTTGCT ACAGACTTAC AAGATTATCC TTTGGCTGCT 7 4 40 

G7TAGAGA7T TTGGGGGCAG AGCTTATTTT GCTCAAGAAA TCTTTAATGC CGGTGTTCTC 7 500 

TTGGTAAACA ATGCTTTTTG GAAAAAAGAG AATATGACCC AAAAATTAAT TGATGTAACC 7 56C 

AATGAATGGC ATGATAAGGT GGATCAGGCA GATCAGACCA TCTTGAATAT GCTTTTTGAA 7 6 20 

CATAAATGGT TGGAATTGGA CTTTGATTAT AATCATATTG TCATTCATAA ACAGTTTGCT 7 680 

GATTATCAAT TGCCTGAGGG TCAGGATTAT CCTGCTATTA TTCACTATCT TTCTCATCGG 77 4 0 

AAACCGTGGA AAGATTTCGC GGCCCAAACC TATCCTGAAG TTTGGTGCTA CTATCATCGG 7800 

CTTCAATGGA CAGAATTCGG ACAAAACCAT CATTTACATC CATTACAAAG ATCTCACATC 7 8 60 

TATCCAATAA ACGAACCTTT CACTTGTCTA ATCTATACTG CCTCACACCA TATTGAACAA 7 920 

ATTGAGACAT TGGTTCAATC CTTGCCTGAT ATTCAGTTTA AGATAGCAGC TAGAGTAATA 7 980 

GTTAGTGATC GATTGCCTCA GATGACAATT TATCCAAACG TGACTATATT TAACGGAATT 804 0 

CACTATTTGG TAGATGTCGA TAATGAATTG GTAGAAACCA CTCAACTACT TTTAGATATT 8100 

AATCATGGCG AAAAGACAGA AGAAATTCTC GATCAATTTG CTAATCTTGG CAAGCCTATC 8160 

TTATCCTTTC AAAATACTAA AACCTATGAA GTAGGTCAGG AGGCATATCC TGTTGACCAA 8220 

GTTCAAGCAA TOATTCAAAA ATTGACACAA ATAACCAAA? GAACAAAAAT CATTTACTAG 3:30 

GAGATGCTCT GATTTTGACG GTTAGTGATC AGATTGAAGA GTTGCATTAT TTTTTATAAA 3 340 

ATTTCTCCCT TCATCATATA TGAAAGTTGT TCAAACATCA CAQTCCTTTA TAAAATATAA 8 400 

ATAGACCTAA AGATATTTAA TATGAACTGC ACCCCAAAAG TTAGACACAA AAAATCTAAC 84 60 

TTTTTGGsGT CAGTACAATA TTAGGGTCTG ATTAATTATC TTTTTAGGTG AAAATGATTC 35 20 

TATATTATAG CTGTTTGATA CGAAATTTAT TATAAGGAAA TTATGTTAAT GAATACAAAA 85 80 

TCTATACTTT TTAATGCAGA TAATGATTAT GTAGATAAAT TAGAAACTGC AATTAAATCT 8640 

ATTTCTTCTT ATAATAATTG TTTAAAATTT TATGTATTTA ATGATGATAT TGCGTCAGAG 87 00 

TGGTTTTTGA TGATGAATAA GCGATTCAAC ACTATACAAT CTGAAATCGT TAATGTAAAG 37 6 0 

ATTGTAGATC ATGTTCTTAA AAAGTTTCAT TTACCGTTAA AGAATTTAAG TTATGCCACT 382 0 
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TTCTTTCGTT ATTTTA7ACC TAATTTTCTC AAAGAAAGTC GTGCTTTATA CCTACATTCT 8830 

GACATCATTG TTACAGGAAG TTTAGACTAT TTAT7TGATA TAGAACTAGA TGGTTATGCC 9940 

TTCGCACCAC TAGAAGATTC TTTTGGTGAT GTTCCTTCTA CCAATTTTAA CTCCGGAATG 9000 

T7ATTACTTA ATGTAGATAC TTGGAGAGAT GAAGATGCTT C7TCGAAACT G7TAGAACTC 9060 

ACCAATCAAT ATCATGAAAC AGCATATGGA GATCAAGGAA TTTTAAATAT GTTATTCCAT 9120 

GATAGATGGA AAAGATTAGA CCGAAATTTT AATTTTATCC 7GGGGATGGA TAGCGTCGCA 9130 

CACATAGAAG GAAATCATAA ATGGTATGAG ATT7CTGACT TGAAAAATGG AGATTTACCT 9 240 

AGTGTTATAC ATTATACTGG GGTAAAACCT TGGGAAATAA TTTCCAATAA TCCCTTTAGA 9300 

GAAGTTTGCT GGTTTTATAA TCTGTTAGAA TGGTCTGATA 7TTTATTGAG AAAAGACATT 9 360 

A7TAGTCC7A GTTTCGAAGA ACTTGTATAC AGTCCTAAAG CTCATACAGC AATTTTTACA 9420 

GCTAGTTGTG AGATGGAGCA TCTAGAATAT TTGATAGAAA ATTTACCAGA GGTACATTTT 94 80 

TCTATACTAG C AC AT AC A? A TTTTGCGTCT AGTGTCGTTG CTTTATTAAG ATATAGCAAT 9540 

GTTACGATTT ATCCTTGTTT TTCTCCATTT GATTATCGAA AAATTTTCCA TAATTTAGAT 9600 

TTTTATTTAC ATATTAATCA TTATAAAGAA GTGGATAATA TTGTATCCGT TGTTCAACAA 9660 

CTATCTAAAC CAATTTTTAC CTTTGAAAAT ACTACTCATG ATATAGGCAA TCAAACTAAT 97 20 

ATATTTTCTT CAACCGAACC AAACAAAATG GTAGAGGCTA TTAGACAATT TATAGGAGAA 97 80 

TAACTTTATG GCAGACGAAC TAATTAGTAT TCTACTTCCA ATCTACAACC TTGAGAATTA 9840 

TTTGCGAATG TGTTTGGATA GCATTCAGAA TCAGACGTAT CAAAATTTTG AGTGTTTATT 9900 

AA7CAATGAT GGCTCTCCAG ATCATTCATC CAAAA7ATCT GAAGAATTTG TAGAGAAACA 99 60 

TTCTCCTTTC AAATATTTTG AGAAAGCAAA CGGCCGTCTT TCATCAGCTC GTAACCTAGG 10020 

7A7TGAATGT TCGGGGCGGG GCGTACATTA C77TTGTAGA C7CTCA7GAT TGCTTCGAAC ICG 30 

ATGATGCTTT AGACCGATTA TATGGTGCTT TGAAAAAGGA AAACG C AG AT ATTAGTATCG 10140 

GGCGTTATAA TTCTTATGAT GAAACACGCT ATC7CTATAT GACTTATGTT ACGGATCCAG 10200 

ATGATTCTCT AGAAGTGATA GAAGGTAAAG CAATTATCGA TAGGGAAGGT GTCGAAGAAG 102 60 

TCAGAAATGG GAACTGGACT GTAGCTGTCT TGAAGTTATT CAAGACAGAG TTACTACAAC 1032 0 

ATTTACCATT TCCTATAGGA AAAATTGCAG AGCATACTTA CTGGACATGG AAGGTACTTC 10380 

TAAGAGCTTC GAGGATAGTC TATTTGAATC GTTGTGTTTA CTGGTACCGT GTTGGTTTAT 10440 

CTGATACTTT ATCGAATACA TCGAGTGAAA AGCGTATGTA TGATGAAAT7 GCGGCTAGGG 1C500 

AAGAAAAGAT AGCTATTTTA GCAAGTTCAG ACTATCAC7T GACCAATCAT ATTTTGATTT 10560 

ATAAAAA7AG ATTACAAAGA G7GATAGCAA AATTAGAAGA ACAAAATATG CACT7CACAG 10620 
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ATTTAAAATA AATGATAAGG ATTTAAAATG AGAAATACCA 
GGTGATTACG CTTA7ATTCC ACAAATCGAA ACGGCGATGA 
AGTCATTTGA AAATTTATCT GCTAAATCAG CACATTCCTC 
AGAATATATT TACAAGAGAT GGGGGGCGAC TTGATTGACT 
TTTCAAATGA ATTGGTCTAA TAAATTACCT CATATCAATC 
TT7ATTCCAG ATTTTGT AAC AGAAGATAAA GTTCTCTATC 
ACTGGTGATT TGACCGATTT GTTTGAATTA GACTTAGGTG 
CGTTCTTGCT TTGGAGCAGG AGTCGGCTTC AATGCTGGTC 
AAATGGGGAT CTGAAACTAT TCGACAAAAA TTGATTGACT 
AATGTGGAAG AAGGAGACCA GTCAATTTTG AATATCTTGT 
CTTCAAGATC AATATAATTT TCAAATAGGA TATGATTATG 
CAATTCATTT TTGATATTCC GCTCGAACCA CTGCCACTAA 
CATAAGCCTT GGAATCAATT TTCTGTTCGA CGTCTAAGAG 
TTGATGGATT CGTCTGTTAT TTTAAATGAA TGCTTTTCAA 
AAATCACAAA TATTTAAGTT GC AATGTGTT AATTTAACGA 
ATCGATTATT TGGCGGACCA ATTGCCAGAA GTTCATTTTC 
ATGGCAAATG AACTACTAGC TTTAACGCGT TTTCCTAATG 
TTACCAATGT TATTGCAACA AATAGTAATA GCTTCACATT 
GATCGAAAAT TAGAAGATGC ATATGAGTTT GTGCTTAAGT 
TTCGACAATA CTTGCTCTGA AAATCTTTCT GAGATTTCAT 
7CCATTCCGA AAAAAATGGT TGCAGCAATC AGATCTTACA 
GAAAATCAAT AGTATTAGCG GCAGATAATG CCTATCTTAT 
AGTCTCTATT GTATCACAAT AGAGATGTTG ATTTTTATAT 
CTGAATGGTT TAAATTATTG GGGAGAAAAA TGGAACTTGT 
TACACATTGA TAAAGAACTT TTTGAAAGCT ATAAAACAGG 
CTTACTTTAG ATTTTTTGCG ACAGAAGTGG TTGAATCTGA 
CCGATATCAT TGTAACTGGG CAACTACCTA CTTTGTTTGA 
CAATTGGTGC TGTTGATGAT GTCTATCCCT ATGAAGGACG 
CTATGTTACT AATCGATGTT GCAAAGTGGA AAGAACATTC 



AACGCGCTGT AGTATTTGCA 14 220 

AGTCACTCTG TAGACACAAT 14280 

AGGAATCGTT TACTCAAATA 14 340 

GCAAGTTAAT TGGCTCACAG 14 400 

ATATGACATT TGCACGCTAT 144 60 

TAGATAGTGA TTTGATTGTG 14520 

AAAATTATTT GGCAGCAGCT 14580 

TTCTCTTGAT T AAC AAC AAA 14 64 0 

TAACAGAAAA AGAACATGAG 14 700 

TTAAAGATCA ATATAGTTCC 14 7 60 

GGGCGGCAAC CTTTAAACAT 14820 

TTTTACACTA TATTTCTCAG 14 880 

AAGTTTGGTG GGAATACTCT 14 940 

AGAGTGTGAA GTACCCTAGT 15000 

ATTCTTGGTG TGTCGAGAAA :5060 

ATATTGTTCC TTATACAAAT 15120 

TTACCCTATA TCCAAATTCC 15180 

7GTATTTGCA TTTGAATCAT 152 40 

ACAAAAAACC AATGATAGCT 15300 

ATGAAGGTAT CTATCCAACC 153 60 

TGAGGTAGAG AACAGTATGA 15420 

TCCTTTACAG ACG ACT AT \A 154 80 

TCTCAACAGT CATATAGCTC 155 4 C 

GAATTCTACA ATTCGCAGTG 15600 

ACCTCATATA AATTATGCTT 15660 

TAGGCTATTG TATCTGGATT 15720 

GATAGATCTC AAAGGATATT 15780 

AAAATCTGGA TTTAATACTG 15840 

TATTCTCAAT AGTTTATTGG 15900 
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AATTAGCCCC CGAGCAGAAT CAAGTTGTTC ATCTTCGGGA TCAGAGTATT TTAAATATTT 15960 

ATTTTGACGA TAATTCGCTA GCCTTAGATA AAACATATAA TTATATGGTG GGTATTGATA 160 20 

TTTATCACCT TGCTCAAGAA TGTGAACGTC TAGATGACAA TCCACCTACA ATTGTTCACT 16080 

ATCCTAC7CA TGATAAACCT TGGAATACAT AT ACT AT AT C TAGACTACCT GAATTATGGT 16140 

GGGTTTATAG ACATTTGGAT TGGTCAGACA TTGCTTTTCA ACCTTCCGAT TTAAATTATT 16200 

TTGAAAGAAG CAATCACTCT AAAAAACAAG TGATGCTTCT GACATGGAGT CCAGATA7AA 16 2 60 

AACATTTAGA GTATTTAC7A CAACGGTTAC CTGATTGGCA TTTTCATT7C GCTGCACCGT 163 2C 

GTGATTGTTC TGAGGAGCTG ACC7CTCTAT CACAGTATAC GAATCTAACA GTATATCAAA 163 80 

ATGTATTACA TAGTAGAATT GATTGGCTAT 7GGACGATTC TATAGT7TAT TTAGATATTA 164 40 

ATACAGGTCG ACAGCTTT77 AATGTAGTTA CAAGGGCACA AGAAAGTGGC AAGAAAATCT 16500 

TCCCTTTTGA TATCACACGT AAAAGTATGG ATGATGGACT CTATGACGGT ATTTTTTCTG 16 560 

TGGAGAGACC AGATGATTTA GTGGATAGAA TGAAGAATAT AGAGATAGAG TAATGAGTGA 16620 

ATTAATTAGT GTTGTGCTAC CGATATACAA TACGGGAAAA TATTTACTGC AGTGTGTCGA 16680 

GCATATTCTG AAGCAAACCT ATCAAAATAT AGAAATTATT TTAGTTGATG ACCCTTCTAC 16740 

GGATAATTCT GCGGAAATTT GTGATGCTTT TATGATGCAA GATAATCGTG TGCGAGTATT 16800 

GCATCAAGAA AATAAGGGGG CCCCACCACA AGCTAAAAAT ATGGGGATTA GTGTAGCTAA 16 860 

GGGAGAGTAC ATCACGATTG TTGATTCAGA TGATATCGTA AAAGAAAATA TCATTGAAAC 16920 

TCTTTATCAG CAAGTCCAAG AAAAGGATGC AGATCT7CTT ATACGGAATT ACTATAATTA 169 80 

TGACGAAAGT GACGGGAATT TTTATTTTTA TGTAACAGGG CAAGAT7777 GCGTCGAACA 17040 

ATTACCTATA CAAGAAATTA TGAACCGTCA ACCAGGAGAT TGGAAATTCA ATAGCTCGGC 17100 

CTTTATATTG CCGACAT77A AC77GATTAA AAAACAAT7A TTCAATCAAG T7CACTTTTC 17160 

AAATGGTCGC CGCTTTGATG ATGAAGCAAC TATGCATCGC TTTTATCTTT TAGCCTCTAA 17 2 20 

AATCGTC7TT ATAAACGATA ATCTC7ATCT GTATAGAAGA CGTTCAGGAA CCATCATGAC 17 2 80 

AACGGAATTT GATCTTTCCT GGGCAAGAGA TATTGTTGAA CTGTTT7C7A AGAAAATATC 17 3 40 

GGATTGTGTC TTGGCTGGTT TGGATGTCTC CGTTCTGCGT ATTCGATTTG TCAATCTTTT 174 00 

AAAAGATTAT AAGCAAACTT TAGAATACCA TCAATTAACA GATACTGAGG AATATAAAGA 17 460 

TATTTCTTTC ACATTAAACT TGTTTTTTGA TGCAGAACAA AGAAATGGTA AAAGTTGAAA 17520 

TAAAAGAAT7 GTTATTTACC ATATCACAAA CAATCAAGG7 GACGCGAC7C TTTTATCACT 17 530 

AAGATTTATT CGTCAATAGC AGTAAAAAAA GGACTATTTA CCTCAT7TC7 ACTGTTTA7C 17640 

TATCTATTCG GAAGTCGTAT TATTCTCCCT TTTCTTCACC TAAATACTAA AGATTTTTTA 17700 
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AGTGGGATGA 


TGACTGTGAA 


TTTATTTATA 


GTCCCTTTAC 


GATTATCGTT 


19140 


CAAAAAAATG 


GGAAGAAATT 


TGCTAAGGTT 


GAACATGGAG 


TTGAAGGCTT 


CATCAGTGAT 


19200 


ATACACTATT 


TTGAACCAAA 


TGGTCAAATA 


CATATGCACC 


ATATCGTCCA 


TGATCGTGGG 


19260 


TTTGTATCGA 


GCATTA7CTT 


TTTTGAAGAT 


GGGCAAGCAG 


CCTATCAAGA 


ATATCTGAAC 


19320 


CTCAAGCGAG 


AGTGGCAATT 


CAGAGAGCGT 


TTAAAAGAAG 


GAGGACAGGT 


AGAAGTCAAT 


19390 


CCAATTTTGG 


GTTATCGCTT 


TAAAATGCrr 


ACCTATCAAA 


ATATGGGAGA 


TCTGGTGGCA 


19440 
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GAATTTTTTC 


AGAATTATCT 


GCAAACGTAT 


GTGAAGGATC 


AGGATATTTT 


TATGCTTCCT 


195C0 


TCTCATTCTC 


ATCATGACCA 


GTTGGTACTA 


GATCGTTTAC 


CTAGTACTAA 


TCCTAAACTG 


X9560 


TTGAGTCTGT 


TCATTGGACG 


TAATCCTCAA 


GATACCTTTA 


GGGATTTAGA 


TGTAACTTTT 


19620 


GAAAAATCGG 


ATTTGATTTT 


GGTGGATAGA 


GAGGATAGTT 


TACGATTGTT 


CCAGGAGTTG 


19680 


TATCCTGAAC 


GAATGCATCA 


ATGTTATCAT 


TTATCATCT? 


TTGACACCCG 


ATTACGATTG 


19740 


GGACGAAGCC 


AAACTAAGAA 


AGAATCCATC 


ATTTATTTTC 


AACTGGATTT 


TGAGCAGGGG 


19800 


ATTGATAATC 


AAGCTCTGCT 


TCAAGTCTTG 


TCCTTTGTCG 


CTGAAAATAA 


GGATACTGAG 


19860 


GTGA7TTTTG 


GAGCCTTTGC 


TGCTAGTCAG 


GAGCAAATGA 


ATGAGGTTGA 


AGGGATTGTT 


19920 


CACTCTTTCA 


TCCAAGAAAA 


CATTCAATCC 


GAAAATCTGG 


GAAAGGCGAT 


TGATTATGGT 


19980 


GATGCAGAAA 


ATCCTCTGGA 


AGAAAATCAA 


CACCAGGACT 


TACGCTTACA 


GTTTGTTAAC 


20040 


TTGAATGATG 


AGTTAGATTT 


GATAAAAACA 


CTAGAATTTG 


TCCGTTTGAT 


TCTGGATTTA 


20100 


AATAGACATC 


CTCATCTCTA 


CACACAGATT 


GCTGGGATTA 


GTGCAGCAAT 


TCCTCAAATC 


20160 


AACCTAGTTG 


AAACCGTCTA 


TGTTGAACAT 


TTAAAAAATG 


GTTATTTGTT 


AGCAGATGTT 


20220 


ACAGAATTTT 


CTAAGGCTGC 


AC ATT ATT AC 


ACAGATAGGT 


TGAACGAGTG 


GAATGAGTCC 


20280 


TTGATATATT 


CAATTGATAA 


GATTAAGGAG 


CACACAGGAC 


AACAATTTCT 


TGGAAAATTA 


20340 


GAGAAATGGA 


TACAGGAGGT 


TAAAAATGTC 


AAAGGAACTT 


AATATTTTAC 


AGATAGGACT 


20400 


TGCCAATTGG 


GAAAATCACT 


ATGACATACC 


TGAAAATATG 


ACTTGGTATT 


ATTTTTACCC 


20460 


AAACTCATCA 


AAAGCCCTTC 


GTGAAATAAT 


TGAAAAAGAG 


GATATTAACC 


GTTTTCATGC 


20520 


AGTTTTAATA 


GAAGATGGTC 


AGTATTCCAG 


AGACTTATTT 


TCCTATCTAA 


AATATTTTGA 


2C53G 


ACCTTATACT 


TTATTTTATA 


ACCAGAATCT 


ACAAATAAAT 


CATAGAGAGG 


TTGTGGATTT 


2064C 


TCTAAAAAAA 


CGATGTGCAC 


AAGCAATTCA 


TTTTTTAAGT 


CCCCAACAAC 


7AATCAATGA 


20700 


TTTAAGTAAA 


TCTCTTTTTG 


GCGGTGGGTA 


TGGTGATAAA 


CTCTTTCCTC 


CGACGATACA 


20760 


AGTCAATCCA 


AATTTTACAG 


GAGCTATTTC 


TTATCAAGGA 


TTGGATTATG 


TCAGTTTGGA 


20820 


ACGTGAGTTT 


GGCCAACATT 


TTGCCCAGCT 


TGCCTATTGG 


GCTTATAATA 


TTATCGTGCA 


20880 


AAAAACACTC 


CCTATTGAGT 


TGTGGCTTGA 


ATATGAGAAG 


GAACGCAATT 


GTGACTTTCG 


20940 


TTTAGTAATC 


CGTAAAATGT 


GGAGTGGGTC 


TGTTGATGAT 


TTCTTTGAAG 


AAGTAATAGT 


21000 


ATCTGAAAAA 


GACTTGGAGC 


AAGCACTTTT 


TATGGATAGT 


CGAGACGGAG 


ACTACTTTCT 


21060 


CTCGATATCT 


GTTGAAGCAA 


GAGCTCGTGG 


AACTATCAAA 


CTAGGTAATC 


TTCACCAACG 


21120 


ATGGAGTCGA 


AAACAATTTG 


GTAAGTTTGT 


ACTTGGTGGA 


AATATCCTAC 


ATGATTCCAA 


21180 


GCGTGATCAA 


ATAAACTATT 


TCTTCCATCC 


ACGTCATTT7 


AAACCGCCT? 


TGACTGTCTA 


21240 
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TTTTGCAGCT TATCGACCTG CAGAAGGATT CGAGGGTTAC TTTATGATGA AAACTCTTGG 21300 

ATGTCCCTTC ATTTTATTTT CTCA7CCACG TTTAGAGGGG GCAGCTTTTT ATCTCGGAAC 21360 

GGATGAGCTA GAGGGAAAAG TAAAGGATAC GATCACTCAC TATCTTGATT ATTTAGGCTT 214 20 

TGATCATAAG GATTTGATTT TATCAGGTCT TTCTATGCGA ACGTTTCCGG CTCTCTATTA 214 80 

TGGTGCTTCT TTTGAACCCC ATGCCATCAT AGTTGGTAAG CCCTTGGCTA ATTTAGGAAC 21540 

TAT AG CT ACT CGTGGACGTT TGGACGCACC GGGTGTCTCT AACTTACCTT TTGATTGTTT 21600 

AATTCATCAT ACAGGTGGGA CAACTTCTCA AGATATGACG GAGTTGGATC AGCCTTTTTG 21660 

GAAAATTTTT AAACAAGCAA ATTTTTCAAA GACAACCTTT GGTTTATCCT ATATGAAAGA 21720 

TGAAGAAATG GATCCACAAG CCTATGAACA ATTAGTGTCT TATCTGTGTA ATACAGGTGC 21780 

GAAGATTTTA TCTAAAGGAA CTGCTGGACG ACACAATGAT GATACAGATA CCAATATTTC 21840 

TTGCTTTTTG CACTTTTATA GAATGGTCTT AGAGACTGGT TTTGGAAGGG AGAAAAGATG 21900 

ATTATTACAC AGAGACAGTC TATTCATTGG GGAGAAGTTG CTCGGACTTA TATGTATGGA 21960 

ACAACTGTAT CTTATTACCC TGACAAAAGT GTTCGTCTGT ATAATCCTCT ATTGCCATCT 22020 

GGTGAGATTC TAAAGACTTG GTTTTCTAGT GTCAATTACC AGGCTGCACG AACCCAACCT 22080 

CAGCTTCCCT TATTAAAAAG AAAGCAGGAG TATCAACTAT CACTGGTTTT TGACTGTCAG 22140 

CCTGAAAATG GAGTTTATAC CAAGATAACT TTTTTTGACC GCTATGGTGA TATTTTAGAA 22200 

AAAAAGGTAG AAAAAGTGAA AGATTTCATA TTTACTTATC CAGAAGATAG TTATACTTAT 22260 

CGAGTTTCTC TTTTAAGTGC TGGATTTGAG TCCTTAACTT TTTATCATTT TTCTATCAAG 223 20 

GAGATCAGAA GTGTTTAGAC GTTTAGGTCA AGATTTCCAG CTTAGGAAAG 7GAAAAACAT 22 380 

TTTAAAGCAG ATTAATGCCC TGAAAGGCAA GATGTCCTCT CTTTCGGATC AAGAATTAGT 22440 

AGCTAAAACA GTACACTTTC GTCAGCGTCT TTCCGAGGGA GAAAGTCTAC ACGATATTTT 22500 

GGTTGAAGCT TTTCCTGTGC TGCGTGAAGC AGATAAGCGG AT7TTAGGCA TCTTTCCT^A 2 2560 

TGATGTTCAA GTCATGGGAG CTATTGTCAT GCACTATGGA AATGTTCCTG AGA7GAATAC 22620 

GGGGGAAGGT AAGACCTTGA CAGCTACCAT GCCTGTCTAT TTGAACGCTT TTTCAGGAGA 22 680 

AGGAGTGATG GTTGTGACTC CTAATGAGTA TTTATCAAAG CGTGATGCCG AGGAAATGGG 22740 

TCAAGTTTAT CGTTTTCTAG GATTGACCAT TGGTGTACCA TTTACGGAAG ATCCAAAGAA 22 800 

GGAGATGAAA GCTGAAGAAA AGAAGCTTAT CTATGCTTCG CATATCATCT ACACAACCAA 22 860 

7AGTAAT7TA GGTTTTGATT ATCTAAATGA TAACCTAGCC TCGAATGAAG AAGGTAAGTT 22 9 20 

TTTACCACCG TTTAACTATG TGATTA7TCA TGAAATTGAT GATATCTTGC TTCATACTGC 229 80 
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ACAAACTCCT CTGATTATTC CGGGTTCTCC TCGTGTTCAG TCTAATTACT ATGCGATCAT 2 3 040 

TGATACACTT GTAACAACCT TGGTCGAAGG AGAGGATTAT ATCTTTAAAC AGGAGAAAGA 2 3100 

GGAGGTTTGG CTCACTACTA AGGGGGCCAA GTCTGCTCAG AATTTCCTAG GGATTGATAA 2 3160 

TTTATACAAG GAAGAGCATG CGTCTTTTGC TCGTCATTTG GTTTATGCGA TTCGAGCTCA 2 3220 

TAAGCTCTTT ACTAAAGATA AGGACTATAT CATTCGTGGA AATGAGATGG TACTGGTTGA 2 3 230 

TAAGGGAACA GGGCGTCTAA TGGAAATGAC TAAACTTCAA GGAGGTCTCC ATCAGGC7AT 2 3 340 

TGAAGCCAAG GAACATGTCA AATTATCTCC TGAGACGCGG GCTATGGCCT CGATCACCTA 2 3*100 

TCAGAGTCT? TTTAAGATGT TTAATAACAT ATCTGGTATG ACAGGGACAG GTAAGGTCGC 2 3 460 

GGAAAAAGAG TTTATTGAAA CTTACAATAT GTCTGTACTA CGCATTCCAA CCAATCGTCC 2 3520 

GAGACAACGG ATTGACTATC CAGATAATCT ATATATCACT TTACCTGAAA AAGTGTATGC 23 580 

ATCCTTGGAG TACATCAAGC AATACCATGC TAAGGGAAAT CCTTTACTCG TTTTTGTAGG 2 3 640 

CTCAGTTCAA ATGTCTCAAC TCTATTCGTC TCTCTTGTTT CGTGAAGGGA TTCCCCATAA 2 3700 

TGTCCTAAAT GCTAATAATG CGGCGCGTGA CCCTCAGATT ATCTCCGAGT CAGGTCAGAT 2 3 7 60 

CCGCGCTCTG ACAGTCCCTA CCTCTATGGC AGGACGTGGT ACGGATATCA ACCTTGCTAA 2 3820 

AGGAGTCCCA GAGCTTCGGG GCTTGATTGT TATTGGGACT GAGCGGATCG AAAGTCAGCG 23 880 

GATCGACCTA CAAATTCGTG GCCGTTCTGG TCGTCAGGGA GATCCTGGTA TGAGTAAATT 2 3 940 

TTTTGTATCC TTAGAGGATG ATGTTATCAA GAAATTTGGT CCATCTTGGG TGCATAAAAA 2 4000 

GTACAAAGAC TATCAGGTTC AAGATATGAC TCAACCGGAA GTATTGAAAC GTCGTAAATA 24060 

CCGGAAACTA GTCGAAAACG CTCAGCATGC CAGTGATAGT GCTGGACGTT CAGCACCTCG 2 4120 

TCAGACTCTG GAGTATGCTG AAAGTATGAA TATACAACGG GATATACTCT ATAAAGAGAG 2 4180 

AAATCCTC7A ATAGA7CGTT CTCCTCACTT AuAGGATCTT GTTGTCGATA TCAITGAOAG 242 40 

ATATACACAA GAGGTAGCGG CTGATCACTA TGCTAGTCGT GAATTATTCT TTCACTTTAT 24300 

TGTGACCAAT ATTACTTTTC ATGTTAAAGA CGTTCCACAT TATATAGATG TAACTGACAA 243 60 

AACTGCAGTT CGTACCTTTA TGAAGCACCT GATTGATAAA GAACTTTCTC AAAAGAAAGA 244 20 

ATT AC TTAAT CAACATGACT TATATGAACA GTTTTTACGA CTTTCACTGC TTAAAGCCAT 244 80 

TGATGACAAC TCGGTAGAGC AGGTAGACTA TCTACAACAG CTATCCATGG CTATCGGTGG 24540 

TCAATCTGCT AGTCAGAAAA ATCCAATCGT AGAGTACTAT CAAGAAGCCT ACGCGGGCTT 24 600 

TGAAGCTATG AAAGAACAGA TTCATCCCCA TATGGTGCGT AATCTCCTGA 7CGCCCTGCT 24 660 

TGAGGTCACT CCAAAAGGTG AAATCGTGAC TCATTTTCCA TAAAAGGAGA AAATATGACA 24 7 20 

ATTTACAATA TAAATTTAGG AATTGGTTGG GCTAGTAGCG GTCTTGAATA CGCTCAACCC 24 7 80 
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TATCGTGCTG GTCTTTTTCG 
ATTTTAGCCG ATAATATTCA 
ATCTGGCTTT ATAATCATTT 
GATCTCTTGG CTTACTTTGG 
CGTGTATTCT TTTTTGACCA 
GACTTCGTTC AACATGCCGA 
7CTTATACGC GTTATTGTAG 
CAACGAACTT TTTATAATGA 
AAGGAAGAAG TTTATCATTT 
GCCTTTATGA AATCTTTGAA 
GGTATTGGAC AGGTTGTGTT 
GCGGAGCATT ATAGTGAAAA 
GACTATCAGT TTACCAATGC 
AATGAAGTTC TACAAGAGCA 
ATTCCTGTAG GCAGTATTGA 
TTGATTACGG CTTCACGTCT 
ATTGAAGCTC ATAAGCAGTT 
GATTCTCTGC TTAGAGAAAT 
GGGCATGCGG AACTTTCGCA 
AGCCAACGA? TTCGTCTGAC 
TTTGATGTGC CTTATGGTAA 
CCAAGTTCAT CTGACCATGT 
CAATTGTATC AAGAAAATCG 
GGCTTCTTGA CCAAAGAAAT 
GA7TGAACT7 TATGATAGTT 
TACTGG7CT7 TC7CAACTTG 
CCTTTC7CCT TTTACCTATT 
AGTTCCCGTT TCAGATTTTT 
7G7GACGCAG GAGAGGGCTG 



589 

GAAATTAAAT CTGTCCTCTA AGT7TATC7T TACAGATATC 24840 

GCACTTAACA CCCAA7AT7G GTTT7GATCA 7AA7CAGGT7 24 900 

CACAGATA7C AAAATTGCAC CTACTAGCGT GACAGTGGAT 24960 

TGGTGAAGAA AGTCACAGAG AAAAAAATGG CAAGGTTTTA 25020 

AGATAAGTTT GTAACCTGTT ATTTGGTTGA TGAGAACAAG 2 5080 

GTATGTTTTT AAGGGAAACC TGATTCGGAA GGATTACTTT 25140 

CGAGTATTTT GCTCCCAAGG ACAATGTTGC AGTCTTATAC 25 200 

AGACGGGACT CCAGTCTATG ATATC7TCAT GAATCAAGGG 25260 

CAAGGATAAG A7TTTC7ATG GAAAGCAAGC TTTTGTGCGT 253 20 

TTTGAATAAG TCTGATTTGG TCATTCTCGA 7AGGGAGACA 253 80 

TGAGGAAGCA CAGACAGCAC ATCTACCGGT AGTTGTTCAT 25440 

TGCTACAAAT GAGGACTATA TCCTTTGGAA TAACTATTAT 25500 

AGATAAGGTT GACTTCTTTA TCGTGTCTAC TGATAGACAA 25560 

ATTTGCCAAA TATACTCAGC ATCAGCCAAA GATTGTTACC 2 5620 

TTCCTTGACA GATTCAAGTC AAGGGCGCAA ACCATTTTCA 25680 

TGCCAAAGAA AAGCACATTG ATTGGCTTGT GAAACCTGTG 25740 

ACCGGAACTA ACCTTTGATA TCTATGGTAG TGGTGGAGAA 2 5800 

TATTCCAAAT CATCAGGCAG AGGAC7A7AT CCAAC7CAAC 2 5860 

GATTTATAGC CAGTATGACG TCTACTTAAC GGCTTCTACC 25920 

CTTGATGGAA GC7A77CGT7 CAGGTCTACC TCT.^ATTGGT 25930 

TCAGACCTTT ATAGAGGATG GGCAAAATGG TTATTTGATT 2 6040 

AGAAGACCAA ATCAAGCAAG CTTATGCCGC TAAGATTTGT 26130 

TTTGGAAGCT ATGCGTGCCT ATTCTTACCA AATTGCAGAA 26160 

TTTAGAAAAC TGGAAGAAAA CAGTAGAGGA CGTCCTCCAT 2 62 20 

ACAGTCAAGA AAGTCGAGAT TTACATGAAA GTCTAGGCGC 2 52 80 

GAGTGGTCAT CGATGCAGAT GGTTTTCTGC CTGATGGTCT 2 6 340 

ATCTAGGTTA CGAGGATGGA AAACCTCTCT ATTTTAATCA 2 64 00 

GGGAAATTTT AGGAGATAAT CAGTCTGCTT GTATTGAAGA 2 64 60 

TCATTCATTA TGCTGATGGA ATGCAGGCTC CCTTGGTTAA 2 65 20 
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4 



ACAGGTAGAC 


TGGAAAGACC 


TAGAAGGTCG 


AGTACGTCAG 


CTTCACCACT 


ACAATCGCTT 


26580 


CGGAGCTTCT 


TTTGCTACAA 


CCACTTATAC 


CGCAGATAGC 


GAGCCGATTA 


TCACAGTTTA 


26640 


CCAAGATGTC 


AATGGTCAAC 


AAGTTTTACT 


GGAAAACCAT 


GTGACGGGTG 


ATATCTTATT 


26700 


GACTTTGCCA 


GGTCAGTCCA 


TGCGTTACTT 


TGCAAATAAA 


GTTGAATTTA 


TCACCTTCTT 


26760 


TTTGCAAGAT 


TTGGAAATAG 


ATACCAGTCA 


GCTTATCTTT 


AATACTCTAC 


CGACTCCTTT 


26820 


CTTGGTTTCC 


TTCCATCATC 


CAGATAAATC 


TGGCTCGGAT 


GTCTTGGTAT 


GGCAGGAACC 


25890 


TCTCTATGAT 


GCCATTCCAG 


GTAATATGCA 


CTTCATTTTC 


GAAAGTGATA 


ATGTGCGTAC 


26940 


TAAGAAGATC 


ATCATTCCAA 


ATAAGGCGAC 


TTATGAGCGC 


GCTTTAGAGT 


TAACTGACGA 


27000 


GAAATACCAT 


GATCAGTTTG 


TCCACTTCCC 


TTA7CATTAC 


CAGTTCAAAC 


CTGATAATTT 


2706C 


CCTAAGACGA 


GATGCCTTAA 


TCTTGACCAA 


TTCAGATCAG 


ATTGAGCAAG 


TAGAAGCAAT 


27120 


CGCAGGAGCC 


TTGCCTGATG 


TCACTTTCCG 


TATTGCAGCG 


GTGACAGAGA 


TGTCTTCTAA 


27130 


CCTCTTACAC 


ATGCTTTGCT 


ATCCTAATGT 


GGCCCTTTAC 


CAGAACGCTA 


GTCCACAGAA 


27240 


GATTCAGGAG 


CTGTATCAAC 


TGTCGGATAT 


TTACTTGGAT 


ATAAACCACA 


GTAATGAGTT 


27300 


GCTACAGGCA 


GTCCGTCAGG 


CCTTTGAGCA 


CAATCTCTTG 


ATTCTTCGCT 


TTAATCAGAC 


27360 


GG7GCACAAT 


AGACTTTATA 


TCGCTCCAGA 


CCATCTATTT 


GAAAGTAGTG 


AAGTTGCTGC 


27420 


TTTGCTTGAG 


ACCATTAAAT 


TGGCCCTTTC 


ACATGTTGAT 


CAAATGCGTC 


AGGCACTTCG 


27480 


CAAACAAGGC 


CAACATGCAA 


ATTATGTTGA 


CTTGGTGAGA 


TATCAGGAAA 


CCATGCAAAC 


27540 


TGTTTTAGGA 


GGCTAACATG 


TCAGAGGAAG 


ATTTATTTTA 


CAAAGACGTT 


GAAGGCCGCA 


2750C 


TGGAAGAGTT 


GAAACAAAAA 


CCCATCAAGA 


AGGAAAAACA 


AACCCCAGGG 


GAAAAGATTA 


27660 


GTAAGACTTT 


TTCACTTTTA 


CTGGCTTTGA 


TGATTCTGAT 


TGGTTTGCTC 


TTTACTTTGC 


27720 


TGGGAATTTT 


GAGGTAGATC 


TATCATTGAA 


ATAC7AATTC 


TTT7ACCTAT 


TATCCTATCT 


27730 


CTTGCTTTGA 


TTGTATTGGT 


AACTATACAA 


CCCCGTCAAA 


ATCAACTATT 


TTCCATGGAT 


27840 


GCCACTAGTA 


ATATTGGTAA 


ACCAAGCTAC 


TGGCAGAGCA 


ACACCTTGGT 


CAAGGTGCTC 


27900 


ACTTTATTGG 


TGAGTTTGGC 


TTTATTTATT 


CTACTATTAA 


CCTTTATGCT 


GATTACTTAT 


27950 


AAATAAAAGA 


AAACTTCAGA 


TATTCACCTT 


TTGTGGATTG 


GTCTGAAGTT 


TTCTTTTTTA 


28020 


TACTCAATGA 


AAATCAAAGA 


GCAAACTAGG 


AAGCTAGCCG 


CAGGCTgCTC 


AAAACACCGT 


28080 


TTTGAGGTTG 


TAGATATAAC 


TGACGAAGTC 


AGCTCAAAAC 


ACCGTTTTGA 


GGTTGTAGAT 


28140 


A7AACTGACG 


AAGTCAGCTC 


AAAACACCCT 


TTTGAGGTTG 


TGGATAGAAC 


TGACGAAGTC 


28200 


AGCTCAAAAC 


ACCGTTTTGA 


GGTTGTGGAT 


AGAACTGACG 


AAGTCAGCTC 


AAAACACCGT 


28260 


TTTGAGGTTG 


TGGATAGAAC 


TGACGAAGTC 


AGCTCAAAAC 


ACCGTTTTGA 


GGTTGTGGAT 


23320 
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AGAACTGACG AAGctCAGTA AC AT AT AT AC AGCAAGGCGA CGCTGACGTG GTTTGAAGAG 28380 

TATTACTGTC TATATTTTTG GTAAAAATCA ACTTTTACTT GCATGAACCT TTTGGCTTCA 28440 

CGTAGGAGTT GAAGAAGGGT GGCGCGGGTT TCAAATTCTT CTCTTGTCTT GGGCAGACTG 28500 

CGGTTCCGGA AGACTTCCAG ATAACGTTCA ATTTCATCTA GCAAATCAGA AGCACGATTG 28560 

GTCTGGCTCA GTTGACCTGC AATTTTTGAA AAGAGTTGCG CTAAGATCAG GCTTTCACTG 28620 

GCGGCAAGGT GACAAGTGT? AATCTGTTGG GCCATGTTTC TCAGGATACG ACTTTGTCGC 28680 

TGTCTCATCT CAAAGTAGTG GATATGGTAG TCTGTCTGGT GAAAGAGGTG GTCAGAGTGA 287 40 

TCCAAATAGA CCAGTCTGAG GGCTTCTTTC AAAAGCGTGT CTAATTCTGC TACCAGCTGT 28800 

GCTCGGTTGC GTCCGTCTCC TCTGGATAAA TAGTATTTGA AGCGCTGGAG CATATCTTTT 28860 

AACTTTTCTT CCACCAGCGT GTGGTAGTGC TGGATTTCCT CTTCTCGTGA AGGCATATAG 28920 

AGATTAACAA GCAAGGCAAA TCCTGTACCA ATAGCAAAGA GAAGGAATTC ATTGACTAGA 2 8980 

AGGTCTCGAG AGGTTGACTC TTGAACCAAG AGATGGCTAA CCAAAACAGT GCTTGGTGTG 29040 

ATGCCAATTT CCCAGCCCAT CTTGTAGGCT AAAGGAACGT AGAAGGCCAG ATAGAGGCCG 29100 

AGACTCCAGA TATGAAATCC GCTCAAGTGA AAAGCTAGAA CACCGATAGC CACACCTACA 29160 

AGCATAGAAA AAAGACGATT GCGAGCCAGT TTTAAACTAC TTCTACGCCT ATCAGATAGG 29220 

CTCAAGAGAG CGATAATTCC AGCCGAAACT GCTGACGAAA GATTGAGAAA ATAAGCAAGC 29280 

AGGCAGGCAA GACAGGTAGC TAAGATGAGC TTGGTCGTAC GTTCGCTAAT AGACATAAGA 29340 

ATTTCCTAAT AAGTTAGAAT AAAACCCTAA AAGACAAGAC ATGACCAGCC TTGCCTTGAT 29400 

GAGTTATTTT TTACGGGTTG CTGCGTATTC GGCAACGGCG GTAAAGAGGA CATCTGTAGA 29460 

AGAGTTAAGG GCTGTTTCAC ATGAGTCTTG GATGACACCA ATCACAAAAC CAACCCCAAC 2952 0 

AATTTGTATG GCAATATCGT TAGAAATACC GAAAAGGCTA CAAGCAACTG GGATAAGAAG 2 9580 

GAGGGAACCT CCGGCAATAC CTGAAGCATC ACAGGATGAG ATAGCTGCTA CCACACTCAG 2 9640 

GACAAAGGCT GTGGCAAAGT CAACAGGAAT TCCAAGACTG TTAACTGCAG CAAGGGTCAA 2 9700 

AAGGTTAATG GTAATCGCTA CTCCAGCCAT ATTGATAGTA GAACCGAG7G GGATAGAAAC 297 60 

AGAATAGGTA TCTGGGTTGA GTCCAAGGTC ATGGCAGAGT TTCATGTTGA CAGGAATGTT 29820 

AGTCGCAGAA CTACGAGTGA AAAAGGCTGT CACACCGCTG ACACGGAGGC AGTTCCAAAC 29880 

TAGAGGGTAA GGATTGCGTC T C AT AAAGAA GAAGGCAATC AAAGGGTTGA CCACAGGGGC 29940 

AACAAAAAGC ATAGTCGTTA CTAATAGAAC CAATAAAATA CCGTAGTTCC CAAGGCTTCC 3 0000 

GACTCCCTTG TCAGAAATGG TTTTAAAAAC AAGACCAAGG ATTCCAAATG GAGCCAGATT 3 0060 
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GATGATCCAT 


TCGACAATTT 


TAGAAGTCAC 


GTCAGCCATA GTTTTTAGCA 


ATTCTTGACT 


30120 


ATTTTTACTG 


GCTTCTCTCA 


TAGCGATTCC 


AAAAATGACT 


CCCCAAGATA 


AGATTCTAAT 


30180 


ATAGTTAGCA 


GTAAGCAGGG 


CGTTGACTGG 


GTTGTCAACC 


AGTTTGAGCA 


AGAGGTTGCT 


30240 


GAGAACCTGC 


CCAATCCCAT 


CTGGTGGTGC 


AATTTCAGTA 


TTGGCACTAT 


TTGCGCTAAT 


30300 


TTCAATAGGG 


ACGATGAAAT 


TTGCTAGTAC 


AGCTACAAGA 


GC AG CGGCGA 


AAGTCCCTAT 


30360 


CATAGGATAT 


ACAAGAAAAC 


AACAGTTTTC 


ATATTCCTAT 


CTTGTCCCTT 


TTGATGTTGG 


30420 


GAAAGGGCAT 


TGGCAACGAG 


AGCAAAGACT 


AGGATAGGAG 


CAACAGCTTT 


TACACCTCCA 


30460 


ACGAATAAAT 


CCTCGAGTAG 


CCCAATCCCT 


GAG AG AT TAG 


GAAGGGTCAG 


TCCTAGGATT 


30540 


CCCCACAAAG 


CATACCAATC 


AAGATACGCT 


TCACAACCCT 


TCCCTTATTC 


CAACCATGAA 


30600 


TGATTCTTTT 


CATAATAATC 


TCCTTTTTGT 


GTACTGATTA 


TGATTATAGT 


ATAAATGATA 


30660 


GACAAAATCA 


AGAATTTTCT 


GTCTATTTTT 


TGAATATT7A 


TGGAGAATGA 


CACTCATGAA 


30720 


AATATGGTAT 


AATGAAATAA 


ACCAGTTTTA 


TATGCAAAAA 


TTTATTCAGG 


CTTATATTGA 


30780 


AAAGCTAGAT 


GTGACAACCA 


TTATCGAGAA 


TATTCTAACC 


AAGGTCATTT 


CTCTTTTACT 


30840 


GCTTTTAATT 


GTATTTTATA 


TTGCTAAAAA 


AATGCTTCAT 


ACCATGGTCC 


AGAGAATTGT 


30900 


CAAACCTTCT 


CTAAAAATGT 


CTCGTCATGA 


TGTTGGACGC 


CAAAAAACCA 


TCTCACCTTT 


30960 


ACTAGAAAAT 


GTGTTTAATT 


ATACGCTATA 


TTTCTTTTTA 


CTCTACTGCA 


TTTTGTCGAT 


31020 


TTTACGTTTG 


CCAGTTTCTA 


GTTTGCTGCC 


TGGAGCTGGT 


ATTCCTGGGG 


TAGCGATTCC 


31080 


TATGGGAGCC 


CAAGGCTTTC 


TGTCTGATGT 


CATCAATGGC 


TTTTTCATCC 


TCTTTGAACG 


31140 


TCAACTGGAT 


GTGGGAGATG 


AGGTCGTTCT 


GACAAATGGA 


CCGATTACTG 


TATCGGGTAA 


31200 


GGTTGTCAGT 


GTGGGAATTC 


GTACGACACA 


GCTTCGTAGC 


GAGGAGCAAG 


cccttcactt 


31260 


TGTCCCTAAC 


CGAAATATCA 


CAGTTGT7AG 


CAATTTCTCA 


CGCACAGAC7 


AGACCTGTTA 


31320 


TTTTAAGTAA 


TTTGTGGTAC 


AATAGAGGGA 


GTTTAATAAG 


GAGAAAAGAT 


ccttttagaa 


31380 


AAGCAGTTGG 


GCAATCGTTG 


TACCTGGATA 


GACCTAGACC 


TAGGAAAGTT 


GAATAAACTA 


31440 


GAAGACCTTT 


CTCAAATTTA 


CGGTTTGGAC 


AAGGAAACCA 


TTGAATACGC 


ACTGGATAGA 


31500 


AACGAGCGCG 


CCCACATGCA 


CTACCACCGT 


GAAAGTGAGA 


CGGTTACCTT 


tatctataat 


31560 


GTCTTAGACG 


TAAAAAAGGA 


CAAGGCCTAC 


TATGAGACTT 


TTCCCATGAC 


ctttattgtc 


31620 


GAGCATCGTC 


GCCTGATTAC 


CATTAGTAA? 


ACCAAGAACG 


CCTATGTCAT 


tgaacagatc 


31680 


ACTCGTTATC 


TGGAGAACCA 


TGACACGCTT 


TCGATTTATA 


AGTTTCTCTT 


tcccagtctc 


31740 


GAAATCATCA 


GCAATGCCTA 


CTATCCTGTC 


ATTGAGCAGA 


TGGACAAGAG 


tagggatgag 


31800 


GTCAATGACC 


TCTTGCGCCA 


GCGAACTACC 


AAGAAAAACC 


TCTTTGTO^T 


gtctgatttg 


31860 
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CAGACTCCTA TCGTTTATCT GACGGCAGCT GCCAAACAAA ATCGCATTTT GTTAGAGCAT 3 19 20 

ATTCAAGGTC ATGCCTTGTA TCGTAGTTTT GATGAGATTG AGAGAGAACA GTTTGATGAT 31930 

GCCATGATTG AGGCTCATCA GCTGGTATCC ATGACAGACC TAATCTCTCA GATTTTACAG 32040 

CAGCTTTCAG CCTCTTACAA CAATATTCTA AACAATAATC TGAATGACAA TTTGACAACC 32100 

TTGACTATCA TTTCAGTCTT GCTAGCTGTT TTGGCAGTCG TGACAGGCTT TTTCGGAATG 3 2163 



32768 



AATGTTCCCT TACCTTTAAC AGATGAGCCC CATGCTTGGC TCTATATCAG TTTGGCTAGT 3 222C 

GCAGGTTTGT GGATTGTTTT ATCCTT G TTA CTAAGGAAAA TTGCGAAAAA AAGTTAAGAA 3 2230 

AAGGAGCCAG AATCGCGATT GAAAATTATA TACCAGATTT TGCTGTGGAA GCAGTCTATG 3 2 340 

ATCTGACAGT CCCAAGCCTG CAGGCGCAGG GAATAAAGGC TGTTTTCGTC GATTTGGATA 3 2400 

ATACCCTCAT TGCTTGGAAC AACCCTCATG GAACGCCAGA GATGAAGCAA TGGCTACATG 3 24 60 

ACCTTCGGGA CGCGGGTATT CGCATTATCG TAGTGTCAAA TAACACCAAA AAACCCGTTC 32520 

AACGAGCAGT TCAGAAATTT GGGATTGATT ACCTTTACTG GGCCTTGAAG CCCTTCACAT 3 2580 

TTGGTATTGA CCGTGCTATG AAGGAATTCC ACTATGACAA AAAGGAAGTG GTCATGGTTG 3 2640 

GTGACCAACT CATGACAGAT ATACGAGCAG CCCACCGTCC ACGGATTCGC TCAATTTTAG 3 27 00 

TCAAACCCTT GGTCCAACAT GACTCAATCA AAACGCAGAT TAACCGAACT CGTGAGCGTC 3 2 760 
GTGTTATG 

(2i INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14872 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
( D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CCAGTCACAA AGAAATTGAG CGCGTTCAGc TGAGGATGCA CTATGATGCA AGCTACATTT 
CATTTGATGG GATATTAAGA AAGGAGATTT TCATGACACT TTTAGATGTA AAACACGTTC 
AAAAAATTTA TAAAACACGT TTTCAGGGCA ACCAAGTAGA AGCCCTCAAG GATATTCACT 180 
TTACCGTAGA AAAGGGTGAC TACGTTGCCA TCATGCCTGA GTCTGGTTCT GGTAAATCAA 240 
CTCTTCTCAA TATTCTAGCT ATGTTGGATA AACCAAGTCG TGGTCAGGTT TACTTGAATG 
GAACTCACAC CGCAACTATT AAAAATTCAC AGCCTTCTAG TTTCCGGCGT GAAAAGCTAG 
GATTTGTCTT CCAAGACTTT AACTTGCTAG ATACTCTGTC TCTTAACCAC AATATCTTGC 



60 
120 



30C 
360 
420 



WO 98/18931 



PCT/US97/19588 



594 

TTCCGCTTGT CTTGTCAAGA ACACCTATAA CCGAGATGAT GAAGAAATTG GTGGTGACAG 4 90 

CTGAGAATCT GGGTATTAAC CAATTGCAAG AGAAGTACCC TTACGACATT TCTGCTGGTC S4 0 

AGAAACAGCG TGTAGCAGTA GCCCGCGCCA TCATCACAGA ACCTGAAATT CTCCTTGCGG 600 

ACGAGCCAAC AGGAGCCCTT GATTCCAAGT CATCTGCAGC CTTACTTGAT GTCTTTAATG 660 

AAATCAATGA GCGTGGGCAA ACCATCCTCA TGGTAACCCA CTCAACAGCA GCTGCTAGCA 72 3 

GGGCCAAGCG TGTTCTCTTT ATCAAAGACG GCATTCTTTA CAACCAAATC TACCGTGGAG 730 

AGAAGACAGA GCGTCACATG TTCCAAGAAA TCTCTGATAC CTTGACTGTC ATGGCAAGCG 840 

AGGTGAATTA GTATGTTTCG ATTAACCAAT AAGTTAGCGG TA7CCAACTT GATTAAAAAC 900 

CGCAAACTCT ACTATCCCTT TGCACTGGCT GTTCTCTTGG CAGTCACCAT CACCTATCTC 960 

TTTTACTCCC TAACCTTCAA TCCAAAGATT GCGGAAATCC GTGGAGGAAC CACCATTCAA 1020 

GCAACACTTG CATTTGCTAT GTTTGTCGTT ACCCTTGCGT CAcCATTATC GTCCTCTATG 1080 

CCAATACTTT TGTCATGAAA AACCGTTCCA AGGAACTGGG TATATATGGC ATGTTAGGCT 1140 

TGGAGAAGCG CCATCTAATC ACTATCACCT . TTAACCAGTT AGTGGTATTT GGGATTCTAA 1200 

CTGTTGGAGC GGGTATCGGT ATTGGAGCCT TGTTTGACAA GTTAATTTTC GCTTTCCTGC 1260 

TCAAACTAAT GAAACTGAAG CTTGAGCTCG TTGCTACCTT CCAAATGAAT GTTGTCATTG 13 20 

CAGTACTTGT TGTCTTTGGA TTGATTTTCC TAGGCCTCAT GTTCCTGAAT GCTCTTCGAA 1380 

TCGCCCGTAT GAATGCCCTC CAGCTCTCGC GTGAGAAACC AAGCGGAGAG AAAACAGGTC 1440 

GCTTCCTACC TCTCCAAACG ATTCTTGGTT CCATAAGTTT AGGGATTGGC TATTATCTTG 1500 

CCCTTACGGT AACCGATCCT CTTACAGCCC TAACAACTTT CTTCCTACCT GTTTTGCTGG 1560 

TTATCTTTGG TACTTATCTA TTCTTTAATG CAGGGATTAC ACTCTTCCTA CAAATCTTAA 1620 

AGAAAAACAA GAAATACTAT TACCAACCTA ATAACCTCAT ATCTGTTTCC AACTTGATTT 1680 

TCCGTATGAA GAAAAATGCG GTTGGACTAC CAACCATCGC TATTTTGTCA ACAATGGTTT 174 0 

TGCTAACCAT GTCAGCAGCG ACAAGCATTT TCAATTCCGC AGAAAGCTTT AAAAAAGTTC 13CO 

TAAATCCTCA TGATTTTGGG GTTTCAGGGC AAAATGTTGA AAAAGAAGAT TTGGACAAAC 18 60 

TCTTGAGCCA GTTTGCAAGT GACAAAGGTT ATAGTGTCAA AGAGAAAGAA GTACTTCGTT 1920 

ACAGTAACTT TGGTATTGCA AATCAAGAAG GAACCAAGTT AACTATTTTT GAAAAAGGAC 1980 

AAAACCGTGT CCAACCCACA ACAGTTTTCA TGGTATTTGA CCAAAAAGAT TATGAAAATA 2040 

TGACTGGTCA AAAACTGTCT CTATCAGGAA ATGAGGTCGG TCTCTTTGCC AAAAA7GACG 2100 

GACTGAAAGG ACAGAAACCT CTAACTCTAA ATGATCATCA ATTTTCTGTC AAAGAAGAAT 2160 

TTAATAAAGA TTTCATTGTG AACCATGTTC CAAATAAGTT TAATATCTTG ACTACTGATT 2 22C 
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ACAATTACCT TGTTGTTCCT GATTTACAAG CCTTTTTGGA TCAATTCCCA GATTCGGCTA 2230 

TCTATAATCA GTTTTACGGT GGTATGAATG TAAATGTCAG TGAAGAAGAA CAACTCAAGG 2 3 40 

TCGCTGAGGA GTATGAAAAC TACCTCAATC AATTTAATGC TCAATTAGAC ACAGAAGGTA 2 400 

GCTATGTTTA TGGTAGCAAT CTAGCAGATG CTAGTTCTCA GATGAGTGCC CTCTTTGGTC 24 60 

GTGTCTTCTT TATCGGTATT TTCCTATCCA TTATCTTTAT GG7CGGAACT GTTCTGGTCA 2520 

TCTACTACAA ACAAATT7CT GAAGGCTACG AAGACCCTGA ACGCTTTATT ATCTTGCAGA 2580 

AACTCCGTTT GGACCAAAAG CAAATCAAGC AAACCATCAA CAAACAGGTT TTAACTCTTT 2 640 

TCTTCCTTCC TTTGCTCTTT GCCTTCATAC ATC7CCCCTT TGCCTACCAT ATGCTTAGCC 2700 

TGATTTTAAA AGTGATTGGT GTACTGGATA CGACTATGAT GTTGATTGTG ACCTTGTCTA 2760 

7CTCCGCTAT CTTCCTCATC GCCTATGTGC TGATTTTCAT GATTACTTCA AGAAGTTATC 2 820 

GCAAGATTGT GCAAATGTAA AAAAGATACC TCGACTTCAA AATC GAGGT A TTTCTTGTAT 2880 

TCTAAATGCT GAAAAGTTGT CCGAGCAGGA AGGTAACTCC CATCGTCAAG AGACCAATAG 2940 

CAAGGTTCCG AATCATAGCT GTTTTGGTTG GGCCTTTTCC AAGTCTAGCA CTTGTGTAAC 3000 

CAGTGAGAAG AAGGGCCACA CCGACAATAA GGACGGTAGC AGGGATGCGG TAATCACTTG 3060 

GAAAAATGGT CACTGACAGC ATTGGAGGCA AACTTCTAAG GAAAAAGGCA ACGAAGCTAG 3120 

AAATGGCAGC GTGCCAAGGA TTGGTAAATT CTTCATACTC AATC CC AT AT TTTTCCTCTA 3180 

CCAGAGCCTT GAGTGGATTT TTAAGAAAGA TCTTATTGGT CAAGAGTTGG GCAGAAGTTT 3240 

TGAATTCTCC ATTTTGGATA TAAGCAGCAT AGAGCGATTT TTTGGCTAGT TCCCTATCT? 3 303 

GGTCTAGCAA GAGTTTTTCT CGCGAAACGG CAGCTTCCTC GGTATCTTTT CGAGTTGAAA 3 3 60 

CCGA7ACATA TTCTCCACCA GCCATTGAAA AGGCACCAGC TAACA7AGCC GTAAAACCTG 3420 

ATAAAAAGAT AATCCAGATA TTCGTCGTGG CACTGGCAAC TCCGATAACC ACACCAGCAA 3480 

TGGAAATAAT TCCATCGTTA GCATCAAGAA CACCCGCACG CACGATATTT AAACGACCTG 3540 

CAAAATTTGA ATCAATTTCG TGATTTGTTT CTGACGCTAA ATTTCAAGTT CAAGTTAGCC 3600 

ATCAAGAAG7 CTTCTCTGGG TGACTTGTAG TCCAAGCATT TTTTACGATA GTTGTTAATC 3 660 

CACTTTTCGA TGAATGCCAC TTCTTTGGGA GTCATTTTCT TCGTTCCCTT AGGTAACCAT 3 7 20 

CTACGAATGA GCCTGTTGTG ATTCTCATTA GTTCCCCTTT CCCAAGAGGC ATAGGGATGT 37 80 

GCATAATAAA TGTGCTCCTC AGAAAATACA 7TAGACAAGC GATTGAATTC CGTTCCATTA 3840 

TCTGCCGTGA TGGAAAGAAT CTTGTGTTGT TTTAAGATGA GTTTTAGAGC CTGATTGACC 39CC 

ACATCAGCAC TTTTATTTGG AATCAATCGG ATGATCTGAT GTC7ACTTTT TCGATCCG7C 3960 
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AAGACAAGCA 


AGCAGTAGTT 


TTTCGCTCTC 


GTAAGTAGAA 


CTGTATCAAT 


CTCATAATGC 


4020 


CCATTCTCCA 


AGCGAAAATT 


GATAGCTTCA 


AGCCGCTGTT 


CGATGGATTG 


ACCAGCAGGT 


4080 


TTAAACTTGG 


TGCTGCCCTG 


TTTCTTAAGC 


GCTTTTCCTT 


TTCTAGGGTA 


AAGCAGATCC 


4140 


TGTTTCCTTA 


ACCCCAATTT 


TCCATGATGA 


ATCCAATAGT 


AAATGGTTGA 


AATTCCCACG 


4200 


TTAACCCCTT 


TAGCCATCAC 


CATCATTTCA 


GGCGAAAATT 


TTTGGTTATG 


ATAGTGGAGA 


4260 


ATCTTTTCCT 


TTAGTTCCTT 


GGTCAAGCTT 


GATTTCTTGA 


CCGAGCGCTT 


CCGATTGTT? 


4320 


TCATAAGACT 


GTTGAGCATA 


GTCGGCAGAA 


TAAACCTCTT 


TGAAGCGCCC 


TTTTCCAAGA 


43SO 


CATTGTCGGA 


CTCTCCCACG 


CTTGATTTCA 


GTGTGGATAG 


TTTGAGGAAC 


TTTTCCAAGC 


4440 


AGAGAGGCAA 


TTTCTCTATT 


TGATTTCCC7 


TCTTTTTTCC 


ATCTTTCGAT 


TAAGCGACGG 


4 500 


CTATCGATTG 


TCAAATGTTC 


GCCTTTTGTA 


GTATAATGGT 


TTTGCATCTC 


TGTGCCTTTC 


4560 


TTGTGTTTGT 


GGTTGAACAA 


CAAGTATAAC 


ACAGAGGTGT 


TTTCTTATGC 


CTACAAGAGC 


4620 


TATCGGCTAG 


TTGAACCATC 


TAATTTTTAG 


GAGGGCTGGG 


TGGCTAACTT 


CATTATAGAA 


4680 


CTTTCATTTA 


CGAACATATA 


GTAAAATGAA 


ACAAGAACAG 


AACAAATCGA 


TCAGGACAGT 


4740 


AAAATCTATT 


TCTAACAATC 


TTTTAGAAGC 


AGAGGTGTAC 


TATTCTAGTT 


TCAATCTATT 


4800 


ATATTTTTGT 


TTTTTATCAA 


AAAATACTTT 


ACAAGTTCTT 


AAAAACATGA 


TATAGTAATA 


4860 


AAGCTTAGAA 


AATGAGATGA 


TGTTTTCTAC 


CAAATATAAA 


CCCGAGTAAA 


AAATGCCTAC 


4920 


GGACAGGCAG 


GGTTGAATGC 


CGAAGCGTGG 


TTGAAAAGCC 


ACATTATTGA 


TAGGGTTAAA 


4980 


AGCCTACTTT 


TATAAGTTGA 


TGTTAGGACA 


CTTGTCCTAA 


TTCATAAATT 


TTTAGTGTGG 


5040 


TGAAAGCACA 


CGTCATCTTG 


TGAAACGATC 


AATAAAGTAC 


GTAATATTTG 


CT AC TAG AG A 


5100 


GTTAGGAAAC 


ATCGGGAACA 


GACATACTCA 


ACAGAAACCA 


AAATAAACAC 


GTCAGAAGAT 


5160 


TGCAGAGCAG 


GTGAAAACC7 


GCTCTTTTTT 


CATGAGTCAA 


CCTTTAGTTC 


CTTAGTTTTC 


5220 


ATAAGGTCCT 


AAAAATATTG 


AAAGGAGTAT 


GTTTTGAAAG 


AGTTAGATCA 


AAACCAAGCC 


5280 


CCAATTTATG 


AGCCCTTGGT 


GAAGTTACGC 


AAGAAAAGGA 


TTGTTCCCTT 


TGATGTTCCA 


5340 


GGTCACAACC 


GTGGACGGGG 


AAATCCAGAA 


CTTGTCGAAC 


TCTTACGAGA 


AAAATGTGTA 


5400 


GGCATTGATG 


TCAATTCGAT 


GAAACCTTTG 


GATAATTTAG 


GCCATCCTAT 


TTCGATTATT 


5460 


CGTGATGCAG 


AGGAGCTGGC 


TGCAGATGCT 


TTTGGAGCTA 


GCCATGCC7T 


TCTAATGATT 


5520 


GGTGGAACAA 


CTTCATCGGT 


GCAGACTATG 


ATTCTGGCAA 


CCTGCAAGGC 


AGGAGATAAG 


5580 


ATTATTCTGC 


CACGAAATGT 


CCATAAATCT 


GCTATCAATG 


CGTTGCTTCT 


ATCTCGTGCC 


5640 


ATTCCCATCT 


ATATCGAGAT 


GAGTGTAGAT 


CCTAAGATTG 


GTATCGCTTT 


AGG7CTTGAA 


5700 


AATGACCGAG 


TAGCACAGGC 


CATAAAGGAC 


CATCCAGATG 


CTAAGGCTAT 


CCTAATCAAC 


5760 
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AATCCTACTT ACTACGGCAT CTGTTCAGAC CTAAAGGGGT TGACAGAAAT GGCTCATGAA 5820 

GCTGGCATGA TGGTTTTAGT AGATGAAGCC CACGGAGCGC ATTTGCATTT CACTGATAAA 5880 

CTTCCAATTT CTGCTATGGA TGCAGGGGCT GATATGGCAG CAGTTTCCAT GCATAAGTCT 594 C 

GGTGGGAGTT TCACCCAAAG CTCCATTTTA CTTATCGGGG AGCAGATGAA TTCTGAATAC 6000 

GTTCGTCAGA 7AATTAACCT GACCCAGTCT ACATCTGCCT CTTACTTGTT GATGGC7AGT 6060 

TTGGATATTT CACGTCGCAA CTTGGCCCTT CGTGGTAAAG AGTCGTTTGA GAAAGTCATT 6120 

GAGCTATCTG AGTATGCCCG CC GTGAAATC AATGCTATCG GTGGCTACTA TGCCTACTCA 6130 

AAAGAGTTAA TAGACGGTGT TTCGGTTTGC GATTTTGACG TAACTAAGCT GTCAGTTTAC 6240 

ACTCAGGGTA TTGGCTTAAC AGGTATCGAG GTTTATGACC TCTTGCGAGA CGAATACGAC 6300 

ATTCAGATCG AGTTTGGTGA TATCGGCAAT ATCTTGGCCT ATATTTCCAT CGGCCACCGC 6 360 

ATCCAAGACA TCGAGCGCTT GGTTGGTGCT CTGGCTGATA TTAAGAGACT CTATTCAAGA 6420 

GATGGAAAAG ATTTGATAGC AGGAGAATAT ATTCAGCCCG AGTTAGTGCT GTCTCCGCAA 6480 

GAAGCCTTCT ATTCAGAAAG AAAAAGTTTA ACTTTGGATG ATTCTCTTGG ACACCTCTCT 6540 

GGAGAATTTG TTATGTGTTA CCCTCCAGGT ATTCCTATCT TCCCTCCTGG TGAACGCATT 6600 

ACACGAGAAA TTGTCGACTA TATCCAATTC GCCAACGAAC GTGGTTGCTC CCTCCAAGGG 6 660 

ACGGAAGATC CAGAGGTCAA TCATATCAAC GTTATTAAGA GAAAGACAAA CTATAAGAAA 6720 

AGTCAATAGT TTTATCTAAA CTATTTCTTA TTTCAATTTG ATGATTTGCC GATGATTTTA 67 80 

GAGCACGGCA AAAAGCCCTT GAATTAGAAG CGGTCAATCG CTTAATTTCT ATCAGCTTAT 6 840 

CAAATCCTGC CTCAAGCCTT TTCTGAGGAT TAGGGTAGCG TGTCAAGAGT TGGTAGGTAT 6900 

ATTCTGAATG CTTTCCAACG ATTTTATCCA ACTCACGAAA GATGATATCA AGACAACGAC 6960 

TGTATTGTAC TTTCCAATCA GACTGTTTTT TCTTGAGACG ATGAATATGT CTAGCCAGTA 7020 

TTTTTACTTC TACTTGCCGA TTATCGTGTT GAAATTGTTC ACGATTGGGG TCAGAAAGAA 7080 

GTTTAAGAGC GATGCCATGA GCGTCTTTCT TATCCGTTTT AGTTTTGCGA AGTGATAATG 7140 

ATTTGGCAAA TTTCTTGATG AGCAAAGGAT TGTAGGTCTA AACTTTATAT CCTTGTTCAT 7 200 

GCAGGAAGTT CAGTAGATTA AAGGCATAAT GTCCGGTATT TTCAAGAGCG ATGAGACAGT 72 60 

CTTGGTTGAG CTGTCGAAGA GACAGATCTA AGAGTTCAAA ACCAGCTTTA TTATTTGAAA 7 320 

AAGTGAGTGG TTTAAGAACA GTTTTTCCTG GAACATTCAA GGCTGTAACA TCGTGTTTAT 7380 

TTTTAGCGAC ATCAATGCCC ACATAAAGCA TGGGAGTATC TCCAGATATA GTATTTCAAG 7 440 

TCTACTGGGT TATCCACGAA CTTTTTGCCT TGTTACCTTA GACGAGATAA AACGTCTATG 7 500 
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CGTTATCAAA CTCATTACCA ATTGAAACAA AAAACTGTGC TTAGAGCCTT TCGGAAATCG 7560 

TCAAGCGATT GGAGGAAATG AACTAATCCA CAGTGCCTTA TTCCAAGTAT ACCACTTGGG 7620 

CTTTGGCAGT AGCTAACTGC GCTAAATATA ATATAAGGAG AAATAGATGG ATTTATGGTT 7680 

TTCTGAAGTT CATACTCCAG ATGTCAAATT GTCTCTGAGA ACAGCCAAGC AACTTTACGC 7740 

TGGAAAAAGT GAATGGCAGG ATATCGAAGT CTTGGATACG CCACCTTTTG GCAAAATACT 7800 

GATTTTAAAT GGCCATGTCT TGTTCTCAGA TGCGGATGAT TTCGTCTACA ATGAAATGAC 78 60 

CGTTCACGTT CCCATGCCTG TCCACCCAAA TCCAAAGAAA GTATTGGTTA TTGGGGGTGG 792 0 

TGACGGCGGT GTTGCCCAAG TATTAACCCT CTATCCTGAA CTGGAGCAAA TTGATATTGT 7 980 

GGAACCGGAT CAGATGTTGG TCGAGGTCTG TCGTGACTAT TTCCCAGACT TTGCTGCAGG 8040 

GCTAGATGAT CCTCGTGTTA CCATTTACTA CCAAAATGGG CTACGCTTTT TGCGAAACTG 8100 

CGAAGATGAT TACCATATTA TCATCAACGA TGCGACAGAT CCATTTGGCC ATACGGAAGG 8160 

ACTCTTTACC AAGGAATTCT ACGGCAATAG TTATCGAGCT CTGAAGGAAG ACGGCATCAT 8220 

CATTTACCAG CATGGGAG7C CCTTCTTTGA CGAGGATGAG TCGGCCTGCC GAACCATGCA 8280 

CCCCAAGGTC AATCAAGCCT TTCCAATCAG TCGGGTTTAT CAGGCCCATA TTCCAACTAG 8340 

CCCAGCTGGC TATTGGTTGT TTGGATTTGC ATCGAAAAAA TACCACCCTG TCAAAGATTT 3400 

TGACAACGAA GGCTGCAAAA AACGCCAGCT TTTCACAGAA TACTACACTG CAAACTTACA 84 60 

CGTGGGAGCC TTTATGTTGC CCAAGTATGT TGAGGACATT TTACAAGAAG AGGAAGGAAA 8520 

AAAATGAGTC GTTTACTAGT TATTGGTTGT GCGGGCCTTG CCCAAGTTCC TATTTCAAAG 8580 

ATTTGTCAAG ATAGCGAAAC ATTTACAGAG ATTATCATTG CTAGCCGTAC CAAGTCAAAA 864 0 

TGCGATGACT TGAAAGCGAA GCTAGAAGGC AAAACAAGTA CTAAAATTGA AACTGCACCA 8700 

CTTGATGCTG ACAAGGTTGA AGAAGTGATT GCCCTGATTG AAAGCTACAA ACCAGAAGCT 87 60 

CTTTTGAATG TAGCTCTGCC TTATCAAGAT TTAACCATTA TGGATGCTTG TTTGGCAACA 8820 

GGTGTTCACT ATATCGATAC AGCCAACTAC GAAGCAGAAG ACACAGAAGA CCCTGAGTGG 8880 

CGTGCTATCT ACGAAAAACG TTGTAAGGAA CTTGGTTTTA CAGCCTACTT TGACTACTCA 8940 

TGGCAGTGCG CTTATCAAGA GAAATTCAAA GAAGCAGGCT TGACTGCTCT TCTTGGTTCT 900C 

GGTTTTGACC CAGGTCTAAC TAGTGTCTTT TCAGCTTATG CCCTCAAACA CTATTTTGAT 9060 

GAAATCCATT ATATCGACAT TTTAGACTGT AATGGCGGTC ACCACGGTTA TCCATTTGCA 9120 

ACCAACTTTA ATCC AGAAAT TAATCTCCGT GAGGTTTCTG CGCCAGGTTC TTACTGGCAA 918C 

GATGGGAAAT GGGTCGAAGT CGAAGCTATG TCTATCAAGC GTGAGTATGA TTTCCCTCAA 9240 

GTTGGACAAA AAGATATGTA TCTCCTTCAC CATGAAGAAA TCGAATCATT CGCCAAGAAC 9 300 
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ATTCCAGGTG TCAAACGCA7 TCCTTTCTTT ATGACTTTTC GTCAATCTTA CTTGACGCAC 9 360 

ATGAAATGTC TTGAAAATGT TGGACTCCTT CGTACGGATA CCATTAACT7 TAACGGCCAA 94 20 

GAAATTGTTC CAATTCAATT TTTGAAAGCC TTGCTTCCAG ATCCTGCCAG 7CTTCGGCCA 9480 

CCTACAGTCG GAAAAACCAA TATTGGATGT ATCTTTACAG GTGTCAAAGA CGGTGTCAAA 9540 

AAGACTATCT ATATCTACAA TGTCTGCGAC CATCAGGAAT CTTACGCAGA GGTTGGTTCG 9600 

CAAGCTATTT CTTATACCAC AGGAGTTCCA GCCATGATTG GGACAAAAT? ACTCA7CAAC 9660 

GGAACTTGGA AACAAGCTGG AGTGTATAAC CTTGAGGAC7 TAGATCCAGA TCCATTCATG 97 20 

GAAGCTTTGA -ATGAGTATGC TTTGCCATGG GTTGTGGTTG AAAATCCACA AATGGTCGAC 9 7 90 

TAATGAAGTT AGAACAAGTA CCAACACCAG CCTATGTTAT TGACTTCGCC AAGTTAGAAG 9840 

CTAATTGCCG CATTCTACAA TATGTACAAG AAGAGCCCGG TTCCAAGG7C TTGCTTGCCC 9900 

AGAAGGCATA TTCCCTCTAC AAAACTTATC CCTTGATTAG CCAGTATCTA TCAGGTACGA 9960 

CAGCTAGTGG ACTCTATGAG GCCAAATTGG CAAGGGAAGA ATTTCCTGGT GAAGTCCATG 10020 

TATTTGCGCC TGCTTTCAAG CATCCAGACT TGGAGGAATT GCTAGAGATA ATGGACCATA 10080 

TAGTCTTTAA CTCAGAGACA CAGTTGCGTA AACACGGTCC GCGTTGTCGA GAGGCTGGTG 10140 

TCAGTGTTGG TTTGCGCCTC AACCCTCAGT GTTCAACTCA AGGcAGATCA CGCGCTCTAT 10200 

GACCCTTGTG CACCAGGTTC TCGCTTTGCA GTTACTATAG ACAAGATTCC GAGTGATTTG 102 60 

CTAGATTTGG TTGACGGACT TCATTTTCAT ACCCTTTCCG AGCAGGGAGC AGATGATTTA 103 20 

CAAACAACTT TGAAAGCAGT AGAAGAACAG TTTGCTCCCT ACTTACATGA GGTAAAATGG 10380 

CTCAATATGG GTGGTGGTCA TCATATTACA AGAGAAGGTT ACGATGTGGA TTTGCTGATT 104 40 

TCAGAAATCA AGCGTATCCG AAAAACTTAC AATCTTGAAA TCTATATCGA GCCTCCTCAA 10500 

GCCATTGCGC TTAATCCGCG TTATTTAGCA ACTGACGTAT TAGATATTGT AGAAAACGGT 105 60 

ATGGAAATCT TGGTTTTAGA CGCC7CTGCG ACCTGCCATA TGCCTGATGT ACTTGAGATG 10620 

CCC7ATCGTC CACCTTTGAG AAATGGCTTT GAGTCACAGG AAAAAGCCCA TACCTACAGA 3 0680 

CTTTCTTCTA ATACC7GTCT GACGGGCGAT GTGATTGGTG ATTATAGTTT TCAAAATCCA 10740 

GTCCAAATCG GAG AC AG ACT TTATTTTCAA GACATGCCCA TTTATTCTTT TGTCAAAAAT 10800 

AATACCTTTA ATGGTATTGG ATTGCCAAGT CTCTATCTCA TGGACCAACA GGGAGACTGT 10860 

AGCTTACTCA AACCTTTTCG CTATCAAGAC TTTAAACGGA GATTATCATG ATGGACAGTC 109 20 

CAAAAAAATT ACGCTATCAC ATGCCAGCAG ACTACGAACC CCATCATGGT ACCCTCATGA 10980 

TATGGCCGAC TCGACCAGGA TCATGGCC7T TTCAAGGAAA GGCTGCTAAA AGAGCATTTA . 11040 
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CTCAGATTAT 


CGAGACCATA 


GCAGAAGGGG 


AAAGAGTCTA 


TCTTTTGGTG 


GAGCAGGCCT 


mco 


ATCTATCTGA 


AGCCCAATCC 


TATCTTGGAG 


ACAAGGTTGT 


7TATTTAGAC 


ATTCCCACCA 


UX60 


ATGATGCCTG 


GGCGCCTGAT 


ACTGGCCCAA 


CCATTCTCGT 


CAATGATAAA 


GGTAAGAAAT 


11220 


TAGCCGTGGA 


TTGGGCCTTC 


AATGCTTGGG 


GAGGCACCTA 


TGATCGTCTT 


TATCAAGATT 


11280 


ATGAAGAGGA 


TGACCAAGTA 


GCCAGTCGTT 


TTGCTGAGGC 


CTTGGAAAGG 


CCTGTCTATG 


11340 


ATGCTAAACC 


TTTTGTACTG 


GAAGGAGGCG 


CAATCCATAG 


CGATGGTCAA 


GGAACTATTC 


11400 


TCGTAACTGA 


AAGTTGCTTG 


CTTAGTCCTC 


GTCGCAATCC 


TAACTTGACT 


AAAGAGGAGA 


11460 


TTGAAAACAC 


ATTATTAGAA 


AGTCTTGGTG 


CTGAAAAAGT 


TATTTGGCTT 


CCTTATGGTA 


11520 


TTTATCAGGA 


TGAAACCAAT 


GAACACGTCG 


ATAATGTTGC 


TGCCTTTGTT 


GGTCCTCCTG 


11580 


AGCTTGTTTT 


CCCTTGGACA 


G ATG ACG AAA 


ATGATCCCCA 


GTATGCCATG 


TCAAAAGCAG 


11640 


ATCTCGAACT 


CTTAGAACAG 


GAAACAGATG 


CAAAAGGTTG 


TCACTTCACC 


ATTCATAAAT 


11700 


TCCCTATCCC 


TGCAGTTCGA 


CAAGTTGTGA 


CAGAAGAAGA 


TTTCCCACCC 


TACATCTATG 


11760 


AAGAAGGAGA 


AGAAAAGCGA 


TACGCAGGTG 


AACGACTACC 


AGCTTCCTAC 


GTAAACTTTT 


11820 


ATATCGCCAA 


CAAGGCTCTC 


TTCGTTCCAC 


AGTTTGAGGA 


TGTAAACGAC 


CAAGTGGCCT 


11880 


TAGATATCCT 


CAGCAAGTGT 


TTCCCAGACC 


GTAAAGTTGT 


CGGAATACCA 


GCCAGAGATA 


11940 


TTCTCTTAGG 


TGGTGGCAAT 


ATCCACTGTA 


TCACCCAACA 


AATTCCAGAA 


TAGGAGAAAA 


12000 


AGATGAGAAA 


TGTAAGAGTT 


GCAACCATTC 


AGATGCAATC 


CGCTAAGGAT 


GTGGCAACAA 


12060 


ATATCCAAAC 


CCCAGAGCCT 


TTAGTACGTC 


AGGCTCCTGA 


CCAAGGAGCC 


CAAATTATTC 


12120 


TCTTGCCCGA 


CTTCTTTGAA 


CATCCCTATT 


TCTGTCAGGA 


ACGTCAGTAT 


GACTACTACC 


12180 


AGTATGCCCA 


ATCTGTAGCG 


GAAAATACTG 


CCATTCAGCA 


TTTTAAGGTG 


ATTGCTAAGG 


12240 


AACTACAACT 


TGTTTTACCA 


ATCACTTTCT 


ATGAAAAAGA 


TGGTAATGTC 


TTGTATAACT 


12300 


CTATTGCCGT 


CATTGATGCA 


GATGGGGAAG 


TGCTGGGCGT 


TTATCGAAAG 


AC CC AT AT AC 


12360 


CAGATGACCA 


TTATTATCAA 


GAAAAATTCT 


ATTTCACGCC 


TGGTAACACT 


GGTTTCAAGG 


12420 


TCTGGAATAC 


TCCCTATCCT 


AAGATTGGTA 


TCGGTATCTG 


TTGGGATCAA 


TGGTTCCCTG 


12480 


AAACAGCGCG 


CTGTCTTGCA 


TTGAATGGTG 


CTGAATTGCT 


CTTTTATCCT 


ACAGCTATCG 


12540 


GTTCAGAGCC 


AATTTTGGAT 


ACAGATAGTT 


GTGG7CACTG 


GCAACGTACT 


ATGCAAGGGC 


12600 


ACGCAGCAGC 


GAA7ATTGTT 


CCACTCATCC 


CACCCAATCC 


TTATGGTTTA 


GAGGAGGTTA 


12660 


CTCCTAGTGA 


GGAAAATGGC 


GGACAGAGCT 


CCAGTCTTGA 


CTTCTACGGT 


TCCTCCTTTA 


12720 


TGACGGATGA 


AACAGGAGCT 


ATTCTAGAAC 


GAGCTGAAAG 


ACAAGAAGAA 


GCTGTTCTGT 


12780 


TACC7ACTTA 


TGACCTAGAC 


AAGGGAGCAA 


, GTGAACGCCT 


AAACTGGGGC 


TTGTTTCGAG 


12840 
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ATAGAAGACC AGAAATGTAT AGACAAATTA CAGATTAGTG TCGCAGAAAT GAGAGATTCA 12 900 

TTCTGCTAGA CTAACTTCTT ATTAGTAACT ATAACATACT ATGGCATCTA GTAAATCGAT 12 960 

TTTTATGATT CGCTATTCTT GTCTATTGAT TAGTCCGTAT TTTAAAATAT TAGCAAAAAA 13020 

GCAAATAGCA GTAACTTCTG TCTATTTGCT TTTCTTTTTT ATAGAATATA TTTCTCAATA 13080 

GCACGCCCAA CGCCGTCTTC TTCGTTGCTT GAGCTAACGG CATCCGCAAC AGATTTGATA 13140 

TAATCGCTGG CATTTCCCAT TGCAATCCCA AGCCCTGCAA ACTGGAGCAT TTCGATATCG 13 200 

TTATTAGCAT CGCCCATGGC CATAATCTCT GAGGAATCAA TCTTCAAAAT CTCAGCTAGT 13 2 60 

CGTGAAAGAG CAGTAGCCTT TGTCGTTCCA ACCGGCATTC CTTCATAAAT GACAGGCTGC 13 320 

GAACGAACTC CACTGAATCG TTGGCAAAGC TCTTCAGCAA AACGCTGCTC AAAATCGTCT 13 380 

GTTTGTTCTT TTGTTCCTAA ACACATACCT TGGAACATCC GGAACTTTCC ACTACTCGCT 13440 

TCTTCAAGAG AAATTTCACT CAGGTCTGAA AATACTAGTT TAGCATCATT TTCAATAACT 13500 

TGATTGGGCT TCTCACCCAG AACAAAATAA TGTGACTCCT CAAAAACTGT CAACTGAACA 13 560 

TCACTCTTTT CAGCAAGGTC ATACACCTAT TCGATGTCAG CTGGACTCAG TTCTTTCCAG 13 620 

TCAACTAGAC TCCAATCACT CGTCTCGTCA GTTGAACAAC CGTTCTTAAC AATAATATAT 13 680 

TCGTTCTGGA GGTCAAGCTC CAGTTTTTTG TAGTAGGGGA GGACACCCAA AAGGGGGCGA 13740 

CCCGTACAGA CAACCACTTT GACACCTTTT TCAATGGCTT TGTGAATAGC AGTAATGTGT 13800 

GCTTGTGGGA TTTCCTTGGC TTCATTGAGG AGGGTGCCGT CCATATCCAA GGCTAGTAGT 13 860 

TTAATCATAG CTCTTCCTCT TTATCTTTGC TATTATTATA GCATATTTTG GAGAAGAAAT 1 3920 

TGATAGAAAG CTTGACACTA ATTGATTTTA TAGTTTAAGA TGTTTTGATG ACAATTCATG 13 980 
ATTTGAAGAC GA7ATTTCGC AAAGATATGC TATACTATGT TTGTCAATCT TGCAACTAGA 14 040 
CAAATTAAAA AACCAACTTA ATATAATAGT TTTTTTGTAA GTACCTATCA GTAGCAGATT 14 ICO 
ACTCAACTAA TCTGAAGAAT AATGGAGGAA ATATATCATG ATTTTAATGA CAAAAAATAT 14160 
AAATCTAACA AATCAACAAT TAGAGCTGAT ACAAGGTGGA GCAGATCCAT ATGGTAAAAA 14220 
TCCTAATGGT AGGTACGATT GGGAAATAGA ACCAGTAT7A ACTCTGCTGG TTCATGGATT 14230 
TTGTCCCAGA GGCACCTATG ATTCAGGATA TATTGGAGGA GGTAATCATC TTTGCAAAGC 14 340 
AAGTGCTGCG AGATTTTAAC TAAAATTTAT TAGGAATATG AAGAAACAAG GGGAGAAAAC 144 00 
AGAGGATTTA ATATGAAAAA ACGAGCTATT CAAATTTTAC TAGCATTGTC CTTAATTTTT 14460 
TACAAATCAA CTTGGTTTTG GAGGCTTTTC AATTATCTCG CAAAGCCCTA TCTACCAGCA 14520 
AGTCGTGAAT TTTTTCAGAT TCTGCTTTTG ATGGAGAGCG CACTTCTTTT CTTAGCGGTC 14580 
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ATCTATCTAC TGGTTTTTGC AGGAAAGAAA ATTTTTCATT TCAAGTGGCA GCTGAGGTAC 14 640 

TTCATCTACC TTTTACTGGG CTACATCATT TCATATATGT CTGACTTCCT CTTTTCGTAT 14700 

TTCATATCCC TGTCTTCAAA TCAGATTTCT TTGAATGAAA CGGTAGAAAT GATGGGGAGA 147 60 

CAGGAGTTCC CTTATGTCTT GCTCATCGTT TGCTTCATCG CCCCTATTGC TGAGGAATTG 14 820 

ATTTATCGAG GtGTGCTTAT GACAACCTGT TGCAAAAACT CACCTTGGTA CG 14 872 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

I A) LENGTH: 10223 base pairs 
(3) TYPE: nucleic acid 
IC) STRANDEDNESS: double 
(O) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

CGTGCTATCG GTCTCAAAAC CAATCTCGTC GCTATGGTCA AATCCAGTTG GAAAATCCAT 60 

TCTTCTTGGA GCCATCTGCT GGATTGCCAT CATCCTCACC ACTCTTGGTA TGCAGACCCT 120 

TATCGGCATT TTCTAATACT CTTCGAAAAT CTCTTCAAAC CACGTCAACG TCGCCTTGCC 180 

GTAGGTATAT GTTACTGACT TCGTCAGTTC TATCTGCAAC CTCAAAACGG TGTTTGAGCT 240 

GACTTCGTCA GTTCTATCTG CAACCTCAAA ACGGTGTTTT GAGCTGACTT CGTCAGTCGT 300 

ATCTACAACC TCAAAACAGT GTTTTGAGCT GACTTCGTCA GTTCTATCTG CAACCTCAAA 3 60 

ACAGTGTTTT GAGCAGCCCG TGGCTAGTTT CCTAGTTTGC TCTTTGATTT TCATTGAGTA 420 

TAACACAAAA GGTAGCCCAT CAGCTACCTT TTTCTTATGC TTCCTCAATC AAGCGAGTAT 480 

GTTCTCTCTT GATACAGCGA TTCATCACGA TATCATCACA TCCACCATCA CGCAAAATCT 540 

CTTTCGCTTC TAAACTTTCA AGTCCTAGCT GTGCCCAAAA AATCTTGCCA TCAGCTTTGA 600 

GAAAATCACG CGCCACATCG GGCAGAAATT CACTGCGACG ATAAACATTG ACAATATCTA 6 60 

CAGGAAAAGG AATTTCAGCG AGGCTAGCAT AAGCCTTTTC ACCCAAGATT TCCCCACCTG "7 20 

CCGCCTTGGG ATTGACTGGG ATGATTTTAT AGCCCCGACC CTGCATTTCC TTTGTTACTC 780 

GATTGCTGGT TCTTTCTTCA CGGTCAGACA AACCCACCAC AGCAAGGGTT TTACTCGTTG 84 0 

CGAGATACTG ACGAATCACG CCATCACTTG GATTGATAAA TTCTTGACTC ATAGAAATCC 900 

TCCTTTTTCA TCAGTATAGC ACATTTTGAA AAGGTTTGCA GAATTATACT ACAAAAAAGG 960 

AGCACTACCC CCCTTTTTAT TTAGCCTCCT ACCACGTTGC CCCTTCATTC TCATCTGCGA 102 0 

TAAGAGGAAC ACTGAGTTGA ATGGCTTCTT CCATGGTTTG TTTCACCAAT TTTTTCATCT 1080 

CTACCAATTC AGATTTAGGC ACTTCAACGA CGATTTCATC GTGCACTTGT AACAGCATCT 1140 
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TAGTCTGATA ACCACCTGCA ACCAAGGCT? TATCCAGCTG AATCATGGCA ATCTTGAGAA 1200 
TATCTGCTGC CGAACCCTGG ATAGGTGAGT TGATAGCACT TCGCTCCGCA AAACCACGAA 126C 
TATTGAAGTT GCGCGAATTG ATATCTGGCA ACTCACGGCG ACGCTTAAAG AGGGTCTCTA 
CATAGCCCTT ATCACGCGCC TCCCGCACCA CTTCATCCAT GTAGTTTTTA ATACCTGGAA 
AACGTTCAAA GTAGGTATCA ATGTAGGCTT TGGCTTCCTT ACGACTAATT CCCAAATTAT 
TAGACAAGCC AAAGTCTGAA ATCCCATAAA CCACTCCAAA GTTAACTGCC TTGGCATTGC 
GACGGTCCTT TGCAGTCACA TCATCAGGAC GCTCAATGCC AAAGACCCGC ATGGCTGTCG 
AAGTATGGAT ATCTGCCCCC TCTTGGAAGG CCTTAATCAA GTGCTCATCC T7AGAAATAT 
GCCCCAAAAC GCGCAATTCA ATCTGTGAAT AGTCAGAGCT GAGTAGCACA CTATCCTCCC 1680 
ACTCTGGCAC AAAAGCCTTC CGAATCAAGC GCCCCTGTTC CAATCGGGCA GGAATATTTT 
GCAAGTTTGG ATCCACACTA GACAAACGCC CGGTCTGGGT CAAATCCTGC ACATAGCGAG 
TATGAATCTT TCCATCAGCC AAAATCCAGT CCTGCAACCC AATTACATAA GTAGATTGAA 1860 
TCTTAGCAAT TTGACGGTAA TCCAGGATTT TCTTAACAAT CGCAGCAATA GGAGCGAGAC 
GCTCTAAAAC ATCCACTGCT GTCGAATAAC CTGTCTTGCT TTTCTTAGTG TATTCTAGAG 
GAAGTCCCAA TTTCTCAAAG AGAAGCACGC CCAACTGCTT AGGCGAGTTG ACATTAAACT 
CCTCACCAGC CAGCTCGTAA ATCTCTTGAG TCAGTTTTTC AATGACAAGC TCATTTTCAG 
CCTGCATCTC AAGCAAGGTC TCTTTCTTGA CCATAATCCC AGCAATTTCC ATCTTGGCAA 
GGACAAAAGC CAGAGGTTGC TCCATATCAT AAACAAGCTC TAA7TGCCCA TTTTCGCTGA 
GTTTTTCAAG 7AAAATACGC TCTGTTTCTA CCAAAACAGC AAGTTTACAA GCTAAGTGTT 
CCAAGAATTT CTCACGTTCA CGAATGGCCT TTTTAACACC CTTACCGTAC AAAGTTTCA? 
CATCAACCAA GTAAGTC7CA CCATAAAGAC TAGCGATGCT CGCAATTTCA TTGTCCTCCA 
CAGTCGAAAG GAGGTATTTA GCCAAACGGA TGTCAAAAGC AGGCCCCTGC AAATCCACAC 
CAAAACGTTC CAAAAGAACT TTAACCTTCT TAAAGTCATA AACTCTCAGA GATGTTTTTT 
CTAAGAAATC CTTGAAAATC GCGTCTTGCA ACAGCTCAAG CTTGTCTGTG GCATAGAGCT 
TATCCCCACA AGACCAGACA AATCCAACCA AATTATCCCT ATGGTAATTC TCACCAAAAA 
GCTCAAAGTG GAAGATAGAC TCTTCACTCA GCATATCTTG ACTGATTTGG TCAACAATAG 
TAAAATCCAA ACTC7CAGAC ACATCAGCTG ACGACACATT TAAAGCCTGC TTTAGCTGTT 
TGAAGCCCAT CTCATCGTAG AATTTCCCAA GATTTTCAAC ATCTGGACCA CTATAGACCA 
AGTCCTCTAA ACCAATCGCA ATCGGTGCCT TGGTATCAA7 GGTCGCTAGT GTTTTAGACA 



1320 
1380 
1440 
1500" 
1560 
1S20 



1740 
1800 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2323 
2880 
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AAAAGGCCTC TTCCTTGTCA TTGATGAGAT TTTCCTTCAT CTTAGAAGTC TTCATTCCAT 2 940 

CAATATTTTC ATAAATCCCC TCAAGCGAAC CATGCTCCAG CAAGAGCTTA ATACCCGTCT 3000 

TTTCACCGAC TTTGGTCACC CCAGGGATAT TATCCGACTT ATCACCCATG AGCGCCTTGA 3060 

GATCGATAAA CTGAGCTCGT GTGAGGCCCA TTTCTTCCAT GAGGTAATCT GGCCTAAACC 3120 

CCTCAAACTC AGCCACACCT TTCTTGGAAA TTTCAACCAC CCTATCCTCA TCCGTCAGCT 3130 

GAATCAAATC CTTGTCCCCA CTGACAATAG TAATATCAAA ACCATCCTGC TCTGCTAGCT 3 24 0 

TATCCAGCGT CCCAATCATG TCATCCGCCT CATACTGAGC C AG AT CAT AG TGACGAATCC 3 300 

CCATATGATC CAGCAACTCA CGAATGAAAG GAAATTGCTC ACGAAACTCA TCAGGAGTCT 3 3 60 

TGGCCCGACC ACCCTTATAG TCCGCATACA TCTCTGTCCG GAAGGTCGTC TTTCCCCCAT 3 420 

CAAAAGCCAC CAAAATATGA CTCGGCTCAA CCCGCTCCAA TAAATGACTC AACATCAACT 3480 

GAAAACCATA AATCGCATTG CTATGCAAAC CAGCCACATT CTTAAAACGG TCCAACTGCT 3 54 0 

GATACAGCGC AAAAAACGCC CGAAAAGCTA CAGAAGACCC ATCAATCAAT AATAATTTTT 3600 

TCTTATCCAT ACACCCATTA TAAACGAAAG AATCAAAAAA TACCATTGGG AAGAGCTAGA 3 660 

GCAACTATTT TTCAAACTTT TTCCGAATAA ATACATAGAG CCAGAGAATT TAGTAAACCT 3720 

AGATTTAAAA ATGTGCTATA ATATAGTATA TTGAATCTAT AATAGTACAC CTTGACTGCT 3780 

AAAATATTTC TATAAATTAA TTTGACTTTC CTGATAGAGT TATTCACATC TTATTTCAAC 3840 

TCACTATAGA AGGAGGAATA GGAGGATTCT CAGACATCCG GGCATCAGCC CAACTAATCA 3900 

TTTGATTGCT AAGAAAATAT TCAGCAATCC AGAAATCACT TGTCAATTTA TTCGCGATAT 39 60 

GCTGGACTTC CCAGCAAAAA ATGTGACCAT TTTGGACGGA AGCGATATTC ACGTATTACT 4020 

CTCCATGCCT TACTCGGTGC AGGATTTTTA TACCAGTATA GACCTCTTCC CGGAGTTGGA 4080 

TAACGGTACT CAAGTAATTA TTGAGATTCA AGTCCATCAT CAGAATTTTT TCATCAATCA 4140 

CTTGTGGGCT TACCTGTGCA GTCAGGTTAA TCAAAATCTT GAAAAAATTC GTCAGCGAGA 4200 

AGGTGATACT CACTAGAGCT ACAAACACAT CGCTCCTCTT TACGCCATTG CTATCGTGGA 4260 

TACTAATTAT TTCTCAGATG ACCTGGCTTT TCATAGCTTT AGTATGCGCG AAGACACAAC 43 20 

AGGTGAGGTA TTGGCGATTA CCAACAATGG ACACGAAAAC CATCTCGTTA AGATGGCATT 4 3 80 

CTTGGAATTA AAAAATACAG AGAAACCAGC AAAGACAAGG TTCGCAAGCC ATGGTTGGAG 4440 

TTTTTCGGCA ACAAGCCCTT TACCCAGCAA CCGCAACGAG CCATTACCCA AGCAAATCAA 4500 

CTGCTGGACT ACAAGAGCTG GTCCGAGGAG GACAGGAAAA TCTTTACTCA ACTACATATG 4 560 

CGAGAAGAAC AAGTCTTGTT AGCACAGGAC TATGCCTTGG AAACTGCTAG GGCTGAAGGC 4 620 

CTTGAACAAC GACTAGAGCG TGGGAAAGTT GAAGGAACGG CAGAAAGGAA ACTTTTTGCC 4 680 
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TTCCTAGACA TAGTACGCCA AGCTCTTCTG ACTTCTCAGG TTGCCAGCCA GCAATTAGGT 4740 

ATGTCAGTAT CTGAATTTGA GGCACTGTTG TAAAATGGCT CCATAATATC CATAGTGGGT 4 800 

AAATCCCCTA TGCATATTAT GGACCCTATT TTGTGTAGAA AAAAACTCCC ATATGACCTA 4 86C 

TAATGAAAAG CGACAAAACA AC ?C ATT AC A AAGAATCATA TGGAACAATT ACATTTTATC 4 920 

AC AAAATT A C TAGACATTAA AGACCCTAAT CTCCAGATTT TAAACATCAT CAATAAGGAT 4 980 

ACACACAAGG AAATCATCGC CAAACTGGAC TACGACCCCC CATCTTGCCC TGAGTGCCGA 5040 

AACCAATTGA AGAAATATGA CTTTCAAAAA CCTTCTAAAA TTCCTTATCT TGAAACGACT 5100 

GGTATGCCTA CAAGAATTCT CCTTAGAAAG CC7CCATTCA AGTGCTATCA CTGTTCAAAA 5160 

ATGATGGTCG CTGAAACTTC TGATGACGTA CAGTCATATT TCTTCTCTTT TTATTATATC 5220 

ACAGTTTTAA ATCTAGCTTT ACTAGATTCA CCGCTACTAT CTATTTATTC GGAAAAAAGA 5280 

CGAAAAAACC TGAGAATCAT CTCAGGCTTG GTCATTAAAT TTTTTTCTCA ATATCGAAAA 5340 

GTGGAGAAAG TGCTCCTTTT TCATGAATAC GTACGATACC ATCCCCTAGG AGATGAGCGA 5400 

TTGAAATCTG CTCAATCTTA TCAATCAAAC GCTCTTCTCC CAGATAGATG GTATCCAAAA 54 60 

CAACCAATTT CTTAATAGCT GATTTTTGCA TATTGTCCGT AGCAGGACCA GAAAGAACTG 5520 

GGTGCGTACA GCTTGCATAG ACTTCAACAG CACCACCTTC CGCAACAGCA TCTGCCGCAT 5580 

GACAAATCGT TCCAGCGGTA TCAATCATAT CATCAATCAA GATACAAGTC TTGCCTTCAA 564 0 

CCTTACCGAT GATATTCATA ACTTCACTAG TATTCATCTT ATCAACGCTA CGACGTTTAT 5700 

CAATAATAGC GATAGATGTT TTCAAAAATT CTCCCAACTT ACGAGCACCA GTCACCCCTC 57 60 

CATGGTCCGG GCTGACAACC ACATAGTCAG AACCAACCAT ACCACGACCC TCAAAATAAT 5820 

CTGCAATCAG AGGAGCACCC ATCAAATGAT CCACAGGAAT ATCAAAGAAT CCTTGAATTT 5 880 

GCCCAGCATG CAAGTCGATG GTCAATAAAC GATCCACTCC AGCTACTTCA AGCATATTTC 5940 

CGACAAGTTT TGAAGTGATT GGCTCACGCG CTCTCGCCTT TCTATCCTCA CGTGCATACC 6000 

CATAGTAAGG CATCACAACA TTGACAGATT CTGCACTCCC ACGCTTCAAA GCATCTACCA 6060 

TAATCAAAAT TTCAAGCAGA TTGTCATTTA CAGGCGAACT AGTTCATTGT AAGATAAAGA 6120 

CCTCTTTCCC ACGGATTGAT TCTTCAATGT TGACCTGAAT CTCTCCATCT GAAAATTGCC 6180 

GAACACTTCA TTTCCCCAAC TCTATCCCAA TCTCCTGCGC CACACGTTCT GCCAATTCTT 62 40 

TATTAGAAGA AAGGGCAAAC AGCTTTAAAT CAGAAAAAGA CATGATTTCC TCCGGTATAT 6300 

ATGTATAACT TGTGCTTTTC ACAAGATTTT GCATCTACCA TTGTAGCGCT TTTTGCACTA 6 3 60 

TTTTTCAATC AAAAATAAAA GAAGGGCACC ATATTTGTAC CCTTGCATCA TTCTTTTGAA 64 20 
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AAATATTCTA 


GGTCATCAAC 


TCATTGTGTT 


TCTCAACAAA 


GCAATAAGCA 


TGATAAAAAC 


6480 


CATACAGAGC 


AATAGCCGTA 


ACCACTCCAA 


TCGCTAAAGG 


CAACTCTCTT 


TCCAACTCCA 


6540 


CAAAAGGAGA 


GTTAAACAAG 


AAGTGAGTTC 


CCAAGGCTAA 


ACCTAGAAAA 


ATAACGCCCT 


6600 


GTTTCTTGCC 


AACCTTCTGT 


CCTTTATAGG 


CTCTGTAAAG 


CAAGTAAACA 


CCTACTACAG 


6660 


C7ACACCTCA 


AAAAGTCCAG 


TGAGAGGCAA 


TTCCTGAGAT 


GATACGCTCT 


AAAATTCGCG 


6720 


AAATAGTAAA 


GTCAAAGCCC 


TCTGGCAAAT 


CCGTACGAAT 


ATAACCAATA 


TCCTTAATCA 


6780 


TTTGGAATCC 


CAAACCGGAA 


GCAATTCCAA 


GTAAAAACAA 


AGATTTTAAT 


TTTCGCACAG 


6840 


GAATCAAACC 


CAAAACAAAA 


ACAAGTGACA 


ATAATTTCAA 


GGGTTCTTCT 


ACCAAAGGAG 


6900 


CCCCAA7AGC 


ACTTTCAAAG 


GCATTTAAAA 


ATGGACTATC 


TGGCAAAAGA 


ACCCCCAGTA 


6960 


AATCATGGAT 


ATAAGTATTA 


GCAAAACTAG 


ACAACCAGCC 


TGAAAGGAAC 


ATCCCTCCCA 


7020 


ATAAAGACAG 


AATCAAAACC 


TTCTTTGGCA 


ATTCCCATTT 


TTCCCAATAC 


GGAAGAGAAA 


7080 


ATAAAGAGCC 


GGAATCATGT 


AAAAGAGAGC 


TAGAAAGATA 


GAAACTCCCA 


TTAGTCCATA 


7L40 


TTCCGCACCT 


GACCTCGAAC 


CGTCCGTATA 


GTAGATGGTT 


TCATACTGTA 


AACCAATACA 


7200 


TAGCAATAAA 


ATAAAAATAA 


ATAAAATATT 


GCTTTTCTTC 


ATACACTTTC 


TTTCTAAATG 


7260 


AAGTATTTAT 


AATTCTACGA 


CTGTCATACT 


TCCTGTATCA 


ACATTGTAAA 


TGCCACCAGA 


7320 


GATAATGACA 


TCGTCTGGTA 


TTAGGGGAGA 


CTCCATAAGC 


AGTTGCATAT 


CCTCGCGTAC 


7380 


ACTCTCTTCT 


ATATCTTGGA 


AGGGCAAGAA 


GTCCTGGTCT 


GACACATCGA 


CACCCAATTC 


7440 


TTCCTTCAAA 


TACTCCTGAA 


AAGGTTCATT 


TTCAAAGGTC 


TGACCACCAC 


AGTCTGTATG 


7500 


ATGCAATACC 


ACAATTTCTC 


TTGTCCCCAT 


TTGTTGCTCG 


GAAATAACTA 


GAGAACGAAT 


7560 


CATATCCTCA 


GTCACTCGAC 


CACCTGCATT 


CCGCAAAATA 


TGAGCATCCC 


CAAGTGCCAA 


7620 


ACCTAGAGCT 


TGCGCAACGT 


GCAAACGTGA 


GTCCATACAG 


GTCACAATGC 


CTACTCTGGT 


7680 


TTTAGGTTTA 


AGTGGCAGAT 


TTAACTGCCC 


ATGTAGGGCA 


ACATAAGCCT 


CATTCCCTTG 


7740 


CA7AAACTGT 


TCAAAATACG 


ACACGATTCC 


CTCCTTGAAA 


ATTTGATAGT 


CAAAT ATTIC 


7800 


TCCTATCTTA 


TCATTTTTAA 


GAGAATTTGT 


CACGGATTA7 


GCAAAGACCT 


TTTTCAAGAC 


7860 


TTCCTGAATC 


GTTGTCACGC 


CAATGACCTG 


AATTTCCTTA 


GGCAGAGTGA 


TTCCTGTCAA 


7920 


GGAATTCTTA 


GGTACATAAA 


TCTTAGTAAA 


GCCCAGTTTA 


GCAGCTTCGT 


TGATGCGTTG 


7980 


CTCAA7ACGA 


TTCACGCGCC 


GAATCTCTCC 


TGTCAAGCCC 


ACTTCTCCGA 


CAAAACATTC 


8040 


CTGAGGATTA 


GTTGGCTTGT 


CTTTGTAGCT 


CGAAGCAATA 


GCAACTGCAA 


CAGCCAAGTC 


9100 


AATCGCAGGT 


TCATCCAATT 


TAACACCACC 


AGCAGATTTG 


AGATAGGCAT 


CCTGATTTTG 


3160 


CAAGAGAAGC 


CCTCCCCGTT 


TTTCCAAAAC 


AGCCATAATC 


AAGCTAGCAC 


GGTTAAAATC 


8220 
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AAGTCCTCTC GTAGTACGCT TGCCATTTCC AAACATCCTC GGTGTTACCA AAGCCTGAAC 8280 

CTCCGCCAAA ATCGGACGCG TCCCTTCCAT GGTTACAACG ATGGAGGAAC CAGTCGCCCC 8 340 

ATCCAAACGC TCTTCTAGGA AAACTTGACT CGGATTGAGT ACCTCAACCA AGCCGCCCGA 8400 

CTGCATCTCA AAAATCCCAA TCTCATTAGT GGAACCAAAA CGATTTTTGA CCGCTCTCAA 84 60 

AATACGAAAG GTGTGGTGAC GCTCCCCTTC AAAGTAAAGC ACCGTATCCA CCATATGCTC 8520 

CAACATACGA GGCCCAGCCA AGGTTCCTTC TTTGGTCACA TGACCTACGA TAAAGATGGC 8580 

AATGTTATTG GTCTTGGCCA ACTGCATGAG TTCAGCGGTC ACTTCACGCA CCTGAGAAAC 364 0 

AGACCCCTGC ACCCCTGAAA TCTCAGCAGA CATGATGGTC TGGATGGAAT CAATAATGAG 37 00 

AAAGTCTGGC TGGATACGCT CCACTTCTGC ACGAACACTC TGCATATTCG TCTCTGCATA 87 60 

GAGATAAAAC TCACTATCAA TATCACCTAA GCGCTCTGCA CCTAGTTTAA TCTGCTGCGC 8820 

AGACTCCTCC CCACTGACAT AGAGAACTGT CCCCACTTGC GACAACTGGG TTGAGACTTG 8880 

TAGGAGAAGA CTTGATTTCC CAATCCCAGG ATCCCCACCG A7AAGGACGA GACTTCCTGG 8940 

TACCACTCCG CCTCCAAGCA CACGGTTGAA TTCCTCCATC TCCGTCTTGG TTCGATTGAC 9000 

ATTGATGGAA GTCACCTCAG CTAGTTTCAT GGCCTTGGTT TTCTCACCTG 7CAAGCACAC 9060 

ACGCGCATTC TTAACTTCGG CAACCTCAAC CTCTTCCACA AAAGAAGACC AAGACCCACA 9120 

GTTGGGGCAA CCTCCCAGAT ATTTAGGGGA ATTATACCCA CAATTTTGAC ATACAAATGT 9180 

CGCTTTTTTC TTTGCGATGA CAAACCTCTT TCTATATCTC TAACTCACAC TCAATCACTT 9240 

GGCAAAAATC AATCTTCTCA TTTGGCACAA ACTCGCCCAT GAGCATTCGA TGAGCAACAA 9 300 

CTACCACAGT CTGATGTTCT CGATACTTAG AC ATA C ATT C TAGAAACCGA GACTTCATTT 9 3 60 

CCGTAGCTGT CTCATATTGA ATAGGACTAT TAGGAAGCAA CTCCCCCTTG TTTTCTAAAA 94 20 

ACAGTCTTCT ACCTGTTTCA AAGTTTTCTA TTCCTGTTTT ATAGACCTGC CATTCATGTA 94 80 

ATAAACCCTC TACTCTTAAA GGAAGACCCG TAGCACAGAC CACATACCAA GCCGTTTCTA 9540 

AAGCTCTTGT GACTGCAGAA GATACGATTA TTTCAGCTGA CGAGAGTAAA GGATTTTTGC 9600 

TCAATTTCTG GACTTGCTGC CGTCCCATCT CAGACAAGGG TGCCAAATCT ATCCCAAATC 9 660 

CTATATAAGA ACGCTCCTCT AACTCACGGT AATCTGGCTC CCCATGACGT ACAAAGATAA 9720 

TCTTCATTCT AGTGCCCTGT CGATCCAAAT CCACCAGTTC GAACCCCATC AGCTGCATCT 97 80 

CCATCTGCAA TTAAGAAAGT AGCAAAAACA GCCTGGACAA TACGCTCCCC AACTTCAAGA 9840 

ACAACCTCTT GGTCTGTGAT ATTCTTCATC TGCGCAAAAA TATGCCCTTC ATTTCCAGGA 9900 

TTTCCATAAT AATCCCCATC AATGACTCCA ACTGAGTTAA 7TAAAACCAA GCCCTTCTTA 9960 
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CGAGGATTTG AAGAACGATC ATAGAGGTAG AGAACCTCAG TCGGCTGCAT ATAAGCCTTA IOC 2 0 

ACCCCTCTCG GAACCAAGAC AATCTCTCCT GGCGCAACAA CTGTACGCAC AGCAACCTTT 10080 

AAGTCGTAAC CAGTCGCATG CCCTCTCTCA CGCTTGGGCA ATAAATTTTC ATCTGTAAAA 10140 

CTCGAAACCA ATTCAAAACC ACGAATTTTC ATAATTTTCT CTTTTCTATT ATCATTTATT 10200 

CT AG ATT ATT CTATACTTAT TTA 10223 

(2) INFORMATION FOR SEQ ZD NO: 74: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16535 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(Di TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TGGTTCTGTC CTTATCGGCG CCTTGTCTTG CTTGCCATCG CTACACCAAC TATCTCATCC 60 

CACGAAACTA CACCAACCAC TAACGAACCC AACAACAGAA ATACAACCAC CCTTGCCCAA 120 

CCTCTTACTG ATACAGCAGC TCGCTCTGGT AAGAACGAAA GTGATATTTC TTCACCTGGA 180 

AATGCAAACG CTTCCCTAGA GAAAACAGAA CAAAAACCTG CTCCAAGCCC AGCCCATCCA 240 

GCACCACAAA CTGGACAAGA TCCTTCAAGT GAGCCAACTA CTTCTACTAG TCCAGTAACA 300 

ACTGAAACTA AGCCACAAGA GCCCATCGAA GATAACTACT TCCCTATCCA TGTCAAAAAA 360 

CTTCCTCAAG AAAACAAGGA TGCTCAAGGA CTATGCACTT CCCACGATGT TGAAAAACCA 4 20 

TCTGAAAACT GGCCAAACCG AGCTTTGTCC TTCAACGATG CCAAGAAAGA TGACTACGGC 480 

TATTACCTAG ATGTCAAATT AAAGGGAGAA CAAGCCAAGA AAATTAGCTT CCTCATCAAC 540 

AATACAGCTG GAAAAAATCT AACCGGCGAT AAATCTGTAG AAAAACTAGT TCCAAAAATG 600 

AACGAAGCTT GGTTAGACCA AGATTACAAG GTTTTCTCTT ACGAGCCACA GCCTCCAGGA 660 

ACTGTTCGCG TCAACTACTA CCGCACAGAT GCCAACTATC ACAAGAAATC TCTCTGGTAC 7 20 

TGGGGAGATG TGAAAAATCC AAGTAGCGCT CAATGGCCTG ACGGAACAGA CTTTACGGCT 7 80 

ACAGCCAAAT ATGGCCGCTA TATCGACATT CCTCTTAATG AAGCCGCAAG AGAATTTGGA 840 

TTTTTATTAC TACATGAGAG CAAACAAGGA GACGACGTGA AAATCCGTAA AGAAAATTAT 900 

AAGTTCACAG ATTTGAAAAA TCATAGCCAA ATTTTCCTAA AAGACGATGA TGAATCGATT 9 60 

TACACAAATC CATACTATGT CCATGATATC CGTATGACAG GAGCCCAACA CGTAGGCACT 102 3 

TCTAGCATTG AAAGTAGCTT TTCAACACTT GTCGGTGCTA AAAAAGAAGA TATCCTCAAA 108 C 

CACTCCAACA TCACTAATCA CCTAGGAAAC AAGGTAACTA TTACCGATGT TGCAATCGAT 1140 
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GAAGCTGGTA AGAAAGTGAC CTACAGCGGA GATTTCTCTG ACACAAAACA TCCTTATACT 1200 

GTTAGCTACA ATTCCGACCA ATTCACTACC AAAACAAGCT GGCGCCTGAA AGATGAGACA 12 60 

TACAGCTATG ATGGCAAACT GGCAGCTGAC CTAAAAGAAG AAGGAAAACA AGTTGATTTG 1320 

ACCCTTTGGT CACCAAGTGC TGATAAGGTT TCTCTTGTTG TCTACGACAA GAATGACCCT 1380 

GACAAAGTAG TTGGAACTGT CGCTCTTGAA AAAGGGGAAA GAGGAACTTG GAAACAAACT 1440 

CTAGACAGCA CAAACAAACT CGGAATCACA CATTTCACTG GCTACTATTA TCAATACCAA 1500 

ATCGAGCGTC AAGGTAAAAC TGTTCTTGCA CTCGATCCTT ACGCTAAATC TCTTGCTGCT 1560 

TGGAATAGCG ACGATTCCAA CATTCACGAT GCCCATAAAG TGCCTAAAGC CGCCTTTGTA 1620 

GATCCAGCTA AACTCGCACC TCAAGACTTG ACTTATGGTA AGATTCACAA TTTCAACACT 1680 

CGTGAAGACG CCCTTATCTA CGAAGCTCAT GTGCGTGATT TCACTTCAGA TCCTGCCATT 1740 

GCAAAAGACT TGACCAAACC ATTTGGGACT TTTGAAGCCT TCATTGAAAA ACT AG ACT AT 1800 

CTCAAAGAC? TGGGTGTAAC CCATATCCAG CTCCTTCCAG TCTTGTCTTA CTACTTTGTC 1360 

AATGAATTGA AAAACCATGA ACGCTTGTCT GACTACGCTT CAAGCAACAG CAACTACAAC 1920 

TGCGCATATC ACCCTCAAAA CTACTTCTCC TTGACTGCTA TGTACTCAAG CGATCCTAAG 1980 

AATC C AG AAA AA CG AATCGC AGAATTTAAA AACCTCATCA ACGAAATCCA CAAACGTGGT 2040 " 

ATGGGAGCTA TCCTAGATGT CGTTTATAAC CACACAGCCA AAGTCGATCT CTTTGAAGAT 2100 

TTGGAACCAA ACTACTACCA CTTTATGGAT GCCGATGGCA CACCTCGAAC TAGCTTTGGT 2160 

CGTGGACCCT TGGGGACAAC CCACCATATG ACCAAACGGC 7CCTAATTGA CTCTATCAAA 2 22 0 

TACCTAGTTG ATACCTACAA AGTGGATGGC TTCCGTTTCG ATATGATGGG AGACCATGAC 2280 

GCCGCTTCTA TCGAAGAAGC TTACAAGGCT GCACGCGCCC TCAATCCAAA CCTCATCATG 2 340 

CTTGGTGAAG GTTGGAGAAC CTATGCCGGT GATGAAAACA TGCCTACTAA AGCTGCTGAC 2400 

CAAGATTGGA TGAAACATAC CGATACTGTC GCTGTCTTTT CAGATGACAT CCGTAACAAC 24 60 

CTCAAATCTG GTTATCCAAA CGAAGGTCAA CCTGCCTTTA TCACAGGTGG CAAGCGTGAT 2 52 0 

GTCAACACCA TCTTTAAAAA TCTCATTGCT CAACCAACTA ACTTTCAAGC TGACAGCCCT 2 580 

GGAGATGTCA TCCAATACAT CGCAGCCCAT GATAACTTGA CCCTCTTTGA CATCATTGCC 2 640 

CAGTCTATCA AAAAAGACCC AAGCAAGGCT GAGAACTATG CTGAAATCCA CCGTCGTTTA 2700 

CGACTTGGAA ATCTCATGGT CTTGACAGCT CAAGGAACTC CATTTATCCA CTCCGGTCAG 27 60 

GAATATGGAC GTACTAAACA ATTCCGTGAC CCAGCCTACA AGACTCCAGT AGCAGAGGAT 2820 

AAGGTTCCAA ACAAATCTCA CTTGTTGCGT GATAAGGACG GCAACCCATT TGACTATCCT 2 8 80 
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TACTTCATCC ATGACTCTTA CG ATT CT ACT GATGCAGTCA ACAACTTTCA CTGCACTAAG 2 940 

GCTACAGATG GTAAAGCTTA TCCTGAAAAT GTCAACAGCC GTGACTATAT GAAAGGTTTG 3000 

ATTGCCCTTC GTCAATCTAC AGATGGCTTC CGACTTAAGA GTCTTCAAGA TATCAAAGAC 3060 

CGTGTCCACC TCATCACTGT CCCAGGCCAA AATGGTGTGG AAAAAGAGGA TCTAGTGATT 3120 

GGCTACCAAA TCACTGCTCC AAACGGCGAT ATCTACGCAG TCTTTGTCAA TGCGGATGAA 3180 

AAAGCTCGCG AATTTAATTT GGGAACTGCC TTTGCACATC TAAGAAATGC GGAAGTTrTG 3240 

GCAGATGAAA ACCAAGCAGG ACCAGTCGGA ATTGCCAACC CGAAAGGACT TGAATGGACT 3 3 00 

GAAAAAGGCT TGAAATTGAA TGCCCT7ACA GCTACTGTTC TTCGAGTCTC TCAAAATGGA 3 360 

ACTAGCCATG AGTCAACTGC AGAAGAGAAA CCACACTCAA CCCCTTCCAA GCCTGAACAT 34 20 

CAAAATGAAG CTTCTCACCC TGCACATCAA GACCCAGCTC CAGAAGCTAG ACCTGATTCT 34 80 

ACTAAACCAG ATGCCAAAGT AGCTGATGCG GAAAATAAAC CTAGCCAAGC TACACCTGAT 3540 

TCACAAGCTG AACAACCAGC ACAAGAAGCA CAAGCATCAT CTGTAAAAGA AGCCGTTCGA 3600 

AACGAATCGG TAGAAAACTC TAGCAAGGAA AATATACCTG CAACCCCAGA TAAACAAGCT 3660 

GAACTTCCAA ATACAGGAAT CAAAAACGAA AACAAACTCC TATTTGCAGG AATCAGCCTC 37 20 

CTTGCCCTCC TTGGTCTCGG TTTCTTACTA AAAAATAAAA AAGAGAACTA AACTAGCCCT 3780 

CCTATAGAAA AATCCCCCAA GCATTATAGC TCGGGGGATT AATTTTTGTA CAATATTTGT 3840 

TGTCCTAATA AACTTGATTA GGATTTTTTA TTAAGCCTCT TTCATAGCAA AATAAGCTCG 3900 

TACTTTGGGT GCAACTTGTG TTCCGAAGAG TTCAATAGCT CTCAGAACCT GGTCATGAGG 3 960 

CATAGAACCA AGCGGTAGAT GAAGCATCAA GCGGTCCAAT CCTAAATCCT CTATCATGCG 40 20 

AATCAATTTT TCGGCCACCT GATCTCGATT GCCAACAAAC ATGGCGCCAT TTGGCCCTAC 4080 

CTGCTCCAAA TATTGCTCAT AACGCAATTC CTGCCAGTGC GGACGGTCTT TGGAAATAGC 414 0 

ATCCACCACT TGCTTAGTCG GATGGAAATA A7CTTTCACC GCCTGCTCAC CATCTTCCCC 4 200 

AATCCACCCC CAAGAATGGG CTCCCACTTT CAAGTCTTTG TCAGCATGGC CCCTTCGCT7 4 2 60 

CCAATCTCAC GATAAGCCTG AATCAACTTT TTAAAATAAC GTGGATTACC ACCAATAATA -1320 

G C AT AT AC AA TCGGTAGACC AGCCTGAGCA ATCTTCACTG TTGATTCGAC ATGACCACCT 4 380 

GTAGCTATCC ACAAGGGCAA TTTGTCCTGA ACTGGACGAG GATAAACTTC TTTACCAGCA 4440 

ATCGTTTGAG TCAATCGACC TTGCCAGTCT AACTTGGTCT TTTCATTGAC TAACTGAAGC 4500 

AAGTCTAATT TCTCATCAAA AAGAGAGTCG TAGTCTTTCA AGTCATAACC AAACAGAGGG 4560 

AAAGATTCCG TGAAAGAGCC CCTTCCAGCC ATAATCTCCG ATCGTCCATT TGACAAAGCA 4620 

TCGATAGTGG CATACTGTTG GAACAAACGA ATCGGGTCCA TGCTTGACAG AATGCTGACT 4 680 
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GCACTGGTCA AACGGATTTT CTTGGTATTG ACTGCCCCAG CGGCCAGAAC AATCTCTGGG 4740 

GCTGATACTG CAAAATCCGC CCGATGGTGC TCACCAATCC CATATACATC CAAACCAACC 4 800 

TTGTCAGCCA GCTCAATCTC TGCCACCAAC TGGCGAATCC GTTCAGCATG ACTGTAAGTT 48 50 

TGTCCAGTCC CTTCAACCTC CGTTATTTCC CCAAATGTTG AAATTCCCAA TTCTACCATT 4 920 

GTGATTCTCC TTATCTATCT CTGTACTTCA ATTTGAAAAA TTATTCTAAC ACGAATCTTG 4 980 

AGTACAAGCA ACCGATTTGC TC ATT AG AAA AAGCCTAGAT AACTAGACTT TTTTAGCTTA 5040 

TTCTACCCTT ACTGACTTGG CAAGGTTACG TGGTTTCTCC ACATCGAGGC CACGGTGGAG 5100 

GGTTGCAAAG TAAGCGACTA ATTCCGTTGG TACCACCATT GAAATTGGTG AGAGGTATGG 5160 

ATGTACGGTC GTAAGGACGA TATCGTCGGT ATC7TTCGCT ACATTCTCTT CTGCGATAGT 5220 

GAGGACTTTG GCACCACGGG CTGCGACCTC TTGGATATTT CCACGAGTAT GATTGGCAAG 5280 

AACTGGATCT GACAAGAGAG CCAAAACAGG CCTTCCTTCT TCAATCAAGG CAATGGTTCC 5340 

CTCCTTGAGT TC TCCTCCAG CAAAGCCTTC ACACTGGATA TAAGAAATCT CTTTCACTTT 5400 

GAGACTTGCT TCCATGGCTA CGTAGTAATC TTGACCACGT CCGATGTAAA AGGCGTTACG 5460 

ACTTGTTTCA AGAAGTTCAC GAACCTTGAC TTCAATGGTT TCTTTCTCTG AAAGAGTTGA 5S20 

TTCGATAGAC TGAGCTACGA TTGACAATTC ATCAACCAGC TCAAAGGCTT CCGCTTTACC 5580 

ATTACCATTT GCTTCTCCGA CTGCTTTTGC AAGGAAGGCA AGGGCTGCGA TTTGCGCTGT 5640 

ATAGCCTTTA GTTGATGCCA CGGCAATTTC AGGACCTGCG TGAAGGAGCA TGGTATAGTT 5700 

GGCTTCACCT GACAGGGTTG AACCTGGAAC GTTTGTCACT GTTAAGCTTG GAATTCCCAT 57 60 

TTCATTAGCC TTGACCAAAA CTTGACGACT ATCCGCTGTT TCACCACATT GGCTGATAAA 5820 

CATCAAGAGT GGTTTCTTGC TGAGAAGTGG CATACCGTAG CCCCACTCAG ATGAGATTCC 5880 

AAGTTCAACT CGTCTATCTG TCAATTCTTC CAACATTTTC TTAGAAGCAA ATCCTGCATG 5940 

GTAAGATGTT CCAGCTGCAA GGATGTAGAT GCGGTCTGCG TCTTGAACAG CCTTAATGAT 6000 

ATCTGGCTCT ACGACAACTT GACCAGCCTC ATCTGTGTAG GCTTGGATCA GTTTCCGCAT 6060 

AACAGTTGGT TGCTCGTCAA TTTCCTTGAG CATGTAGTAA GGGTAAGTTC CCTTACCGAT 6120 

ATCTGACAAG TCAAGTTCAG CAGTGTAGCT AGCACGCTCA CGACGATTTC CATCATAGTC 6180 

TTGAACTTCC ACACTATCAG CCTTGACGAT TACCAACTCT TGGTCATGGA TTTCCATGTA 6240 

TTGCTTAGT7 TCACGAATCA TAGCCATGGC GTCTGACCAG ACCATGTTAT AGCCTTCTCC 6 300 

AACACCAATC AAAACTGGTG ATTTATTTTT AGCTACGTAG ATGACTTCAG GATCTTGTGA 6 3 60 

GTCAACCAAG GCAAAGGCAT AACAACCACG GATGATCTOA AGGGCTTTTT TCAAGGCTTC 64 20 
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AAGAACTGAG AGCCCTTCTT CTTCCGCAAA TTTTCCAATC AAATGAACGG CTATTTCAGT 6480 

ATCTGTCTGC CCCTTGAAGT GGTGACCTGC AAGGTATTCT TCCTTGATTT CAAGATAGTT 6540 

CTCAATCACC CCATTATGCA CCAAGACAAA ACGTTCCGTC TCAGAGCGGT GTGGGTGAGC 6600 

ATTGTCCTCA GTTGGTTTTC CGTGAGTAGC CCAACGAGTA TCTCCGATAC CAGTTGTTCC 6660 

CTCAACACCA GCTGTCTTGG CAGACAATTC TGCAATACGA CCAACCGCCT TCACCAAATG 67 20 

GTTATCAGCA CCATCTACGA CAAAAATTCC CGCAGAATCA TAGCCACGGT ATTCAAGCTT 6780 

TTCAAGCCCT TGAATCAAAA 7ATCAGTTGC ATTTGTGTTT CCAACAACAC CAACAATTCC 6840 

ACACATAGTA T AT AC G AC AC AGGCAAGCTG TGCTTTCTCC TTAAAATTGG TATAGTCTAA 6900 

TTCATCTTTT ATAGAATCAG CAAAAACAGT ATATACTTGT TTCTTTCACT TGTCAAGAGT 6960 

AAAAATTGGT ATAGTTCAAA TTAAGCTCCT GTAAGCATAA AAACTCTGAC CGATTGGGAT 7020 

AATCAGTCAG AGTCCTTTTT AAAATCCATT ATTATCGCTT AATTCTTTGA ACCAGTGGCC 7080 

TGATTTCTTC AGACGACGTT CTTGCGTTTC CAAGTCTAAT TCGACCAAAC CATAGCGATT 7140 

TTTATAGCTG TTGAGCCATG ACCAGCAGTC AATAAAGGTC CAAATCAAGT AGCCCTTACA 72 00 

GTTGGCACCA TCTTCAATGG CACGGTGAAG TTCACGAAGA TGACCTTTTA CAAAGTCAAT 7260 

ACGGTAATCA TCTTGAATCA TTCCATCTTG ACCCAATTTT TCTTCCCCTT CAACACCCAT 7 3 20 

ACCATTCTCA CTCAACATCC ACTCAATATT GCCATAATTT TCCTTGATAT TTTGGGCGAT 73 80 

GTCATAAATC CCTTGCTCAT AAATCTCCCA ACCACGCTGA GAATTGATTT TACGTCCAGG 7440 

CATCACATAA CGCTCCTAAA AATCTTCTGG TAAGAGTGGA CTCTCTGGAT GCTTAGCAAA 7500 

TCGAGGAGCC ATAACACCCA AAGGTTGATA GTAGTTCACA CCAAGGAAGT CCACCGTATT 7S60 

ATCACGAATG AGTTCCAACT CTTCCTCTGT AGCATCAGGT AAAAGACCGT GTTCATGCAA 7620 

GATTTCTACC AACTCCTGTG GATAAGTCCC CAAGACAGAT GGATCTAAGA AAGATTGGGC 7680 

CTGAAAAACC GCCGCAATAC GAGCTGCCTT GACATCACCA GGATGCTGGC TACGTGGATA 7740 

AGCCGCTGTC AAGTTGAGGA CAATCCCAAT CTTGGAATCA GGCAAAAGTT CATGGCAAGC 7800 

CTTAACAGCC CGGCTGCTGG CCAATTGTGT ATGATAGGCT ACCTTAACAG CTGCCTCTGC 7 860 

ATCCACCTTA TGTGGATAAT GGGCATCATA AAAATAACCA AATTCTACAG GAACGATGGG 7 920 

CTCGTTAAAG GTAATCCATT GATCCACTAA ATCTCCATAA GTCTCAAAAC AAAAACGAGC 7980 

ATAGTCTTCA TAGGCTGAGA CTGTCGCCTT ATTTTCCCAA CCATCACCAT CCTCTTGAAG 8040 

GGCAAAAGGT AAATCAAAAT GATAGAGATT GACTAACAGA CCAATTCCTT TAGCCTTAAT 8100 

AGCCTCAAAG ACCTTACGAT AAAAATCCAC ACCTTGAGTG TTGACTTTTC CACAGCCTTC 3160 

TGGAAAAATC CGTGACCACT GAATAGAAGT CCGAAAGCCT G7GTGACCAG TCTCTAACAA 82 20 
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AAGCTCAATA 


TCCCGCTCCC 


AATTTTCATA 


AAAAGTCGAT 


GTCTTATCTG 


AACCAATCCC 


8280 


ATTATAGTAA 


CGATTTGGCT 


CCACTTGGAA 


CCAGTAATCC 


CAGAGATTGT 


CTCCCTTACC 


8340 


GTCACCAGCT 


ACACGTCCTT 


CTGTCTGCGG 


TCCAGAAGTA 


GAGGATCCCC 


AGACAAAATC 


8400 


TT'^GCAAAT 


CTTAGCATAC 


ATTTACCTCT 


TTATCTACTC 


ATTTCTCCCA 


TTATACAGAA 


846C 


/UVU\l>nnVJVJ i 


AAAAJVC^ACT 


T AC ATTTTTT 


CCTTGTTTTT 


CTTCTGATTA 


TAGTTTTTAT 


8520 


L 1 L 1 L kj*— 1 * n 


GG ATTTPAAG 


CGTTTCAAGC 


ACGTTATCTG 


CATGAACCTC 


AATGGTGTCA 


8580 




TGATCTTAAC 


TTCTACAATG 


CCATCGGCCG 


CTTTTTTACC 


AACAGTGATA 


8640 


pggattcgaa 


GACCAATCAA 


GTCACTATCG 


CTAAATTTAA 


CACCGACACG 


TTCGTTACGC 


8700 


TPA'T'PTG'^CA 


AG AC TTC AT A 


ACCAGCTCCC 


ATCAAGCTTG 


CTTCAAGTTT 


TTCTGTCAAG 


8760 




CTTCATCCT fT * 


GACATTGACA 


GTAATCAAAT 


GCACATCAAA 


TGGTGCCAAT 


8820 


TfTTTACCCA 


AATTGATTCC 


CCAAGCGTAA 


CGGTATTCAC 


CTTTTGGCGT 


TTTGTTAACA 


8880 


AAGAGGCCAC 


CCTGTTGCTC 


CATCACTGCT 


GAAAGAAGAC 


GGCTGACACC 


GATACCGTAA 


8940 




TGATTGCCAC 


AGCACGACCA 


TTTTCATCCA 


AGACATCTGC 


TCCCATCCTT 


9000 


GPTGAATAGC 


GAGTTCCGAG 


TTTGAAAATA 


TGACCGATCT 


CAATACCACG 


CGCAAAGTTA 


9060 


& ftfl AC A£*PTT 


CTCCATCTGG 

VJ 4 Vw W 4 \* 4 WM 


GGAAATTTCA 


CCCTCACGAA 


CTTCACGGAT 


ATCCACATAT 


9120 




AATCACGGCC 


TGGGTTCACA 


CCAGTCAAGT 


GGTAGTCATC 


TTCGTTAGCA 


9180 




CA' TWT *CCGAAC 


ATCTTGTACC 


TTACGATCTG 


CAATAATTTT 


AATATTCTCT 


924C 




'"TGH'^CCAAG 

*w 4 vj vj * v#\#rmw 


TGAACCAAAT 


CCTGCTTGAA 


CAACATTCCC 


CACTTCTTCT 


9300 




CGTCAAACAA 


ATCTGCTCCC 


AAGTGATTTT 


TCAACTTGAC 


TTCGTTGAGT 


9360 


TGGTCATTTC 


CAACTAGAAG 


CGCTGCAACA 


AGCTCACCAT 


CTGCAATGTA 


GAAGAGGGTT 


9420 


'^T'AATrf^TTT 
• inAiV.\Jl 1 L 


GTTCTTCTGG 


AACATTGAGG 


AAGCCTGCAA 


CTTCATCAAT 


TGATTTAACA 


9430 


TCTCGCGTTG 


CAACACGAGT 


AACTTCTTCT 


TCAGCUAWAA 




TCCTTTGTAC 

1 VjVJ 4 4 4 VJ 4 A\« 


9540 


TCGTTTGTTG 


CCATTTCTAA 


GTTAGCTGCA 


TAG C TAG ACT 


CACTTGAGTA 


ACCAATGGTA 


9600 


TCTTCACCAG 


AGACTATCCA 


TTTGAGCAAT 


TCTGCCTTGA 


TTTCTTCTTG 


CACTTCTGCA 


9660 


CGAATTTCCT 


CAAATGAGGC 


AACTGACTTG 


TCCAAGACAA 


CCCAGCGGTC 


AAGGTCTGTA 


9720 


CGAGCAGATG 


TAATGGCCAT 


AAATTCTTGG 


CTATCCTTAC 


CACCCATGGC 


TCCACCGTCA 


9780 


CCAATAATAG 


CCTTCAACTC 


TAAACCACTA 


CGAGTGAAAA 


TACGCTCATA 


GGCTCCTTTG 


9840 


TACTCATCAT 


AAACACTATC 


CAAACTATCA 


TAGTTAGCGT 


GGAAACTATA 


AGCATCCTTC 


9900 


ATGATAAACT 


CACGTGTACG 


AAGAAGTCCA 


TTACGCCGCC 


CTTTTTCATC 


ACGATACTTG 


9960 
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GGCTGAATTT GATAAACCTT GAGTGGCAAT TCCTTGTAAG ATTTAACAGA ATCACGGACA 10020 

ATACCTCTAA AGGTTTC7TC GTGAGTTGGA CCTAAGATAA AGTCTGATTT TTCACGGTTT 10080 

TTTAGTTTGT AAAGGTCTTC ACCATAGGTT TCGTAACGAC CTGATTCACG CCACAATTCT 101*10 

GCACTAAGAA GGGCTGCAGC CAACATCTCA ACAGCACCAA TCTTTTCGAA TTCTTGGCGC 10200 

ATGATGTTTT TAGCTTTTTC AATCACACGG TTGGCAAGTG GTAGATAAGA ATAAACACCT 10260 

GCTGAAACTT GGCGAACATA ACCAGCACGC AACATAAGAG CATGGCTGAT AACTTGAGCA 103 20 

TCGCTTGGCA TTTCGCGAAG CGTTGGGATA GGCATTTTAC TTTGTTTCAT AATATTCCTC 10390 

GATTATCTAA AAAAGAGTCG CATAATGTCA TTCCAAGTCA CAGCAATCAT CAAGACAACC 10440 

ATGATGACCA CTCCGGCCAA GGTGACATAG GTTTCAATTT CTTGTTTCAA TGCTTTGCCC 10500 

CGGATGGCTT CTAGGATATT GAGCACAATC TTACCACCAT CCAAGGCTGG AATCGGAATA 10560 

AGATTAAAAA TCCCAATATT GATGGAAATC ATTGCCAAGA AGTACAAGAT ATTTTCAATT 10620 

CCATTTTTAG C AG CATC ACT ACTTGCCTTA AAGATAGCAA CAGGTCCACC CAACTTGTTC 10680 

AAATCTGGTT GGAAAATCAG ATTTTTCAGA GCTGAGAGAA TTCGGAGAGC TGAGTCAGCA 10740 

CCAGTTCTAA AACCACCTAC AAACATGGAT AGAAAATCTG ACTTAACCCC CCCTTGAACA 10800 

CCTAGAAGCT AACGACCTTG ACTATCTTTG GGTGTAACAC TCACTTGTTT GTCACTCCCC 10860 

TTTTCAGAAA TAGTCACATC CAAAGTCGGT GCCGTCTTAT CTTTGGTTTC TGTTTCCACA 10920 

CCTTGCATCA AGCTTTCCCA CTTGCTAACC TCATGTGAGC CAATCTTGGT AATTTCTCCC 10980 

ATTTCTGGTA CTCCTACCTT GCCCAAGGCA CCTTGGGGCA TGATATGGAA CTGATTCGTA 11040 

TCAACATCTC TGACACCACC CTGCATAAAG ATTAAAACCC AAAAAACAAC CACACCTAAC 11100 

ATAAAATTGT TCATAGGACC TGCAAAATTG GTAATCAGTT TGCCCCAGAT AGTCGCATTT 11160 

TGATATTGAA CATCTAAAGG TGCAATCCGA ACCTCAGTAC CATCTGCTTC CACAACCGTT 11220 

GCATCGTGAT CCACTGCAAA TGTTTTTTCT TCTTCCAGAA CCAATCCTTT GATAAAGAGC 11280 

TTGTCTTCAA AATCAAACTG GCTCACCTGC ATAGGGAGGG CTGTTTGATC CAATTTTTi A 11340 

CCTGAGAGAT TGATGCGTTT AACCTTACCA TCATCAGCAA GTGTCAAACT AACAGGCGT? -.1400 

CCTCTCTTGA TTTCAGTTGT ATCATCACCC CAACCGGCCA TGCGGACATA GCCACCCAGA 11460 

GGCAAGATTC CAATCGTATA GGCCGTTCCA TCCTTGCCAA TGTGAGCAAA AATTTTAGGT 11520 

CCCATACCGA TGGCAAATTC ACGTACTAAA ATCCCTGATT TCTTGGCAAA GTACAAGTGA 11580 

CCGAACTCGT GCACCACTAC AATAATCCCG AAAACCAGAA TAAAGGTTAA AATTCCGAGC 11640 

ATAGCGTTTC CTCCGTCTTT TGATTAAAAG AGTCCAAATA AGTGCATGAT TCCAAATACA 11700 

AGCAACATAC TATCGAAACG ATCCAAAACA CCACCATGTC CAGGGATAAA TTTCCCAGAA 11760 
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TCCTTAACAC CAAAATGACG TTTGATCGAA CTTTCTAGTA AATCACCAAA TTGTCCAGCA 11820 

ATGCTAAAGA AAATAGCAAA GACTGACATC 7TGTAAAT7C CATATGGAAG AGCAACTGTA 11880 

CTCTCAACTA 7CATAACGA7 AATGGTTACT AAAATTGCTC CTAAAATACC ACCCAAGGCA 11940 

CCCTCAAGGG TTTTATTAGG CGATACCCTT GGTGCTAACT TTCGTTTCCC ATAGTTCATC 12000 

CCAACAAGAT AGGCACCACT GTCTGTCGCC CAGACGATAC ACAAGGCTAA GAGAGCCTTG 12060 

7CCAAACC7G CAACACGAGC ATCTAGTAAA GCATTAAATC CAAAGCCCAC GTAGAAGCTC 12i20 

ATAGCAAGAG GGAAAACCGC ATCCTCAATC GTATAAGACT TGCTAAAAAC GGTCG7TCCT 12130 

AACATGATTG AAATCAAAAC ACTATAGGCA ACCACATTCC CATCAACTGG CAAAAAAGTC 1224 0 

AGGTAATTCT CCAAGGGAAT GGTCAATGCA AAGGTTGCAA AGAGGGTCAA GACGCCCTCC 12 300 

ATCGTCATGG TCTCTAGACC TCTCATCTTC AAAAGTTCAT GCATGGCTAG CATGGCTATG 12360 

ATTCCGATTG CTATCTGAAG CAAGAGGCCC CCAATCATTA AAATTGGTAG GAAAATAGCC 12420 

AGGGCAATCC CTGCAAACAA GGTTCTTTTC TGTAAATCCT GGGTCATATT TCCTCCTAAA 12480 

CTCCTCCAAA TCGGCGATGA CGACGATTAT AGGCAAGAAT AGCTTCCTGC AAGGCCGCTT 1254 0 

CGTCAAAATC AGCCCA7AAG GTGTCCGTAA AATAAAGCTC ACTATAGGCT CCCTGCCATG 12600 

GAAGGAAATT GCTCAAACGT AATTCTCCAC TAGTACGGAT AATCAAGTCT CCGTCTCGTA 12660 

AGTCCTTAGG CAAATGCTGA GTAAAGAGAT AGTTACCAAT CAATTCCTCT GTGATGTCAC 12720 

CTGGGTTGAT TTTGGCATCT AAAACATCCT GGGAAATCAA CTTAAGCGCC TGTGTAATCT 12 780 

CAGCACGTCC ACCATAGTTA AGAGCAAAAT TAACAATCAA TCCTGTGTTG TTCTTAGTCA 12 840 

ATTCCTCAGC CTTGGTTAAA GCTTCAAAGG TTTGC7TAGG CAGGCCG7C7 GTCTCCCCAA 12900 

TCATTTGAAT CTTAACATTA TTCGCATGTA GTTCCGGGAC ATAATTATCA TAAAACTCTA 12960 

CTGGCAACTT CA7GATAAAC TTGACTTCCT GATCTGGACG CGTCCAGTTT TCCGTAGAAA 13 020 

AAGCATAGAC CGTAATAACC TTGACGCCCA GTTTGTTGGC TGCCTTGGTC ACGGTTTGCA 13080 

ATGCTTCCAT GCCCGCCTTA TGTCCAAAAA CTCGCGGTTG CATACGTTTT TTAGCCCAAC 13140 

GCCCATTGCC ATCCATGATG ATGCCGATAT GAGCAGGAAC CTGTGTCGGA ACCTCTACTT 13200 

CCACAGCCTT ATCTTTCTTA AAAAATCCAA ACATGATCTT ATTCCTATTC AAAAATCTAT 13 260 

CGTTTCATTA TACCATATTT CCCCATTTTC TTC7A7CAC7 AAGCTATTTA TTCTCAGGCA 13 320 

CCAAGCCCAT TTT7CAAAAA AATAAGCCGC C7GA77GCGC GAC7TTA77T TTA7AGGGAG 13 3 80 

AT7A7TA7GA AAAAGTTTTA GGAGTTTAAG 77AAGG7CT7 C7TAACT7A7 GAAC7TAG7G 13440 

7ACACTCCC7 AGCTTAAAG7 TTCCTTAAG7 A7TT77AAAA A7CAAAT77T 7CCA77TCTC 13 500 
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CTGCCAATTT 


TTCTTGGATA 


AACGTGTTTG 


ATAGAGTTCC 


ATTCGGTCTT 


CATTTTCTAA 


13S60 


GAAATGAGCA 


GTTGCACGAA 


CTTGAAAATT 


CAAAATATCC 


TCCAAACCAT 


AAGGTACATA 


13620 


GAGTTCAAAA 


TCTAATTCTT 


CATTCAAGCG 


CACTCCAACT 


GCCGTACACC 


GTTCTGGATA 


13680 


CTTACTCATA 


GCATCACGAG 


AACTGGTATA 


GGAAGCAGTG 


TGAGGACTGT 


GCTGATGCAT 


13740 


ATAGACCTGA 


TTTTTCAATT 


CCCACTGGTA 


CTGAGGAAAA 


TCCTCTCTCA 


GCTTTTTCTC 


13800 


CAGTAATAAG 


GTTTCCTCAT 


AAGAAAAATC 


7GGATCAAAG 


AAAATCACAT 


CTATATCTGT 


13860 


TTCATGATCA 


AAAGGGCAT? 


TCTCTGACAA 


AAGATTCCAG 


ATGAAATTTC 


TGACAGAACC 


•3920 


TGCTGCCAAC 


CACGAGTCTT 


TCAAACCAAG 


GTCTCGGATC 


ATCGTCAGAA 


TGGCCATCAT 


13980 


ATCTGGACTT 


TCTCTAAAAG 


CCTCTAAGAT 


TTCTTGCTTA 


TTTTTCACTG 


TATTCATAAC 


14040 


CTAAGTGCTC 


ATATGCCTTA 


GCAGTCGCCA 


CCCGTCCAGA 


CCGTGTCCGC 


ATGATAAAAC 


14100 


CTTTTTGAAT 


CAAGTAAGGC 


TCATACATGT 


CTTCAACTGT 


CTCACGCTCT 


TCGGCCATAT 


14160 


TCACAGAAAG 


AGTTCCTAGA 


CCAACAGGTC 


CTCCACTCTA 


CATCTCAATC 


ATGGTGCGAA 


14220 


GGATTTTTTG 


ATCCACATAG 


TCCAAACCTT 


CATGGTCAAC 


ATCCAGCATA 


GTCAAACCCT 


14280 


TATCCCTAAT 


AACATCATCG 


ATAACCCCAT 


TCCCCATTAT 


CTGGGCAAAA 


TCGCGCACGC 


14340 


GCTTGAGGAG 


ACGATTGGCA 


ATACGAGGGG 


TTCCACGACT 


ACGTAGGGCC 


AACTCAGATG 


14400 


CTGCCTCATG 


GGTGATTTCC 


ATCTCAAAAA 


TATCTGCCGT 


CCGCTCGACA 


ATTTCTGTCA 


14460 


AGTCAGCATG 


AGCATAATAC 


TCCATATGAC 


CTGTAATCCC 


AAAACGTGCC 


CCTAGTGGAT 


14520 


TTGAGACCAT 


ACCAGCCCGA 


GTCGTCCCAC 


CAATCAAGGT 


AAAAGGAGGC 


AACTCCAAAT 


14580 


GAACACTCCG 


ACTGCCTTCA 


CCAGCCCCAA 


TCATAATATC 


GATGTAGAAG 


TCCTCCATGG 


14640 


CACTATAAAG 


CACTTCTTCC 


ACTGACATGG 


GTAAGCGATC 


AATCTCGTCA 


ATAAAGAGGA 


14700 


CATCTCCAGG 


CTCTAAATCA 


TTCAAAATCG 


CTACCAAATC 


ACCCGCTTTT 


TCGATAACAG 


14760 


GACCAGACGT 


TTGCTTCAGA 


TTGACTCCCA 


GTTCATTGGC 


AATGACAAAA 


GCCATGGTTG 


14820 


TTTTCCCAAG 


CCCTGGAGGG 


CCAAATAAGA 


GCACATGATC 


CAGCGCTTCA 


TCCCGCATTT 


14880 


TAGCGGCTTC 


GATAAAGATC 


TGAAGTTGAT 


CCTTAACCTT 


ATCCTCACCA 


ATATATTCAC 


14940 


GTAAATACTG 


AGGACGGAGC 


GTGCGTTCTA 


CTAACTCCTC 


ATCACCCATC 


ATCTCATTAT 


15000 


CTAAAATTCT 


ACTCATGGCT 


CTATTATATC 


AAAAAAAACA 


AGCCACAAAC 


AAAAAAGCCA 


15060 


CCTGATTGGG 


TGACTCCTAA 


G7TTAGCACT 


TATGTCGTAT 


AATATTATAC 


GGCACTTCTA 


15120 


CACCGCCTAC 


GAAAGGAGGT 


GAGATAGCCC 


ATGATGGAAT 


TAGTACTCAA 


AACTATTATC 


15180 


GGACCAATTG 


TGGTCGCTGT 


CGTTCTTCGT 


ATAGTCGATA 


AATGGCTAAA 


CAAGGACAAA 


15240 


TAGTGTCAAA 


AAAGACCTCA 


AGCTTATTTG 


GTCGTGAGCT 


TGCGGTCTTT 


TCTAGCCTAT 


15300 
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GATATAGAAC 


TAGTACTCAA 


TT CCTTTTT A 


TTATCCCATA 


GTTCACGAAT 


TTTGTCAAAA 


15360 


CTTT ACATTT 


TCTTCAACCG 


CTGTACGACA 


AGACGGTTAA 


GATTAAGAGA 


ACGTTAGCGA 


15420 


TTCTATCAAT 


TTCATAGAAA 


TTTTGATTTC 


GTAAACGAAG 


AGACAATCTT 


ACATGTCACT 


15480 


T( "T 1 '"* A'PT'T A A 

4 \_ i i 1 i rW 


TACGCCACTA 


CTAGACAAGC 


AAAATCATTA 


TTACAGTAGT 


TCCAGTCCTT 


15540 


TAATTAACAC 


TCACTTACAA 


TCAAATTGAG 


TTTCAACTAG 


CTGAAGCGAC 


CACAGACCTA 


15600 


TTTPTT A<"«T f" 

l a 1 v_ 1 1 rV\J 4 ^ 


ATATTCGCTA 


AAAAAATCCC 


CGCCAAAATC 


TCAAAAAGTC 


CCCGCCAATT 


15660 




AATPrt^AAAA 


ATACCGAAAA 


ATATCGAAAA 


ATTATTTTTA 


GAATAGTCCC 


15720 




AAATAGAGCT 


AAAAAACTCC 


ACCTGATTCG 


GTGGAGTTAA 


GGGAGATTAT 


15780 


T ATd AAAAAG 


AAAAGTTTAC 


GATTTTATTA 


AATAAACTTA 


GGAGGTCTTT 


ATTTAATAAC 


15840 


TACATGATAC 


AAGACGAAAC 


TTAAAACTAG 


CTTAACTTTT 


C T AAAATTTT 


ACTATTTTGC 


15900 


AAAAAATTTC 


TATCACCAGC 


ACCTCACCAA 


TCGAGTAGGG 


GATAATCTCT 


AGCCCCTCTC 


15960 


ACACCACCCT 


ACGTGCCGTT 


TGGCATACGG 


CGGTTCAACT 


AACTTTTAAC 


GCATGTCGTT 


16020 


CAAGGTAATA 


ATCCAAACAC 


GAAACCAGTC 


CACGTTTTTC 


CAGGACTGGT 


TTTGATATAG 


16080 


CACGTTTAAG 


TACCGACTTC 


TGAGCTACTA 


ATTGATAATG 


GTCGCCCCAG 


CCAGATACCT 


16140 


TATCTGCTAT 


CCATTTAGGA 


ACTCCTAACT 


TAAGCAATCC 


CCATAATCGT 


CTCGATTTCT 


16200 


TCTTCCATTG 


CTTCCAGATA 


ATCACTCGTA 


GGCGAGTACG 


CAAGCGCTCA 


TCTATGCTGC 


16260 


CC ACT AT ACT 




CCCAATCACC 


AATACTTTAT 


CCATCCTCGA 


AT AG AC AAA T 


16320 ' 


7CAGTTCCTC 


AATACGTCTT 


GT7AGGTCTA 


TACTCCATTT 


CCTCTCTGTT 


AGTTTCTTCA 


16380 


ATTTAAACTT 


AAATCTCCGA 


ACACTATCTT 


GATG7GGACG 


GCTTTTCCAA 


CCATCTGATA 


16440 


ATTTCCAGAA 


CCCAAAACCT 


AGATATTTCA 


ACTCTCTTGG 


TCATGTTTAC 


TTTCAAACCT 


16500 


AGCCCTTTCT 


CAATAAACGA 


CTGACTGAAT 


ACATC 






16535 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8136 base paxrs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
(0) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CCAGAGCGTT CCCTCCGAAA GTCTATCCAG ACACGGCTCT TTAAAAACAA AAGGAGAAAT 60 
GATGCATACT TATTTCCAAA AGAAAATTGA AAA T AT C AAA ACAACCCTAG GTGAAATGTC 120 



WO 98/18931 



PCT/US97/19588 



618 

ACCTGGTTAC CGTCGTATGG TTGCGGCTAT GGCTGATTTA GGATTTTCAG GAACTATGAA 180 

GGCTATCTGG GATGACCTCT TTGCCCATCG TAGTTTTGCC CAGTGGATTT ATTTGCTGGT 240 

TTTAGGAAGT TTTCCTCTCT GGCTGGAGTT GGTTTACGAA CATCGTATTG TTGACTGGAT 300 

TGGGATGATT TGTAGCTTGA CAGGGATTAT CTGTGTAATC TTTGTATCGG AAGGTCGAGC 3 60 

AAGTAATTAT CTTTTTGGCT TGATTAACTC TGTTATTTAC CTTATTTTGG CCCTACAGAA 4 20 

AGCCTTTTAT GGTGAGGTGC TGACGACACT TTACTTCACA CTCATGCAGC CAATTGGACT 4 80 

TCTAG7TTGG ATTTATCAGG CACAGTTTAA GAAGGAAAAG CAGGAGTTTG TCGCCCGTAA 5 40 

ACTGGACGGC AAGGGCTGGA CAAAGTATCT TTCCATTAGT GTGCTTTGGT GGTTGGCCTT 600 

TCGCTTCATT TATCAGTCTA TTGGTGCCAA TCGTCCCTAT CGTGATTCAA TCACAGATGC 660 

AACCAATGGG GTAGGGCAAA TCCTCATGAC AGCTGTTTAC CGTGAACAGT GGATATTCTG 720 

GGCGGCTACC AATGTCTTTT CAATCTATCT CTGGTGGGGA GAAAGCCTGC AAATTCAAGG 780 

GAAATATCTA ATTTATCTCA TTAACACTCT AGTTGCTTGG TATCAATGGA GCAAGGCAGC 840 

TAAGCAGAAT ACTGATTTAC TTAACTACCA AAAGATGTTT GAAAGTGCTG TTTTGAGATT 900 

TCGATTAAAA C AC AT AT ACT TGATAATCAA GGATTTATAG TATGAAAAAG AGCATCGGCC 960 

GGTCCTCTTT TGTTGTTGAA AACATAAAAA ACTCAGTAAC CTAGAAATAA CACAACTGAA 1020 

GCTTTACTCT ATATTCAATT TTTAGGAATG AGAAGGTCTA GATAAAATTG GACAACTTCC 1080 

TCGTCTGTGA AATCTTGACC TTTTTTGAGC CACCAGGTCA ATGTCTCGAT AAAGTTGGAC 1140 

ATGACCAAGT GTTGGAGGTA ACAAGTACCC AGATTAGGGT GGGCTTCTTT TAAATTATCA 1200 

GCTAGCACGG AATAGACATG CTG7TCTACC TCTTTATGGA GTTGACGGAG GAAGTAGTCA 12 60 

TTTTTGGAAA ATAGCAGACT GGTGATATGG TCTTGGTTTT TATGAAAATG GAGAAAGAGG 13 20 

TGGGCGAGGT AGTCCTCGCT TGAAATGGCT TCCTCTCTTT CAAAAAGATG ATGGAAGAGC 1380 

TAGCGGCAGA GCTGGTCCAG AAGAAGCTCC TTACTCTCAT AGTGACAGTA AAAGGTGGAT 14 40 

CGTCCCACAT CTGCGAGATC AATGATATCC TGAACAGTAG TGGCCTCGTA GCCCTTAGCA 1500 

TTCAAAAGTT GTATAAAAGC TTGATAGATG GCTTTTTTGG TTTTGCTGAT ACGGCGGTCA 1560 

ATGTTAGTCA TATGGACACT TAACGCAAAT TCTTCACAAC TGAATAAAGC TGACGTTTTG 1620 

CTTCTATCCT TTCTTTGAGT TTTAGTGGAT AATGATAATG AACAAGGTGT TCATAAATCT 1680 

ATTATAACAA AGGAATGAGA AATATGAAGG CAAAATATGC TGTTTGGGTG GCTTTTTTCT 17 40 

TAAATTTGAC TTATGCCATT GTTGAGTTTA TTGCAGG7GC AGTATTTGGT TCTAGCGCTG 1800 

TTCTTGCTGA CTCTGTCCAT GACTTGGGAG ATGCGATTGC AATTGGAATA TCACCTTTTC 18 60 

TAGAAACAAT CTCCAATCGT GAAGAAGACA ATCAGTACAC CTTGGGCTAT AAGCGGTTTA 19 20 
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4 



GCCTGCTAGG 


AGv.(- in vjo I A 


A*» Alii. 1 \J I w A 


TTCTCGTAAC 


GGCCTCTGTT 


CTAGTCATTT 


1980 


TGGAAAATCT 


CAGGAAGAi. . 




AACCAGTCAA 


TGATGAGGGG 


ATTCTCTGGT 


2040 


TAGGAATTAT 


TGCGATTAL. . 


ft Tf* ft ft TrTP.T 




GGTGGTTGGT 


AAGGGAAAGA 


2100 


CAAAGAATGA 


GTCTATTCTG 


Au 1 1 IjtH i 4 




TACGCTAGGG 


TGGGTAGCTG 


2160 


TTATCCTGAT 


GGCGATTGTT 


Li I Lun I 1 in 


rrV.ACTGGTA 


TATCCTAGAT 


CCTCTTTTGT 


2220 


CCCTTGTCAT 


TTCTTTCTTT 


ATTCTTTCAA 


AAuLLL I ILL 


ACGTTTTTGG 


TCTACACTCA 


2280 


AGATTTTCTT 


GGATGCTGTG 


CC AGAAGG i v_ 


1 a DA i A i U/vl 


GCAAGTAAAG 


AGTGGCCTGG 


2340 


AGCGATTGGA 


CAATGTGGCC 


1 1 Aft 1 u 


AflCTT AAT CT 
nuv. i i (v> i ^ * 


CTGGACTATG 


GATCCTTTGG 


2-400 


AAAAAAATGC 


C ATTGTC CAT 


G I I lulL; An 


AAftAAATGCA 


ACATATGGAA 


ACTTGTAAAG 


2460 


AGTCTATTCG 


AATTTTCCTA 


ft ft ft r* & TTT?m 
aaaua 1 lwiu 


nTTTTCAAAA 


T ATT AC CAT? 


GAAATTGATG 


2520 


CTGACCTAGA 


AACTCACCAA 




GAAAGGTGTG 


TGACTTGGAA 


CGGAGTTATG 


2580 


AGCATCAACA 


TTAGAAAAAA 


r'TT' ft ft ft ft ATA 


PTTHT.GT ACT 


ATCTTATTTG 


CAATAGAGTA 


2640 


ATTTCTTTAT 


TATTTAAATA 


(nmrvif* ft ft ft & ft T 

TTTCAAAAA 1 


1 LrL# 1 nvwnun 


AGAGCATTGT 


ATAAACTCCA 


2700 


GATATATGAT 


TGTTAATGAT 


r ft ft ft iTT'WT 
AAnnnl till 


PftATTAGATA 


CAAAATGCTT 


GACTTGGACT 


2760 


CAACTCAAAG 


TTATATAATA 


ft^ ft ft /"*Tf ft 

AGATAAG I Li A 


Vi i l Aunn i #%w 


CGTGAATTCA 


GTG AATGAAA 


2820 


TGAGAGGAGG 


TTAGCGTGTG 


ft ft 1* & ft ft IT 

AATA I I AAA i 


rrnpf* Afzre A 


TTTGTTGGGA 


ATTTCAGCGG 


2880 


ATACGATTCG 


GTATTATGAA 


CGGGTTGGTG 


TTCTC.CC \CC 

1 L Vj i LjLV-AL L. 


GATTACTCGT 


ACTGCTACTC 


2940 


GGATTCGTGA 


TTTTCAAGAT 


^ ft iT' ft T ft TrT" 

CAGGATA I GG 


ft AnPftPTCClA 


ATTTATTAAG 


TGTTTTCGTT 


3003 


CGGCGGGTGT 


CTCTGTAGAT 


AG *T. 1 nu 1 lu 


AfTATATGTC 


GCTCTACCAA 


AACGGAGATG 


3060 


AAACGAGAGA 


GGAGAGGCTT 


GGT A i 1 1 I AL» 


AAr; ac:c AAAA 


GCAAAAATTA 


GAGGAGCCCT 


3120 


TGTCTCAGCT 


AC AG AC AC CT 


1 * AAA 1 L,\j i * 


TAAATCTCAA 


AATTAAACTT 


TATAAGGAAG 


3130 


GAAAATTTTA 


AATGAAATCA 


GCAGTATATA 


CAAAGGCAGG 


TCAGGTTGGA 


PTTrPT iff* A 


3 2 40 


TTGAACGTCC 


GCAAATAATA 


GAAGCGGATG 


ATGTGATTAT 


TCGTGTGGTT 


CGTGCGTGCG 


33CO 


TTTGTGGTTC 


AGATTTATGG 


AGGTACCGTA 


ATCCAGAAAC 


GAAAGCTGGA 


CACAAAAATA 


3 360 


GTGGACACGA 


AGCGATTGGG 


ATTGTTGAAG 


AAGCTGGGGA 


AGCCATTACG 


ACGGTGAAAC 


3420 


CAGGTGATTT 


TGTGATTGTC 


CCTTTTACAC 


ATGGATCTGG 


TGAGTGTGAT 


GCCTGTCTTG 


3480 


CTGGATTTGA 


CGGTTCTTGC 


GACAATCATA 


TTGGCAATAA 


TTTGGGGCGT 


GA'ITTTCAGG 


3540 


CAGAATATAT 


TCGCTTCCAC 


TATGCAAACT 


GGGCGCTGCT 


TAAAATCCCT 


' CG7CAACCTT 


3600 


CTGACTATAC 


: AGAAGGGATG 


; CTCAAGTCCC 


TTTTGACTCT 


TGCAGA7GTC 


* ATGCCGACAG 


3650 
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GCTATCATGC GGCGCGTGTT GCAAATGTTC AAAAAGGGGA CAAGGTTGTT GTTATCGGTG 37 20 

ATGGGGCTGT TGGTCAATGT CCTGTCATCC CCCCTAACAT GCGTGGAGCA TCACAAATTA 3 7 80 

TCCTTATGAG CCGTCATGAA GACCGTCAAA AGATGGCTAT GGAGTCAGGT GCGACAgcTG 3 840 

TTCTTGCAGA ACGTGGTCAA GAAGGAATTA CCAAGGTGCG TGAAATCCTC GGTGGAGGAG 3900 

CAGATGCAGC ACTTGAATGT GTTGGTACGG AGGCTGCTAT AGAACAGGCG CTAGGTGTTC 3 960 

TTCATAATGG AGGGCCTATG GGCTTTGTAG GAGTCCCACA CTATAATAAT CGTGCTCTTG 4 020 

GTTCCACATT TATGCAAAAT ATCTCTGTAG CAGGTGGGGC AGCTTCTCCT ACAACATACG 4080 

ATAAGCAATT TTTACTAAAA GCCGTCCTTG ATGGTGATAT CAATCCAGGT CGCGTCTTTA 4140 

CTTCAACTTA TAAACTGGAA GATATCGACC AAGCCTATAA AGATATGGAT GAACGTAAGA 42C0 

CAATTAAGTC TATGATTGTA ATCGAATAAA AAACGAATAG GAGTTTTAGA ACTCTATTCG 4260 

TTTTTTATGT TATCCTATTC TTGATTTAGG GTACTTTCTC TTAATGTCAG TCTGGTTCCC 4 3 20 

AGCATGGTCA GGCTAGGGAT TTTCCGACCG TGGAGGACTT CCTT G TTAAG AATATCCATA 4380 

CCTGCTCCGC CCATTTCTTC ACTATAAACT GTAATACTAG AGAGGGGAGG ATAGACCTGT 4 440 

TTGGTCACAC TAGTGTCGTT AAAGCAAATG AGGCTGACCC CATCTGGCAG GCTGATTCCA 4500 

GCTTCTTGGA GGGCACGGAG GGCACCGATA CCTAAACTAT CGCTGGCTCC GAAAAATGCT 4560 

GGCGGAACTT GGTCTCCCAA GCTCTGAATG GCCTCCTTCA TTAAGTCATA GCCACACTGG 4620 

GCAGTAAATC TTCCTTGAAA GACCAGTTCA TCATGATAGA TTCCCCTCCC TTGACTATAG 4680 

TTT7TGAAG? TTTCTACACG CTTGTCCTGA ATGATTTCTT CTTCCTCTGT TGTTTCTTCA 4 7 40 

AGGCCTGTTA GAATCCCGAT ACGGTCCATT CCTTGACTGA GGAAATAATC GACAACCTGT 4 800 

TTCATAGCAG TGTAAAAATC CCTGATAA7A CAGCTATGTC CCAGGGAAAG TGTATCGCTG 4860 

TCTAGAAATA CAAGAGGCTT TTGCTATTCT TCAAAGGCAG AAATCTGAGC TCGACTAAAC 4 920 

TTTCCCATGC AGAGAATCCC AATCACTTCC TCGCTTAGGG TAAAACGCTG GTCATTAAAA 4 980 

TAGCGCAAGA TATCATAGTC CAACTCTTGG GCTCTTTTTT CTATTCCTAG GCGAATCTGG 5040 

TAGTAGTAGA GGTCGTCCAG CTCCCCTTGT TCGCTGACCC ATTGCATAAT GGCAATCTTT 5100 

TCCTTGGGTT TGTGGGACTC GCCTGTCTTG AGGTGCTTGG TGTAGCCCAG CTCTTCAGCA 5160 

ACGGTTAAAA TACGGTGTCT GGTTTCTTCT GTAACAGATA GGCTCTGGTC GCGGTTGAGG 5220 

ACGCGGGATA CCGTCGCGA7 AGAGACAGAG GCTAGCTGTG CAATGTCTTT TAAGGTAGCC 5280 

ATAAATCCTC CTTGATT AGG TTAGTATATC ATG7TTTTCT TCTTTTTACT GATATTTTAC 5 340 

TAAAATTTTA GTAAAAAGGA TTGACCTTGG AAAATTCCTT GGATATAATA GAAAGAAAAC 5400 

CATTACACCT TAAGATGGCT TAACGGACAG TCAAAGGAGA ATTCATATGG CACAACATCT 54 6C 
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TACTACTGAA GCCCTTCCCA AAGACTTTCT TCCTGTTTTT GGTCAAGAAG CAGATCAAAC 5520 

CTTCTTTTCA CCAGGCCGCA TTAATTTGAT TGGTGAACAC ACAGACTACA ACGGTGGGCA 5 580 

CGTTTTTCCT GCTGC7ATTT CCTTGGGAAC TTACGC7GCA GCTCGTAAGC GTGACGACCA 5640 

AGTCTTGCGT TTCTACTCAG CTAACTTTGA GGACAAGGGC ATTATCGAAG TGCCTCTCGC 5700 

TGACCTCAAG TTTGAAAAAG AGCACAACTG GACCAATTAT CCAAAAGGTG TCCTTCATTT 5760 

CTTCCAAGAA GCTGGGCACG TGATTGACAA AGGTTTTGAT TTTTATCTTT ATGCAAATAT 5820 

TCCAAATCGT GCTGGCTTGT CTTCTTCTGC ATCCTTGGAA CTCTTGACAG GAGTCGTGGC 5880 

TGAGCATCTC TTTGATT7AA AATTAGAGCG TCTCGATTTC CTTAAAATCG GCAAACAAAC 5940 

AGAAAACAAC TTTATCGGAG TAAACTCTGG CATTATGGAC CAGTTTGCTA TTCGTATGGC 6000 

GGCAGACCAA CGTGCTATTT ACCTAGATAC TAATACTTTA GAATACCACT TGGTGCCACT 6060 

TGATTTGAAG GACAATGTCG TTGTTATCAT GAACACCAAC AAACGCCGTG AATTGGCGGA 6120 

CTCTAAATAC AATGAACGTC GTGCTGAGTG TGAAAAAGCA GTGGAAGAAT TGCAAGTTTC 6180 

CTTGGATATT CAGACTCTGG GTGAATTGGA CGACTGGGCC GTTGACCAAT ATAGCTATCT 6240 

GATTAAAGAT GAAAATCCTT TGAAACGTGC TCGCCATGCT GTGCTTGAAA ACCAACGTAC 6300 

CCTCAAAGCT CAAGTAGCAC TCCAAGCAGG AGATTTGGAA ACATTTGGAC GCTTGATGAA 63 60 

TGCGTCACAC GTTTCTCTGG AG CAT G ATT A TGAAGTAACT CGTTTGGAAT TGGATACCCT 6420 

TGTTCACACA GCTTGGCCAC AAGAAGGAGT TCTCGGTGCT CGTATGACAG GCGCTCGTTT 64 80~ 

TCGTCGCTCT GCcATTGCCT TCCTTCAAAA AGATACTGTT GAGGCCTTTA AGGAAGCTGT 6540~ 

AGGCAAACAC TACGAGGAAG TAGTTCCATA CGCTCCAAGC TTCTATATCG CTGAAGTTGC 6 600 

AGGTGGCACT CGCGTCCTTG ACTAGTCAAA ACCAGCCTCT ATAGTGACCT TAGTAAATAA 6660* 

ATTTGTAACA CATGTCATTT CTGAAAGCTC ATTTGACGAA ATGGATCGAA TCTATCTGAC 6720 

CAATCGTGTT TTGGCACGAG TGGGAGAAGG TGTTTTGGAA GTTGAGACCA ATCTGGATAA 6780 

ATTGATTGAC CTCAAGGACC AGCTGGTTGA AGAAGCCGTT CGATTAGAGA CGATTGAGGA 6840 

TAGTCAGACT GCGCGTGAAA TCCTTGGTGC TGAACTGATG GATTTGGTGA CTCCTTGTCC 6900 

AAGTCAGGTC AATCGTGATT TTTGGGCAAC CTACGCCCAC TCTCCAGAAC AAGCGATAGA 6960 

GGATTTTTAC CAACTCAGTC AGAAAAATGA CTACATCAAA CTCAAGGCCA TTGCTAGAAA 7020 

TATCGCTTAT CGTGTTCCAT CTGACTACGG AGAACTTGAA ATTACCATCA ATCTCTCTAA 7 080 

GCCTGAAAAA GATCCCAAAG AGATTGTGGC AGCCAAGTTG GTGCAAGCTA GTAATTATCC 7140 

TCAGTGTCAG CTTTGTCTAG AGAATGAGGG CTACCATGGT CGAGTTAACC ACCCACCTCG 7200 
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TAGCAATCAC CGTATTATCC GTTTTGAAAT GGTTCCTCAG GAATCGCGTT TCCAGTATTC 7 260 

CCCCTATCCT TACTTTAA7G AGCATTGTAT CTTTTTAGAT GGCCAGCATC CTCCCATGGC 7 320 

CATTAGTCCT CAGAGTTTTG AACGTCTCTT GGCTATCGTA GACCAGTTTC CAGGATATTT 7 380 

TGCTGGATCT AATGCCGACC TGCCGATTGT GGGGGGCTCT ATTCTAACTC ATGATCATTA 7 440 

TCAGGGAGGC CGTCACGTAT TTCCTATGGA ATTGGCTCCC TTGCAAAAGG CCTTCCGATT 7500 

TGCTGGTTTT GAGCAGGTCA ACGCTGGAAT TGTCAAGTGG CCCATGTCTG TCCTACCTT7 7560 

GACTTCGGAT TCCAAAGAGG ATTTGATCAA TTTGGCTGAT AAGATTTTCC AGGAATGGCG 7 62 0 

CCAGTATTCA GATCCTGCAG TGCAGATTTT GGCAGAGACA GACAGGACAC CGCATCACAC 7 680 

TATCACACCC ATTGCCCGCA AACGCGATGG ACAGTTTGAG TTGGACTTGG TCTTGCGAGA 774 0 

CAATCAGACT TCAGCAGAGT ATCCTGATGG TATCTATCAT CCCCACAAGG ATGTCCAACA 7800 

TATCAAGAAG GAAAATATCG GCTTGATTGA GCTCATGGGC TTGGCAATCT TGCCACCACC 7 860 

TCTGAAAGAA GAAGTGGAGC AAGTCGCTAG CTATCTTGTA GGAGAAGCTG TTACAGTTGC 7920 

CGATTATCAT CAGGAGTGGG CAGACCAACT CAAATCCCAA CATCCACACT AACGGATAAA 7980 

GAAAAAGCCC TTGCAATCGT CAAGGACTCT GTCGGTGCTA TCTTTGCGCG 7GTACTTCAC 8040 

GATGCAGGAG TCTACAAGCA GACAGAACAA GGGCAGACAG CCTTTA7CCC CTTTGTGGAA 8100 

CAGGTCGGAA TTTTACTAGA CTAGGAGCTT TCTCGC 0136 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

CCCATAGTGA AGAGTGGCCA TAAGAAGGTC TTCTAGGCTT AATTTAGGTT TTCGTCCACC 60 

TTTTGCGTGT TTAAGTTGAT AAGCTGTTTP TAACACAGCT GAACATCTCT TCAAAAGTCG 120 

TGCGCTGAAC ACCAACAAGA CATTTAAATC GTGTATCAGT TAGTTGTTTA CTTGCTTCAT 180 

CATTCATAGA ACTACTATAC CATGTTTTGT TTCGCAGGAA GTCTAATATT GTCAAATACT 240 

GGAACGCTCA TTGCTGGGAT ACGGAATAAG ATTGGCCCAG CTTCCATAAC TGGGATACCT 3 00 

GGTTCAAAAC CAAGCTCTGT TGCAGCGATT GGTGTAAAGA TATCGTAACC TTTCATAAGG 3 60 

TCTTCGTTTA CATCTTTCAC CATAACTGCA TCACAGTGAA CATCGTAACC ACGGTTTGAA 420 

AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC ACCGCAGGCA 4 80 
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CCAAGAATTT 


TAATCATTTG 


GATTTCCTCC 


GATTTTATTT 


TTTAATACAC 


AAGATTAAGC 


540 


GCTTGCTTCA 


GCAATGTAAG 


CATAAAGGGC 


TTCTGCTTCA 


GAAATTTTTG 


ATAGGTCTTC 


600 


AAGATGACCA 


TTTCCTGTGA 


AGAAGTCCAT 


TAACTGAGCA 


AGAATGTTCG 


TTTGACTTGA 


660 


ACTTGAATTA 


TTGATGATAA 


ACAACAGCAA 


GGATACTTCT 


ACTTCCTTAC 


CTGGCGCAAT 


720 


CATATTATGG 


AAAGTCACCC 


GTTTCTCTAA 


TCGAACAACC 


ACCACTTTCT 


CAGCTAGATT 


780 


ATGAACAATA 


TCTGTGTGAG 


GAATCATTAC 


ATTTGCAAGT 


CCTTTCCTAC 


AAATTCCATA 


340 


TATAAACCAG 


TTGGAAATGA 


CTTTTCACGC 


GTGATCAAGG 


CTTCACGATA 


AGTTGGAGTG 


900 


ACAATTTCTC 


GTTCTTCCAA 


CAAGCTTGCT 


ACCTGATCAA 


AAAGTTATTC 


T7CATTATCC 


960 


GCTTCTAAGC 


AAAACACAAG 


GTTTTTGTCA 


AAGAAATAAT 


CTAATACCAT 


AAGGTTTTCC 


1020 


CTTCTTTCCA 


TTAACTTTAT 


GCTATAAGTA 


TAACACTATA 


TGAAATCGTT 


GTTAATTACT 


1080 


TTCTATTCTT 


TTTTGTCTCT 


TTTTTTATAT 


TTTTGTTTTG 


TTTATAGTTT 


CTTATATAAA 


1140 


AATAAACACA 


CAAACAAATA 


CTCCAACCAT 


TTTTCTGTTC 


TAATACTCAA 


TGAAAATCAA 


1200 


AGAGCAAACT 


AGGAAGCTAG 


CCGCAGTTGT 


TCAAAACACA 


GTTTTGAGGT 


TGTAGATGAA 


1260 


ACTGACGAAG 


TCACTCAAAA 


CATGGTTTTG 


AGGTTGTAGA 


TGAAACTGAC GAAGCAACAg 


1320 


CCATACATAC 


GGTAAGGCGA 


CGCTGACGTG 


GTTTGAAGAG 


ATTTTCGAAG 


AGTATAAAAA 


1380 


CTAAAAAAGC 


AGACCATCTA 


AGCCTGCTTT 


ACTATTGATT 


CTTATATAAA 


TTTCCTGTGA 


1440 


ACAAGGAAAG 


GCATTTCTCA 


TAACTTATTC 


TTCATCCATA 


CTCAAGACGC 


TGAGGAAGGC 


1500 


TTCTTGCGGA 


ACTTCAACTC 


ATCCGATGGA 


TTTCATGCGT 


TTCTTACCAG 


CTTTTTGTTT 


1560 


TTCAAGGAGT 


TTACGCTTAC 


GAGAAACGTC 


ACCACCATAA 


CATTTAGCAA 


GTACGTTCTT - 


1620 


ACGAAGGGCC 


TTGATATCAG 


TACGAGCGAC 


AATCTTGTGT 


CCAATAGCCG 


CTTGGATTGG 


1680 


AACTTCAAAT 


TCTTCGCCAG 


GGATGATTTT 


CTTGACTTTA 


TCAACGATGA 


GTTTCCCACC 


1740 


TTCGTAGGCA 


AAGTCCTTGT 


GAACGATAAA 


GCTGAGGGCA 


TCCACCTTAT 


CTCCATTGAG 


1300 


AAGAATATCC 


ATTTTCACCA 


GCTTAGATGG 


GCGATATTCT 


GACAATTCGT 


AGTCAAAGCT 


1360 


TGCATAACCA 


CGTGTCGAAG 


ACTTAAGTTT 


ATCAAAGAAG 


TCAAAGACAA 


TTTCAGCAAG 


1920 


ACGAATTTCA 


TAGATAACAT 


TGACACGGTT 


ATCATCAATA 


TAGTCCA7AC 


TCACAAAGTC 


1980 


CCCACGCTTA 


CGCTGAGCTA 


GCTCCATTAC 


TGCTCCCACG 


AACTCCTGTG 


GTACCATGAT 


2040 


TTCCGCCTTG 


ACATAAGGCT 


CT7CAATGCT 


CGCAATCTTA 


GTTGGGTCTG 


GAAACTCAGA 


2100 


TGGGTTAGAC 


ACATCCATAG 


ACTCACCGTC 


GGTCAAATTA 


ACTTTGTAAA 


TAACAGACGG 


2160 


AGCTGTCATG 


ATGAGGTCAA 


TATTGAACTC 


ACGCTCTAAA 


CCTTCCTGGA 


TAACATCCAT 


2220 



WO 98/18931 PCT/US97/19588 



624 

ATGGAGAAGT CCAAGAAATC CACAACGGAA ACCAAATCCA AGTGCCTGAG ATCTTTCTCG 2 280 

TTCAAACTCA AGACTAGCAT CATTCACTTC CAATTTTTCA AGcGCTTCAC GCAGGTCATT 2 340 

GTACTTG7TT GATTCGATTG GGTAGAGACC CGCAAAGACC ATAGGATTCA TCTGCTTATA 2400 

ACCATGTAAT GGTTCTGCCG CAGGATTGGT TCCCAACCTA ACGGTATCAC CCACACGAGT 2460 

ATCCTGAACC GTCTTGATAG ACCCCGCAAT GTAACCAACA TCACCAGTCG CAAGGAAATC 2520 

ACGACCAACC GCTTTTGGTG TAAAAATACC GACTTCGGCC ACATCAAAGG TCTTACTATT 2 580 

GCTCATGAGC TGAATCTTAT CACCAGGTTT GACCACTCCG TCCATGACAC GCACTTGGAG 2 640 

GATAACCCCA CGGTAAGCAT CGTAAACAGA GTCGAAAATC AAGGCCTTAA GTGGCGCCGT 2700 

CACATCACCC GTTGGTGCTG GTACTTTTTC TACAATTTGC TCGAGGATTT CTTCAATCCC 2750 

AATACCAGCC TTCGCAGAAG CCAAAACTGC TTCACTCGCA TCCAAACCAA TCACATCTTC 2820 

AATCTCTGTA CCCACCCCCT CCGGATCTGC AGCCGGCACG TCAATTTTAT TAATGATACG 2 880 

CATGATTTCC AAATCATTAT CCAAAGCCAG ATAAACGTTG CCAACAGTTT GAGCCTCAAT 2940 

TCCTTGACCC GCATCCACCA CCAAAATAGC ACCCTCACAG GCAGCTAGCG AACGTGAAAC 3000 

TTCATAGGTA AAGTCAACGT GCCCTGGTGT GTCAATCAAG TGGAAAATAT AACTTTCCCC 3060 

ATCTTTTGCA GTGTAATTCA ACTCGATGGC ATTCAACTTA ATAGTAATTC CACGTTCCCG 3120 

CTCTAGCTCC ATCCTATCCA AAAGCTGGGC CTGCATTTCA CGACTTGAAA CCGTCTCTGT 3180 

TTTTTCCAAA ATGCGGTCTG CTAGAGTTGA TTTTCCGTGG TCAATATGGG CGATAATAGA 32 40 

GAAGTTACGG ATCTTCTCCT GTCGTTTTTT CAATTCTTCT AAGTTCATGA TTCTCTTCCT 3 300 

TTCAGGGTAT CTATTTATTA TAAATTGTTT TTCATATTTT GACAACACCA TACCCTGCTA 3 360 

GGAGTACTAA TCTTCAGCGA CAAAGCCGTC ATTTTCGATA AAGTGGTGTT CTGTCATTCC 3420 

TTGGTCTGTA AAGACAATCC CGTGAAGGAC ACCACCATAA ACAGCTCCTC CATCCATTCC 3480 

AATCTTGCCA TCTTCTGTAG TCCAAAGCTC AGATGTACCG CCTTCTTGCT GTAACAAACC 35 40 

ATAGACCGGT GTATGACCGA AGACAATGGT TTTTCCAGTA TGATTTTCAG CTCCGTGGAA 3 600 

TGGTTTTCTA AGCCATACTT TTTTATAATC TGTTGTTTCA TGCCAGTCGT CCAAGGTCAA 3650 

ATCAATACCT GCGTGAACAA AGATATACTT GTCTGTCTCT ACTACAAATG GCATTTGACG 37 20 

AATGAATTCG ACCAAGTCTG CCGCTTCAGC GgCAACCCCC TTCGCATCTT CTACTCCATC 3 7 80 

AACTGGTGCA TCCAAGGGAC GACCTAGGAT AGACTTAATG GTTGTATCTC CACCATTGCG 3 840 

ACTATAATGG TCATAACTTT CTTCTGGGTC ATCTAGCCAA GTCAAAAACA TATACTCCTC 3 900 

GTTTCCCGAC AAACAGATAG CCCCTTGATT GTCCACCAAG TCCTTGACCA TTTCAAGAAC 3 9 60 

ACGGTGACTA TCCTCACCTC TGTCAATCAA ATCACCTAGA AAGAGCAACT GGGGCTGACC 4020 
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ATCCCAGGTT TTGAGAAGCT 
ATAATAATCT GTCATCTTAT 
CTGCTTCTGT CACATCATCA 
CCGTCAAGAG ACGAACAGTC 
ATTGATAATC TGCAATCCCA 
GCTGACCAAT TTTATGAATT 
TATCCACCTC ATCAAAGACA 
TGGCTAACAT GAGACGAGAT 
CTTCTCCAGG GTTGGTTGAA 
TTCCCTTACT AAAACGAACC 
GTTTAATCTC AGCTTCGAGT 
CCAAATTGAC AAGATTGACT 
GATTATTGCC TCTCAAGACA 
CATC AAC ACT CCCACCATAC 
CCTCCATCAG GCGATTGCCA 
TAATGTCTTC TAAAACATAG 
GATCATACTC TTCGACACTT 
AAAAATCTTC ATTGTCCAAC 
GGTTGAGGAG TTTATCTCGC 
CTGCCTCAAT CTCTGCCATT 
GGTTTTTCTT GACTTCCAGA 
GATAGCTTTC TTTCAAGTCC 
GCAGTTGGGG ACGCATTAAC 
GCCCAATAGC TCGCAAAACA 
GACCATTTTG CAAGATTTCC 
CATCAAAAAT TTCCTGTAAA 
CCTTTGGTGC ACCATGACGA 
TGGCATCAAT GATAATCGAC 
TTTCAAAATT GAGGGAAATA 



CTTCCAGCAT 
TTCTCCCTGT 
CCTGCCAACA 
GAAACCGTTG 
ATTACTTGTG 
TTCTGAGCAA™ 
ATGCTAGTCT 
AATTCCCCTC 
ATATAAAACT 
TGAAACTGGG 
TGCTGAGCCA 
TCCAACTTCT 
TTGTATTCTT 
TTACGAGTAA 
TCAAAATCAA 
TAGGTCTCAG 
TCCATGTCAT 
ATACTGTAGG 
TCTTGATTGA 
TGAAATTCCA 
ACCTGCTTGC 
CAAAAAGCGG 
TCCTCATCCT 
GACAGATTAA 
CGACGGATGA 
AGACGACTAT 
ATAACATCTG 
TTCCCTGCAC 
GCCTCAATAA 
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CCCAGCTTTT 
TTCTCAACAA 
TCTTGGCAAC 
AATGGTCATT 
GCAAATGGGA 
T AGCTTG AG C 
TGCCTTCTTT 
CAGAAGCAAC 
CAACCATTTT 
CTTTTTCCAT 
AATTATGACG 
TAAGCTCTGC 
CCGTAATCTT 
TAGTATGAAG 
GGTCCTCAAT 
ACAGATAGCT 
TCATACCTGA 
CATTGGTCAG 
GAGCCAAGTC 
ACATTTCGAT 
GCATTTTCCG 
CATCACCAAA 
CATGCTGACC 
CCATCTGACC 
TAATTTCATC 
TCTCAACTGA 
TCGTCGCACG 
CCGT7TCACC 
TGGCAAAGTT 



CCGTGAACAT CTCCAATTAC 4080 

TTCTCTTGCT TGCGTCAGGG 4 HO 

TTCCTCCACT CGCTCTTCGA 4 200 

ACTAATCTTC TCAATAAAGA 4 2 60 

GATAGCCAAA ACCTGACCAT 4 320 

AACACGACCT GAAACTCCCG 4 380 

ACGTGAAAAG GCAGACTTAA 4 4 40 

CTTAACCAAG GGTTTAAAGT 4 500 

ATTTCCCTCA CGACTGAATT 4 560 

ATAAAGATCT TGCAGTTCTT 4 620 

AGCAGAAGCA AGTTGACCTG 4 680 

TTCCATGTCC TCAGACGAAA 4740 

GGCAAAATAA AGCAAAACAT 4800 

GACGTCCAAA CGATTCTCAA 4 860 

GATAGCTTCC AAACCTTTGC 4920 

TGAAATTTCA CGGTATTCAG 4980 

ACGAACATTG GCCAGACTTC 5040 

TGTATCCGCA ATATTTTTGT 5100 

TTCTCCAGCC TGCAAGTTTG 5150 

ACGTGCCTTG TGTTCCTGTT 5220 

ATAGGCATCA AAACTCCTTT 5280 

TTCATCCAAC ATCTGGATAT 5340 

ATGAATATCT ACAAGATGTT 54 00 

ATTTACACGG CTGATACTAC 54 60 

ACCTAATTCT AAACCTTGCT 5520 

GAAAAGCCCC TCAATCTCTG 5580 

AGCTCCCAAC ATCATATTCA 564 0 

AGTCAGGACA CTCATCCCCT 57 00 
TTTTATCGAA ATTTCAAGTA 57 60 
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ACATATAGAC CTACCAATTT TTTACTTGTT CAAAGATTTC 
TGGCAATGAC TAAAATCGAG CTATCATCAG TCAAACACCT 
TCTCGATTAA CTGAGCTTTT ACAAAAGCCG TATTTCCTGC 
TCTTATCCAT CAATTCAGCC GATTCGATAT TGTCTTCAGC 
TTGATTTTGG CAATTCGTAG ACATAGGTGT TGTCTCTCAA 
ACTCTTTGAT ATCTCGGGAT ACCGTCGCCT GAGTGGCAGT 
GTTCTACAAT TTCT7CTTGC GTGCCGATTT GATAATCTGT 
CAAGTCTCTC TTTTTTATTC ATTTTTAAAT TGACTATGCG 
ATCTCACCAA GAATCTGATT GCTTGCTGAC TTTTCTTTTT 
ATATTTCCAT GTCCACCTTG GATGGGAGAA AAGTCCAAGC 
TCTACTGCCA TAGCTGTTAC AGATTCAAGG ACATTCTGAT 
ATTCCATTTT TCCCAATCTG CTCACGTCCT CCCTCAAACT 
ACCTGACCTT GATCAGCCAA GACACGGTGC AAGGCTCCCA 
AAACTCACAT CAATACTGGC AAAG CTCGGC TCCTGCTCCA 
CGGAAATTGA ACTGCTCCAT GCTGACAACT CGTGGGTCTT 
TGATTGGTAC CAACATCGAC TGCAAACACC AACTTGGCAC 
GTAAAACCTC CAGTAGAGGC CCCGATATCA ATCGTAGTCG 
AAGACCTGCA AGGCCTTTTC CAGTTTCAAA CCACCACGGC 
CCCTTGACTT TTAATTCGGT CTCATCTGGA ATTTTCTCTC 
CCATTAAGGA CTGCTACGAC TAGGCCAGCC ATCACACCTC 
TCAAACAACC CCTGTTTATA AGCTAGTACA TCCACTCTTT 
ACTTTCTACT ACACTTACAA TCGATTC7GT TTCAAAGGGA 
TAATTTTTCA TTAGCTTGAT CCAGGGTTTG GTTACAAAAG 
CAACAGGGCA GGATAGGTTG ATTTTTCTGC CTGCACATCC 
TTCCTCAAAA CTAGCTGTCA CATCCAGTAC ATCATCTCTG 
CAATTCACCC ACAGTTTTCA GCTTCACCTG CATTTCAGGT 
TGCCGCTTGG AAGGGATAGG CTAGTAACTT CCCAGTCTTA 
TTCTTCCAAA GACAAGTGCT CGTGTTCGCC CTCCATATCC 
ACCCAGACTA CCTGAAGCAA GGGATAAGTT GGCAATCAAG 
CAAATCTGCC TGCGCAATCA AGGCATATGA GTCTAAGAAT 



CTCTCCTAGA CTTCCACTTC 5820 

AAAAATCTTG TCTGCAAAAC 5880 

AATAACTTCG AGATTGATCA 5940 

CAGTTGCAGA CTTTTTACGA 6000 

AGGAATTTTG ACAATACCTA 6060 

CATACCTGCT TCTTTCAAAT 6120 

CACCAA7CTT CTAAT T TTT T 6180 

CCCTCTCTAC TGCTTCTTTA 6240 

TCAAATACGC TAAAAATTCA 6 300 

CAAGGACTGA AAAACCTACC 6 360 

GAACCTTAGC ATCTCGAATA 6420 

GAGCTTTGAC AAGTGCTACC 6480 

AAATCAGACT AAGGGAAATG 6540 

AATCAGTCTT TTCACCATAG 6600 

GGCGTAATTT CCAAGCCAAC 6660 

TATTCTGTAC CATGACATCG 6720 

CGCCATCCAC CGACAAATCA 6780 

TGACATACTT GAGTTTCTCC 6840 

CTGGCTTGTC AAACCGTTCT 6900 

GCTTGGCCTG CTCTCTCGTT 6960 

CCTTAGCCAT TGATTCTCAA 7 020 

AGCTGCTGGG CAATTTCTTC 7 080 

GCAATGGACT CTTCCAAGCC 7140 

TTTTGACGTG TCTTGCCGAT 7 200 

ACTTGAAAAG CAAGTCCAAT 72 60 

GACAATTCAG CTATAATAGC 7 3 20 

TTGGCATGAA TAGTCTCAAG 7 390 

AAAACTTCCC CTGCTACCAT 7440 

TCCACCTTAA TCTGACTTGG 7 500 

AAGGCATCTC CAGCCAAAAT 7560 
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GGCCATAGCT TCACCCAATT TCTTG7GATT GGTTAACCGC CCTCTTCGAT AATCGTCATC 7620 

ATCCATAGCA GGAAGGTCAT CGTGAATCAA GCTCCCTGTA TGAATCATCT CTAAGGCAGT 7680 

AGCTACCTGC GCGTGAGCAG GTTTGATGGT AACCTGCAAG CCTTCCAGAA CTTCTAACAA 77 40 

GAGAAAAGGC CGAATACGCT TGCCACCAGC ATCAATAGAA TAGAGAACAG ACTCCCGTAA 7800 

ACTAGAGGCA AACTGCTGGT CTCCATAAAA ATCTTCCAAA GCCGACTCGA CAAGAGCTAA 7860 

TTTTTCTTGC TTTTTCATTC AAAA7CACTT TCTGTTCCGT CTTCTTGCAT GACCTTGACC 7920 

AAGGTCTTTT CAGCCTTGTC CAGCGTAGCT TGGAGCTCTT TTGACAAGAC CATGCCCTTT 7980 

TCAAAGGCAG TAATCGCATC TTCCAGAGCA ATTTCACCAT TTTCCAAACT TTGGACAATG 804 0 

GTTTCCAGTT CTGCTAGATT TTCCTCAAAT TTCTTTTGTT TTGACATCTT TAACCTCTAA 8100 

TTCTACTTGA CCATCTCGCA TCAAAAGCGT TACTTGGTCT TTTTTCTTCA AACTCTCAAC 8160 

CGAATCTACA ACGCACTCTT CTTTTTTGAC AATAGCATAA CCACCCGCCA CGATTCGGCT 8220 

AGTATCCAAC ATGAGCAAAG CTTCCGAAAG TCGCTTGGCC TCAGCAACCT TGGCGTCATA 8280 

AACTAACGCC ATTTCCCTAC CTAAGACCTT GTCCAACTGT CCTAAACGGT CTTGATAGCG 8 340 

TTGGATTTTG GTAACAGGTG ATAATTGTAC TAATTGATGA CTTCTTGCTT GAACTAATTC 8400 

TTTGTTATCA GAAATCCGAG TTCGCAAACT TTCTTTCAAA CGCAGTTGCA GTTGGTCCAA 8460 

GCGTTGCAAA TAACCGTCAT ACAAGCGCTC AGGT7CTCTA AAGATAACAG ACTGACTGCA 8520 

TTTTTTCAAA GCCTCTTGTT TCTTAGATAG AACATTTCGG ACTGCCGTTA CCATCCGTTT 8580 

TTCCTGATTT TGCAAATGAC CTAATACATC CAACTTGGTC ACAGGTGTTG CCAGTTCACC 8640 

CCCCGCTGTT GCCGTTCCAG CGCGTCGATC TGCCACAAAA TCTCCCAAGG TCACATCCGT 8700 

CTCATGCCCC ACACTAGAGA TAACTGGCAA ACGACATTCA AAAATAGCTC GTACCACAAT 8760 

TTCTTCGT7A AAGGCCCAGA GATCCTCAAT AGAACCACCT CCACGACCAA TAATGAGCAA 8820 

ATCCAAATCG TCCCGTTGAT TAGCACGCGC AATATTTCTA GCAATTTCCT CCGCAGCCCC 8880 

TTCACCTTGA ACCTTGGTCG GATAAAGAAG GATGTCAACA CCTGGGAATC CCCTGCTGAC 8940 

GGTCGTGATA ATATCTCGAA TAACGGCTCC ACTACGGCTG GTTACTACAC CAATTCTCTT 9000 

AGAAAATTGG GGCAGAGCTT GCTTGAAGCG TTCTTCAAAC AGGCCTTCTT CTGTCAATTT 9060 

TTTCTTAAGT TGTTCAAACT GAATCGCAAG CGCCCCAACC CCATCAGGCT CAG C TTTTTC 9120 

AATGATGATG GAGTAGCTAC CACTTGGTTC ATAGACCTGT ACACGCCCAA TCACATTCA? 9180 

CTTCATTCCT TCTTCCACCT CAAACCCTAA TTTCTGATAA ATCCCAGACC AGATGGTCGC 9240 

7TGAATAACT GCATGGTCAT CCTTTAGGGA GAAATATTGG TGAGTAGGTC GTTTACGAAA 93 00 
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GTTGGAAACT TCACCACTTA AATAGACCCG TTCCAAGTAT CGGTCTTTAT CGAATTTCAT 

TTTCAGATAC TTGGTCAAAG TTCTTACCGA TAAATACTTT TCCATCTCCA CCTACTATTC 

ATTTACTTGC TCTTTCATGG GT ATT ATT AT ACCAAAAATA TGCCTAAAAA TCTCCATTTA 

TGTACCATTA TGAGGGAAAA ATAGAAAAAG GAGGCAAGGC CTCCACATGT GATTATTTGC 

7GTTTCCAGC TTCTTCCAAA ATCTTTGCAA TCTTGGTCGT CAACAGGTCG ATAGCCACGG 

TATTGCTAAC CCCTTCAGGA ATGACGATAT CAGCATAACG CTTAGTTGAC TCGATAAACT 

GGTGGTACAT TGGTTTGACC ACACCTAAGT ACTGCTTAAT AACGCTATCA AGGCTACGGC 

CACGCTCCTC CATATCACGC TTGATACGAC GAATAATGCG CACATCGTCA TCCGTATCCA 

CAAAAATCTT GATATCCATC AAATCGCGCA GACGCTTGTC CTCCAAGACC AAAATACCCT 

CAACCATAAA GACATCTTGA GGTTCCTGAC GATAGGTCTT GCTACTCCGT GTATGCTCTC 

TATAGTCGTA GGTCGGGATG TCCACCGGAC GCCCTGCCAA CAATTCCTTA ATCTGCTCGA 

TCATCAAGTC TGTATCAAAG GCAAAAGGAT GGTCATAGTT GGTTTTGACG G 

(2) INFORMATION FOR SEQ ID NO: 77: 

{ i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5365 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
CGTGTGGTCT TAAAAATAGA AGACAAAGAA CAAACTGTTG GAGGCTTTGT CCTTCCAGGC 
TCAGCCCAAG AAAAAACCAA AACAGCTCAA GTTGTGGCTA CTGGACAAGG TGTTCGTACC 
TTGAACGGTG ACTTGGTTGC TCCAACTGTT AAAACTGGAG ATCGTGTCTT AGTTGAAGCC 
CACGCAGGTC TTGATGTCAA AGATGGCGAT GAAAAGTACA TCATCGTAGG CGAcTAACAT 
TTTGGCAATC ATTGAGGAAT AGAAGGAGAA AGTAACTATG TCAAAAGAAA TTAAATTTTC 
ATCAGATGCC CGTTCAGCCA TGGTTCGTGG TGTCGATATC CTTGCAGACA CTGTTAAAGT 
AACCTTGGGA CCAAAAGGTC GCAATGTCGT TCTTGAAAAG TCATTCCGTT CACCCTTGAT 
TACCAATGAC GGTGTGACCA TTGCCAAAGA AATCGAATTG GAAGACCATT TTGAAAATAT 
GGGTGCTAAG TTAGTATCAG AAGTACCTTC TAAAACCAAT GATATCGCAG GTGACGGAAC 
TACGACTGCA ACAGTCTTGA CCCAAGCTAT CGTCCGTGAA GGAATCAAAA ACGTCACAGC 
AGGTGCAAAT CCAATCGGTA TTCGTCGTGG GATTGAAACA GCAGTTGCCG CAGCAGTTGA 
AGCTTTGAAA AACAACGCCA TCCCTGTTGC CAATAAAGAA GC7ATCGCTC AACTTGCACC 
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CGTATCTTCT CGTTCTGAAA AAGTTGGTGA GTACATCTCT CAAGCAATGG AAAAACTTGG 

CAAAGACGGT GTCATCACCA TCGAAGAGTC ACGTGGTATG GAAACAGAGC TTGAACTCGT 840 

AGAAGGAATG CAGTTTGACC CTGGTTACCT TTCACAGTAC ATGGTGACAG ATAGCGAAAA 900 

AATGGTGGCT GACCTTGAAA ATCCGTACAT TTTGATTACA GACAAGAAAA TTTCCAATAT 960 

CCAAGAAATC TTGCCACTTT TGGAAAGCAT TCTCCAAAGC AATCGTCCAC TCTTGATTAT L020 

TCCGGATCAT GTGGATGGCG AGGCTCTTCC AACTCTTGTT TTGAACAAGA TTCGTGGAAC 1080 

CTTCAACGTA CTAGCAGTCA AGGCACCTGG TTTTGGTGAC CGTCGCAAAG CCATCCTTCA 1140 

AGATATCGCC ATCTTAACAG GCGGAACAGT TATCACAGAA GACCTTGGTC TTGAGTTGAA 1200 

AGATGCGACA ATTGAACCTC TTGGTCAAGC AGCGAGAGTG ACCGTGGACA AAGATAGCAC 12 60 

GGTTATTGTA GAAGGTGCAG GAAATCCTGA AGCGATTTCT CACCGTGTTG CGGTTATCAA 13 20 

GTCTCAAATC GAAACTACAA CTTCTGAATT TGACCGTGAA AAATTGC AAG AACGCTTGGC 1380 

CAAATTGTCA CGTGGTGTAG CGGTTATTAA GGTTCGACCC GCAACTGAAA CTGAGTTGAA 1440 

AGAAATGAAA CTCCGCATTG AAGATGCCCT CAACCCTACT CCTGCAGCTC TTGAAGAAGG 1500 

TATTGTTGCA GGTGGTGGAA CAGCTCTTGC CAATGTGATT CCAGCTGTTG CTACCTTGGA 1560 

ATTGACAGGA GATGAAGCAA CAGGACGTAA TATTGTTCTC CGTGCTTTGG AAGAACCCGT 1620 

TCGTCAAATT GCTCACAATG CAGGATTTGA AGGATCTATC GTTATCGATC GTTTGAAAAA 1680 

TCCTCAGCTT GGTATAGGAT TTAACGCAGC AACTGGCGAG TGCGTTAACA TGATTGATCA 1740 

AGGTATCATT GATCCAGTTA AAG7GAGTCG TTCAGCCCTA C AAAATGC AG CATCTGTAGC 1800 

CAGCTTGATT TTGACAACAG AAGCAGTCGT AGCCAATAAA CCAGAACCAG TAGCCCCAGC I860 

TCCAGCAATG GATCCAAGCA TGATGGGCGG GATGATGTAA GCTTTCTATA GAAAACAACT 1920 

7ATAAAAAAC ACAAAAGGAG GGAATGACTA ACCCTTCTTT TTATAGGCTC TTTGTCAACT 1980 
CTACTCCCrr GAAGTCAGCT AAGCTCGAGA AAGGACAAAT ttcgtccttt cttttttgat 
GTTCAAAGCG ATAAAAATCC GTTTTTTGAA GTTTTCAAAG TTTCGAAAAC CAAAGGCATT 

GCGCTTGATA AGTTTGATGA GATTATTGGT CGCTTCCGGT TTGGCGTTAG AATAGTGTAG 2160 

TTGAAGGGCG TTGATAATCT TTTCTTTATC TTTGAGGAAG GTTTTAAAGA CAGTCTGAAA 2 220 

AATAGGATGA ACTTGCTTAA GATTGTCCTC AATAAGTCCG AAAAA7TTCT CCGGTTCCT? 2280 

ATTCTGAAAG TGAAACAGCA AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA 2340 

ATAGCTCAAA AGCTTGTCTA AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGACG 2 400 

ATAAAATCGC TTATCACTCA GTTTACGGCT ATCCTGTTGT ATGACCTTCC AGTAGCGCTT 2 4 60 



2040 
2100 
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GATAGCCTTG TATTCATGGG ATTTTCGATC CAATTGGTTC ATAATTTGAA CACGCACACG 2 520 

ACTCATAGCA CGGCTAAGAT GTTGTACAAT GTCAAAGCGA TCCAACACGA TT7TAGCATT 2580 

CGGGAGTGAA ACAGTCTGGG AGACTCTTTC AGCCTGAGCC TAGAAATTTG AAAGCGAAGC 2640 

TCTTTACCCA AGTCATAGTA AGGACTAAAC ATATCCATCG TAATGATTTT CACTTGACAA 2700 

CGAACGGCTC TATCGTAGCG AAGAAAGTGA TTTCGGATGA CAGCTTGTGT TCTGCCTTCA 2 760 

AGAACAGTGA TAATATTAAG ATTATCAAAA TCTTGCGCAA TGAAACTCAT CTTTCCCTTA 2820 

GTGAAGGCAT ACTCATCCCA AGACATAATC TTTGGAAGCC GAGAAAAATC ATGCTCAAAG 2 880 

TGAAACTCAT TGAGCTTGCG AATGACAGTT GAAGTTGAAA TGGCCAGCTG ATGGGCAATA 294 0 

TCAGTCATAG AAATTTTTTC AATTAACTTT TGAGCAATCT TTTGGTTCAT GATACGAGGG 3000 

ATTTCCTGA? TTTTCTTTAC CAGGGGAGTC TCAGCAACCA TCATTTTTGA ACAGTGATAG 3060 

CACTTGAAAC GACGCTTTCT AAGGAGAATT CTAGAAGGCA TACCAGTCGT TTCAAGATAA 3120 

GGAATTTTAG AAGGTTTTTG AAAGTCATAT TTCTTCAATT GGTTTCCGCA CTCAGGGCAA 3180 

GATGGGGCGT CGTAGTCCAG TTTGCCCATC ATTTCCTTGT GTGTATCCTT ATTGATGATG 3 240 

TCTAAAATCT GGATATTAGG GTCTTTAATA TCGAGCAGTT TTCTGATAAA ATGTAATTGT 3300 

TCCATATGAA TCTTTCTAAT CAC7TCTTTT CTCGCTTTTC ATTATAGGTC ATATGGGACT 33 60 

TTTTTTCTAC AACAAAATAG GCTCCATAAT ATCTATAAGG GATTTACCCA CTACAAATAT 3 4 20 

TATAGAGCCG AAAATTCACA TCTAATATAT GCAGACTACT TTGAAATGAA ATTAAAAAAA 34 80 

TTATTAAAGG ATGACACAAA AGTTTTTGAA AAATCTACAT TCAAATTTGT AGAAGGATAT 3 540 

AAAATATACC TGACAGAATC TAAAGAATCT GGAATTAAAC AAATGGACAA TCTCATAAAA 3 600 

TATTTTGAGT TTATTGAATC TAAAAGTATT GCTTTATATT TTCAAAAACC ATTAAATGAG 3 660 

CTGATAGATT AAATACCATT TTCTCTGTTG AGATATTGTT TTTAAAATAT TGTACTAAAT 37 20 

GATTCATGCT ATGTGGAAAT ACAAAAAAAT GTTTTTGATA CGAACTTCAC CTGTATTTTT 37 80 

TATACTAATC ATTTTCGTAT TTTTTGTATT AAACGATATA AGTTTGTTGT AAACTTACAA 384 0 

GGAATAAAGA CATTAAAAAA TAACAGTATA TCTATTTGTT TTATATATTT TACGAATTCT .'900 

GCATAAATCT CTTTCTAGTA ATGTGTTGTA ACTCTGC7A? AATAGATTTA TTCCTTTTTG 3960 

TGTTTACACA ATTTATTTTA TACTACCAAA AAAGGTCAGG ATTTTGTTCC TGACCTTTGA 4020 

CAACTTTACC GATTCTTTAG TTCTACATAG CCCTTGTACC AAATCTTTAC ATAGGCTTCT 4 080 

GAGAAAGCAC CACGTCCATT GTTAATCCAA TCAACAAGAA 7TTTGACATG TTCTTTTAAA 4 140 

ATATAGTCCA AGTCATCAGA ATAATTCATT TTGCGTTTGT GACGCTCGTA CTCTTCAACG 4 200 

TCCAAGAGAC GTTTTTCCCC ATCTGTAAAA ATTTTAACAT CCAAATCGTA ATCAATATAC 4 2 60 
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4 



TTCAGTGCTT 


CTTCATCCAG 


ATAGTAGGGG 


CTAGCCATAT 


TGCAATAGTA 


AGAAGTTCCA 


4320 


TTATCACGAA 


TCATGGCAAT 


GATATTAAAC 


CAATATTTCT 


TGTGAAAGTA 


AACAATAGCC 


4380 


GGTTCTCGAG 


TGACCCAACG 


ACGACCATCA 


CTTTCGGTAA 


CAAGTGTATG 


ATCGTTGACA 


4440 


CCAATAATGG 


CCTTTTCTCT 

WW 4 4 4 4 W 4 w 4 


TGTTTTTAGT 

1 4 4 4 4 4 rv\J 4 


ACCATGGTGT 


CCCGCCAAGT 


TCGGTGGAGA 


4500 


CTC CC ATC AT* 


CfTTATAACT 


"Tr.AATTCTA 


ATAAAGTCGC 


CTTCTTTTGG 


AAGCTTCATA 


4560 


XCTXACCAAC 




4 4 ** 4 «^»V *J a 4 4 


ATC ATT TACT 


ATTGTACCAT 


AAAATTACCC 


4620 


AAAATGTGTG 


n#\ i i 4 w n w i 4 


CC AAATATTA 


AAGATATTCT 


CTAAGAGCGC 


TTGCTATATC 


4680 


CGAAAAATGG 




GTCCTAAAAC 


TTGAGTTAAA 


CGCTGCTTCA 


GTTCGTATCC 


4740 




GGGGPATACT 
v«^j\jviv>n i nw 4 


TAC'ATATTG 


CTTATCAAGT 


TCCTTGAAGA 


TGAGTTCCTG 


4800 


AG T CGTT^GT 


TCATCAACT*** 


GACTATCCAA 


TTCGTCAAAG 


GCAATTTTAC 


CATCAAAATA 


4860 


AGAGAAGCCC 


TTGTTAGTCA 


AGTTCTGGAT 


AATCTTATCT 


TGCAGGGCAC 


GAGCTGGAAG 


4920 


TTTTCCCTCA 


TATTTTTTCA 


ATAGTTTATT 


GGCTACACGT 


TGAGCAACTT 


CCGAAAAATC 


4980 


AAAATCATTC 


AAGATTTCTT 


CTATAGTAGA 


TTTTGAAATT 


CCTTTTTGTG 


CTAATTTCTG 


5040 


AGTCAGTACA 


TAAGGTCCCT 


TGTCTCCTGA 


AAGTTGATTG 


GCATTGATCA 


TAGCATAAGC 


5100 


GTACTGGCTA 


TCATTAATCC 


ACTTCTCTTC 


TTT \AG ATTA 


GCAATGACTT 


GAGAAACGAT 


5160 


GTTTTCATTA 


ATATCATATT 


TTTTCAGATA 


TTCTCTGACC 


TCTTTTTCAG 


TACGTGCTTT 


5220 


AAAGGATAAG 


TGGTAGAGGG 


CCAGATTCTT 


ACCATAAGAA 


AATTGAGCAA 


AGTCTTGAAT 


5280 


CTCTTTCAAT 


TCCTCTTCGC 


TTATCACCTT 


ATCTCTCGAT 


AACATAAAAC 


GAACAATTGT 


5340 


GTCTTCCGTG 


ATATAGCATT 


TGTCG 








5360 


(2) INFORMATION FOR SEQ ID NO: 78: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

TTTCCAGAAA GAAGTTGACT AAAGTCTTTA TCAAAGAGAA TCACTTCCGT ATTCGAACTC 60 

ACATTAGCTT TTATTTCTAC TTTACTAGCC TCCGCCCTAG CATT7TCTAA ATCTTTAATC 120 

TCTTCTGTTG CCCTATTTAT AGCCAGCTGA ATAACTCCTT CAGCATTTTC ACTCAGTCCA 180 

TGAAGCTTAT CGTCCACCGA AGTATAAAGA CTCGAATGCA TGACTTGTAA AATAATCAGA 240 
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GTCATTGTAG 


AAAAAATCAG 


GGTGAAGACA 


CCGAAGTTGC 


GGATAAAATA 


ACTAAAGTCA 


300 


TCCGCATACC 


ATGTTTTTTT 


AAGTTTACTG 


AACATCTTTT 


AAAAGATACC 


CAACACTACG 


36C 


CAAAGTTTGC 


AAATTCTCTG 


CAAAAGTGGT 


TCCCTTTAAT 


TTCTTACGGA 


CTTTTGAAAC 


420 


ATAGACTTCG 


ACAACCGAAA TCCTTGTATC 


ACTATCAAAT 


CCCCATAGAC 


GGTCAAAAAT 


480 


CTGCGTCTTA 


GGCAAAATCA 


CATTTTGATT 


TTGAAGGAAA 


TAAACTAGTA 


AATCGAACTC 


540 


TTTCCCCAGC 


AATTCGACAG 


GAGTATCTTC 


AACTTTAACG 


GTATTGGTTG 


ATAAATTAAC 


600 


CACGATATTC 


CCATAAGTCA 


AGGTGTTTTC 


ATTAAACTTC 


CCTGAACGTT 


TGAGAAGGGC 


660 


CTGAATCCGC 


ATT7TAAGTT 


CTTCTAGGTA 


GAAACGTTTC 


GTCAGATAAT 


CATCCGCTCC 


720 


CAGTTCAAAT 


CCATCTCCCT 


TGTCATCCAA 


ACTTTCCTTG 


GCAGTCATAA 


TCAGAACTGG 


780 


TGTCGTAATT 


CCCTTTTCAC 


GCAATTCTTT 


TAAGACTTGC 


AAACCATTTT 


TTTCTGGCAA 


340 


CATCAAATCC 


AGCAAAATCA 


AGTCATAGAC 


ACCACTCTCA 


GCTTCGTAGA 


GACCTTCTTC 


900 


TCCATCAAAT 


ACCTGCATAA 


CATCCGCAAA 


ATCGTCTAAA 


AAGTCAAATA 


CTGAATTTGA 


960 


CACACCTAGC 


TCATCCTCAA 


CCAATAAGAT 


TTTTATCATG 


AGAAACTCCT 


CCTTATTAAA 


1020 


ACTATTATAC 


CAAATTTCCC 


TTAAAAAAAA 


CTCAACTCTC 


TGCATTTTAC 


ATGAGATAGC 


1080 


TGAGTTTTCT 


TTTTATTTTA 


GGCTTATTTA 


TGCATTTCCG 


TATTGAAGAA 


CAACTGCTTC 


1140 


GACTGCAGCT 


TTTTCACGGC 


TAATCAAGTC 


AACACGCGCT 


GCAATTTCCT 


TGATTCCCAT 


1200 


ACCGATGTTA 


CGGCTAAGAG 


CAAGGTCAGA 


AAGTTGCGGT 


TCAAAGAACT 


CCTTGTATTC 


1260 


CGCCAAGCGT 


TGCTGAGTCT 


TAAATACATG 


AGCAGGAAGG 


ATAACAAAGC 


TATCAAAGCT 


1320 


CATATCTCCT 


CCAAGGGCTG 


CCTTAATCCA 


AGCCCAGTTT 


TCACGCGCCC 


AAGACCAAGC 


1380 


TGTTTTCTGA 


GTTGCTTGAT 


GAGCTAGGAA 


TTGGTAATAC 


CAAGCAGACA 


AGTCCTGTGG 


1440 


TTTGACCACA 


AATTTGTCCT 


TCCAAGAAGT 


AATCAGGTTT 


TGGATATTAT 


CCGCATCTGT 


1500 


ACTGTATGCA 


AGAGCTCCTG 


CCAACTGGCG 


TTTAAAGACA 


GCATCTGTTG 


CGTGAGTATA 


1560 


AGTATCAAGA 


TAAAGTGCTA 


ACAAGTCTTT 


AGTCTCATGA 


TGTTTCATCT 


CATTAATCAG 


1620 


AACTTGTGAG 


CGAATAGCTG 


CTGGGAGTCC 


TGCAAGATTC 


TCCTTGTGTG 


TTGCGAAGAT 


1680 


TTGGCTAGCG 


ACTTGACTAG 


CTTCTGCATC 


ATTTGAGCCA 


ATCATCATCG 


AAACAGCCAG 


1740 


CTGACGAACC 


AATTCATCCT 


CATCTGATTC 


TCCGTCTTTA 


GCTTCAAAAC 


CAAGACGGTC 


1800 


ATAGTTATGA 


CGAGCCAATT 


TAGCAACCAG 


TCCTTTGAAG 


GCTGTTTCAG 


CATCCGTTCC 


1860 


TTCATCAATA 


AAGCGCTCAA 


GGGCTGAAAT 


CACT7GAGAA 


ACAGCTGAAA 


CCACCAGATA 


1920 


AGACTCTTCC 


TTAGCAAGTT 


TATCAAGAAC 


TGGAAGCAAG 


TCTGCATAAG 


AAATGTGCCC 


1980 


TGCCTCAGCC 


AACAAACGAC 


GTTCTTGAAC 


AATTTGCAGT 


TTGCTTCTCT 


TATCAAGTGT 


2040 
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CTCTAGCTCA GCAAGAACAG CTGCTAACAA GTCTCCTTGA TAGTCGGTAA TATAGTGGGC 2100 

AGTATTTTCA GTGTTGAGAC GAAGAGCTCC TTCATTTTCA GCAAGAACAG CTGCGTAGCC 2160 

AGGGATTTCG ATACTTTCAG TTTCGAGTGT ATCAGGCAAG CCTTTCCAGT TGCTATTGAG 22 20 

GGGCACCACC CAGAGACGGT TCTTGTCTTC GTTCTCACCG ATGAAGAATT GTTTTTGTGA 22 80 

AATCTTCAAG ACATCATTTT CAACTTTAAC AGTAAGAACT GGGTAACCAG GCTGTTCCAA 2 340 

CCAAGAATCC ATGAAGCCTG CGACATCACG TCCTGACGCT TGACCAAGGG CATCCCAAAG 2 4 CO 

GTCACTACCA ATGGTCTTGC TGTATTGGTG TTTTTCAAAG TAGGCGTGCA AACCTTTAGC 24 60 

AAAATCAGCA TCTCCTAGCC AACGGCGAAG CATGTGCATG AGACGGCTTC CTTTGGCATA 2 520 

GACGATAGCG CCGTCAAAGA GTGTATTGAT TTCATCTGGA TGTTTAACTT CCACGTGGAC 2580 

AGACTGAACG C CATC ACT AG CGTCACGTTC AAGAGCAAGA GGTACTCCAC CTGTTTGGAA 2640 

ATCTTCAAAG ATATTCCAGC TTGGTTCGAT GGTATCCACA CAGACGTATT CCATCATATT 2700 

ACCGAAACTT TCATTGAGCC AAAGGTCATC CCACCATTTC ATAGTCACCA GGTTCCCAAA 2760 

CCATTGGTGA GCCAATTCAT GGGCCACAAC AAGGGCAACT TGTTGACGGC TAGCAAATGT 2820 

AGAGTTCTCA TCGACAACCA AGTAAACTTC ACGGTAGGTC ACAAGACCCC AGTTTTCCAT 2 880 

AGCACCAGCT GAGAAGTCAG GAAGGGCGAT GTGGAGAGAT TGAGGAATTG GGTACTTAAC 2940 

TCCATAGTAA TCTTCGTAAA ACTCGATAGA GCGAACAGCG ATATCCAGTG AGAAATCAAG 3000 

ATTTGAAAGT GGATGTGCTT TGGTTGAGTA GACACCTACC AGGGTACCAT TTTTAGTTTT 3060 

AGCGGTCACC CCTTGCAAAT CACCAGCAAC AAAGGCCAAC AAGTAAGAAC ACATGCGAGG 3120 

TGTTGTCTCA AACTTCCAGA TACCTGTTTC CTTACCGTTT TCAACATCCA TTTCTGGCAT 3190 

GTTTGACAAG GCCAATTCAC CTTCTGCTTG GTCAAAGCCA AGAGAGACGT CAAAAGTTGC 3240 

TTTGGCTTCA GGCTCATCCA CACATGGGAA AGcTTCGCGC GCAAAATGGC TCTCGAACTG 3 300 

AGTAGACAAG ACCTCCTTCT TGACTCCATC AACTGTATAA TAAGAAGGCT AAATCCCTCT 3 3 60 

CATGTTGTCT GTAATTTTAC CAGAAAAGGC AAGAACCAAT TCAACTTCAC CACCCTCAGC 3420 

CAATTCGATA TGAAGGGCTT CATTGTCATG GTCAACTGTA AATGGACGAG CTTGACCTGC 3480 

AACTTCTACA GAGGTGATTT CCAAATCTTT TTGGTGGAGG GAGATGCCGT CACTCTGTCC 3540 

TTGACCAGTG ATGGTCACTT TCCCAGAAAA AGTCTTGGTC TCACGACTCA AATCTAAAAA 3600 

TAAATCATAA TGTTCAGGAA CAAATTGCTT AATGGG 3 636 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5066 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



ATAGCGTGTA 


ATAATCGATT 


TTAGAGGTAC 


CATAAGCCAC 


CTCCTACAAA 


TAGAAACCGA 


6C 


TATAAATCAA 


TGCCTTCCAC 


CCTTAGACTT 


CCCTAGTTCC 


TGTCTCAAGC - 


GAAACATTTC 


120 


TTTGAAACAG 


GAATAAGTTA 


ACCAATTCAT 


ACCAATAGCT 


AGCAGAATAA 


AAAGAAACCA 


180 


AATGCCCCAT 


AACTTCATAT 


CTCTCACATT 


TCTCAAGACG 


GTATTGAAAA 


ACAGAACTGA 


240 


AACAACTGTC 


CAAGCAAGGC 


TAAAAAGAGA 


ATAGAAGGGG 


ATGTAAAACC 


AG7AAAAATA 


3C0 


ATAAAAAATT 


GGAAAAAACT 


TACTATTTCT 


GTTGGCCTTT 


TCAATCCAGT 


TATCAAAATA 


360 


AAAGTACGGT 


GCTAAAAGTA 


AGAATTTAAA 


CAAATGTTCC 


ATCACCGACA 


TCCCCCCTTC 


420 


TTTTGATAGC 


GTTTTCTATT 


ATTTTATTAT 


ATCAAAAAAA 


TCCGGAACTG 


TCATTCCAGA 


480 


TTCTACTTTT 


TTATTTGCGT 


TTTCTTCCGA 


TGAGATCAAT 


CGGTGTTCCC 


TCAAAAACAA 


540 


AGGCCTTGCG 


GATTTGATTT 


TCCAAGAAAC 


GCAGGTAACA 


AAAGTGCATG 


AGTTCTTCTT 


600 


C ATT G AC AAA 


GATGACAAAG 


GTTGGTGGTT 


TGGTTGCCAC 


TTGGGTCGCA 


TAGAAAATCT 


660 


TGAGACCTTT 


TCCTTTGTCT 


GTCGGTGTTG 


GGTTGATCGC 


AATGGCATCC 


ATGATGACAT 


720 


CGTTCAAGAC 


AGCTGATGGA 


ATACGTGTAT 


TTTGACTTTC 


GCTGATTTGC 


TTAATCATCT 


780 


CAGGAAGTTT 


GTGGAGACCT 


TGCTTGCTTA 


AAGCTGATAC 


AAAGATAATC 


GGTGCGTAAG 


840 


GCAGGTATTG 


GAACTGCTCA 


CGGATATCTT 


CTTCCCAGTT 


TTTCATAGTG 


TGGTTATCTT 


900 


TTTCAAGCGT 


ATCCCACTTG 


TTGACCACGA 


TAATCATCCC 


TTTACCAGCT 


TCATGGGCAA 


960 


ATCCTGCGAT 


ACGCTTGTCG 


TACTCACGAA 


TGCCTTCTTC 


CGCATTGATG 


ACCATCAAGA 


1020 


CCACATCTGA 


ACGGTCAATA 


GCACGCATGG 


CACGCATAAC 


AGAGTATTTC 


TCAGTATTTT 


1080 


CATAAACCTT 


ACCAGACTTA 


CGCATACCAG 


CCGTATCAAT 


CATGGTAAAC 


TCTTGACCAT 


1140 


CTGTATCTGT 


AAAGTGGGTA 


TCAATGGCAT 


CACGAGTTCT 


TCCAGCAACA 


GGACTAGCAA 


120C 


TAACACGGTC 


TtCTCCCAAG 


ATAGCATTGA 


TCAAGCTTGA 


TTTTCCAACG 


TTAGGACGAC 


1260 


CAATCAACCT 


AAACTTAATG 


ACATCTGGAT 


TTTCTTCCTC 


ATATTCATTT 


GG AAGATTTT 


1320 


CTACGATCGC 


ATCTAGCACA 


TCCCCTGTAC 


CGATTCCATG 


GACAGATGAC 


ATAGGCAATG 


1380 


GTTCACCCAA 


ACCGAGAGCA 


TAGAAATCAT 


ATATATCATT 


TCTCATCTCA 


GGGTTGTCCA 


1440 


CCTTGTTGAC 


! TGCGAGGATA 


. ACTGGTTTGT 


' GGGTCTTATA 


. AAGCTTACGA 


. GCTACGTATT 


1500 


CGTCTGCATC 


: AGTAATTCCT 


1 TCCTTACCAG 


[ ACACGACAAA 


. AACCATAACA 


> TCTGCTTCTT 


1560 
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CCATGGCAAT TTCTGCCTGG TGCTTGATTT GTTCCATGAA AGCAGCATCG ACATCATCAA 1620 

TTCCTCCTGT ATCAATCATC CTAAAAGAAC GATTGAGCCA C7CACCCGTT GCATAAATAC 1680 

GGTCACGTGT CACTCCTTCG ACATCTTCTA CAATGCAGAT TCGCTCACCA GCGATCCGAT 1740 

TAAATACGGT TGATTTCCCA ACATTGGGAC GTCCTACAAT GGCAATAGTT GGTAGGGCCA 1800 

TAATTTCTCA CTTTCTACAA TAATTTCTTC TGTTCAAGAT TTTTTCTAGT TGAGCTTGGT 1860 

TCAGCTTGAC CAAACTGTTC TGCTAGGCGC TGACTCCAGC TTGTGGTCCC ACGCGCCCCA 1920 

GCATAGTCAG CCTGAACACG GTCATAACCT TGGATTGCCT CAGTTGACTG TTCTTGGTAT 1980 

TCTTCCTCAA AGACAACATT CTCTAGTGGC ACTCTCGCTT TCATATCATG ATGTTGATTT 204 0 

GGCACACCCA GTCCCATCCC AAAGACAGAA TAGGTGTAGT CAGGTAGGTT AAAGAGCTCT 2100 

GCCACTTCTT CAGACTTGTA TCGAACCAAA CCGATAATCA CACCACCATA GCCCAAGCTT 2160 

TCAGCTGCCA ACAAGGCCTT TTGTCCAGCA AGAGCTGCAT CGACCGAACT AATCAAGAGA 22 20 

i CCTTCCACAC CTTGGGGTTG GAAGGTGTCG GTATGAAGTC GGCCTCCCTT TTCTGCTCGG 2280 

TTCAAATCTC CGACAAAGAG AACCAAAACA GCAGACTCCC GAATCCCTTC TTGAGGTACC 2 340 

AATTCATACA AGGCATCTTT CTTCTCTTCA CTTCGTACCA CAATCACAGA GTAGGATTGG 2400 

AAATTCTTCC AAGATGATGC CATCTGGCCT GCTGTCAAAA TCTCATTTAA GTCTACTTGG 2460 

CGAATTTCTT GCTCTTTAAA CCTGCGCACT GAAGTATGAG CCTTCATCAA TTTAATGGTT 2S20 

TCTGTCA7CG ACGGTTTACT CCTTCTAAAC GAGTCTCCTC AGCCAAATAA CGGATGCGTT 2580 

CCATGACCCG TCTGGCTTCC CACCTTTCGT CATTTCCATG TTTCACTTTC GCAAAATGCT 2640 

TCTCCAAATC TTCAAAGTTC AAGTTGGATG TGAAAAAGGT CGGTAAATTT TCCTGCATCC 2700 

GATATTGGAG AATGACCTCC AGGATTTCGT CACGCACCCA AACCCTTGAT TGCTCGGCGC 27 60 

CAATATCATC TAAAATCAGG ACCTCAGACA GCTTAATCTC ATCCACCAAG CTCTTAACAT 2820 

TGC CATC ACT CA7AGCATTT TTGACATCAA TGACAAAGCT AGGATAGTGG AGCACAGTTG 2880 

ATGAAACACC ACGTTTTTCT GATAAATCAT GAGCTAAGGC CGCCACCATG AAACTTTTAC 2940 

CCACACCAAA GTCTCCATAT AAGTAAAGAC CTTTTCGAAT ACCTGGATAT TGCTCCACGA 3000 

AGGCTAGTAG CTTTTCAAAA ACTGGTAAGC GCCCCAAATC ATCCAAGTCA ACTTGAGCCA 3060 

AACTAGCTTT CTTGAGACTG GCTGGTAGAT TGATTAACTT GAGACGGTTC TTAATAGCCG 3120 

CTTCTTTTTC AGCCGCGATT AGCTCAGGAG TTTCTTCATA TGAAACATCT GCATAACCAT 3130 

CATTCTTAAC CAAAATCCGC TTGTAGCCTT 7GGCAATATA ATCCGTATCC CCACGGAGAA 3240 

ACTTG7CACG CTCGGTGATG TACTGATTAA ACTTGGAGAT ACTGCGATTT AATTCCTTTG 3 300 
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GAGTTAAGGA TTCTTGCTGG ATAAAGGCCG CAACATCAGG GTCCTTCATG ATTTTCTGGA 3 360 

CCAAATCTTG ATAATAAAAA CGGCTGGGTT GACGTTTGAG TACCTCTCCG ACACTTTCCA 3420 

TCTAATCTCC TCCTTTTTCT AATCGAGCTA ATAGTTCTTG CTTCTTACGT TCTAGTTCCA 3480 

GACGAGTTTC CTCGCTGGTT TCATTCTTAT ATTCAGGATT ACTCCATTTA GGAACATTGC 3 540 

TTTTTTCTGG GGCAGTCTGA TTCTGTTTTT CTGTTTTTGC TT TC TGCCCT CGATCACGAA 3 600 

• TTCGTAAAAC GGCCTCTTCT GCCGAATGAA TCTTTTGATA GGCATAGTCA TTGGCTACCT 3660 

TCATCGCATA TTTCTCATTG ATATTTGCCG AATCCACCTT ATTAAAGGTC AATAAGAGAA 3720 

TAATATTGAT GACTTCGTCC AGTAAGCCCA AGCCAGCCAT CTGTTGCAAG AGTTCTCTTT 3780 

CTGTTTGGGT AATGGTTCCC TTGCGTGTTT GCTTGATTTC TGCTAAGAAC TGCACGGCAG 3840 

TTTTACTTTT AGCTTCTTTG ATAATGGTCG CTTCCTTAAG ACTAAAGTCA GAGGAAACTG 3900 

GTTTTTGAGC AATTTTTTCA CGCATGCGTT TGCTTGAAAT AACCTGGGAA ACAGCTGTTG 3960 

ACTTGGCCAA TTGATAGGTT TCAAACCAAG TCCATTTCTT CTCCTCGGCA ATAGCAAAGA 4020 

GGTTTAAGAC ATCGGACTGC TCATCCGCAA AACGAAGTCC ATCTCGAGCC ATCACCTGCC 4080 

GAAAATCTTC CAAGTCAAAA TCATTGGCCA CTTTCTTCTT GAGACCAAGG TCTTCTTGAC 4140 

TGCCTAGTTC TCCCAATTCT GGAAAGACTT GATTCACTCA GACAGGTATT TCTTCACCAT 4200 

CAGCACTTTC AACTTTCAAA TCCTCCACAG CTACATCGCC AATCTTTTTC TCTAAGAGTC 4260 

TCCCATAAAC AGGATGCCCC AAGAAGTCTT GACTAGATAG AGCAGCATGG AGGGCTACCT 4320 

GATAAACATC ACCCTTTTGA TAGAGGGTCA AC AG ATT AAA AGCAGATAAG ATTTTCAATG 4 3 30 

ATTTTATCAG TCTATCCATC CCAAAGTTCA GATGGTTGAG AATGCTTGAA AAAAGATATT 4440 

CCTTTCTACC ATTATCCCAA AAACTGATTG TATAAACATA AAGGCTCAGT GCCTCCTGAC 4500 

CGATAATCGG GAGGTAGCAC TGTACCAGAG ATGAGGTATC TTGCGACACC CGATTATTCT 4560 

TTAGATAAGA AAAACGGTCA ATTGGCTTCA TTTATCTTTC CTTTTTCTTT TTAGAGGACT 4620 

GGGTGATTTG TTGGAGCAAG CTCTCTAACT CACTGACATC CTTAAAACTA CGATAGACAC 4680 

TAGCAAAACG TACATAGGTA ATCTCGTCCA ATTCAGCCAA CTCCTCCATG ACGAGTGAAC 4740 

CAATGTCCTC ACTTTGAATT TCATTTTCAT TTCGACCACG GAGTTTCTGT TCGATACGAT 4800 

TG ACT AC CAT GTTGATTTCA TCACTTGACA CAGGACGTTT CTGGGCTGAG CGGATAATCC 4860 

CATTAAAGAT TTTATCTCTG GAGAATTGTT CCCGTGTGCC ATCTTTTTTA ACAACCACTA 4920 

AGGTTCTTTC TTCTACTCGT TCGTAGCTTG TAAAACGGTG TTGGCATTCG TCGCACTCAC 4980 

GTCTTCTACG AATGGTGTTC CCTTCTTCTG CTTGGCGACT ATCGATAACA CTTGACTTGG 5040 

TAGCCCCACA TTTTGGACAG GGTACC 5066 
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(2) INFORMATION FOR SEQ ID NO: 80: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 960*7 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



LAL l *VjAAu 4 


iTTTrii apa 


GGf ATGG AAA 


ACATCATG CC 


TGTACTTGAA 


GTACGTGCAC 


60 




I Vjti l L \_ I AAL 




CAGTTGAAGT 


TCGTCCAGAA 


CCTCGTACAA 


120 


V, Av. 1 1 VjL»AV_ k 


f T % ^ m f^^P^P^*/ *P*1*^T 


GTAAGAATCG 


CTCGTCTTCG 


TGGTGAACAC 


ACAATGCAAG 


180 




AAAnunnn 1 


TTGGATGCTG 


CTAACAACAC 


TGGTGCAGCA 


GTTAAGAAAC 


240 


(j 1 ^iAAUA 1 At. 




GCTGAAGCTA 


ACCCTGCATT 


CGCACACTTC 


CGTTGGTAAG 


3 00 




A A AGGGT*** A A 


GAAAGTCCCA 


GAGAAAATAG 


GGAATCGAAG 


CAGGTTGCGG 


360 


TT AAt. L AA 


tv* a r* a ttt* a i* 
IUAL»Ai i 


L 1 i i 1 l 1 *. 


ACAC^TTTAG 


CTTGAGCTCA 


ACTAAATCAT 


420 






TGGAAGCTAA 


AAATACGAAA 


CTGACGCAGT 


ATTCCACGAA 


480 


T*af a i/VlPT 
TAt. AAviVjAlr 1 


1 1 1A1LI 1 L 1 


TG AC GC AGC A 


TCCCGTTCCA 


GCTCACATCG 


CCTAACTAAC 


540 


TTTAGCCCGG 


CTTCAAATTA 


GCTAAATCGA 


TTAGTATTAG 


CTATAACTCA 


GCTTACCATC 


600 


TCGTAACTTG 


AAACCAACAA 


TAGCATGAAA 


ACATTGACAA 


CCCGTAGGTC 


CTGCCTATCC 


660 


CTTTTTATTA 


AAATCGTGTT 


ATAATAGAAT 


AGAAATCAAA 


AATAAATAGG 


AGAAACAAAC 


720 


CTCATGGCAC 


GCCAATTTTC 


ACTTGAAAAA 


ACTCGTAATA 


TCGGTATCAT 


GGCTCACGTC 


780 


GATGCCGGTA 


AAACAACAAC 


TACTGACCGT 


ATTCTTTACT 


ACACTGGTAA 


AATCCACAAA 


840 


ATCGGTGAAA 


CTCACGAAGG 


TGCGTCACAA 


ATGGACTGGA 


TGGACCAACA 


CCAACAACCT 


900 


GGTATCACGA 


TCACATCTGC 


TGCGACGACA 


GCTCAATCCA 


ACAACCACCG 


CGTAAACATC 


960 


AT CG AC AC AC 


CAGGACACGT 


GGACTTCACA 


ATCGAAGTAC 


AACGTTCTCT 


TCGTGTATTG 


1020 


GATGGTGCGG 


TTACCGTTCT 


TGACTCACAA 


TCAGGTGTTG 


AGCCTCAAAC 


TCAAACAGTT 


1080 


TGGCGTCAAG 


CAACTGAGTA 


CGGAGTTCCA 


CGTATCGTAT 


TTGCCAACAA 


AATGGACAAA 


1140 


ATCGGTGCTG 


ACTTCCTTTA 


CTCTGTAAGC 


ACACTTCACG 


ATCGTCTTCA 


AGCAAATGCA 


12C0 


CACCCAATCC 


AATTGCCAAT 


CGGTTCTGAA 


GATGACTTCC 


GTGGTATCAT 


TGACTTGATC 


1260 


AAGATGAAAG 


CTGAAATCTA 


TACTAACGAC 


CTTGGTACGG 


ATATCCTTGA 


AGAAGACATC 


1320 


CCAGCTGAAT 


ACCTTGACCA 


AGCTCAAGAA 


TACCGTGAAA 


AATTGATTGA 


AGCAGTTGCT 


1380 
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GAAACTGACG AAGAATTCAT GATGAAATAC 
TTGAAAGCTG GTATCCCTAA AGCGACTATC 
TCAGCCTTCA AAAACAAAGG TGTTCAATTG 
AGCCCACTTG ACATCCCAGC AATCAAAGGT 
CGTCCAGCAT CTGACGAAGA GCCATTTGCA 
TTCGTAGGTC GTTTGACATT CTTCCGTGTT 
CTATTGAATA CTTCTAAAGG TAAACGTCAA 
AACAGCCGTC AAGAAATCGA CACTGTTTAC 
AAAGATACTA CAACTGGTGA CTCATTGACA 
ATCAACGTTC CAGAACCAGT TATCCAATTC 
GACAAGATGG GTATCGCCCT TCAAAAATTG 
ACAAACGTTG AAACTGGTGA AACAGTTATC 
CTTGTTGATC GTATGCGTCG TCAGTTCAAA 
TCTTACCCTG AAACATTCCG CGCTTCTACT 
GGTGGTAAAG CTCAATTCCG TGATGTATGG 
GGATTCGAAT TCGAAAACGC AATCGTCGGT 
GTTGAAAAAG GTTTGGTAGA ATCTATGGCT 
GACGTTAAAG CTAAGCTTTA TGATGGTTCA 
TTCAAGATTG CGGCTTCACT TTCCCTTAAA 
CTTGAACCAA TGATGCTTGT AACAATCACT 
GGTCACGTAA CTGCTCCTCG TGGACGTGTA 
ATCGTTCGTG CTTACGTTCC ACTTGCTGAA 
GCATCTCAAG GACGTGGTAC ATTCATGATG 
TCACTACAAG AAGAAATTAT TAAGAAAAAT 
GAAGGAAGTC ACTTAGTGCC TTCCTTTTGT 
TAGTAGAAGA ATAATGTGAG GAAAATGAAT 
ATTGGGGATG CCTGCTGAAA TGAGACAGGC 
AGTTGTGGTT CATAAAATTA GTAAGGTATG 
ACCGATTGAA ATCTTTTTAG AATTAAAGAA 
CAATAAAGCT GTTTTTGAAA TTAAGGCTCG 
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CTCGAAGGTG AAGAAATCAC TAACGAAGAA 1440 

AACGTTGAAT TC7TCCCAC? ATTGTGTGGT 1500 

ATGCTTGATG CGGTTATCGA CTACCTTCCA 1560 

ATTAACCCAG ATACAGACGC TGAAGAAATT 1620 

GCTCTTGCCT TCAAGATCAT GACTGACCCA 1680 

TACTCAGCTC TTCTTCAATC AGGTTCATAC 1740 

CGTATCGGAC GTATCCTTCA AATGCACGCT 1800 

TCAGGTGATA TCGCTGCTGC CGTTGGTTTG 1860 

GATGAAAAAG CTAAAATCAT CCTTGAGTCA 1920 

ATGGTTGAGC CAAAATCTAA AGCTGACCAA 1980 

CCTGAAGAAG ATCCAACATT CCGCGTTGAA 2040 

TCAGCTATGG GTGAACTTCA CCTTGACGTC 2100 

GTTGAAGCCA ACGTAGGTGC TCCTCAAGTA 2160 

CAAGCACGTG GATTCTTCAA ACGTCAGTCT 2220 

ATTGAATTTA CTCCAAACGA AGAAGGTAAA 2 280 

GGTGTGGTTC CTCCTGAATT TATCCCAGCG 2340 

AACGGTGTTC TTGCAGGTTA CCCAATGGTT 24 00 

TATCACGATG TCGACTCATC TGAAACTGCC 24 60 

CAACCTGCTA AATCACCACA ACCAGCTATC 2520 

GTTCCAGAAC AAAACCTTGG TGATGTTATG 2580 

GATGGTATCG AAGCACACCG TAACAGCCAA 2 640 

ATGTTCGGTT ACGCAACACT TCTTCCTTCT 2700 

GTATTTGACC ACTACGAAGA TGTACCTAAG 27 60 

AAAGGTGAAG ACTAATCCGT CCTCACTCTA 2820 

CTTTAGAAAA TACCTCTAAA TATGGTAAAA 2 880 

GTCAAATAGT TTTGAAATTT TGATGAATCA 2940 

TCCTGCTTTA GCACAGCCCA ATATTGACCG 3000 

GGAGTTTCAT TTCGTA7TTT CTAATATTTT 3060 

AGGTTTGAGC GAAGAATTTT CTAAGACAGG 3120 

GTCTCAAGAA TTTTCAAATC AGCTCTTGCA 3130 
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GTCCTACTAT AGGGACCCTT TCTCTGAAGG TCCATGTGCT AGTCAAGGTT TTAAGTCCCT 3240 

TTATCAAAAT TTCCAAGTTC GTGCTGAGGG TAATCAGCTA TTTATTGAAG GATCTGAAGC 3 300 

GATTGATAAG GAACATTTTA AGAAGAATCA TCTTCCTAAT TTAGCCAAAC AACTTGAAAA 3 360 

GTTTGGTTTT CCAACTTTTA ACTGTCAAGT CGAGAAGAAT GATGTCCTGA CCCAAGAGCA 3420 

GGAAGAGGCC TTTCATGCTG AAAATGAGCA GATTGTTCAA GCTGCCAATG AGGAAGCGCT 3480 

CCGTGCTATG GAACAACTGG AGCAGATGGC ACCTCCTCCA GCGGAAGAGA AACCAGCCTT 3 540 

TGATTTTCAA GCGAAAAAAG CTGCAGCTAA ACCCAAGCTG GATAAGGCGG AGATTACTCC 3 600 

TATGATCGAA GTGACGACAG AGGAAAATCG TCTGGTATTT GAAGGGGTTG TTTTTGATGT 3 66C 

GGAGCAAAAA GTGACTAGAA CAGGTCGTCT TTTAATCAAC TTTAAAATGA CGGACTATAC 3720 

TTCAAGTTTT TCTATGCAAA AGTGGGTTAA AAACGAGGAA GAGGCCCAGA AGTTTGACCT 3 "7 80 

CATCAAGAAG AATTCTTGGC TCCGAGTTCG AGGGAATGTG GAGATGAATA ACTTCACACG 3340 

CGATTTCACT ATGAACGTAC AGGATCTGCA GGAAGTTGTT CACTATGAGC CGAAGGATTT 3900 

GATGCCAGAA GGTGAGCGTC GGGTTGAGTT TCATGCTCAT ACTAACATGT CGACTATGGA 3960 

TGCTTTGCCA GACGTCGAAG AGATTGTTGC AACAGCTGCT AAGTGGGGAC ACAAGGCGGT 4020 

TGCTATCACG GACCATGGGA ATGTCCAGTC CTTTCCACAT GGCTATAAGG CGGCTAAGAA 4080 

AGCGGGAATC GAGCTGATCT ATGGGATGGA AGCCAATATC GTGGAGGACC GTCTCCCTAT 4140 

CGTCTATAAC GAAGTGGAGA TGGACTTCTC AGAAGCAACC TACGTGGTCT TTGACGTCGA 4200 

AACGACGGGA CTTTCAGCTA TCTATAATGA CTTGATTCAG GTTCCGGCTT CTAAGATGTA 4260 

CAAGGGGAAT GTTATTGCTC AATTTGATGA ATTTATCAAT CCTGGGCATC CCTTGTCAGC 4 320 

CTTTACTACA GAGTTAACTG GAATTACAGA TGATCATGTC AAAAATGCCA AACCACTAGA 4 380 

ACAAGTTTTG CAAGAATTCC AAGAATTTTG CAAGGATACG GTCCTAGTTG CCCACAATGC 4 440 

TACCTTTGAC GTTGGCTTTA TGAATGCTAA TTATGAGCGG CATCATCTTC CAAAGATTAG 4 500 

TCAGCCAGTT ATTGATACGC TGGAGTTTGC TAGAAACCTC TATCCTGAGT ATAAACGCCA 4 560 

TGGTTTGGGG CCTTTGACCA AGCGTTTTGG TGTGGCCTTG GAACATCACC ACATGGCCAA 4 620 

CTACGATGCG GAACCGACTC G7CGTCTCCT TTTCATCTTT ATCAAAGAGG TAGCAGAAAA 4680 

ACATGGTGTG ACCGATTTAG CTAGACTCAA CATTGATCTA ATCAGTCCAG ATTCTTACAA 4 74 0 

AAAAGCTCGG ATCAAGCATG CGACCATCTA TGTCAAGAAT CAGGTAGGTC TAAAAAATAT 4 800 

CTTTAAGC7G GTTTCCTTGT CTAATACCAA GTATTTTGAA GGAGTGCCAC GGATTCCGAG 4 860 

AACGGTTCTA GATGCCCATC GAGAGGGCTT GATTTTAGCT TCAGCCTGTT CAGAGGGTGA 4 920 
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AGTTTTTGAC GTGGTCGTTT CTCAAGGTGT 
TGATTTTATC GAGGTCATGC CACCGGCTAT 
CAAGGATATG GAGGAACTCC AGACCATTAT 
TGGCAAGCCT GTTCTGGCTA CGGGAAATGT 
TCGTGAAATT ATCGTCCGTA GTTTGGGACA 
TGGTGAACAT GCCCAACCAG CACCACTTCC 
GTTGGATGAA TTTGCCTTTT TGGGAGAGGA 
CAATGCCTTG GCAGAAATAT TTGAATCCGT 
TTTCATCGAC AAGGCTGAAG AAACAGTTGC 
TTATGGAAAT CCGCTGCCAG ATATTGTTGA 
ACTGGGGAAT GGATTTGCTG TGATTTATCT 
TGAACGGGGT TATTTGGTTG GTTCTCGTGG 
GATTGGGATT ACGGAGGTCA ATCCTCTCTC 
CAGTGAGTTT ATCACAGATG GTTCGTACGG 
TCCAAACTGT GGTCACAAAC TCAGTAAAAA 
TGGTTTTGA? GGGGATAAGG TTCCTGATAT 
TAGCGCCCAC TTCGATGTCC GTGATATCTT 
GGTTGGTACG GTAGCTGCCA AGACTCCCTA 
TCGCAAGTTT TATCGTGATG CAGAAGTAGA 
GCGGACAACA GGCCAACACC CGGGGGGAAT 
CGATTTTACG CCTGTCCAGT ATCCAGCAGA 
CTTTAACTTC CACGATATCG ATGAGAACGT 
TCCGACTATG ATTCGAAAAC TTCAGGATTT 
GGATGACGAA GGCGTGATCG CACTCTTTTC 
ACAAATTGGA ACGCCTACGG GTATGTTGGG 
TGGAATGGTA GACGAAACCC ATCCGACAAC 
GTCCCACGGT ACTGATGTTT GGTTGGGGAA 
GGACCTATCG ACTGTTATCG GTTGTCGGGA 
TCTGGAACCT AAGATGGCCT TTACCATTAT 
GATTTCAGAA GAGGACAGAA ATGGCTATAT 
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GGATGCGGCG GTTGACCTCG CCAAGTATTA 4980 

CTATGCACCC TTGATTGCCA AAGAGCAGGT 5040 

CAAGAGTTTG ATAGAGGTTG GAGACCGCCT 5100 

TCACTATATC GAACCGGAAG AAGAGATTTA 5160 

GGGTGCGATG ATTAATCGAA CTATCGGTCA 5220 

AAAGGCTCAT TTTCGAACGA CTAATGAGAT 5 280 

ACTGGCTCGT AAACTCCTTA TTGAAAACAC 53 40 

TGAAGTCGTT AAGGGTGACT TGTATACGCC 5400 

TGAGTTGACC TATAAGAAAG CTTTTGAGAT 5460 

TTTGCGGATT GAAAAAGAAT TAACATCCAT 5520 

GGCATCGCAG ATCCTGGTGC AACGTTCTAA 5580 

GTCTGTCGGA TCTAGTTTCG TTGCCACCAT 564 0 

TCCTCACTAT GTCTGTGGTC AGTGTCAGTA 5700 

TTCAGGATTT GATATGCCCC ATAAGGACTG 5760 

CGGACAGGAT ATTCCGTTTG AGACCTTCCT 5820 

TGACTTGAAC TTCTCGGGAG AAGATCAGCC 5880 

TGGTGAAGAA TATGCCTTCC GTGCGGCAAC 5940 

TGGATTTGTC AAAGGTTACG AGCGAGATTA 6000 

ACGCCTCGCT CAAGGAGCGG CGGGTGTCAA 6060 

CCTTGTTATT CCGAACTACA TGGATGTCTA 6120 

TGATGTCACG GCTGAATGGC AGACCACTCA 6180 

CCTCAAACTC GATGTACTGG CACATGATCA 624C 

GTCTGGTATT GACCCTAATA AAATTCCTAT 6300 

TGGGACTGAT GTGCTAGGGG TAACACCTGA 6 3 60 

GATTCCAGAG T7TGGAACAA ATTTCGTACG 64 20 

CTTTGCGGAA TTGCTTCAGC TGTCTGGTCT 64 80 

TGCTCAGGAT CTGATTAAGC AAGGAATAGC 6540 

CGACATCATG GTTTACCTCA TGCATGCGGG 6600 

GGAACGGGTA CGTAACGGTT TGTGGCTAAA 6660 

CGAAGCAATG AAGGCTAATA AGGTGCCAGA 6720 
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GTGGTATATC GAATCCTCTG GGAAAATTAA GTACATGTTC CCTAAGGCCC ATGCGGCACC 6780 

CTACGTTATG ATGGCCTTGC GTGTAGCTTA CTTCAAGGTT CACCATCCTA TTT ATT ACTA 684 0 

CTGTGCTTAC TTCTCCATTC GTGCTAAGGC TTTTGATATC AAGACCATGG GTGCGGGCTT 6900 

GGAGGTCATC AAGCGCAGAA TGGAAGAAAT CTCTGAAAAA CGGAAGAACA ATGAAGCCTC 6960 

TAATGTGGAA ATCGATCTCT ATACAACTCT TGAGATTGTC AATGAGATGT GGGAACGAGG 7020 

TTTCAAGTTT GGTAAATTAG ATCTCTACTG TAGTCAGGCG ACAGAGTTCC TCATCGACGG 7080 

GGATACCCTT ATCCCACCAT TTGTAGCAAT GGATGGTCTG CGAGAGAACG TTGCCAAGCA 7140 

ACTGGTGCGG GCGCGTGAAG AGGGAGAATT CCTCTCTAAA ACAGAACTAC GCAAGCGTGG 7 200 

TGGACTCTCA TCAACCTTGG TTGAAAAGAT GGATGAGATG GGTATTCTTG GAAATATGCC 7260 

ACAGGATAAC CAGTTGAGTT TGTTTGATGA GTTGTTTTAA AAAATTGCTT AATAATCTAT 73 20 

TAAAAGAGGC TAACGTATAT CCAATAGATT TACATTAGCT IT C TTT ITT G TTAAAATAGT 73 80 

CTATGGAAAG AGGGTGAGAG TATGTCAAAG ATGAGTATAA GCATCCGTCT CGATAGTGAG 74 40 

GTTAAGGAGC AGGCCCAACA GGTCTTTAGT AATCTGGGAA TGGATATGAC AACAGCTATT 7 500 

AATATTTTCC TTCGTCAGGC AATTCAATAT CAGGGATTAC CTTTTCATGT TAGACTAGAC 7 560 

GAAAATCCGA AGTTGCTCCA AGCGTTAACG GATTTAGACC AAAATCCTAA TATGAGCCAG 7 620 

TCTTTTGAAT CAGTCTCAGA TTTGATGGAG GACTTACGTG CTTAAGATTC GTTATCATAA 7 680 

ACAGTTTAAA AAAGATTTTA AGTTGGCTAT GAAGCGTGGT TTGAACGCAG AATTATTAGA 774 0 

AGAAGTTTTG AATTTTCTGG TTCAAGAAAA AGAACATCCT CCCAGAAATC G70ATCATTC 7 800 

ATTGACGGCA TCCAAGCATT TTCAAGGAGT TCCTGAATGC CATACCCACC CAGATTGGCT 7860 

TTTGGTTTAT AAAGTAGACA AGTCGGAATT GATTTTAAAT TTGCTGAGGA CAGGCAGTCA 7 920 

CAGTGATTTA TTTTAATCTA TTTTAAGGGG CTTCTCATGA AACTAAGAAT ATTTGCGGAA 7980 

GATAAGCCGG CTAAGAAGGT ATTTGAATAT CAATTAGAAC TTGCTCATCG TACAATTCTT 8040 

CTATCGACAG CACTCTTGTC AGGTGCTATT GCTTTAGCAG GAATCTTTTC TGCTTTGAAA 8100 

GAAAAATAAA AATAGAAAAG AGAAAACAGA ATGGTTTTAC CAAATTTTAA AG AAAATCT A 8160 

GAAAAATATG CGAAATTGTT GGTTGCGAAC GGAATTAACC TGCAACCTCG TCACACTTTG 8220 

GCTCTCTCTA TTGATGTGGA GCAACGTGAA TTGGCACATC TAATCGTGAA AGAAGCTTAT 82 80 

GCCTTGGGTG CGCATGAGGT CATCGTTCAG TGGACAGATG ATGTGATTAA CCGTGAGAAA 834 0 

TTCCTCCATG CCCCGATGGA GCGTTTGGAC AATGTGCCAG AATACAAGAT TGCTGAGATG 84 00 

AACTATCTCT TGGAGAATAA GGCTAGCCGT CTTGGAGTTC GTTCATCTGA TCCAGGTGCC 84 60 
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TTGAACGGAG TGCACGCTGA CAAGCTTTCA GCTTCTGCTA AAGCTATGGG ACTTGCCATG 3520 

AAGCCTATGC GTATCGCAAC TCAATCTAAC AAGGTTAGCT GGACTGTAGC AGCTGCAGCA 8580 

GGACTTGAGT GGGCTAAGAA AGTCTTCCCA AATGCTGCGA GCGACGAAGA AGCAGTTGAT 8640 

TTCCTTTCGG ACCAAATTTT CAAAACTTGC CGTGTCTACG AAGCAGATCC TGTTAAGGCT 8700 

TGGGAGGAAC ATGCAGCCAT TCTCAAGAGC AAGGCCGATA TCCTTAATAA GGAGCAATTT 8760 

TCACCCCTTC ACTACACAGC GCCAGGAACA GATTTAACAC TTGGTTTGCC AAAGAACCAC 8820 

GTTTGGGAAT CAGCTGGTGC TGTCAATGCA CAGGGCGAAG AATTCTTGCC AAATATGCCA 8880 

ACAGAAGAGG TCTTCACAGC GCCTGACTTC CGTCGTCCAG ATGGTTATGT CACTTCTACA 8940 

AAACCGCTTA GCTACAACGG AAATATCATT GAAGGCATTA AGGTGACCTT TAAGGATGGA 9000 

CAAATCGTAG ATATCACTGC TGAGAAGGGT CATCAGGTTA TGAAAGACCT TCTCTTTCAA 9060 

AATGCGGGTG CCCGTGCCTT GGGTGAATGT GCCTTGGTAC CAGATCCAAG TCCAATTTCT 9120 

CACTCAGGCA TTACCTTCTT TAACACCCTT TTCGATGAAA ATGCCTCAAA CCACTTGGCT 9180 

ATCGGTGCAG CCTATGCGAC TACCCTTGTT GATCGAGCCG AGATGAGCGA AGACGAGCTT 9240 

GAAGCTCCAG GGCTTAACCG TTCAGATGTT CACGTAGACT TTATGATTGG TTCTAACCAA 9 300 

ATGGATATCG ATGGTATTCG TGAGGATGGA ACGCGGGTAC C7CTTTTCCG TAATGGGAAT 9360 

TGGGCAAATT AAGGAGATAA TATGTTAGGA AGTATGTTCG TTGGTCTCCT AGTGGGATTT 9420 

TTACCAGCTG CTATGACCAA TCGTGGAGAG CGAATGGGAT CTTTTCGAAA AATCTTTCTC 9480 

CGTTCGATCC CACCCTTTCT AGCTCACTTG CTCTTTGGAA CTTGGGGGCC AGTTTTATCA 9540 

GGAACAGCTA TTATCCCAGC GATTTTAGGA GCCATGATTG TTTTAGCTAT TTTTTGGACA 9600 

CGAGGAA 9607 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i> SEQUENCE CHARACTERISTICS : 

< A) LENGTH: 14231 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
{ D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

CTACAAGATA ATTCCAGCTA TAACATCCGC TATAATACTA ACACCGAGC7 CTATCATAAC 60 

GCTCATTAGT TTCACCTCCT CTCACGAACC CATAGCAACG TAATCGGTAA CCGATGACAA 12 0 

AAATAGTATA CCACAATACA TTTAGATCAT CAAGGTCACT TAATTCTTGA AATATCAGAT 180 

CTAAGAGAAA AATCTTTAAA ATCAGAAAAA CGCATAATAT CAGGTGTGCA AAAACTTGAT 24 0 
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ACTATGCCTT 


TT ATT GTCGG 


AAGGTTTACT 


C C ATTTTCTC 


CTGAAATTGA 


GTTTTTGTCC 


300 


ACCCTCTGTT 




CTAAGAAAAT 


AATGTCATGT 


GCTGAATATT 


TGTAAATCAG 


360 


TCACCACACA 


G AACG AT ACT 


CTTCGAAAAT 


CTCTTCACAT 


CATGTCAGCT 


TCGTCTTTCC 


420 




CACTCACTTC 


ATC AGTT CT A 


TCTACAACCT 


CAAAACAGTG 


TTTCGAGCTG 


480 


Al. i lVj/\iUAA 


111 I LrtAAlV- 


TGTACTTTGA 

1 \_l & nW 4 4 4 


GCAAGCTGAG 


ACTAGCTTCC 


TATTTGATTT 


540 


tt* iTTr a at & 

ILnl lunnln 




ATTCTCCATC 


AAATAATTCG 


ACTGCGTCTA 


ATAATTTTTG 


600 


All. uuLntu 




T A A AGGTTGT 


GTATTTGGAG 


AGGGGATTAA 


TTTTAAAAAA 


660 


l\_l_Alj 1 L t Ivj 


TA A AATTT AG 


AACTATCAAT 


CAGTAAGATG 


CTTTCATGGG 


CTTTGTCAAT 


720 


AATATTCTTT 


ill Vj AAA 1 /*Vj 


CTTGGCTGAG 


AGAAGCTTCA 


TAAACATATT 


GGTCATCAAT 


780 


AC— Iwl I \J<- L 




CTAAATCG AT 


ATTAAAATGA 


TCTAATAAAG 


AATTTTCCTT 


840 




ACC ACflC AAC 


AGGATTGATG 


TTTGACCTCG 


CCAGATGTGA 


TAAACATTTT 


900 




TT A AC ACTTT 


CAGATAGGGT 


TTGTGCAGTA 


TGTAAACCAT 


TTGTAAAAAT 


960 


AAlLAAAl I A 


Tru AnTTCAC 

I \- AAvi I I U/w 


AAACATAGGG 


ACAGAGTTCG 


TAGACAGTAG 


TACTAGAATC 


1020 


1 AUn 1 ALj A 1 A 


rariTArrAr. 

V_ n\»A iALWnU 


ACCGAATAAA 


GTCTTTAGCG 


AGACTAGCGA 


TTAGTCTTTT 


1080 * 


T Lvj(.l_ 1 At* i A 


I **^^P^PP T Hp 


C ACGT ATTTG 


ATGAGAAAGT 


TCAATTCTGT 


TCATAGAGGA 


1140 




l A I V.LU 1 I 


1 1 \— I I 11 \jn i 


AAGACCTTGA 


TTTTCTAAGA 


AAATTAAATC 


1200 




r"T AfTTCTfir 
L> I AV- I lul VjI. 


TGG AG AAAGT 


GATTTCTGCC 


AGCTCTTTTA 


CGCCAATTCT 


1260 • 


TTTTTTCTTT 


f*T*P Xfil T**"** 
1 1 UA I AA lit. 


CAATCAJVTTC 


AAGTACACGT 


*^C, A^OTTTT A 


TCATAAGCTC 


1320 ^ 


CTC CT AATTT 


A TP a TTT A A 
A 1 V_A I 1 * V_ AA 


CT AT ATT ATA 


GCACAAATTG 


GAGGAATTTG 


AATTATTTTT 


1380 


ITT*!! ITITTP 
A i L» An 1 A I I Vj 


CCTTAACATT 


TGAACATTAT 


TCAAGTAAGC 


GTTCACATAT 


TGAAAAAATA 


1440 


AAA CCTCfW! 


AT^ATAATAA 


AGTTAATCmA 


CGACGAAGAG 


ACAACAAAAA 


TGGAAGCGGT 


1500 


TTTAGCAATA 


GATTTAGGTG 


CGACTTCTGG 


AAGAGCAATC 


GTTGGTTACC 


TTTCTGAAAA 


* S6n 


TAAACTAGTA 


ATGGAAGAAA 


TAAATCGCTT 


TTCTAATCTA 


CCTATTAGAG 


TAAAAGGGCA 


1620 


TTTATCTTGG 


G AT ATT G ACT 


TTCTACTAGC 


TAAAATTCTT 


GAAAGTATCC 


GCTTGGCTAA 


1680 


TACTAGTTAC 


AAGATTTTAT 


CTATCGGTAT 


TGACACATGG 


GGAGTTGATT 


TTGGACTGAT 


1740 


TGATAATGAA 


GGTAAGCTGT 


TATTACAACC 


TGTTCATTAT 


CGTGATGAAA 


GAACAAAGGG 


1800 


AGTGTTAAAG 


GAAATATCTG 


AAATGACTGA 


ATTAGAAAAA 


CTGTATTCAG 


AGACAGGAAA 


1360 


TCAGATTATG 


GAGATAAATA 


CCTTGTTTCA 


ACTCTTTAAG 


GCACGTCAAC 


AATCTCCTGA 


1920 


CTCTTTCTAT 


AAGACCAATA 


AGATTCTTTT 


AATGCCAGAT 


TTGTTTAATT 


ATCTCTTGAC 


1980 
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AGGTAAGTTT GCTACACAAA AAAGCATTGC TTCAACAACT CAATTATTTG ATCCTAGGAG 2040 

TCAAAATTGG AATCAGAATA TCTTAAAACT ATTTGAATTG GATTCATCTT TACTTCCTGA 2100 

AATTGTTTCA GAGGGAAATG TTCTTGGAAG GATAAAAGAG GAGTATGGTT TAGGCGATAT 2160 

TCCTGTTGTG AATGTTTGTA GTCATGATAC ACCAACCCCG ATTGTCTCAG TACCTAAGAC 2220 

AGAAGGTAGT TTATTTATTT CATCAGGTAC TTGCTCTTTC GTTGGAGTGG AACTTACTTC 2280 

ACCGATTCTT ACTACCGAAT CCTTCAGTTA TGGATTTACA AATGAAGTCG GTAAAGATGG 2 340 

AGTGATTACA TTTCTGAAGA ATTGTACAGG GTTGTGGATC ATAGAGGAAC TAAGACGTTC 2400 

ATTTGAACGA AGAGGGAAAG CCTATTCTTT TGATGATATT AGGACAATGG TGGAGAAAGA 24 60 

AAAAGAAAAT CTTCCTCTGA TTGATACTGA ATCAACTGAA TTTGCAACAC AATCTGATAT 2520 

GCACAACAC? TTGACAGAAT ATCTAGCTTA TCATCATGAA ACTAGAGACT GGACAGATGG 2580 

ACAACTATTT AAGATTGTTT ATGAAAGCCT AGCTGAAACG TATAGGAAAG CCATAGAGTT 2640 

ACTAGAACAA CTAACTCATA AGGTTTATAA GAG GAT AT AT CTCATTGCAC GAGGTGCTAG 2700 

AGCCAGTTAC TTTAACCAAA TGATTGCTGA TAGAACTGGT AAAGAGGTTC TTACAGGTTT 2760 

GACTGAGGCT ACAGCTGTGG GGAATATTGT TGTGCAGCTC ATAGCTATGG G AC AATT AAA 2820 

AGGGATGGAA GAGGCTCACC ATGTTATTGA GGAGTTTCTA CAATTAGACA GTTATTACTC 2 880 

CCAAAAGAAT TAAAAAGATT GAGAGTTTGT AAATTTGCCT CCCTCCCCCT TCTTACCTTT 2940 

TGTGCAGGAA GGGGGGATAA TTGGTGAATT GAAAAATATT TAGT G TTTTG ATATGAGGAG 3000 

GACAAGGATG TCAGATGTAA AACAAGAATT AATTAAATAT GGTAAGAAGC TAGTAGAAAC 3060 

AGATTTGACG AAAGGAACAG GTGGGAATCT CACCGTTTTC GATCGTGAAA AACAATTGAT 3120 

GGCAATTACC CCGTCGGGTA TTGATTTCTT TGAAATCAAA GAATCCGATA TTGTAGTGAT 3130 

GGATATTAAT GGAAATGTTG TAGAGGGAGA ACGCTTGCCA TCTAGCGAAT GGTATATGCA 3240 

TTTGATTCAA TATCAAACTC GTGATGATAT CGATGCAATT ATCCATGCTC ATACAACTTA 3 3 00 

TGCAACAGTA TTAGCTTGTC TCAGAGAACC ACTTCCAGCG AGTCATTATA TGATTGCAGT 3 360 

GGCAGGGAAA GATGTTCGGG TAGCTGAGTA TGCAACATAT CGCACGAAAG AATTGGCTGT 3420 

GAATGCAGCT AAAGCAATGG AAGGTCGTAG AGCAGTTTTA CTAGCGAATC ATGGAATTTT 34 30 

AGCAGGTGCA CAAAATTTAT TGAATGCATT TAATATTGTT GAAGAAGTTG AATATTGTGC 3540 

AAAAATTTAT TGTTTAGCTA AGAATTTTGG AGAGCCAGTA GTTCTTCCTG ATGAGGAGAT 3 600 

GGAATTGATG GCAGAAAAAT TTAAAACATA CGGTCAGAGA AAATAGGGAG GATATTAATC 3 6 60 

TTAAAACATA TACCGAAAAA TATTTCTCCA GATTTATTGA AGACTTTAAT GGAAATGGGA 37 20 

CATGGAGATG AAATAGTATT AGCTGACGCG AATTATCCT7 CTGCCTCATG TGCAAATAAG 3 780 
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5220 


ACTATTTCAG 


GTGCGATTAT 


TGGTACTGCA 


TTTGGTATTT 


TATCTGGTCA 


AGGAGAAACT 


5280 


GCTGGTATCG 


CTATAGCAGT 


TCCAATTGCA 


GTTGCTACCC 


AACAGTTGGA 


TCTTCTTGCA 


5340 


AAAACTTTAG 


ATGTTTATTT 


TGTGAAAAAA 


GCTGATAATG 
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CCACCAATCG 


TTATGCAGCG 


ACTTAACTCT 


GCAGGTGCTT 


TACTACCTTC 


AATTGGTTTT 


5580 


GGTATGCTTT 


TAAATATGAT 


GCTCAAGAAA 


AATATGTGGG 


TATTCTTGTT 


GATTGGATTC 


5640 


ATTTCTTCTG 


TGTATGGAGG 


AATGTCAACC 


ATTGGGATCT 


CACTAGTTGG 


TATTGCGGTA 


5700 


GCATACTTCT 


ACGATATGAT 


TGGAAGCAAA 


CCACAAGAAA 


CAACTTCAAG 


TAGTGATGTT 


5760 


GAGGAGGATC 


TTGATCTATG 


ATGAATAATA 


AAGTAACTAA 


AGTTGAACTT 


AAAAAAGTTT 


5820 


TCAAACGAAG 


TTTTATGTAT 


GGTTCTTCAT 


GGAACTATGA 


GAGAATGCAG 


AACCTAGGTT 


5880 


TTCTATATAC 


AATTCTTCCA 


GTATTGAAAA 


AACTATACCC 


AGACAAAGAT 


TCAGCTTCTC 


5940 


CTGCAATGAA 


ACGTCACCTT 


GACTTTTTCA 


ATACTCATCA 


AACAGCGGCA 


CCATTTATTC 


6000 


TTGGAGTTAC 


TTCCGCTATG 


GAAGAACAAG 


AAGGAAATGA 


AGGTGCAGCT 


TCAATTACTG 


6060 


GTATTAAAGT 


TGGCTTGATG 


GGGCCACTGG 


CTGGTCTACG 


AGATAGTTTG 


TTCTGGCTGA 


6120 


CACTAGTTCC 


TATCTGTTTT 


AGTATTGGTG 


CCTCTTATTC 


TAAAGACGGC 


GGTGCTTTAG 


6180 


CTATCTTTAT 


CGCCTTAATA 


TTGTTTAATA 
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TCCTGTTAAA 
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6240 


TGAAATATGG 


GTATACTAAG 
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TGTGTAAATT 
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GGAAAGAATC 


CGGTTGTACT 
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TTAGGAATTT 


TCAAGTAGTA 
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'S960 


TGTGCTCTTT 


CATTGTATGA 


AAGAGAGCTA 


GATTTAGATA 


CACCTATTTC 


TAATGTTGTG 


7020 


TTTGAGCCTA 


ATAGTTGTAA 
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GTTGCCGTA7 
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CTTC AAAAC A 
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7140 
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ATAAACGGTT 


TAATGACGAA 


GTATCTAAAC 
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TAGAAAAGGT 
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; TGTATTGGGA 


, GAGACAATAG 


1 TTATTCGGAA 


k TGCTACAGAG 


7320 
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GTATTTCTTT ATCTCAAATC AATGACGGAT TATTCGGCAA ATATAGATAT TTCTTCTCTT 7 380 

CACGGAGAAT TT ACT ACT AT TCATTACTTT ACAGAAAAAG ATGAACATGT AAAAAAATAT 7440 

CAGGAGCAAT TTAATAGAGT TGATTTTAAA CTAGACTATA GTAAACCTTG TCTTAGCATT 7500 

CCAACGAATC TACTTCTTGA AAACACTAAA AAGTATAGTA ACTACTTGAC TAACTTGTTA 7560 

TTTCATTATG GAAGATATCT GTTAATATCG TCTACTCAAC CGAATGGTTT ACCTGCCAAT 7620 

CTTCAAGGAA TATGGTGTGA TGAATTAAAT CCAATTTGGG GTTCTAAATA TACGATTAAT 7680 

ATTAATACTC AAATGAATTA TTGGATGGTA GGTCCATGTG ATTTACCACA AGTAGAATAT 7740 

CCATTATTTG ATATGCTCGA AAGAATGAGA GAACCGGGAA GACTAACCGC TAAGAAAATG 7800 

TATGGAGCTA GAGGTTTTAC AGCACATCAT AATACGGATG GTTTTGGCGA TACGGCTCCC 7860 

CAATCTCATG CCATGGGGGC TGCAATTTGG GTATTAACTA TTCCATGGTT ATGTACTCAT 7920 

ATTTGGGAAC ACTATTTATA TTTCCAAGAT GAGCGTATTC TTACGGAACA TTTTGAAATG 7980 

ATAAAAGAAG CATTTCTTTT CTTTGAAGAT TATTTATTTG AGCTGGATGG CTACTTGATG 8040 

ACAGGTCCAA GTGTCTCACC GGAAAATAAA TATCGCTTAA AAAATGGTAT TGAAGCAAAT 8100 

GCTTGTCTAT CATCTACAAT TGATAATCAA ATTCTAAGAT ATTTTTGTGA TTCATGCATT 8160 

GGCATTGCAA AACAATTAGG AGACAATTCG CATTTTATTA GTCGTGTGAA GGAGTTAAAA 8220 

AAGAAACTAC CTAAAACAAA AATAGGTAGT AATGGGCAAA TCCAAGAATG GTTAGAAGAT 8280 

TATGAAGAAG TAGAGCCTGG GCATAGACAC ATTTCACCTC TATTTGGGCT TTATCCTTAT 8 340 

AATGAGATTG ATATTCATAA AACTCCGCAA TTAGCAGAAG CAGCTAAAAT CACTATCAAT 8400 

ACCAGATTAT CAAACGCTAA TTTTTTATCT TCACAGGAGA GGG AGCAAGC GATTAATAAT 84 60 

TGGTTAGTAA GTGGTTTGCA TGCTAGTACA CAAACAGGTT GGACTGCTGC ATGGCTGATT 8520 

CATTTTTTTG CGAGACTATA TCAAGGTGAA CCTGCTTATA ACCAGATTAA TGGTTTGTTA 8S80 

AATAATGCGA CTCTTGGCAA TTTATTTCTT GACCATCCAC CATTTCAAAT TGATGGTAAT 8640 

TTAGGTTTGG TGAGTGGAAT TTGTGAATTA TTACTACAGA GCCATCATAA TTGGTTATCA 8700 

CTAATTCCAG CTTTACCTTC TGCTTGGTCA GAAGGAGAAG TGAAAGGTTT CAGAGTAAGA 87 60 

GGAGGATATA AGGTATCGTT TGCTTGGAAA AATGGGGATA TAACATTCCT AAAATTGGAA 8820 

GGAGGAAACA AAGATCAAAA AGTAAGAGTA AGAATATATG GCAAAAATAC TGATGTACAA 8880 

AATATTGAAT TGGTATTTAA TTCAGAAAAA ATTATTGAGT TAAATTTTTA GGTATAAGTC 8940 

ATGAATAAAG AAAAAATAAA AAGAAAATTA ATCACAATAT TGTTTGTATG TATTGGGATG 9000 

TTATGTTT7G GATTGTTAGC AGGAGTTAAG GCTGATAATC GTGTTCAAAT GAGAACGACG 9060 
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ATTAATAATG 
TTATGGTGGG 
TATCCAGCGA 
GATACGAAAG 
ATTCCAGTTT 
TTAGATGAAC 
TGGATTTACA 
TATGCAGCGC 
GATCCGACAT 
AATACGCCAA 
GGCTTATGTG 
ACAAACACAT 
TCAATCATTG 
GAATGTGCCG 
ATTATTCCTT 
AATAGAACAA 
TATCAAGGAC 
ATTCTTCCTC 
GCAAAAATTT 
CTTTATCCAA 
ATTTATAATA 
AATAATACAA 
AATCCAAATA 
GCATTATCAG 
AACTGGATAA 
CTTACATTAA 
CATTATACTT 
CATAATGGAA 
ACACCAGATA 
AGTTCTGTAG 



AATCGCCAT7 
GGAACACATT 
TTGAACTTCA 
AATTGAGAGA 
TCTTGGTTAT 
AATTCCAAAA 
ATAACCAGTT 
ATTTTATCTG 
TCTTTGAAGC 
TAAGAGATGA 
ATAACTGGGG 
TTGAAACTGG 
CTATGGAAAT 
CGTATACATT 
TCTTTAGACA 
AAGCTCTATT 
TTTATTCGAA 
TAATACATGA 
TGACTAAAAA 
AACTTTATGA 
GTAATCCTAA 
AGTCGTTATC 
ATTTACATAT 
GAAATTTTGA 
GCAAAAATTA 
AAGGGCATAC 
ATACAGAAAA 
TGCTAGAGAT 
AATTTAATGA 
ATTACAATGG 



GTTGCTTTCT 
GAAGGGAGCA 
TCCTGCAAAA 
ATGGTATGTC 
TATGTCGGCT 
GTATAGTGTG 
AGCTCCGCAT 
GCATGATCAT 
GAGTCAAAAA 
TGCGGGTACA 
CTCATCAACA 
AAGAGCTAGG 
GATCAATGTA 
TATGACAAAT 
TGCTATACAA 
TTGGAATGGA 
TGATGAAACA 
GAAAATTGAT 
TAGTGAGGAA 
AGGAGATGGG 
TATCAATAAA 
GTTAGATTTG 
TTTATTGAAT 
TGCATCAAAA 
TTCCA7CAAT 
TGGTCATAAA 
TTGGGATGAG 
GTCTATAAAT 
TCGTAATTTG 
AGACCCTAAT 
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CCGTTGTATG 

TGGGAAGCTA 

GTCTGTAAAC 

AAGATGTTGG 

GGAGAGCGTA 

TTAAAAGGTG 

AGTGCTAAAT 

GAAAAATGGT 

TATCATAAAA 

GATTCTATCG 

GATACATGGA 

GATATGACAT 

TATACTGGGG 

GATGTACCAA 

AATCCAGCTC 

GAAGGTAGGA 

ATCCCTTTAT 

AAGGAAAAGA 

TTGTCTAGTA 

TATCCTCAGC 

AATCAGCAAG 

ACGCCACATA 

AATTACAGGA 

AGTTGGAAGA 

CCTGTAGATA 

CCTCAGATAA 

AATACCCATG 

ACTGAGGGGA 

AATATAGCAT 

AGAGCTGTGG 



GCAATGATAA 
TTCCTGAAGA 
CAACAAGTTG 
AGGAAGCTCA 
ATRCAGTTCC 
TTTTAAATAT 
ATTTGGAAGT 
TCTGGGAAAC 
ATTTGGTGTT 
TTAGTGGATT 
AATGGTGGGA 
CCTATGCATC 
GAGGCACACT 
CTCCAGCATT 
CAAGTAAGGA 
TTAGTTCATT 
ATAATAATGG 
TTTCATCTAT 
AAGTCAACTA 
GTGTAGGTAA 
TAATGTTGCC 
CTTACGCTGT 
CAGATAAGAC 
AAGAAGAATT 
ATGACTTTAG 
ATATAAGTGG 
TTTATACCAT 
CAGGTCCAGT 
ATGCAAAACC 
ATGGTAACAG 



TGGTAACGGA 
TGTAAAGCCA 
TATTCCACGA 
AAGTCTAAAC 
TCCAGAGTGG 
TGAGAATTAT 
TTCTGCCAAA 
TATTATGAAT 
GGCAACTAAA 
TTGGTTGAGT 
AAAACATTAT 
GGAACCAGAA 
TTATAATTTC 
TACTAAAGGT 
AGAAGTTGTA 
AAACGCATTT 
CAGATATCAT 
ATTCCCTAAT 
TTTAAACTCC 
TTCCTGGTAT 
TATGTATACT 
TGTTAAAGAA 
AGCTATGTGG 
AGAGTTAGCG 
GACAACAACA 
CGATAAAAAT 
TACGGTTAAT 
CTCTTTCCCA 
AACAACACAA 
AAATGGTAAT 
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TTTAACTCTG GTTCGGTAAC ACACACTAGG GCAGATAATC CCTCTTGGTG GGAAGTCGAT 10920 

TTCAAAAAAA TGGATAAAGT TGGGCTTGTT AAAATTTATA ATCGCACAGA TGCTGAGACT 10980 

CAACGTCTAT CTAATTTTGA TGTGATTCTA TATGACAATA ATAGAAACGA AGTTGCTAAG 11040 

AAACATGTTA ATAATTTGTC GGGTGAATCT GTTAGTCTAG ATTTCAAAGA AAAAGGAGCA 11100 

AGGTATATTA AAGTTAAATT ACTAACGAGT GGAGTGCCTT TGAGTTTAGC AGAAGTAGAG 11160 

GTTTTTAGAG AATCAGATGG TAAGCAATCT GAAGAGGATA TAGATAAAAT AACAGAAGAT 11220 

AAAGTAGTCT CTACAAATAA GGTAGCTACT CAAAGTTCAA CCAATTATGA GGGTGTAGCT 11280 

GCTTTAGCAG TTGATGGTAA TAAAGATGGA GATTACGGAC ATCATTCGGT GACTCATACT 11340 

AAGGCAGATT CTAACCCTTG GTGGCAGGTC GATCTGGGAG AAGAGTTTAC GGTTTCTAAA 11400 

GTTGATATTT ATAATAGAAC AGATGCCGAA CCTCAGCGTT TATCTAATTT TGATGTTATT 11460 

TTTCTATCTT CATCAGGAGA AGAAGTTTTT AGAAGACATT TTGATAAAGT AGTTGATGGT 11520 

TTGTTATCTT TAAAAGTACC TTCTGTAGGG GCTAAGCTAG TCAAAATAGA ATTAAAATCA 11580 

GCAGCTATTC CGTTAAGTTT AGCGGAAGTT GAAGTCTATG GTTCAAAGAG AACTCCGAAG 11640 

AAACTTTCTA ATATTGCATT AACAAAAGAA ACTCGACAGA GTTCAACGGA TTACAATGGT 11700 

TTTTCTCGTC TAGCAGTTGA TGGAAATAAA AACGGAGATT ATGGTCATCA TTCAGTGACT 11760 

CATACCAAAG AAGATTCTCC TTCATGGTGG GAGATAGATT TAGCACAAAC CGAAGAATTA 11820 

GAAAAGTTAA TTATTTATAA TAGAACAGAT GCTGAAATTC AG AG ATT AT C AAATTTTGAT 11880 

ATT ATT AT AT ATGATTCAAA TGATTATGAA GTTTTTACAC AACATATTGA CAGTTTAGAA 11940 

AGCAATAATC TATCCATAGA CTTAAAAGGA CTGAAGGGAA AAAACGTTAG AATTTCTTTG 12000 

AGAAGCGCAG GAATTCCTTT AAGTTTAGCA GAGGTAGAGG TTTATACTTA TAAGTAATTT 12060 

TAAAAATTAT CACCCAGGCT ACCGTAAATA TAATGGAGAT GGTAGTATGA AACAAACAGA 12120 

AAAATAAGAG GAAAATAGTA TGATTCAACA TCCACGTATT GGGATTCGTC CGACTATTGA 12180 

TGGTCGTCGT CAAGGTGTAC GCGAATCACT TGAAGTGCAA ACAATGAACA TGGCTAAAAG 122 40 

TGTGG C AG AT TTGATTTCAA GCACATTGAA ATATCCAGAT GGGGAACCTG TGGAATGCGT 12300 

GATTTCTCCA TCTACTATTG GCCGTGTACC AGAGGCTGCA GCTTCCCATG AGTTGTTTAA 123 60 

AAAATCAAAT GTTTGCGCAA CAATTACAGT TACACCATGC TGGTGTTATG GTAGTGAAAC 12420 

TATGGATATG TCTCCAGATA TTCCTCATGC TATTTGGCCA TTTAATGGGA CAGAACGCCC 12480 

AGGAGCTGTC TATCTTGCAG CTGTACTAGC TTCACATGCT CAAAAAGGGA TTCCAGCCTT 12540 

TGGGATTTAT GGAAGAGATG TTCAGGAAGC TAGTGACACA GATATTCCAG AAGATGTCAA 12 600 
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AGAAAAACTT TTACGCTATC CGCCTCCAGC TCTTGCAACT GGCTTGATGA GAGACACTGC 12 660 

TTACCTATCA ATGGGTAGTG TTTCGATGGG GATTGGTGGT TCTATTGTAA ATCCGGATTT 12720 

CTTCCAAGAA TACTTAGGAA TGCGAAATGA ATCGGTAGAT ATGACGGAGT TCACGCGCCG 127 80 

TATGGACCGT GGTATTTACG ACCCTGAAGA GTTCGAACGT GCGCTCAAAT GGGTGAAAGA 12 840 

AAACGTAAAA GAAGGATTCG ACCATAACCG TGAAGACCTT CTTTTAAGCC GTGAAGAAAA 12900 

AGATAGACAA TGGGAATTTG TTATTAAGAT GTTCATGATT GGACGTGACT TAATGGTTGG 12960 

TAACCCAAGA CTTGCTGAAC TTCGTTTTGA GGAAGAAGCG GTTGGTCACC ATGCTTTAGT 13020 

AGCTGGTTTC CAAGGTCAAC GTCAGTGGAC AGACCATTT? CCAAATGGGG ACTTTATGGA 13080 

AACTTTCCTC AATACTCAGT TTGACTGGAA TGGTATTCGA AAACCATTTG TATTTGCGAC 13140 

AGAGAATGAT TCACTAAATG GTGTGTCTAT GCTCTTTAAT TATCTATTAA CAAATACTCC 13200 

ACAAATCTTT GCTGATGTGC GTACTTATTG GAGCCCAGAG GCTGTTAAAC GTGTAACGGC 13 2 60 

ACATACTTTA GAGGGTCGTG CTGCAGCTGG CTTCTTACAT CTAATCAACT CTGGTTCTTG 13 320 

TACATTGCAT GGTACAGGTC AAGCTACTCG AGATGGCAAA CCTATTATGA AACCATTCTG 13380 

GGAGTTGGAA GAAACTGAAG TGCAGGCTAT GCTTGAAAAT ACAGACTTCC CACCAGCAAA 13 440 

CCGCGAATAC TTCCGTGGAG GAGGATTCTC AACTCGTTTC TTGACGAAGG GGGATATGCC 13 500 

AGTAACAATG GTACGTCTCA ATCTTCTAAA AGGGGTTGGT CCAGTGCTAC AAATTGCAGA 13560 

AGGTTACACA CTTGAACTTC CTGAAGATGT TC AC CAT ACT TTAGATAATC GTACAGATCC 13620 

AGGATGGCCA ACTACTTGGT TTCCTCCACG TTTGACAGGA AAAGGTGCTT TCAACTCTGT 13 680 

CTATGACGTC ATGAATAATT GGGGAGCTAA TCACGGAGCC ATAACATATG GACACATTGG 13740 

AGCAGACTTG ATTACCTTGG CTTCTATGTT GAGAATTCCT GTCAATATGC ATAATGTACC 13800 

TGAGGAAGAT ATCTTTAGAC CTAAAAATTG GTCCTTATTT GGAACAGAAG ATCTAGAATC 13860 

AGCAGACTAT CGTGCATGTC AGTTGTTGGG GCCACTACAT AAATAAAACT TGTTTATATA 13 920 

GGAGGTGAAC TTACGTCCCT CCTATCCTTT TAAAAAGATT TGTTAAACAA TTCACAAATA 13 980 

ATTGAAAACG AATACAAAAA GTAATATAAT GATGTTAAAT AGATACCGCG GAGGCGCAGG . :040 

AGGAAAATTA TATGGCTATA TTTTATGTTC CGGCACTCAA CCTTATTGGA AAAGGTGTTG 14100 

TAAATGAAGT GGGTCCTTAT ATCAAGGAAC TTGGCTATAA AAAGGCACTT TTGGTGACAG 14160 

ATAAGTACAT CGAAGGCACT GATATTTTAC CTAAGACTTT AAAACCACTG GATACAGAAG 14220 

GAATCGAATA T 1423 I 
(2) INFORMATION FOR SEQ ID NO: 82: 
U> SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 16995 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



AGTTCTCTTA 


ACTTTTTTAG 


GATGGCATTC 


TCCGCTCTCA 


GGTACTCATT 


TTCTGCTgAA 


60 


GACGTTCTAA 


TTCTGTCCTC 


TCTTCAGGTC 


TCGTTTTTGG 


CTTACGTCCC 


ATTTTAGGTA ~ 


120 


CTCTCCCTCT 


TGTTTTCTCA 


ACAATAGTAT 


ACCCGTTTTT 


CCTGTATTGT 


GCTAGCCAGT 


180 


TAAGAAGTAT 


CGTACGACTT 


GGGAGACCGT 


ATTCAAGAGA 


AACTCTATCT 


TTAGTCCAGC 


240 


CTTCATGTCA 


GACTTTATTA 


CTCATTTCTT 


GTTTTAAATC 


AGGAGAATAG 


TAACGATTTT 


300 


TTCCTTTTTT 


GACGAACTCT 


ATTCCGTAAC 


GATCAATCAA 


TTTAATCATG 


TACCTAATAT 


360 


TAGAATTGCT 


TATCCCAAAT 


TTATTTGAAA 


GCTTCTCTAA 


GCTATATCCT 


TGTTTTCTAA 


420 


GTTCATAGAT 


CTGAACTTTA 


TCATCATAAG 


TTAGTTTCAT 


AATAAAAACA 


CCCCAAAAGT 


480 


TAGATTTTTT 


CTGTCTAACT 


TTTCGGGTGT 


AGTTCATGTA 


CACCTGATAT 


GATGCGTTTT 


540 


ATAA'I^TTTTA 


AGCCTTTTTG 


CCCAGCCTCG 


TCAAAAGTAA 


TCTTTTGACA 


CAAAATCTGT 


600 


GACAAAACTT 


TAGTTTTAAA 


GGTTTTTAAC 


TTTGTATATA 


CTAGTTTTAA 


GAAAAGGAGC 


660 


ATCATCTAAT 


GGAAGAAAAA 


GTATCATTGA 


AAGTCAGGGT 


TCAAAAACTA 


GGGACATCGC 


720 


TTTCAAATAT 


CGTTATGCCC 


AATATTCGAG 


CATTTATTGC 


TTGGGGAGTA 


TTGACTGCCC 


780 


TCTTTATCCC 


TGATGGCTAT 








GGTCCTATGT 


840 


TAACGTATTT 


ATTGCCAATC 


CTGATTCGTT 


ACACAGGTGG 


ATATATGATC 


CATGGCCAAC 


900 


GTGGTGCCGT 


TGTAGGAGCT 


ATTGCTACTG 


TTCCTGCAAT 


CACAGGTTCT 


AGTGTTCCTA 


960 


TGTTTATCGG 


AGCTATGGTA 


ATGGGCCCAC 


TGGGAGGATG 


GACTATCAAG 


AAATTTGATG 


1020 


AGAAGTTCCA 


GGAAAAAATT 


CGTCCCGGAT 


TTGAAATGTT 


AGTTAATAAC 


TTCTCAGCTG 


1080 


GTCTCGTTGG 


TTTTGCATTA 


TTGCTTTTGG 


CTTTCTACGC 


AATCGGTCCA 


GTCCTATCGA 


1140 


CTCTTACTGG 


AGCTGTTGGG 


AATGGTGTTG 


AGGCTATTGT 


CAATGCTCGC 


CTCCTTCCTA 


1200 


TGGCTAATAT 


TATCATCGAA 


CCGGCTAAAG 


TCCTTTTCCT 


CAATAATGCC 


CTCAATCATC 


1260 


GCATTTTTAC 


TCCTCTGGGA 


GTAGAACAGG 


TAGCTCAAGC 


TGGTAAGTCA 


ATTCTCTTCC 


1320 


TATTGGAAGC 


TAATCCTGGA 


CCAGGTCTGG 


GAATTCTATT 


AGCTTATGCT 


GTATTCGGTA 


1380 


AAGGTTCTGC 


TAAATCTTCT 


TCTTGGGGGG 


CAATGGTTAT 


TCATTTCTTC 


GGAGGGATTC 


1440 


ATG AAATTT A 


CTTTCCTTAT 


GTTATGATGA 


AGCCTACTCT 


ATTTTTAGCT 


GCTATGGCAG 


1500 
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GAGGTATCTC TCGAACTTTT ACTTTTCAAC TCTTAGACGC TGGTCTTAAA TCTCCAGCTT 1560 

CACCAGCTTC TATTATTGCG ATTATAGCTA CGGCGCCAAA AGGTGTTTGG CCCCATCTAA 1620 

ATGTTCTTTT AGGTGTTTTA GTGGCAGCAG TTGTTTCTTT CCTTGTAGCA GCCCTTATTC 1680 

TTCATGCAGA CAAGTCAACT GAGGATTCGC TCGAACCTGC TCAGGCGGCT ACCCAAGCAG 1740 

CTAAGGCTCA GTCTAAAGGT CAGTTAGTAT CAACTTCTCT TGATGCAGTT GTTTCGACAC 1800 

ACTCAGTGGA AAAAATCATT TTCGCCTGCG ATGCTGGTAT GGGAAGCTCT GCTATGGGAG 1860 

CTAGTATTCT TCGAGATAAG GTTAAAAAAG CAGGTCTAGA GATTCCAGTA TCTAATCAGG 1920 

CAATCTCAAA TTTGCTTGAT ACACCAAAAA CATTAATTGT TACTCAGGAA GAACTGACAC 1980 

CAAGAGCTAA AGACAAGAGT CCAAGTGCTA TTCATGTTTC TGTTGATAAT TTCTTAGCGT 2040 

CCTCTCGTTA TGATGAAATT GTAGCTTCAT TAACAGGAGC TTCTCCAATA GCAGAAATTG 2100 

AAGGAGATAT ACCAACTTCA GCACCAGTAG ATAGTCAGGA AAGTGACCTT AACCATATTG 2160 

ATGCTGTAGT AGTTGCTTAT GGTAAAGCAC AGGGAACTGC AACTATGCGC TGTGAAACGA 2220 

TTCGGGCTAT TTTTAGAAAC AAGAATATTC GTATTCCAGT TTCTACTGCC AAAATTTCAG 2280 

AATTAGCTGA ATTTAATTCT AAAAACATAA TCATTGTAAC AACTATTTCT TTACACGCAG 2340 

AAGTGCAGCA AGCAGCACCG AATTCTCAAT TTCTTATTGT GGATAGTTTA GTAACAACAC 2400 

CACAATATGA CAAAATGGCT GCTAGAATGT ACAAATAGAA CTAGAGGTTT CTAAATTACC 24 60 

AATGCTATTA ACCAAACGAG AAGAACAATT ATTGAAGGCT TTCCTACATG TAGGGAAGCT 2520 

TTCAATGCAA GATATGACTG AAATCTTACA GGTTTCATCT AGAACAATTT ATCGAACTTT 2580 

ATCAGATTTG ACAGATAGCA TGGAGCAATA TGGAATCGAA ATAACGAAGC ATGGGAAATA 2640 

CTATATTTTG ACTGGAGAGT TGGATGATTT CCCGACAGAA CTTGAAGTGT TAGTTGAGTA 2700 

TAGTCCCCAA GAAAGACAAG AGTTGATTAC CTATCGCCTT CTGACTGAGA GTGGTTTTGT 2760 

CACCAATGAA GCATTGCAAC AGTGCACGAA AGTCAGTAAT CTAACTATTA TTCAGGATAT 2820 

TTCAGATATT GATAAGCGTC TTTTAGACTT TCATCTCAAA ATTGAACGAC AAAAAGGTTA 2880 

TCGGATTTCT GGTCATTCAG TTGGTAAGAG AAGATTTTTG GCTATTTTAC TGACAAACTG 2940 

TATCTCAGTA GCAGATTTTT CAACCGGTAA TTTTGGGAGC TTTGATATTT TAGAAGCAGA 3000 

TAGAACTGGG CTGGCCAGTC AGATTGTTAA 7AAGCAACTG TCAGGTTTTC CAGATATGGA 3060 

TGCTAGGATG AAGATGTTTT TTGCGATCTT GTTATCTCTT ATAGGTCAGG ACCAAAACAT 3120 

TGAAAATTCA CCTAATACTA GTAAGCAGGC ITT GG AAA TT TCTCAAAAAA TTTTTCAAGC 3180 

TTACTCTAAG CAGACTGCAC AATTTTATAG TATTCACGAA ATTATCTATT TTGCGAGCAT 3240 

CTTGGATGAA TTAATCATTA AACGTCAGGA CAATCCCCTC TTTACGGAGA AATTTGATGG 3 300 
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TCAATTTTTC TACAATATTT CAAATCTGAT TGATACGCTT TCCATCTATA CCAAGATTGA 3 3 60 

CTTTTTTAAG GACAAGGTTT TATTCAATTT TCTTTTCCAT CATATTCCCC TCAGTTTAGG 3 420 

CGTCCCTATC CTTTTTCACG GTGAAAATTT GCCAGAATCT ATCCAGATTT TAGTTGAAAG 34 80 

GAATAAATTT CTTTATACAG TCATCAGTCT TTTAGTGAAT GATATTTTTC CGAAATATCT 3540 

TCATACAGAG TATGAGTATG GCATGATTGC CCTACATTTT ATCTCTAGCT TAGGCCGTAG 3 600 

TC CAG AG ATT TATCCAGTCC GTGTTTTGCT TTTAACGGAT GAACGTCGGG TCACTAGAGA 3660 

TTTATTAGTC AGTAAAATTA AGAGTGTTGC TCCTTTTGTA GAGTTGATAG ATATTCAATC 3720 

TCTAGTAGAT TACCACAGTA TTGATCTCAC TCAGTATGAT TATATTTTAT CTACCAAGCC 3 780 

GCTGACTAAT CAGGAAATCG ATGTAATTTC TAGTTTTCCA ACCGTCAAAG AATTCCTTGA 3 840 

ATTACAGGAA CGACTTCAGT ATGTACAGGC ACATCGTACA ATTGTCGCGC GTGATGCTAT 3 900 

CGCTCCAGAG AAAAGTTATG ACTTGCAAGA TTATTTAATA TCTAGTAGTC AGCTTTTGAG 3960 

TCAATTCGAG TTGGTTCAAT TGGAGAATAA TCAATCATTT GAGCACACGG TAGAACAAAT 4020 

CATCCAATAT CAGAAGAATG TGAGTGACAG AGCTTACCTA ACAAGAAAAT TGTTATCTCA 4080 

CTTCCAGAAT AGTCCTATGG CTATTCCTAA TACTGGTCTG GTGCTTTTAC ATAGTCAGTC 4140 

TAGCAAAGTA ACAACAAATA GTTTTACTAT GTTTCAACTC AAACTACCTA TCTCCGCATT 42 00 

GTCAATGAAA CGAGAGGAAG AAGAGGTCAA AAGGTGTCTG CTAATGCTAA TCTCTAAAGA 42 60 

AGCTAGCGAG GAAGCTAGAG ATTTAATGAC AGCTATTAGT CAGTCGATTA TTGAAAATCA 4 320 

TCTTTATACA GAGATTTACA AGACGGGAAA TCAATCCATT ATTTATCAGA TGCTAAATAC 4 380 

TATTTTTAAC GAAAAAATTA AGAAATTGGA GAACTAATAT GAAACTTGAA AAACATTTGA 4 4 40 

TTAAGCTTAA TAAACAATTT TCTAACAAGG AGGAAGCTAT TTCTTATTGT GCGCAAGTTC 4500 

TTTATGAGGC TGGATATGTT AATGAAGACT ATATTGAAGC CATGATTGAG CGAGATAAAC 4 560 

AGCTATCTGT TTACATGGGT AACTTTATCG CCATACCGCA TGGAACAGAT GCAGCAAAAA 4 620 

ATGATGTCCT CAAGTCTGGT ATTACAGTCG TTCAAGTCCC TAGAGGGGTT GATTTTGGCA 4 680 

ATGTATCTAA CCCTCAAGTG GCAACGGTTC TTTTTGGTAT TCCTGCTATT GGTAATGAAC 4 740 

ACTTAGAAAT TATTCAGAAA ATTTCTATCT TCTGTGCAGA TGTAGATAAT GTTCTTAAAC 4 800 

TAGCAGATGC TCAGTCAAAA GAGGAAGTAT TGCGCTTATT TGATGCTGTT GAATAATTGA 4860 

ATTTAGTCAT TTGTCATCTA GTATATATGT CCCTCAAATA GGAAAAGGAG AAATTGAATG 4920 

AAACATTCTG TTCATTTTCG TGCCGGTAAT ATCGGTCGTG GTTTTATAGG TGAAATTCTA 4 980 

TTTAAAAATG GTTTCCATAT TGATTTTGTG GATGTCAATA ATCAGATAAT TCATGCTCTG 5040 
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AATGAAAAGG 


GCAAGTATGA 


AATTGAAATT 


GCACAGAAAG 


GACAGTCTCG 


TATAGAAGTA 


5100 


ACTAATGTGG 


CTGGCATTAA 


TAGCAAAGAA 


CATCCTGAGC 


AAGTCATTGA 


AGCGATTCAA 


5160 


AAGACGGATA 


TTATTACTAC 


TGCAATCGGA 


CCTAATATAC 


TCCCTTTTAT 


CGCCGAACTT 


5220 


CTAGCCAAAG 


GAATCGAAGC 


TCGCCGAGTT 


GCAGGAAATA 


CACAGGCATT 


GGATGTTATG 


5280 


GCCTGTGAAA 


ATATCATTCG 


CGGGTCTCAA 


TTTCTTTATC 


AAGAAGTCAA 


GAAATATTTA 


5340 


AGTCCGGAAG 


GTTTGACATT 


TGCTGATAAC 


TACATAGGTT 


TTCCAAATGC 


TGCAGTAGAC 


5400 


AGGATTGTTC 


CAGCACAAAG 


TCACCAAGAT 


TCCCTTTTTG 


TTGTGGTCGA 


GCCCTTTAAT 


5460 


GAATGGGTCG 


TGGAAACCAA 


GCGTCTTAAA 


AATCCAGATT 


TACGTCTAAA 


AGATGTGCAT 


5520 


TATGAAGAAG 


ATTTAGAACC 


CTTTATTGAC 


CGAAAACTTT 


TTTCAGTCAA 


TTCTGGACAT 


5580 


GCAACTTCAG 


CTTACATTGG 


TGCGCATTAT 


GGTGCCAAGA 


CAATTTTGGA 


AGCTCTTCAA 


5640 


AATCCTAATA 


TTAAATCTCG 


GATTGAATCT 


GTATTAGCTG 


AAATTCGGAG 


TCTCTTGATT 


5700 


GCCAAATGGA 


ACTTTGATAA 


AAAAGAATTG 


GAGAATTATC 


ACAAAGTCAT 


TATAGAACGA 


5760 


CTTGAAAACC 


CTTTCATAGT 


GGACGAGGTT 


AGTCGCGTAG 


CTCGTACTCC 


AATCCGAAAA 


5820 


TTAGCCTATA 


ATGAACGATT 


CATCCGGCCG 


ATACGTGAAT 


TGAAAGAACT 


CAGTTTGTCA 


5880 


TATAAAAACC 


TACTTAAAAC 


AGTTGGCTAT 


GTCTTTGACT 


ATCGCGATGT 


AAATGATGAA 


5940 


GAAAGTATTC 


GATTAGGTGA 


ATTGTTGGCT 


AAACAATCAG 


TCAAAGATGT 


TGTTATACAA 


6000 


GTTACAGGTT 


TAGACGACCA 


AGAATTGATT 


GAGCAAATTC 


TAGACTATAT 


TTAATCTTTT 


6060 


TCGAAAATCT 


CTTCAAATCA 


GGTTAGCATC 


GCTTTGTCTT 


AGGCATATGT 


TGTTCTATCT 


6120 


ACAACCTCAA 


AGCAGTGCTT 


TGAGCTGACT 


CCGTCAGTCT 


TATCTGCAAT 


CTCAAAACAC 


6180 


TGTTTGAGTT 


ATCTGCGGTA 


ATCTTTCTAG 


CTTGTCTT7G 


ATTTTTGTTG 


TTATTTATAA 


6240 


GGTAAAAGAA 


GCTCGACAAA 


AAGTCTTCAA 


AATCGGGAAA 


AGGCAGCCTA 


TCGGGTGTTC 


6300 


AAAAATCTTG 


ATAGGATGTC 


CTTTATTATG 


GAAAGCCTTA 


TTGGATTTTC 


TCCTCAGATT 


6360 


GAGTTTTTGA 


TCAGCTTTAT 


GAGATAGGTC 


TTGCTAGAGA 


TGTAGCCCAT 


CATGTTAT1T 


6420 


TTATGGACAG 


TGGGAAAATT 


GTTGAAAAAA 


ATAATGCCCA 


TCAATTCTTT 


AGTCGTCCAA 


6480 


GAGAAGAACG 


AACCAAGCAA 


TTTTGGAACG 


AATTCTTTCG 


AATGCGATCT 


ATATAGTAAA 


6540 


ATGAAACAAG 


AACAGGACAA 


ATCGATCAGG 


ACAGTCAAAT 


CGATTTCTAA 


AAATGTTTTA 


6600 


GAACTAGAGG 


TGTACTATTC 


TAGTTTCAAT 


CT ACT AT ATA 


ACTGAAAAAT 


TAGATAAATT 


6660 


AGTTTTGGAA 


AATGACTAAC 


CAAAAGATAT 


CCAAAGTAGT 


CTAAAATTGT 


CTATACTTTA 


6720 


TGAGTGTTTT 


AGTTAGGAAA 


AAGGCTTGTT 


GTCTATAATT 


GTCTGCATTA 


GTCTAGATTT 


6780 


TATTTATAGA 


AAATGTTATA 


ATAGACTGTA 


TTTAAAAAAT 


TTTAAGGAGA 


AATGACAGAA 


6840 
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TGTCTGTATC ATTTGAAAAC AAAGAAACAA ACCGTGGTGT CTTGACTTTC ACTATCTCTC 6900 

AAGACCAAAT CAAACCAGAA TTGGACCGTG TCTTCAAGTC AGTGAAGAAA TCTCTTAATG 6960 

TTCCAGGTTT CCGTAAAGGT CACCT7CCAC GCCCTATCTT CGACCAAAAA TTTGGTGAAG 7020 

AAGCTCTTTA TCAAGATGCA ATGAACGCAC TTTTGCCAAA CGCTTATGAA GCAGCTGTAA 7080 

AAGAAGCTGG TCTTGAAGTC GTTGCCCAAC CAAAAATTGA CGTAACTTCA ATGGAAAAAG 7140 

GTCAAGACTG GGTTATCACT GCTGAAGTCG TTACAAAACC TGAAGTAAAA TTGGGTGACT 7200 

ACAAAAACCT TGAAGTATCA GTTGATGTAG AAAAAGAAGT AACTGACGCT GATGTCGAAG 7260 

AGCGTATCGA ACGCGAACGC AACAACCTGG CTGAATTGGT TATCAAGGAA GCTGCTGCTG 7 320 

AAAACGGCGA CACTGTTCTG ATCGACTTCG TTGGTTCTAT CGACGGTGTT GAATTTGACG 7 380 

GTGGAAAAGG TGAAAACTTC TCACTTGGAC TTGCTTCACC TCAATTCATC CCTGGTTTCG 7440 

AAGACCAATT GGTAGGTCAC TCAGCTGGCG AAACCGTTGA TGTTATCGTA ACATTCCCAG 7500 

AAGACTACCA AGCAGAAGAC CTTGCAGGTA AAGAAGCTAA ATTCGTCACA ACTATCCACG 7560 

AAGTAAAAGC TAAAGAAGTT CCGGCTCTTG ACCATGAACT TGCAAAAGAC ATTGATGAAG 7620 

AAGTTGAAAC ACTTGCTGAC TTGAAAGAAA AATACAGCAA AGAATTGGCT GCTGCTAAAG 7 680 

AAGAAGCTTA CAAAGATGCA GTTGAAGGTG CAGCAATTGA TACAGCTGTA GAAAATGCTG 7740 

AAATCGTAGA ACTTCCAGAA GAAATGATCC ATGAAGAAGT TCACCGTTCA GTAAATGAAT 7 800 

TCCTTCGGAA TTTGCAACGT CAAGGGATCA ACCCTGACAT GTACTTCCAA ATCACTGGAA 7860 

CTACTCAAGA AGACCTTCAC AACCAATACC AAGCAGAAGC TGAGTCACGT ACTAAGACTA 7920 

ACCTTGTTAT CGAAGCAGTT GCCAAAGCTG AAGGATTTGA TGCTTCAGAA GAAGAAATCC 7980 

AAAAAGAAGT TGAGCAATTG GCAGCAGACT ACAACATCGA AGTTGCACAA GTTCAAAACT 8040 

TGCTTTCAGC TGACATGTTG AAACATGATA TCACTATCAA AAAAGCTGTT GAATTGATCA 8100 

CAAGCACAGC AACAGTAAAA TAATCTTAAT AAACAGAAAA CCCACCTGAA TTGGTGGGTT 8160 

TTCTGATGCA CTATTTTCCA AAAATCTCTT TGAGGTCTGT GTCTGTAATC CCAATCATGG 822 0 

CTGGGATGCG GTCCCAGTTT TCTTCGGTTA GGATGTAGGA TTGTTCAGAG GCACTTGATG 8280 

TGACTGTTTC AGAGACAGCT TGTTGCTTTT CTTCAACATT CTCCAGTAGA TCACTGAAGC 8340 

GTTCAATCAG ATAGGTTTTT CGGGCAGTTC CGATGTGTTG GGTAGCATAG TCGAAGGCTT 8400 

GTAATTCGCC TAGTAAGATG AGTTTGCTTT TGGCACGTGT AATGGCTGTG TAGATGAGAT 8460 

TTCGCTCCAG CATACGTCGG CTAGCACTAG TAATCGGTAG GATGACAACT GGGAACTCAC 8520 

TTCCCTGAGA CTTATGAATA CTCATGGCAT AGGCCAAGCG AATCTTGTAC CATTCGTTAC 8580 
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GGGGGTAAGA GACTTCATTA CCATCAAAAT CAATGACAAT CTCGTCTTGT TTCGATTCGG 8640 

TGTATTTACC AGGAATCAGG TCTGTCATAG CTCCTAAATC CCCATTAAAG ACA7TCATTT 8700 

CAGCATCGTT AACCAAATGA ATGACCCTGT CTCTCTTACG ATAGTGACAC TGAGGAGCTT 8760 

CAAAACTGAG TTGATCTTTT TGTGGGGGAT TGAGCAGGTC TTGCATGAGC TGATTGATAG 8820 

CATCAATCCC TGCCGTCCCT CGGTACATAG GAGCCAGAAC TTGGATATCA CCGGCGGGAA 8880 

TACCATTTCT GAGGGCGGCA CCTAAGATTT TTTCAATGGT GGCAGGAATA TGGCCACTAG 8940 

CAATTTCAAA GTAGGAACGG TCAGCTTTTT TTTGGGTGAA ATCAGCTGGC AAGATGCCCT 9000 

GTCGAATCTG ACTAGCTAGG GTGACCATCC TTGATTCTTT GCTTTGTCGA TAAATTTTTT 9060 

CCAAGCGAGT CTGAGGAATC AAAGGAATAT GAAGTAGATC CGCTAGAACC TGTCCAGGAC 9120 

TGACAGAAGG TAGCTGATCA CTGTCACCTA CGATGAGGAT CTTACTGTTA GAAGAGATAT 9180 

TGGAGAAGAG TTGATTGGCC AGCCAAGTAT CTACCATAGA GAATTCATCC ACGATGATAA 9240 

AGTCAGCATC TAGGTAATCT TCCAGATGAC TGGTATCATC GTCACCTCTC ATTCCCAAGT 9 300 

GGCGATGTAT GGTCGCGCTA GGCAAACCTG TCAATTCATT CATGCGACGA GCAGCTCGAC 93 60 

CAGTTGGAGC AGCAAGAAGA ATGGGCAGAT TGCTTTTCTT CCTGAAGTCA ACTCCTTCTA 9420 

AAAGGGCATA AACAGCAATG ATTCCATTGA TAACAGTTGT CTTACCAGTA CCAGGCCCAC 9480 

CTGTCAGGAT AAAGACCTTA TTCTGGATAC CATCACAGAT AGCCTGTTTT TGAATGTTAT 9540 

CATACTCAAT TCCCAGTTCT TGCTCGACAG TAGTGATATG TTTTTGAATG GTTTCTAAAT 9600 

CATGACTCTT CTGTTTTCCT TTTTCAAGGA TACGAACCAA GTGACTGCGG ATGCCTTCCT 9660 

CAGCGAAAAA GAGCCTCTTG TCAAAGATCT TGGTATCAAT CTGCTGAACC TTGTCTTCTT 9720 

CGATCAGGTA GGAGAGCTCT TGGGCAACTT GGCTGGGGTC TAGTTCCACG GGACGGGAAG 9780 
ACTCAAGGAG AGTAAGGGTT TGTTCCAGCA AATCCCGTGC TTCAACATAG GTGTCCCCTG 9840 
TTTCCATACA GGCCTGAAAA AGACTGTGAA CTAGACCGGC GCGGAAGCGT TCAGGAGCCT 9900 
GACTTTCGAT GCCTAGTTCC TCAGCTAGTT GGTCAGCAAT GCTAAAGCCC AAACCCTTGA 9960 

TATCCTCAAC CAGCTGGTAG GGATAATTTT CAACCACATC AAGGGTTTCT TCCTTGTAAA 10020 

AGTCTTGAAT CTGAAAGGCT AGTTTGTTGG GAATGCCGTA GTTGGCTAGT TTGGCCAAAA 10080 

TCATCTCCGT TCCGTAGTTG AGACGGAGAG TGGAGACGAA AGCCTCGCGA TTTTTGGCAG 10140 

AGAGTCCTGC GATGCCTTCT AACTTTTCTG GGTGTTGCAA AATTTCGTCA ATGGTATTTT 10200 
CGCCATAGGT ATCCACGATT TTCTGAGCTG TCTTGAGACC AATCCCCTTG AAATGGCTAC 
TTGAAAAGTA CTTGACCAAG CCCTTACTAG TTGGTTTTGC GCGATCATAA CGACTGATTT 

GCAGTTCTTC TCCATACTTG GAGTGCTGGA CAATTTGCCC CCAAAAAGTA TAGTCTTCGC 10380 



10260 
10320 
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CCTCAATTAC ATCAGCCATG GTTCCTGTGA CAATGATTTC AAAATCATCA AAATCCTCTG 104 40 

CCTCCGTATC GTCGATTTCT AGGAGGAGGA TGCGATAAAA ATTGCTGGGA TTTTCAAAAA LOS 00 

TAATCCGTTC AATAGTTCCT GAAAAATAAA CTTCCATAAA ATTCCTTTGC ATGAATAGGT 10560 

GACACTTCGG ATTGTTTTTA TTTTATACTC TTCGAAAATA TCTTCAAACC ACGTCAGCTT 10620 

CCATCTGCAA CC7CAAAACA GTATTTTGAG CTGACTTCGT CAGTTCTATC CACAACCTCA 10680 

AAACACTGTT TTAAGCAGCC TACGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTCATTGAG 10740 

TATTTGTAAA TAAACAATCA CTTCTCACGA TAGAAGAAGA GGCTGAGATT CGTGATTCTC 10800 

TGCCTCTTAG GTTTCTTAAA ATGTTCCGAT ACGGGTGATT GGCCATAAGC GGAAT7TAGC 10860 

TTCCCCTGTG ATATCTTTTG CTTTGAAGGT ACCTACGTGG CGGCTGTCGC TCGAAACCAA 10920 

GCGGTCATCT CCGAGGAGAA GGTATTCTCC TTCTGGAACA GTAAAGCTAA AGTTGGTGTT 10980 

GTAGTTGACA TCAACTGTGA AGGCTTGAGC TTTTTGAGCG ATACTTCTAA AGAAAGTTCC 11040 

TTTATTTCCT TCAAAGCCCT TGCCTGAGTA AGTGCTTTGG AGTTTGTCAT CCTTGAAGCG 11100 

TTTGATATAG TCTGCTAGAT AAGGCTCGTC CGTTTCTTTG TCATTGATGT AGAGTTTATC 11160 

ATTTTCGTAA CGAATGGTGT CGCCAGGCAT TCCAATCACC CGCTTGACGA TGTCCTTATT 112 20 

GCCATCTTCC TCATGGGCCA CCACGATATC AAAACGGTCA ATAGGAAGGT GTTTTACAAC 11280 

GAAGAGAATT TCGCCATCCG CTAGGGTCGG ATCCATGGAA TGTCCTTCTA CGCGAACATT 11340 

GCTCCAAAAA AAGATACGAC TTAAAGCTAG TAATGACAGA ATTAGGAGGA ACAATCCCCA 11400 

CTCTTTTAAG AAATTTTTAA ATGAATTCAT AACTTACCTT TCTAAGCGTT TTTTCGCTTT 11460 

TTCAGTGTTT TTAAAGTGCA ATTTGGCGCA GAAGCTGAGT CCCTGCATAC CATAGGCTTG 11520 

CAAAATCTGG CTAGCCACCT TGTCAGAAGC CGTTCCAGCT CCACTTGGCA GCTGATAACC 11580 

CAGTTCTCGT CCCAAATTTT CAAGATTTTC CAGAAAGAGA TCACGCGCAA TGACAGAAGA 11640 

AACTGCGACA GACAAGTATT TGCCCTCAGC CTTTTCTTCT AAGCTGATAG CATTCCTGAA 117 00 

ACCATTGGCC TCTTGTGCCA AGTACTTGTC ATAATTTTTA GCACTGGTAA AGGCATCAAT 117 60 

CACAATTTTC TCAGGCTGAA CACCTTTTTG AAGGAGGAGA TAGATAGCCT GATTATGGAG 11820 

GGCAACCTTA ACCGAAACAG CGTTGTAGCG GTCTCCGATG ACCTCGTTGT ACTTGCTGGG 11880 

TGAGAGAAGG AGTCCCTGCT GCTGAATTTT TTCCTTGAGA ATAGGAGTAA TCTGACGGAT 11940 

CTTTTGGTCG GTCAGAGTCT TAGAATCCCC CACACCCAGT TTTCGTAAAA AGTCGTGCTG 12000 

GTCAGGTGTG ACAAACGCAG CCACAACTGC AAGCCCACCA AAGTAGGAAC CATTTCCCAC 12060 

CTCATCTGTC CCAATTAAAG GAAGATTTTG TCCGCTGGTT TGCTCTACAG CTTGATAGCC 12120 
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AAAGAAACTG 


CCCTATTTTT 


CAGCCCCTTC 


ACCCTGAAGC 


AAGATTTTTC 


CAGAAGTATA 


L2130 


GATAGAAACC 


GTTGCTTGAG 


GTAGTTTCAA 


AAAGTAGCGG 


ATATAGGGAT 


TCTTCCTACC 


12240 


AGCCAGACTG 


GTTTGATAGT 


GTTCAAGAAA 


AGCCTGAATA 


TCCTTTTCCC 


TTGGTGTGAG 


L2300 


TGTGATACTT 


GCCATAGTTT 


CTATTGTACC 


ACAAAAGCAG 


TAAAATTTGT 


AAAAACTGAC 


12360 


AAAATTAGCG 


AATTTTGGTA 


TAATATCGTG 


AGGTGAATTT 


TATGGCAAAT 


CTAAATCGAT 


12420 


TCAAATTTAC 


ATTCGGGAAA 


AAATCGTTAA 


CCTTGACAAG 


CGAACATGAC 


AACCTTTTTA 


12480 


TGGAGGAAAT 


CGCTAAGGTT 


GCGACAGAAA 


AATACCAAGC 


AATTAAAGAA 


CAAATGCCTA 


12540 


GCGCAGATGA 


TGAAACAATC 


GCTCTTTTGT 


TGGCAGTCAA 


CTGTTTATCA 


ACTCAGCTCA 


12600 


GCCGTGAGA7 


TGAATTTGAC 


GATAAGGACC 


AACACCTAGA 


AGAACTCCCT 


CACAAGCTTG 


12660 


TGACTTGTAA 


GCAAGAACAG 


AGCAAGATTG 


AGGATTCCTT 


ATGATTTCAT 


TCCTTCTTCT 


12720 


ATTGGTCTTG 


GTTTGGGGAT 


TTTATATCGG 


CTATCGGAGA 


GGCCTGCTCT 


TACAGGTTTA 


12780 


TTACCTGATT 


TCAGCCATGG 


CATCGGCTTT 


TATGGCTGGC 


CAGTTTTATA 


AGGGGCTTGG 


12340 


AGAGCAATTC 


CATTTATTGC 


TCCCTTATGC 


AAATTCGCAG 


GAAGGTCAGG 


GGACTTTCTT 


12900 


TTTCCCATCG 


GATCAACTCT 


TTCAGCTGGA 


TAAGGTCTTT 


TATGCACCTA 


TCGGCTACTT 


12960 


GCTTGTATTT 


GGGATTGTCT 


ATAGCATTGC 


TCGTTTACTT 


GGTCTTCTCT 


TACACTTGAT 


13020 


TCCTAGCAAA 


AAACTGGGTG 


GTAAGTTGTT 


CCAAGTTTCA 


GCAGGTATCT 


TGTCCATGTT 


13080 


GCTGACCTTA 


TTTGTCTTGC 


AAATGGCCTT 


GACAATCTTG 


GCGACCATCC 


CCATGGCAGT 


13140 


TATACAAAAT 


CCTCTTGAAA 


AGAGTATCGT 


CGCAAAACAC 


ATCATCCAGA 


GCATACCGGT 


13200 


AACAACCAGT 


TGGCTCAAAC 


AAATCTGGGT 


GACAAATTTA 


ATCGGATAAA 


AAGGGCAGGA 


13260 


GTTTTCCTAG 


CCCTTTGTTT 


ACAGATTTGA 


CTCGAATCTA 


TCAGAATGTA 


AAAAGCTACC 


13320 


ACACCTAGAC 


ATTCAAAGAC 


AAGGAAATAA 


AGATGAATAA 


GAAAATATTA 


GAAACATTAG 


13380 


AG7TCGATAA 


GGTCAAGGCC 


TTGTTTGAGC 


CTCATTTGTT 


GACCGAGCAG 


GCCTTGCACC 


13440 


AATTGAGACA 


ACTGGCTCCG 


ACTGCCAAAG 


CAGATAAAAT 


CAAACAGGCT 


TTTGCTGACA 


13500 


TGAAGGAAAT 


GCAGGCTCTT 


TTCGTCGAGC 


AACCGCATTT 


TACTATTCTC 


TCAACTAAGC 


13560 


AAATTGCAGG 


AGTCTGCAAG 


AGGTTGGAGA 


TGGGAGCGGA 


TCTCAATATC 


GAGGAGTTCC 


13620 


TACTCTTGAA 


ACGCGTGCTT 


CTTGCCAGCC 


GAGAACTTCA 


AAATTTTTAC 


ACCAATCTGG 


13680 


AAAATGTCAG 


CTTGGAAGAA 


TTAGCCCTTT 


GCTTTCACAA 


ATTACATGAT 


TTTCCGCAAT 


13740 


TACAAGGAAA 


TCTTCAGGCC 


TTTAATGATG 


CGCGTTTCAT 


TGAAAATTTT 


GCCAGTGAAG 


13800 


AATTGGCGCG 


AATCCGTCGA 


AAAATACATG 


ATAGCGAGAG 


TCAGGTACGC 


GATGTTTTAC 


13860 


AAGACTTGCT 


CAAGCAAAAA 


GCGCAGCTGT 


TGACGGAAGG 


AATTGTTGCT 


AGCAGAAATG 


13920 
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GCCGTCAGGT TTTACCACTC AAAAACACCT ACCGCAATAA GATTGCAGGT GTCGTTCATG L3980 

ATATTTCTGC TAGTGGAAAC ACCGTCTATA TCGAACCCCG TGAGGTAGTC AAACTGAGCG 14040 

AAGAAATTGC TAGTCTGCGA GCAGATGAGC CCTATGAAAT GCTTCGCATT CTCCAAGAAA 14100 

TTTCTCAGCC TGTCCCCCC? CATGCGGCTG AGATTGCTAA TGACGCTTGG ATTATCGCTC 14160 

ATCTGGACTT GATTCGTCCC AAGGTTCGAT TTATCCAAGA AAGACAAGCA GTCGTGCCTC 14220 

ACCTCTCAGA AAATCAAGAG ATTCAACTGC TCCATGTCTC CCATCCTTTG GTCAAAAATG 14280 

CCGTCGCAAA TGATGTCTAT TTTGGTCAAG ATTTAACAGC TATTGTCATT ACAGGTCCCA 14 340 

ATACAGGTGG GAAGACCATC ATGCTCAAAA CTCTGGGCTT GACACAGGTC ATGGCCCAGT 14400 

CAGGATTGCC GATTTTAGCA GACAAGGGAA GTCGTGTTGG TATTTTTGAA GAAATCTTTG 144 60 

CTGATATTGG AGATGAGCAG TCTATTGAGC AGAGCTTGTC TACCTTCTCT AGTCATATGA 14520 

CCAATATCGT GGATATTCTT GGCAAGGTCA ACCAACATTC ACTCTTACTT TTGGATGAGT 14580 

TGGGGGCTGG TACTGATCCC CAAGAGGGAG CAGCCCTTGC CATGGCTATT CTGGAGGACC 14640 

TTCGCCTGCG TCAAATCAAG ACCATGGCGA CCACCCACTA TCCAGAACTC AAGGCCTACG 14700 

CTATTGAGAC AGCCTTTGTG CAAAATGCCA GTATGGAGTT TGATACTGCA ACTCTTCGCC 14760 

CGACCTATCG CTTTATGCAG GGTGTTCCTG GCCGAAGTAA TGCCTTTGAA ATTGCCAAAC 14820 

GTCTAGGCCT ATCTGAAGTT ATCGTAGGAG ATGCCAGTCA GCAGATCGAT CAGGACAATG 14880 

ACGTCAATCG TATCATTGAG CAATTAGAAG AGCAGACGCT GGAAAGCCGC AAACGTTTGG 14940 

ACAATATCCG TGAGGTGGAG CAAGAAAATC TCAAGATGAA CCGTGCGCTA AAAAAACTCT 15000 

ACAACGACCT TAATCGTGAA AAGGAAACCG AGCTTAACAA GGCGCGTGAA CACGCTCCTG 15060 

AGATTGTGGA TATGGCCCTA AGTGAAAGTG ACCAGATTCT CAAAAATCTC CACAGTAAAT 15120 

CCCAACTCAA GCCCCACGAA A7CATTCAAG CCAAGGCCAA CTTGAAAAAA TTGGCTCCTG 15180 

AAAAAGTGGA CTTGTCTAAA AATAAGGTCC TTCAAAAGCC CAAGAAAAAA CGAGCTCCAA 15240 

AGGTGGCAGA TGATATCGTG GTTCTCAGTT ATGGTCAGCG TGGTACCTTG ACCAGTCAAC 15300 

TCAAGGACGG TCGCTGGGAA GCCCAAGTTG GCTTGATTAA GATGACCTTG GAAGAGAAAG 15360 

ACTTTGATC7 TGTTCAAGCC CAGCAAGAAA AACCAGTCAA GAAGAAACAG CTCAATCTTC 15420 

TGAAACGAAC TTCTGGGCGA GGACCTCAAG CTAGACTGGA TCTTCGAGGC AAGCGCTATG 15480 

AAGAAGCCAT GAATGAGCTA GATACCTTCA TCGACCAAGC CTTGCTTAAC AATATGGCTC 15540 

AACTTGATAT CATCCATGGT ATCGGAACAG GAGTCATCCG TGAAGGAGTT ACCAAATACT 15600 

TGCAAAGAAA CAAACATGTC AAGAGTTTCG GCTATGCCCC ACAAAATGCT GGAGGCAGTG 15660 
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GTGCGACTAT 


TGTCACTTTT 


AAAGGATAGC 


AGTATTCTGG 


ACTTTATAAA 


GTAAAAACTG 


15720 


TTGAACTAAT 


TTTTACTAAT 


AAACACATTG 


ACAAAAGCCA 


ACATTTTTTG 


TAAAATTAGA 


L5780 


ATCAATTAAA 


TACCAACACC 


GAATGAAGTT 


TAATAGAAGT 


GGGGAATCGT 


TTGATTTTCC 


15840 


ATGACTGTAA 


ATGGACGGAA 


CTCTGGAGAG 


ACCGTAAAGG 


CACCGAAGGG 


CAAGGCAGGC 


15900 


AACTGCTCAA 


ACTCTCAGGT 


AAAAGGACAG 


AGCTAGGATA 


GACCGCTTTT 


TAGCATTTAT 


15960 


CTAAGCATTC 


CAGAGTACAT 


GTATCTTGCA 


TGTGCTCTTT 


CTTTTGGGGT 


TGAAACGATA 


16020 


GGAGAAGGAA 


ATGTTAGAAT 


TGCTTAAATC 


AATCGATGCT 


TTTGCTTGGG 


GACCGCCCCT 


16080 


CTTGATTTTA 


TTGGTCGCAA 


CAGGGATTTA 


CCTAACTATT 


CGGCTAGGAC 


TCTTGCAGGT 


16140 


TTTGCC7CTA 


CCCAAGCCCT 


TTCAGCTTAT 


TTTTATCCAG 


GATAAGGGAC 


ATGGTGATGT 


16200 


ATCCAGTTTT 


GCAGCTCTGT 


GTACAGCCTT 


GGCATCAACT 


GTTGGAACAG 


GAAATATCAT 


16260 


AGGAGTTGCG 


ACGGCTATCA 


AGGTTGGTGG 


ACCAGGAGCT 


CTATTTTGGA 


TGTGGATGGC 


16320 


GGCTTTCTTT 


GGAATGGCTA 


CCAAGTATGC 


GGAAGGACTC 


TTCGCCATCA 


AATACCGCAC 


16380 


CAAGGACGAC 


CATGGTGCAG 


TAGCGGGAGG 


TCCCATGCAT 


TATATCCTTC 


TAGGGATGGG 


16440 


AGAAAAGTGG 


CGACCACTTG 


CTGTTTTGTT 


TGCAGTAGCA 


GGAGTATTGG 


TTCCTCTCTT 


16500 


GGGAATCGGA 


ACCTTCACCC 


AAGTCAACTC 


GATTGCAGAA 


TCTATCCAAA 


ATACAACGAC 


16560 


GATTTCGCCA 


GCCATCACAG 


CTCTCGTCTT 


GTCTGTCTTT 


GTAGCGATTG 


CAGTCTTTGG 


16620 


TGGACTCAAG 


TCTATTTCTA 


AGGTTTCAAC 


TACTGTTGTT 


CCTTTTATGG 


CCATCATTTA 


16680 


TATCTTAGGA 


ACTCTTACAG 


TTATTTTCTT 


TAATATCGGA 


AAAATCCCTG 


GCACAATCGC 


16740 


TTTAGTCTTT 


ACCTCAGCTT 


TTAGTCCCCT 


TCCTGCGGTA 


GGTGGATTTG 


CTGGTCCTAG 


16800 


CGTTCGGATG 


GCTATTCAAA 


ATGGTGTGGC 


GCGTGGTGTG 


TTCTCAAACG 


AATCTGGTCT 


16860 


GGGTTCTGCT 


CCTATTGCAG 


CTGCAGCTGC 


CAAGACAAAT 


GAACCAG7AG 


AGCAAGGTTT 


16920 


GATTTCCATG 


ACAGGAACCT 


TTATTGATAC 


CCTCATCATT 


TGTACTCTAA 


CTCCTTTGAC 


16980 


CATCTTGGTA 


ACTGC 










16995 


(2) INFORMATION FOR SEQ ID NO: 83: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(Di TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
CCGGGCCTTT TGTAGTATAA TAGAGATACG TTTTGAAAGT AGGAGGTATC TATGGACTTA 
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ACTAAGCGCT TTAATAAACA GTTAGATAAA ATTCAAGTTT CGTTGATTCG TCAGTTTGAC 120 

CAGGCTATTT CGGAGATTCC TGGGCTCTTG CGTTTGACCT TGGGGGAACC TGATTTTACA 180 

ACGCCAGACC ATGTCAAGGA GGCGGGCAAG CGAGCGATTG ATCAGAACCA ATCCTACTAT 2 40 

ACAGGGATGA GTGGTCTGCT GACTCTACGT CAGGCAGCCA GTGACTTTGT TAAGGAAAAG 3 00 

TACCAACTGG ACTATGCTCC TGAAAATGAA ATCTTGGTTA CAATTGGGGC GACAGAGGCT 3 60 

TTATCTCCCA CTTTGACGGC TATTTTGGAA GAGGGAGACA AGGTACTTTT GCCAGCTCCT 4 20 

CCTTATCCAG GCTATGAACC GATTGTTAAC TTAGTTGGGG CAGAAATTCT TGAGATTGAT 4 80 

ACGACTGAAA ATGGTTTTGT CTTGACTCCT GAGATGTTGG AGAAGGCCAT TTTGGAGCAG 54 0 

CGTGATAAGC TCAAGGCGGT TATTCTCAAC TATCCAGCCA ATCCGACAGG AATTACCTAC 600 

AGTCGAGAGC AGTTAGAGCC CTTGGCAGCT GTTTTACGCA AGTACGAAAT TTTTGTTGTC 660 

TGTGATGAGG TTTACTCAGA ATTGACCTAC ACAGGCGAAG CCATGTGTCT CTAGGAACGA 720 

TGTTGAGAGA CCACGCTATT ATTATCAATG GTTTGTCTAA ATCGCATGCC ATGACAGGTT 7 80 

GGCGTTTGGG GCTGATTTTC GCTCCTGCGA CCTTCACAGC CCAGTTAATC AAGAGTCACC 84 0 

AGTACTTGGT CACTGCCGCA AATACCATGG CCCAACATGC TGCGGTAGAA GCCTTGACGG 900 

CTGGTAAAAA CGATGCGGAC CCATGAAGAA GGAATATATC CAACGTCGGG ACTATATCAT 960 

CGAAAAAATG ACTCCTCTTG GTTTTGAGAT TATCAAACCA GACGGTGCCT TCTATATTTT 1020 

TGCTAAAATT CCACCGGGCT ACAATCAAGA CTCCTTTGCT TTTCTGAAGC ATTTTGCTCA 1080 

GAAGAAGGCC GTTGCCTTTA TCCCTGGTGC AGCCTTTGGA CGTTACGGGG AAGCCTACCT 1140 

CCCCC7ATCT TATGCAGCCA GCATGGAGAC TATCAAAGAA GCCATGAAAC GACTTGAGGA 1200 

GTACATGAGA GAAGCATGAT TCAGTCTATC ACGACTCAAG GCTTGGTGCT TTACAATCGC 12 60 

AATTTTCGTG AGGATGACAA GCTCGTCAAA ATTTTTACAG AGCAGGTTGG CAAACGCATG 13 20 

TTTTTTCTCA AACACGCTGG TCAGTCTAAG CTGGCCCCTG TTATTCACCC CTTGGTGCTG 13 80 

GCACGATTTC TCTTGCGAAT CAATGATGAC GGACTCAGTT ACATCGAAGA CTATCATGAC 14 4 0 

GTCATGACTT TTCCCAAGAT TAATAGTGAC CTCTTTGTCA TGGCCTATGC GACCTATGTG 1500 

GCAGCTCTTG CAGATGCTAG TTTGCAGGAC AATCACCACC ATGCTCCCTT GTTTCCTTTT 1560 

TTGCAAAAGA CTTTGGAGTT GATGGAAGCA GGCTTGGATT ATCAGGTTTT GACCAATATT 1620 

TTTGAAATTC AAATTTTGAC TCGATTTGGA ATCAGCCTCA ATTTTAATGA GTGTGTCTTC 1680 

TGCCATCGGG TTGGTCAGGC TTTTGACTTT TCTTTCAAAT ATGGAGCCTG CCTCTGTCCA 1740 

GAGCATTATC ATGAGGATAA GAGACGTTGT CATCTCAATC CCAATATCCC CTATCTGCTC 1800 
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AATCAATTTC AACCTATTGA TTTTGAGACT TTGGAGACCA TTTCGCTCAA GCCTGGAATC 1860 

AAGCAAGAGC TACGCCAATT TATGGATCAA TTATATGAAG AGTACGTTGG GATTCACCTA 1920 

AAATCAAAGA AATTTATTGA TTCCCTAGCA GACTGGGGAC AATTACTAAA AGAGGAAAAG 1980 

AAATGAAAAA AATCGCAGTA GATGCCATGG GCCGCGATTA CGCACCTCAG GCCATTGTTG 2040 

AGGGTGTCAA TCAAGCCCTA TCTGACTTTT CAGATATCGA GGTTCAACTT TACGGAGATG 2100 

AAGCTAAAAT CAAGCAATAT CTGACAGCGA CAGAGCGCGT CAGCATTATC CATACGGATG 2160 

AGAAGATTGA TTCGGATGAT GAACCTACGA GAGCTATTCG GAATAAGAAA AATGCCAGTA 2220 

TGGTATTGGC AGCCAAGGCT GTCAAAGATG GTGAAGCAGA CGCTGTCCTT TCGGCTGGGA 2280 

ATACAGGTGC C77C7TGCCA GCAGGATTCT TCATCGTGGG TCGTATCAAG AATATCGACC 2340 

GTCCTGGACT CATGTCTACC TTGCCTACCG TTGATGGAAA AGGTTTTGAC ATGCTAGACC 2400 

TTGGTGCCAA TGCAGAAAAT ACAGCCCAGC ACCTCCATCA ATATGCGGTT CTAGGTTCCT 24 60 

TCTATGCTAA AAATGTCCGT GGCATTGCGC AACCACGCGT TGGTTTGCTC AACAACGGAA 2520 

CAGAGAGTAG CAAGGGCGAC CCGCTTCGTA AGGAAACTTA TGAATTACTG GCGCCTGATG 2580 

AAAGTTTGAA CTTTATCCCA AACGTGGAAG CGCGTGATTT GATGAATGCC GTTGCAGATG 2640 

TTGTTGTGGC AGATGGTTTC ACGGGAAACG CTGTGCTCAA ATCCATCGAA GGGACAGCTA 2 700 

TGGGAATCAT GGGCTTGCTC AAGACAGCTA TTACAGGTGG TCGTCTTCGA GCGAAACTAG 27 60 

GTGCCCTCCT TCTCAAGGAC AGCCTCAGTG GTTTGAAAAA ACAGCTCAAT TATTCAGATG 2820 

TTGGTGGAGC GGTCTTGTTT GGTGTTAACG CACCTGTTGT CAAGACTCAT GG CTCAAGCG 2880 

ATGCCAAGGC TGTTTATAGT ACGATTCGTC AGA7CCGTAC CATGCTAGAA ACACACC7GC 2 940 

TTGCCCAGAC TGCGCGTGAA TTTTCAGGAG AATAAAAGAG ATGACAGAAA AAGAAATTTT 3000 

TGACCGTATT GTGACCATTA TCCAAGAGCG ACAGGGAGAC GACTTTGTCG TGACAGAATC 3060 

CTTGAGTCTG AAAGACGATT TGGATGCGGA TTCTGTTGAC TTGATGGAGT TTATCTTGAC 3120 

TCTGGAAGAT GAATTTAGTA TCGAAATCAG CGATCAAGAA ATTGACCAAC TCCAAAACG? 3180 

AGGAGATGTG CTTAAAATCA TTCAAGGAAA ATAGCAATCG GAGTTCCAAG TCAACGGAAG 1240 

TAGATGCTTT TTAGAAATGA GAAATATCGG ACAAGC7CGT AAAATC77GG CTGACACTCC 3 300 

77ATCAAGGG CTCATCAAGA TATA7CC7CA AGCACAAAC7 CCACG7AAAT CCAGCAAACT 3360 

CAAGCCGC7A ACAGTTGAAG ATAAAGCCTG 7AA7CA7GCG C7A7CTAAGG AGA7AAGCAA 34 20 

GG7TGAGAA7 ATCTT7GCCA AAC7AAAAAC G777AAAATC TT77CAACAA CCTA7CGAAA 3480 

7CATCG7AAA CGC77CGGA7 TACGAA7GAA TTTGATTGCT GGTAT7A7CA ATCATGAAC7 3540 

AGGATTC7AG T7TTGCAGGA AGTCTAA7AG 7AAAAAAGTG ATTAGAAAAC ATCT7T777A 3 600 



WO 98/18931 



PCT/US97/19588 



663 

AAAATAGAGA TGATTTTGAA ACAAAAAACC TAATTCAACA CGTTTCGATG CCAATTCAAG 3 660 

ATTTGGATGA AAAAAATTAA TAGATACTGT TATACTAAAC TTGTCAAGTT TGTAACAAGA 3720 

CAAATATTAA AAATAAAAAA GAGGTATTCG TTATGAATAC AAAAACGATG TCACAATTTC 37 80 

AAATTATGGA TACTGAGATG CTTGCTTGCG TTGAAGGTGG CGGATGCAAT TGGGGAGATT 3840 

TTGCCAAAGC AGGTGTTGCA GGAGGAGCAG CACGAGGTCT TCAGCTAGGA ATTAAAACAA 3 900 

GAACATGGCA AGGTCCACCA ACTGGTGCTG TGGGAGGAGC TATACTTGGA GGTGTGGCCT 3 960 

ATGCAGCGAC ATGTTGGTGG TAATTATGGA TTTTAAAAGT TTTATTATTG GTTTAGTAGT 4 020 

TGGTATATTT GGTCCTTATA TCGATGATTT AATTAGAAAA AAATTTTTAA AG7CTTCGGA 4080 

GAAGAAAACA GAAAAATCTG TTAAAAAATA ATCAAAACTA TAAATGATGA ATCTGAATCA 4140 

AAATTATTTT GCGCATGTAA AGAGGAGTCT TATAGTAACG AGTCAAAAAA GGAGTAACTA 4 200 

TGAATCGTAA TTTAGAACGG TGTTATCTAT TCTGACTAGG AATAGATCAT ACCAGAGGTA 4 260 

GCTTAGAAAT AGCAGAGACA TTAGAAATTG AAGTAATAAA TAGGATGTCG TAACTCTTAC 4320 

TATCAATGAT TTATTTGTTT CAAGCTTGCC TAGGGTGACA GTAAAAAATC AATTTCCTTT 4 3 80 

CAATAGCATA TTTTTAGTGG GCAGGACTCT TGTTCTGCCT ATTTTTTTAT CCAAAAAGTG 4440 

CAGTTGGGAG GGAGATAGGC TCATTTGGGA AGGAAGTCCA GTTTTTGTTT AGTGATTGGG 4 500 

GTAAGATAGT TGTTATCACA TGAGTTAATA CTCTTCGAAA ATCAAATTCA AACCACGTCA 4 560 

ACGTCGCCTT GCCGTATATA 7GTGACTGAC TTCGTCAGTC CTATCTACAA CCTCAAAACA 4 620 

GTGTTTTGAG CAGCCTACGG CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA 4680 

GGGAAAAGGA GATGAATATG AAATTTGGGA AACGTCATTA TCGTCCGCAG GTGGATCAGA 4740 

TGGACTGCGG TGTAGCTTCA TTAGCCA7GG TTTTTGGCTA CTATGGTAGT TATTATTTTT 4800 

TCGCTCACTT GCGAGAATTG GCTAAGACGA CCATGGATGG GACGACGGCT TTGGGCTTGG 4 860 

TCAAGGTGGC AGAGGAGATT GGTTTTGAGA CGCGAGCCAT TAAGCCACAT ATGACGCTTT 4920 

TTGACTTGCC GGATTTAACT TTTCCTTTTG TTGCCCATGT GCTTAAGGAA GGGAAATTGC 4980 

TCCACTACTA TGTGGTGACT. GGGCAGGATA AGGATAGCAT TCATATTGCC GATCCAGATC 5040 

CCGGGGTGAA GTTGACTAAA CTGCCACGTG AGCGTTTTGA GGAAGAATGG ACAGGAGTGA 5100 

CTCTTTTTAT GGCACCTAGT CCAGACTATA AGCCTCATAA GGAACAAAAA AATGCTCTCC 5160 

TCTCTTTTAT CCCTATATTA GTGAAGCAGC CTGCCTTGAT TGCCAATATC GTTTTGGCAA 5220 

CACTCTTGGT AACCGTGATT AACATTGTGG GTTCTTATTA TCTGCAGTCT ATCATTGATA 52 80 

CCTATGTGCC AGATCAGATG CGTTCGACAC TAGGGATTAT TTCTATTGGG CTAGTCATCG 5 34 0 
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TCTACATCTT CCACCAAATC TTGTCTTACG CTCAGGAGTA TCTCTTGCTT GTTTTGGGGC 5400 

AACGCTTGTC GATTGACGTG ATTTTCTCCT ATATCAAGCA TGTTTTTCAC CTCCCTATGT 5460 

CCTTCTTTGC GACACGCAGG ACAGGGGAGA TCCTGTCTCG TTTTACAGAT GCTAACAGTA 5520 

TCATCGATGC GCTGGCTTCG ACCATCCTTT CGATTTTCCT AGATGTGTCA ACGGTTGTCA 5580 

TTATTTCCCT TGTTCTATTT TCACAAAATA CCAATCTCTT TTTCATGACT TTATTGGCGC 5640 

TTCCTATCTA CACAGTGATT ATCTTTGCCT TTATGAAGCC GTTTGAAAAG ATGAATCGGG 5700 

ATACCATGGA AGCCAATGCG GTTCTGTCTT CTTCTATCAT TGAGGACATC AACGGTATTG 57 60 

AGACTATCAA GTCCTTGACC AGTGAAAGTC AGCGTTACCA AAAAATTGAC AAGGAATTTG 5820 

TGGATTATCT GAAGAAATCC TTTACCTATA GTCCAGCAGA GAGTCAGCAA AAGGCTCTGA 5880 

AAAAGGTTGC CCATCTCTTG CTTAATGTCG GCATTCTCTG GATGGGGGCT GTTCTGGTCA 5940 

TGGATGGCAA GATGAGTTTG GGGCAGTTGA TTACCTATAA TACCTTGCTG GTTTACTTTA 6000 

CTAATCCTTT GGAAAATATC ATCAATCTGC AAACCAAGCT TCAGACAGCG CAGGTTGCCA 6060 

ATAACCGTCT AAATGAAGTG TATCTAGTAG CTTCTGAGTT TGAGGAGAAG AAAACAGTTG 6120 

AGGATTTGAG CTTGATGAAG GGAGATATGA CCTTCAAGCA GGTTCATTAC AAGTATGGCT 6X80 

ATGGTCGAGA TGTCTTATCG GATATCAATT TAACCGTTCC CCAAGGGTCT AAGGTGGCTT 6240 

TTGTGGGGAT TTCAGGGTCA GGTAAGACGA CTTTGGCCAA GATGATGGTT AATTTTTACG 6300 

ACCCAAGTCA AGGGGAGATT AGTCTGGGTA GTGTCAATCT CAATCAGATT GATAAAAAAG 6360 

CCCTGCGCCA GTACATCAAC TATCTGTCTC AACACCCCTA TGTCTTTAAC GGAACGATTT 6420 

TGGAGAATCT TCTTTTGGGA GCCAAGGAGG GGACGACACA GGAAGATATC 7TACGGCCGG 64 80 

TCGAATTGGC AG AG ATT CG A GAGGATATCG AGCGCATGCC ACTGAATTAC CAGACAGAAT 6 540 

TGACTTCGGA TGGGGCAGGG ATTTCAGGTG GTCAACGTCA GAGAATCGCT TTGGCGCGTG 6600 

CTCTCTTGAC AGATGCGCCG GTCTTGATTT TGGATGAGGC GACTAGCAGT TTGGATATTT 6660 

TGACAGAGAA GCGGATTGTC GATAATCTCA TTCCTTTGGA CAAGACCTTG ATTTTCATTG 6720 

CTCACCGCTT GACTATTGCT GAGCGGACAG AGAAGGTAGT TGTCTTGGAT CAGGGCAAGA 6780 

TTGTCGAACA AGGAAAGCAT GCTGATTTGC TTGCACAGGG TGGCTTTTAC GCCCATTTGG 6840 

TCAATAGCTA GAAAGAGGAG AGGATGAAAC CAGAATTTTT AGAAAGTGCG GAGTTTTATA 6900 

ATCGTCGTTA CCATAATTTT TCCAGTAGTG TGATTGTACC CATGGCCCTT CTGCTTGTGT 6960 

TTTTACTTGG CTTTGCAACT GTTGCAGAGA AGGAGATGAG TTTGTCCACT AGAGCTACTG 7020 

TCGAACCTAG TCGTATCCTT GCAAATATCC AGTCAACTAG CAACAATCGT ATTCTTGTCA 7080 

ATCATTTGGA AGAAAATAAG CTGGTTAAGA AGGGGGATCT TTTGGTTCAA TACCAAGAAG 7140 
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GGGCAGACGG TCTCCAACCG GAGTCCTATG CCAGTCACTT GGACATGCTA AAGGATCAAA 7 200 

AAAAGCAATT GGAGTATCTG CAAAAGAGCC TGCAAGAAGG GGAGAACCAC TTTCCAGAGG 72 60 

AGGATAAGTT TGGCTACCAA GCCACCTTTC GCGACTACAT CAGTCAAGCA GGCAGTCTTA 7 3 20 

GGGCTAGTAC ATCGCAACAA AATGAGACCA TCGCGTCCCA GAATGCAGCA GCTACCCAAA 7380 

CCCAAGCCGA AATCGGCAAC CTCATCAGTC AAACAGAGGC TAAAATTCGC G ATT AC C AG A 74 40 

CAGCTAAGTC AGCTMTGAA ACAGGTCCTT" CCTTCK3CCGG TCAGAATCTA~~GCCTACTCTC 7500~ 

TTTACCAGTC CTACAAGTCT CAGGGCGAGG AAAATCCCCA AACTAAGGTT CAGGCAGTTG 7 5 60 

CACAGGTTGA AGCACAGATT TCTCAGTTAG AATCTAGTCT TGCTACTTAC CGTGTCCAGT 7 620 

ATGCAGGTTC AGGTACCCAG CAAGCCTATG CGTCAGGGTT AAGCACTCAA TTGGAATCCC 7 680 

TTAAATCCCA ACACTTGGCA AAGGTTGGTC AGGAATTGAC CCTTCTAGCC CAGAAAATTT 77 40 

TGGAGGCAGA GTCAGGTAAG AAGGTACAGG GAAATCTTTT AGACAAGGGG AAAGTTACGG 7 800 

CGAGTGAGGA TGGGGTGCTT CATCTTAATC CTGAGACCAG TGATTCTAGC ATGGTTGCAG 7 860 

AAGGTGCCCT ACTAGCCCAA CTTTATCCAT CTTTCCAAAG AGAAGGGAAA GCCAAACTCA 7920 

CAGCTTATCT AACTTCAAAA TATGTAGCAA GAATCAAGGT CGCTGATTCT CTTCGCTATA 7980 

CTACGACTCA TGATGCCGCG AATCAACTTT TCCTAGATTC TACTATTACA AGTATTGATG 804 0 

CCACAGCTAC TAAGACTGAG AAAGGGAATT TCTTTAAAAT CGAGGCGGAC ACTAATCTAA 8 LOO 

CTTCGGAGCA GGCTGAAAAA CTTAGGTACG GGGTGGAAGG CCGCTTGCAG ATGATTACGC 8160 

GCAAGAAAAG TTACCTACGT TATTATTTGG ATCAATTTTT GAACAAAGAG TAATGTTCGT 82 20 

GTTTTTAGAG TTAAATAATT TTTAAACTGT GAGAAAGATT CTTCTTGCAG TTTTTTCTTT 82 80 

ACAATTTTTG AAAAACATCT ACTATTTATT CGGTTAAATT CTTGTGTTTT TTGGTTTTTT 8 340 

GTGGTAAAAT GTGCTCAAGT AATACGAAAG CCGAACTTTA AAATGTCAAA ACAATTGATC 8400 

TATTCGGGAA AAGCTAAAGA TATCTATACA ACTGAGGATG AAAATCTTAT TATTTCAACT 8460 

TACAAGGACC AGGCGACTGC TTTCAACGGT GTCAAGAAGG AGCAGATTGC AGGTAAGGGA 8520 

GTCTTGAATA ATCAGATCTC ATCTTTTATT TTTGAGAAAT TAAATGTGGC TGGTGTGGCG 8580 

ACTCACTTTC TGGAGAAACT TTCAGACACG GAACAACTCA ATAAAAAGGT TAAGATTATT 8640 

CCTTTGGAAG TCGTGCTCCG CAACTATACT GCTGGTTCCT TTTCAAAACG TTTTGGTGTG 87 00 

GATGAGGGAA TCGCCTTGGA GACTCCGATT GTCGAATTTT ACTACAAAAA TGATGATTTG 87 60 

GATGATCCAT TTATCAATGA TGAGCATGTG AAATTCCTAC AGATTGCGGG TCACCACCAC 8820 

ATTGCCTACT TGAAGGAAGA AACGCGTCGT ATCAATGAAC 7ATTGAAAGT CTCGTTTGCT 8880 
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GAGATTGGCC TTAAATTGAT TGACTTTAAG CTAGAGTTCG GTTTTGACAA GGATGGCAAG 8940 

ATTATCTTGG CAGACGAATT TTCACCAGAT AACTGCCGCT TGTGGGACGC TGATGGCAAC 9000 

CACATGGATA AGGATGTTTT CCCTAGAGGA TTGGGAGAAC TAACCGACCT TTATGAGATT 9060 

GTTTGGGAAA AGTTGCAGGA ATTGAAATAA TCTGTTTGCA ACGGAAAACC TTCGTCTCTC 9120 

AACTAAAAGG ACTCAGGCTG AAAAGGTCCC CCAGACCTTT TCACTCTGTA GAGAACTAGG 9180 

TGAACTAACA GATGTTTACG AAATTGTCTG GGAAAAGTTG CAGGGTTTAA AATAACAACC 9240 

TCAACGCTC? TTGGGAATAT TGCAAGAGCT GAAATAAAGG AATAACAATT GATGGATAAA 9 300 

CGTATTTTTG TTGAAAAAAA GGCTGATTTT CAGGTCAAGT CAGAGAGTTT GGTTAGAGAG 93 60 

CTCCAGCACA ACTTGGGACT GTCAAGCTTG AAAAGTATTC GTATTGTGCA AGTATATGAT 94 20 

GTATTTGACT TGCCTGAGCA CTTGTTTGCA CCTGCAGAGA AGCACATTTT CTCTGAGCAG 9480 

GTAACCGACC ATCTTTTAGA TGAAGTATCT GTGCAGGCGG ATCTTGCTAA CTATGCTTTC 9540 

TTTGCCATTG AAAGTCTGCC AGGGCAGTTT GACCAGCGTC CAGCTTCGTC ACAGGAAGCC 9600 

TTGCTTTTGT TGGGAAGTTC GAGTGACGTG ACACTCAACA CAGCCCAACT TTACTTGGTG 9660 

AATAAAGATA TTGATGCGAC TGAGTTGGAA GCTGTCAAAA ACTACCTGCT CAATCCAGTT 9720 

GATTCTCGTT TCAAGGATAT CACGACAGGG ATTGCCAAGC AGGAGTTTTC AGAGTCAGAC 9780 

AAGACCATTC CCAAATTCAC TTTCTTTGAA AGCTATGCAG CAGAAGACTT TGCTCGCTAC 9840 

AAGGCCGAAC AAGGGATGGC CATGGAAGTG GATGATTTGC TCTTTATCCA AGACTACTTT 9900 

AAGTCAATCG GGCGCGTGCC AACTGAGACT GAACTCAAGC TTTTGGACAC TTACTGGTCT 9960 

GACCACTGCC GTCATACGAC TTTTGAGACA GAGTTGAAAC ACATCGACTT TTCAGCTTCT 10020 

AAATTTCAAA AGCAATTGCA GTCAACCTAT GACAAGTATA TTGCCATGCG CGAGGAATTA 10080 

GGTCGGTCTG AAAAACCACA AACCTTGATG GATATGGCGA CTATTTTCGG TCGTTATGAG 10140 

CGTGCTAATG GACGATTGGA TGATATGGAA GTCTCTGACG AAATCAATGC CTCCTCAGTT 10200 

GAAATTGAAG TGGACGTTGA TGGTCTCAAC GAACCTTGGC TCCTCATGTT TAAAAACGAA 10260 

ACCCACAACC ATCCAACAGA AATTGAGCCA TTTGGTGGAG CGGCTACCTG TATTGGTCGA J-0320 

GCTATTCCTG ATCCGTTGTC AGGCCGTTCC TATGTTTACC AAGCCATGCG TATTTCAGGT 10380 

GCTGGTGATA TTACAGCACC GATTTCGGAA ACTCGCGCTG GGAAATTGCC ACAACAAGTC 10440 

ATTTCTAAAA CAGCAGCTCA TGGTTATTCT TCATATGGTA ACCACATTCC GCTTGCAACA 10500 

ACCTACGTTC GTGAATACTT CCACCCAGGC TTTCTACCTA AACGTATGGA ACTTGCTGCC 10560 

GTTGTTCCTG CGACTCCCAA GGGCAATGTT GTCCGTGAAA AACCTGAAGC AGGTGATGTG 10620 

ATCATCCTTC TCGGAGGCAA AACAGGTCGT GATGGTGTCG GTGGTGCGAC GGGCTCTTCT 10680 
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AAGCTTCAAA CAGTTCAGTC TCTAGAGACT GCTCGTGCTG AGGTTCAAAA AGGAAATGCC 10740 

ATCGAAGAAC CCAAGATTCA GCGCCTCTTC CGTAATGGCA ATGTCACTCG TCTGATCAAG 10800 

AAGTCCAATG ACTTTGGGGC AGGCCGTGTC TCTC7GGCTA TCGGTGAATT GGCAGACGGT 10860 
CTTGAAATCG ACCTCAACAA GGTGCCTCTT AAATACCAGG GCTTGAATGG TACAGAAATT ' 10920 

GCCATCTCTG AATCACAAGA ACGGATGGCG GTCCTCCTTC GTCCTGAAGA TGTGGATGCC 10980 

TTCGTTGCCG AATGTAACAA AGAAAATATT GATGCTGTTC TGGTGGCGAC ACTAACTGAA 11040 

AAACCAAATC TTGTCATGCA CTGGAATGGT GAGACAATCG TTGACTTGGA CCGTCGTTTC 11100 

CTTGACACCA ATGGTGTGCG CGTGGTTGTC GATGCCAAAG TTGTGGACAA GGATGTCAAA 11160 

CTCCCAGAAG AGCGTCAAAC ATCTGCTGAA ACACTGGAAT CAGATACCCT TACGGTTCTA 11220 

TCTGACCTCA ACCATGCAAG TCAAAAAGGA TTACAGACTA TCTTTGACTG CTCTGTTGGA 11230 

CCCTCAACGG TTAATCACCC ACTTGGTGGT CGTTACCAAC TCACACCAAC TGAGCCATCT 11340 

GTGCAGAAAT TGCCAGTTCA ACACGGTGTG ACTCATACTG CGTCGGTCAT TGCTCAAGG7 11400 

TTCAACCCAT ATGTAGCTGA ATGGTCTCCA TACCACGGTG CTGCTTATGC GCTTATCGAA 114 60 

GCAACTGCTC GTTTCGTGGC TCCTGGTGCC AACTGGTTCA AGGCTCGTTT CTCTTACCAA 11520 

GAGTATTTCG AGCGTATGGA TAAACAACCA GAGCGTTTCG GTCAGCCAGT AGCTGCTCTT 11580 

CTAGGTTCTA TTGAAGCACA AATTCAGCTT GGCTTGCCAT CTATCGGTGG TAAGGACTCC 11640 

ATGTCTGGTA CCTTTGAAGA ATTGACCGTT CCGCCAACCT TGGCTGCCTT TGGGGTGACC 11700 

ACGGCAGATA GCCGTAACCT GCTCTCTCCA GAATTTAAAG CTGTTGGGGA AAATATCTAC 11760 

TACATCCCAG GTCAAGCCCT CTCTGCAGAG ATTGATTTTG ACTTGATTAA GAAAAATTTT 11820 

GCTCAGTTTG AAGCCATCCA AGCTGACCAT AAAGTGACAT CTGCATCAGC TGTCAAATAC 11880 

GGTGGTGTAG TTGAAAGTTT GGCTCTTGCT ACCTTTGGAA ACTATATTGG TGCAGAGGTG 11940 

ACCTTGCCTG AACTTGAAAC AGCTTTGACA GCTCAATTAG GCGGCTTTCT CTTCACATCT 12000 

CCTGAAGAAA TTGCTGGAGT AGAGAAGGTT GGACAAACGA AAGCAGACTT TACACTGACT 12 060 

GTCAACGGTG TGAAGCTAGA TGGACACAAG CTTGACAGTG CATTTCAAGG GACATTGGAA 12120 

GAAGTTTACC CAACAGAATT TACCCAAGCG AAAGAACTAG AAGAAGTACC AGCTGTGGCA 12180 

TCAGATGTTG TGATTAAAGC CAAAGAAAAG GTTGAAAAAC CTGTGGTTTA CATCCCAGTC 12240 

TTTCCAGGAA CCAACTCAGA ATATGATTCA GCTAAGCCCT TCGAAAAAGA ACGTCCACAG 12300 

GTCAATTTGG TGCCATTCG? GACCTTGAAT GAAGAAGCTA TTGTCAACTC AGTTGAAACT 123 60 

ATGGTTGACA ATATCGACAA GACTAATATT CTCTTCTTTG CTGCTCCATT CTCGGCTGCG 12420 
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GATGAACCAG ATGGTTCACC TAACTTTATC GTCAATATCC TGCTTAATGA AAAAGTGCGT 12 480 

GTGGCTATTG ATAGCTTTAT CGCCCGTGGT GGTTTGATTA TCGGTATTTG TAATGGATTC 1254 0 

CAAGCCTTAG TCAAATCGGG TCTCCTACCC TACGGAAACT TTGAAGCTGC TAACAGTACT 12600 

AGCCCAACCC TCTTCTACAA TGATGCCAAC CAACACGTGG CCAAGATGGT GGAAACTCGC 12 660 

ATTGCCAATA CCAACTCACC ATGGTTGGTT GGTGTGCAAG TGGGCGATAT CCACGCTATT 12720 

CCTGTTTCGC ACGGTGAAGG GAAGTTTGTC GTGACGGCTG AGGAATTTGC AGAGCTCCGT 12780 

GACAATGGAC AAATTTTCAG CCAATACGTT GACTTTAACG GTAAACCAAG TATGGATTCT 12840 

AAGTACAATC CGAATGGTTC TCTCCATGCC ATCGAAGGAA TTACCAGCAA GAATGGTCAA 12900 

ATCATCGGTA AGATGGGCCA CTCAGAACGT TATGAGGATG GTCTTTTCCA AAATATCCCA 12960 

GGCAATAAAG ACCAACACCT GTTCGCATCA GCGGTTAAAC ATTTCACTCG AAAATAAGAC 13020 

TTACAGATTT TCTAATAGAT AGTATCAGTA ATGTAAAAGT CATGTAAATC TAGCTCTTGA 13080 

TG ATT AC AAA TGAAAATTAG GTATAAAAAA TGACATACGA AGTAAAATCT CTTAATGAAG 13140 

AATGTGGTGT TTTCGGTATT TGGGGACATC CAGATGCTGC TAAGTTGACC TATTTTGGAC 13 200 

TCCACAGTCT TCAACACCGT GGTCAGGAGG GGGCAGGAAT CCTCTCCAAT GATCAAGGAC 13260 

AACTGAAGCG CCATCGTGAC ATGGGGCTTT TATCAGAAGT TTTCAGAAAT CCAGCTAATT 13 320 

TGGATAAATT GACAGCAGCT GGTGCGATTG GGCATGTGCG TTATGCGACT GCTGGCGAAG 13 380 

CTTCTGTAGA TAACATCCAG CCCTTCCTCT TCCGTTTTCA CGATATGCAG TTTCCTTTCG 13440 

CTCATAATGG AAATCTGACC AATGCAGCCT CTCTCAAGAA AGAACTGGAA CAAAGAGGAG 13500 

CAATTTTCAC CGCGACTTCG GACTCTGAAA TCTTGGCTCA CCTCATTCGT CGCAGTCATA 13560 

ATCCTAGCCT GATGGGCAAA ATCAAGGAAG CGCTCAGCCT TGTCAAAGGT GGTTTTGCCT 13620 

ATATCTTGCT GTTTGAGGAC AAGTTGATTG CGGCTCTTCA CCCAAATGGA TTCCGACCGC 13680 

TTTCGATTGG TAAAATGGCT AATGGAGCAG TTGTTGTATC TTCTGAAACC TGTGCTTTTG 1374 0 

AGGTCATTGG TGCCGAGTGG ATTCGTGATT TGAAGCCAGG TCACATTGTC ATCATTGATG 13800 

ACGAGGGCAT TCAGTATGAC AGCTATACAG ATCATACCCA GTTGGCGGTT TCTTCTATCG 13860 

AGTATATCTA CTTTGCTCGC CCTGATTCTA ATATCCACGG TGTCAATGTC CATACGGCAC 13920 

GTAAGAGAAT GGGAGCGCAA TTGGCGCGAG AATTTAAGCA TGAGGCAGAT ATTGTAGTTC 13 980 

GTGTGCCCAA TTCTTCCCTA AGCGCGGCTA TGGGATTTGC GGAAGAATCA GGCTTACCAA 14040 

ATGAAATGGG TCTGATCAAA AACCAATACA CCCAGCGAAC TTTTATCCAA CCGACTCAAG 14100 

AATTGCGGGA GCAAGGAGTG CGGATGAAAC TGTCTGCTGT TTCGGGTGTT GTCAAAGGCA 14160 

AACGTGTGGT CATGGTGGA? GATTCCATTG TACGTGGAAC AACCTCTCGT CGTATCGTTC 14 2 20 
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AGCTCTTGAA AGAAGCGGGT GCGACTGAGG TTCACGTTGC CATTGGAACT CCTGCACTAG 14280 

CGTATCCATG TTTCTACGGG ATTGATATCC AGACCCGTCA GGAGCTGATT GCAGCCAATC 14 340 

ATACGGTCGA AGAAACTCGC CAAATCATTG GTGCGGACAG TCTGACTTAT CTTTCAATTG 14400 

ATGGCTTGAT TGAGTCGATT GGTATCGAAA CAGATGCGCC GAACGGTGGT CTCTGTGTCG 14 4 60 

CTTACTTTGA CGGTGACTAC CCAACGCCTC TTTATGACTA CGAAGAAGAC TATCGTAGAA 14520 
GTTTGGAAGA AAAGACCAGT TTTTACAAGT AGGCGACAGA TTCTCCATTA AAGAAAAGGA " 14580 

AAAAATAAAT GACAAATAAA AATGCATATG CCTCACGTCT CACTACTGAC TAAAGGCTTA 14640 

AGCATTTAGT CAGTAGACCC TTTGTCCTAT AGGATCAAAG CTAGAGCCGT G ACT ACT ATT 14700 

TTTAGATAAA AAGATGCTTT ATCTAAAAAT ACGTCGCAGT CTTTCTCAAA AAAAGAAAAC 14760 

GAAAAATAAA ATGGCAAATA AAAATGCGTA CGCTCAATCT GGTGTGGATG TTGAAGCGGG 14820 

TTATGAAGTT GTTGAACGGA TTAAAAAGCA CGTGGCCCCT ACGGAGCGTG CAGGTGTCAT 14880 

GGGAGCTCTT GGTGGCTTTG GTGGTATGTT TGACCTTTCC AAGACTGGGG TTAAAGAACC 14 940 

CGTCTTGATT TCAGGGACTG ACGGTGTCGG AACCAAGCTC ATGTTGCCTA TCAAGTACGA 15000 

CAAGCACGAT ACCATCGGGC ACGACTGTGT GGCCATGTGT GTCAACGACA TCATTGCTGC 15060 

AGGTGCGGAA CCCCTCTATT TTCTCGACTA CGTAGCGACA GGGAAGAATG AACCAGCTAA 15120 

GCTAGAACAA GTGGTTGCTG GTGTGGCAGA AGGTTGTGTG CAGGCTGGTG CTGCCCTCAT 15180 

CGGTGGGGAA ACGGCTGAAA TGCCGGGCAT GTACGGCGAA GACGACTATG ACTTGGCTGG 15240 

TTTTGCCGTC GGTGTGGCTG AAAAATCTCA AATCATTGAC CCTTCAAACC TCGTAGAGGG 15 300 

AGATGTTCTT CTCGGACTTG CTTCAAGTGG GATTCACTCA AATGGTTACT CTTTGGTTCG 153 60 

TCGTGTCTTT GCGGATTACA CAGGTGAGGA AGTCCTACCA GAATTCCAAG GCAAGAAACT 15420 

TAAGGAAGTT CTACTTGAGC CGACTCGTAT CTATCTCAAG GCTGTCTTGC CGCTCATCAA 15480 

AG AAG AGTTG GTCAACGGCA TTGCCCACAT CACAGGTGGT GGCTTTATCG AAAATGTCCC 15540 

TCGTATGTTT GCAGATGACC TAGCTGCTGA AATTCATCAA AGTAAAGTTC CAGTCCTTCC 15600 

AATTTTCAAA ACCCTTGAAA AATACGGTCA GATTAAACAC GAAGAAATGT TTGAAATCTT 15660 

CAATATGGGT GTGGGACTTA TGTTGGCGGT CAGCCCTGAA AATGTAGACC GTGTAAAAGA 15720 

ATTGTTGGAT GAAGCAGTCT ATGAAATTGG TCGCATCGTC AAGAAAGAAA ACGAAAGTGT 15780 

C ATT AT C AAA TGAAAAAAAT ACCG GT TT T T GCCTCTGGTA ATGGCTCAAA TTTTCAGGTG 15840 

ATTGCCGAAG AATTTCCAGT GGAGTTTGTC TTTTCAGACC ATCGTGATGC CTATGTGCTT 15900 

GAGCGTGCAA AGCAGCTCGG CGTTCTGTCC TATGCTTTTG AACTCAAGGA GTTTGAGAGC 15960 
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AAGGCAGACT ACCAAGCACC CCTTGTCGAA CTCTTGGAAG AACACCAGAT TGACTTGGTT 16020 

TGCCTAGCAC GCTACATGAA AATCGTTGGA CCAACCTTAT TGTCGCCTTA TGAAGGTCGG 16080 

ATTGTCAACA TTCATCCAGC CTACTTCCCA GAATTTCCAG GAGCTCATGG GATTGAGGAT 16140 

CCTTGGAATG CTGGCGTGGG TCAGTCTGGT GTGACCATTC ACTGGGTGGA TTCGCGTGTG 16200 

GATACAGGCC AGGTCATCAA ACAGGTTCGT GTGCCACGAC TAGCTGATGA TACCATTGAC 16260 

AGATTTGAAG CTCGCATCCA TGAAGCAGAG TACAGGCTGT ATCCGGAAGT AGTGAAGGCT 16320 

CTATTTACAG ATTGACTTTT TGATGATTCA TATGATATCT TTGATTTTAA ATTGGAGTCA 16 380 

GTGTTTGTTG AAGACGGCTT CAAACGGAGG TATTTGTAAT GTTAGAATCT AAAAAAACAA 16440 

CTCGATATGT ATT7TATCTC TATCTGATGT TATTAACTTG GGGAATCTTA TTTAAGTTTG 16500 

AAACAAATCC TGAATTTATA GCATTTTTCT TAGCTCCAAG GTATATCAAT TGGATTCCAT 16560 

TTTCAGAACC ACTAATAGTC GATGGAAAAA TTGTTTTTGC TGAAATGTTA TTTAATCTGA 1662 0 

TTTTCTTTAT TCCATTAGGT GTTTGTTTCC CTTTGATAAA AACTAATTTA TCTAGTTTAA 16680 

GAATAGTCGG GACAGGTTTC TTGATTAGTT TATTGTTTGA GTGCTTACAG TATATTTTAG 16740 

CAATAGGTAT AACAGATATA ACGGATTTGA CTTTAAATAC GCTAGGTGTC TGTCTAGGCT 16800 

TACTGATTTA TCAAATTTTT ATAAGAGTGT TCAAATCACA GACTAGAAAA TGGATCAATA 16860 

TCTTAGGTAT GCTTAGCCTT GGTTTTGCTT ATCTTGTTTT ACTGTTACTG CATTTACTTA 16920 

GTGTTTAACT AATGATTAAA AAGGAGAATA TAATCACTAA ACGCGTCTTA ATCAGCGTCT 16980 

CAGACAAAGC GGGCATTGTT GAATTTGCCC AAGAACTCAA AAAACTTGGT TGGGAGATTA 17040 

TCTCAACAGG TGGAACTAAG GTTGCCCTTG ATAATGCTGG GGTGGATACC ATTGCTATCG 17100 

ATGATGTGAC TGGTTTCCCA GAAATGATGG ACGGTCGTGT GAAGACCCTC CACCCAAATA 17160 

TCCACGGAGG GCTTCTCCC7 CGTCGTGACT TGGATAGCCA CTTGGAAGCG GCTAAGGACA 17220 

ACAAGATTGA GCTCATTGAC CTTGTGGTGG TCAACCTTTA CCCATTTAAG GAAACTATCC 17 280 

TTAAACCAGA TGTGACTTAT GCTGATGCAG TTGAAAATAT CGATATTGGT GGGCCATCTA 17 340 

TGCTTCGTTC AGCAGCGAAA AATCATGCCA GTGTTACAGT TGTGGTAGAT CCTGCTGACT l7400 

ACGCTGTGGT TTTGGATGAA TTGGCAGCAA ACGGCGAAAC CTCTTATCAA ACTCGCCAAC 17460 

GTTTAGCAGC CAAAGTATTT CGTCACACAG CGGCTTATGA CGCCTTGATT GCAGAATACT 17520 

TCACAGCTCA AGTGGGTGAA AGCAAGCCTG AAAAACTCAC TTTGACTTAT GACCTCAAGC 17580 

AACCAATGCC TTACGGTGAG AATCCTCAAC AAGACGCGGA CTTTTACCAG AAAGCTTTGC 17 640 

CTACAGACTA CTCCATTGCT TCAGCCAAAC AGCTCAACGG GAAAGAATTG TCATTTAATA 17700 

ATATCCGTGA TGCAGATGCT GCTATCCGTA TCATCCGTGA CTTCAAAGAT AGTCCAACCG 177 60 
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TTGTGGCTCT 
CTTGGGACTA 
ACCGTGAGGT 
TTGCACCAAG 
GTATCCTTGC 
GTGTAGTCGG 
GGCAAGTGGT 
GGAACGCTAT 
TTGGTGTTGG 
CCAAAGATCG 
ACGTGGAAGA 
GTGACCAAGA 
TGAGACATTT 
CTTAAAATAC 
ATTCGCAAAA 
CCGATTGCTA 
AATGATGGGA 
AAATTGATTG 
GCCCTTGCTG 
ACTAGGGCTG 
TACGGCGTTC 
ATCGAAAAGC 
GTCGTCGTTG 
AATAAATTTG 
TTTTCACTCT 
CACAAACGTG 
CCAGTCCCAC 
GTTCTAGAAG 
ATCCTGACAG 



CAAACACATG 
CCCTTATGAG 
GGATGCTGCG 
CTATACGGAT 
CTTGCCATTT 
TGGACTTCTC 
GACTAAACGT 
CAAGTACGTC 
TCCAGGTCAA 
TCTGGACGGG 
AATCGCCAAA 
ATCCATCGAA 
TAGACATTAA 
TAACTGAAAC 
GAGGTTGAGG 
AAAAGTTACT 
TGACTCTGGA 
ACTTCGCAAA 
CTGGTATCGT 
CAGCGGAGCT 
CGACAGCAAC 
ATGGTGCGCC 
CTGAGACGGT 
GTGACTCAGG 
TTGCCTTTGT 
CCTATGATGG 
ACTTACCACA 
GGGTGATTAA 
CTGATGGACC 



AATCCATGTG 
TCTGACCCAG 
ACAGCTGAGA 
GAAGCGCTAG 
AATGCTCAAG 
GTGCAAAATC 
CAGCCAACTG 
AAATCAAATG 
ACCAACCGTG 
CCGGTCCTTC 
GCAGGAATTA 
GCAGCGGATA 
GAAGATAAAA 
AAGATTAAAA 
AATGAAACTG 
TGAATCAAAA 
TGGTTTGGAA 
GACCAATGAT 
GGATGATTTT 
GGAGTGGTCC 
ATATGGCACA 
TATCGTAGTC 
TGAGCAAGCG 
TGCGCGCGTG 
CAATGGTGAT 
CGACAAAGGG 
GAGTGTAGTT 
AGAAGGTCGC 
GAAAGTCATT 
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GAATTGGTCA 
TGTCTATCTT 
AGATGCACGG 
CCATTTTGAT 
AGGCTAGCGA 
AAGACGTGGT 
AGACAGAAGC 
GTATTATCGT 
TGGCTTCTGT 
CTTCAGATGC 
AGGCCATCAT 
AATACGGCTT 
GGGAAGAAAA 
CGAACTTTTT 
CTTGTTGTCG 
GACGTGGAAA 
TTGGTAAATA 
GTTGCTTGGA 
AACCAAGCTG 
AAGGATTTCG 
TTTTCAGATT 
AAGGCCCATG 
GTCGAAGCCG 
GTTATTGAGG 
AAGTTCTACA 
CCTAACACGG 
GATACAGCGG 
CCTTATCTGG 
GAGTTCAACG 



AGCTGATCAC 
TGGTGGGATT 
CGTTTTCCTC 
CAATAAAAAG 
AGTGGAAGCA 
CAAGGAAAGC 
GACTCCTCTT 
GACCAACGAC 
TCGCCTTGCC 
CTTCTTCCCA 
CCAGCCCGGT 
GACTATGGTC 
CAGTTTCTTT 
TGATATAATG 
GTTCTGGTGG 
AAGTCTTTGT 
TCTCTATTTC 
CCTTTATCGG 
GACTTAAGGC 
CCAAGGAAAT 
TCGAGGAAGC 
GCTTGGCACT 
CTCATGAGAT 
AATTCCTTGA 
TCATGCCAAC 
GTGGTATGGG 
TTGACACCAT 
GAGTTCTTTA 
CTCCGTTCCG 



ATCGAGACTG 
GTCGTCCTCA 
GAAATCATCA 
AAAAACTTGC 
GAATACACAG 
CCAGCTGACT 
CAGTTCGCTT 
CACATGACAC 
ATTGACCAAG 
TTTGCGGATA 
GGCTCTGTCC 
TTTACAGCTG 
CCTTTTTTCG 
TTGGTAAATA 
TCGTGAGCAT 
AGCTCCTGGG 
CGAACATTAT 
TCCAGATGAT 
CTTTGGTCCC 
CATGGTCAAA 
CAAAGCCTAT 
TGGGAAGGGT 
GCTTTTGGAC 
AGGAGAGGAA 
GGCTCAGGAC 
TGCCTATGCG 
TGTCAAGCCA 
CGCAGGGCTT 
AGATCCAGAA 
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ACTCAGATTA 


TCTTGCCTCG 


CTTGACCTCT 


GACTTTGCTC 


AAAATATCAC 


AGATATCCTG 


19560 


GATAGCAAGG 


AGCCAAATAT 


CATGTGGACG 


GACAAGGGTG 


TGACTCTGGG 


TGTGGTTGTC 


19620 


GCATCCAAGC 


GCTACCCGCT 


AGACTATGAA 


AGGGGCCTTG 


AGTTGCCAGC 


CAAGACAGAA 


19680 


GGCGATGTCA 


TCACCTACTA 


TGCAGGGGCT 


AAGTTTGCGG 


AAAATAGCAG 


AGCACTGCTC 


19740 


TCAAACGGCG 


GACGAGTTTA 


TATGCTCGTT 


ACCACAGCAG 


ATJtCCGTCAA 


AGAAGCCCAA 


19800 


GCCAGCATAT 


ACCAAGAACT 


ATACCAACAA 


AAAATAGAAG 


GACTCTTCTA 


CCGAACAGAT 


19860 


ATCGGAAGCA 


AGGCAATTAA 


GTAAAGATAT 


AAGAATAACG 


CGCCGTAGTC 


GCCAAACACG 


19920 


ATAATGGTCG 


TCGTGGTGAA 


AAGACCAGAA 


CAGTGAATGT 


TCTGGTCAGG 


GGGAAACTTG 


19980 


CACACCTTAG 


GCTCAAAGTT 


TAGGAATGAA 


ACCGAAGGTT 


TGCTTCCGCC 


TCCATCACCT 


20040 


AAGACCATTA 


TCAAAAAGAA 


AAATAAAAAT 


TCACAAAATA 


CCTTAATGAT 


CGTATGGTTT 


20100 


GCGAGCGTTA 


GCGAGCTAAT 


ATAGAACAAT 


CACCGCCGTT 


GTGAAAGAAC 


GATTGGATGA 


20160 


TAATCCAATC 


CTTCAGGGAA 


ATTGGAAGAC 


CTTGGCTTTC 


CAATTTACGC 


ATGACACACC 


20220 


TTTGGTGGCT 


GCTGCCGTCC 


CTCACAAGCT 


AAGGTGATTG 


TTGAAAAAGA 


GGAAAAAGGA 


20280 


GAAGAAATGA 


AACCAGTAAT 


TTCCATCATC 


ATGCCCTCAA 


AATCCGACTG 


GGCAACCATG 


20340 


CAAAAAACAG 


CAGAACTCCT 


AGACCGCTTC 


GGTGTAGCCT 


ACGAAAAGAA 


AGTTGTTTCC 


20400 


GCACACCGTA 


CACCAGACCT 


CATGTTCAAA 


CATGCAGAAG 


AAGCCCGTAG 


TCGTGGCATC 


20460 


AAGATCATCA 


TCGCAGGTGC 


TGGTGGCGCA 


GCGCATTTGC 


CAGGCATGGT 


AGCTGCCAAA 


20520 


ACAACCCTTC 


CAGTCATTGG 


TGTGCCAGTC 


AAGTCTCGTG 


CTCTTAGTGG 


AGTGGATTCA 


20580 


CTCTATTCTA 


TCCTTCAGAT 


GCCGGGTGGG 


GTGCCTGTTG 


CCACCATGGC 


TATCGGTGAA 


20640 


GCTGGAGCGA 


CTAACGCAGC 


TCTCTTTGCC 


CTCCGTCTCC 


TCTCTGTAGA 


ACATAACTCC 


20700 


ATTGCGGATG 


CACTTGCCAA 


CTTTGCTGAA 


GAACAAGGAA 


AAATCGCAGA 


GCAGTCGTCA 


20760 


AATGAGCTCA 


TCTAAAACAA 


TCGGAATTAT 


CGGTGGCGGT 


CAACTGCCTC 


AGATGATCGC 


20820 


CATTTCTGCT 


ATCTACATGG 


GCCACAAGGT 


TATCGCGCTG 


GATCCTGCGG 


CGGATTGCCC 


20880 


GGTCTCTCGT 


GTGGCGGAAA 


TCATTGTGGC 


ACCTTATAAC 


GATGTAGACG 


CCCTCCGTCA 


20940 


GTTGGCAGAC 


CCTTGCGATG 


TCCTCACTTA 


TGAGTTTGAA 


AATGTCGACG 


CTGACGGTTT 


21000 


GGATGCCGTT 


ATCAAGGATG 


GACAACTCCC 


TCAAGGAACA 


GATCTGCTCC 


GCATTTCGCA 


21060 


AAATCGTATT 


TTTGAAAAGG 


ACTTTTTGTC 


AAACAAGGCT 


CAAGTCACTG 


TGGCACCCTA 


21120 


CAAGGTCGTG 


ACTTCTAGCC 


TAGACTTGGC 


AGATATCGAC 


TTGTCGAAAA 


ACTATGTCCT 


21180 


CAAGACTGCG 


ACTGGTGGCT 


ACGATGGTCA 


TGCACAAAAG 


GTTATTCGTT 


CAGAAGCAGA 


21240 


CTTGGAAGCA 


GCCTATGCGC 


TAGCAGACTC 


AGCAGACTGC 


GTCTTGGAAG 


AATTTGTCAA 


21300 
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CTTTGACCTT GAGATTTCTG TCATCGTGTC AGGAAATCGC AAGGAGGTCA CGTTTTTCCC 213 60 

ACTTCAGGAA AATATCCACC CCAACAATAT CCTGTCTAAG ACCATCGTAC CAGCCCGCAT 21420 

TTCTGAAAGT CTAGTAGACA AGCCTAAACC TATGGCAGTG CCAATCGCAG AACAACTCAA 214 80 

CTTGTCTGGA ACTCTCTGTG TGGAAATGTT TGCGACAGCT GATGACATCA TTGTCAATGA 21540 

AATCGCCCCA CGACCACATA ACTCTGGGCA CTATTCTATT GAAGCCTGTG ATTTCTCTCA 21600 

"GTTTGACACC CATATTCTGG GTGTTCTCGG AGCACCATTA~CCACTCATCA AACTCCATGC 21660 

GCCAGCCGTT ATGCTTAATG TCCTCGGTCA GCATGTCGAG GCTGCTGAAA AATATGTCAC 217 20 

AGAAAATCCA AGCGCCCACC TCCACATGTA TGGTAAAATA GAAGCAAAGC ATAATCGTAA 21780 

GATGGGACAT GTCACTTTGT TTAGTGATCT GCCGGATAGT CTGGAAGAGT TTGGGGAAGG 21840 

GATTGATTTT TAGGACAAGT CTATGATACA AATTATCGTT AATACATTTA TTGAAAAGTA 21900 

TAAGACTGGA GCAGTTGTTG AAGTGTTGTA TGCCAGTGCT GACCAAGATA AGGTACAAGC 21960 

TAAATATGAA CAACTACCTG CACAATACCC CGAAAATTAT TTAGCTATCT ATAATGTACC 22020 

CCTCGATACG GATTTGAATA CACTAGATCA TTACCCCTC7 GTGTTTATTG GAAAAGAGGA 22 080 

GTTTGAGTAG AAATCTTGGT TTACCTAGAT ACCTTATTCC CAACAGCTTA AGAAGAAAGG 22140 

AAAAATTAAC ACATGATCAA CCGTTACTCT CGCCCTGAGA TGGCGAATAT TTGGAGTGAA 22200 

GAAAATAAAT ACCGTGCTTG GCTTGAGGTG GAAATCCTCT CTGACGAGGC ATGGGCTGAG 222 60 

TTGGGGGAAA TCCCTAAGGA AGATGTGGCT TTGATTCGCA AGAAGGCGGA CTTTGACATC 22 320 

GACCGTATTT TGGAAATTGA CCAGGAGACG CCCCACGATG TGGTGGCTTT CACGCGTGCG 223 80 

CTTTCTGAGA CTCTTGGTGA AGAGCGCAAG TGGGTTCACT ATGGGTTAAC TTCTACTGAC 22440 

GTGGTGGA7A CTGCTTATGG TTACCTCTAC AAGCAGGCCA ACGACATCAT CCCTCGTGAC 22 500 

CTTGAAAACT TCACTAATAT CATCGCTGAC AAGGCCAAGG AGCACAAGTT CACCATCATC 22560 

ATGGGGCGTA CTCATGGTGT GCACGCTGAG CCCACAACCT TTGGTCTTAA ATTAGCAACT 22 620 

TGGTACACCG AAATGAAACG CAATATCGAG CGCTTCGAGC ATCCGGCTGC TGGTGTAGAA 22 680 

GCTGGTAAGA TTTCTGGTGC GGTTGGGAAC TTTGCCAATA TCCCACCATT TGTAGAGGAG 22740 

TATGTCTGCG ATAAACTTGG CATCCGTGCC CAAGAAATCT CTACACAAGT CCTTCCTCCT 2 2 800 

GACCTTCACG CTGAGTACTT TGCGGTTCTT GCCAGCATTG CGACTTCAAT CGAACGTATG 22860 

GCGACTGAGA TTCGTCCTCT ACAAAAATCT GAGCAACGCG AAGTAGAAGA GTTCTTTGCT 22 920 

AAAGGGCAAA AAGGGTCTTC AGCAATGCCT CACAAACCCA ACCCAATCGG TTCTGAAAAT 22 980 

ATGACTGGTC TGGCGCGTGT CATTCGTGGT CACATGATTA CGGCTTATGA AAACGTCGCT 2 3040 
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CTCTGGCATG AACGCGATAT TTCTCACTCA TCACCTGAGC GTATCATCAC AC C AG AT AC G 2 3100 

ACCATTTTCA TTGACTACAT GCTCAACCGT TTTGGAAATA TCGTCAAGAA CTTGACACTC 23160 

TTCCCAGAAA ATATGATCCG AAACATGAAC TCCACTTTTG GTCTTATCTT TAGCCAACGG 2 3220 

GCTATGTTGA CATTGATTGA AAAAGGCATG ACCCGTGAGC AAGCCTATGA CTTGGTGCAA 23 280 

CAAAAACAGC CTACTCTTGG GACAACCAAG TAGACTTTAA ACCACTTCTT GAGGCAGATT 23 340 

CAGAAGTAAC ATCACGTCTC ACACAAGAAG AAATCGATGA AATCTTCAAC CCAGTTTATT 2 3400 

ACACCAAACG AGTGGATGAT ATCTTTCAAC GTCTTGGACT AGGTGATTAA TTAAAAAATA 2 3460 

AACAGCGAGC TTCAATCTCG CTGTTTATTT TTTATCGAAA AGACTTACTC TTCTTTTCTT 23520 

TTAGTGAGTC CATAGGCTGC TAGTGTCGAC ATGAGTCCTG CGACTACTAG TCCTCCAGAA 2 3 530 

TCGTGAGTTC CTGTTTCAGG AAGTTTTTTC TCTGTTACCA CAGGACCTGG ATCTTGAGGA 2 3640 

AGAACTTTGC TTTCCTCAGC AGGAGCAGTT GATGGAGCTG GTTGGCTTGG GATTTCTAGT 2 3700 

TTTGGTTTTT CTTCAGCAAT AGCGGCTTGT CCGTTTTCAT CGCCTACATG TGTTACCATA 2 3760 

GTTCCGACTT CGACTATTTG AGTAACGGCT TCCTGTGCTA CGACACTATT TACAAGTGTT 2 3820 

TTCACT7CCT TACCATCGGC AGAAGTGCTC ACAGAGTAGA AGTTGCTACG ATGTCCATTG 2 3880 

ACGCCCTTAG TAATGACTTG TGTTTTTCCT TTGAGTAAGA GTGGATTTTC ACAAGTCACT 2 3940 

GTGGTAAATG GAATTTCTTC TTCTTGGATA TCCAGTCTAG GTTTTACCTC AGTAGTTGGT 24000 

GCAAGACCAC TTTCATCACC CTTGTGAGTT ACAGGAGCGC CAACTTCAAC CACTTGGTTT 24060 

ATAACTTCTT TGGTTACCTG GCTATCAAGG ACTGTTTCTG TTGTTTTTCC ATTTTCAGTG 24120 

AGTACAGAGA TGTAATGAGT TCGTTCACCT TTGACTCCTG CTGTGATAAT ATTTTCCTCA 24180 

CCGGCTGGGA GGTTAGGATT TTCTTTCTTG ATAACTTCAA ATGGAATTTC T7CAGTTCTT 24 2 40 

GTGATGAGTT CTGGTCTGGT TTCAACATTG GCAGCCACTT CATTTTCATC TACCCTTCCT 2 4 300 

GAATGAGTTA CAGCTGGTTT GAGGCCTTGA AGAGCGGCTT TTAGGTTGGC TACAAGCGTG 24 360 

TCAAGCTCAG CTTGTTTATT ACGGTTCAGG TTGTAATTTA GAGCTGTTTT AGCTGCGTCA 24 4 20 

AGGGCCTCAA GACTTTCTTT ACTATATCCT TCTAAGTTTG TAGGAATTTT AGCTAATTCT .:44 80 

TCCCGGAGAG CATTATAATT AGCACGAAAG TACTCTT7CT TGTGGTCTGC AAAGGCAGTC 2 4540 

ATGAGTTCAA AGATTTCCTC TTCCTTCTAT TCAGCGCTTG GTCTATCTGC CCAGATTGAA 24600 

AGCATACTTC CGACTGTTGG AAGATCTACT TCAGGATATT TGGTAGAAGC TAGTTGATTG 24 660 

AATGGTGTTT TTCCAGTATT CTCAATAGCT TTCTTGAGGA AACCACCACC ATCTTCTCGT 2 4 720 

TTTTGACCAA GAATGTAC7A CCAGTCACCG TTGGTATTCA AGAATTTATA GCCTTTGCTT 2 4780 

GCTAGGTATT GAGGTGATGC GAGGTTATAT CCCCACCAGC CTTTAGACCA GTAAGAAATC 2484 0 
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AAGACATCTT TGTCAAACTG AACATCGTCC TTGTCTTCAT AGTAGAAGCC ATCGTTGAAG 24900 

GCCATTGGTT GAAGCCCTCT TTCTTTGGCC ATAGCTGCGA GGGTGTTGGC ATATTCGGCA 24960 

AATTTGCCAT AGAGTTGATA CCACTTGAGG TAGTACCAGC CTTCGGCACT AGTCGCATCG 25020 

TTGCCGTATT CGTCAGTACC AAAGTTGAAA ATCTTTGTTT TACCTGCAAA GAAGTCCATG 25080 

TATTTACCGA TGAGGGCTTT TACAAAGTTC ATCGCTTCTT CGTTTTTCAA GTCCATAGTT 25140 

GTTTTTGAAA CTTTATCAAA GTGGGCTTGA CGATTTTTAA TACCTAATTT TTCCATGGCA 25200 

ACCAGCATAG CATCCATGTG ACCTGGACTG TTAATAGCTG GGATGAGACC CATGTCCTTA 25260 

GATTTAGCGT ATTCAATTAG CTCTGTTACT TCTGCCTCTG TTAGTGCAGT ACCGTTTGGA 25 320 

TCGTCGTAGT AACCTTTAGT TCCTTCGATA ATAGCTTTTT TAACGTCATC ACTAGCATAG 2 5380 

GTTTTTCCGT TGGCAGTAAT GGTCATATCA TCGAGTACAA AGCGAAGTCC GTCATTTCCT 25440 

AGAAGGAGAT GGACATCAGA ATATCCGAGC TCACTGCCCT TGTCTACGAT CCGTTTGAGC 25500 

TGGTTCAGAG TAAAGTATTT GCGTCCAGCA TCGATTGAGA TTACCTTGTT TTTGGCAAGT 25560 

TTTTCAACCT CACGTTTAGC TTCTTCTTCT TTTTGAGCTT CAGGCGTGAG GGTCAAGTTG 25620 

TTGACACTTT CTTGAAGTTT AGCAATGGCT TGATCAATCC TATCTTCTTG GGCACGGCTA 25680 

AGGTTGCTAT CGAGAGAGCG AATAGCTTTT TCAGCTTCTT TTACGGCCGT GACGCTTTCT 25740 

GCAGTATAAC GGTTCAGGTC TTTTGGTACC TCGTTAAGTG CTTGCTCTGC AGATTCATAA 25800 

TCAGCTGCGA AGTATTCAGC GTTGGCATTT GCAAAATGAC GCATGACTTT GAAGAGGCGT 25860 

GATGGTGAAT AACGTGCAGA TGGAGTGTCA GCCCAAGCAG CTACCATACC ACCGATGATT 25920 

GGGATATCAG CTCCTTCTGT TTTTGGTACA GAAGTGATTG GTGTGTTTTT AATACCATTG 25980 

AGCCCCTGAT CCACATTGTA CCAGCCTTGG CCATCAGCGT TTCGTCCAAG AACGTAGTAC 26040 

CAAGCATCAT TGGTATTAAG GATTTGGTGA CCTTTTTCAG CTAGTAGTT7 AGAAGAAGCG 26100 

ACATCGTAGC CTCCCCAACC ACCAGTCCAC ATAGAAACGA TGATGTCTTT GTCAAAACTA 26160 

CCAAAGCTTG TGTCGCTATT GTAGTAGATA CCGTCGTTAA AAGCCATTCG TTTGAGACCG 26220 

TGCGATTTTA CAATACGAGC GAGGTCATTG GCGTAGGCAA TAAATTTTTC ATAGCCTTTT 262 80 

ACAGGGTAGC CTTCGTTTGG ATAGTATTTA TCAGCTTGAA GCACACTCCA ACCTTTAGCA 26340 

TCTGTCGCAT CATTGGCATA TTCATCAAGT CCGATGTTGA AGATTTCAGT CTTTTTCGCC 26400 

AAATAAGCAG CATACTTGTC GATAAGGGCT TTTGTAAAAG CGACAGCTTG TTCGTTGTCA 264 60 

AGATCGACAG TACGGGCTGA TTTCTTCCCA AAATAGCTAA AGTTAGGGTT TTGGATTCCC 26520 

AATTCTTTCA TGGCATTGAG AATCGCATCC ATGTGTCCAG GACTATTTAC TGTCGGAATG 2 6580 
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ACACCGATAC 


CTTTATCTTT 


GGCATAGTTA ATCAGATCTG 


TCATTTGACT 


TTCTGTTAAG 


26640 


TGATTGCCCT 


TTGGATCGTT 


GTAATAATCA TTTGTACCTT 


TTTCAATGGC 


GCGTTTCACA 


26700 


TCGTCACTCG 


CATAGGTCTT 


GCCGTTAGCT GTGATGCTCA 


TATCGTCCAA 


CATGAAACGG 


26760 


AGTCCATCAT 


TTCCGACTAA 


TAGGTGTAAA TCAGTGTAGC 


CATAATGTTT 


CGCTTTATCG 


26S20 


ATGATTTCCT 


TGAGCTGTTC 


TGGTGAGAAA TATTTACGTC 


CAGCATCAAT 


AGAAACAATT 


26880 


TTCTTTTTCG 


CTAGTTTTTC 


ATTTACAGTT GCAGCACGTT 


CCTTTCCTGC 


CTCTGTTGCC 


26940 


GGTTTGTCAG 


CCTCTGCTTT 


CGCTTCATCT TTTT7AGCTG 


GTTTATCCTT 


GTCAGTCTTG 


27000 


TCTGTATTTG 


ACTCTTTAGA 


ATCAACCTCT TTCGCTTCTT 


CCTTTTTAGG 


GCTAGCTTCT 


27060 


TCTGCCTTTT 


TATTAGCAGT 


TTCTTTTTCA GCAGAAGTTC 


GAGTTACCAC 


TTCTGCTTTA 


27120 


TCACTAGGAG 


TTGAACTAAC 


TTCCTCTTGT GGTTTTTCTT 


CTGTTTTTGG 


AAGACTAGCT 


27180 


ACCTTATCAG 


TAGCTGGAGT 


TTCTGTTTCT ACAGTTTTTG 


GAGCTTCTGG 


TTGAAGCACT 


27240 


GCTTTAGGTG 


TTTCCTCAGT 


CCGATTTTCG GATGATTGAG 


GGGAATCACA 


AACCGTATGG 


27300 


ATGGTCCCTT 


GGTTTTCTGT 


AGTAGTAGGA GTAACTCCAT 


CGGCTGCAAC 


AGTCTGTGCT 


27360 


TGGAAGGCAA 


ATCCAATTAG 


AACAGAAGCT GCTCCTACAG 


CGTATTTACG 


AATAGAAAAA 


27420 


CGCTGTTGTT 


TTTCATGTTT CATTGCAAAA CCTCCTGATT GCATTGTTAT ATTGATAGCG 


27480 


ATTATATAAA 


TCAACGCCTT 


TATTTTATTT CTTATATTAA 


TTTCTTATAT 


TAACGAGAGT 


27540 


CAAGAGGAGA 


TGACAAAAAA 


CTATAATAAG TATAAAAAAA 


TATAAAATTT 


AAACTTAAGA 


27600 


TTTCAGATTG 


GTCGGAAAAA 


ATACGTATAT ATATCTAGTA 


TAATTTTTGG 


TTCTATTTCT 


27660 


ATAAAATATT 


CCACAAATTA 


TAGAATTTTC CAAAAATAGG 


TAAGCGCTAC 


CTTTTTGGTG 


27720 


TAGTATAATA 


AGCATAGAAA 


AAGCCCAAGC GATTAGCTCA 


GGTTTTCTTC 


TTAGTGATCA 


27780 


CGGTCACATG 


AGATAAATTT 


AATCTTGTAG TAATCAGATC 


GTTTGTAACT 


TTCACTGTAT 


27840 


TCTAAAACTT 


GGCCAGTTGA 


TTCGAGTTTG GTGATTTTAG 


TTTGTAGGAC 


AGTAGGGAAT 


27900 


TGTTCATCGA 


CTCCGAGGAC 


TGAAGCTGCA TGTTCTGCAG 


TTGGAAAGAC 


TATTTCGTTG 


27960 


ATTTCTTCAA 


AGTGTTCATC 


ATTCATGTGA ATCTGCTAGT 


CTAACTTGAA 


ACGATTATAG 


28020 


ATAGAACTAT 


AGTATTCAAG 


GTTTGGATAA TTTCCCTTGA 


TATATTGTTC 


TGGGATGTAG 


28080 


GATGTATGGT 


AGATATAAAC 


GACACCGTTT GATTCGCGGA 


TACGTTCAAT 


CTTGTAGTAG 


28140 


AATTGATCGC 


CGCGTAGACC 


CAATTTTTCC AAGTAAACAA 


GCTTGTTTCC 


GCGTTCAATT 


282C0 


GAAAGAACAG 


TTACCTTATC 


ATCTTTAGCA TTGAAGAGTT 


CAATATCTGA 


AAACTCTACA 


28260 


AGCTTGTGTT 


TGCGTGCACG 


TGAAACGAAG GTTCCTTTTC 


CTTGTTGGCG 


GACAATATAG 


28320 


CCATCTTTGG 


CAAGGTCGTT 


TAAGGCGCGA ACAACTGTGA 


, TACACCTCAC 


ATCGTACATT 


283B0 
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GAAATGAGTT CTGCTTCAGT GTAAAATTTA TCTCCACTCC TAAACTGCCC AG AG AT G ATT 28440 

TTATTTTTTA ATTCGTCTTT TATGTATTGA TGG 2 8473 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6749 base pairs 

(B) TYPE: nucleic acid 

(C-)— STRANDEDNESS :- double-- - - - - - 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CCTCATGCCT GGTATGCGAG GATACAGTTC TGAAAATCCC CGTTACTTAA TTAATGGACG 60 

CGAAGTCACA CCTCAGGAAT TTGCTCACTA TCGTGCGACT GGTCAATTAC CAGGAAATGC 120 

AGAAACTGAT GTGCAAATGC CACAACAGGC ATCAGCTATG AAACAAGGCG GTGTCCTTGC 180 

AAAACTAGGT CGAAACTTAA CAGCAGAAGC GCGTGAGGGC AAGTTGGATC CTGTTATCGG 240 

ACGAAACAAG GAAATTCAAG AAACATCTGA AATCCTCTCA CGCCCCACCA AGAACAATCC 300 

TGTTTTCGTC GGAGATGCAG GTGTTGGTAA GACAGCAGTT GTCGAAGGTC TAGCCCAAGC 3 60 

CATTGTCAAC GGAGATGTTC CTGCTGCTAT CAAGAACAAG GAAATTATTT CTATTGATAT 420 

CTCAGGTCTT GAGGCTGGTA CTCAATACCG TGGTAGCTTT GAAGAAAATG TCCAAAACTT 480 

AGTCAATGAA GTGAAAGAAG CAGGGAATAT TATCCTCTTC TTTGATGAAA TTCACCAAAT 540 

TCTTGGTGCT GGTAGCACTG GTGGAGACAG TGGTTCTAAA GCACTTCCGG ATATTCTCAA 600 

GCCAGCTCTC TCTCGTGGAG AATTGACAGT GATTGGGGCA ACAACTCAAG ACGAATACCG 660 

TAACACCATC TTGAAGAATG CTGCTCTTGC TCGTCGTTTC AACGAAGTGA AGGTCAATGC 7 20 

TCCTTCGGCA GAGAATACTT TTAAAATTCT TCAAGGAATT CGTGACCTCT ATCAACAACA 7 80 

CCACAATGTC ATCTTGCCAG ACGAAGTCTT GAAAGCAGCG GTGGATTATT CTGTTCAATA 840 

CATTCCTCAA CGTAGCTTGC CAGATAAGGC TATTCACCTT GTCGATGTAA CGGCTGCTCA 900 

CTTGCCGGCT CAACATCCAG TAACAGATGT GCATGCTGTT GAACGAGAAA TCGAAACGGA 960 

AAAAGACAAG CAAGAAAAAG CAGTTGAAGC AGAAGATTTT GAAGCAGCTC TAAACTATAA 1020 

AACACGCATT GCAGAATTGG AAAGGAAAAT CGAAAACCAC ACAGAAGATA TGAAAGTGAC 1080 

TGCAAGTGTC AACGATGTGG CTGAATCTGT GGAACGAATG ACAGGTATCC CAGTATCGCA 1140 

AATGGAAGCT TCAGATATCG AACGTTTGAA AGATATGGCT CATCGCTTGC AAGACAAGGT 1200 

GATTGGTCAA GATAAGGCCG TAGAAGTTGT AGCTCGTGCT ATCCGTCCTA ACCGTGCTGG 12 60 
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TTTTGATGAA 


GGAAATCGCC 


CAATCGGCAA 


CTTCCTCTTT 


GTAGGGTCTA 


CTGGGGTTGG 


1320 


TAAGACGGAG 


CTTGCTAAGC 


AATTGGCACT 


CGATATGTTT 


GGAACCCAGG 


ATGCGATTAT 


1380 


CCCTTTACAT 


ATGTCTGAAT 


ACAGTGACCG 


CACACCTCTT 


TCTAAGCTAA 


TTGGTACAAC 


1440 


AGCAGGCTAT 


GTGGGTTATG 


ATGACAATAG 


CAATACCTTA 


ACACAACGTG 


TTCGTCGCAA 


1500 


TCCATACTCT 


ATCATTCTCT 


TGGATGAAAT 


TGAAAAGGCT 


GACCCTCAAG 


TTATTACCCT 


1560 


TCTCCTCCAA 


GTTCTAGATG 


ATGGTCGTTT 


GACAGATGGT 


CAAGGAAATA 


CAGTAAACTT 


1620 


CAAGAACACT 


GTCATTATTG 


CGACCTCAAA 


TGCTGGATTT 


GGCTATGAAG 


CCAACTTGAC 


1680 


AGAAGATGCG 


G ATAAACC AG 


AATTGATGGA 


CCGTTTGAAA 


CCCTTCTTCC 


GTCCAGAATT 


1740 


CCTCAACCGC 


TTTAATGCAG 


TCATCGAGTT 


CTCACACTTG 


ACTAAGGAAG 


ACCTTTCTAA 


1800 


GATTGTAGAT 


TTGATGTTGG 


CTGAAGTTAA 


CCAAACCTTG 


GCTAAGAAAG 


ACATTGACTT 


1360 


GGTAGTCAGT 


CAAGCGGCTA 


AAGATTATAT 


CACAGAAGAA 


GGTTACCACG 


AAGTCATGGG 


1920 


GGTTCGTCCT 


CTCCGTCGCG 


TGGTTGAACA 


AGAAATTCGT 


GATAAGGTGA 


CAGACTTCCA 


1980 


CTTGGATCAT 


TTAGATGCTA 


AACATCTGGA 


AGCAGATATG 


GAAGATGGCG 


TTTTGGTTAT 


2040 


TCGTGAGAAA 


GTCTAAGACA 


GAATTTTGAG 


GATAAAAAAG 


AAGGAGCCAG 


CTGAAAAAAA 


2100 


CTGGTTCCTT 


TTTAGGTACG 


ACAGGCATGT 


CGTATAGTAG 


AAGTGTATTA 


TTCTAGTTTC 


2160 


AATATACTAT 


AGTAGCTCAG 


AAGTCGGTAC 


TTAAACGTGC 


TATATCAAAA 


CCAGTCCTGG 


2220 


AAAAACGTGG 


ACTGGTTTCG 


TGTTTGGATT 


ATTACCTTGA 


ACGACATGCG 


TTAAAAGTTA 


2280 


GTTGAACCGC 


CGTATGCCGA 


ATGGTACGTA 


CGGTGGTGTG 


AGAGGGGCTA 


GAGATTATCC 


2340 


CCTACTCGAT 


TTTAAATCAC 


ATGACGTTCA 


AAGGCATCAT 


CTGAAATCCC 


TTGTTCCAAG 


2400 


ATGAGTTTTG 


CCCATTCTTT 


AGCAGAGAAG 


AGGCTGTGGT 


CCTTGTAGTT 


TCCGCAAGAT 


2460 


TCGATGGTTG 


TCCCTGGGAC 


ATCTTCCCAA 


GTAGTAGTTT 


CAGCGATTTC 


CTTGAGCGAA 


2520 


TCCTTGATAA 


CAGCTGCGAT 


TTTAGCACTG 


GTGTGACGTC 


CCCACATAAT 


CATGTGGAAC 


2580 


CCTGTGCGGC 


AACCAAATGG 


TGAACAGTCA 


ATCATGCCGT 


CAATGCGGGT 


ACGGATGAG7 


2640 


TTGGCTAAGA 


GGTGCTCGAT 


AGTGTGAAGG 


CCGGCAGTAG 


GGATAGAGTC 


TTCGTTTGGT 


7700 


TGCACCAAGC 


GAATATCATA 


ATTGGAGATG 


ATGTCTCCTT 


TTGGTCCTGT 


TTCTTCCCCA 


2760 


ATCAAGCGAA 


CATAGGGTGC 


TTTGACAATG 


GTGTGGTCAA 


GTTCAAAACT 


TTCGACAATA 


2820 


ACTTCTTTTG 


ACATGGTAAA 


TCCTTTCAGT 


TTTCTTCTCT 


CATTATATCA 


TAAAGGTTGC 


2880 


TCCTGAGACA 


GAGAGAAAAC 


CTCTCCGACG 


CTGGAGAGGT 


TGAAATCTTT 


ACTTACGATA 


2940 


TAAGCGGTCG 


TATTCCTACT 


ATGGGTCAAA 


GGTTACGTTG 


ATACCCAGTT 


TACGAAGGAC 


3000 


ATTCTTGTCT 


TCATCAGTCA 


AGATGATGGT 


TGAGTGGGCT 


TCGCTTCCTT 


TGAGGTTGCC 


3060 
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GAGTTCTTCC ATAGCGCGGG CAGCATCAGG ATTTTCTGTA GCTCTGATAG CAAGTGCAAT 3120 

CAGGATTTCA TTTGAATGAA GGCGTGGATT GCGGCTACCG AGATGATCGA TTTTAAGACC 3180 

TTGGATTGGC TTAACAACTT CAGGCTCGAT TAGTTTTACT TCTTTAGCGA TGTCAGCTGA 3240 

TTTTTTGATG GCGTTGATCA AGGCAGCGGC TGTAGGACCA AAGAGTTCTG AGTTCTTACC 3 300 

AGTGATGATT TCCCCATTTG GCAATTCAAA GGCTAGGGCT GGTCCACCAG TTTCTTCTGC 33 60 

TTTTTGCCGC GCAACGACAG CAACCTTACG GTCTGCAGGT GTGATACCGA GGTCGTTCAT 3 420 

GAGCAACTCA ATTTTCTTGA CGGCAGCTTC GCCAACTTTT TCAGCTTTGA AGTCAAGAAC 3480 

TGTTTGATAG TAACGGCGGA TGATTTCTTG TTTAGAAGCT TCGACAGCGG CCTCGTCATC 3 S40 

TGTAATAGCG AAACCAACCA TGTTGACACC CATATCTGTC GGTGAAGCGT ATGGTGATTT 3600 

TCCGAGAATA CGTTCCAACA TGCGTTTGAG CACTGGGAAG ATTTCGATAT CACGGTTGTA 3660 

GTTGACAGTG GTTTCTCCAT AGGTTTGAAG ATGGAAGGGG TCAATCATGT TGACATCATC 3 720 

AAGGTCAGCT GTGGCAGCTT CATAAGCCAA GTTAACTGGA TGATGAAGGG GAAGATTCCA 3 780 

AACAGGGAAG GTTTCAAATT TAGCGTAGCC ACATTTCATG CCATTGATTT GGTCGTGGTA 3840 

CATATTGGAC ATACACGTTG CCAATTTTCC AGAACCAGGT CCAGGAGCGG TTACGACAAT 3900 

CAAGTTGCGA CTGGTTTTGA TCTAGTCGTT TTTGCCCATG CCTTCTGGGG AAATGATGTG 3960 

ATCCATATCC GTCGGATATC CTTTGATTGG ATAATGAAGA TAAGAATCAA TTCCGTTTTT 4020 

CTCAAGTTGA TTCCGGAAGC CATCTGCAGC GGGTTGGCCA GCGTATTCTG TAATGACAAC 4080 

GGAACCAACA AAAATCCCTA ATTCATTGAA TTTATCAATC AAACGAACAA CTTCTTGGTC 4140 

ATAAGAAATG CCTAAGTCGC CACGTGCTTT GGAATGTTCA ATGTTGCTAG CATTAATGCC 4200 

AATCACAACC TCAACCTGCT CTTTCAATTC TTGCAAGAGC TTGATTTTCT TGTCAGGTTC 4 2 60 

ATAACCAGGA AGGACACGAG CACCCTCGAA ATCTTCTAAC ATTTTACCGC CAAACTCTAA 4 320 

CTAGAGCTTG CCGTCAAATT GGTTAATGCG CTCCAAAATA TGGTCGCCTT GTAAATTCAA 4 380 

ATATTGTTCA GAACTAAAAG CTTGTTTTTT CATTTTTTTA CCTCTGGACT CTATTATAAT 4440 

AAAAAATTGG AAGTTAGGAA ACTACGGAGC TAAAAAAGAA ATTAAAAACA TTAAGCAAAC 4500 

GCTTGCACAA AATTTTAAAA AGTGCTATCA TAGACTATAG ATTATGAAAA TAATGAGGTA 4560 

AACAGATGCA AGAAAAATGG TGGCACAATG CCGTAGTCTA TCAAGTCTAT CCAAAGAGTT 4 620 

TTATGGATAG TAATGGAGAT GGAGTTGGTG ATTTGCCAGG TATTACCAGT AAGTTGGACT 4680 

ATCTAGCTAA GCTAGGAATC ACAGCAATTT GGCTTTCTCC CGTTTATGAC AGCCCTATGG 4740 

ATGATAATGG CTATGATATT GCTGATTATC AAGCGATTGC GGCTATTTTT GGAACCATGG .4800 
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AGGACATGGA 


TCAGCTGATT 


GCAGAAGCTA 


AGAACCGTGA 


CATTCGTATC 


ATCATGGACT 


4860 


TGGTGGTCAA 


TCATACCTCA 


GATGAACATG 


CTTGGTTTGT 


CGAAGCCTGT 


GAAAATACTG 


4920 


ACAGCCCTGA 


GCGAGACTAC 


TATATCTGGC 


GCGATGAACC 


CAATGACCTA 


GATTCTATCT 


4980 


TTAGTGGGTC 


TGCTTGGGAA 


TACGATGAAA 


AGTCAGGTCA 


ATACTATCTC 


CACTTTTTCA 


5040 


GCAAGAAACA 


GCCGGATCTC 


AACTGGGAAA 


ATGAAAAACT 


TCGCCAGAAA 


ATTTATGAGA 


5100 


TGATGAACTT 


CTGGATTGAT 


AAAGGTATTG 


GTGGTTTCCG 


TATGGATGTT 


ATTGACATGA 


5160 


TTGGCAAAAT 


TCCTGACCAG 


AAGCTAGTCA 


ATAATGGTCC 


TATGCTCCAT 


CCCTATCTCA 


5220 


AGGAAATGAA 


TCAGGCGACC 


TTTGGAGATA 


AGGATCTCTT 


GACAGTAGGG 


GAGACTTGGG 


5280 


GAGCAACTCC 


AGAGATTGCC 


AAGTTCTACT 


CTGATCCAAA 


GGGGCAAGAA 


TTGTCTATGG 


534C 


TCTTCCAGTT 


TGAACATATC 


GGTCTTCAGT 


ATCAGGAACG 


TCAGCCTAAA 


TGGCACTATC 


5400 


AAAAAGAGCT 


GAATATCGCT 


AAGTTAAAAG 


AAATCTTCAA 


CAAATGGCAG 


ACAGAGTTAG 


5460 


GAGTTGAGGA 


CGGCTGGAAT 


TCCCTCTTCT 


GGAACAACCA 


TGACCTCCCT 


CGTATTGTCT 


5520 


CAATCTGGGG 


AAATGACCAA 


GAATACCGCG 


AAAAATCTGC 


CAAAGCCTTT 


GCAATCTTAC 


5580 


TTCATCTCAT 


GAGAGGAACT 


CCTTATATCT 


ACCAAGGTGA 


GGAGATTGGG 


ATGACCAACT 


5640 


ATCCGTTTGA 


AACACTGGAT 


CAAGTAGAAG 


ATATTGAATC 


TCTCAACTAT 


GCGCGTGAGG 


5700 


CTCTTGAAAA 


AGGTCTTCCG 


ATTGAAGAAA 


TCATCCACAG 


TATCCGTCTT 


ATTGGACGTG 


5760 


ACAATGCCCG 


TACCCCTATG 


CAATGGGACG 


AGAGCAAAAA 


CGCTGGTTTC 


TCAACAGGTC 


5820 


AACCTTGGTT 


GGCGGTTAAT 


CCAAATTACG 


AGATGATCAA 


TGTCCAAGAA 


GCGCTGGCAA 


5880 


ATCCAGATTC 


TATTTTCTAT 


ACCTATCAGA 


AACTCGTCCA 


AATTCGCAAG 


GAGAATAGCT 


5940 


GGCTAGTTCG 


AGCTGACTTT 


GAATTGCTTG 


ATACGGCTGA 


TAAGGTCTTT 


GCTTATATAC 


6000 


GTAAGGATGG 


CGACCG7CCC 


TTCCTAGTTG 


TGGCTAACTT 


GTCCAATGAA 


GAGCAAGACT 


6060 


TGACAGTACA 


AGGAAAAGTC 


AAATCTGTCT 


TGATTGAAAA 


CACTGCGCCT 


AAAGAAGTAC 


6120 


TTGAAAAACA 


GGTCTTGGCT 


CCATGGGATG 


CTTTCTGTGT 


GGAATTACTA 


TAAATATTTT 


6180 


TTGCAGAAAA 


ATTTAAAATT 


GAAATCGTAT 


AAAAACAAGG 


GAGGACTGTA 


TAAAAGACAG 


6240 


AAATCCTTTG 


TTTTTTATAA 


CCAAAGTTTA 


TAAACTTTCA 


TTCTTCAAAT 


TCAATTAACT 


6300 


TTACAAATTC 


CCACTATTAA 


GCAGAAAGAA 


GATGAACATA 


AAGAAGCGTG 


TCCTTAGTGC 


6360 


AGGCCTGACT 


TTTGCATCTG 


CTTTGCTTTT 


ACCCAAATCA 


TTCATACCTC 


TCTCAACTAG 


6420 


ATGTAACTTA 


CAAAACCCCT 


GACCTCATGA 


GCCACTTTCT 


TCCTCCTCAT 


GAGGTCAGTT 


6430 


TTACTTTCTG 


CTGTTCCAGT 


ATCGTTTTTC 


CTCGCTAGAT 


TTCCTCAAAA 


GGGCAGACTC 


6540 


CTCCCTTGGT 


GCGTCACACG 


ATTTTTTCAT 


CTCGACTGTT 


CTTTAATGCA 


TCATTAACGA 


6600 
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CGCTTTTCTT CTAGGTGGTT CATAAGGAAC AGGAAGATTC AGGTTGACTT TTCTAATCCT 6660 

AGAATAAAGT CCTGAAAACA ATTCGGAATA GGCATAGAGA CTAGACAATT TGAGGAGCTG 6720 

CTTGCGTCCT GTTCCAACAC ATTTTCCGG 674 9 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 
(A)— LENGTH : — 1842 base pairs - - — 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 85: 

TCTACCCATG GACTTTGAGG CATTCATTGT TCCATCTTCT AGTGGCGAAT CTTTTCATAC 60 

AAACGATTCA ATTCACTTGG ATAGTGAAAC TCTCCCGCAA ACATTTTTCT GGTTAACTCA 120 

ATCCAGCTGA TATTTCTTTC AGCCAAAATA ATGGACAAGT TCTCCCAAAA TCGTTCAGCC 180 

ATATTCCTTC TCCTTTAGTT AGATAAATAA TGTGTTTGCG CCATGTAAAT CAATTGTTTC 240 

GTATCTCTTG GCAATAGAGC TCTAGCCTCT TCCAAATTCA GACTTGGATA AACTCGCTTA 300 

TTTGAAACCG CAAGAGGAAG TCTGATGGTT AGTTCACGAT TTTTTAAAAT TATCTCAACG 360 

AAATCCGTTA ATCTTAGATT GTCACGGTTC TTAAATCGTA ATAAATTGGG AGATAAAAAC 420 

TCAAAACAAT CTGAAGAATA GCT CAT CATC TCAATTAATT TGTCCTTTGT CATTTCAGAA 480 

ACTGAATGAC AACATACCTC TATGCCATAG TTTTGGAAGA AATCTAAAAG AAGTTGATTT 540 

CTTTGTCTAT TTTTACTTAG ATAGAGATCA ATCATCGGAG ACCTCCCAAA GATTCGGTTC 600 

CATTTGATAT TCTGACACGA TTAAGGAATC TAATAAATTA AGGAATCTAA TAAATTTGCG 66C 

AAGTTAATCG GTTTCTTGTC TTCATCATAA GCTTTTACAG TTACTTGGGT TGTAAGTATT 720 

CCCTCTTTTC CCTCCCCTCG ATAGCCTTGT CCATATAAAA C AAAAAC G AG ATTTTGATGA 780 

TCATCTACAA AGGCATCAAC CCCATTCTTT ATGTCTTGAC TTTCAAGGAA TTCCATAACG 840 

TTTTGAAGAT AGGATTCGTA AAATAGTGGG TAGTTATCTT TTTTATGGTA ATCATCTAAA 900 

AATGTCACTT CAAACTCACA TGGAGAGTAA TTTTGACTTT GAACAGCCTA AAAGTGCCAT 960 

CAAATTTGAA TTGGAATAAA TCAAATAAAT ACCCCCATCC TCATCAATCC AACCTTTGCT 1020 

CAAAGACAAC TCCAACCGAT CTTTTAAAAC TGAGTAAACC ACCTTAACCT CCAGTTTCAT 1080 

ATTCTTATAC CGTTCACTCT CAAATAAAAG TTTGGGGAGC TTATAATAAC GCTCTGATGT 1140 

CTGATATTGA TTAGCGGTAA TACGCTTCAT TATTGTCCCT CCAAGACTAA AATTCCAACA 1200 
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TTTCCAAATT CATCAAATCG CATTAAACCT ACTTGTTCCA TTTCATCAAC TAACTGAGTT 1260 

GCTTTTACCC AAATCATTCA TACCTCTCTC AACTAGATGT AACTTACAAA ACCCCTGACC 1320 

TCATGAGCCA CTTTCTTCCT CCTCATGACG TCAGTTTTAC TTTCTGCTGT TCCAGTATCG 1380 

TTTTTCCTCG CTAGATTTCC TCAAAAGGGC AGACTCCTCC CTTGGTGCGT CACACGATTT 1440 

TTTCATCTCG ACTGTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG GTGGTTCATA 1500 

AGGAACAGGA AGATTCACCT TGACTTTTCT AATCCTAGAA TAAAGTGCTG AAAACAATTC 1560 

GGAATAGGCA TAGAGACTAG ACAATTTGAG GAGCTGCTTG CCTCCTGTTC GAACACATTT 1620 

TCCCACCACG TGAAGAAAAA GATGGCGGAA GCGTTTGATT GTTAAAGTTT GGAAGTCACC 1680 

TCCAGCTAGA TGTTTGAGAA AAAGATAGAG ATTGTAGGCG ATACAGCTCA TCATCATACG 17 40 

AACTTCGTTT TTGATTAAGG TTGAACTATC CGTTTTATCG CCAAAAAATC CCTCCTTCAT 1800 

CTCCTTGATG AAATTCTCGG CTTGACCACG TCCACGATAA AG 1842 

(2) INFORMATION FOR SEQ ID NO; 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19390 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

{xi> SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
TCATCTTTAT CTCCTCGAAA TTTTCTAATA TAGCCATTAT AACAGAATTT TGTGAAAATT 60 
CCTATTATAG TAAATCACTA TTTCAGTATA AAAAGAAAAA ACGAATCAGA CGATTCGCTC 120 
TTCTTAAAAT CTGAAAATAG CTTTCCAGAA AGGATTACCC GATTTTTTCC AGATTGAGCA 180 
CTGCATCGTG ACTCATCAAG ACTTGACCAT ACTCTTGTAA GACTGAGCGA CTGATATCAC 240 
TATCGTCTGC AAACTCGCGC ATACGGGCCA ACAGCCAAGC TGGATATGGG CTTGGATGAT 300 
TTTCAATATC CACTAAAATG GTCAAATAAT AGCGCTCGTT CATTTTGTAG AGTTCAGAAG 360 
TTTCCATTTC AAAAGTCACT GTCTTGGCAA AAGCTACCAA GTCAGCCAAC TTAGCAAAAG 4 20 

AAAGGATGTA GTAGATGTAA GGTTCTTTCT TACTCTCAGC TTCTTGTTCA GCCTGCTCTT 480 
GCTCTTCTTC CTTGACTTCA ACTTGCTCAA GAGATTGAAT GGCTTCGATA TCATCCTTGG 540 
TTTTGTCTGC GATGCTTTTT TCCAGGCTTT TGATAAATTC ATCTGGAGAC ATTTGAGCCA 
ATTCTTCCAT ATCTGGCAAA TCCGATAAGT CTTCAAAATC TACATTTTCG TCAATCTTTG 
ACTTGGTCAC AAAGACATCT ACCTTATCAG GTTTTGCAGT CACACGGAAG CTCAACATGC 
CTGTATCCAG AAAGCTATCA GGCATCTCTA GCTCATCCAA GATAGCATAA AAGAACTCTT 



600 
660 
720 
780 
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CTCTTTTTTC TTGAGGAACG AGAAAGTCAG CAATCTCCAT TCCACGATCC ATCAAATCCT 84 0 

CTAAAGATAT CGTGATTTTT AAACTTCTA? CACTAATTTG TTTCATTTTC ATTGCTAGTA 900 

ACCTCATACT' TTCAGTTCTA TCTATTATAC TAGATTTTTA CGATTTTATC AAAAGAAGGC 960 

TCCTCTATAC GGATAGATTT TCCCTAGGGT CTTTCTATAG GACACTCCAA AAGAAAATTT 102 0 

CTGCAGACAG ATAGAAAAAG CCTTCAAAAT CGCCTAAGAG CCGACTTTGA AGACCTTATA 1080 

CATCAGAATA CTTATAATTT AAAGGTTGCT ACACCCAGGA TAGAACGATT TAAGTTTCTG 1140 

AGAATTTGAA GACTTTGCTC AAATTTCTTA TAACGAGTCA CTCCGTACTC TTCAACAAGA 1200 

AGGACTGTAT CTCTTTCCAA AAGAGATGAT ACATCCTGTA AATCTACAAA ATGCATTCCT 1260 

TTTAAACCTT CTTCACTCTG TTTCAATTTA TCTAACATAG CTTTATTTGA GCTAACGATC 1320 

GTCAATTCCT GTCCAGTATT TTTGTATCAC AAAACATCTG CTAGGTTAGC AATTCTTGTA 13 80 

ATCTCTGTTA CAAAATCAA7 TTGATACTGA GAAAAATCAC CTACTCTATT GATTGTTGGA 1440 

TTAAAGAGAT AAACTAACAC ATTTCCCATC ACAACCAAAA TCACACAAAC CACTCCAATA 1500 

ACAACTAAAC CAAGAATCAC AT7TTTCACA TTTAAGCCAA CCGCTGTTTC ACCATTTGCG 1560 

TTCAATTCTT TAGACTTGA? GGTTTCCAGT TTTTCAATTT TCACATTTGC ATAGGCATGT 1620 

TTAAATTTCT CAATCAACCC ATCAATTTTT TTCTCTAACA AGTTATTGGC ATCTTTACTT 1680 

GATGTCAAAA TTTTCACACC AACCCCTGCA TCGTCAATCA TATAGTAGAC GGTCAATTTT 1740 

TTCCACCAAT AGTCATTCGT TGAATTTTTC AAGGTTGTTT CTGTCGTGTC TAATTCACTG 1S00 

GCAATTTTTT TCAACTCACT GGGTTCTACA TCATTGAAAA GATAAGCTCC ATTCAAATTA i860 

CCATCAATCA ATTTCCCATA AAAATCACTA TAACCACCAA TTTGATGATT CAAAATCGTT 1920 

TTGTCCGACT CTTTTGGAGG AGTGATTTTA TAGATAAGAT AAGTTGAATA ACTTGTTGTA 198C 

TCTTTGACAG TGTTTTTATT CCTAACTGCT TTAATTGTAA ATGGTACAGC AATGAGAGCA 204 0 

AATAAAGCGA TGAGAGCTAA AATATTTGCT TTTCGCTTTT TATAAAGATT TGCAAACAAA 2100 

TCAGCTACTG AATAATGTTC AAACATGATT TTTTTCTCCT TTGTTTAGTA GATACTAGTT 2160 

TTCCTTTGTA AGCATTTTTG CTACAAATAT AATCACAAGA ACAATTCCCC AGAATTGCAT 22 20 

TGTAAA7AAA TTGAAGAAAC TTTCTGAAAA GCTGCTTCTT GGCATAAAGA ATAGATTATT 2280 

CAAGATGAGT AGGGATAAAG CAAATAGGAT TGTCCTTGAG CGATAGGCTA CTTGCAGCAT 2340 

GGCTATAAAT AATACGCCGA GTAAGAAACT AAGCAGAAAG ACTCCAATCA TACCATAGTC 24 00 

GGTATACAAC TCCATGATAT AACTACTTCC GATACCATGC CCTTTCAAGT ATTCCTTGTT 2 4 60 

CAAGACAAGA TAGGATAGAT TGTGGGCATA ACT ATT ACTA TCAATAGCTA CTTCCACACT 252 0 
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ATTGCTTCTA 
ATAATCAAGA 
AAATAGAAAA 
TAAGATATCC 
TCCCATCGCT 
AATCCATTTT 
ACTTAAAATA 
GGAGACAAGC 
TGCATAGACC 
TGCATAGTAG 
GAAATAGAAA 
ATAAACCTCT 
GTAACGAGCC 
CGCAAAACGA 
GGTCGGTCTT 
ATACTTGATA 
AAGT AAGCT A 
ATAACTGACT 
GTTCTAATGT 
AATATTTATG 
AGAAATAGCA 
TTTATACTGA 
GAAAAAAGCG 
GTAGATATCA 
TCTATCGAAA 
TAAATCAAAA 
TGCAATCTGT 
GTATTCCTTT 
TGGGCTTTCA 
TTGAACTAAT 



TGTTCAAAGG 
ACAGGACCAA 
CCTCGAGCCA 
CAGAAACCTG 
AACATGAGAA 
CCTTTTTCAG 
AAGGGATTTC 
ACTGCTGTGG 
GTAAAGCTAG 
GCATAGTAGG 
GGATAACTTA 
TTTAGAGAAT 
AGAATGCCTC 
TAGGCTATTG 
GATACCAGAA 
TCATTCCAAC 
ACATTATTAT 
ATGGTCAAAC 
AATTTTTTAG 
AATACGCCCA 
AAGGCATGGC 
TCATCTAGCA 
ACCTCTTTCA 
ATATAGGCTA 
TAGTAATAAT 
TGGTAATAGG 
CTCTTGATTA 
AGAAATGTTT 
TCATTAGCAT 
TGCTCATATA 



CTTTTCCTCC 
AAGTAAAATT 
GAACACCAAA 
TATGGGAAAC 
TAGGAGAACC 
TTTGCTCCCG 
GTGTCCCAAT 
CCTGCAATTT 
ACAAAATGTA 
AAGTCTGCAA 
GAAGAAAAAC 
TTCCTATATT 
CTGTGGTCAA 
GATGATAGGT 
ATACAAAAAT 
AAGCAATTAA 
TATTAAACAG 
TAAATAATAA 
ATTTTTCAAT 
AAGCACTAAT 
CTTTTAAAAA 
ACATCTTATC 
ACTCAGG AT A 
AATCCTTCTC 
AGGGTTTAGT 
CATCTTCGTA 
GCTTATTGCA 
GAGAATCACA 
AGACATTCAT 
AGCTCTGAAT 
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GAAAATGGCT 

ACGGAAATCT 

ACTAGTCCCT 

TTGGACATTA 

TACAAAAATC 

CATAAAGTAA 

TGCCAAATGA 

CTTTGGCTTG 

CGTAAAATAA 

ACGATACAAG 

TCCTAGTGAT 

TGCTACTTTT 

GCCCAGAATC 

ATCCAAAGCA 

CGTTAAATAG 

CCTACTAACC 

ATACACAATT 

TCGTTTCCCA 

ATTTTTCAGT 

TCTTCTCCCC 

ACGATGAATC 

CAGAATAAAG 

GTTTTTCACA 

TGCATAGGGT 

ATTAACCACA 

AATCAACCCC 

AATCGTCCCA 

GACAAAATAG 

GACACCACAG 

CATTTCTGGA 



CCCAAACTCC 
CGGTAAGGGA 
TGTTTATACA 
TCCCGTACAT 
GCTAACTTTT 
TAAACAAAAG 
ATAGTATTAG 
GTTGCCAGAT 
GGCAGT7TAC 
AGCCGTTCAA 
ACAAAGCGTA 
ATTTTCTTCC 
GAAATCATGA 
CCATCCCTAA 
AAAATAAAAT 
AACAAGAACA 
CCACTTACTA 
TCAATCACTT 
AATAAGAATC 
TTACGGAAAA 
TGAGAATAGG 
AAGTGGGCAT 
ACTTCATTAT 
TTGGTCGTAA 
TACTTCTTGG 
TTACGAAAGG 
GGTATTTTTT 
TCATCCTGAT 
CTCGAAACAT 
TGGATATAAT 



CCCTTGCAAA 
GGCTACrGTT 
TAAAGTCAAC 
AATTGAGTAC 
CTTTAAACCC 
CAAATmAAAT 
CTGCAATAAA 
ACATACACAT 
TTTCAAAATT 
ATAACCGAAT 
ACCGCTTGAT 
TAGCTATGAA 
CAACTATAAA 
AATAATCAAT 
GGATTAAGTA 
ATAAAGTAGA 
GCGTCAAGGC 
CGTCACCCCC 
GATATAAGGA 
TTGGATTCCT 
CTTCAAACTG 
AGGCCAATCT 
AAAACTTTTG 
TACTATCCCC 
CCAACTTGAT 
ATAGGGCAGT 
CACCTATGAG 
TGGCTGACTG 
CCGCATCTTC 
CATCTGAGTC 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
306C 
3120 
3180 
3240 
3300 
3360 
3420 
34S0 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
414C 
4200 
4260 
4320 
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AATAAAAATC AGATAATCCC CCTCAGCCTG CTTCATCCCA TCATTTCCTG CTTGCGACAA 4380 
TCCTTCGTTC TTTTTATGAA GCACTGACAC CCTGTCATCT TGTTCAGCGA TTGAATCACA 4440 
CAAGCGACCA CTTTCATCTG TTGCACCATC ATCAACAAGA ATAATTTCCA GATTTTGATA 4 500 

GGTCTGCTTC TGAATGGAAG CTATCGATTT TTCTAGGTAC TGCGCCACAT TATAGACTGG 4560 
CACAATCACA CTAATTAATG CAGTTTCCAT GCTACTCCTC TAATAGTTTT TCTACTTGTT 4620 
CGATTTGTTT TGWATTGTA AATTGTTCAA TGAATTGGCT AGCCTCATCG ACATCAAAGT 4680 
TTGAGGCACA AGTCATGTAA TTAGTAATCG CCTGAGCTGC CTCTTCATTG CTCTCAATGA 4740 
T7TGTCCAAA TCGTCCTTCT TGGGATAATT CCTCACCCCC TCCAACGTCC GTAGAGATAA 4800 
AAGGGAGTCC CAGACTCAAG GCCTCCACAT ACACTCCAGG AAAACCTTCT TGTTTAGACA 4860 
TAGACAAAAG AACTTTCGTC TGAGATAGAT ACTGATAAGG ATTTTTTTGA TAACCAAGGA 4920 
AATGTACATA GTCCTCAATC CCATACTCTT TGACTCGTTT TTTCAGTTCC TCTTCCATAT 4 980 

CACCAGCCCC CATAAAATAG AGATGATAGT TTTTTCCCTC TTGGTGTAAT AATCGTATCA 5040 

CTTCCACTAC ACGGTCAGAA CCCTTATTTT CCTCAATCCG TCCGATAGTA CAGATACTTT 5 LOO 

GAGGAGCAAT CTCGATATCG ATCTTCTCTT GAGATTTTTC TAGAATAGTC TGAAAATCAT 5160 

ATCCATTGTA GATTGTCTGT AATTTAGAAG TATAATCTGG ATAAACTTCC TTGATAGAAT 5220 

TGCTGGTCTT TTTTGAAATC CCTACAATTG TATTCGCAGC ATCCAACTGG CTTCTATCTC 5280 

ATTCTCTTTT AGAGCTATCC TTAAGAAGTT CTTCAATACT TCCATGAATC CAAGATATCT 5340 

TCTTGACTTC TCTTCTTTTA GAGAACAACA GTCCTCGATT CATAATGGTA AAAGAAACTT 5400 

CAACATCATA ATCATCTTTT ACAAGCAAAC GACGAGTCAG TCTTGGAAAA T AAATTCT C A 54 60 

TTCTCCACAA AAAAGCTCGT AACCATCTGG TTTGGCGATA ATCTTGAAGG GATTTTAAAA 5520 

TGCGTACATG CTTTGGAACA GATTCATATC CCTTGTCAAA GTGCTCCATT TCAAGAATAT 5580 

CAATATCATA CTTTTCTGGA TCCAGATTTG AAACAATCGT TGATAGAATC TTCTCTGCAC 5640 

CACCTCCAAG AGAAAAAGAC CACATAAAAA ATAAGATTTT TTTCTTAGCC ACCATATTCT 5700 

CCCTTGTATT CTGTATAAGA CTTATCCATA TCAGCGATGA CAGCATCATG ATGCGGTACC 5760 

TGCTTGTCTG CTGGTGGAGG CGTCATATAA TCCCCAAAAG CACTTCTGAG ATAGACATCA 58 20 

TAGCCGATTG GAATAGCCAT CTCTGTTCCT TCAAATGGCA AGAAAAGATT GTCTTCAAAA 5880 

GATGTGATTG GGTACTTGTT TCTCATGTAC CCACGACCTG AGCATAATTC TCTAATGCCA 5940 

TCACAATCAG CCAAATCATA CTTAGTCATT TCTTTCTCAG CTTTTTTCCA GATGCGATAA 6000 

CGGAGAGATT TTGGAGTCAA ACCCACTAAA ATGCGACTTC CCCATTTCAT GAGATCACCA 6060 
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TCCTTTTCTG GAATAGTTTG CGCACAAAAG AGTGAATAAA TCAAGGCCCA ACGAACCTGT 6120 

TTTTTCCGCT CAGCTGGATT TTTCGGATAA TAATCCAAAG GCAAAACATC CAAGGCCAGA 6180 

CCATGTGGCA AATCCAAATC CTGCTGATAA GGCTTGATAC AGGTGGTTTT CTTGTCACGA 6240 

ATGGTAATAA AAAGATTACG ATCAACAAAA TCCTTGTGAC TCTTTGACAA GAAATAACCT 6300 

TCATCTGCAT AACGAGGCCA TAATTCTGCT AATTTCTCAT AATCTTTACG AGGCATAAAA 6360 

AAGTCTAGGT CGTCGTCCCA AGGAATAAAT CCCTTGTTTC GAAGGGCACC AATAGCGCCT 6420 

CCGCCACAGA GATAACAGAG CAAATCATGT TCTTTACAAA AGGCCACAAA ATATTCAGCC 6480 

ATCTCCAGAC TACGAGCCTG AATTGCTTTT AAATCAGTCA TATTGTTCAT TATTCTTTCT 6540 

ATCGTATCGT TTCATTA7AC CACAAACAAG GGGTGAAAAT CTATTGCAGA CTGTAAAAAA 6600 

TCAAAGCCTG ACTGCTATCC AAATAGCTAT CAAACTTTGA TTTTTCTGTC TTATACTCTT 6660 

CGAAAATCTC TTCAAACCAC GTCAGCTTCA CCTTGCCGTA GGTATAGCTA ACTGACTTCG 6720 

TCAGTCTTAT CTACAACCTC AAAACTGTGT TTTTAGCAGC CTGCGGCTAG CTTCCTAGTT 6780 

TGCACTTTGA TTTTCATTGA GTATTATCTT ATCTTAAGCC CATTTGAGCG AGCTTGGTTT 6840 

GATATTTGTT TTGATCAACC AGCAGGCCCA AGCCCCCATA AACATCATAG GCATCTACCC 6900 

AGTCACCCAG TTCTGGAATC GTCAATTTTT CAATACCATT TTTTGCTCCA TCCAAAACAG 6960 

ATAAACCGTT TGTTACGAGG AAAGTATAGG GTACGTTGGT TGAGGTCATA GCAAAAACCT 7020 

TTCCAAGAGC TTCAGAACCA GTGAAAAGTT TACTGGGATC TTTAATTTGC TCTAAAATTG 7080 

CTGTTAAAAC TTGTTGCTGT CTTTTTGTAC GGCCGTAATC TGCCTCATCA TCATCACGGA 7140 

AACGAGCATA ATTGAGCAGG GTCGAGCCAT TCATCTGCTG TTTTCCGACT TTAATGGTTT 7200 

GGGTTGGAGA CTCAGTCTCG GTAGCGTATA AATCATCTCC GACTGTAGCT TCTGTTAGGG 72 60 

GACGCCCATT CAATGTTGAA AATTGAGCAT CAATCGTCAC CCCATCAGGG AAAAGCGTGT 7320 

CAATCGCTGT GGCAAAGGCC TGGAAATCAA CCAAGGCGTA GTACTTAATG TCCAAGTCAA 7 380 

AATTATCTTT CAAGACTTGG CGAACCATTT CTGCCCCTTT TTGCCCCTCT TGTTCTCCTA 7440 

ACTCGTAGGC TACGTTTAAC TTGTTATCTG TCTGTTTTCT ACCATTAATC ACTTGACTAT 7500 

AACCATCTAT ATAGACCAAA TTATCACGCA TGAAACTGAC TAGCTTCATT TTCTTATCTG 7 560 

AGCCCCCGAC ATTTAATACC ATAATAGAGT CAGTTCGTGT CTCAACACTG TTCTGGCCGA 7620 

TTCGACCATC AGTACCCATG ATTAAAATAT TAACTCCATC TCTAGTCTCC TGACCATTAA 7680 

ACACTTCTAC TTGAGCTGCC CGGGCATCAG CAGTTTTCTT TGCGCTAGCA TCTTGGTAAC 77 40 

CACGCAAAAA CATGAATACC ATGGCCAAAG CCACACAGAC CAAAAGTGAA AAAATCACCA 7800 

TAAAAATTCG TTTAAGACGG AGCTTCCGTC TrTTClT T T T TGGAGGGAAA GAGAGTGCTT 7860 
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Gf GATTTGGA TTCTGAGCGA 
TTTCTTGTTC CAAGCTAGAG 
AGGCAAACTC ATTTTTTTCT 
AACGCAACTG CTCATGATGA 
GTCTGATATT GGATAAATAG 
TAATTCTTTT TCGGCAAAGT 
AAAGTATTCA CCCAGTGCTG 
TCGCCAAGCA AAGGTCCACA 
CAAGGCCAAA CTCATCTTTT 
TAGAACCCAG AAACGTGTGA 
CAATCCATAT TCTTCAGCAG 
TTCGGAAATA AAACGGGCCG 
ATATTGTTCA TCGATGAATT 
TTCAATCTTA GTATGGCCTC 
TGCTACTGCT TGGATGTGAG 
CTCAATAGAA TTTTCAACTG 
GATGACATCT GTAATGTTGG 
AACGTGGTGT GAAAATGATC 
ATAATTTCCT CTGGGCTTAG 
CAAGCCTGTC TTTCTTGGAA 
GGTCGCTGAT TGTCCTTATC 
AGGTAGATGT TATCTGTATT 
CCTCCTCCAG TTGACACGAC 
ACCTGACGAA AGGCTGTTTC 
CGATTCTCAA TCAGAGCATC 
GATTTTCCAG CCCCCATAAA 
ATCAAAGAAA CTAGGATAGC 
ATCTGCAACC AAGAGCGCTG 
ATTGACTCTA GCACCGTGAA 



687 

CTCCGGTTCG CATAGCTTGG TAAGTCAACC TCCTCTTCTC 7920 

CTACTATTTC CCCTAGCAAG AGTTAGCTTT TCTTCCAAAT 7980 

CTCTCATTGA GATAGTGAAT ATTTTTTAGC AAATAATCAT 8040 

CTTAAGGGAT TTTCTTTACT CATCTTCTCT CCTTTCCATG 8100 

GATAGGCACC CAGAATTTTA TACTGGATTC CAATCGCTTC 8160 

GGACCAAGTC CTTATCGGTA TAATCCACAT CGATAATGAA 82 20 

TCTTGAGTGG ACAACTTTCA ATTTTTCTCA AGTCAATTCC 8280 

GGGCCTTATA AAGTGCACCT GGAAGGTTGT CAGGTAATGT 83 40 

CAGTTTGTGC TTGCAAGGGA ATACTAGGCT TTTCAGCTCC 8400 

AATTGGCTTC CATTTCCTGA ATATCCTCGG CAATCAGTTC 84 60 

AACTTCTAGG TGCAACTGCT GCAAAGGGCT GGTCTGGATG 8520 

CATAAGCTGT ACTAGCTGTT ACCTCGATTT GAGCCTCTGG 8580 

TCTTTCCTTG AGCCAAGCCC TGTGGATGTG AAAAAATCTT 8640 

GAACCACCAT CAACTGCTGA TGAATAGCCT GAACGATTTC 8700 
CCTGATGAAA AAGATACTCC AAGCTTTCAT GAACACTACC - 8760 

GCACCACAGA ATACTTCACT AATCCTTGCT CATAAGCCTT 8820 
CAAAAGCCTG CAATTCCTCA TGACGAAAAG CTGTCTCCAC ^ 8880 

CCTTGGGACC TAGATAAGCA ATTTTCATCT TAGTTCCTCT 8940 

CTTGGTCACA TCCAAAACCC GACTAGCCAC TTCCTCATAC 9000 

AATACCTACT AGTTCTTCCT TGCTA7TATT TAGAAAAAGC 9060 

ACCTGCGATA CGTTGGTAGA GGGTTTCAAA ATCTGCTCTC 9120 

AGTCTTGAGT AAGTCACGAT TTCTCTGAGA AATAACCACT 9180 

TTGGTCTGTT TGTAGTAAAT CAGCTAGGAC TTCTGATTCT 9240 

TCCCTTTTCA GCGAAAAAAT TCGCAATGGA CATACCTAGG 9 300 

CATATCAAGG TAATTAGGGT CCAAGCCTCT TGCAATAGTC 9 3 60 

CCCTAATAAC ACCTTAGCCA TGAATCAAGC TCTCCAAATC 9420 

TGGTATTGAT GGCTTCTGCA CGGTCAAGCT CCACCTCTCC 9480 

CGATAGCTGT CATCATGCCG ATACGGTGGT CACCAAACGT 9540 

GAGCTGATTT TCCTTTGATA ATCATCCCAT CTGCCGTAGG 9600 
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ACTAATATCT 


GCTCCCATAC 


TATTTAAGGC 


GTCTGCCACA 


ACCTGAATAC 


GGTCTGTTTC 


9660 


CTTGACCTTG 


AGCTCCTCAG 


CATCCTTGAT 


AACTGTTACA 


CCTTGGCCTT 


GGCTCGCAAG 


9720 


CAGGGCAATA 


ATGGGCAATT 


CATCAATCAA 


TCGTGGAATC 


AAAGCGCCAC 


CAATCTCTGT 


9780 


TCCTTTCAAG 


TCAGAAGACT 


CAACAATCAA 


GGTACCAGAT 


TTAGCGACTG 


GATCGATTTC 


9840 


AGTTATTTCC 


AATTTTCCAC 


CCATGGCACG 


AATGACATCA 


ATAATACCGG 


TGCGAGTTTC 


9900 


GTTGATCCCC 


ACATTCTGCA 


GCACTAGACG 


AGAATTTGGA 


GCAATCAAAC 


CTGCGACTAA 


9960 


CCAAAAGGCT 


GCACTGGAAA 


TATCTCCTGG 


TACGACCACC 


TTCTCTCCTG 


TCAATTTTTG 


10020 


TGGCCCCTGG 


ACTGTGATTT 


TCTTACCATC 


CACACTTAAA 


TGACCACCAA 


ATTGTTTCAA 


10080 


CATATCTTCA 


GTATGATTAC 


GGGTGTACTC 


TTTTTCGATA 


ATAACTCACT 


CCCCCTTACC 


10140 


TTGTAAGGCT 


GCAAACATCA 


AGGCTGACTT 


GACTTGGGCA 


GAGGCAATTG 


GCAACTCATA 


10200 


ATGAATAGGT 


CTTAGCTTTT 


TCGTCCCTTT 


TAAGCGAAGG 


GGAGGCAAGT 


CTCCTTCAGT 


10260 


TTGCCCTGAA 


ATGCTGACGC 


CCATTTTTTT 


CAGTGGAACG 


GTCACACCCT 


CCATAGGACG 


10320 


TTTGGAAAGA 


CTATCATCTC 


CAAACATCTC 


TACTTCGAAA 


TCTGCACCAG 


CAAGGACACC 


10380 


TGAAATCAGG 


CGAATCGAGG 


TGCCAGAATT 


TCCCATATTA 


AGGGCATTTT 


GTGGCGCTTT 


10440 


TAAGCCAGCC 


ATGCCTACAC 


CTTGAATGGT 


AATAACCCCA 


TCTTTATCCT 


CAATTTCAAC 


10500 


ACCAAGGTCA 


CGAAAAACCT 


GCATGGTCGA 


AAGAACGTCT 


TCACCTCGCA 


GAATATCATA 


10560 


AACCTTCGTC 


TCACCCTCAG 


CCAAACTTCC 


AAAGATAATG 


GAACGCTGGC 


TGATAGACTT 


10620 


GTCACCTGGG 


ACGCCCATAC 


TACCATGTAA 


ATGGCGAATG 


TTTGTTTTTA 


GTTTCATACT 


10680 


GGACCTCATA 


CTTGCAATAC 


TTTTACCTAT 


TTTATCATAA 


AAAGCCAGAA 


ATTCCTTAAA 


10740 


AATTCCTGAC 


TTTAGGATCG 


TTCTTTTCTT 


ATTTCAGCAA 


TTCTGAAACT 


GGTTCAAAAA 


10800 


CAATTTTTTC 


AATATCAGAA 


AGGTAAATGG 


CCAATTGTTG 


TTGCTTGGTA 


AAGAATTCTG 


10860 


ACAAGAGGCT 


ATTTCCTTGA 


ATCTGTTTAC 


CAAAGCCTTC 


CATCTTAGCT 


TGGAAGGACG 


10920 


CATCTGGCAT 


TTGACCTGTC 


TGTGCTAGTT 


TTTGAATTTC 


CTCTTGAAAG 


GCAAGATAAT 


10980 


CTGTAAAGAT 


TTTGCTTGCC 


TCAGCATCTG 


CTGCAATCGC 


ATCTTTAGCT 


GCTTTAACAG 


U040 


CCTTGTATTC 


TGGTAATCCG 


CGTAGACCGC 


GACTGAGTTC 


GTTTGCACTA 


TCGTAAATAT 


11100 


TTGACATGTT 


CTTCTCCTTA 


TTTGATGACG 


ACTGTATACT 


CAGTATTTTC 


TGTTATGAGA 


11160 


TGCTCAGCTC 


TTTCCAAGTC 


TTCAGCATTT 


TTAAATGAAA 


TTTGTAGGAT 


TCCGTGAATA 


11220 


TCCTCACCAT 


TTTCCTCGTT 


GATGTGGATA 


TTAACCAAGG 


AAGTTCCACG 


TAGCAGTTCC 


11280 


AAAATCCGCA 


GGATGACATC 


TTCTTCATCA 


GGAACGTCAA 


CATAGAGGTC 


GTAAGAGCTA 


11340 


TCCACACCAC 


CACGCTTATG 


GATTTCCATG 


GTCTGGCGTT 


GTTCACGCGC 


TTGGTTAAAA 


11400 
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AAGTTCCAAA 


TTTGCTCTTC 


ATCTCCCTTA 


CTAATGGCCT 


GACCAATCCC 


TTCCAAACGT 


1X460 


TCCTTGAAAT 


CCTCAATTCT 


ATCCAGAATG 


ATCTCGCTAT 


TGGACAAGAG 


AATGGAGGTC 


11520 


CACATTCCTG 


GCTCGCTTTC 


CGCAATTCGG 


GTCATATCTC 


GAAAACCACC 


TGCCGCAAAG 


11580 


CGCCTTGCCA 


TCTCATGCTC 


TTGAGCATAG 


ACCGCAGTCT 


GCTCCATGAG 


ACTAGAAGCC 


11640 


AAAATATGAG 


GAAAATGGCT 


AATCTGAGAA 


GTGACACGAT 


CATGCTCCTT 


GGCATCAATC 


11700 


TCGATAAAAC 


GAGCATGAAG 


ACCTGAAAGC 


AGATCCTTCA 


TTTCCTTAAG 


CGTGTCCTGA 


11760 


C7TGTCAGGC 


TTGAAGGTGT 


AAAGATATAA 


TAGGCATTTT 


CAAAAAGATT 


GACATCTGCC 


11820 


GAAGCAGCCC 


CTCTCTTGTG 


ACTACCAGCC 


ATGGGATGGG 


CCCCGACAAA 


GCGAACAGAC 


11880 


TTGCCAGCCA 


AATACTGCTC 


CGCCGCATCC 


ACAATGGTTG 


ACTTGGTCGA 


ACCAGCATCT 


11940 


GAAATAATAA 


CGCCTTCTCG 


CAAATCCAAA 


TTGGCCAACT 


CCTTAATGAA 


AGCAATAGTT 


12000 


TGTTTGATTG 


GCAAGCTGAG 


GATAATGACA 


TCTGCCAAAG 


GAGCAAAACT 


AGCAAAATCA 


12060 


TCCGTTGCAC 


GGTCAATCAT 


ACCTTCTTTC 


AAGGCGATAT 


CTCTCGAAGC 


TTGACTACGA 


12120 


TTATAACCTA 


AAATTTCATA 


ATCTGGATGA 


TCGCGTTTGA 


TACCAAGTGC 


CATAGAGCCT 


- 12180 


CCAATCAACC 


CAAGACCTGC 


GATATAGATT 


GTTTTTCCCA 


TAGGAACTCC 


TTAATACTTC 


12240 


TTTGTATAGT 


CTCGGTGTTT 


GGCTACCGCT 


TCTTTTAGTT 


CCTCAAGATT 


ATCTGATGAG 


12300 


AATTTTTCGA 


GGATTTCTTG 


CGCCAGAACC 


GTTGCTACAA 


CTGCTTCCAT 


GACCATTCCT 


12360 


GCAGCTGGAA 


GAGCAGTCGG 


ATCACTTCTC 


TCCACGGTTC 


CCTTGTAAGG 


TTCGTGGGTT 


12420 


TCGATATCCA 


CACTCATAAG 


AGGTTTATAA 


AGAGTAGGAA 


TGGCTTTCAT 


GACCCCACGA 


12480 


ACAACGATGG 


GTTGCCCATT 


AGTCATACCA 


CCTTCAAAAC 


CACCTAGATT 


ATTGGTACGG 


12540 


CGAGTATAAC 


CGTCTTCTTT 


AGACCAGAGA 


ATTTCATCCA 


TAACTTGGCT 


GCCTTTACGA 


12600 


TAACCACCCT 


CAAAGCCAAG 


ACCAAATTCC 


ACCCCTTTAA 


AGGCATTGAT 


AGAGACAACA 


12660 


GCTTGAGCCA 


ATCTTGCATC 


CAATTTTCTA 


TCCCATTGGA 


CATAGGAACC 


AAGACCAACT 


12720 


GGAACGCCTC 


CGACGACTGT 


CTCCACAACC 


CCACCGATCG 


TATCACCATC 


ACGTTTGATT 


12780 


TGGTCAATAT 


AGTCCTTGAT 


TTCCTGTTCT 


CCTTCTTCCT 


TGACAATAGA 


AACTTCAGAC 


12840 


TGGGCAGCTC 


TTTGCTTAAT 


TTCAGCGACT 


GTCAGATTTT 


CAGGAACATC 


GATTTCCTTG 


12900 


CCACCAAAGA 


CCACGACATG 


GTTGGCAATC 


TCCATATCCA 


GCTCAGCCAA 


GAGGCGTTTG 


12960 


GCTACTGCAC 


CAACTGCCAC 


CCGCATGGTG 


GTTTCACGAG 


CTCATCAACC 


CTCCAAAGAA 


13020 


TTTCGCAAAT 


CATCAAAACG 


GTACTTAATC 


CCCCCAACCA 


AATCGGCATG 


ACCTGGGCGA 


13080 


GGATGAGTAA 


TTTTCCCCTT 


GCTTTTAAGG 


CGGTCTTCAA 


TGTCCTCCGC 


AGACATGATG 


13140 
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TCCAGCCATT TCTCGTCCTC CTTATTGATG ACATCCATAG TAATAGGCGC CCCTGTCGTC 13 200 

TTCCCGTGGC GAACGCCCGA AGTAAAGACA ACCTGGTCAT TCTCAATCTT CATACGACCA 13 2 60 

CCACGACCGT AGCCACCCTG ACGGCGTCTA AGCTCCTCAT TGATATCCTC ACCTGTCAAT 133 20 

GGAACTCCAC CTGCAATTCC CTCAATAATA GCTGTTAGAC GGCGGCCGTG TGATTCTCCT 13 380 

GCAGTTAAAT ATCTCATACA CTCTCCTTAT TTTACCAACT AGTCTTTCAT CTCTTCCAGA 13440 

GAAACTGGGT GAATGGTCGC TGAACCAAGC TCTGGCACCA AGACCAATTT CAAGGTGTTA 13 500 

CCACGCGCTT TCTTGTCATG AGTAAGAGCC TGATAAAGCT TGCCAACTTC CCAATTTTCA 13560 

TAGTCAACAG GCAAACCGAA TTTCTGACAC ATCTCTGTGA TAGATTGGGT AATGCCAGCT 13 620 

CGCATGAGGC CTTTTTCCTC AGCAACCTTG GAAATCTCTA CCATTCCCAT GGCAACAGCC 13 680 

TCTCCATGCA TGACCTTGCC ATAACCGGCA GTCCCTTCGA TGGCATGGCC AATAGTGTGG 13740 

CCAAAATTGA GGTAAAGACG AATACCATTG TCCAACTCAT CTTCAACCAC CATCTTGCGC 13800 

TTCACCTGAC AAGAATGTTC AATCAAGGTC TCTGCATGTT CCAAAATACT CTCAACAGAA 13860 

CCATTCAGTC CCGTCAAGAG AGCCCACACT TCTGGATCCT CAATCAAGCC ATACTTGATA 13920 

ACTTCACCCA TCCCTTCAAT CAACTCTCTT TTTCCGAGCG TTTCAAGAAC AAGTGGATCA 13980 

ATCAGAACCC CATCTGGTTG GGCAAAGGTC CCCACCATAT TTTTAGCAAA TGGTGTATTA 14040 

ACCCCTGTCT TTCCACCGAT AGAAGAATCA ACCTGAGCTG TCAAACTAGT CGGAATCTGA 14100 

ACAAAGTGAA TACCCCGCAT ATAGGTAGAG GCTACAAATC CAGCCAGGTC CCCAACAACG 14160 

CCACCACCAA GAGCAACGAT TCCATCGCTA CGAGTCAGAC CTTGCTTGAC TAGAAATTCA 14220 

TAGACTTTCT GAACAGTAGT TAAATTCTTT CTTTCTTCAC CTTCTAAGAA ATCAAAAACA 14 2 80 

GCTACCTGAA AACCAGCATC TTCTAGGCTG AGCTTGACCT TCTCTGCATA GAGAGAGGCT 14340 

ACATGGTTAT CTCTCACAAT GACTACCTTT TGCGGTTGCC AGAGTTCTCG CAACCACTGA 14400 

CCAGCCTGGG CCATACAACC TTTTTCAATC TGAATATCAT AAGGATGGTG AGGAATATCG 14460 

ATTCTCATTT TCATAGGAGA GTCTCCCTTT CTTTATTGGT A T TTTTCTGT TAAAGACTGC 14520 

CAAATCTCTT CTGTCGGCAT TTCCTTGCCT GTCCACACTT GAAAAGCTTC TGCAGCTTGA 14 580 

TAGAGTAACA TTCCCAGACC ATTGACTGCT GGATTGCCCT GACTTCTAGC CCATTTCAAA 14 640 

AACGGTGTTT CAAAGGGTTG GTATATGATA TCTGCAACTA AAAGAGTTTC TGGTAAGACT 14700 

ATGTTTTCAG GAACAGGAGA GGATTGCCCA TCCATGCCCA CACTGGTGGC ATTAACTAGC 14760 

AAATCCGACT CGGCAATCCT TGCTTGCAGT TCAGAAACAT ATTCTAAAGC ACACAAATCC 14 820 

ACTTTAAAAC CTGTCTGCTC CTGTAACTTG TCTACGTAAG GTCTTGTTTT TTCCATAGAA 14 8 80 

ACGGAACGAA CAAAGACCGA AATCTGACTG ACGCCATCCA AAATAGCCTC TGCCAAGATT 14940 
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GATTTAGCCC CACCACCTGC ACCCAGCAGG GTCATCTTTT TACCTGAAAT TGTAAAAGAA 15000 

GCCAAGCACT TAAAAAATCC CTTGCCATCT GTATTATATC CAATTAAATT GCCATTCTCA 15060 

TTGACAACCG TATTAACCGC ACCAATCAAG CGCGCTTCAT CGCTCAGCTT ATCCAAATAA 15120 

GGAATCACCT GCTCCTTATA GGGCATGGAC AGATTGATCC CAAACATCTG GTAGCGACCA 15180 

ATATTGGCCA CTGTTTCTAC CAAGTCACTC GCTTCAATCT CCCAAGCCAC ATAAGCACCG 15240 

TTGGTAGCTG TCGCCTCAAA GGCTCTATTG TGGATGAAGG GAGAAATAGA ATGCTTAATA 153 00 

GGATTGGCAA CAACTGCACC TAAACGTGTA TAGCCATCAA GCTTCATCCA AAATCTCCCT 153 60 

GATTTTTTTC ATCCTAGC7A GAGAAATCTG CCCAGGGGCA CTAACCTCAT CCAGACTGGC 15420 

AAAAGACCAA CTCGAACCAG TCACATCCGC AGTGATACGA GAGACCTTGC CCACCTTACC 15480 

CATAGAAATG GTCACATATT CCTGTTCAGG ATTGAGGGTT TTAAAGCCTC GTGTATAGTT 15540 

CATCAAGTCT AAGACATCCT GCTCCGTGTG AGCCATCACC GC AACCTTAA CAAGTTTTGG 15600 

ATTTAGGATC GTCAACTCTG ACAAGATTTC CATCATGTTC TCAGGTGTTT CTTGGAAATT 15660 

ATGGTAACTC AAAACAAGAT TTGGGAAGTC CACCATTTCC TCAAAAACAT CCTTGTAGCT 15720 

ATAGTACTCA AAATCAATAT AGTCTGGTTG ATAGAGTTCC GCAACTTCCT TGATTAGATG 15780 

GATATACTCT TCTGGAGAAA GGTCGATTTC TCCACCTTCG GAGCCAGTTC GTAGCGTGAA 15840 

AACCAACTCA CGGCCTGCCA ATTTTTCAAA AATGGCTGGA GCTACCTGCA AAATCCCTTC 15900 

TTTAGGCAGA TAGTCGGCAC GCCATTCAAT GATGTCGGCA TCCAGGTACC TCGTGGCATC 15960 

CAGAGCCTGA GCCTCCTCTA AACTTCTTGG CATTACTGAA ACGATTAATT TCATTTACTA 16020 

ACCTTCATAC TAATCACCTT GAGGTAATTA CTACTTTCAT CTTTTTTATT ATAGGCAAAA 16080 

TCTGCTGGAA GACCATATTT GTTTAAAATC TGGTAACTTC TTCCTGCAAA ACCTTTATCA 16140 

ATTTGTTCTG TAAATTTCTG ACGGGAAACA TTGGCAGCAT TGGTACTGGC AATGATAATC 16200 

CCTCCCCGAT TTAAAATCTC AACACTCTGG GAAATCAACT TGTGATAATC CTTCCCCACA 162 60 

GAGAAAGTTT CTTTTTTATT CCGAGCAAAG CTACGCGGAT CTAGGACAAT CACATCGTAC 16320 

GTCAAGTCTT TCCGTTTGGC ATATTTGAAA TACTCAAAGA CATCCATGAC TATAAAACGA 163 80 

TGCTCCTCTG TGCTGAGCCC ATTTGCCTGA AAATGCGCTT GAGACAATTC TCCTGAACGT 16440 

TTGGCTAGAT CAACAGAAGT TGTATGGCTA GCTCCTCCCA TCGCCGCACC TACTGAAAAA 16500 

GCCGCTGTGT AGGAAAACAT ATTGAGTAAG GATTTACCCA TAGCCAAGCC GTCAACTAAA 16560 

CTACCGCGAA CCTCATGCTG GTCTAGGAAA ATTCCTGTCA TCAAGCCATC ATTCATAAAG 16620 

ACTTG AT AC A GGACACCATT TTCTAAAACA TTCAAAAAGT CAGGTGCTTC TTGACCATAA 16680 
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ACATCGCCAC ATTCATAGTC CAAACCC7TA AACCGGATTT TCTCATAAGC TCCTAAAACC 16740 

TCACGCAAAA CCTGTCTAAA GGCTTCTGAT ATAGTCTCAC GAATCTGATA AACATAAGAG 16800 

TTATACCAAG AAAAGACGGC GTAGTCGCCA TAAAGGTCCA CTGTCAGACC CCCAAAGCCA 16860 

TCTCCCTCTT GATTAAAGAG ACGAAAGGCA GTTGTCAAAT CATCTTGATA GTAGGCGTTT 16920 

CTCTTTTCTT TGGCTTTTCT AAACAACGTT TCAAAGAAAG CTTGATTGAA GGCCACCTTG 16980 

TCTTTGCTGA TAAACCAGCC CAAGCCCTTG TTTTGCTGAG AAAGGTAGGC AGTCCCAAGA 17040 

AAGTTTCCTT CCTGACCCTG CACCTCTACT TCCTGATCCT TAAGATTGAC ATTCTCAAGA 17100 

TCACTGGCTT CTAGTAAAAC TAGCCCCTTA GCAAGCTTCT TTTCAACCCT TTTGCTGACT 17160 

CTTATTCTA.T TCATAACTAC CATTATATCA AACTTTTAGA CAATTCTCAA' AAAAGAAACT 172 20 

ACCCTTCCTT TTTTACTCTT CTTTTAAAAA ATGCTATACT AGACTTCCTG CAAAACTAGG 17280 

AAGTAAATGT GTAAGAATCA CAGTAAAAAA TGCTCTTCCC TCTTGGAGGA GCATTTCTTT 17 340 

TTATCAACGA AAATCAAATA GCAAACTATG AAACTAGCCT CACGTTAACT CTGAGATTAT 17400 

AGGTAGAGAG GTTCTATCAG CAATATGTGT CTGTCAAATT TAGTGACAAA GGTAGTAGAA 17460 

GAAAGATAAA GAAATAAATC AGCTTCACTA GGTATCTGGA AAATTTGATT TTATAGAGAA 17520 

GCCTTTTGTT ACAAACTCAA TATACTATCA ATAAATAATA TTATAGAACC AACAATAATT 17580 

ATAATTTCAC CTATCTGCAT CATTCTATTT CCAACTCTAA ATATATCTTC TATCAAAAAT 17640 

ACTTGGAACA CACACATTAT AGGAATTAAC GTTTTTGAAA TTGAAAAATA TCCAAATAAA 17700 

TAAACTATAA ACAACAAAAA TAGAACTATG TTATATTTCT TATTCAAAAC ATTCCTCCCT 177 60 

ATATATTTTT GATTACCAAT CTTAATCATT TACAACTACA TTCTAACAAA CTATAAAAGC 17 820 

GTTTGTCGAA TTGAATTTAT CAAGCAAGCG ACCAACCAGT TCAT C TTTTT TCTATTTCTG 17880 

CCAATATGCG TGACAGGTAA TAATGATAGC CAAAAATAGC AAGAGCAAGC AAGACGATAA 17940 

GAGCTCCTAC TCCCAAGCTG ATGGCAAGGA TAGGGGAGAG AGACTGAACC AAGAATATGC 18000 

TCCCAATTAC AAGGGCCATC AGGATTGCAC TATAAATAAA CAATAAAACT ATCGCGAC f A 18060 

TGCCATTTGA ACGATTCACC AGGTCCGTAA TGCTACTCCA ATTGGTTGAC ACATTTTTAA iB120 

CGTCCTTAAA GTAATGCTGG CAAGAAAGGA TCACACTCGC AATGATCCAG ACTACAAGAA 18180 

GGTAAATCAT CGAAATGATG GGCAAGCCTA GATATAGAGA AAGACCAAGC AAAGTCAGAA 18240 

CTGGTAAAAA GGACTGGACA GCATATATAA TCCAAAATTT CACTTTCACA TAACGAGCAA 18300 

AGTCAAAGGG TAAACTCTTA AGAAAATCAA CATTTTCCCT CTCCAAGGAC AAGGCAATTG 18360 

AATGCAGGCT GGTGATATTG TTATTGACAA CTGCTATAAA GAGAGCTATA AAAAACAAGG 184 20 

GTAACCAGTA TGGAGGATGA ATGTCTGGAA CTATCTCAGA ATCTCGGATT TTGGAAATCA 184 80 
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CACCGATCAT CATC AG AT AA GGAAGGAAAG CACTTGTAAA AAGCACTGTA ATCACGCCAG 18540 

TCCCCTGTCC CAAGAGGGTG AGGTGGTAGC GTAAAACCAT GCGAAAAAAT CCCTTTTTAG 18600 

TGGTTGAAAT TCTCTCCTTG CTGCGACGTT CTTTTTTGAC CTTCTCCTCA CTATTAAGCA 18660 

GGATCACGTC ATAAAAACGA GGAAGGACCT TCTTTTTGGT CAGATAAAGC AGGAAGAGAG 18720 

TTAGTCCTAT CCAAGCGAGC AGACCCACTA AGGCTTCTGT CGAAAAAGGC TCCACTGCTA 18780 



TTTTGTAAAA GATATGAAGA GGATAAAGGA GAAATGGAAT GTCTCTAACT TTGTCAACAA 18840 

TACTTCCAAA AGTCGACTGA AGAAAGAAGA TAAATATTAA AGGTATGAGA ACTCCTATCC 18900 

CAATCATCAC ATTCGAAAAA ATACACTGAT ACTTTCTGAA GACCCTAGTT TGAGCCAAGA 18960 

AATGCACTGC CACTACCATC ACTAGAGCCA GAG AG AC AAA TAATAAGGTC AAGGACAGTA 19020 

GCATCAAAGG CAAACCCAGC CATAGAGAAG GAGCTAGCCT AATGTAGAGG ACCAGAAAAT 19080 

AAGCTAGGAT TCGTACAATT CCAGTTAGAG CTGCCAAAAG GACAGACAGT CCTTTAGCAA 19140 

TTATAATCTC TGATTCTTTA AAGGCATAGG GCCTATACGA TACCAAATCC TTACTCTCAT 19200 

AAAAGACATT GTAAAAGGCC GTTAAAGAAG TTGAAAAGGC AATCACTACT AAAATAGCAA 192 60 

TCATCGAGCT AAAATAAATA GGTATTTCCT CAAAAGGAAA ATGAATGGCT ATATTACTAA 19320 

AACAGATGAT CATCAAGAGA CTGGAAAAAA TGTAAGAACT TAAGACTCTA GCGGAAACAT 19380 
TTACTTTTTT 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 18436 base pairs 
<8) TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: linear 



19390 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CCGAGCGTCG TTACAGACTT TATCAAGATT GGACGCAAGA AGAAATTCAA CATATAAAGC 60 " 

AAAATATGGC ACAATCTCCA TGGCATACTC ATTACCATGT TGAGCCAAAA ACAGGACTTC 120 

TCAACGACCC AAATGGCTTT TCTTACTTTG ATGGCAAGTG GATCCTCTTT TACCAGAATT 180 

TTCCTTTTGG TGCAGCCCAC GGTTTAAAAT CTTGGGCACA GCTAGAAAGT GATGATTTGA 240 

TTCACTTTAA AGAAACTGGA ATCAAAGTTT TACCAGATAC TCCATTAGAT AGCCACGGTG 300 

CCTACTCTGG TTCTGCCATG CAATTTGGCG ATAACTTATT CCTATTTTAT ACAGGAAATG 3 60 

TTCGCGATAA AAACTGCATC CGTCACCCAT ACCAGATCGG TGCTTTGATG GACAAGGAGG 420 
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CTAACATTAC AAAGATTGAC AAGATCTTGA TTGACCAGCC ACCAGACTCT ACTGACCACT 4 90 

TCCGCGATCC ACAAATTTTT AACTTTCAGG GTCAATATTA TGCCATTGTC GGCGGACAAG 540 

ACTTGGAGAA AAAAGGTTTC CTTCCTCTCT ACAAGGCTGT CAATAACGAC TACACAAACT 600 

GGCAAGCAGT TGGCGACCTT GACTTTGCTA ACGACCGTAC TGCCTACATG ATGGAATGTC 660 

CTAATTTGGT CTTTGTAGAG GAACAACCTG TCCTTCTCTA CTGTCCACAA GGATTGGATA 720 

AGAAAGTTCT AGACTACCAT AATATCTTTC CAAATATGTA TAAGATCGGG GCTTCCTTTG 780 

ACCCTAAAAA TGCCAAAATG GTAGATGTGT CTCAACTTCA AAACATGCAT TACGGTTTCG 840 

AAGCCTATGC AACTCAAGCC TTCAACGCTC CTGATGGGCG TCCTCTAGCA GTTAGCTGGC 900 

TTGGTTTGCC AGATGTTTCT TACCCATCTG ACCGTTTTGA CCACCAAGGA ACCTTCTCTT 960 

TGGTCAAGGA ACTCACTATC AAAGACGACA AGCTCTACCA GTATCCAGTC GCTGCTATTA 1020 

AGGACCTTCG TGCTTCTGAA GAAGCCTTCT CAAACCCTTC CCAAACCAAG AACACTTACG 1080 

AACTTGAACT CAACTTGGAA GCTAATAGCC AGAGCGAGAT TGTCTTACTT GCTGATAAAG 1140 

AAGGTAAGGG ACTTTCAATC AACTTTCACC TTGTAAACGG TCAAGTAACA GTGGATCGTA 1200 

GCCAGGCTGG AGAACAGTAT GCCCAAGAAT TTGGGACAAC TCGTTCTTGC CCTATCGAGA 1260 

ATCAGGCTAC TACTGGTACA ATCTTCATCG ATAACTCTGT CTTTCAAATT TTCATCAATA 1320 

AAGGAGAAAA AGTATTTTCT GGTCGTGTCT TCCCACATGC GGACCAAAAT GGTATCCTGA 1380 

TTAAATCTGG AAACCCAACT GGAACTTACT ATGAATTAGA TTATGGTCGC AAAACTAACT 1440 

GATGTCGCCA AACTTGCAGG CGTCAGTCCT ACTACCGTTT CTCGGGTTAT CAATAAAAAA 1500 

GGGTATCTAT CTCAGAAAAC CATCCAAAAA GTCAATGAAG CCATGCGAGA ATTGGGCTAT 1560 

AAACCCAACA ACCTGGCTCG TAGTCTGCAA GGAAAATCAG CTAAGTTAAT CGGCTTGATT 1620 

TTCCCCAATA TTTCCAATGT TTTCTATGCA GAATTCATTG ATAAATTGGA ACACCAACTC 1680 

TTCAAAAATG GTTACAAGAC CATCATCTGC AACAGTGAAC ATGATTCTGA GAAGGAACGC 1740 

GAATACATCG AAATGTTGGA AGCCAATCAC GTGGACGGCA TCATTTCTGG TAGTCACAAC 1800 

CTAGGAATCG AAGACTACAA TCGTGTGACA GCGCCGATTA TTTCCTTTGA CCGAAACCTA 1860 

TCGCCAGACA TCCCTGTCGT CTCCTCTGAC AACTATGCTG GTGGGGTTCT TGCTGCCCAA 1920 

ACCTTGGTCA AGACAGGTGC CCAGTCTATC ATCATGATTA CAGGGAATGA CAATTCTAAT 1980 

TCGCCAACCG GACTGCGCCA CGCTGGTTTT GCATCCGTAC TCCCAAAAGC TCCTATTATC 2040 

AATGTTTCCA GTGACTTTTC TCCCGTCAGA AAAGAAATGG AAATCAAGAA TATCTTGACC 2100 

CGGGAAAAAC CAGATGCCAT TTTTGCTTCC GATGATTTGA CAGCTATTCT GGTCATTAAA 2160 

ATCGCTCAAG AATTGGGCAT TTCTGTCCCA AAAGAGCTCA AGGTCATCGG CTATGATGGG 2220 
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ACCTACTTTA 


TCGAAAATTA 


CTACCCTCAA 


TTGGCTACTA 


TCAAGCAACC 


TTTGGAAGAG 


2280 


ATTGCTTGTC 


TCACTATTGA 


TCTTCTCTTG 


CAAAAGATTG 


AAGGCAAGGA 


AGTCGCCACA 


2340 


ACTGGTTACT 


TCTTACCACT 


TACGCTATTA 


CCAGGAAAAA 


GTATTTAAAC 


ACAAGAAAAC 


2400 


TCAGACCGAT 


TCGTCTGAGT 


TTTTATGATC TTAAATTTTC 


GAGATAGCGC 


TGGGCTGTCT 


2460 


CTAGGTTAAA 


GGTTTTATCT 


GAGATGAGCC 


GCTCTACTAG 


GGGAGCAACT 


TCAGATTCAC 


2520 


TAGCCCCAGC 


TAGGAGAGCT 


AGGGATTTGG 


CCTGTAGTTT 


CATGTGGCCT 


TGCTGCATCr 


2580 


CCGTACTTAC 


CAAGCCTTTG 


AGGGCTGCAA 


AATTTTGAGC 


AAGACCGATG 


GACACGATAA 


2640 


TCTGGGCTAA 


TTCTCTGGCA 


GAAGGATTTC 


CTAGTAGATC 


ATGACTGAGA 


ACTACACGTG 


2700 


GGTTGAGGCC 


CATAGAGCCA 


CCCTTAGTCG 


CTACAGGCAT 


GGGCAGGCTC 


ATCTCACCCA 




CCAATTCTTC 


TCTTTCAAGG 


TCCAGCGTCC 


AGCAGCTAAG 


ACCTTGATAG 


CGTCCATCTC 


2 820 


GACTGGCAAA 


GGCATGGGCC 


CCAGCTTCGA 


TGGCACGCCA 


GTCATTACCA 


GTGGCAATCA 


5 ft an 


AAATCGCATC 


AATACCATTA 


AAAATTCCTT 


TATTATGAGT 


AGCAGCTCGG 


TAAGGATCAG 


2940 


CCTGCGCAAA 


CTCACTAGCC 


AACGCAATTT 


TCTCCGCAAT 


CTCTCGTCCT 


TG AT C f^*F*FW 

* \ 4 h* 4 k 4 i 


3000 


GGCGGCTCAA 


GTAGCGAAAG 


GCGATGCGAC 


ACCTTGCACT 


CACCAGAGAA 


TCGGTCGCGT 


3060 


AGTTGGACAG 


GATTCCCATG 


AGACTCTGTC 


CCTGACTGAG 


TTCTTCTAAG 


ACTGGTTT C A 


3120 


AGGCTTCCAG 


CATGGTGTTG 


AGCATATTGG 


CACCCATGGC 


TTCCTGGGTA 


TCGACATGAA 


3180 


TATAAACAAC 


GAGAAAGTCT 


GGTTCGCCTT 


TTATCTCCTC 


GACATGCAGA 


TCACGCGCCC 


3240 


CACCTCCACG 


TTTAACGATA 




Ll 1 UM I 1 L»Ot- 


AAGCTCCAAG 


ACCTCCCCTT 


3300 


TCTTGCTGGC 


AATCTTCTCT 


TGCGCTAGTT 


TAGGATTAGC 


AACTTGATAA 


AGGGCTACCT 


3360 


GCCCAATCAT 


CTGTCGCTGA 


TGGACTTGTG 


CAGTAAAACC 


ACCTGCACGC 


TTCATGATTT 


3420 


TGCTGGCATA 


GCTGGCCGCC 


GCAACCACAG 


AGGGTTCTTC 


TGTCACATAG 


GGAACGGTGT 


3480 


ATTCCTGACC 


GTTGACAAGT 


ACCTCCGGAA 


CCAGTGAATA 


AGGCAGAGAA 


AAAGTTCCCA 


3540 


CTACATTCTC 


ACTCAGCTCC 


TCTGCCACAG 


TCACCCTCAT 


CTGTTCATCC 


TTCTCCAGAC 


3600 


TAGCTTGTCT 


CTCAGGACTA 


AGGAGCGCCT 


GAGCTTTTAA 


CAGCTCGAGG 


CGCTCTTGGT 


3660 


ATGATTTTTT 


AGAAAATCCA 


TTCCAACTTA 


TCTTCATTAT 


TTTTCAACCT 


TGCTATAACG 


3720 


GCGTTGGTGG 


TCGAGAATTT 


CAACCAAGGC 


AAAATCTTGA 


TTTTCATAGC 


CAGCAAACTG 


3780 


GGCAGAGTTA 


GTTTCATCCA 


AGTTTACTTC 


CTCAAAAAAG 


ACCTTTTCAT 


AGTCTGCAAC 


3840 


GGATAGGGCA 


GTTCGTTGGT 


TGAGCTTGTT 


CAAACGGTCT 


TTATCCAAAT 


AAGCTTCATA 


3900 


TCCTTCAACC 


AATTCACCAC 


TGAAGAACTC 


AGCCACAGCT 


CCACTTCCGT 


AACTATAAAG 


3 960 
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CGCCATTTTA 


TCCCCAGCTT 


TCAAGCTATC 


TCTATTTTCC 


AAGAGAGACA 


AAAGTCCAAG 


4020 


GAAAAGTGAA 


CCTGTGTAGA 


TATTCCCCAC 


CTTTTGACTC 


TAGAGAATAG 


ACTGGTCAAA 


4080 


ATGCTTTTGT 


AAGAGGTCTT 


TTTTCTCTTG 


AGGCAGCCTC 


TTATCCATGA 


TTTTTTTCAA 


4140 


GCCTTTTAGC 


GCTAATTTAG 


GATAAGGCAA 


GTGGAAACAA 


ACAGCCGCAA 


AATCATCCAA 


4200 


AGTAAGCTCG 


TAGCGTTTTT 


GATATTCAAG 


CCAAGTCGTT 


TTCAAACTAT 


CCAAGTATTG 


4260 


TTGGGTAGAA 


TAGACACCAT 


TTACATAAGG 


AGTTGTCGAC 


TAATTTGGTC 


GCCAGAAATC 


4320 


CATGATGTCA 


CGGGTCTGAG 


CTACATTGTC 


ATTATTAAAC 


GCCATCATGC 


GTGGATTTTG 


4380 


TGTAATCAAC 


ATAGCTACAC 


TTCCAGCACC 


TTGAGTTGGT 


TCTCCTGCAG 


TTTCAATACC 


4440 


GTATTTGGCA 


ATATCACTGG 


CAATGACCAA 


GACCTTGGAC 


TCCGGAGAAT 


TTTCCACATG 


4500 


CAATTTGGCA 


TAATGGAGGG 


CAGCAGTCGC 


TCCGTAGCAG 


GCTTCTTTAA 


TCTCGAAACT 


4560 


ACGAGCAAAG 


GGCTGGATGC 


CCAGCAAGCC 


ATGCACAAAG 


ACGGCCGCAG 


CCTTACTCTG 


4620 


GTCAATTCCT 


GACTCGCTCG 


CCACAATGAC 


CATGTCAACT 


TCTTGTCTTT 


CTTGCTCAGT 


4680 


TAAAATAGAG 


TCACTAGCAC 


TCGCCGCCAA 


GGTCACGATA 


TCCTCAGTTA 


GGGGCGCAAT 


4740 


ACTCAATTCC 


TTGAGTAAGA 


GTCCTTTACT 


TAATTTTTCA 


GGGTCAATTC 


CCCTCGCTTC 


4800 


TGCTAAGTCT 


TGTAATTTCA 


AGACATATTG 


ACTGGTCGCA 


AAACCAATCT 


TATCAATACC 


4860 


GATTGTCATA 


TTTACCTCTG 


TTTPATCATT 


CATGTAAAAA 


ATCGTTCTAT 


ACTATTTTAT 


4920 


CACAAATGGC 


AGTAAAAGAG 


AGAAAAAAGA 


CTTGATTCAC 


CAAATCAAGC 


CTCTTATTGG 


4980 


TCATCATTTT 


AAAGAATGAT 


TAGTTGCTAG 


AGAGTTCACC 


GATATAAGTA 


GCTTTATAAG 


5040 


CTCCATTCAC 


AGTTATCAGC 


TCCTGGAGGA 


TCAAATTTCC 


TGAGTAAGTC 


CTTCCCATCT 


5100 


CATCTACAAA 


TTTTTGATAA 


AACTGACTGG 


TCGGAATTTC 


TCTGACATCC 


TTATCAAATG 


5160 


TCTTATCAAG 


TGTTTTACTA 


ACCTTCTCAG 


CAATCAATTG 


ATGCTCTTGC 


CATCCACTTT 


5220 


GAAACTCTGA 


GCCCGAACTA 


GAAACCATGA 


CTGGGATAAA 


CAACAAGGTC 


AGTAGATTTA 


S280 


CAGACAATAA 


GGAAAGTAGT 


AGACTTCCTG 


CAAAACTAGA 


ATCCTACTTC 


ATGATTGATA 


5340 


ATACCAGCAA 


TCAAATTCAT 


TCGTAATCCG 


AAGCCTTTAC 


GATGATTTCG 


ATAGGTTGTT 


5400 


GAAAACATTT 


TAAACGTTTT 


TACTTTGGCA 


AAGATGTTCT 


CAACC7TGCT 


TCTCTCCTTA 


5460 


GATAGCGCAT 


GGTTACAGGC 


TTTATCTTCA 


GCTGTTAGCG 


GCTTGAGTT7 


GCTGGATTTA 


5520 


CGTGGAGTTT 


CTGCTTGAGG 


ATATATCTTC 


ATGAGCCCTT 


GATAATCACT 


GTCAGCCAAG 


5580 


ATTTTACCAG 


CTTGTCCGAT 


ATTTCTGCAA 


CTCATTTTGA 


ACAACTTCAT 


ATCATGACTA 


5640 


TAGTTCACAG 


CGATATCCAA 


AGAAACAATT 


CTCCCTTGAC 


TTGTGACAAT 


CGCTTGAGCC 


5700 


TTCATACCGT 


GAAATTTCTT 


TTTACCAGAA 


TCATTCGCTA 


ATTGTTTTTT 


AGGGCGATTG 


5760 



WO 98/18931 



PCT/US97/19588 



697 



ATTTTTACTT 


CCGTCACATC 


AATCATTATC 


GTGTCCTCAA 


AGCTGAGAGG 


AGTTCTTGAA 


5820 


ATCGTAACAC 


CACTTTGAAC 


AAGAGTTACT 


TCAACCCATT 


GGCTCCGACG 


G ATT AAGTTG 


5880 


CTTTCGTGCA 


TACCAAAATC 


AGCCGCAATT 


TCTTCATAAG 


TGCCCTATTC 


TCGCACATAT 


5940 


AGAAAGCGTT 


ATCAATTTAT 


TTATCTCATT 


TTTCAGAAAA 


TTCTTT T ATT 

* * ^ 4 4 4 4 4% 4 4 


TCTCTAAACT 


6000 


CTACGATACT 


CGATGTGTTT 


TTATATAATG 


ATAGAGTCTG 


AGAATCACTC 






CATTCCAATA 


GAGATTACCA 


AAGCCAACAT 


GACAACCAAG 






oi«u 


ATT AT ACT CC 


CCTGTCACAA 


AAAAGGCAGT 


TGTTCGGTAG 


GAGAGATAAC 


CTGGAAPCAG 


6 180 


CGGTGCCAAA 


ATGGCCAAGA 


TAAAGACCAC 


AGCAGGTGTC 


TTATAAAGAA 


TACTTAAAAT 


6240 


CTGGCTGACA 


CAAGAACCAA 


TAATGGCTGC 


AATGAAGGTA 


GCTACAATGA 


CATTGGTCGG 


9 JVU 


TTCCTTGAGC 


AAGAGATAGA 


TTAGCCAGAC 


AGTCATGCCC 


AAAATCCCTC 


CAGGTAAGAG 


Q J OU 


CATAGACCGT 


TGCACATTGA 


GTACGATTAA 


AAAAGTGATA 


ATGGCAAGAA 


AACTTGCTAC 


6420 


TGCTTGTAAT 


AAAAAGGTTG 


TTAGTGTCAT 


ATTAGTTCAT 


C AAT AC CAAG 


GCGACAGAAG 


6480 


TTCCTGCCCC 


TAAAGCGAGG 


GTAATGAGCA 


GGGATTCAAA 


CATCTTACTC 


ATACCAGAGT 


6540 


TTATGTGGTT 


GGTCATAATA 


TCACGGACCG 


CATTGGTCAA 


GGCAATACCT 


GGTACAAACG 


6600 


GCATGACCCC 


ACCAGCTATA 


ATCAAATCTG 


CCCTTGAAGG 


AAAACCTGTG 


TACCCAGCCC 


6660 


AAAACTGGGC 


AATTATCCCA 


AAGACAAAGG 


CTCCAGCAAA 


GGCTGTCACA 


AAGGGAATTC 


6720 


GGATAAATTT 


TTCCACATAG 


AGGGAAAAGG 


CAAAACCAAA 


TAAGGTCGCC 


ACTCCTGCCC 


6780 


CAAGTGCGTC 


GTAGATATTT 


CCGCTAAACA 


TAACTGAAAA 


GAAAGGAGCA 


CTAAAGGTCG 


6840 


CAGCCAGAGT 


TACCTGCAAC 


TTAGTATAGG 


GAAGGGGTTT 


GGCTTGCAAG 


GCCGTCAATT 


6900 


GCTTAAAGGC 


TGTTTCTAAG 


TCAATCTCCC 


CCCCAACTAG 


CTGACCAGAA 


ATCTGGTTCA 


6960 


CATCGCAGAC 


TTTTTCGATG 


TTATAAGAAG 


AGGAGGTCAC 


GCGCTTCATG 


CCCAAATATT 


7020 


GGTATTTTCA 


ATAGAGAAAA 


AGATAGCGGC 


AGGCATGGCA 


AGGACATTGC 


AAT CC AC AAT 


7080 


CCCCTGCGAA 


TGCGCGATTC 


GAATCATGGT 


ATCTTCTACA 


CGATGGATTT 


CTGAGCCACT 


7140 


TTTAAGGAGA 


ATAGTCCCCG 


CTAGCATAAT 


CACATCAATG 


ACGGCATTTA 


ATTCTTTTGA 


7200 


TTCTTCCATG 


CTTTCCTCCT 


TTTATCAACT 


CCCTCTATTC 


TATCACAAAT 


CCGGACTCAA 


7260 


AAAAAATCTT 


TGCCATGAAA 


TCATGACAAA 


GATTGATTAC 


TCATTTTGAT 


TATCCATCTG 


7320 


CTTTTAAGGA 


GTAGCTGAAG 


TTGTTTTAGG 


TTTGTAGATT 


GAAATCTTGA 


CTCTAGTCTT 


7380 


ATTGAGGTCT 


ACCTTTTCAC 


CTGCTCTAGG 


ACTTTGTTCA 


ACAACCATGC 


CTTCTGCACT 


7440 


ACCTGCAGGC 


GCTGTCGTCA 


CTTCTACAAC 


TTCTATATTA 


GCTTCCTTAA 


TCCCAACAAT 


7500 
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TTGAATCAAA TTGTTCTTAG TAAACTCCAA GCTAGAACCA ATGTAACTCG GCATCGCAAC 7 560 

ACTTGTAACT TTTTTAGCTA CTGTCAAGAC AATTTGAC7A GGTTTACTCA CATCATAAGT 7620 

CGTTCCGGCA CCTGGACTTT CTTTCATAAT CGTTCCTGGT TCGCTTTCGC TGGACTCTTC 7680 

TTCCTCTATC TTAATCAAAT TCTCAGGAAC CTTCTTCTCC TTGAGTTCTC AGATTACTTC 7740 

TGTAGAGTTC CGTCCAATAT AGTTCCCTAA TTGAATCCTC bTAUC ' riTlT TAGCTACTGT 7800 

CAAAACAATT TGAGTTGCCT TGCTCAAGTC ATAGGTCGTA CCTTCTGGTA GACTTTGCTT 7860 

CAGGACCCTT CCAGCCTCAC TCTCATTCGA CT CTT CT TC C TCAATTTTAA TCAAATTATC 7920 

TGGAACTTTT TTCTCTTTTA ATTCCGCAAT GACATCAGAG GATTTCCGAC CGACATAATT 7980 

ACTAATTTGG AAAGATTGCT TCCCTCATGA GACAACCAAA TTGATTTTCG TTCCTTCTTT 8040 

TCGACCAGTT CCAGCGCCAG GATCTGTACG GATAATCCGC CCTTCTTCCA CCTTTTCACT 8100 

AGCCTCTGTC TTCTCCTCAC CAATCTCAAA ATTGGCTTTT TTGAGCCTTG CCTTGGCCTC 8160 

TGCAACTGTC TGACCTGCCA CATCTGGAAT GGCAATGGTT GCAGGAGTTC TGGATAGTAT 8220 

CCAAATAAGA GAAGCTGCCA CCAATACAAG GCTGGCCAAC AAAATCAGGT AACGCATCTT 8280 

AAATCTATGT TTTTTCGGTG CTTGTGGTTG GTAAGTTTCC TCTCTCACAG CCTGGCTTGG 3340 

GTTTTTGATT GATTTGTGTT CTGTTTGCGC TTGAACCTTA GGAATAGATG TCAAGGTACT 8400 

CTGAGAAACC TTCGGCAAGG TCTTGGTATC TGCCTTGCTC GTTTCATCAA AGATTAACTT 84 60 

ACTTTCATTT CTACGATTGT AGCACAACCT ACTAGACAAG TCCACATACA TCTCTGAAAC 8520 

CGAGCGGTAG CCATTGGTCA ACTTTTTACC ACTTGCCTTG ATAATAACAT TTTCTAAAGC 8580 

CTGAGGTACA GATGGATTTT CTGCAATAAC GGACGGCAGG GGTTTCTGGA AATGCTGGAG 8640 

GGCAATGGTC ACCGCGCTAT CCCCGTCATA AGGGATATGG CCTGTCAGCA TCTCATAGAA 8700 

AATAATCCCC ATCGCATAGA TATCACTCTG CACAGTCCCC TTCGAACCAC GCGCCTGCTC 8760 

TGGTGACAAG TAATGAACTG AGCCCAACAT CGAGTTAGTC TGGCTCAGAC TTGTCTCTGC 8820 

AAAGGCTACA GCAATCCCAA AGTCTCTGAC CTTGGCAGTC CCATCTGCTG TCAAGAGGAT 8880 

ATTTTGAGGT TTCAAGTCCC TGTGAACAAT TCCTCGAGTA TGGGCCAAGC GCATAGCCAA 8940 

CACAATTTGT CCCATGATAC GGACTGCTTC TTCATTAGAA AGAGGATAAT GTTCCTTGAT 9000 

ATAGCGTTTG AGGTCCAGTC CAGCCACATA CTCCATAGCT AGGTACTGTT GACCGTCTTC 9060 

CTCGCCAATA TCTG7TATCC GAACGATATG AGGATGGTCT AGATCTGCCA TAGCTCTCGC 9120 

TTCACGCTGA AAACGAGCTA CACCTATCGG GTCCGTCTGG TAGTTCCTCC TCAGAACCTT 9180 

CACTGCCACT TCTTCCCCAT CTAAGATTAA CTCTTTGGCT AGGTAGACAT CCCCCATACC 9240 

TCCTCGACCA ATCTGTTTGA CAATCCGATA GCGTCCGGCA AAAATCTTGC CGATTTGGAT 9 300 
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CATTCTGCAT CCTCCTCCTT CATAGAAACA AGGGCAACCG TAATGTTGTC TAAACCTCCT 9360 

GCATTCTTAG CAAAACGAAC AAGTGTCTCC GTTTTATCTG CTAAAGGAAT ATCACTGGTT 9420 

ACAATATCAC GAATCTCACT CCCTGAAATC ATGTTGGTCA AGCCGTCACT ATTGAGCAAG 9480 

AGATAGTCAC CTGACTCAAG GATAACTGTC CCAAAATCAG GCTGAATTTC ATCTTTTTGC 9540 

CCAATAGACT GGGTGATAAT ATTTTTTTGC GGATCAGCTT CTGCCTCTTC TGGTGTCAAT 9600 

TGACCAGCCT TGAGCAATTC ATTAACCAAG GAATGATCGC TCGTCAACTG ATGGTATTCT 9660 

TCTCCACGAA TCAAGCCGAT ACGCGAATCA CCAATATGAG CATAGATAGC CTGATTATCA 9720 

ATAATAGCAA GGACTTCCAA AGTAGTTCCC ATGCCTCTGT AAGCTTCATC CTGACCAAGC 9780 

TGGTGAATCT TTTGATTTTC AATTTCTAGG TAATGGGCCA ACCATTCACG CACTTCATTG 9840 

ACTGTATCGA TCTGGGTATC AACCCAAGCT ACACCCAGGT CTGTGACCGC CATTTCACTA 9900 

GCGATATTCC CTGCCCCATC ACCTCCCATC CCATCAGCTA AAATAATCAT GGTACGTCCA 9960 

GCTCTATTGA CATAGTGCTT GACATAGTCT TGGTTATTTG TTCGTTTCTG ACCAACATCT 10020 

GTTAATAATG AAATTTCCAT GTGTCAGTTC CTTCCTAATC CGATATCTTG CGAAATTGAC 100 BO 

TGATGAAGAA TCCATCACTT CCATACAATT CAGGTGTAAT GAGGATACAG CCGTCTTTCA 10140 

TGATATCCTT ACATTCATGT TCTAGTTTTA CCTGCTCGAA CTCGGCATGA CTCTCTAAAA 10200 

AGGCCTTAAC GACTTGAAAA TTCTCCTCTC AGACGATAGT GCAGGTGCTA TAAGTrATTA 10260 

TACCACCTTT GCCTAGTATT TGACAAACAC TACCTAATAT TTCTAACTGA ATTTCCTGCA 103 20 

AGGACGCGAA ATCTGCCGTT TCTTTATTGT ATTTGATATC TGGTTTTCGC CGCAAAAGAC 103 80 

CGATTCCTGA ACAAGGAGCA TCCACCAAAA TCTTATCAAA GGAATCCTGG TCAAAAAACT 10440 

CATGCACCTT TCTGGCATCC AATTTTTGAG TTTGAACCCC ATCTTCAACT CCCAGACGTT 10500 

GGGCATTTTC TTGAATTAAA TCCAACTTGT GGTCGTACAA GTCCAGAGCA CTAACCTGAC 10560 

CTGTCGTAAG ATAAGAGCCT ATATGGGCTG TTTTCCCACC TGGAGCCGCA CAGGCATCAA 10620 

GCACTCGCTC ATCACCTTGT AAATCAAGCG TCGGAGCAAC CAGCTGACTG GACTCATCTT 10680 

GGATGGTAAT GGCTCCATCC GCAAACAAAT TATGCCCTGC AAAATGCCCC TGCTCCTTAA 10740 

CCAGACCAGT GGTTGCTAAA AGGGAATTAT TCGCCTCCAA CAAGGCTTGG ATTTCCTCTT 10800 

TTCGACTTAG GTCTGTTACA CGAATACTGG CTTTGTTTCG CACTAACAGG CTTTCAAAGA 10B60 

TGGCTTTTGC TCTCTCCTCT CCGTATTCTT CCTTGAGTTT GGCAACTAGC CAAACTGGGA 10920 

GAGAATAGGC AATCCACTCA CGCTTGTTTT TTCGCTTGAT GCTAGCAATA TCTGGCCAGC 10980 

CTTCACGCAA GATACGGCGA AGGACAGCGT TGACCAATTT TTCACTGCCT TTTTTACGGA 11040 
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GTTTGGCCAA TTCCACTCCT TCATTAACCA CAGCATGATC TGGAATCTTG TCCAAATAGC 11100 

GGAGTTGGTA GGCACTCATG AGAAGAAGGA CATAGAGCCA CCTGTCTAAC TGGTCTCTGT 11160 

CTTCGATAAA GTGGGATAGG TACCATTCCA GAGTCAGTTT ACGGGCTACC GTTCCATAGA 11220 

CCAGCTCGGT CACTAAGCCC TTGTCTGCTG CCAAAAGTTG ACTTCCCTTT AGATGCTTAT 11280 

TTAAGGCGAT ATTTGAATAT GCTTGGTTCA CAAAAACATC CTCTAGCACT GCTAGAGCTA 11340 

AACTTCTAGC CGTTTCTACT TTAGTCACCA AATCGTTCTC CTACAGTCAA TGTACGTCCA 11400 

ACTCCGTTGA GGAAGGAAGC AATGTCCATC TTAGGCTTAC CAGCTGGCTG CACTTGTTTG 114 60 

AGGGATAGAG CCCCT7CAGC CGTTGCGACA ATCAATTCTT TCTTGCCGAT AGAGAGAATC 11520 

TCACCTGGAT TTCCCTGACC TTCTACTGGT AGGGCTTCAT AAATCTTAAA GCGGTCGCCC 11530 

TTAAGGAAAG TATGGGCAAC AGGCCAGGGG TTCATTCCAC GAATTTGGTT AAAGAGTTGA 11640 

CGATTGGTTT TGTTCCAGTC CAGTTTTTCT TCCTCTGGCT TTATATTTGG AGAGAAGGTA 11700 

ACCTGACTCG TATCCTGCGG TTCAGGTTTG ATATCACCAG CAATATAGGC AGGCAGAGTG 11760 

TCCAAAAGCA AATCACGACC AACTAGCGCC AATTTTTCAA ACAAGGTGCC AACATTGTCC 11820 

TCATCTGTGA TCGGAATGCT GCGACGAGAA ATCATATCTC CTGCATCCAT TTCCTTAACC 11880 

ATTTCCATGA TGGTCACACC AGCTTCCTCA TCCCCTTGAA TCAAGGCATA ATGGATAGGC 11940 

GCACCACCAC GGTGTCTAGG AAGGAGGGAG GCATGAACCT TGACAGCAAA GTCCATGCTA 12000 

TCAAGGAGTT TGCTTGCGAG AAACTGCCCA AAAGCAGCAG TCACAATTCC ATCTGCTCCT 12060 

AGCTTCATAA GATCTTCCAT CTCTGGACTT CCAGATAATT TTTCAGGTTG CTACATACAT 12120 

AGTCCTGCTT CCTTGGCAGC CTCCTTGACT GCGGTTTCTT GGATAACTTT TTTACGACCA 12180 

ACAGCACGGT CTGGCTGGGT CACAACGGCT AGAATTTCGT AACGGTCATC TGTCAAAAGT 12240 

CCTTTTAAGA CTCTTCCTGA AAAGTCGGGG GTCCCCATAA AGATTAGTTT TGTCATATCT 12300 

TCTCCTTCTT ATAAAAATTG CTGCGGCTCA TGGTCAATCC TCAGACCGAG CTCACTATTT 12360 

TCCCGTTCTT GAGTCAAGGC TAAAACCTGG TTGAGGGTCG ACCCCAGCTC ATCTTCTAAA 12420 

CGGTATTTAA TTAAAATCTG GTAATGATAG AGGTTGTGGG TACGGGCAAT CGGTTTTGGC 12480 

GTTGGCCCCA GAATGGGACT GGTCTCTGAC AAGCCTGACC GCAAAATGTT CATGACTTCA 12540 

TAGGCACGTT TGAAAACCTC TTCTTCTTTC TTGTGAGAAA GGGTAATACC AATCGTGAAA 12600 

TAGTAAGGTG GATAGCCGAG TTGTCGTCTG ATTCCCATTT CATAGGCATA AAAGCCTTCG 12660 

TAATCTTGAT CCTTGGCAAA TCGAATAGCA TAGTGCTGCG GATTGTAGGA CTGTATCAAG 12720 

ACTTGACCTG CCTTTTCAGC CCGACCTGAT CGACCTGCCA CCTGAGTCAA GAGCTGGAAG 12780 

GTTCTCTCAG AAGAACGGAA ATCACGCAGA TTCAAGGCCG TATCCGCATT TAGAACTCCG 12840 
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ACTAGGGTAA CATTCGCAAA ATCCAAACCC TTTCCAATCA TCTGAGTACC AAG7AAAATA 12900 

TCCGCTTCCC CTCGCCCAAA CTGGTCAAGC AAGGCTTGGT CACTCCCTTT CTTTCGAGTC 12960 

GTATCCACAT CCATCCTCAG AATGCGAGCT TGGGGAAAGA GTTCTGCTAG CTCATCATAA 13020 

GCCTTCTGAG TTCCCGTCCC ATAGTAACGA ATACTGCGGC TCTTACAGTT AGGACAGACC 13080 
TGAGGAATAT CCTTCGAGAA ACCACAATAA TGGC ACTTCA TAGTCT TGGT ATCCATATGC . 13140 

AAGGTCAGAG AAATATCGCA GTTGGGACAA GTATCCACCG TCCCACACTC CCGACACATG 13200 

ACAAACCTAC AATAACCACG GCCATTGACC ATGAGAACCA CCTCCTCTTT TTTAACCACA 132 60 

CGGTCTTGGA TACCCTCTAG CAAACGAGGC GTAAACTTTG ACGTCTCATT TTGTCCGATA 13 320 

TAGTCTCCAA AGTCAATCAC TTGAACCTCA GGGATTGTAG CCAAAGGATT GGCACGTTGG 13 380 

GTTAGACCTA AGTGTTGATA GACGCCTTTG CCAGCACCTG CCCGGCTCTC TAACCTCGCC 13440 

GTTCCACATC CAAGTACCAG AGTTGCTTGA TTATACTGAG CCCCTAAAAT AGCTACCTCT 13500 

CTGGCATGGT AACGGGGATT GCTGTCCTCC TTATAAGCCG CTTCATGCTC TTCATCAATA 13560 

ATCATGACAC CCAGATTTTT CACAGGAGCA AAGATAGCAG ATCTGGCACC AACAACAACT 13 620 

TGGGCATCGC CACCCTCCAC CTTCCCCCAT TCATCATACT TTTCACCATT GGATAATCCT 13680 

GAGTGAAGAA TGGCTACCTT CTCCCCAAAA CGTGCTATAA AACGCTCGGT CATCTGAGGA 13740 

GTCAAGGAAA TCTCAGCTAC CAGCAAAATA GCTGTCTTGC CCTTATCCAG GGCACCTTGC 13800 

ATAATCTGCA AGTAAACCTC GGTCTTCCCA CTTCCTCTAA TCCCTTGAAG TAGAAAGGGA 13860 

GGTTGAGAAC TGCCAATAGA ACTCACAACC GCATCACCCC CCTGTCTTTG TTCTGGATTT 13 920 

AACTCCAAAG GTCTACTTGC TTCAATTCCT TCAAAATAAG CAGCCGAGCG TTGAACTTCC 13 980 

TTTTGGACTA TGGTAACAGC ACCTTGATCC ACAAAGAAGT TGACTTGCTC TCGCGACTAC 14040 

GACTCTAACA AGCTACCCAA GGAAGCGCTC TCTGCATGAG ACAGCAGATA ATCTCTCAGT 14100 

TCCAACTTTT TCTTGGCACC TGTAGAAATC TCAACACCTT CTAATTGAGC ATGGTCAACC 14160 

TCATACCAAG ACTGGGTCTT GACCTTCTTT TGATCCACTG CCTCATATTC CAGACCAACC 1422 0 

AGGCCnTTC TACTCAAACG CATCATTTCA GCTTCCTTGG CAAGGTCTAG TGAAGAAAAC 14230 

GCTAGCGAAT CTTCTGAACC AAACAGGCGC ACTCGTTCTT CCTGACTCAA GCCTTCCAGA 14 3 40 

GGATAGACAA TCTTGTCATA GCTAGAATTC AGAAACCCTG CAAGCATGGC CTTGAGGATA 144 00 

GAGATTTTGT AGGAGAAGAC AGATTTGCGT AACTCCTCAG CCAGCCAGAG TTCTTCTCCC 14 4 60 

GTGAGAACAG GAGAAAAATC CAGCACCTCT GCAATATCTT TTAAATCTTG CTCCATCTCT 14520 

TCTCCATCTG ATTGGGACTT CAAACCAAGA ACAATCCCTT CAATCACCCG ATTACCCTTA 14530 
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CTGTAACTAT AGGGCTGGTC CGTCTGCATC AAGGGCACAT CTACGATAAT CTTACCTAGG 14700 
GCCATCTTCT CACCTCCTCC TTGTCAGTAC ATTCTTGCAA TAGAAAAAAT AAGATTGAGT 14760 
CCCCCCAACC TTAAATTTTT TCACCATCTT CTTTH' C m' AGCAATTTGC TCTTTGATTT 14820 
TCTTTTCTTC TTCTTCTTTG CGGCGTTTTT CTTCTTCGAT ACGGCGACGC ACTGCTTCAC 14880 
GTTTTCCTTC TGGATCTGGG TCAATTGTAA CGTTTCCTGA TTCGATTTCT TCTAAAGCGC 14940 
GAAGAGTTGA TTTTTCAGAC TTGAAACCTT GAGTTGCTGG GGCACCTGCT TCCAATTCCT 15000 
GGGCACGTTT TGCTTCCAAG ATTACGAGTG AATATTTTGA AGGAACCTTG TCGAGCAAGG 15060 
TATCAATAGA CGGTTTTAAC ATCATTTGCT TGTACCTATT TTCTAAATTT TATCGGGTAG 15120 

TTGGAGATTT TGGTAACATC TCCTGATAGT GACCAATGAC ACGATCCACA CAGAAGTGTT 15180 

CTGCTTCAAT CACACATTTG ACACGTTCAG CAGCTAGGGG TACCTGATCG TTGACAATCG 15240 

CATAATCATA CTCACGCATG AGGGCAATTT CTTCCTTGGC CTTTTCGATT CGTTGGGCAA 15300 

TCACTTCTGC ACTATCTGTT CCACGACCTA CCAAGCGATC TTGCAATTCA TCCAAATCTG 153 60 

GTGGTGTCAG GAAGATAAAG ACAGCATCTG GAACCTTTTT CTTGACCTGA AGAGCACCCT 15420 

GAACTTCAAT TTCAAGGAAA ACATCGATTC CCTTGTCCAA GGTTTCATTG ACATAGCTCA 15480 

GAGGAGTTCC ATAGTAGTTA CCGACATATT CTGCGTATTC CAACATCTGT CCTTGACGAA 15540 

TCAGCTCTTC AAATTCTTCA CCAGTACGGA AGAAATAGTC AACACCGTCC ACTTCTCCAG 15600 

GACGTTGTGC GCGTGTCGTC ATCGATACAG AATATTGAAA TTGGTTTTCA GAACTCTCAA 15660 

AAATCTCTCT TCTAACCGTT CCTTTTCCAA CCCCTGAAGG ACCAGAAAAA ACGATTAGTA 15720 

AGCCTCGGTC TGCCATTGTG TCTCCTTTTA CTCAATCTGT GAAATAACAT TTCTCTAGAA 15780 

TAATGGCAAA AAGCCAGATT ATCCTTTACA GTCTTTCTAT CTAGTGTAAC AAAAAAGCAC 15840 

TAATTTTTCA ACTGCTCTTT CTTATTTATT TAGCATAATC TACTGCACGA ACCTCGCGAA 15900 

TCACGGTTAC CTTGATATTT CCTGGATAAT CGAGATTGTT TTCAATTTTC TTACGAACTT 15960 

TGTGAGCCAA CATTGTGACT TTCTCGTCCT TGATTTTTCC TGGATTGACC ATGATACGAA 16020 

TTTCACGTCC TGCTTGAAGG GCAAAGCTAG TTTGCACTCC TTCAAAGCCC TTAGCAATTT 16080 

CTTCCAAATC ATGGAGACGC TTGATGTAGC TTTCAAGAGA CTCACTACGA GCACCTGGAC 16140 

GGGCTGCCCT CAAGGCATCT GCTGCAGCGA CGATAACTGC TATCACGCTC TCAGCTTCAA 16200 

CATCTCCGTG GTGACTAGCA ATCGTATTCA CCACAACTCC GGGTTCCTTG TACTTACGGG 16260 

CCAATTCCAT ACCGATTTCA ACGTGGCTAC CTTCAACCTC ATGGTCAATG GCTTTCCCGA 16320 

TATCGTGAAG GAATCCAGCA CGACGGGCAA GAGCCGCATT TTCACCAAGT TCGCTCGCCA 16 380 
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TGATACCAGC CAACTTAGCA ACCTCAATCC AATGGCGCAA AACATTTTG? CCATATGAAG 16440 

TACGGAACTG CAAACGTCCC ATAATCTTCA TCAAGTCTGG ATCAAGGTTT GGCGCACCAA 16500 

TTTCATAGGC AGCAGCCTCA CCGTATTCAC CAATCTTATT GTCAATCTCT TGACGGTTTT 16560 

TCTCAACCAA CTCTTCGATA CGAGCTGGAT GTATACGACC ATCTTTGAGC AACATTTCCA 16620 

TAGTCATACG GGCAATCTCA CGACGAATCC GATCAAATCC TGACAAGGTC ACCACTTCTC 16680 

GTGTATCGTC GATAATCACA TCGACCCCTG TCAAACTTTC AAAGGTACGA ATGTTACGAC 16740 

CTTCACGACC AATAATGCGT CCCTTCATAG TATCGTCTGG CAGATGAACT GTTGAGTTTC 16800 

TTGACTCCCC TACATATTCA CCAGCGATAC GTTGCATAGC TTGAACCAAG ATGTCCTTGG 16860 

CCATTTTGTC AGAACGTTCC TTGACCTCTT GCTCACCTTC GCGAATGCGA CTCGCAATCT 16 920 

CCCTGGTCAA GTTTTCCTCT GTCTGAGCCA AGATAATATC TCCTCCTTCT GCCTGAGACA 16980 

GCGCACCAAT ACGCTCTAGT TCTCCTTCTT TTTGTCTTTC GACTTCCTCT AATTGCTCTT 17040 

CACGCGCATC AAGCTTTTTC GCTCTATCAG AAATACTTTG TTCTTTTTGT TCAAGTGTTT 17100 

GTTCTTTACT CGTCAAATTG TCGTCCTTAC GGTCAAGCCT AGTAGCTCTC TCTGTCAAAC 17160 

GACTTTCGAT TTGTTTGAGT TCTTGACGTT CTGATTTGAA TTCAGCGTCC ACTTCTTCAC 17220 

GGTATTTTCT GGCTTCTTCT TTGGCCTCCA ATAGTGCTTC TTTTTTAAGA GACTTGCTTT 17280 

CACGTTTGGC TTCATTAACA AGTAAATCCG CTTCACGCTC AGCTTGTCCA CGTAAATTAG 17 340 

TTGCTTCTTG TTCAGCATTT AAAAGCATCA ACTCTGCAGC TTCCTGAGAT GATTTCATCT 17400 

TAGCTGAGAT GCTGACATAT CCAATGACTA AACCAATGAT GACGGCAAAA ACAGCAATCG 17460 

CAAGCGACAT GATTTCCATG TTTTTACCTC ATTTTATTGT TATTCCGAAT CACATACATT 17 520 

CTTTTACATT CT AC CAT AAA AAAGTGATTT TCACAAACCT AAAATAGAAT ATGTTTTGAC 17580 

GAATTTGGAA CACATTTACC AAAATAAACT TGTTGTTTAG AAATAGTAGT TTAGTAGAGA 17 640 

CTTGAGAAAA AGCCTACCTT TCAATAGACT TAGTAATGAT CTTTAAAGGA CAAGAAAGCC 17700 

ACCCTATCTC CATCCATCAT ATAAATCAAC CGATTTTCTG CATCAATACG CCGTGACCAG 17760 

GCTCCTTGGT AATCATATTT GAGTGGTTCT GGTTTACCTA TTCCTGTAAA GGGATCACGT 17820 

TGAATATCCT TGATTAGTTT ATTGATTCTT TTTAACGTTT TCTTATCCTG ATTTTGCCAG 17880 

TAGCAATAAT CTGCCCAGGC ATCTTCTGTA AACTTGAGCA GCATTTCTTA CTCCTCAATA 17940 

ACATGGACCT GAGTACTTCC AGCACGAACT TGAGCCATTC CTCGCAAAAC CTTATCAGAA 18000 

AGTTCCTTAT TTTGAGCAAT TCTCAGGGTT TCTTGGATAC TATCCCACTC ACTCTTTGAA 18060 

AGGACTACAA TGTCCTCATC TGGATTTTTA TTGACCACCG TCAAAGGCTC AAATTCATCA 18120 
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TTTACCTTCT TCATGTAGTC CTTTAAATGA TTTCGGAATG TTGAGTAAAG GACTGCTTCC 13 ISO 

ATAACCATAC CTCGTTTTAC CTCTTTTCCA CTATTATACA CGAAAAGAAA GAAATTCTCA 18240 

GGAACTTGTA CAAGATTTTC TTTTCTATCT ATTTATACTC AATGAAAATC AAAGAGCAAA 18300 

CTAGGAAACT AGCCGCAGGC TGTACTTGAG TACGGCAAGG CGACGTTGAC GCGATTTGAA 18 360 

TTTGATTTTC GAAGAGTATT ATTCGTAAAA AATCTCAAAA AGCCTACCTT TCGCTAGACT 18420 

TAGTTTGTTT CTATTC 18436 
(2) INFORMATION FOR SEQ ID NO: 88: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7001 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

ACGTAGAAAA ACTATTTCTA TCACAGATAA TATTCCGTAT GTTGTTCGAG GTATTGAAAT 60 

AAACGTCCTA GGTATCTTTC TCACTCTATG TGACTTACAA GGGAAAACTC TTTTCGAGAC 120 

AGAAATTTTG AATGAAGATT ATCCTATTTC AGAAATCAAT TCCACCATTA CCAATATGAT 180 

AAAAACACCT ATAGAGTACG TCCCTTTGGA AACAAAATTA CTTGGATTTG GCTTATCAAT 240 

ACCTGGACAT TATAACAAAG ACTCCGGAAG TATCATTACA AACAACCCCA TATGGGAATC 300 

TTTTAATTTA TTAAATGTAA TTAAAAGATT CAATTTTCCT TTTATTGTAA AAAATAATAT 360 

CGATTGTATG GCTATAGGAC AATACCTTTT TAATCCACAC AATACCCCCG ATAACTTTAT 420 

TTTCCTACAC GCTGGATTAG GTATTTACAC TTCCTTTTTC ACAAAAGAAA AAATAGGAGC 480 

CTCTAAAAAT CCTTATATCG GAGAAATTGG ACACACCATT GTCGAATTGA ATGGGCAATA 540 

TTGTGAATGC GGAAAAAAAG GTTGTTTACA AACATATATT TCGGATGCTT GGTTAATCAA 600 

ACACGCCCAA TTATTATTTA AAAATTCCCA ACTAACTGTA CTAAAAAGCC TTCTAAAGAC 660 

TGAAAAAGAC ATTCATTTAG ACACCCTTTT AACGGCTTAT AATTTAGGCG ACTCCGCTTT 720 

ACGTCAACAA ATTGATAAAG GAGTCAATTT ATTAGCCACT TCTATTGCAA ATCTCCTCCT 780 

CATCAATCCT GCTGATAAAA TCTATATCAA CAGTCAATTC CTTAATTATC AACCTTTCAC 840 

TCATGAAGTC AGGGATAAAA TCCAAGACCA GCTCCACTTC GTTCCCTTTA CTCGTAATAT 900 

AGAAATTGAA ATTTTACCTT ACAACAAACA TCGTGGAAGT ATAGGACCTT GTGCATTAGC 9 60 

TATCGTCGCT TTTTTCATAG AACATAGCAA TGTATTACAA GATATTATTT CACCTTAATA 1020 

TATTAGAAAT CTATAGACCT GTTTAAATCA ACTATAACCT GTAGTAGATA TCTCCTATTT 1080 
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AGACAATATG AAAACAAGAC GACTTCCATA TAGGAAACCG CCTTCTCGCT ATGTTGAGTG 1140 

ATTTATATTA AAATAACTTT TCTTCTAGCT GCATTTTATT ATTATAAAAA CATTCATCAT 1200 

AACCCCCAGA ACTTAAATAA CAATTTTTAT TCAAGATACA TACTCCTAGA ATAAACTTTA 12 60 

TATCAAATTC TCATTTTTGT TTTTACAATT CTCCTTAGTT AAATCTTGTT TAATATATGT 1320 

TTTACATATA GTATTTAGCG CCACATAGTA CTGAACTCTC TCCAAAAACG GTTATTCCTC 1380 

TTTGAATAGG GCGTTATCAC AAGAAAAGCA TCTCCACGTT TCAACTTCAT ATGGCTCAAA 1440 

AACAATCAAT TCATGCTAAA ACCTGTACCT AGATGTTTCG GTTCATAAAA CCATGAAACT 1500 

GTAAAAGTGG ATGAAATTGA TAGCGATAGT CAAATCAAGA GGCATCATAA CTCTAAAAAC 1560 

TCACAATATA TAAGTTCATC CTCGGAAAAA TATCATTCTA ATTGTTGAAA TGCCTACATG 1620 

AAAAGAAACG TCAAATGCTC ATGAAACAAC GAATACAGGT ATCAAAACTA TGACAAAACA 1680 

AATCCCTAAA TTTACTAAAC ACACTGCTCA ACTTTACACC TGT AAATGGT TGTTGTATAA 1740 

TAAAGTTACA AAGATGTACG ACCACACTGT TGTAAATCAT AGTGTTCGCG AATATATTAC 1800 

TGATAGCATT TCTACAAATA CAACTAAACA GAGCGGATGA GATTCAAACG AAATATGTCA i860 

GTGCTTTGGC ATTCCTAGCC TTCATATCAT TTAAAGAATT CTATAGACAA AATTTTTTCC 1920 

AATACAGACA CTCGTAACAA CTGCTTCATT TTTCTACCAA CATATTTAGG AACAGGATAA 1980 

GATACAAGAG TATTAATCCA TAGCTCAGTT CTATACCAAT CTAAGACAAA TAACCTAAAA 2040 

AAACGATTGA TAATAAGCAA ATAGATTCCA AATTTTCTCT ATCTGCTCAT TTTAATAAAC 2100 

AATACTAGTG TAACTATCCT TCCAGTCAGA ACCTTGTCAA ATCACACCGA AAATTCTTCT 2160 

AAAATTTATC TCGTTAGCCA ATCAAGCAAA AACTCGACGA TAGTACAAAC ATTATCATAC 2220 
AGGATTGACT TCCTAAATTA TATACTTTAG TAACGTTTTC CCATAAGAAA AAAGGTTCAT 
TTTACATTTC TAAACATTCT TTTCTAAGAT GAAAAACAGA ATTTTTCGAT TGTGATTTAA 

AGCAACAAGA AGATTTTCAG TATCATCCTA TAGATACGAG CTAATTAAGA AAAACTACAT 2400 
TTTTGAATAT AAACTACAAT AATATAAAC7 AAATTTTATA GGACCAAGAC AATGGATTGG 2460 
TACGATTA7A TGATACACGC ATCCAAACAA TCACAATTCA ACGCAAGCCA TTGGTTTCGC 2 520 

TATTTGCGAA AAGTTATTTT TGAAGACTAT TCTTATTTAA CAAACCAAGA TGTAGAAAAG 2 580 

TTGCTAGACT CCAAAGAACT AACCCGTTTT CAAAAAATTA GCTTGAAGTA TGCCTTTCAA 
GAGCATACTC CAACTCATAA ATATGTGATT TCATTAAATA AACCTGCTAA GTTAACCAAT 
GTTCAAAAAT TGATGGAGAA ATACAAACAT GGATAAAATG AAACCGGTCT TCCAAGCCCT 
AAATAAGGAA TTAATTCAGG AAAATCTGAC TTTAACAATT ATCTGTGTCG CTCCTTATCT 



2280 
2340 



2640 
2700 
2760 
2320 
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CTTAGAATAT CATCGTTTAC GTGCCACACA AGATGTTGAT GCTTTTATGG CTCTATAATA 2880 

TTTCTAGTGG GTAAATCCCC TATGGATATT ATGGAGCCTA TTTTTGTGTA GAAAAAAAGT 2940 

CCCATATGAC CTATAATGAA AAGCGACAAA ACAACTCATT AGAAAGAATC ATATGGAACA 3000 

ATTACATTTT ATCACAAAAT TACTAGACAT TAAAGACCCT AATATCCAGA TTTTAGACAT 3060 

CATCAATAAG GATACACACA AGGAAATCAT CGCCAAACTG GACTACGACG CCCCATCTTG 3120 

CCCTGAGTGC GGAAACCAAT TGAAGAAATA TGACTTTCAA AAACCGTCTA AGATCCCTTA 3180 

CCTCGAAACA ACTGCTATGC CTTCTAGAAT TCTCCTTAGA AAACGCCGTT TCAAGTGCTA 3 240 

TCACTGTTCA AAAATGATGG TCGCTGAAAC TTCTATCGTC AAGAAGAATC ATCAAATTCC 3300 

TCCTATTATC AACCAAAAAA TTGCGCAAAA GTTGATTGAG AAGATTTCTA TGACCGATAT 3 3 60 

TGCTCATCAG CTGGCCATTT CAACTTCAAC TGTCATTCGC AAGCTCAATG ATTCTCACTT 3420 

TGAGCATGAT TTTTCGCGTC TTCCTGAGAT TATGTCCTGG GACGTTGAAA CAGTCCGGGG 3480 

AGTGACTGTT TCAATCGGGA GATGGAGATG AGCTTTATTG CGCAAGATTT TGAAAAGCTC 3540 

GATATCATCA CTGTTCTTGA AGGTAGAACA CAAGCTGTCA TCCGAGATCA CTTTCTTAAA 3600 

TATGATAGAG CCGTCCCATC TCGCGTCAAA ATTATTACTA TGGATATGTT TAGTCCTTAC 3660 

TATGACTTAG CTAGACAACT TTTCCCGTGT GCTAAAATCG TTCTTGATCG CTTTCACATT 3 720 

GTACAACATC TTAGCCGTGC TATGACTCGT GTCCGTGTCC AAATCATGAA TCAGTTTCAT 37 80 

CGAAAATCCC ATGAATACAA GGCTATCAAG CGCTACTGGA AACTCATTCA ACAGGATAGC 3840 

CGTAAACTCA GCGATAAACA TTTTTATCCC CCTACTTTTC GTATGCATTT AACCAATAAA 3 900 

GACATTTTAG ACAAGCTTTT GAGCTATTCA CAAGACTTGA AACATCACTA TCAGCTCTAT 3 960 

CAACTCTTGC TGTTTCACTT TCAGAATAAG GAACCGGAGA AATTTTTCGA ACTTATCGAG 4020 

GACAATCTTA AGCAGGTTCA TCCTATTTTT CAGACTGTCT TTAAAACCTT CCTCAAAGAT 4080 

AAAGAAAAGG TTATCAACGC CCTTCAACTA CACTATTCTA ATGCCAAACT GGAAGCGACC 4140 

AATAATCTCA TCAAACTTAT CAAGCGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 4200 

AAAAAACGGA TTTTTATCCC TCTGAATATC AAAAAAGAAA GGACAAAATT TGTCCTTTCT 4260 

CGAGCTTAGC TTTTTTTCAA CCCACTACAG TTGACAAAGA GCCGGAAAAA GGAACAGCCT 4320 

TAGCTTTCCT TTCATTTCTT TTTATTTCCC TCGTAGTAAA CGTGCTAGCT TCCACAAAAC 4380 
AAACAGGATT CCCAGAAATG CCAGTACCAC TAGCCCACGG TACAACCATT GAGAGGTTGC 4440 
AACACGCGAT ACAGATTGTC CTTCTTTCGT AAAAGCAACC CTCGCAACTG CACCTGTTTC 4 500 

TGGATCTGAT TTTTGATAAA CAGCGACTCG TTCAAAATTC ACTAATAAGC GTTTATTAAA 4560 
GGTAGGAATC GCATCGCAGG TTATCAAGGT CATGATATTT TTAGAGCTAA CCGATTCTAA 4620 
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TTTTTCCCAT TCCGACGCTA AAATAATCTC TGTGTCCATC ATCTCATATT CTACAATTTC 4 680 

CTGGCCATTA TCATAATAAA GAGCATCTCC AACTTTTAGC TGATCCAAAT GGCGGAAAAA 4 740 

CACATGGCTT GGCTCTGCAC GGTGCCCAGC AATCACTCAG CGAATCCCTG TACCATCCAG 4 800 

AGGCAGCGGT GTACCATCCA CATGAGCCAA GCCCATCCCT AAATGATGAT AATCTGCTCC 4 8 60 

CAAATAAACC GGCTCCATGA TTTCCAAACT TGGAATAGAC AAGTAACCAT AGACTGCATC 4 9 20 

AGGGTCGTCA GACACTTGGT AATTGACCTC ATATCCCTCC GCCAAAAAAG GATCTACAAT 4 980 

CCGATTTTGC GAAGCCAAGC GTTGATTGTA GGCGAGAGAA TGGTTCTGTT GTTCTTGGTA 5040 

CATTTCAGTT GTCATGGATT TCACAAATGT AGCATGACCT TTCACCTGTC CAAGAGACTG 5100 

CAACACCATC TGTCCAAAAC AATAAATAGG AATCAAACAG GCTACCAACA TCAACAAGTA 5160 

TCCCAATAAG GCTCGTAGTT TAGTCCTTGA CATGACGCCC CTCCAATTGC TTTTCTAGTC 5220 

CTTTGACAAT CCGTCGATTA CGATACACGC GATACAGCAA GAGAAGGATG ACCGCCATCG 52 80 

CTCCTACTAA TAACCACAAC CAGAATTCCC CACGCTCTCT CACCGCTCGA TTCCGCTCTG 5340 

CAATTGGTGC CGTATACGGA ATCCCCTTCC CACGTACCAA CAGACGATGA CTGTTAATCA 5400 

TATACGGTGT ACAAGTCAAC AAGGTCGCAT AATCTTCCCC ATGTTGAATC AAGACAGGCT 54 60 

CAAAGTCATT CGGCTCCACC GTCACTATCT GATCCACTTG GTAGGCCAAC ACCTGATCTA 552 0 

AAACGTGAAG ATAAAAGATA TCCCCTTTTT TCATCTTATC CAATTGACTG AACAAT TCTG 5580 

CCGTTGGCAA TCCTCTGTGA GCACTGATCA CTGTATGGGT ATTTTCACCT CCAACAGGCA 564 0 

GCGAAGCCCC TTCTAACACC CCTGCCCCTT TCTGAAGAAT GTCCTCACTC CTTCCGACAT 5700 

ACATCGGAAT TTCCTGATCA ATCGCAGGAA TTTCCACATA GCCAATCCGC TCATGGACCT 57 60 

TTAGCATATT GGCATATTCT GAGACGCCTT TCTTTTTCTC TTGCTCTGTA AAAGGATCAA 5820 

GAATTTCAGA TGGTTTCAAG GTCGCATTGA AGGCTTGAGC CAAGCGCCAA CGCTCCTCAA 5880 

CTTCTGCCTT ATCCATCTGG GAAACCGTCT CATCAAACTC TTTAATAACC TCGTTTGACT 5 940 

CAATACGATA ATAATAACGA GACACCAATG GATATATCGC AACGGCGAAT CCTACTAAGA 6000 

AAATCAGAAG AAGGATCAGC GGATGTTTCT TCTTTTTTGT GCCTTTTTTT CGTGAACGTC 606C 

TACTGTTGTC CATCCTCCAC CTTCACTTCC TTCCTTGCTG CTTTCAGCGC CTTCAAAGCC 6120 

TTTTCCGGTT GTTTTTTCTT CTTGCGCAAG CGTCGAATAA TCCATAAAAG AATCACAATC 6180 

AAACCAACTG CCACATAAAA CAGGTAGCGA TAGAGATGAC TGAGTTTGTT TGCTGCAATA 62 40 

AATTCTTCCT CAACCTCTGC TACGTACGGT ATCCGATGCC CCCGAACCAA TAGACGATCG 6 3 CO 

GTATTGATCA TGTATGGCGT ACAAGTCAGC AAGGTCACAT AATCATGACC TGGTACAATC 63 60 
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AATAAATCAT CAAAGTTCGT CCGCTCAATC ACCTTTACTT GATCCACTTG ATAGGCCATC 6420 

ACTTCCTTCA TATTGTGCAC ATAAAACTTA TCCCCAACTT TAAGTTTGGT CAAATCCGTA 6480 

AACATCTTAG C7GTTGGCAA ACCTGTATGT GCCGTAATCA CCGCATGGGT CGAATTCCCT 6540 

CCGATCGGCA CAGAAGTTCC CTCTAGATGC CCAGCCCCTT GCTGCAATAC CTCTTCAGCA 6600 

GTACCAGCAT AAACCGGCAA ATCCACGTCA ATAACGGGGA TTTCCACATG CCCCATCCGC 6660 

TCATGGATTT CTAACATACG TGCATACTCT GCTCGCCCTT TTTTCTTCAT TTCTTCCGAC 6720 

CAAGGATCGC CACTCACTAC ATTATTCAAA GAGTCATTGA AGGCTTGTGC CAATTTCATT 6780 

CGTTCATCAA TGTCAGCCTC ATCCAACGTT GCTTTTTCCT TATCAAAGTC AGCAATTTGT 6840 

TGATTTGATT CCACTCGATA ATACAAGCGA GACACCAGCG GATACGCCAT TACCGCCATT 6900 

CCAATGAAAA ATACCACTCC TAATAGGAGA TTATTTCGTT TTTGCTTTTT TGTTTTTACC 6960 

ATTTTTATCA GCATCCCTTT ATCTTCAAAC TTCAGGGTAT C 7001 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



GAGGCAGCTT 


AAGAAGTTAC 


CACCGTCCTC 


TAGCGCCTTA 


TCCGCATCAA 


AGTTAACGTT 


60 


GATATTTTTA 


AAACTGTCGC 


CAGCTTGTGA 


TACGATCCTT 


TCTTTAAGGT 


CATTTAGGGT 


120 


TTTAGTGAAA 


TCTCCATTGC 


TGAGGATATC 


ACTCTTTGAG 


AGATTCAAGG 


CAAAATTGAT 


180 


GATGATATTG 


ATCTGGTTTC 


CTCTTATGAC 


CTGATCAACT 


TTGTAATTTT 


TTAAGGTATC 


240 


TTCAACAATC 


TTGCGGATAT 


CTTCTTCTGT 


CAGATTTCCC 


TTACTTTCTT 


TAGCTTTGGC 


300 


GAGTCCTGAC 


TTCATATCAG 


CTAGGGCAAC 


GTTTAATTTA 


TTAGCATCAT 


AGCCTGAT7T 


360 


GTCCTTGTTT 


TCAGCATTGA 


TATCTGACAA 


AGCTTTTAGC 


TCTTCTTGAG 


CCAAATCTTT 


420 


ATTAGCTTGT 


GGCACCTTGG 


CTCCATTAGC 


CTCTAGCGAA 


TAGTAAATCC 


CTGCTAAACC 


480 


ACTTTCTCCT 


GTAACTGGAA 


TAGGCCCTGC 


TACAGTGATT 


TTGGCATGTT 


CCATACCCAG 


540 


CGTTACTGC7 


GCGTTTCGGT 


ACATATCCTG 


AGTCACCTTA 


GTGATATTTT 


CTGCTCTTTC 


600 


AATCTTGACC 


TCAAGTGGCG 


ATTTGTCACC 


TAGCTTTTGA 


ATCTTGGCTG 


ATGAATACAA 


660 


CTGTAAGCTA 


GAGTCATTGG 


CCACATTCAT 


GATTTTAGAA 


TAAACATCAG 


GTGTCATGGT 


720 


CTTGAGTTC7 


TTGGTATCTG 


TTCAGGCATT 


GTAGCCCAGT 


TTTTTAACAG 


TTTGATTTTT 


780 
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TTCCTCTTCA GATAGGGAGG AACCTAGGAC ATATTCAGGT TCGACATAGG TTTCATCCAT 84 0 

AACTTTTTGA ACATCTGTTG CTGCATGGAC GCTATTCATA GCTGTTACTG CCCACAAGAT 900 
CGCAGCGCTA GTCAGAAAGA GTTTCTTTCT CATAGGGAA7 TTCCTCCTTT ACTTCTTTAG 9 60 

AGTAATATAT CTATCTTAAA GAAAACTTAT AACAAAAACA CCTCGTCTAG CCAGATGTTG 1020 

AAAAGAGAGT GAAACATTTG ATGATGTAAA GCTTAAGTCG TACCTCTCTA GAATAATAAT 1080 

AGTTTCCTCC ATTTACATAG AGTTCAGCAC CGTGAAAAAT GGAAATGGCG TGAATATAAC 1140 

TATAAGTCTT TCCAGTCCTA TTACCAAGCA AGGGGGCAAC AGTCTCACGA GAGTACTGTT 1200 

TGGCTAGAGC CAGGGTATTT TCCTTGCCAT TTTGGGCGAT AAAATCGATA TAGGCAGGTC 12 60 

CAAAATTATA GGCTTGAACA GCTGTCCAGA TATCTACCCC CTTCTTCTGC GCCAGATAGA 1320 

GATTGCCTGT CAGAGTTTGA ATGCCTTGCC GAATGCTAGA GGCATTATCA TTGATGGTGT 1380 

TGGTGGAACC ACTTGCAGAC TC ACT AG ACT GCATAACATC GCCTTCTTTT CCTTTTGTTT 1440 

C AG TAT AAA? CATAGCAAGC ACAAGCTCTT CGTTTGCTGG GGTGTCTTGT TCACTCAATA 1500 

TTTCTCGCAC CATGGGTTGA TAGGTCATGA CTTCTTTGAC ATCTTGATGA ACGCGGTAAG 1560 

CTTTATAGCC AGCAAAAAGG AAGACTGCTA GTACAAGCAC TCTTCGAATT CGTTTAAACA 1620 

TTATTTACTT TGGATATCCT CGATATTTTT GATTAACATA GAGTAGGTTC CATTTTCGTT 1680 

TTGGATAAAC TCAACAGACT CGGCGTCTTG ATAGACGTTA TTGGGAACGA TGAGCTCAAT 1740 

TCCATTTGAT AAGGAGAGTT TTTGGTTTTC AAATTTCTTT AATTGGCGAC TGGCATCAAT 1800 

TTCATCAAAT TGAACAGGTT CTGGTACGGC TTCTTTGACT TGGTCAATAA AGCTCAAACG 1860 

AGCCGTCAGA TTCTTGTCAA AAAGGTCATT AGCCAATTTC TCACGTGACA ATTCATTGCT 1920 

TTCTTCTAGG TTGTTGAAAA TAGCTGATTT GACCTTGGAT TGAAATTGAA AATCATCTGT 1980 

GTTAAAAGAT TTAGCAATTC TCTGGGCTGT TTTTTCCAGT TCCTTGATAG ATTTTTTAGG 2 040 

AGAAATCTTA GGAGCGACAG CAAGAAGATT ATCTGAAAAA TAGTTCAAAA AAGTCCCGTT 2100 

GTACTTGATT CGTTTTTCAA TCAGGTGATA CTTGCTACTC TGAAGATTGA CCACCAAGGC 2160 

CTCATCAGCT CCTGTTCCAA ATCCAGGCAG GTTATTCTCA GTTAGCTTGA TTGGATTATC 2 220 

AACTTCTCCT CCGAGGTGGG TCAAGGTCTC CCGCAGGGCA ATTCGCAAGA AAGCGAAATG 22 80 

TTCTACACCT TCTTTAGAAA ATTGCACAAA AATCAAGTCA TTGGTCTTGA GATTTTCAGA 2 340 

AATGCTAAAC TCCTCTTTCC AGAGATTAGC CAGCGTTACT GATGTCTCCA ACAAATCGTC 2400 

TGTAATATGA TTGAAGAAGG GATTTTCTTC TTCGAAAATC CCAGTCTTGG CTTCATCTGA 2 460 

ATACACATGT TCAATTTTTT 7ACGCAGGTA TTCTTCGATT TTTGGAGTAA T ATT GAG AAA 2 520 
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CTTATCTGCT AAGAACACTT CGGTATCATC CGGACTGAAC TGGTGAATAA TGGCTTTCTT 2580 

AATATAAATG TCCATAAAAG TTTTAGTCCT CGTATAATGG GAAGGCATCT GTCAATTCTT 2640 

TGACTGCACT TCTCACTTCT TCTAATACAG CCTCATTTTC TGAATTCTTA AGGGTTTTAA 2700 

TGATGAGTTC AGCCACTTTG CGACTTTCTT CTTCACCAAA TCCACGTGCA GTAATGGCTC 27 60 

CTGCTCCGAT ACGAATCCCA CTTGTCTTGA ATGGTGACAA GCTTTCGTAA GGGATTGAGT 2920 

TTTTATTTAA GGTAATATTG ACTTCATCCA ACAAGTTTTG AGCAACTTTG CCGTTTTCTA 2880 

CAACTTTAGT CACATCAACA AGGAAGAGAT GGTTTTCAGT TCCACCTGAA ATAATACGGA 2940 

AATCAGGGTC TTGCAAGAAG ACATCTGCCA TAGCCTTGCT GTTCTTAATT ACATTGGCAG 3000 

CATATTCCTT GAAGGCTGGA TCCAAAACTT CTTTGAAGGA AACTGCCTTA GCCGCCACAA 3060 

CATGCTCTAA AGGACCGCCC TGAATACCTG GGAAAATAGC TGAATTGATT TTTTTAGCAA 3120 

GTTCTTCGTC ATTGGTCAAA ATCAAACCAC CACGAGGTCC ACGAAGGGTT TTGTGGGTCG 3180 

TTGTTGTTGT GATATGAGCG TATGGAACTG GGCTTGGATG AACGCCAGCC GCAACCAAGC 3240 

CAGCGATATG GGCCATGTCC ACCATGAGCT TCGCACCGAC AGCATCTGCG ATTTCACGGA 3300 

ATTTTGAAAA ATCGATAATT TGAGAATAGG CTGAAGCACC AGCTACAATC AGTTTTGGTT 3 360 

TTACTTCTTG GGCTT G TTTC AAGATAGCAT CAAAGTCTAA GAGTTCCGTT TTAGGATCAA 3420 

CACTATAAGA AACAAAGTTG TAGGTTTGAC CAGAGAAGCT AACAGGAGCC CCATGAGTCA 3480 

AATGACCACC TGATGCCAAA TCCATTCCCA TAACCGTATC ACCTGGCTCA ATCAAGGACA 3 540 

TGTAAGCCGC ACAGTTACCT TGGCTTCCTG AATGTGGTTG AACATTGGCA AATTTAGCAC 3 600 

CGAAAATTTC TTTTGCGCGT TCAATAGCAA GAGTCTCTAC AACGTCTACT ACATCAGTTC 3 660 

CACCATAATA ACGGCGTCCT GGGTAACCCT CGGCATATTT ATTTGTCAAG ATAGACCCTT 3720 

GAGCTGCCAT AACAGCCTTG GAAACTACGT TTTCCGAACC AATTAACTCG ATATTATTTT 3780 

GTTGGCGTTC TTCTTCTTTG GCAATAGCAT TCCAGAGATC AGCATCATAT GCTTTAAAAT 3840 

CATCTTTGTC AAAAATCATA GGTCTTCTCC TTTATTCTCT GACTAGTCCA TTAGTTTGAT 3900 

TTTACAATAA GAAAATCAAA CTAACAGATG CGAATAAACC GTTTCTGCAT TTTATCACAA 3960 

GTATAGCCAA CTTTTTCATA AAATGCATGA GCACCCAGAC GATGATTGCC AGAATTTAAG 4020 

CGGATAAACC CATAACCACA TCTTTTTCCT TCTTCTTCCA ACCCTTGTAG TAAACTTTTA 4080 

CCAATACCTT GACCTTGCGC TTGAGGTGAA ACTGCTAAAG CTAAGATATT AAATCCTGCT 4140 

TTGGAATAGA GTGATTCGTA AACTTCAGCG TGGACATATC CAAGTAAGAC ATGATTAGCT 4 200 

GCATCCTCAT AGCCAAGTAG GAAATGATGG GAATCCTGAG ACAGTCTAGC TAGTTGGCTA 4260 

GCCGTTTCCT CTGGACTAAA AGTATAACCC AAAGCCTCTT GGTTGATCTC AC AT AT AG CT 43 2C 
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TTCACATCAG TTTCTCTTAA ATCTCTTAGC ATCTCATTCC TCCTCAAAAG AAATCTTTGG 4380 

CAACCGAGCA AGAATATCTT CTCCCTTAAT GGCCCCTTGA CCTAAGATTT TCACCTTGTC 4440 

TCCCGACAAA TTCAAAATAG TTGAATCCTG TCCAGTTAGA AAAGCATCGT CTTCCAGACC 4500 

GAGAACCTCT TGGTCAAAAT CCTCTAGAAT TTGATTAAAG GTCACTCCAC TCGCCTGACC 4560 

TGAGATATTG GCAGACGGCC CAATCAAGGG ACCTGTCTCT CGAATCAAAT CAAGGGTAAT 4 620 

GGGATGACTA GCCATCCGAA ATCCAACAGT TGCAAGGCCA GAATTGACCC AATAGGGAAC 4 680 

TCGGTCATTA GCTTCGAGAA TAATGGTCAA GGGACCTGGT AAAAAGATCT CTACAAGTTT 4 740 

TTGAAGATAA CTTGCC7GAT TCTTTGAAAA GTACAAGATG TCCTCTAAAG AGGCAACATT 4800 

GAGATTGAGC GCCTTGTCTC TACGTCGACG TTTAAGCTGG TAAACATGGT CAACTGCTTT 4860 

TTCGTCTAGC GCCTTAGCAA AGAGACCGTA AACTGTCTCT GTAGGCAAAA CGACAGCTCC 4920 

ACCATTTTCC AACTCTTGTC TAATCCTGTC CATCATCAAC GACAACCATC CTATCTTGAC 4 980 

CAAATTGGTC CTTCAGTCTT CGTACTCGCT TTTCAGGAAG ATGTTTCCTA AAAACTTCAG 5040 

GAACACTTTG ACCTTGCTTG TATCCAATTT CAAGGTAAAT CTTACCACCA TCTTTGAGAT 5100 

AGTCTTTTGC ATCTTCCGCA ATTCTACGGT AAATAGCTAG GCCATCCTCA TCTCCAAAGA 5160 

GAGCTAGATG AGGCTCCGAA TACAAGACAT TCAACCCTAC CTCTGACTCA TCTTCACGAG 5220 

AGATATAGGG TGGATTGGAA ACAATTATAT CATATTTTTC AGAAATTTCT GTAAAACAGT 5280 

CACATTTTTT TAAAAATATT TGAAGATTTT GATTTTTAGC ATTTTCGCTA GCTACATCTA 5340 

AAGCATCTTG GGAAATATCT GCTGCCGTCA CTGACCAATC TGGTCTGTTT TTTGCTAGAG 5400 

CGAGAGCAAT AGCTCCACTA CCTGTTCCGA TATCTAGGAC CATAAGATTT TTCACAGGAT 54 60 

TTTCAGCCAG GATAAGCTCC ACCAACTCCT CTGTTTCTGG ACGAGGAATC AAAACCCGTT 5520 

CATCCACCTT TAAATGCATT CCATAAAAAT CTGCCTGTCC AATGATGTAC TGAGCTGGCT 5580 

TGTGAGCTGC TAGTTGCTGG TAAATATCTT CTACAAATTG T7TTTCTTCC TCTGTTGTCA 5640 

CCTCCTGCTG GAGGGCAAAA ATAAAGTCTC TAAAAGATAG ATTTTTCAGA CTACGATAGA 5700 

CAAAAGAGAG GCTTTCCGCT TCCTCTCCTT GTCTTATCAA CTCTTCTTCA AAATTTGAAA 57 60 

ATAATTGAGC TAATTTCATT ATTTGTTTAA TTCTTCTAGT TT7TGTGTTT GGTCATAAAG 5820 

CACCAAGGCA TCCACAACTT CGTCCAATTT ACCAGACAAA ATCGTATCTA GTTTTTGGAG 5880 

GGTCAAGCCG ATACGGTGGT CTGTGACACG GTTTTGTCGG AAGTTATAAG TTCGGATCCG 5940 

TTCTGAACGG TCACCAGTAC CGATTGTCGA CTTACGCTCA GCGTCCTGCT CATCTTGAGC 6000 

AATCTGAGCA AAGTGGTCAG CAACACGGGC ACGGATCAT? TTCATCCCCT TCTCACCGTT 6060 
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CTTCTCCTCG GTACGTTCTT CCTGCATCTC AACCTTGATA TTGCTTGGCA ACTGAACGAT 6120 

ACGAACGGCA GTCGCAACCT TATTGACGTT CTGTCCACCA GCACCAGAGG CGTGATAGAT 6180 

GTCGACACGA AGGTCTTTTG GATCAATGTC GTATTCAACC TCTTCAACTT CTGGCATAAC 6240 

AAGAACTGTC GCTGTCGAAG TATGAACACG GCCTTGGCTT TCTGTCACAG GAACACGTTG 6300 

CACACGGTGG GCACCTGATT CATACTTAAG CTTAGACTAT ACAGACTGAC CTCAAACCAT 6360 

AGCAACCACT TCTTTAAAAC CACCGACACC ATTCATAGAG GCTTCCATGA CTTCAAAGCG 6420 

CCAACCTTGG GCTTCCGCAT ACTTTTGGTA CATAGTTAGC AAATCTCCAG CGAAAAGTGC 64 80 

CGCTTCGTCT CCACCAGCTG CTCCACGGAT TTCAAGGATG ATATTCTTGT CATCGTTTGG 6540 

ATCCTTTGGA AGGAGCAAAA TTTTCAGTTT TTCTTCATAT TCTTCTTTTT CAGCCTTCGC 6600 

ATCTTTGAGT TCTTGCTTGG CCAATTCTTC CAAGTCCGCA TCTCCGCCTG ATTCCTTAAT 6660 

CATCTCTTCG GCATCGACGA TATTTTGAAG GACTTGTTTA TACTCACGGT AGGCTATTAC 6720 

GGTGTCACGA TTGGAAGCTT CTTCTTTTGA AACCTCCATA AAACGCTTCG TGTCTGAAAC 6780 

GACATCAGGG TCACTCAGCA ATTCTCCTAA TTCTTCATAA CGGTCTTCTA CAACTTGTAG 6840 

TTGATCATAG ATGTTCATTT TTTCTCCTTA TTTCTCAATT GTTAAATCAT ACATTGCTAC 6900 

TACTTCATTC TCGGATATTT CCCCAGTTTC TTTAAATCCA TAACTGAGGT AACAAAATCT 6960 

TGCCTGTTCA TTTTCTGGTT CATArGACAA CCAAAGTTTA TTGCTTAAAC CTGCTGGCGC 7020 

TGTTCGAACA TAGTCTAGTA CTTTATCCAT AATTGGTTTA AAATATCCTT GATTTTGAAA 7080 

ATTCTTATCA ATCATAAAAC GAAATAGTAA ATAATTTCCA CTACTAATTC CGATCTTTTT 7140 

ATCATAAGCT ATCATCACAA AACCTATAAT TGCATCATTA TCATAAACTG CCAATGGACC 72 00 

TACAAAATCT CCATTTTTAG TGTAGACGTA TCCTTCAGCT AAACTAATTG CGTTGGTTGC 72 60 
AATGAATTGT TTTTCATATT CCTTGACATC CAAATTTAAA ACATCAAAAT AATTTTCCAT - 7 320 

TGTAACATCT CTTAGTTCAA TTGTCATAGT TTTGCTCCTT GTTAGAGGTT ATCATTGGCG 7 3 80 

CAAAATAATG TTTACGGCAA ACTGAGATAT AGGTTTCGTT ACCACCAATC TGGATCTG'lT 7440 

CTCCATCGTA AACGGGCAGT CCATCCTGTG TTCGCAACAC CATGGTCGCC TTTTTCTTGC 7500 

AATACTGACA GATGGTCTTG ATTTCGTCAA TCTTGTCTGC TAAAAGCAAG AGATATTTGG 7 560 

AACCTTCGAA CAATTCATTG CGAAAGTCAT TTTTCAAGCC AAAAGCCATG ACGGGTATGT 7620 

CTAACTCGTC CACAACACGA GCTAGGTCGT AAACATGGTG GCGTTTGAGA AACTGGGCTT 7680 

CATCGACCAA AACACAGTAA GGTTTTTCTG GTAGCTCTCG GATATAGCCA AAGATATCCG 77 40 

TTGTTTCCTC AATCGCAAgG GCAGGGCGTT TCATGCCAAT TCGACTCGAC ACATAGCCAA 7 800 

CGCCGTCACC CGTATCCAGA GCCGAGGTCA TAATCACAAC ACCTTTTCCT TGCTCCTCGT 7860 
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AGTTATAGGC CACTTTGAGA ATCTCAATCG TTTTACCAGA GTTCATGGTC CCATAACCAT 7920 

AGTACAACTG TGCCATGTTT CTTGCTTCAC GTCCATTTCT AAATTTTTGC TACATTCTAG 7980 

TATATCATAA TTTTCTTAAG CTTTAAACGG CAAAATGTCG TAAAATAGAA GAAATCAAAA 8040 

ACTAGTGGAG GAAGCTATTA TGCCATTTGT ACGCATCGAT TTATTTGAAG GACGCACGCT 8100 

CGAGCAAAAG AAAGCTCTTG CTAAGGAAGT AACGGAAGCA GTTGTCCGCA ACACTGGAGC 8160 

CCCTCAATCT GCTGTCCATG TCATCATCAA CGACATGCCA GAAGGAACTT ACTTCCCACA 8220 

AGGGGAAATG CGTACTAAAT AAGCTAGCTT AAGCAGAATT GCTTAGGCTT TTTCAATCTC 82 30 

CAAGTAGCAT TCATTGAAGA AATATCCTAA ATTTGTTACA ATTTGAAAAG AAACTTGGAG 8 340 

AATTTCCAAG AAAAGAGCTA TTAATTAAAG GAAACATTAT GATTACACGT GAATTTGATA 8400 

CCATCGCTGC TATCTCTACT CCACTAGGTG AAGCGGCTAT TGGTATTGTC CGCCTGAGCG 84 60 

gAACAGACAG TTTTGCTATT GCGCAAAAGA TTTTTAAAGG AAAAGACTTG AACAAGGTTG 8S20 

CCAGCCACAC TCTCAACTAC GGTCACATTA TTGATCCTCT GACTGGTAAA GTCATGGACG 8580 

AGGTTATGGT TGGGGCTATG AAGTCTCCAA AGACCTTCAC TCGTGACGAT ATTATCGAGA 8640 

TTAACACCCA CGGTGGGATT GCGGTGACCA ATGAAATTCT CCAGCTAGCT ATTCCTGAAG 8700 

GGGCTCGGTT GGCAGAACCT GGTGAATTTA CCAAACGTCC TTTTTTAAAC GGTCGCGTAG 87 60 

ACTTGACACA GGCAGAGGCT GTGATGGATA TCATCCGTGC CAAGACTGAC AAGGCCATGA 8820 

ACATTGCGGT CAAACAATTA GACGGCTCCC TTTCTGACCT CATTAACAAT ACCCGTCAAG 8880 

AAATCCTCAA TACACTTGCC CAAGTTGAGG TCAATATCGA CTATCCTGAG TATGACGATG 3940 

. TTGAGGAAGC CACTACTGCT GTTGTCCGAG AGAAGACAAT GGAGTTTGAG CAATTACTAA 9000 

CCAAACTCCT TAGGACAGCA CGTCGTGGTA AAATCCTTCC TGAAGGAATT TCAACGGCTA 9060 

TCATTGGACG TCCCAACGTT GGGAAATCAA GCCTTCTCAA CAACCTCTTG CGTGAGGACA 9120 

AGGCTATCGT AACAGATATC GCTGGGACAA CACGAGATGT CATCGAAGAG TACGTCAACA 9180 

TCAATGGTGT ACCTCTCAAA TTGATTGATA CAGCCGGTAT TCGTGAAACG GATGATATCG 92 40 

TTGAACAAAT TGGAGTTGAG CGTTCGAAAA AAGCTCTTAA GGAAGCTCAC CTACTTCTGC 93 00 

TAGTACTAAA CGCTAGTGAA CCACTAACCG CCCAAGATCG CCAACTCCTA GAAATCAGTC 93 60 

AGGAGACTAA TCGCATTATT CTTCTTAACA AAACTGACCT GCCTGAAACG ATTGAAACTT 94 2 0 

CGGAACTACC TGAAGATGTC ATCCGCATTT CAGTTCTTAA AAATCAAAAC ATCGATAAAA 9480 

TCGAAGAGAG AATCAACAAC CTCTTCTTTG AAAATGCTGG TTTGGTTGAG CAAGATGCTA 9540 

CCTACTTGTC AAACGCCCGT CACATTTCCT TGATTGAGAA GGCCGTTGAA AGCCTACAAG 9600 
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CTCTTAACCA AGGTCTTCAA CTAGGCATGC CAGTTGACTT GCTTCAAGTT GACTTGACCC 9660 

GTACTTGGGA AATTCTAGGA GAAATCACTG GAGATGCTGC TCCAGATGAA CTCATCACCC 97 20 

AACTCTTTAG CCAATTCTGT TTAGGAAAAT AAGAAAAATC CATGATCCTT CATTCGGTCA 9780 

TGGATTTTAG GTTCTATAAT ATTTGTAGTG GGTAAATCCA CTATAGATAT TATGGAGCCT 9840 

ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA AACAACTCAT 9900 

TAGAAAGAAT CATATGGAAC AATTACATTT TATCACAAAA TTACTAGACA TTAAAGACCC 9 960 

TAATATCCAG ATTTTAGACA TCATCAATAA GG AT AC AC AC AAGGAAATCA TCGCCAAACT 10020 

GGACTACCAC CCCCCATCTT GCCCTGAGTG CGGAAACCAA TTGAAGAAAT ATGACTTTCA 10080 

AAAAACCTTC TAAAATTCCT TATCTTGAAA CGACTCGTAT GCCCACTACA ATTCTCCTTA 10140 

GAAAGCGTCG ATTCAAGTGC TATCACTGTT CAAAAATGAT GGTCGCTGAA ACTTCTATCG 10200 

TCAAGAAGAA TCACCAAATC CCTCGTATCA TCAACCAAAA GATTGCTCAA AAGTTAATTG 10260 

AAAAGATTTC TATGACTGAT ATTGCCCATC AGCTTTCCAT CTCAACTTCA ACTGTTATTC 10320 

GTAAGCTCAA TGACTTTCAC TTTAAACATG ATTTTTCTTG TCTTCCTGAG ATTATGTCTT 10380 

GGGATGAGTA TGCTTTTACA AAAGGGAAGA T 10411 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2393 base pairs 

(B) TYPE: nucleic acid 
(CJ STRAND EDN ESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

GTTTTCGCTT CTGGAAATTA TCAGATGGTT GGAAAAGCCG TCCACATCAA GATAGTGTTC 60 

GGAGATTTAA GTTTAAATTG AAGAAACTAA CACAGAGGAA ATGGACTATA GACCTAACAA 120 

GACGTATTGA GCAACTGAAT TTGTCTATTC GAGGATGGAT AAACTATTGC TCATTGGGAA 180 

ATATGAAAAG TATAGTCGCC AGCATAGATG AGCGCTTGCG TACTCGCCTA CGAGTGATTA 240 

TCTGGAAGCA ATGGAAGAAG AAATCGAGAC GATTATGGGG ATTGCTTAAG TTAGGAGTTC 300 

CTAAATGGAT AGCAGATAAG GTATCTGGCT GGGGCGACCA TTATCAATTA GTAGCTCAGA 3 60 

AGTCGGTACT TAAACGTGCT ATATCAAAAC CAGTCCTGGA AAAACGTGGA CTGGTTTCGT 4 20 

GTTTGGATTA TTACCTTGAA CGACATGCGT TAAAAGTTAG TTGAACCGCC GTATGCCAAA 4 80 

CGGCACGTAC GGTGGTGTGA GAGGGGCTAG AGATTATCCC CTACTCGATT AACTCCCCTG 540 

AAATTTATTT TAATTATGCA AATTTCACGT ATTTTTGATG CTGAGACGAC GATCCTGCGA 600 
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ACTTTTCAGA TATTTTTTTG ACTATCTAAA TCTATCATTA GAAAACCTTA GAGCGCCAAA 660 

GGATTTGAGC GTTTTTCTGA TTTTTAAGAC TTTTTCCAGT CTCTTTTTCG ATTGAAGATG 720 

TAATTATTCT ACTAACTAAC TAACTTCTTA GTACTACCCA ACAACGATAA TCATAATTCC 780 

TCCTAAAATT AGGAATAATA AAGGCAATAG TT T TT G TTTT TTCATGTAAA AAACCTCACT 840 

TTTGTTTTCT GCTATTTTAT GCTAAAATAT TAAAAATCAA ATTTAATTCC AAAGTTTGTA 900 

ACTAAAGGGG GAGCGCTACA TGTCTAATTC ATTTGTCAAG TTGTTAGTCT CTCAATTATT 960 

TGCAAATTTA GCAGATATTT TCTTTAGAGT AACAATCATT GCTAACATAT ACATTATTTC 1020 

AAAATCAGTA ATTGCCACAT CACTAGTTCC TATCTTAATA GGAATATCCT CTTTTGTTCC 1080 

GAGTCTTTTA GTTCCGTTGG TTACTAAAAG GTTAGCGCTA AATAGGGTTT TATCTTTATC 1140 

TCAATTTGGA AAGACTATAT TATTGGCGAT ACTCGTAGCA ATGTTTACCG TAATGCAATC 1200 

CGTAGCGCCT TTGGTGACCT ATCTATTTGT TCTTGCAATT TCCATACTAG ATGGTTTTGC 1260 

AGCACCCGTT TCCTATGCTA TTGTGCCACG CTATGCGACC CATTTGCCTA AGGCTAATTC 13 20 

AGCCTTATCA ATGACTGGTG AAGCTGTTCA ATTGATAGGT TGGGGATTAG GTGGACTCTT 1380 

GTTTGCAACA ATTGGTCTGT TACCTACCAC GTGTATCAAT TTAGTCTTGT ATATCATTTC 14 40 

TAGCTTTCTG ATGTTATTTC TTCCTAACGC TGAAGTGGAG GTGTTAGAGT CAGAAACTAA 1500 

TCTTGAAATT TTGCTCAAAG GTTGGAAGTT AGTTGCTAGA AATCCTAGAT TAAGACTTTT 1560 

TGTATCACCA AATTTATTGG AAATTTTTTC AAATACGATT TGGGTTTCTT CC ATT AT ACT 162 0 

TGTTTTTGTA ACGGAGTTAT TAAATAAAAC GGAAACTTAC TGGGGATATT CTAATACAGC 1680 

ATACTCTATT GGTATTATAA TTAGTGGCTT AATTGCTTTT AGGCTATCTG AAAACTTCCT 1740 

TGCTGCTAAA TGGGAACCCC AATTATTCAC CCCAAATCTA AAAACCATCC AGAATCCTTG 1800 

CCTTAGCTTA GATCCTGGAT GGTTTCTTTT TTCACCCAAT GGGTGTTTTT TACTAGACAA 1860 

AAAAGAGTTT CCCCTTTATG GTATAAGTGT AGAAAAAAAC ACAAAAAGAA AGGAAACTCA 1920 

CATGAACAGT TTACCAAATC ATCACTTCCA AAACAAGTCT TTTTACCAAC TATCTTTCGA 1980 

TGGAGGTCAT TTAACCCAGT ATGGTGGTCT TATCTTTTTT CAGGAACTTT TTTCCCAGTT 2040 

GAAACTAAAA GAGCGGATTT CTAAGTATTT AGTAACGAAT GACCAACGCC GCTACTGTCG 2100 

TTATTCGGAT TCAGATATCC TTCTCCACTT CCTCTTTCAA CTGTTAACAG GTTATGGAAC 2160 

GGACTATCCT TGTAAAGAAT TCTCAGCTGA TGCCTACTTT CCAAAATTGT TGGAAGGAGG 2220 

GCAGCTTGCT TCACAGCCAA CCTTATCCCG TTTTCTTTCC AGAACTGACG AGGAAACAGT 2280 

CCATAGTTTG CGATGCCTCA ACCTTGAATT GGTCGAATTC TTTTTACAGT TTCACCAGCT 2 340 
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AAACCAACTC ATTCTAGATA ACGATTCTAC CCATTTCACA ACTTATGGCA AGC 2393 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

TTTGTATCTT TTTAGGTCTC TTTCAATCCA AACCCTTTAA ACTATACGTC ATTTCGGTTC 60 

CTGCAAGTCT TGTGGTAATT TTAGGTTTGA TTTTACTTTT CTTTTCACAA GAGCCTCTGC 120 

ACGCTTCTTA TTTGATGGTC GTCTTCCCTG TTTTCCTACT TTTATTGGTA ACCAATATTA 180 

AGAGTCAACA GAGGGGGCGT AGTGCTAGAA GAACCCGAAG AGAAACGCCA TTATGCCTAT 240 

GGAGTCGTTT CTTCAAAGGA AATCTATATC TGCTAGTTTT TGCGTTTCTC TAT C TTTTGT 300 

CTGTTCCTTT TTTGATGAAG TTTGTCCTTT ATCCAGTACC TTATCAAGAA CGTAATCGTC 3 60 

TTGCTGATTT GGTAAAAGAG CAGACAAATA CCGAAGATGC TATCTCATGC ATGGGATGAT 420 

ACTGCGACTC TTTATCGTAA GAGTGAGCGC TTCTCCCATC GGCCATTTTG TCCCCGTTGC 480 

ACT AT AC AGC AACTGAGGAA AATCGTAATA AGTTACTTAA TGACTTGAAA GAAAAACAAC 540 

CTAAGGTGAT TGTGGTAAAT GATAAGGTGG TAGTCTGCTC TGAAGTGGAA ACACTCTTAA 600 

AAGAAAATTA CCAACAAGTA AAGACTGATT ACTCAGAGTT TAAAGTCTAT AAAATTAAAT 660 

AACCAAATCA ATATCTTGTG TATTTTTAAA AATTTTAGGA TTTTTAACAC AAGATATTGA 720 

TTTTTCTTTT TAGAGTGGTA TAATACTTTT TAGAAAGAAC ATTTTAGAAA AGAGCATGCA 780 

TATGATTGCA CTAGAAGAAA AAATTACAAT TTTCCCAACT CTCTTCGTCG ACAAACGAGA 840 

TGGGAGACGT GTTGTATTTC ATGTGGACAA GATTGACAAG GCTCTCCACA AGGCGGCTGA 900 

CAAGGTTATG GATGTGACAC CCCTGGTTGA AAAATGCCTC AATGATCTGA CTGACCCAAT 960 

TATTACAGAA ATTCATAGTC GCTTTCCACA GGGAATTAAG ATTTACGAAA TTCAAAATAT 1020 

CGTAGAACAT GAACTCCTTG AAGCCAAAGA ATATGCGCTG GCTCAGGAGT ATATTACTTA 1080 

TCGGACACAG AGGGATTTTG AGCGCTCAAA AGCGACGGAT ATCAACTTTA GTATTCATAA 1140 

ACTTCTCAAC AAAGACCAGA CAGTTGTCAA TGAAAACCCT AATAAAGACA GTGATGTCTT 1200 

TAACACTCAG CGTGATTTGA CAGCAGGGAT TGTTGGGAAA TCAATCGGAC TGCAAATGCT 1260 

TCCTAAGCAC GTAGCCAATG CCCACCAAAA GGGGGATATC CACTATCACC ATTTGGACTA 1320 

CAGTCCCTAT ACCCCTATGA CCAACTGCTG TTTGATTGAT TTTAAGGGTA TGTTGGAAAA 1380 
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TCGTTTTAAG ATTGGAAATG CAGAGGTAGA CACTCCCAAC TCTATCCAGA CTGCGACACC 1440 

ACAGATTTCT CAAATCATTG CCAACGTTGC TTCTAGCCAG TACCGTCCCT GTTCAGCTGA 1500 

CCGTATCGAT GAAATTTTGG CGCCTTATGC A^AGAAGAAT TATCAAAAAC ATCTCAAAGA 15 60 

TGCAGAAGAG TGGGTATTGC CTGAAAAACA GGAAGATTAC GCTTGGAAGA AAGCGCAAAA 1620 

GGACATCTAC GATGCCATGC AATCTCTTGA GTATGAAATC AATACTCTCT TCACTTCAAA 1680 

TGGACAAACA CCTTTTACTT CGTTAGGTTT TGGTCTGGGA ACCACTCCTT TTGAACGAGA 17 40 

AATTCAAAAA GCTATTTTAA ACATTCGCAT CAAGGGTCTT GGTTCAGAAC ACCGTACGGC 1300 

T ATCTTTC CT AAACTTATCT TTACGCTTAA AAGAGGCCTC AACTTAGAGG AACGAAC7CC I860 

CAACTATGAC ATCAAGCAGT TGGCTCTAGA GTGTGCAACC AAGCGGATGT ATCCAGACGT 1920 

CTTGTCTTAT GATAAGATTG TTGATTTGAC AGGTTCTTTC AAGGTGCCTA TCGGCTGCCG 1980 

TTCTTTCCTT CAAGGGTGGA AGGATGAAAA TGGTGTAGAA GTCAATTCAG GTCGCATGAA 2040 
TCTGGGTGTT GTGACGGTTA ATCTGCCTCG TATTCCTCTT GAGTCTGAAG GTGATATGAA • 2100 

TAAGTTCTGG GAAATCTTCA ACGAGCGAAT GAATATCGCA GAAGATGCTC TTGTTTACCG 2160 

TGTCGAACGC ACTAAAGAGG CGACACCAGC GAATGCTCCT ATTCTTTATC AGTACGGTGC 2 220 

TTTTGGCCAT CGTCTAGGTA AAGAAGAAAG TGTTGACCAG CTCTTTAAGA ATCCTCGTGC 2280 

GACCGTTTCG CTGGGCTATA TCGGCTTGTA TGAAGTAGCG ACACTTTTCT TTGGTAACAG 2 340 

CTGGGAAAGT AATCCAGATG CTAAGGAATT CACGCTAGAC ATCATTCACG ATATGAAACG 2400 

CCGTGTAGAA GAGTGGTCAG ACCAATATGG CTACCATTTC TCTATCTACT CAACACCATC 2460 

CGAAAGTCTG ACAGACCGTT TCTGCCGACT AGATATAGAC AAGTTTGCCT CTATTCCTGA 25 20 

TATCACAGAC AAGGAATACT ACACCAACTC TTTCCACTAC GATGTTCGTA AAAATCCAAC 2580 

ACCGTTTGAA AAATTGGACT TTGAGAAAGT CTATCCCGAA CCACGTCCCT CAGGTGGTTT 2640 

CATCCATTAT TGTGAGTATC CAGTCCTTCA GCAAAATCCA AAGCCCTTGG AAGCTGTCTG 27 00 

GGATTATGCT TATGACCGTG TAGCCTATCT AGGCACCAAT ACTCCGATTG ACCCTTGCTA 27 60 

CAAGTGTGAC TTTGAAGGGG ATTTTGAACC AACTGAGAGA GGGTTTGCTT GTCCAAACTG 2820 

TGGCAATAGC GACCCTAAAA CAGTAGATGT GGTGAAACGA ACTTGTCGCT ACCTAGGTAA 2880 

TCCTCAACCA AGACCGATGG TCAACGGGCG TCACAAGGAA ATCGCTGCGC GTGTCAAACA 2940 

TATGAATGGT TCAACGATTA AAATAGCTCG GCATCAAGTA ACAAATTAGA AAGAAATGAA 3000 

ATGGGAAAAT ATCAACTAGA CGATAAGGGG CGCGCACAAG TGACCCGTTA TCACGAGAAA 3060 

CACTCTAAAG GTGGAGCTGG TAAGAAAGAA CGCTTGCTTA CCTTCACAGA ACAATTTTTA 3120 
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AACAACAACA AGAAAAAATA AAAGTGAGAG CCAGCTCTCG CTTTTCTCAT AGTGGGAGGT 3180 

AACCATGGAA TTACGCAGAC CAAGATTAGC GGATAAGAAA GCTGTTTTAG ATATGATGAC 3240 

AGAGTTTGAA AAATTTCAGT CGCCTCACGA CGGCGGTTTC TGGGATACAG AGAACTTTGT 3300 

GTATGAAGAC TGGTTAGAAA GCAATCAGGA ACAGGAAATG GGGATTAATC TGCCTGAAGG 3360 

ATGGGTTTCT CCAATTCAGT TAGTGGCTTT TTCTGAGAAA GGTCAAGCAG TTGGATTTCT 3420 

TAATCTCCGG TTGCGCCTCA GTAACTTTCT ACTAGAAGAA GGTCGCCACA TTGGCTACTC 3480 

CATTCGTCCA TCTGAAAGAC GCAAGGGTTA TGCAAAAGAG ACTCTCCGTC AGGGCTTGCA 3540 

AGTTGCTAAG GAAAAGAACA TCAAGAAAGC TCTGGTGACC TGTAGTGTGA ATAATCCTGC 3600 

TAGCAGAGCA GTCATTCTAG CAAATGGTGG AATATTTGAG GATGCTCGCA ATGGAGTCGA 3660 

GCGTTATTGG ATAGAGGTAG CGAATGAATA ATCCAAAACC ACAAGAATCG AAAAGCGAGG 3720 

AACTTAGTCA AGGTCGTATC ATTGACTACA AGGCCTTTAA CTTTGTGGAC CGCGAAGGCG 3780 

TGCGCAACTC TCTCTATGTA TCAGGCTGTA TGTTTCACTG CGAGGGATGT TATAATGTTC 3840 

CGACTTGGTC TTTTAATGCT GGCATTCCCT ATACAGCAGA ATTAGAAGAG CAGATTATGG 3900 

CAGACCTTGC CCAACCCTAT GTTCAAGCCT TGACTTTGCT GGGAGGGGAG CCTTTTCTCA 3960 

ATACTGGGAT TCTCTTGCCA CTTGTTAAGC GGATTCGGAA GGAATTGCCA GACAAGGACA 4020 

TCTGCTCCTG GACCGGCTAC ACTTGGGAAG AAATGATGTT GGAAACTCCA GATAAACTGG 4080 

AATTCTTGTC ACTGATTGAC ATTCTTGTCG ATGGAAGATA TGATCGAACT AAGAGAAATC 4140 

TTATGCTCCA GTTTCGACGT TCATCTAACC AACGAATTAT CGATCTGCAA AAATCGCTCA 4200 

AAAGTGGGCA AGTAGTGATT TGGGACAAGC TCAATGACGG AAAAGAAAGC 7ATGAACAGC 4260 

TGAAGAGAGA ATGAAGAAAA AGGACTTACT AGACCAACTA GTCTCACAGA TCCAGACGGG 4320 

GAAAGTCAGG ACACTGGGAA TATACGGTCA TGGAGCTTCA GGTAAATCAA CCTTTGCACA 4380 

GGAATTGTAC CAAGCTTTAG ATTCTACTAC AGTAAATTTG CTAGAGACAG ATCCTTATAT 4440 

CACCTCAGGA CGCCATCTGG TAGTACCCAA GGACGCGCCG AATCAAAACG TGACAGCCAC 4500 

TCTGCCAGTG GCCCATGAAC TGGAGAGTTT GCAGAGAGAT ATCCTTGCTT GCAGGCGGCT 4560 

ATGGATGTCT TGACAATTGA AGAACCTTGG AAGGCTAGTG AGGTCTTGTC TGGAGCCAAA 4620 

CCAATTTTGA TTGTCGAAGG GATGTCTGTT GGCTTTCTAC CCAAGGAACT CTTTGAAAAA 4680 

ACCATCTGTT TCTACACGCA TGAGGACACC GAATTAAAGC GACGCCTTGC TAGAGATACG 4740 

ACTGTGAGAA ATCGCGATGC GG 47 g 2 
(21 INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS: 
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CGTATCTTCf 


CGTTCTGAAA 


AAGTTGGTGA 


GTACATCTCT 


GAAGCAATGG 


AAAAAGTTGG 


780 


CAAAGACGGT 


GTCATCACCA 


TCGAAGAGTC 


ACGTGGTATG 


GAAACAGAGC 


TTGAAGTCGT 


840 




CAGTTTGACC 


GTGGTTACCT 


TTCACAGTAC 


ATGGTGACAG 


ATAGCGAAAA 


900 




GACCTTGAAA 


ATCCGTACAT 


TTTGATTACA 


GACAAGAAAA 


TTTCCAATAT 


960 


TCAAGAAATC 


TTGCCACTTT 


TGGAAAGCAT 


TCTCCAAAGC 


AATCGTCCAC 


TCTTGATTAT 


1020 






AGGCTCTTCC 


AACTCTTCTT 


TTGAACAAGA 


TTCGTGGAAC 


1080 


l_ * L Lnn^.U 1 r\ 


C.TAGCAGTCA 


AGGCACCTGG 


TTTTGGTGAC 


CGTCGCAAAG 


CCATGCTTCA 


1140 


AflATATCGCC 


ATCTTAACAG 


GCGGAACAGT 


TATCACAGAA 


GACCTTGGTC 


TTGAGTTGAA 


1200 


& f A T CI CG A C A 


ATTGAAGCTC 


TTCCTCAAGC 


AGCGAGAGTG 


ACCGTGGACA 


AAGATAGCAC 


1260 




GAAGGTGCAG 


GAAATCCTGA 


AGCGATTTCT 


CACCGTGTTG 


CGGTTATCAA 


1320 




GAAACTACAA 


CTTCTGAATT 


TGACCGTGAA 


AAATTGCAAG 


AACGCTTGGC 


1380 




nGTGCTGTAG 


CGGTTATTAA 


GGTTGCAGCC 


GCAACTCAAA 


CTGAGTTGAA 


1440 


& r* & a & Tr. & & & 


rtrrccATTG 


AAGATGCCCT CAACGCTACT ; CGTGCAGCTG TTGAAGAAGG 


1500 


I Al 1 L> I 1 Vjl— rt 


GGTHGTCGAA 

4 VJ\J 4 \J\J*\m* 


CAGCTCTTGC 


CAATGTGATT 


CCAGCTGTTC 


CTACCTTGGA 


1560 




r.ATHAAC C AA 


CAGGACGTAA 


TATTGTTCTC 


CGTGCTTTGG 


AAGAACCCGT 


1620 




GCTC ACAATC 


CAGGATTTGA 


AGGATCTATC 


GTTATCGATC 


GTTTGAAAAA 


1680 




GGTATAGGAT 


TTAACGCAGC 


AACTGGCGAC 


TGGGTTAACA 


TGATTGATCA 


1740 


AlAi Inl V- rt I 1 


GA^CCAGTTA 


AAGTGAGTCG 


TTCAGCCCTA 


CAAAATGCAG 


CATCTGTAGC 


1800 


LnuL 1 1 vjrt * 1 


TTG AC AAC AG 


AAGCACTCGT 


AGCCAATAAA 


CCAGAACCAG 


TAGCCCCAGC 


1860 


TCfAPfAATfl 


GATCCAAGCA 


TGATGGGCGG 


GATGATGTAA 


GCTTTCTATA 


GAAAACAACT 


1920 




ACAAAAGGAG 


GGAATGACTA 


ACCCTTCTTT 


TTATAGGCTC 


TTTCTCAACT 


1980 


GTAGTGGGTT 


GAAGTCAGCT 


AAGCTCGAGA 


AACGACAAAT 


TTCGTCCTTT 


CTTTTTTGAT 


2040 


GTTCAAAGCG 


ATAAAAATCC 


GTTTTTTGAA 


CTTTTCAAAG 


TTTCGAAAAC 


CAAAGGCATT 


2100 


GCGCTTGATA 


AGTTTGATGA 


GATTATTCGT 


CGCTTCCGGT 


TTGGCGTTAG 


AATAGTGTAG 


2160 


TTGAAGGGCG 


TTGATAATCT 


TTTCTTTATC 


TTTGAGGAAG 


GTTTTAAAGA 


CACTCTGAAA 


2220 


AATAGGATGA 


ACTTGCTTAA 


GATTGTCCTC 


AATAAGTCCG 


AAAAA7TTCT 


CCGGTTCCTT 


2280 


ATTCTGAAAC 


TGAAACAGCA 


AGAGTTGATA 


GAGCTGATAG 


TGATGTTTCA 


AGTCTTGTGA 


2340 


ATAGCTCAAA 


AGCTTGTCTA 


AAATCTCTTT 


ATTGGTTAAA 


TGCATACGAA 


AAGTAGGACG 


2400 


ATAAAATCGC 


TTATCACTCA 


, GTTTACGGCT 


1 ATCCTGTTGT 


ATGACCTTCC 


AGTAGCGCT? 


2460 
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GATAGCCTTC TATTCATGGG 
ACTCATAGCA CGGCTAAGAT 
CCGGAGTGAA ACAGTCTGGG 
TGTTTAGCCA AGTCATAGTA 
CGAACGGCTC TATCGTAGCG 
AGAACAGTGA TAATATTAAG 
GTGAAGGCAT ACTCATCCCA 
TGAAAGTCAT TGAGCTTGCG 
TCAGTCATAG AAATTTTTTC 
ATTTGGTGA? TTTTCTTTAC 
CACTTGAAAC GACGCTTTCT 
GGAATTTTAG AAGGTTTTTG 
GATGGGGCCT CGTAGTCCAG 
TCTAAAATCT GGATATTACG 
TCCATATGAA TCTTTCTAAT 
TTTTTTCTAC AACAAAATAG 
TATAGAGCCG AAAATTCACA 
TTATTAAAGG ATGACACAAA 
AAAATATACC TCACAGAATC 
TATTTTGACT T7ATTGAATC 
CT GAT AG ATT AAATAGCATT 
GATTGATGCT ATGTGGAAAT 
TATACTAATC ATTTTCGTAT 
GGAATAAAGA CATTAAAAAA 
GCATAAATCT CTTTCTAGTA 
TGTTTACACA ATTTATTTTA 
CAACTTTACC GATTCTTTAG 
GAGAAAGGAC CACGTCCATT 
ATATAGTCCA AGTCATCAGA 
TCCAAGAGAC GTTTTTCCCC 



630 

A7TTTCGATC CAATTGGTTC ATAATTTGAA CACGCACACC 2 520 

GTTGTACAAT GTCAAAGCCA TCCAACACCA TTTTAGCATT 2580 

AGACTGTTTC AGCCTGAGCC TAGAAATTTC AAAGCGAAGC 2640 

AGGACTAAAC ATATCCATCG TAATGATTTT CACTTGACAA 2700 

AAGAAAGTGA TTTCGGATGA CAGCTTGTGT TCTGCCTTCA 2 760 

ATTATCAAAA TCTTGCGCAA TGAAACTCAT CTTTCCCTTA 2820 

AGACATAATC 7T7CGAAGCC GAGAAAAATC ATGCTCAAAG 2 880 

AATGACAGTT GAAGTTGAAA TGGCCAGCTG ATGGGCAATA 294 0 

AATTAACTTT TGAGCAATCT TTTGGTTGAT GATACGAGGG 3000 

CAGGGGAGTC TCAGCAACCA TCATTTTTGA ACAGTGATAG 3060 

AAGGAGAATT CTAGAACGCA TACCAGTCGT TTCAAGATAA 3120 

AAAGTCATAT TTCTTCAATT GGTTTCCGCA CTCAGGGCAA 3180 

TTTGGCGATG ATTTCCTTGT GTGTATCCTT ATTGATCATG 3240 

GTCTTTAATA TCGACCAGTT TTGTGATAAA ATGTAATTGT 3300 

GAGTTGTTTT GTCGCTTTTC ATTATAGGTC ATATGGGACT 3 3 60 

GCTCCATAAT ATCTATAAGG GATTTACCCA CTACAAATAT 34 2 0 

TCTAATATAT GCAGACTACT TTGAAATGAA ATTAAAAAAA 3 480 

AGTTTTTGAA AAATCTACAT TCAAATTTC7 AGAAGGATAT 3 540 

TAAAGAATCT GCAATTAAAC AAATGGACAA TGTCATAAAA 3 600 

TAAAAGTATT GCTTTATATT TTCAAAAACG ATTAAATGAG 3 660 

TTCTCTGTTG AGATATTGTT TTTAAAATAT TCTACTAAAT 3720 

ACAAAAAAAT GTTTTTGATA CGAAGTTGAC CTGTATTTTT 37 80 

T7TTTGTATT AAACGATATA AGTTTGTTGT AAACTTACAA 384 0 

TAACAGTATA 7CTATTTC7T TTATATATTT TACGAATTCT .'900 

ATGTGTTGTA ACTCTGC7AT AATAGA7TTA TTCCTTTTTG 3960 

TACTACCAAA AAAGGTCAGG ATTTTG7TCC TGACCTTTGA 4020 

TTCTACATAG CGCTTGTACC AAATGTTTAC ATAGGCTTCT 4080 

GTTAATCCAA TCAACAAGAA TTTTGACATG TTCTTTTAAA 4140 

ATAATTCATT TTGCGTT7GT GACGC7CGTA CTCTTCAACG 4 2 00 

ATCTGTAAAA ATTTTAACAT CCAAATCGTA A7CAATATAC 4 2 60 
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1 1 UaVj I VvV_ I I 


Ul 1 \—A I ULAu 


ATAftTAfWIC 


CTAGrfATAT 


TGCAATAGTA 


AGAAGTTCCA 


4320 


TT ATC ACG AA 


TLA 1 


OA TA 1 1 ArtnL 


L. AA 1 Al 1 I *w A 


tgtgaaagta 


AACAATAGCC 


4380 


GGTTCT CGAG 


TGAL.L.L. AAL,G 


ACGACv- A 1 LA 


L. l 1 l LVjLl * AA 


C A A (TTYIT ATG 


ATCGTTGACA 


4440 


CCAATAATGG 


CGTTTTCTGT 


TGTTTTTAGT 


A A T/*f**TY1T 

ML, L. A iuu ly 1 


rclCC A AfIT 
LLLViLLAAo 1 


TCGGTGGAGA 


4500 


CTCCCATCAT 


GCTTATAACT 


TTGAATTGTA 


A i AAAVj 1 L,UL> 


L I ILl 1 1 • w 


AACCTTCATA 


4560 


ACTAACCAAC 


TTTCTACAAT 


TTATAAGTTT 


A1LA1 1 1 AL 1 


Al LVj I AL^Al 


AAAATTAC*CC 


4620 


AAAATCTGTG 


AATTTCACTT 


GGAAATATTA 


A A/* ITITTC" 
AAUA iAl 1L > 


PT A A n AftPflC* 
L AAuAuLuL 


TTGCTATATC 


4680 


CGAAAAATCG 


TAGCCCTTTC 


GTGCTAAAAL. 


TTf* AfT m AAA 
I 1 LlALf I * AAA 


f*r;pTr;pTT*f*A 


GTTCCTATCC 


4740 


TTCATACTTT 


CGGGCATAL. r 


TAuTAiAl ITj 


C*"T ATC AACT 

^ * l AIL AAU 1 


T P CTTC AAG A 


TGAGTTCCTG 


4800 


AGTCGTTTCT 


TCATCAACTT 


GACTA^ LLAA 


1 1 1.U 1 UAAAu 


flPAATTTTAC 

OL/V\ 1111 Aw 


CATCAAAATA 


4 8 60 


AGALrAAL»CL,L, 


L l^jU ALi 1 V. A 




AATCTT AT CT 


TGCAGGGCAC 


GAGCTGGAAG 


4920 


rp^p/pm^f-p-p/- * 

r I I * L,L,L I *-A 


LA; 1 n U *-A 


ATA fITTT ATT 


GGCTACACGT 


TGAGCAACTT 


CCGAAAAATC 


4980 


AAAATCATTC 


AAGATTTCTT 


CTATAGTAGA 


TTTTGAAATT 


CCTTTTTGTG 


CTAATTTCTG 


5040 


AGTCAGTACA 


TAAGGTCCCT 


TGTCTCCTGA 


AAGTTGATTG 


CCATTGATCA 


TAGCATAAGC 


5100 


GTACTGGCTA 


TCATTAATCC 


ACTTCTCTTC 


TTTAAGATTA 


GCAATGACTT 


GAGAAACGAT 


5160 


CTTTTCATTA 


ATATCATATT 


TTTTCAGATA 


TTCTCTGACC 


TCTTTTTCAG 


TACGTGCTTT 


5220 


AAAGGATAAG 


TGGTAGAGGG 


CCAGATTCTT 


ACCATAAGAA 


AATTGAGCAA 


AGTCTTGAAT 


5280 


CTCTTTCAAT 


TCCTCTTCGC 


TTATCACCTT 


ATCTCTCCAT 


AACATAAAAC 


GAACAATTGT 


5340 


GTCTTCGGTG 


ATATAGCATT 


TGTCG 








5365 


(2) INFORMATION FOR SEQ ID NO: 78: 









ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3636 base pairs 
(S) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
TTTCCAGAAA GAAGTTGACT AAAGTCTTTA TCAAAGAGAA TGACTTCCGT ATTGGAACTC 
ACATTACCTT TTATTTCTAC TTTACTAGCC TCCGCCCTAG CATTTTCTAA ATCTTTAATC 
TCTTCTGTTG CCCTATTTAT AGCCAGCTGA ATAACTGCTT GAGGATTTTC ACTCAGTCCA 
TGAAGCTTAT CGTCCACCGA AGTATAAAGA CTCGAATGCA TCACTTGTAA AATAATCAGA 
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< 



CTCATTGTAG 


AAAAAATCAG 


CGTGAAGACA 


CCGAAGTTGC 


GGATAAAATA 


ACTAAAGTCA 


300 


TCCGCATACC 


ATGTTTTTTT 


AAGTTTACTG 


AACATCTTTT 


AAAAGATACC 


CAACACTACG 


36C 


CAAAGTTTGC 


AAATTCTCTG 


CAAAAGTGGT 


TCCCTTTAAT 


TTCTTACGGA 


CTTTTGAAAC 


420 


ATAGACTTCG 


ACAACCGAAA 


TCGTTGTATC 


ACTATCAAAT 


CCCCATAGAC 


GGTCAAAAAT 


480 


CTGCGTCTTA 


GGCAAAATCA 


CATTTTGATT 


TTGAAGGAAA 


TAAACTAGTA 


AATCGAACTC 


540 


TTTCCCCAGC 


AATTCGACAG 


CAGTATCTTC 


AACTTTAACG 


GTATTGGTTG 


ATAAATTAAC 


600 


CACGATATTC 


CCATAAGTCA 


AGGTGTTTTC 


ATTAAACTTC 


CCTGAACGTT 


TGAGAAGGGC 


660 


CTGAATCCGC 


ATTTTAAGTT 


CTTCTAGGTA 


GAAAGGTTTG 


GTCAGATAAT 


CATCCCCTCC 


720 


CAGTTCAAAT 


CCATGTCCCT 


TGTCATCCAA 


ACTTTCCTTG 


GCACTCATAA 


TCAGAACTGG 


780 


TGTCGTAATT 


CCCTTTTCAC 


GCAATTCTTT 


TAAGACTTGG 


AAACCATTTT 


TTTCTGGCAA 


840 


CATCAAATCC 


AGCAAAATCA 


AGTCATAGAC 


ACCACTCTCA 


GCTTCCTAGA 


GACCTTCTTC 


900 


TCCATCAAAT 


ACCTGCATAA 


CATCCGCAAA 


ATCGTCTAAA 


AAGTCAAATA 


CTGAATTTCA 


960 


CAGACCTAGG 


TCATCCTCAA 


CCAATAAGAT 


TTTTATCATG 


AGAAACTCCT 


CCTTATTAAA 


1020 


ACTATTATAC 


CAAATTTGCC 


TTAAAAAAAA 


CTCAACTCTC 


TGCATTTTAC 


ATGAGATACC 


1080 


TGAGTTTTCT 


TTTTATTTTA 


GGCTTATTTA 


TGCATTTCCG 


TATTGAAGAA 


CAACTGCTTC 


1140 


GACTGCAGCT 


TTTTCACGGC 


TAATCAAGTC 


AACACCCCCT 


GCAATTTCCT 


TGATTCCCAT 


1200 


ACCGATGTTA 


CGGCTAAGAG 


CAAGGTCAGA 


AAGTTGCGGT 


TCAAAGAACT 


CCTTGTATTC 


1260 


CGCCAAGCGT 


TGCTGAGTCT 


TAAATACATG 


AGCAGGAAGG 


ATAACAAAGC 


TATCAAAGCT 


1320 


CATATCTCCT 


CCAAGGGCTG 


CCTTAATCCA 


AGCCCAGTTT 


TCACGCGCCC 


AAGACCAAGC 


1380 


TGTTTTCTGA 


GTTGCTTGAT 


GAGCTAGGAA 


TTGGTAATAC 


CAAGCAGACA 


AGTCCTGTGG 


1440 


TTTGACCACA 


AATTTGTCCT 


TCCAAGAAGT 


AATCAGGTTT 


TGGATATTAT 


CCGCATCTGT 


1500 


ACTGTATGCA 


ACACCTCC7G 


CCAACTGGCG 


TTTAAAGACA 


GCATCTGTTG 


CGTGAGTATA 


1560 


ACTATCAAGA 


TAAAGTGCTA 


ACAAGTCTTT 


AGTCTCATGA 


TGTTTCATCT 


CATTAATCAG 


1620 


AACTTGTGAG 


CGAATAGCTG 


CTGGGAGTCC 


TGCAAGATTC 


TCCTTGTGTG 


TTGCGAAGAT 


1680 


TTGGCTAGCG 


ACT TG ACT AG 


CTTCTGCATC 


ATTTGAGCCA 


AT CAT CAT CG 


AAACAGCCAG 


1740 


CTGACGAACC 


AATTCATCCT 


CATCTGATTC 


TCCGTCTTTA 


GCTTCAAAAC 


CAAGACGGTC 


1800 


ATAGTTATGA 


CGAGCCAATT 


TAGCAACCAG 


TCCTTTGAAG 


GCTGTTTCAG 


CATCCGTTCC 


1860 


TTCATCAATA 


AAGCGCTCAA 


GGGCTGAAAT 


CACTTGAGAA 


ACAGCTGAAA 


CCACCACATA 


1920 


AGACTCTTCC 


TTAGCAAGTT 


TATCAAGAAC 


TGGAAGCAAG 


TCTGCATAAG 


AAATGTGCCC 


1980 


TGCCTCAGCC 


AACAAACGAC 


GTTCTTGAAC 


AATTTGCAGT 


TTGCTTGTGT 


TATCAAGTGT 


2040 
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CTCTAGCTCA 


GCAAGAACAG 


CTGCTAACAA 


CTCTCCTTGA 


TAGTCGGTAA 


TATAGTGGGC 


2100 


A G T AT^T^C A 


GTGTTGAGAC 


GAAGAGCTCC 


TTCATTTTCA 


GCAAGAACAG 


CTGCGTAGCC 


2160 


AVjVj A 1 » i 


ATACTTTCAG 


TTTCGAGTGT 


ATCAGGCAAG 


CCTTTCCAGT 


TGCTATTGAG 


2220 


UuwCaL V- AL V. 




T (""TTGTCTTC 


GTTCTCACCG 


ATGAAGAATT 


GTTTTTGTGA 


2280 


nn 111 1 L AAO 


ALA * LA till 


CAACTTTAAC 


AGTAAGAACT 


GGGTAACCAG 


GCTGTTCCAA 


2340 


LLMunn ILL 


ATfiAAfinrTG 


CG AC ATC AC C 


TCCTGACGCT 


TGACCAAGGG 


CATCCCAAAG 


2400 


L . LAL I nv.«.n 


ATnnT n ,Tv rG r 

A t L*L ly • i 


TGTATTGGTG 


TTTTTCAAAG 


TAGGCGTGCA 


AACCTTTAGC 


2460 


MnA I L ALL A 


L L 1 LL 1 nUL >~ 


AACGGCGAAG 


CATGTGCATG 


AGACGGCTTC 


CTTTGGCATA 


2520 


LALLA 1 AuLu 


PCHTCAAACA 


GTGTATTGAT 


TTCATCTGGA 


TGTTTAACTT 


CCACGTGGAC 


2580 


AviAL i LAALlj 


LL A * nu 1 r\\j 


CGTCACGTTC 


AAGAGCAAGA 


GGTACTCCAC 


CTGTTTGGAA 


2640 


» Tr"PTP A A A^l 


ATATTCCACC 


TTGGTTCGAT 


GGTATCCACA 


CAGACGTATT 


C CATC AT ATT 


2700 


a n r*n a a a pt^ 1 


TCATTGAGCC 

1LA1 


AAAGGTCATC 


CCACCATTTC 


ATAGTCACGA 


GGTTCCCAAA 


2760 


LLAI 1 LL* i LA 


VjLLAA 1 1LA1 


GGGCCACAAC 


AAGCGCAACT 


TGTTGACGGC 


TAGCAAATGT 


2820 


ALinLi 1 11.1 




AGTAAACTTC 


ACGGTAGGTC 


ACAAGACCCC 


AGTTTTCCAT 


2880 


ALtLALLAUL I 


/■ A A A GT C A G 


GAAGGCC GAT 


GTGGAGAGAT 


TGACGAATTG 


GGTACTTAAC 


2940 


TL L AT AGT AA 


TCTTCGTAAA 


AGTGGATAGA 


GCGAACAGCG 


ATATCCAGTG 


AGAAATCAAC 


3000 


A 11 1 vjAAAb i 




TGCTTGAGTA 

4 VJW * * WAV * 


GACACCTACC 


AGGGTACCAT 


TTTTAGTTTT 


3060 




P^TTf-.r A AA' 7 * 

L v— I IH-A/W * 


CACCAGCAAC 


AAAGGCCAAC 


AAGTAAGAAC 


ACATGCGAGG 


3120 


1 Vj 1 1<J 1 L 1 LA 


AACT? CC AC A 


TACCTGTTTC 


C7TACCGTTT 


TCAACATCGA 


TTTCTGGCAT 


3190 


GTTTGACAAG 


GCCAATTCAC 


CTTCTGCTTG 


GTCAAAGCGA 




phi ar.TT.r 


3240 


TTTGGCTTCA 


GGCTCATCCA 


CACATGGGAA 


AGcTTCGCGC 


GCAAAATGGC 


TCTCGAACTG 


3300 


AGTAGACAAG 


ACCTCCTTCT 


TGACTCCATC 


AACTGTATAA 


TAAGAAGGGT 


AAATCCCTGT 


3360 


CATGTTGTCT 


GtAATTTTAC 


CAGAAAAGGC 


AAGAACCAAT 


TCAACTTGAC 


CAGCCTCAGC 


3420 


CAATTCGATA 


TGAAGGGCTT 


CATTGTCATG 


GTCAACTGTA 


AATGGACGAG 


CTTGACCTGC 


3480 


AACTTCTACA 


GAGGTGATTT 


CCAAATCTTT 


TTGGTGGAGG 


GAGATGCCGT 


CACTCTGTGC 


3540 


TTGACCAGTG 


ATGGTCACTT 


TCCCAGAAAA 


AGTCTTCCTC 


TCACGACTCA 


AATCTAAAAA 


3600 


TAAATCATAA 


TGTTCAGGAA 


CAAATTGCTT 


AATGGG 






3636 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5066 base pairs 
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(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
( 0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

ATACCCTGTA ATAATCGATT TTAGAGGTAC CATAAGCCAC CTCCTACAAA TAGAAACCGA 6C 

TATAAATCAA TGCCTTCCAC CCTTAGACTT CCCTAGTTCC TGTCTCAAGC GAAACATTTC 120 

TTTGAAACAG GAATAAGTTA ACCAATTCAT ACCAATAGCT AGCAGAATAA AAACAAACCA 130 

AATGCCCCAT AACTTGATAT CTCTCACATT TCTCAAGACG CTATTGAAAA ACAGAACTGA 2 40 

AACAACTGTC CAAGCAAGGC TAAAAAGAGA ATAGAACGGG ATGTAAAACC AGTAAAAATA 3C0 

ATAAAAAATT GGAAAAAACT TACTATTTCT GTTGGCCTTT TCAATCCAGT TATCAAAATA 360 

AAACTACGGT GCTAAAAGTA AGAATTTAAA CAAATGTTCC ATCACCGACA TCCCCCCTTC 4 20 

TTTTGATAGC GTTTTCTATT ATTTTATTAT ATCAAAAAAA TCCGGAACTG TCATTCCAGA 4 80 

TTCTACTTTT TTATTTGCGT TTTCTTGCGA TGAGATGAAT CGGTGTTCCC TCAAAAACAA 54 0 

AGGCCTTGCG GATTTGATTT TCCAAGAAAC GCAGGTAAGA AAACTGCATG AGTTCTTCTT 600 

CATTGACAAA GATGACAAAG GTTGGTGGTT TGGTTGCCAC TTGGCTCGCA TAGAAAATCT 660 

TGAGACGTTT TCCTTTGTCT GTCGGTGTTG GGTTGATGGC AATCGCATCC ATGATGACAT 720 

CGTTCAAGAC AGCTGATGGA ATACGTGTAT TTTGACTTTC GCTGATTTGC TTAATCATCT 7 80 

CAGGAAGTTT GTGGAGACGT TGCTTGGTTA AAGCTGATAC AAAGATAATC GGTGCGTAAG 840 

GCAGGTATTG GAACTGCTCA CGGATATCTT CTTCCCAGTT TTTCATAGTG TGGTTATCTT 900 

TTTCAACCCT ATCCCACTTG TTGACCACGA TAATCATCCC TTTACCAGCT TCATGGGCAA 960 

ATCCTGCGAT ACGCTTCTCG TACTCACGAA TGCCTTCTTC CGCATTGATG ACCATCAAGA 1020 

CCACATCTGA ACCGTCAATA GCACGCATGG CACCCATAAC AGAGTATTTC TCAGTATTTT 1080 

CATAAACCTT ACCAGACTTA CGCATACCAG CCGTATCAAT CATGGTAAAC TCTTGACCAT U40 

CTGTATCTGT AAAGTGGGTA TCAATGGCAT CACGAGTTCT TCCAGCAACA GGACTAGCAA 12 0C 

TAACACGGTC TTCTCCCAAG ATAGCATTGA TCAAGCTTGA TTTTCCAACG TTAGCACCAC 12 SO 

CAATCAAGCT AAACTTAATG ACATCTGGAT TTTCTTCCTC ATATTCATTT GGAAGATTTT 1320 

CTACGATCGC ATCTAGCACA TCCCCTGTAC CGATTCCATG GACAGATGAG ATACCCAATC 1380 

GTTCACCCAA ACCGAGAGCA TAGAAATCAT ATATATCATT TCTCATCTCA GGGTTGTCCA 14 40 

CCTTGTTGAC TGCGAGGATA ACTGGTTTGT GGGTCTTATA AAGCTTACGA GCTACGTATT 1500 

CGTCTGCATC AGTAATTCCT TCCTTACCAG ACACGACAAA AACCATAACA TCTGCTTCTT 1560 
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CCATGGCAAT TTCTGCCTGG TGCTTGATTT GTTCCATGAA AGGAGCATCG ACATCATCAA 1620 

TTCCTCCTGT ATCAATCATG CTAAAAGAAC GATTGAGCCA CTCACCCGTT GCATAAATAC 1680 

GGTCACGTGT CACTCCTTCG ACATCTTCTA CAATGGAGAT TCGCTCACCA GCGATCCGAT 1740 

TAAATAGGGT TCATTTCCCA ACATTGGGAC GTCCTACAAT GCCAATAGTT GGTAGGGCCA 1800 

TAATTTCTCA CTTTCTACAA TAATTTCTTC TGTTCAAGAT TTTTTCTAGT TGAGCTTGGT 1860 

TCAGCTTGAC CAAACTGTTC TGCTAGGCGC TGACTCCAGC TTGTGGTCGC ACGCGCCCCA 1920 

GCATAGTCAG CCTGAACACG GTCATAACCT TGGATTGCCT CAGTTGACTG TTCTTGGTAT 1980 

TCTTCCTCAA AGACAACATT CTCTAGTGGC AGTCTCGGTT TCATATCATG ATGTTGATTT 2 040 

GGCACACCCA GTGCCATCCC AAAGACAGAA TAGGTGTAGT CAGCTAGGTT AAAGAGCTCT 2100 

GCCACTTCTT CAGACTTGTA TCGAACCAAA CCGATAATCA CACCACCATA GCCCAAGCTT 2160 

TCAGCTGCCA ACAAGGCGTT TTGTCCAGCA AGAGCTGCAT CGACCGAACT AATCAAGAGA 2 2 20 

CCTTCCACAC CTTGGGGTTG GAAGGTGTCG GTATGAAGTC GGCCTCCCTT TTCTGCTCGC 2 280 

TTCAAATCTC CGACAAAGAG AACGAAAACA GCAGACTCCC GAATGGCTTC TTGAGGTACC 2 340 

AATTCATACA AGGCATCTTT CTTCTCTTGA CTTCGTACCA CAATCACAGA GTAGGATTGG 2 400 

AAATTCTTCC AAGATGATGC CATCTGGGCT GCTGTCAAAA TCTCATTTAA GTCTACTTGG 2 460 

GCAATTTCTT GCTCTTTAAA CCTCCGCACT GAAGTATGAG CCTTCATCAA TTTAATGGTT 2 520 

TCTGTCATCC ACGGTTTACT CCTTCTAAAC GAGTCTCCTC AGCCAAATAA CGGATGCCTT 2580 

CCATGACCCG TCTGGCTTCC CAGGTTTCGT CATTTCCATC TTTCACTTTC GCAAAATGCT 2640 

TCTCCAAATC TTCAAAGTTG AAGTTGGATG TGAAAAAGCT CGGTAAATTT TCCTGCATCC 2700 

GATATTGGAG AATGACCTGC AGGATTTCGT CACGCACCCA AACCCTTGAT TGCTCGGCGC 27 60 

CAATATCATC TAAAATCAGG ACCTCACACA GCTTAATCTC ATCCACCAAG GTCTTAACAT 2820 

TGCCATCACT GATAGCATTT TTGACATCAA TGACAAAGCT AGGATAGTGG ACCACAGTTC 2 880 

ATGAAACACC ACGTTTTTCT GATAAATCAT CACCTAACCC CCCCACCATG AAACTTTTAC 2 940 

CCACACCAAA CTCTCCATAT AAGTAAAGAC CTTTTCGAAT ACC7GGATAT TGCTCCACGA 3000 

AGGCTAGTAG CTTT7CAAAA ACTGGTAAGC GCCCCAAATC ATCCAAGTCA ACTTGAGCCA 3060 

AACTAGCTTT CTTGAGACTG GCTGGTAGAT TGATTAACTT GAGACGGTTC TTAATAGCCG 3120 

CTTCTTTTTC AGCCGCGATT AGCTCAGGAG TTTCTTCATA TGAAACATCT GCATAACCAT 3130 

CATTCTTAAC CAAAATCCGC TTGTAGCCTT TGGCAATATA ATCCGTATCC CCACGCAGAA 3 240 

AC7TGTCACC CTCCGTGATC TACTGATTAA ACTTGGAGAT ACTGCGATTT AATTCCTTTG 3 300 
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GAGTTAACGA TTCTTGCTGG ATAAACGCCG CAACATCAGG GTCCTTCATG ATTTTCTGGA 3 3 60 

CCAAATCTTG ATAATAAAAA CCGCTGGGTT GACGTTTGAG TACGTCTCCG ACACTTTCCA 3 420 

TCTAATCTCC TCCTTTTTCT AATCCACCTA ATAGTTCTTG CTTCTTACGT TCTAGTTCCA 3 480 

GACGAGTTTC CTCGCTGGTT TCATTCTTAT ATTCAGGATT ACTCCATTTA GGAACATTGC 3 540 

TTTTTTCTGG GGCAGTCTGA TTCTGTTTTT GTGTTTTTGC T TTC TGCCCT CGATCACGAA 3 600 

TTCGTAAAAC GGCCTCTTCT GCCGAATGAA TCTTTTGATA GGCATAGTCA TTGCCTACCT 3 660 

TCATCGCATA TTTCTCATTG ATATTTGCCG AATCCACCTT ATTAAAGGTC AATAAGAGAA 37 20 

TAATATTCAT GACTTCGTCC AGTAAGCCCA AGCCAGCCAT CTGTTGCAAG AGTTCTCTTT 3780 

CTGTTTGGGT AATCGTTCCC TTGCGTGTTT GCTTGATTTC TGCTAAGAAC TGCAGGGCAG 3340 

TTTTACTTTT AGCTTCTTTG ATAATGGTCG CTTCCTTAAG ACTAAAGTCA GAGGAAACTG 3900 

GTTTTTGAGC AATTTTTTCA CGCATGCGTT TGCTTGAAAT AACCTGGGAA ACAGCTGTTG 3960 

ACTTGGCCAA TTCATAGCTT TCAAACCAAG TCCATTTCTT CTCCTCGGCA ATAGCAAAGA 4020 

GGTTTAAGAC ATCGGACTCC TCATCCGCAA AACGAAGTCC ATCTCGAGCC ATCAGCTGGC 4080 

CAAAATCTTC CAAGTCAAAA TCATTGGCCA CTTTCTTCTT GAGACCAAGG TCTTCTTGAC 4140 

TGCCTAGTTC TGCCAATTCT GGAAAGACTT GATTCAGTCA GACAGCTATT TCTTCACCAT 4200 

CAGCACTTTC AACTTTCAAA TCCTCCACAC CTACATCGCC AATCTTTTTC TCTAAGAGTC 42 60 

TGCGATAAAC AGGATGCCCC AAGAAGTCTT GACTAGATAG AGCAGCATCG AGGGCTACCT 4 32 0 

GATAAACATC ACC C TTTTGA TAGAGGGTCA AC AG ATT AAA AGCAGATAAG ATTTTCAATG 4 3 30 

ATTTTATCAG TCTATCCATC CCAAAGTTGA GATGGTTGAG AATGCTTGAA AAAAGATATT 44 40 

CCTTTCTACC ATTATCCCAA AAACTGATTG TATAAAGATA AAGGCTCAGT GCCTCCTGAC 4 500 

CGATAATCGG GAGGTAGCAC TGTACCAGAG ATGAGGTATC TTGCGACACC CGATTATTCT 4560 

TTACATAAGA AAAACGGTCA ATTGGCTTCA TTTATCTTTC CTTTTTCTTT TTAGAGGACT 4620 

GGGTGATTTG TTGGAGCAAG CTCTCTAACT CACTGACATC CTTAAAACTA CG AT AG AC AC 4680 

TAGCAAAACG TACATAGCTA ATCTCGTCCA ATTCAGCCAA CTCCTCCATG ACGAGTGAAC 47 4 0 

CAATCTCCTC ACTTTGAATT TCATTTTCAT TTCCACCACC GAGTTTCTGT TCGATACGAT 48 00 

TGACTACCAT GTTGATTTCA TCACTTGACA CAGGACGTTT CTGGGCTGAG CGGATAATCC 4860 

CATTAAAGAT TTTATCTCTG GAGAATTGTT CCCGTGTGCC ATCTTTTTTA ACAACCACTA 492 0 

AGGTTCTTTC TTCTACTCGT TCGTAGGTTG TAAAACGGTG TTGGCATTCG TCGCACTCAC 4980 

GTCTTCTACG AATGGTGTTC CCTTCTTCTG CTTGGCGACT ATCGATAACA CTTGACTTGG 5040 

TAGCCCCACA TTTTGGACAG GGTACC 506 6 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9607 base pairs 
( B > TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



LAL 1 «. Onnu i 


nl 1 1 UrtArtLn 


GPT ATGC AAA 


ACATCATGCC 


TGTACTTGAA 


GTACGTGCAC 


60 




TCOTTPTAAP 


TACCAAGTCC 


CAGTTGAAGT 


TCGTCCAGAA 


CGTCGTACAA 


120 


LAV. 1 I tit»At. 1 




GT AAC AATC G 


CTCGTCTTCG 


TGGTCAACAC 


ACAATGCAAG 


180 


At.t.t»Tt. n. t»t. 




TTGGATGCTG 


CTAACAACAC 


TGGTGCAGCA 


GTTAAGAAAC 


240 


ti 1 (jAAIjA 1 At, 


tp & ppgt atg 


GCTGAAGCTA 


ACCCTGCATT 


CGCACACTTC 


CGTTGGTAAG 


300 


ATAtiOA I tit.t» 


A5.APPPT m AA 


GAAAGTCCCA 


GAGAAAATAG 


GGAATCGAAG 


CAGGTTGCGG 


360 


TTGt.AAt.t-AA 


TP1PA1TAT 


PTTTTTPTCC 


ACACTTTTAG 


CTTGAGCTCA 


ACTAAATCAT 


420 


OA * t»C I aUuA 


APPPTAAflfiA 


TGC AAGGT AA 


aaATAGGAAA 


CTGACGCAGT 


ATTCGACGAA 


480 


TAt. AAtitaAt» 1 


t 1 lAlV.i 1 l l 


TCACGCAGCA 


TCCCGTTCCA 


GCTCACATCG 


CCTAACTAAC 


540 


TTTAGCCCGG 


CTTCAAATTA 


GCTAAATCGA 


TT ACT ATT AG 


CTATAACTCA 


tit. 1 I nvUn 1 *• 


600 


TCGTAACTTG 


AAACCAACAA 


TAGCATGAAA 


ACATTGAGAA 


CCCCTAGGTC 


CTGCCTATCC 


660 


GTTTTT ATT A 


AAATCGTGTT 


ATAATAGAAT 


AGAAATCAAA 


AATAAATAGG 


AGAAACAAAC 


720 


CTCATGGCAC 


GCGAATTTTC 


ACTTGAAAAA 


ACTCGTAATA 


TCGGTATCAT 


GGCTCACGTC 


780 


GATGCCGGTA 


AAACAACAAC 


TACTCAGCGT 


ATTCTTTACT 


ACACTGGTAA 


AATCCACAAA 


840 


ATCGGTGAAA 


CTCACGAAGG 


TGCGTCACAA 


ATGGACTGGA 


TGGAGCAAGA 


GCAAGAACGT 


900 


GGTATCACGA 


TCACATCTCC 


TGCGACGACA 


GCTCAATGGA 


ACAACCACCG 


CGTAAACATC 


960 


ATCGACACAC 


CAGGACACGT 


GGACTTCACA 


ATCGAAGTAC 


AACGTTCTCT 


TCCTGTATTG 


1020 


GATGGTGCGG 


TTACCGTTCT 


TGACTCACAA 


TCAGGTGTTG 


AGCCTCAAAC 


TGAAACAGTT 


1080 


TGGCGTCAAG 


CAACTGAGTA 


CGGAGTTCCA 


CGTATCGTAT 


TTGCCAACAA 


AATCCACAAA 


1140 


ATCGGTGCTG 


ACTTCCTTTA 


CTCTGTAAGC 


ACACTTCACG 


ATCGTCTTCA 


AGCAAATGCA 


12C0 


CACCCAATCC 


AATTCCCAAT 


CGGTTCTGAA 


GATGACTTCC 


GTGGTATCAT 


TGACTTGATC 


1260 


AAGATGAaAC 


CTGAAATCTA 


TACTAACGAC 


CTTGGTACGG 


ATATCCTTGA 


AGAAGACATC 


1320 


CCAGCTGAAT 


ACCTTGACCA 


AGCTCAAGAA 


TACCGTGAAA 


AATTGATTGA 


AGCAGTTCCT 


■380 
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GAAACTGACG AAGAATTGAT GATGAAATAC CTCGAACGTG AAGAAATCAC TAACGAAGAA 1440 

TTGAAACCTG GTATCCGTAA AGCGACTATC AACGTTGAAT TCTTCCCAGT ATTGTGTGGT 1500 

TCAGCCTTCA AAAACAAAGG TGTTCAATTG ATGCTTGATG CGGTTATCGA CTACCTTCCA 1560 

AGCCCACTTG ACATCCCAGC AATCAAAGGT ATTAACCCAG ATACAGACGC TCAAGAAATT 1620 

CGTCCAGCAT CTGACGAAGA GCCATTTGCA GCTCTTGCCT TCAAGATCAT GACTGACCCA 1630 

TTCGTAGGTC GTTTGACATT CTTCCGTGTT TACTCAGGTG TTCTTCAATC AGGTTCATAC 1740 

GTATTGAATA CTTCTAAAGG TAAACGTGAA CCTATCCGAC GTATCCTTCA AATCCACCC? 1800 

AACAGCCGTC AAGAAATCGA CACTGTTTAC TCAGGTGATA TCGCTGCTGC CGTTGGTTTG 1860 

AAA. GAT ACT A CAACTGGTGA CTCATTGACA GATGAAAAAG CTAAAATCAT CCTTGAGTCA 1920 

ATCAACGTTC CAGAACCAGT TATCCAATTG ATGGTTCACC CAAAATCTAA AGCTGACCAA 1980 

GACAAGATGG GTATCGCCCT TCAAAAATTG CCTGAAGAAG ATCCAACATT CCGCGTTGAA 2040 

ACAAACCTTG AAACTGGTGA AACAGTTATC TCAGCTATGG GTGAACTTCA CCTTGACGTC 2100 

CTTCTTGATC GTATGCGTCG TCAGTTCAAA GTTGAACCGA ACGTAGGTGC TCCTCAAGTA 2160 

TCTTACCCTG AAACATTCCG CGCTTCTACT CAAGCACGTG GATTCTTCAA ACCTCAGTCT 22 20 

GGTGGTAAAG GTCAATTCGG TGATGTATGG ATTGAATTTA CTCCAAACGA AGAAGGTAAA 2 280 

GGATTCCAAT TCGAAAACCC AATCGTCGGT GGTGTGGTTC CTCGTGAATT TATCCCAGCG 2340 

GTTGAAAAAG GTTTGGTAGA ATCTATGGCT AACGGTGTTC TTGCAGGTTA CCCAATGGTT 2 400 

GACGTTAAAG CTAAGCTTTA TCATGCTTCA TATCACGATG TCGACTCATC TGAAACTCCC 2 4 60 

TTCAAGATTG CGGCTTCACT TTCCCTTAAA GAAGCTGCTA AATCAGCACA ACCAGCTATC 2 520 

CTTGAACCAA TGATGCTTGT AACAATCACT GTTCCAGAAG AAAACCTTCG TGATGTTATG 2 580 

GGTCACGTAA CTGCTCCTCG TGGACGTGTA CATGGTATGG AAGCACACGG TAACAGCCAA 2 640 

ATCGTTCGTG CTTACCTTCC ACTTGCTGAA ATGTTCGGTT ACGCAACACT TCTTCGTTCT 2700 

GCATCTCAAG GACGTGGTAC ATTCATGATG GTATTTGACC ACTACGAAGA TGTACCTAAG 2 7 60 

TCAGTACAAG AAGAAATTAT TAAGAAAAAT AAAGGTGAAG ACTAATCCGT CCTCACTCTA 2820 

GAAGGAAGTC ACTTAGTGGC TTCCTTTTGT CTTTAGAAAA TACCTCTAAA TATGGTAAAA 2 880 

TAGTAGAAGA ATAATGTGAG GAAAATGAAT GTCAAATAGT TTTGAAATTT TGATGAATCA 2 940 

ATTGGGGATG CCTGCTGAAA TGAGACAGGC TCCTGCTTTA GCACAGGCCA ATATTGAGCG 3000 

AGTTGTGGTT CATAAAATTA GTAAGGTATG GGAGTTTCAT TTCGTATTTT CTAATATTTT 3 060 

ACCGATTGAA ATCTTTTTAG AATTAAAGAA AGGTTTGAGC GAAGAATTTT CTAAGACAGG 3i20 

CAATAAAGCT GTTTTTGAAA TTAAGGCTCG GTCTCAACAA TTTTCAAATC AGCTCTTGCA 3130 
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GTCCTACTAT 


AGGGAGGCTT 


TCTCTGAAGG 


TCCATGTGCT 


AGTCAAGG TT 


TTAAC* n»V-U i 




TTATCAAAAT 


TTGCAAGTTC 


GTGCTGAGGG 


TAATCAGCTA 


ill ATTGAAG 


(jATL. luAAuL 


inn 


GATTGATAAG 


GAACATTTTA 


AGAAGAATCA 


TCTTCCTAAT 


TTAGCCAAAC 


AAV. L 1 unnAn 


J J o U 


GTTTGGTTTT 


CCAACTTTTA 


ACTGTCAAGT 


CGAGAAGAAT 


GATGTCCTGA 


CCCAAGAGCA 




GGAAGAGGCC 


TTTCATGCTG 


AAAATGAGCA 


GATTGTTCAA 


GCTGCCAATG 


AliOAAuCGt- * 


j *i o yj 


CCGTGCTATG 


GAACAACTGG 


AGCAGATGGC 


ACCTCCTCCA 


GCGGAAGAGA 


AAt.L.At»t.v_ T I 


J J ** VJ 


TGATTTTCAA 


GCGAAAAAAG 


CTGCAGCTAA 


ACCCAAGCTG 


GATAAGGCGG 


AGATTACTCL 


JOUU 


TATGATCGAA 


GTGACGACAG 


AGGAAAATCG 


TC7GGTATTT 


GAAGGGGTTG 


TTTTTOATO I 


1 fi 
J Oct 


GGAGCAAAAA 


GTGACTAGAA 


CAGGTCGTCT 


TTTAATCAAC 


TTTAAAATGA 


CGGACTATAC, 


J /iU 


TTCAAGTTTT 


TCTATCCAAA 


AGTGGGTTAA 


AAACGAGGAA 


GAGGCCCAGA 


AG i I 1GACCT 




CATCAAGAAG 


AATTCTTGGC 


TCCGAGTTCG 


AGGGAATGTG 


GAGATGAATA 


At. 1 I UALALIj 


j o**vj 


CGATTTGACT 


ATGAACGTAC 


AGGATCTGCA 


GGAAGTTGTT 


CACTATGACC 


CJCjAACKjA III 


JJuu 


GATGCCAGAA 


GGTGAGCGTC 


GGGTTGAGTT 


TCATGCTCAT 


ACTAACATGT 


CGACTATGGA 


i q fin 


TGCTTTGCCA 


GAGGTCGAAG 


AGATTGTTGC 


AACAGCTGCT 


AAGTGGGGAC 


ACAAC*OCL»tj a 




TGCTATCACG 


GACCATGGGA 


ATGTCCAGTC 


CTTTCCACAT 


GGCTATAAGG 


CGGCTAAUAA 




AGCGGGAATC 


CAGCTGATCT 


ATGGGATGGA 


AGCCAATATC 


GTGGAGGACC 


GTGTCCCTAT 


414U 


CGTCTATAAC 


GAAGTGGAGA 


TGGACTTGTC 


AGAACCAACC 


TACGTGGTCT 


rruACo tcaja 




AACGACGGGA 


CTTTCAGC?A 


TCTA7AATGA 


CTTGATTCAG 


GTTGCGGCTT 


CT AAUA 1 O 1 A 


4 260 


CAAGGGGAAT 


GTTATTGCTG 


AATTTGATCA 


ATTTATCAAT 


CCTGGGCATC 


C ClltlTCAU 




CTTTACTACA 


GAGTTAACTG 


GAATTACAGA 


TGATCATGTC 


kill m ^^f~*f*f** fi 

AAAAATGCC- A 




4 380 


ACAAGTTTTG 


CAAGAATTCC 


AAGAATTTTG 


CAAGGATACG 


GTCCTAGTT G 


\.\.LACAA k Vj*- 


■1440 


TACCTTTGAC 


GTTGGCTTTA 


TGAATCCTAA 


TTATGAGCGG 


CATGA7CTTC 


CAAAGATTAG 


4500 


TCAGCCAGTT 


ATTGATACGC 


TGGAGTTTGC 


TAGAAACCTC 


TATCCTGAGT 


ATAAACGCCA 


4560 


TCGTTTGGGG 


CCTTTGACCA 


AGCGTTTTGG 


TGTGGCCTTG 


GAACATCACC 


ACATGGCCAA 


4620 


CTACGATGCG 


GAAGCGACTG 


GTCGTCTGCT 


TTTCATCTTT 


ATCAAAGAGG 


TAGCAGAAAA 


4680 


ACATGGTGTG 


ACCGATTTAG 


CTAGACTCAA 


CATTGATCTA 


ATCAGTCCAG 


ATTCTTACAA 


4740 


AAAAGCTCGG 


ATCAAGCATG 


CGACCATCTA 


TGTCAAGAAT 


CAGGTAGGTC 


TAAAAAATAT 


4800 


CTTTAAGCTG 


GTTTCCTTGT 


CTAATACCAA 


GTATTTTGAA 


GGAGTGCCAC 


GGATTCCGAG 


4860 


AACGGTTCTA 


GATGCCCATC 


GAGAGGGCTT 


GATTTTAGCT 


TCAGCCTGTT 


CAGAGGGTGA 


4920 
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AGTTTTTCAC GTGGTCCTTT CTCAAGGTGT GGATGCGGCG GTTCAGCTGG CCAAGTATTA 4980 

TCATTTTATC GAGGTCATGC CACCGGCTAT CTATCCACCC TTGATTGCCA AAGAGCAGGT 5040 

CAAGGATATG GAGGAACTCC AGACCATTAT CAAGAGTTTG ATAGAGCTTG GAGACCGCCT 5100 

TGGCAAGCCT GTTCTGCCTA CGGGAAATGT TCACTATATC GAACCGGAAG AAGAGATTTA 5160 

TCGTGAAATT ATCGTCCGTA GTTTGGGACA GGGTCCGATG ATTAATCGAA CTATCGGTCA 5220 

TCGTGAACAT GCCCAACCAG CACCACTTCC AAAGGCTCAT TTTCGAACGA CTAATGAGAT 5280 

GTTGGATGAA TTTCCCTTTT TGGGAGAGGA ACTGGCTCGT AAAC7GGTTA TTGAAAACAC 53 40 

CAATGCCTTG GCAGAAATAT TTGAATCCGT TGAAGTCCTT AAGGGTGACT TGTATACGCC 5400 

TTTCATCGAC AAGGCTGAAG AAACAGTTGC TGAGTTGACC TATAAGAAAG C7TTTCAGAT 5460 

TTATCCAAAT CCGCTGCCAG ATATTGTTGA TTTGCCGATT GAAAAAGAAT TAACATCCAT 5520 

ACTGGGGAAT GGATTTCCTG TGATTTATCT GGCATCCCAG ATGCTGGTGC AACGTTCTAA 5580 

TGAACGGGGT TATTTGGTTG GTTCTCGTGG GTCTGTCCGA TCTAGTTTCG TTGCGACCAT 5640 

GATTGGGATT ACGGAGGTCA ATCCTCTCTC TCCTCACTAT GTCTGTGGTC AGTGTCAGTA 5700 

CAGTCAGTTT ATCACAGATG GTTCGTACGG TTCAGGATTT GATATGCCCC ATAAGGACTG 5760 

TCCAAACTGT GGTCACAAAC TCAGTAAAAA CGGACAGGAT ATTCCCTTTG AGACCTTCCT 5820 

TCCTTTTGA? GGGGATAAGG TTCCTGATAT TGACTTGAAC TTCTCGGGAG AAGATCAGCC 5880 

TAGCGCCCAC TTGGATGTGC GTGATATCTT TGGTGAAGAA TATGCCTTCC GTGCGGGAAC 5940 

GGTTGGTACG GTAGCTGCCA ACACTGCCTA TGGATTTGTC AAAGGTTACG AGCGAGATTA 6000 

TCGCAAGTTT TATCCTGATG CAGAACTAGA ACCCCTCGCT CAAGGAGCGG CCGGTCTCAA 6060 

GCGGACAACA GGCCAACACC CGGGGGGAAT CGTTGTTATT CCGAACTACA TGGATGTCTA 6120 

CGATTTTACG CCTGTCCAGT ATCCAGCAGA TGATGTCACG GCTGAATGGC AGACCACTCA 6180 

CTTTAACTTC CACGATATCG ATGAGAACGT CCTCAAACTC GATGTACTGG GACATGATGA 624C 

TCCGACTATG ATTCGAAAAC TTCAGGATTT GTCTGGTATT GACCCTAATA AAATTCCTAT 63 00 

GGATGACGAA GGCGTGATGG CACTCTTTTC TGGGACTCAT GTGCTAGGGG TAACACCTGA 6 3 60 

ACAAATTGGA ACGCCTACGC GTATGTTGGC GATTCCAGAG T7TGGAACAA ATTTCGTACG 64 20 

TGGAATGGTA GACGAAACCC ATCCGACAAC CTTTGCGGAA TTGCTTCAGC TGTCTGGTCT 64 80 

GTCCCACGGT ACTGATCTTT GGTTGGGGAA TGCTCAGGAT CTGATTAAGC AAGGAATAGC 6540 

GGACCTATCG ACTGTTATCG GTTGTCGGGA CGACATCATG GTTTACCTCA TGCATGCGGC 6600 

TCTGGAACCT AAGATGGCCT TTACCATTAT GGAACGGGTA CGTAAGGGTT TGTGGCTAAA 6660 

GATTTCACAA GAGGAGAGAA ATGGCTATAT CGAAGCAATG AAGGCTAATA AGGTGCCAGA 6720 
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GTGGTATATC GAATCCTGTG GGAAAATTAA GTACATGTTC CCTAAGGCCC ATGCGGCAGC 6780 

CTACGTTATG ATGGCCTTGC GTGTAGCTTA CTTCAAGGTT CACCATCCTA TTT ATT ACTA 6840 

CTGTGCTTAC TTCTCCATTC GTGCTAAGGC TTTTGATATC AAGACCATGG GTGCGGGCTT 6900 

GGAGGTCATC AAGCGCAGAA TGGAAGAAAT CTCTGAAAAA CGGAAGAACA ATCAAGCCTC 6960 

TAATGTGGAA ATCGATCTCT ATACAACTCT TGAGATTGTC AATGAGATGT GGGAACGAGG 7020 

T7TCAAGTTT GGTAAATTAG ATCTCTACTG TAGTCAGGCG ACAGAGTTCC TCATCGACGG 7 080 

GGATACCCTT ATCCCACCAT TTGTAGCAAT GGATGGTCTG CGAGAGAACC TTGCCAAGCA 7140 

ACTGGTGCGG GCGCGTGAAG AGGGAGAATT CCTCTCTAAA ACAGAACTAC GCAAGCGTGG 7 200 

TGGACTCTCA TCAACCTTGG TTGAAAAGAT GGATGAGATG GGTATTCTTG GAAATATGCC 72 60 

AGAGGATAAC CAGTTGAGTT TGTTTGATGA GTTGTTTTAA AAAATTGCTT AATAATCTAT 7 320 

TAAAAGAGGC TAACGTATAT CCAATAGATT TACATTAGCT TTCTTTTTTG TTAAAATAGT 7380 

CTATGGAAAG AGGGTGAGAG TATGTCAAAG ATGAGTATAA GCATCCGTCT GG ATAGT GAG 74 40 

GTTAAGGAGC AGGCCCAACA GCTGTTTAGT AATCTGGGAA TGGATATGAC AACAGCTATT 7 500 

AATATTTTCC TTCGTCAGGC AATTCAATAT CAGGGATTAC CTTTTGATGT TAGACTAGAC 7 560 

GAAAATCGGA AGTTGCTCCA AGCGTTAACG GATTTAGACC AAAATCGTAA TATGAGCCAG 7 620 

TCTTTTGAAT CAGTCTCAGA TTTGATGGAG CACTTACCTG CTTAAGATTC GTTATCATAA 7 680 

ACAGTTTAAA AAAGATTTTA AGTTGGCTAT GAACCGTGGT TTCAAGGCAG AATTATTAGA 774 0 

AGAACTTTTG AATTTTCTGG TTCAAGAAAA AGAACATCCT GCCAGAAATC C7CATCATTC 7 800 

ATTGACGGCA TCCAAGCATT TTCAAGCAGT TCGTGAATGC CATACCCACC CAGATTGGCT 7860 

TTTGGTTTAT AAAGTAGACA AGTCGGAATT GATTTTAAAT TTGCTGAGGA CAGGCAGTCA 7920 

CAGTGATTTA TTTTAATCTA TTTTAAGGGG CTTCTCATGA AACTAAGAAT ATTTGCGGAA 7980 

GATAAGCCGG CTAAGAAGGT ATTTGAATAT CAATTAGAAC TTGCTGATCG TACAATTCTT 8040 

CTATCGACAG CACTCTTGTC AGGTGCTATT GCTTTAGCAG GAATCTTTTC TGCTTTGAAA 8100 

GAAAAATAAA AATAGAAAAG AGAAAACAGA ATGGTTTTAC CAAATTTTAA AGAAAATCTA 8160 

GAAAAATATG CGAAATTGTT GGTTGCGAAC GGAATTAACG TGCAACCTGG TCACACTTTG 8220 

GCTCTCTCTA TTGATGTGGA GCAACGTGAA TTGGCACATC TAATCGTGAA AGAAGCTTAT 8280 

GCCTTGGGTG CGCATGAGGT CATCGTTCAG TGGACAGATG ATGTGATTAA CCGTGAGAAA 8340 

TTCCTCCATG CCCCGATGGA GCGTTTGGAC AATGTGCCAG AATACAAGAT TGCTGAGATG 84 00 

AACTATCTCT TGGAGAATAA GGCTAGCCGT CTTGGAGTTC GTTCATCTGA TCCAGGTGCC 84 60 
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TTGAACGGAG TGGACGCTGA CAAGCTTTCA GCTTCTGCTA AAGCTATGGG ACTTGCCATG 8520 

AAGCCTATGC GTATCGCAAC TCAATCTAAC AAGGTTAGCT GGACTGTAGC AGCTGCAGCA 8530 

GGACTTGAGT GGGCTAAGAA AGTCTTCCCA AATGCTGCGA GCGACGAAGA AGCAGTTGAT 8640 

TTCCTTTGGG ACCAAATTTT CAAAACTTGC CGTGTCTACG AAGCAGATCC TGTTAAGGCT 8700 

TGGGAGGAAC ATGCAGCCAT TCTCAAGAGC AAGGCCGATA TGCTTAATAA GGAGCAATTT 87 60 

TCAGCCCTTC ACTACACAGC GCCAGGAACA GATTTAACAC TTGGTTTGCC AAAGAACCAC 8820 

GTTTGGGAAT CAGCTGGTGC TGTCAATGCA CAGGGCGAAG AATTCTTCCC AAATATGCCA 8880 

ACAGAAGAGG TCTTCACAGC CCCTGACTTC CGTCGTGCAG ATGGTTATGT CACTTCTACA 8940 

AAACCGCTTA GCTACAACGG AAATATCATT GAAGGCATTA AGGTGACCTT TAAGGATGGA 9000 

CAAATCGTAG ATATCACTGC TGAGAAGGGT CATCAGGTTA TGAAAGACCT TCTCTTTGAA . 9060 

AATGCGGGTG CCCCTGCCTT CCGTGAATGT GCCTTGGTAC CAGATCCAAG TCCAATTTCT 9120 

CAGTCAGGCA TTACCTTCTT TAACACCCTT TTCGATGAAA ATGCCTCAAA CCACTTGGCT 9180 

ATCGGTGCAG CCTATGCGAC TAGCGTTGTT GATGGAGCGG AGATGAGCGA AGAGGAGCTT 924 0 

GAAGCTGCAG GCCTTAACCG TTCAGATGTT CACGTAGACT TTATGATTGG TTCTAACCAA 9 300 

ATGGATATCG ATGGTATTCG TCACGATGGA ACGCGGGTAC CTCTTTTCCG TAATGGGAAT 9 360 

TGGGCAAATT AAGGAGATAA TATGTTAGGA AGTATGTTCG TTGGTCTCCT AGTGGGATTT 9420 

TTAGCAGGTG CTATGACCAA TCGTGGAGAG CGAATGGGAT GTTTTGGAAA AATGTTTCTC 9480 

GCTTGGATCG GACCCTTTCT AGGTCACTTG CTCTTTGGAA CTTGGGGCCC AGTTTTATCA 954 0 

GGAACAGCTA 7TATCCCAGC GATTTTAGGA GCCATGATTG TTTTACCTAT TTTTTGGAGA 9600 

CGAGGAA 9607 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14231 base pairs 
Oj TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(Di TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

CTACAAGATA ATTCCAGCTA TAACATCCGC TATAATAGTA AGAGCCAGC7 CTATGATAAG 60 

CCTCATTAGT TTCACCTCCT CTCACGAACC CATAGGAACG TAATCGGTAA CCGATGACAA 12 0 

AAATAGTATA CCACAATACA TTTAGATCAT CAAGGTCACT TAATTCTTGA AATATCAGAT 180 

CTAAGAGAAA AATCTTTAAA ATCAGAAAAA CGCATAATAT CAGGTGTGCA AAAACTTGAT 24 0 
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ACTATGCCTT TTATTGTGGG AAGGTTTACT CCATTTTCTC CTGAAATTGA GTTTTTGTCC 3 00 

AGCCTCTGTT TTTAGGGTTG CTAAGAAAAT AATGTCATGT GGTGAATATT TGTAAATCAC 3 60 

TCAGCAGACA GAACGATACT CTTCGAAAAT CTCTTCACAT CATGTCAGCT TCGTCTTTCC 4 20 

GTATATATGT GACTGACTTC ATCAGTTCTA TCTACAACCT CAAAACAGTG TTTCGAGCTG 480 

ACTTGATCAA TTTTCAAATC TGTACTTTGA GCAAGCTGAG ACTAGCTTCC TATTTGATTT 540 

^TCATTGAATA TCAGAAACCC ATTCTCCATC AAATAATTCG ACTGCGTCTA ATAATTTTTG 600 

ATCTGGCACG GTGTCTGAAA TAAAGGTTGT GTATTTGGAC AGGGCATTAA TTTTAAAAAA 660 

TCCAGTCTTG TAAAATTTAG AACTATCAAT CAGTAAGATG GTTTCATGGG CTTTGTCAAT 7 20 

AATATTCTTT TTTGAAATAG CTTGGCTGAG AGAAGCTTCA TAAACATATT GGTCATCAAT 7 80 

ACCTCTTGCT GAACAAAATG CTAAATCGAT ATTAAAATGA TCTAATAAAG AATTTTCCTT 840 

ATCATAGTTG ACCACGGAAC AGGATTGATG TTTGACCTCG CCAGATGTGA TAAAGATTTT 900 

GGACCTATCT TTAACAGTTT CAGATAGGGT TTGTGCAGTA TGTAAACCAT TTGTAAAAAT 960 

AATCAAATTA TCAAGTTCAG AAAGATAGGG ACAGAGTTCG TAGACACTAG TACTAGAATC 1020 

TAG AT AG ATA C AC AT AC C AG ACCGAATAAA GTCTTTAGCG AGACTAGCGA TTACTCTTTT 1080 

TTGCCTAGTA CTTTCTCCTT CACGTATTTG ATGAGAAAGT TCAATTGTGT TCATAGACGA 1140 

CAGGGTCACG TATCCCTCCT TTCTTTTGAT AAGACCTTGA TTTTCTAAGA AAATTAAATC 1200 

ACCACCTAAC GTACTTGTGC TGGAGAAAGT GATTTCTGCC ACCTCTTTTA CCGCAATTCT 1260 

TTTTTTCTTT TTGATAATTT CAATCAATTC AAGTACACGT TCATCTTTTA TCATAAGCTC 1320 

CTCCTAATTT ATCATTTCAA CT AT ATT AT A GCACAAATTG GAGGAATTTG AATTATTTTT 13 SO 

ATGAATATTG GGTTAACATT TGAACATTAT TCAACTAACC GTTCACATAT TGAAAAAATA 14 40 

AAACGTGGGG ATTATAATAA AGTTAATCmA GGACGAAGAG AGAAGAAAAA TGGAAGCGGT 1500 

TTTAGCAATA GATTTAGGTG CGACTTCTGG AAGAGCAATC GTTGCTTACC TTTCTGAAAA 1560 

TAAACTAGTA ATGGAAGAAA TAAATCGCTT TTCTAATCTA CCTATTAGAG 7AAAAGGGCA 1620 

TTTATCTTGG GATATTGACT TTCTACTAGC TAAAATTCTT GAAAGTATCC CCTTGGCTAA 1680 

TACTAGTTAC AAGATTTTAT CTATCGGTAT TGACACATGG GCAGTTCATT TTGGACTGAT 1740 

TGATAATGAA GCTAAGCTCT TATTACAACC TGTTCATTAT CGTGATGAAA GAACAAAGGG 1300 

AGTGTTAAAG GAAATATCTG AAATGACTGA ATTAGAAAAA CTGTATTCAC AGACAGGAAA 1360 

TCAGATTATG GAGATAAATA CCTTGTTTCA ACTCTTTAAG GCACGTCAAG AATCTCCTGA 1920 

CTCTTTCTAT AAGACCAATA AGATTCTTTT AATGCCAGAT TTGTTTAATT ATCTCTTGAC 1980 
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ACGTAAG7TT GCTACACAAA AAAGCATTCC TTCAACAACT CAATTATTTG ATCCTAGGAG 2040 

7CAAAATTGG AATCAGAATA TCTTAAAACT ATTTGAATTG GATTCATCTT TACTTCCTGA 2100 

AATTGTTTCA GAGGGAAATG TTCTTGGAAG GATAAAAGAG GAGTATGGTT TAGGCGATAT 2160 

TCCTGTTGTG AATGTTTGTA GTCATGATAC AGCAAGCGCG ATTGTCTCAG TACCTAACAC 2220 

AGAAGGTAGT TTATTTATTT CATCAGGTAC TTGCTCTTTC GTTGGAGTGG AACTTACTTC 2280 

ACCGATTCTT ACTACCGAAT CCTTCAGTTA TGGATTTACA AATGAAGTCG GTAAAGATGG 2 3 40 

AGTGATTACA TTTCTGAAGA ATTGTACAGG GTTGTGGATC ATAGAGGAAC TAAGACGTTC 2400 

ATTTGAACGA AGAGGGAAAG CCTATTCTTT TGATGATATT AGGACAATGG TGGAGAAAGA 24 60 

AAAAGAAAAT CTTCCTCTGA TTGATACTGA ATCAACTGAA TT7GCAACAG AATCTGATAT 2520 

GCACAAGACT TTGACAGAAT ATCTAGCTTA TCATCATGAA ACTAGAGAGT GGACAGATGG 2580 

ACAACTATTT AAGATTGTTT ATGAAAGCCT AGCTGAAACG TATAGGAAAG CGATAGAGTT 2640 

ACT AG AAG AA CTAACTCATA AGGTTTATAA GAGGATATAT GTCATTCGAC GACGTGCTAG 2700 

AGCCAGTTAC TTTAACCAAA TGATTGCTGA TAGAACTGGT AAAGAGGTTC TTACAGGTTT 2760 

GACTGAGGCT ACAGCTGTGG GGAATATTGT TGTGCACC7C ATACCTATCG GACAATTAAA 2820 

AGGGATGGAA GAGGCTCACC ATCTTATTGA GGAGTTTCTA CAATTAGAGA GTTATTACTC 2 880 

CCAAAAGAAT TAAAAAGATT GAGAGTTTGT AAATTTGCCT CCCTCCCCCT TCTTAGCTTT 2940 

TGTGCAGGAA GGGGGGATAA TTGGTGAATT GAAAAATATT TACTGTTTTG ATATGAGGAG 3000 

GACAAGGATG TCAGATGTAA AACAAGAATT AATTAAATAT GGTAAGAAGC TAGTAGAAAC 3060 

AGATTTGACG AAAGGAACAG GTGGGAATCT CAGCGTTTTC CATCCTGAAA AACAATTGAT 3120 

GGCAATTACC CCGTCGGGTA TTGATTTCTT TGAAATCAAA GAATCCGATA TTCTAGTGAT 3180 

CGATATTAAT GGAAATGTTC TAGAGGGAGA ACGCTTGCCA TCTAGCGAAT GGTATATGCA 3240 

TTTGATTCAA TATCAAACTC GTGATGATAT CGATCCAATT ATCCATGCTC ATACAACTTA 3 3 00 

TGCAACAGTA TTAGCTTGTC TCAGAGAACC ACTTCCAGCG AGTCATTATA TGATTGCAGT 3 360 

CCCAGGGAAA GATCTTCCGG TAGCTGAGTA TGCAACATAT GGCACGAAAG AATTGGCTGT 3420 

GAATGCAGCT AAAGCAATGG AAGGTCGTAG AGCAGTTTTA CTAGCGAATC ATGGAATTTT 3480 

AGCAGGTGCA CAAAATTTAT TGAATGCATT TAATATTGTT GAAGAAGTTG AATATTGTCC 3540 

AAAAATTTAT TC7TTAGCTA AGAATTTTGG AGAGCCAGTA GTTCTTCCTG ATGAGGAGAT 3 600 

GGAATTGATG GCAGAAAAAT TTAAAACATA CCCTCACACA AAATAGGGAG GATATTAATC 36 60 

TT AAAAC AT A TACCGAAAAA TATTTCTCCA GATTTATTGA AGACTTTAAT GGAAATGGGA 3720 

CATGGAGATG AAATAGTATT AGCTGACGCG AATTATCCTT CTGCCTCATC TGCAAATAAG 3 780 
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TTATGTTTTG 


GATTGTTAGC 


AGGAGTTAAG 


GCTGATAATC 


GTCTTCAAAT 


GAGAACGACG 


9060 
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ATTAATAATG AATCCCCATT GTTCCTTTCT 
TTATGGTGGG GGAACACATT GAAGGGAGCA 
TATGCAGCGA TTGAACTTCA TCCTGCAAAA 
GATACGAAAG AATTGAGAGA ATGGTATGTC 
ATTCCAGTTT TCTTCCTTAT TATGTCGGCT 
TTAGATGAAC AATTCCAAAA GTATAGTGTG 
TGGATTTACA ATAACCAGTT AGCTCCGCAT 
TATGGAGCGC ATTTTATCTG GCATGATCAT 
GATCCGACAT TCTTTGAAGC GAGTCAAAAA 
AATACGCCAA TAAGAGATGA TGCGGGTACA 
GGCTTATGTG ATAACTGGGG CTCATCAACA 
ACAAACACAT TTGAAACTGG AAGAGCTAGG 
TCAATGATTG CTATGGAAAT GATGAATGTA 
GAATGTGCCG CGTATACATT TATGACAAAT 
ATTATTCCTT TCTTTAGACA TGCTATACAA 
AATAGAACAA AAGCTGTATT TTGGAATGGA 
TATCAAGGAC TTTATTCGAA TGATGAAACA 
ATTCTTCCTG TAATACATGA GAAAATTGAT 
GCAAAAATTT TGACTAAAAA TAGTGAGGAA 
CTTTATCCAA AACTTTATGA AGGAGATGGG 
ATTTATAATA GTAATGCTAA TATCAATAAA 
AATAATACAA AGTCGTTATC GTTAGATTTG 
AATCCAAATA ATTTACATAT TTTATTGAAT 
GCATTATCAG GAAATTTTGA TGCATCAAAA 
AACTGGATAA GCAAAAATTA TTCCATCAAT 
CTTACATTAA AAGGGCATAC TGGTCATAAA 
CATTATACTT ATACAGAAAA TTGGGATGAG 
CATAATGGAA TGGTAGAGAT GTCTATAAAT 
ACACCAGATA AATTTAATGA TGGTAATTTG 
AGTTCTGTAG ATTACAATGG AGACCCTAAT 
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CCGTTGTATG GCAATGATAA TGGTAACGGA 9120 
TGGGAAGCTA TTCCTGAAGA TGTAAAGCCA 9180 
GTCTGTAAAC CAACAAGTTG TATTCCACGA 9240 
AAGATGTTGG AGGAAGCTCA AAGTCTAAAC 9300 
GGAGAGCGTA ATKCACTTCC TCCAGAGTGG 93 60 

TTAAAAGGTG TTTTAAATAT TGAGAATTAT 9420 

AGTGCTAAAT ATTTGGAAGT TTGTGCCAAA 94 80 

GAAAAATGGT TCTGGGAAAC TATTATGAAT 9540 

TATCATAAAA ATTTGGTCTT GGCAACTAAA 9600 

GATTCTATCG TTAGTGGATT TTGGTTGAGT 9660 

GATACATGGA AATGGTGGGA AAAACATTAT 9720 

GATATGAGAT CCTATGCATC GGAACCAGAA 9780 

TATACTGGGG GAGGCACACT TTATAATTTC 9840 

GATGTACCAA CTCCAGCATT TACTAAAGGT 9900 

AATCCACCTC CAAGTAAGGA AGAAGTTGTA 9960 

GAAGGTAGGA TTAGTTCATT AAACGGATTT 10020 

ATGCCTTTAT ATAATAATGG CAGATATCAT 10080 

AAGGAAAAGA TTTCATCTAT ATTCCCTAAT 10140 

TTGTCTACTA AAG7CAACTA TTTAAACTCG 10200 

TATGCTCAGC GTGTAGGTAA TTCCTGGTAT 102 60 

AATCAGCAAG TAATGTTGCC TATGTATACT 103 20 

ACGCCACATA CTTACGCTGT TGTTAAAGAA 10380 

AATTACAGGA CAGATAAGAC AGCTATGTGG 1044 0 

AGTTGGAAGA AAGAAGAATT AGAGTTAGCG 10500 

CCTGTACATA ATGACTTTAG GACAACAACA 10560 

CCTCAGATAA ATATAAGTGG CGATAAAAAT 1062 0 

AATACCCATG TTTATACCAT TACGGTTAAT 10680 

ACTCAGGGCA CAGGTCCAGT CTCTTTCCCA 10740 

AATATAGCAT ATGCAAAACC AACAACACAA 10800 

AGAGCTGTGG ATGGTAACAG AAATGGTAAT 10860 
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TTTAACTCTG GTTCCGTAAC ACACACTAGC GCAGATAATC CCTCTTGGTG GGAAGTCCAT 10920 

TTGAAAAAAA TGGATAAAGT TGGGCTTGTT AAAATTTATA ATCGCACAGA TGCTGAGACT 10980 

CAACGTCTAT CTAATTTTGA TGTGATTCTA TATGACAATA ATAGAAACGA ACTTCCTAAG 11040 

AAACATGTTA ATAATTTGTC GGGTGAATCT GTTAGTCTAG ATTTCAAAGA AAAAGGAGCA 11100 

AGGTATATTA AAGTTAAATT ACTAACGAGT GGAGTGCCTT TGAGTTTAGC AGAAGTAGAG 11160 

GTTTTTAGAG AATCAGATGG TAAGCAATCT GAAGAGGATA TAGATAAAAT AACAGAAGAT 11220 

AAAGTAGTCT CTACAAATAA GGTAGCTACT CAAAGTTCAA CCAATTATGA GGGTGTAGCT 11280 

GCTTTAGCAG TTGATGGTAA TAAAGATGGA GATTACGGAC ATCATTCGGT GACTCATACT 11340 

AAGGCAGATT CTAACCCTTG GTGGCAGGTC GATCTGGGAG AAGAGTTTAC GGTTTCTAAA 11400 

GTTGATATTT ATAATAGAAC AGATGCCGAA CCTCAGCCTT TATCTAATTT TGATGTTATT 11460 

TTTCTATCTT CATCAGGAGA AGAAGTTTTT AGAAGACATT TTGATAAAGT AGTTGATGGT 11520 

TTGTTATCTT TAAAAGTACC TTCTGTAGGG GCTAAGCTAG TCAAAATAGA ATTAAAATCA 11580 

GCAGCTATTC CGTTAA G TTT AGCGGAAGTT GAACTCTATG GTTCAAAGAG AACTCCGAAG 11640 

AAACTTTCTA ATATTGCATT AACAAAAGAA ACTCGACAGA GTTCAACGGA TTACAATGGT 11700 

TTTTCTCGTC TAGCAGTTGA TGGAAATAAA AACGGAGATT ATGGTCATCA TTCAGTGACT 11760 

CATACCAAAG AAGATTCTCC TTCATGGTGG GAGATAGATT TAGCACAAAC CGAAGAATTA 11820 

GAAAAGTTAA TTATTTATAA TAGAACAGAT GCTGAAATTC AGAGATTATC AAATTTTGAT 11880 

ATTATTATAT ATGATTCAAA TGATTATGAA GTTTTTACAC AACATATTGA CAGTTTAGAA 11940 

AGCAATAATC TATCCATAGA CTTAAAAGGA CTGAAGGGAA AAAAGGTTAG AATTTCTTTG 12000 

AGAAGCGCAG GAATTCCTTT AAGTTTAGCA GAGGTAGAGG TTTATACTTA TAAGTAATTT 12060 

TAAAAATTAT CACCCAGGCT ACCGTAAATA TAATGGAGAT GGTAGTATGA AAGAAACAGA 12120 

AAAAT AAG AG GAAAATAGTA TGATTCAACA TCCACGTATT GGGATTCGTC CGACTATTGA 12180 

TGGTCGTCGT CAAGGTGTAC GCGAATCACT TGAAGTGCAA ACAATGAACA TGGCTAAAAG 12 240 

TGTGGCACAT TTGATTTCAA GCACATTGAA ATATCCAGAT GGGGAACCTG TGGAATCCGT 12300 

GATTTCTCCA TCTACTATTG GCCGTGTACC AGAGGCTGCA GCTTCCCATG AGTTGTTTAA 123 60 

AAAATCAAAT GTTTGCGCAA CAATTACAGT TACACCATGC TGGTGTTATG GTAGTGAAAC 12420 

?ATGGATATG TCTCCAGATA TTCCTCATGC TATTTGGGGA TTTAATGGGA CAGAACGCCC 12480 

AGGAGCTGTC TATCTTGCAG CTGTACTAGC TTCACATGCT CAAAAAGGGA TTCCAGCCTT 12540 

TGGGATTTAT GGAAGAGATG TTCAGGAAGC TAGTGACACA CATATTCCAG AAGATGTCAA 12 600 
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AGAAAAACTT 


TTACGCTATG 


CGCCTGCAGC 


TCTTGCAACT 


GCCTTCATGA 


GAGACACTGC 


12660 


TTACCTATCA 


ATGGGTAGTG 


TTTCGATGGG 


CATTGGTCGT 


TCTATTGTAA 


ATCCGGATTT 


12720 


CTTCCAAGAA 


TACTTAGGAA 


TGCGAAATGA 


ATCGGTAGAT 


ATGACGGACT 


TCACGCGCCG 


12780 


TATGGACCGT 


GGTATTTACG 


ACCCTGAAGA 


GTTCGAACGT 


GCGCTCAAAT 


GGGTGAAAGA 


12840 


AAACGTAAAA 


GAAGGATTCG 


ACCATAACCG 


TGAAGACCTT 


GTTTTAAGCC 


CTGAAGAAAA 


12900 


AGATAGACAA 


TGGGAATTTG 


TTATTAAGAT 


GTTCATGATT 


GGACGTGACT 


TAATGGTTGG 


12960 


TAACCCAAGA 


CTTGCTCAAC 


TTGGTTTTGA 


GGAAGAAGCG 


GTTGGTCACC 


ATGCTTTAGT 


13020 


AGCTGGTTTC 


CAAGGTCAAC 


GTCAGTGGAC 


AGACCATTTT 


CCAAATGGGG 


ACTTTATGGA 


13080 


AACTTTCCTC 


AATACTCAGT 


TTCACTGGAA 


TGGTATTCGA 


AAACCATTTC 


TATTTGCGAC 


13140 


AGAGAATGAT 


TCACTAAATG 


GTGTGTCTAT 


GCTCTTTAAT 


TATCTATTAA 


CAAATACTCC 


13200 


ACAAATCTTT 


GCTGATGTGC 


GTACTTATTG 


GAGCCCAGAG 


GCTGTTAAAC 


GTGTAACGGG 


13260 


ACATACTTTA 


GAGGGTCGTG 


CTGCAGCTGG 


CTTCTTACAT 


CTAATCAACT 


CTGGTTCTTG 


13320 


TACATTGCAT 


GGTACAGGTC 


AAGCTACTCG 


AGATGGCAAA 


CCTATTATGA 


AACCATTCTG 


13380 


GGAGTTGGAA 


GAAAGTGAAG 


TGCAGGCTAT 


GCTTGAAAAT 


ACACACTTCC 


CACCAGCAAA 


13440 


CCGCGAATAC 


TTCCGTGGAG 


GAGGATTCTC 


AACTCGTTTC 


TTGACGAAGG 


GGGATATGCC 


13500 


AGTAACAATG 


GTACGTCTCA 


ATCTTCTAAA 


AGGGGTTGGT 


CCAGTGCTAC 


AAATTGCAGA 


13560 


AGGTTACACA 


CTTCAACTTC 


CTGAAGATGT 


TCACCATACT 


TTAGATAATC 


GTACAGATCC 


13620 


AGGATGGCCA 


ACTACTTGGT 


TTCCTCCACG 


TTTGACAGGA 


AAAGGTGCTT 


TCAACTCTGT 


13680 


CTATGACGTC 


ATGAATAATT 


GGGGAGCTAA 


TCACGGAGCC 


ATAACATATG 


CACACATTGG 


13740 


AGCAGACTTG 


ATTACCTTGG 


CTTCTATGTT 


GAGAATTCCT 


GTCAATATGC 


ATAATGTACC 


13800 


TGAGGAAGAT 


ATCTTTAGAC 


CTAAAAATTG 


GTCCTTATTT 


GGAACAGAAG 


ATCTAGAATC 


13860 


AGCAGACTAT 


CGTGCATGTC 


AGTTGTTGGG 


GCCACTACAT 


AAATAAAACT 


TGTTTATATA 


13920 


GGAGGTGAAC 


TTACGTCCCT 


CCTATCCTTT 


TAAAAAGATT 


TGTTAAACAA 


TTCACAAA1A 


13980 


ATTGAAAACG 


AATACAAAAA 


GTAATATAAT 


GATGTTAAAT 


AGATACCCCG 


GAGGCGCAGG 


. 5040 


AGGAAAATTA 


TATGGCTATA 


TTTTATGTTC 


CGGCAGTCAA 


CCTTATTGGA 


AAAGGTGTTC 


14100 


TAAATGAAGT 


GGGTCCTTAT 


ATCAAGGAAC 


TTGGCTATAA 


AAAGGCACTT 


TTGGTGACAG 


14160 


ATAAGTACAT 


CGAAGGCAGT 


GATATTTTAC 


CTAAGACTTT 


AAAACCACTG 


GATACAGAAG 


14220 


GAATCGAATA 


T 










14231 


(2) INFORMATION FOR SEQ ID NO: 82 











U> SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 16995 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



AGTTCTCTTA 


ACTTTTTTAG 


OA I uuLn 1 1U 




GGTACTCATT 


TTCTGCTgAA 


60 


GACGTTCTAA 


TTCTGTCCTC 


I v. 1 1 V_ AVjVj 1 V. 


I V.U 1 I I 1 1 \JU 


CTTACGTCCC 


ATTTTAGGTA 


120 


CTCTCCCTCT 


TGTTTTCTCA 


ACAATAGTAT 


ACCCGTTTTT 


C C TGT ATTGT 


GCTAGCCAGT 


180 


TAAGAAGTAT 


CGTACGACTT 


(juuAl»Al.l.lj I 


All 


AAC^CTA^CT 


TTACTCCAGC 


240 


CTTCATGTCA 


G ACTTTAT T A 


CTCATTTCTT 


rTTTT A AATC 


AGGAGAATAG 


TAACGATTTT 


300 


TTCCTTTTTT 


GACGAACTCT 


ATTCt-O I AAL 




TTTAATCATG 


TACCTAATAT 


3 60 


TAGAATTGCT 


TATCCCAAAT 


1 pt * <wwpr a & s 
l 4 A i l i y/w\ 




GCTATATCCT 


TGTTTTCTAA 


420 


GTTCATAGAT 


C TGAA CTTT A 


T\_A 1 U A 1 AAU 


l l i\\J ILL ^-rt I 


AATAAAAACA 


CCCCAAAAGT 


480 


TAGATTTTTT 


CTGTCTAACT 


Pi Lvjuuu * \J 1 


AO 1 1 \-t\ i vj I /A 


CACCTGATAT 


GATGCGTTTT 


540 


ATAATTTTTA 


AGCCTTTTTG 




1 v. AAAAO I nn 


TCTTTTGACA 


CAAAATCTGT 


600 


GACAAAACTT 


TAGTTTTAAA 


GO l~l~l"l w l AAv. 


TTTTTATATA 

L 1 IVjiAlAlA 


PTAGTTTTAA 


GAAAAGGAGG 


660 


ATGATCTAAT 


GGAAGAAAAA 


GTATCATTGA 


AAO 1 Lnuuu l 


TPAJVAAACTA 


GGGACATCGC 


720 


TTTCAAATAT 


GGTTATGCCC 


AA 1 A I 


f* ATTT ATTG C 


TTGGGGAGTA 


TTGACTGCCC 


780 


TCTTTATCGC 


TGATGGCTAT 


CTGCCAAATG 


AACAGTTAGC 


TACTGTTGTT 




840 


TAACGTATTT 


ATTGCCAATC 


CTGATTGGTT 


ACACAGGTGG 


ATATATGATC 


CATGGCCAAC 


900 


GTGGTGCCGT 


TGTAGGACCT 


ATTGCTACTG 


TTGCTGCAAT 


CACAGGTTCT 


AGTGTTCCTA 


960 


TGTTTATCGG 


AGCTATGGTA 


ATGGGCCCAC 


TGGGAGGATG 


GACTATCAAG 


AAATTTGATG 


1020 


AGAAGTTCCA 


GGAAAAAATT 


CGTCCCGGAT 


TTGAAATCTT 


AGTTAATAAC 


TTCTCAGCTG 


1080 


GTCTCGTTGG 


TTTTGCATTA 


TTCCTTTTCG 


CTTTCTACGC 


AATCGGTCCA 


GTCGTATCGA 


1140 


CTCTTACTGG 


AGCTGTTGGG 


AATGGTGTTG 


AGGCTATTGT 


CAATGCTCCC 


CTCCTTCCTA 


1200 


TGGCTAATAT 


TATCATCGAA 


CCGGCTAAAG 


TCCTTTTCCT 


CAATAATGCC 


CTCAATCATG 


1260 


GCATTTTTAC 


TCCTCTGGGA 


GTAGAACAGG 


TAGCTCAAGC 


TGGTAAGTCA 


ATTCTCTTCC 


1320 


TATTGGAAGC 


TAATCCTGGA 


CCAGGTCTGG 


GAATTCTATT 


AGCTTATGCT 


GTATTCGGTA 


1380 


AAGGTTCTGC 


TAAATCTTCT 


TCTTGGGGGG 


CAATGGTTAT 


TCATTTCTTC 


GGAGGGATTC 


1440 


ATGAAATTTA 


CTTTCCTTAT 


GTTATGATGA 


AGCCTACTCT 


ATTTTTAGCT 


CCTATGGCAG 


1500 
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GAGGTATCTC TCGAACTTTT AC TTTTCAAC TCTTAGACGC TGGTCTTAAA TCTCCAGCTT 1560 

CACCAGGTTC TATTATTGCG ATTATAGCTA CGGCGCCAAA AGGTGTTTGG CCCCATCTAA 1620 

ATGTTCTTTT AGGTGTTTTA GTGGCAGCAG TTCTTTCTTT CCTTGTAGCA GCCCTTATTC 1680 

TTCATGCAGA CAAGTCAACT GAGGATTCGC TCGAAGCTGC TCAGGCGGCT ACCCAAGCAG 1740 

CTAAGGCTCA GTCTAAAGGT CAGTTAGTAT CAACTTCTGT TGATGCAGTT GTTTCGACAG 1800 

ACTCAGTGGA AAAAATCATT TTCGCCTGCG ATGCTGGTAT GGGAAGCTCT GCTATGGGAG 1860 

CTAGTATTCT TCGAGATAAG GTTAAAAAAG CAGGTCTAGA GATTCCAGTA TCTAATCAGG 1920 

CAATCTCAAA TTTGCTTGAT ACACCAAAAA CATTAATTGT TACTCAGGAA GAACTGACAC 1980 

CAAGAGCTAA AGACAAGAGT CCAAGTGCTA TTCATGTTTC TGTTGATAAT TTCTTAGCCT 2040 

CCTCTCGTTA TGATGAAATT GTAGCTTCAT TAACAGGAGC TTCTCCAATA GCAGAAATTG 2100 

AAGGAGATAT ACCAACTTCA GCACCAGTAG ATAGTCAGGA AAGTGACCTT AACCATATTG 2160 

ATGCTGTAGT AGTTGCTTAT GGTAAAGCAC AGGGAACTGC AACTATGGGC TGTGAAACGA 2220 

TTCGGGCTAT TTTTAGAAAC AAGAATATTC GTATTCCAGT TTCTACTGCC AAAATTTCAG 2280 

AATTAGGTGA ATTTAATTCT AAAAACATAA TCATTGTAAC AACTATTTCT TTACAGGCAG 2340 

AAGTGCAGCA AGCAGCACCG AATTCTCAAT TTCTTATTGT GGATACTTTA GTAACAACAC 2400 

CAGAATATGA CAAAATGGCT GCTAGAATGT ACAAATAGAA CTAGAGGTTT CTAAATTACG 2460 

AATGCTATTA ACCAAACGAG AAGAACAATT ATTGAAGGCT TTCCTACATG TAGGGAAGCT 2520 

TTCAATGCAA GATATGACTG AAATCTTACA GGTTTCATCT AGAACAATTT ATCGAACTTT 2580 

ATCAGATTTG ACAGATAGCA TGGAGCAATA TGGAATCCAA ATAACGAAGC ATGGGAAATA 2640 

CTATATTTTG ACTGGAGAGT TGGATGATTT GCCGACAGAA CTTGAAGTGT TAGTTGAGTA 2700 

TAGTCCCCAA GAAAGACAAG AGTTGATTAC CTATCGCCTT CTGACTGAGA GTGGTTTTGT 2760 

CACCAATGAA GCATTGCAAG AGTGCACGAA AGTCAGTAAT GTAACTATTA TTCAGGATAT 2820 

TTCAGATATT GATAAGCGTC TTTTAGACTT TGATCfGAAA ATTGAACGAC AAAAAGGTTA 2880 

TCGGATTTCT GGTGATTCAG TTGGTAAGAG AAGATTTTTG GCTATTTTAC TGACAAACTG 2940 

TATCTCAGTA GCACATTTTT CAACCGGTAA TTTTGGGAGC TTTGATATTT TAGAAGCAGA 3000 

TAGAACTGGG CTGGCCAGTC AGATTGTTAA 7AAGCAACTG TCAGGTTTTC CAGATATGGA 3060 

TGCTAGGATG AAGATGTTTT TTGCGATCT7 GTTATCTCTT ATAGGTCAGG AGCAAAACAT 3120 

TGAAAATTCA CCTAATACTA GTAAGCAGGC TTTGGAAATT TCTCAAAAAA TTTTTCAAGC 3180 

TTACTCTAAG CAGACTGCAC AATTTTATAG TATTCAGGAA ATTATCTATT TTGCGAGCAT 3240 

CTTGGATCAA TTAATCATTA AACGTCAGGA CAATCCGCTC TTTACGGAGA AATTTGATGG 3 300 
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TGAATTTTTC TACAATATTT CAAATCTGAT TGATACGGTT TCCATGTATA CCAAGATTGA 3 3 60 

CTTTTTTAAG GACAAGGTTT TATTCAATTT TCTTTTCCAT CATATTCGGC TCAGTTTACG 3420 

CGTCCCTATC CTTTTTCAGG GTGAAAATTT GCCAGAATCT ATCCAGATTT TAGTTGAAAG 34 80 

GAATAAATTT CTTTATACAG TCATCAGTCT TTTAGTGAAT GATATTTTTC CGAAATATCT 3540 

TCATACAGAG TATGACTATG GCATGATTGC CCTACATTTT ATCTCTAGCT TAGGCCGTAG 3600 

7CCAGAGATT TATCCAGTCC GTGTTTTGCT TTTAACGGAT GAACGTCGCG TCACTAGAGA 3660 

TTTATTAGTC AGTAAAATTA AGAGTGTTGC TCCTTTTGTA GAGTTGATAG ATATTCAATC 3720 

TCTAGTAGAT TACCACAGTA TTGATCTCAC TCAGTATCAT TATATTTTAT CTACCAAGCC 3 7 80 

GCTGACTAAT CAGGAAATCG ATGTAATTTC TAGTTTTCCA ACCGTCAAAG AATTGCTTCA 3 840 

ATTACAGGAA CGACTTCAGT ATGTACAGGC ACATCGTACA ATTGTCGCGC GTGATGCTAT 3900 

CGCTCCAGAG AAAAGTTATG ACTTGCAAGA TTATTTAATA TCTAGTAGTC AGCTTTTGAG 3 960 

TCAATTCGAG TTGGTTCAAT TGGAGAATAA TCAATCATTT GAGCACACGG TAGAACAAAT 4020 

CATCCAATAT CAGAAGAATG TGAGTGACAG AGCTTACCTA ACAAGAAAAT TGTTATCTCA 4080 

CTTCCAGAAT AGTCCTATGG CTATTCCTAA TACTGGTCTG GTGCTTTTAC ATAGTCAGTC 414 0 

TAGCAAAGTA ACAACAAATA G TTT T ACTAT GTTTCAACTC AAACTACCTA TCTCCGCATT 4200 

GTCAATGAAA CGAGAGGAAG AAGAGGTCAA AACGTGTCTG CTAATGCTAA TGTCTAAAGA 4 260 

AGCTAGCGAG GAAGCTAGAG ATTTAATGAC AGCTATTAGT CAGTCGATTA TTGAAAATCA 4 320 

TCTTTATACA GAGATTTACA AGACGGGAAA TCAATCCATT ATTTATCAGA TGCTAAATAC 4 3 80 

TATTTTTAAC GAAAAAATTA AGAAATTGGA GAACTAATAT GAAACTTGAA AAACATTTGA 4440 

TTAAGCTTAA TAAACAATTT TCTAACAAGG AGGAAGCTAT TTCTTATTGT GGGCAAGTTC 4 500 

TTTATGAGGG TGGATATGTT AATGAAGACT ATATTGAAGC CATGATTGAG CGAGATAAAC 4 560 

AGCTATCTGT TTACATGGGT AACTTTATCC CCATACCGCA TGGAACAGAT GCAGCAAAAA 4 620 

ATGATGTCCT CAAGTCTGGT ATTACAGTCG TTCAAGTCCC TAGAGGGGTT GATTTTGGGA 4 680 

ATGTATCTAA CCCTCAAGTG GCAACGGTTC TTTTTGGTAT TGCTGGTATT GGTAATGAAC 4 7 40 

ACTTAGAAAT TATTCAGAAA ATTTCTATCT TCTGTGCAGA TGTAGATAAT GTTCTTAAAC 4 800 

TAGCAGATGC TCAGTCAAAA GAGGAAGTAT TGCGCTTATT TGATGCTGTT GAATAATTGA 4860 

ATTTAGTCAT TTGTCATCTA GTATATATGT CCCTCAAATA GGAAAAGGAG AAATTGAATG 4920 

AAACATTCTG TTCATTTTGG TGCCGGTAAT ATCGGTCGTG CTTTTATACC TGAAATTCTA 4 980 

TTTAAAAATG GTTTCCATAT TGATTTTGTG GATGTCAATA ATCAGATAAT TCATGCTCTG 5040 
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AATGAAAACG GCAAGTATGA AATTGAAATT GCACAGAAAG GACAGTCTCG TATAGAAGTA 5100 

ACTAATGTGG CTGGCATTAA 7AGCAAACAA CATCCTGAGC AAGTCATTGA AGCGATTCAA 5160 

AACACGGATA TTATTACTAC TGCAATCGGA CCTAATATAC TCCCTTTTAT CGCCGAACTT 5220 

CTAGCCAAAG GAATCGAAGC TCGCCGAGTT GCAGGAAATA CACAGGCATT GGATGTTATG 52 80 

GCCTGTGAAA ATATGATTGG CGGGTCTCAA TTTCTTTATC AAGAAGTCAA G AAA? ATT T A 5340 

AGTCCGGAAG GTTTGACATT TGCTGATAAC TACATAGGTT TTCCAAATGC TGCAGTAGAC 5400 

ACCATTGTTC CAGCACAAAG TCACGAAGAT TCCCTTTTTG TTGTGGTCGA GCCCTTTAAT 5460 

GAATGGGTCG TGGAAACCAA GCGTCTTAAA AATCCAGATT TACGTCTAAA AGATGTGCAT 5 520 

7ATCAAGAAG ATTTAGAACC CTTTATTGAG CGAAAACTTT TTTCAGTCAA TTCTGGACAT 5580 

GCAACTTCAG CTTACATTGG TGCGCATTAT GGTGCCAAGA CAATTTTGGA AGCTCTTCAA 5640 

AATCCTAATA TTAAATCTCG GATTGAATCT GTATTAGCTG AAATTCGGAG TCTCTTGATT 5700 

GCCAAATGGA ACTTTGATAA AAAAGAATTG GAGAATTATC ACAAAGTCAT TATAGAACGA 57 60 

CTTGAAAACC CTTTCATAGT GGACGAGGTT AGTCGCGTAG CTCGTACTCC AATCCGAAAA 5820 

TTAGGCTATA ATGAACGATT CATCCGGCCG ATACGTGAAT TGAAAGAACT CAGTTTGTCA 5880 

TATAAAAACC TACTTAAAAC AGTTGGCTAT GTCTTTGACT ATCGCGATGT AAATGATGAA 5940 

GAAAGTATTC GATTAGGTGA ATTGTTGGCT AAACAATCAG TCAAAGATGT TCTTATACAA 6000 

GTTACAGGTT TAGACGACCA AGAATTGATT GAGCAAATTG TAGACTATAT TTAATCTTTT 6060 

TCGAAAATCT CTTCAAATCA GGTTAGCATC GCTTTGTCTT AGGCATATGT TGTTCTATCT 6120 

ACAACCTCAA AGCAGTGCTT TGAGCTGACT CCGTCAGTCT TATCTGCAAT CTCAAAACAC 6180 

TGTTTGAGTT ATCTGCGGTA ATCTTTCTAG CTTGTCTT7G ATTTTTGTTG TTATTTATAA 6240 

GGTAAAAGAA GCTGGACAAA AAGTCTTCAA AATCGGGAAA AGGCAGCCTA TCGGGTGTTC 6300 

AAAAATCTTG ATAGGATGTC CTTTATTATG GAAAGCCTTA TTGGATTTTC TCCTCAGATT 6 3 60 

GAGTTTTTGA TCAGCTTTAT GAGATAGGTC TTGCTAGAGA TGTAGCCCAT CATGTTATTT 6420 

TTATGGACAG TCGGAAAATT GTTGAAAAAA ATAATGCCCA TCAATTCTTT AGTCGTCCAA 6480 

GAGAAGAACG AACCAAGCAA TTTTGGAACG AATTCTTTCG AATGCGATC? ATATAGTAAA 6540 

ATGAAACAAG AACAGGACAA ATCGATCAGG ACAGTCAAAT CGATTTCTAA AAATGTTTTA 6600 

GAAGTAGAGG TGTACTATTC TAGTTTCAAT CT ACT AT ATA ACTGAAAAAT TAGATAAATT 6660 

AGTTTTGGAA AATGACTAAC CAAAAGATAT CCAAAGTAGT CTAAAATTGT CTATACTTTA 6720 

TGAGTGTTTT AGTTAGGAAA AAGGCTTGTT GTCTATAATT GTCTGCATTA GTCTAGATTT 6780 

TATTTATACA AAATGTTATA AT AG ACTGT A TTTAAAAAAT TTTAAGGAGA AATGACAGAA 6840 
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TC Tf*Tn T ATP 


iTTrraa a a/** 


AAAGAAACAA ACCGTGGTGT 


CTTGACTTTC 


ACTATCTCTC 


6900 


AAGACCAAAT* 


LnnnL wnunA 


TTGGACCGTG 


TCTTCAAGTC 


AGTGAAGAAA 


TCTCTTAATG 


6960 


TTrrAHftTTT 

1 1 \~4\<I\J 111 


rrfT& A AI~^**T» 
1 AAnuVj i 


CACCTTCCAC 


GCCCTATCTT 


ft ^ A ft ft ft ft ft 

CGACCAAAAA 


TTTGGTGAAG 


7020 


A A f '"^T^f ft 
e\/\\3\, 1LU in 


TCAAGATGCA 


ATGAACGCAC 


TTTTGCCAAA 


CGCTTATGAA 


GCAGCTGTAA 


7080 


ftnunnuL I Liu 


TCTTGAAGTG 


GTTGCCCAAC 


CAAAAATTGA 


CGTAACTTCA 


ATGGAAAAAG 


7140 




GGTTATCACT 


GCTGAAGTCG 


TTACAAAACC 


TGAAGTAAAA 


TTGGGTGACT 


7200 


AL.AAAAACCT 


TGAAGTATCA 


GTTGATGTAG 


AAAAAGAAGT 


AACTGACGCT 


GATGTCGAAG 


7260 


AGI.G TAT LGA 


ACGCGAACGC 


AACAACCTGG 


CTGAATTGGT 


TATCAAGGAA 


GCTGCTGCTG 


7320 


AAAAL ijtjt- G A 


CACTG iTGTG 


ATCGACTTCG 


TTGGTTCTAT 


CGACGGTGTT 


GAATTTGACG 


7380 


C'WLr' A A & A O/* 
G i \jG AAAALvO 


TGAAAACTTC 


TCACTTGGAC 


TTGGTTCAGG 


TCAATTCATC 


CCTGGTTTCG 


7440 


A A^* Af*f* A A TT 

AAG At. t. AATT 


GGTAGGTCAC 


TCAGCTGGCG 


AAACCGTTGA 


TGTTATCGTA 


ACATTCCCAG 


7500 




AGt- AGAAG AC 


CTTGCAGGTA 


AAGAAGCTAA 


ATTCGTGACA 


ACT ATC CACG 


7560 


AAGTAAAAGC 


ft R Jh ft. % ft 

TAAAGAAGTT 


CCGGCTCTTG 


ACGATGAACT 


TGCAAAAGAC 


ATTGATGAAG 


7620 




ft J 'VIVI^^H^ ft A 

ACTTGCTGAC 


TTGAAAGAAA 


AATACAGCAA 


AGAATTGGCT 


GCTGCTAAAG 


7680 


AAGAAGL. i l A 


CAAAGATGCA 


GTTGAAGGTG 


CAGCAATTGA 


TACAGCTGTA 


GAAAATGCTG 


7740 


AAATC GT AGA 


ft AiMlAA ft y** ft ft 

ACTTCCAGAA 


GAAATGATCC 


ATGAAGAAGT 


TCACCGTTCA 


GTAAATGAAT 


7800 


TCCTTGGGAA 


TTTGCAACGT 


CAAGGGATCA 


ACCCTGACAT 


GTACTTCCAA 


ATCACTGGAA 


7860 


CTACTt-AAGA 


AGACCTTCAC 


AACCAATACC 


AAGCAGAAGC 


TGAGTCACGT 


ACTAAGACTA 


7920 


ACt- i 1\jTTAT 


CGAAGCAGTT 


GCCAAAGCTG 


AAGGATTTGA 


TGCTTCAGAA 


GAAGAAATCC 


7980 


A A A A & f & a /"^T 

AAAAAGAAGT 


TGAGCAATTG 


GCAGCAGACT 


ACAACATCCA 


AGTTGCACAA 


GTTCAAAACT 


8040 


TGCTTTCAGC 


TGACATGTTG 


AAACATGATA 


TCACTATCAA 


AAAAGCTGTT 


GAATTGATCA 


8100 


CAACCACAGC 


AACAGTAAAA 


TAATCTTAAT 


AAACAGAAAA 


CCCACCTGAA 


TTGGTGGGTT 


8160 


TTCTGATGCA 


CTATTTTCCA 


AAAATCTCTT 


TGAGGTCTGT 


CTCTGTAATC 


CCAATCATGG 


8220 


CTGGGATGCG 


GTCCCAGTTT 


TCTTCGGTTA 


GGATGTAGGA 


TTGTTCAGAG 


CCACTTGATC 


8280 


TGACTGTTTC 


AGAGACAGCT 


TGTTGCTTTT 


CTTCAACATT 


CTCCAGTAGA 


TCACTGAAGC 


8340 


GTTCAATCAG 


ATAGGTTTTT 


CGGGCAGTTC 


CGATGTGTTG 


GGTAGCATAG 


TCGAAGGCTT 


8400 


GTAATTCGCC 


TAGTAAGATG 


AGTTTGCTTT 


TGGCACGTGT 


AATGGCTGTG 


TAGATGAGAT 


8460 


TTCCCTCCAG 


CATACGTCGG 


CTAGCACTAG 


TAATCGGTAG 


GATGACAACT 


GGGAACTCAC 


8520 


TTCCCTGAGA 


CTTATGAATA 


CTCATGGCAT 


AGGCCAAGCG 


AATCTTGTAC 


CATTCGTTAC 


3580 
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GGGCGTAAGA 
TCTATTTACC 
CAGCATCGTT 
CAAAACTGAG 
CATCAATCCC 
TACCATTTCT 
CAATTTCAAA 
GTCGAATCTG 
CCAAGCGAGT 
TGACAGAAGG 
TGGAGAAGAG 
AGTCAGCATC 
GGCGATGTAT 
CAGTTGGAGC 
AAAGGGCATA 
CTGTCAGGAT 
CATACTCAAT 
CATGACTCTT 
CAGCGAAAAA 
CGATCAGGTA 
ACTCAAGGAG 
TTTCCATACA 
GACTTTCGAT 
TATCCTCAAC 
AGTCTTGAAT 
TCATCTCCGT 
AGAGTCCTGC 
CGCCATAGGT 
TTGAAAAGTA 
GCAGTTGTTC 



GACTTCATTA 
AGGAATCAGG 
AACCAAATGA 
TTGATCTTTT 
TGCCGTCCCT 
GAGGGCGGCA 
GTAGGAACGG 
ACTAGCTAGG 
CTGAGGAATC 
TAGCTGATCA 
TTGATTGGCC 
TAGGTAATCT 
GGTCGCGCTA 
AGCAAGAAGA 
AACAGCAATG 
AAAGACCTTA 
TCCCAGTTCT 
CTGTTTTCCT 
GAGGCTGTTG 
GGAGAGCTCT 
AGTAAGGGTT 
GGCCTGAAAA 
CCCTAGTTCC 
CAGCTGGTAG 
CTGAAAGGCT 
TCCGTAGTTG 
GATCCCTTCT 
ATCCACGATT 
CTTGACCAAG 
TCCATACTTG 



CCATCAAAAT 
TCTGTGATAG 
ATGACCCTGT 
TGTCGGGGAT 
CGGTACATAG 
CCTAAGATTT 
TCAGCTTTTT 
GTGACGATGG 
AAAGGAATAT 
CTGTCACCTA 
AGCCAAGTAT 
TCCAGATGAC 
GGCAAACCTG 
ATGCGCAGAT 
ATTCCATTGA 
TTCTGGATAG 
TGCTCGACAG 
TTTTCAAGGA 
TCAAAGATCT 
TGGGCAACTT 
TGTTCCAGCA 
AGACTGTGAA 
TCAGCTAGTT 
GGATAATTTT 
AGTTTGTTGG 
AGACGGAGAG 
AACTTTTCTG 
TTCTGAGCTG 
CCCTTACTAG 
GAGTGCTGGA 



656 
CAATGACAAT 



CTCCTAAATC 
CTCTCTTACG 
TGAGCAGGTC 
GAGCCAGAAC 
TTTCAATGGT 
TTTGGGTGAA 
TTGATTCTTT 
GAAGTAGATC 
CGATGAGGAT 
CTACCATAGA 
TGGTATCATC 
TCAATTCATT 
TGCTTTTCTT 
TAACAGTTGT 
CATCACAGAT 
TAGTGATATG 
TACGAACCAA 
TGGTATCAAT 
GGCTGGGGTC 
AATCCCGTGC 
CTAGACCGGC 
GGTCAGCAAT 
CAACCACATC 
GAATGCCGTA 
TGGAGACGAA 
GGTGTTGCAA 
TCTTGAGACC 
TTGGTTTTGC 
CAATTTGCCC 



CTCGTCTTGT 
CCCATTAAAG 
ATAGTGACAC 
TTGCATGAGC 
TTGGATATCA 
GGCAGGAATA 
ATCAGCTGGC 
GCTTTGTCCA 
CGCTAGAACC 
CTTACTGTTA 
GAATTCATCC 
GTCACCTGTC 
CATGCGACGA 
CCTGAAGTCA 
CTTACCAGTA 
AGCCTGTTTT 
TTTTTGAATG 
GTGACTGCGG 
CTGCTGAACC 
TAGTTCCACG 
TTCAACATAG 
GCGGAAGCGT 
GCTAAAGCCC 
AAGGGTTTCT 
GTTGGCTAGT 
AGCCTCGCGA 
AATTTCGTCA 
AATCCCCTTG 
GCGATCATAA 
CCAAAAAGTA 



TTCGATTCGG 
ACATTGATTT 
TGAGGAGCTT 
TGATTGATAG 
CGGGCGGGAA 
TGGCCACTAG 
AAGATGCCCT 
TAAATTTTTT 
TGTCCAGGAC 
GAAGAGATAT 
ACGATGATAA 
ATTCCCAAGT 
GCAGCTCGAC 
AGTCCTTCTA 
CCAGGCCCAC 
TGAATGTTAT 
GTTTCTAAAT 
ATGCCTTCCT 
TTGTCTTCTT 
GGACGGGAAG 
CTGTCCCCTG 
TCAGGAGCCT 
AAACCCTTGA 
TCCTTGTAAA 
TTGGCCAAAA 
TTTTTGGCAG 
ATGGTATTTT 
AAATGGCTAC 
CGACTGATTT 
TAGTCTTCGC 



8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
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CCTCAATTAC ATCAGCCATG GTTCCTGTGA CAATGATTTC AAAATCATCA AAATCCTCTG 10440 

CGTCCGTATC GTCGATTTCT AGGAGGAGGA TGCGATAAAA ATTGCTGGGA TTTTCAAAAA 10500 

TAATCCGTTC AATAGTTCCT GAAAAATAAA CTTCCATAAA ATTCCTTTGC ATGAATAGGT 10560 

GAGACTTGGG ATTGTTTTTA TTTTATACTC TTCGAAAATA TCTTCAAACC ACGTCAGCTT 10620 

CCATCTGCAA CC7CAAAACA GTATTTTGAG CTGACTTCGT CAGTTCTATC CACAACCTCA 10680 

AAACACTGTT TTAAGCAGCC TACGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTCATTGAG 10740 

TATTTGTAAA TAAACAATCA CTTCTCACGA TAGAAGAAGA GGCTGAGATT GGTGATTCTC 10800 

TGCCTCTTAG GTTTCTTAAA ATCTTCCGAT ACGGGTGATT GGCCATAAGC GGAATTTAGC 108 60 

TTCCCCTGTG ATATCTTTTG CTTTGAAGGT ACCTACGTGG CGGCTGTCGC TCGAAACCAA 10920 

GCGGTCATCT CCGAGGAGAA GGTATTCTCC TTCTGGAACA GTAAACCTAA AGTTGGTGTT 10980 

GTAGTTGACA TCAACTGTGA AGGCTTGAGC TTTTTGAGCG ATACTTCTAA AGAAAGTTCC 11040 

TTTATTTCCT TCAAAGCCCT TGCCTGAGTA AGTGCTTTGG AGTTTGTCAT CCTTGAAGCG 11100 

TTTGATATAG TCTGCTAGAT AAGGCTCGTC CGTTTCTTTG TCATTGATGT AGAGTTTATC 11160 

ATTTTCGTAA CGAATGGTGT CGCCAGGCAT TCCAATCACG CGCTTGACGA TGTCCTTATT 11220 

GCCATCTTCC TCATGGGCCA CCACGATATC AAAACGGTCA ATAGGAAGGT CTTTTACAAC 112 80 

GAAGAGAATT TCCCCATCCG CTAGGGTCGG ATCCATGGAA TCTCCTTCTA CGCGAACATT 11340 

GCTCCAAAAA AAGATACGAC TTAAAGCTAG TAATGACAGA ATTAGGAGGA ACAATCCCCA 11400 

CTCTTTTAAG AAATTTTTAA ATGAATTCAT AACTTACCTT TCTAAGCGTT TTTTCGCTTT 11460 

TTCAGTCTTT TTAAAGTGCA ATTTGGCGCA GAAGCTGAGT CCCTGCATAC CATAGCCTTG 11520 

CAAAATCTGG CTAGCCACCT TGTCAGAAGC CGTTCCAGCT CCACTTGGGA GCTGATAACC 11580 

CAGTTCTCGT CCCAAATTTT CAAGATTTTC CAGAAAGAGA TCACGCGCAA TGACAGAAGA 11640 

AACTGCGACA GACAAGTATT TGCCCTCAGC CTTTTCTTCT AAGCTGATAC CATTCCTCAA 117 00 

ACGATTGGCC TCTTGTCCCA ACTACTTGTC ATAATTTTTA GCACTGGTAA AGGCATCAAT 117 60 

CACAATTTTC TCAGGCTGAA CACCTTTTTG AAGGAGGAGA TAGATAGCCT CATTATGGAG 11820 

GGCAACCTTA ACCGAAACAG CGTTGTAGCG GTCTCCGATG ACCTCGTTCT ACTTGCTGGG 11880 

TGAGAGAAGG AGTGCCTGGT GCTGAATTTT TTCCTTGAGA ATAGGAGTAA TCTGACGGAT 119 40 

CTTTTGGTCG GTCAGAGTCT TAGAATCCCC CACACCCAGT TTTCGTAAAA AGTCGTGCTG 12000 

GTCAGG7CTG ACAAAGGCAG CCACAACTGC AAGCCCACCA AAGTAGGAAC CATTTCCCAC 12060 

CTCATCTGTC CCAATTAAAG GAAGATTTTG TCCGCTGGTT TGCTCTACAG CTTGATAGCC 12120 
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AAACAAACTG 


GCGTATTTTT 


CAGCCCCTTC 


ACCCTGAAGC 


AAGATTTTTC 


CAGAAGTATA 


12180 


GATAGAAACC 


GTTGCTTGAG 


GTAGTTTCAA 


AAAGTAGCGG 


ATATAGGGAT 


TCTTGCTAGC 


12240 


ACCCACACTG 


GTTTGATAGT 


GTTCAAGAAA 


AGCCTGAATA 


TCCTTTTCCC 


TTGGTGTGAG 


12300 


TGTGATACTT 


GCCATAGTTT 


CTATTGTACC 


ACAAAAGCAG 


TAAAATTTGT 


AAAAACTGAC 


12360 


AAAATTAGCG 


AATTTTGGTA 


TAATATCGTG 


AGGTGAATTT 


TATGGCAAAT 


CTAAATCGAT 


12420 


TCAAATTTAC 


ATTCGGGAAA 


AAATCGTTAA 


CCTTGACAAG 


CGAACATGAC 


AACCTTTTTA 


12480 


TGGAGGAAAT 


CGCTAAGGTT 


GCGACAGAAA 


AATACCAAGC 


AATTAAAGAA 


CAAATGCCTA 


12540 


GCGCAGATGA 


TGAAACAATC 


GCTCTTTTGT 


TGGCAGTCAA 


CTGTTTATCA 


ACTCAGCTCA 


12600 


GCCCTCACAT 


TGAATTTGAC 


GATAAGGAGC 


AAGAGCTAGA 


AGAACTCCCT 


CACAAGCTTG 


12660 


TGACTTGTAA 


GCAAGAACAG 


AGCAAGATTG 


AGGATTCCTT 


ATGATTTCAT 


TCCTTCTTCT 


12720 


ATTGGTCTTG 


GTTTGGGGAT 


TTTATATCGG 


CTATCGGAGA 


GGCCTGCTCT 


TACAGGTTTA 


12780 


TTACCTGATT 


TCAGCCATGG 


CATCGGCTTT 


TATGGCTGGC 


CAGTTTTATA 


AGGGGCTTGG 


12840 


AGAGCAATTC 


CATTTATTGC 


TCCCTTATGC 


AAATTCGCAG 


GAACGTCAGC 


GGACTTTCTT 


12900 


TTTCCCATCG 


GATCAACTCT 


TTCAGCTGGA 


TAAGGTCTTT 


TATGCAGCTA 


TCGGCTACTT 


12960 


GCTTGTATTT 


GGGATTGTCT 


ATAGCATTGG 


TCGTTTACTT 


GGTCTTCTCT 


TACACTTGAT 


13020 


TCCTAGCAAA 


AAACTGGGTG 


GTAAGTTGTT 


CCAAGTTTCA 


GCAGGTATCT 


TGTCCATGTT 


13080 


GGTGACCTTA 


TTTCTCTTCC 


AAATGCCCTT 


GACAATCTTG 


GCGACCATCC 


CCATGGCAGT 


13140 


TATACAAAAT 


CCTCTTGAAA 


AGAGTATCGT 


CGCAAAACAC 


ATCATCCAGA 


GCATACCGGT 


13200 


AACAACCAGT 


TGGCTCAAAC 


AAATCTGGGT 


GACAAATTTA 


ATCGGATAAA 


AAGGGCAGGA 


13260 


GTTTTCCTAG 


CCCTTTGTTT 


ACAGATTTGA 


CTCGAATCTA 


TCAGAATGTA 


AAAAGCTACC 


13320 


ACACCTAGAC 


ATTCAAAGAC 


AAGGAAATAA 


AGATGAATAA 


GAAAATATTA 


GAAACATTAG 


13380 


AGTTCGATAA 


GGTCAACGCC 


TTGTTTGAGC 


CTCATTTGTT 


GACCGAGCAG 


GGCTTGGAGC 


13440 


AATTGAGACA 


ACTGGCTCCG 


ACTGCCAAAG 


CAGATAAAAT 


CAAACAGGCT 


TTTGCTGAGA 


13500 


TGAAGGAAAT 


GCAGGCTCTT 


TTCGTCGAGC 


AACCGCATTT 


TACTATTCTC 


TCAACTAAGG 


i.3560 


AAATTGCAGG 


AGTCTGCAAG 


AGGTTGGAGA 


TGGGAGCGGA 


TCTCAATATC 


GAGGAGTTCC 


13620 


TACTCTTGAA 


ACGCGTGCTT 


CTTGCCAGCC 


GAGAACTTCA 


AAATTTTTAC 


ACCAATCTGG 


13680 


AAAATGTCAG 


CTTGGAAGAA 


TTAGCCCTTT 


GCTTTGAGAA 


ATTACATGAT 


TTTCCGCAAT 


13740 


7ACAAGGAAA 


TCTTCAGGCC 


TTTAATGATG 


CGGGTTTCAT 


TGAAAATTTT 


GCCAGTGAAG 


13800 


AATTGGCGCG 


AATCCGTCGA 


AAAATACATG 


AT AG CG AG AG 


TCAGGTACGC 


GATGTTTTAC 


13860 


AAGACT7CCT 


CAAGCAAAAA 


GCGCAGCTGT 


TGACGGAAGG 


AATTGTTGCT 


AGC AG AAATG 


13920 
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CCCGTCAGGT TTTACCACTC AAAAACACCT ACCGCAATAA GATTGCAGGT GTCGTTCATG L3980 

ATATTTCTGC TAGTGGAAAC ACCGTCTATA TCGAACCCCG TGAGGTACTC AAACTGAGCG 14040 

AAGAAATTGC TAGTCTGCGA GC AG ATGAGC CCTATGAAAT GCTTCGCATT CTCCAAGAAA 14100 

TTTCTGAGCG TGTCCGCCCT CATGCGGCTG AGATTGCTAA TGACGCTTGG ATTATCGCTC 14160 

ATCTGCACTT GATTCGTGCC AAGGTTCGAT TTATCCAAGA AAGACAAGCA GTCGTGCCTC 14220 

AGCTGTCAGA AAATCAAGAG ATTCAACTGC TCCATGTCTG CCATCCTTTG GTCAAAAATG 142 80 

CCGTCGCAAA TGATGTCTAT TTTGGTCAAG ATTTAACAGC TATTGTCATT ACAGGTCCCA 143 40 

ATACAGGTGG GAAGACCATC ATGCTCAAAA CTCTGGGCTT GACACAGGTC ATGGCCCAGT 14400 

CAGGATTGCC GATTTTAGCA GACAAGGGAA GTCCTGTTGG TATTTTTGAA GAAATCTTTG 144 60 

CTGATATTGG AGATGAGCAG TCTATTGAGC AGAGCTTGTC TACCTTCTCT AGTCATATCA 14 520 

CCAATATCGT GGATATTCTT GGCAAGGTCA ACCAACATTC ACTCTTACTT TTGGATGAGT 14580 

TCCGGGCTGG TACTGATCCC CAAGAGGGAG CAGCCCTTGC CATGGCTATT CTGGAGGACC 14640 

TTCGCCTGCG TCAAATCAAG ACCATGGCGA CCACCCACTA TCCAGAACTC AAGGCCTACG 147 00 

CTATTGAGAC AGCCTTTGTG CAAAATGCCA GTATGGAGTT TGATACTGCA ACTCTTCGCC 14760 

CGACCTATCG CTTTATGCAG GGTGTTCCTG GCCGAAGTAA TCCCTTTGAA ATTGCCAAAC 14 820 

GTCTAGGCCT ATCTGAAGTT ATCGTAGGAG ATGCCACTCA GCAGATCGAT CAGGACAATG 14 880 

ACGTCAATCG TATCATTGAG CAATTAGAAG AGCAGACGCT GGAAAGCCGC AAACCTTTGC 14940 

ACAATATCCG TGAGGTGGAG CAAGAAAATC TCAAGATGAA CCGTGCGCTA AAAAAACTCT 15000 

ACAACGAGC? TAATCGTGAA AAGGAAACCG ACCTTAACAA GGCGCGTGAA CAGGCTGCTG 15060 

AGATTGTGGA TATGGCCCTA AGTGAAAGTG ACCAGATTCT CAAAAATCTC C AC ACT AAA T 15120 

CCCAACTCAA GCCCCACGAA ATCATTGAAG CCAAGGCCAA GTTGAAAAAA TTGGCTCCTG 15180 

AAAAAGTGGA CTTGTCTAAA AATAAGGTCC TTCAAAAGGC CAACAAAAAA CGAGCTCCAA 15240 

AGGTGGGAGA TGATATCGTG GTTCTCAGTT ATGGTCAGCG TGGTACCTTG ACCAGTGAAC 15 300 

TCAAGGACGG TCGCTGGGAA GCCCAAGTTG GCTTGATTAA GATGACCTTG GAAGAGAAAG 153 60 

AGTTTGATCT TGTTCAAGCC CAGCAAGAAA AACCAGTCAA GAAGAAACAG GTCAATGTTG 15420 

TGAAACGAAC TTCTGGGCGA GGACCTCAAG CTAGACTGGA TCTTCGAGGC AAGCGCTATG 15480 

AAGAAGCCAT GAATGACCTA GATACCTTCA TCCACCAAGC CTTCCTTAAC AATATGGCTC 15 540 

AACTTGATAT CATCCATGGT ATCGGAACAG GAGTCATCCG TGAAGGAGTT ACCAAATACT 15600 

TGCAAAGAAA CAAACATGTC AAGAGTTTCG GCTATGCCCC ACAAAATGCT GGAGGCAGTG .15660 
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GTGCGACTAT 


TGTCACTTTT 


AAAGGATAGC 


AGTATTCTGG 


ACTTTATAAA 


GTAAAAACTG 


15720 


TTGAACTAAT 


TTTTACTAAT 


AAACACATTG 


ACAAAAGCCA 


ACATTTTTTG 


TAAAATTAGA 


15780 


ATCAATTAAA 


TACCAACACC 


GAATGAAGTT 


TAATAGAAGT 


GGGGAATCGT 


TTGATTTTCC 


15840 


ATGACTGTAA 


ATGGACGGAA 


CTCTGGAGAG 


ACCGTAAAGG 


CACCCAAGGG 


CAAGGCAGGC 


15900 


AACTGCTCAA 


ACTCTCAGGT 


AAAAGGACAG 


AGCTAGGATA 


GXCCGCTTTT 


TAGCATTTAT 


15960 


CTAAGCATTC 


CAGAGTACAT 


GTATCTTGCA 


TGTGCTCTTT 


CTTTTGGGGT 


TGAAACGATA 


16020 


GGAGAAGGAA 


ATGTTAGAAT 


TGCTTAAATC 


AATCGATGCT 


TTTGCTTGGG 


GACCGCCCCT 


16080 


CTTGATTTTA 


TTGGTCGGAA 


CAGGGATTTA 


CCTAACTATT 


CGGCTAGGAC 


TCTTGCAGGT 


16140 


TTTCCCTCTA 


CCCAAGCCCT 


TTCAGCTTAT 


TTTTATCCAG 


GATAAGGGAC 


ATGGTGATGT 


16200 


ATCCAGTTTT 


GCAGCTCTGT 


GTACAGCCTT 


GGCATCAACT 


GTTGGAACAG 


GAAATATCAT 


16260 


AGGAGTTGCG 


ACGGCTATCA 


AGGTTGGTGG 


ACCAGGAGCT 


CTATTTTGCA 


TGTGCATGGC 


16320 


GGCTTTCTTT 


GGAATGGCTA 


CCAAGTATGC 


GGAAGGACTC 


TTCGCCATCA 


AATACCGCAC 


16380 


CAAGGACGAC 


CATGGTGCAG 


TAGCGGGAGG 


TCCCATGCAT 


TATATCCTTC 


TAGGGATGGG 


16440 


AGAAAAGTGG 


CGACCACTTG 


CTGTTTTGTT 


TGCAGTAGCA 


GGAGTATTGG 


TTCCTCTCTT 


16500 


GGGAATCGGA 


ACCTTCACCC 


AAGTCAACTC 


GATTGCAGAA 


TCTATCCAAA 


ATACAACGAC 


16560 


GATTTCGCCA 


GCCATCACAG 


CTCTCGTCTT 


GTCTGTCTTT 


GTAGCGATTG 


CACTCTTTGG 


16620 


TGGACTCAAG 


TCTATTTCTA 


AGGTTTCAAC 


TACTGTTGTT 


CCTTTTATGG 


CCATCATTTA 


16680 


TATCTTAGGA 


ACTCTTACAG 


TTATTTTCTT 


TAATATCGGA 


AAAATCCCTG 


GCACAATCGC 


16740 


TTTAGTCTTT 


ACCTCAGCTT 


TTAGTCCCCT 


TCCTCCGGTA 


GGTGGATTTG 


CTGCTGCTAG 


16800 


CGTTCGGATG 


GCTATTCAAA 


ATGGTGTGGC 


GCGTGGTGTG 


TTCTCAAACG 


AATCTGGTCT 


16860 


GGGTTCTGCT 


CCTATTGCAG 


CTGCAGCTGC 


CAAGACAAAT 


GAACCAGTAG 


AGCAAGGTTT 


16920 


GATTTCCATG 


ACAGGAACCT 


TTATTGATAC 


CCTCATCATT 


TCTACTCTAA 


CTGGTTTGAC 


16980 


CATCTTGGTA 


ACTGC 










16995 


(2) INFORMATION FOR SEQ ID NO: 83: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28473 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 
{ D ) TOPOLOGY: linear 



(xii SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
CCCGGCCTTT TCTACTATAA TAGAGATACG TTTTGAAAGT AGGAGGTATC TATGGACTTA 
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ACTAAGCGC T 


TTa i\ T* a a ap a 


nvy ?\ f a t a a & 
g it aga i >w 


/Ml W/vMj lit 




TCAr.TTTGAC 


120 


C AGGCT ATTT 


hp attpp 
cggag a 11ll 


t»PPP PT PTTP. 

*GGGG 1 i ly 


pp.tttp & rnr 


Tfir:cinci aac c 


TGATTTTACA 
i un i k i k n\>n 


180 


ACGCCAGACC 


ATGTCAAGGA 


GGCGGGCAAG 


PP a PPP a 
GGAGCGATTG 


A 1 \*ALiAAl-\./\ 


rtiLLl Inl 


240 


ACAGGGATGA 


GTGGTCTGCT 


GACTCTACGT 


CAGGCAGCCA 


G 1 GAG I i 1\m I 


TA irj" A A A RP 


JwU 


TACCAACTGC 


ACTATGCTCC 


Ftl^ * ft. A ft ^/** ft. A 

TGAAAATGAA 


ATCTTGG1 iA 






360 


TTATCTGCGA 


CTTTGACGGC 


TATTTTGGAA 


« ft ft ft 

G AGGG AG AC A 


aggtactttt 


PTP&PTTrPT 
Wh^auL ILL1 


4 20 


GCTTATCCAG 


GCTATGAACC 


GATTG 1 I AAC 


mm a/ , *T»TPPPP 

TTAGTT\jGGv> 




L unvin 1 l l 


480 


ACGACTGAAA 


ATGGTTTTGT 


CTTGACTCCT 


pap ATPTTP/1 
GAGA 1 vj I I Wj 




TTTr.fi acp Art 


540 


GGTGATAAGC 


TCAAGGCGGT 


TATTCTCAAC 


i»a T*f*r~ & cr~c a 
TA TCCAGCC A 


A 1 LvunLnuu 


AATTAPPTAP 


600 


AGTCGAGAGC 


AGTTAGAGGC 


CTTGGCAGCT 


G I I TTACGC A 


ML» I L 




660 


TGTGATGAGG 


TTTACTCAGA 


ATTGACCTAC 


AL AGGCU AAU 


LLA 1 \J 1 1 \. 1 


PT A flCl AA PP. A 


720 


TGTTGAGAGA 


CCAGGCTATT 


ATTATCAATG 


GTTTGTCT AA 


ATGGGA rvjGG 


ATPAPAPPTT 


7 80 


GGCGTTTGGG 


GCTGATTTTC 


GCTCCTGCGA 


^wrr & p app 

CCTTCACAUC 


AVj 1 1/U\l\. 


AAP,A(tTPAPP 


84 0 


AGTACTTGGT 


CACTGCCGCA 


AATACCATGG 


CGC AAC AT G G 




f P PT TP. AP P-P. 


900 


CTGGTAAAAA 


CGATGCGGAC 


CCATGAAGAA 


/■^ ft ft ft m ft 

GGAATATAT C 




APT AT ATP AT 


960 


CGAAAAAATG 


ACTGCTCTTG 


GTTTTGAGAT 


ft A «^/^ ft ft k a 

TATCAAACCA 


CiAGGG 1GGU I 




1020 


TGCTAAAATT 


CCAGCGGGCT 


A » ft #f*^ ft ft ^ ft 

ACAATCAAGA 


CTCCTTTGCT 


I 1 L \_ 1 \jr\J\\j<J 


A TTTTP.PTP A 


1080 


GAAGAAGGCC 


GTTGCCTTTA 


TCCCTGGTGC 


AGCCTTTGGA 


priyT a frCTT" 
1 1 AWuuuVj 


AAnPP m APPT 


1140 


CCGCCTATCT 


TATGCAGCCA 


GCATGGAGAC 


n & ms* \ a a ^* a a 
TATCAAAGAA 


A TP A A A f* 


PAPTTfiAHP-A 


1200 


GTACATGAGA 


GAAGCATGAT 


T C AGTCT ATC 


app aptp a ap 




TTAPAATCGC 


1260 


AATTTTCGTG 


& & H ^ ft ft 

AGGATGACAA 


GCTI-UTI.AAA 


a 1 M P*^T & P A P 




CAAACGCATG 


1320 


TTTTTTCTCA 


AACACGCTGG 


TCACTCTAAC 


CTCGCGCCTG 


TTATTCAGCC 


CTTGGTGCTG 


1380 


GCACGATTTC 


TCTTGCGAAT 


CAATGATGAC 


GGACTCACTT 


ACATCGAAGA 


CTATCATGAG 


1440 


GTCATGACTT 


TTCCCAAGAT 


TAATAGTGAC 


CTCTTTGTCA 


TGGCCTATGC 


GACCTATGTG 


1500 


GCAGCTCTTG 


CAGATGCTAG 


TTTGCAGGAC 


AATCAGCAGG 


ATGCTCCCTT 


GTTTGCTTTT 


1560 


TTGCAAAAGA 


CTTTGGAGTT 


GATGGAAGCA 


GGCTTGGATT 


ATCAGGTTTT 


GACCAATATT 


1620 


TTTGAAATTC 


AAATTTTGAC 


TCGATTTGGA 


ATCAGCCTCA 


ATTTTAATGA 


GTGTGTCTTC 


1680 


TGCCATCGGG 


TTGGTCAGGC 


TTTTGACTTT 


TCTTTCAAAT 


ATGGAGCCTG 


CCTCTGTCCA 


1740 


GAGCATTATC 


ATGAGGATAA 


GAGACGTTGT 


CATCTCAATC 


CCAATATCCC 


CTATCTGCTC 


1800 
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AATCAATTTC 
AAGCAAGAGC 
AAATCAAAGA 
AAATGAAAAA 
AGGGTGTCAA 
AAGCTAAAAT 
AGAACATTGA 
TGGTATTGGC 
ATACAGGTCC 
GTCCTGGACT 
TTGGTGCCAA 
TCTATGCTAA 
C AG AG ACT AG 
AAAGTTTGAA 
TTGTTGTGGC 
TGGGAATCAT 
GTGCCCTCCT 
TTGGTGGAGC 
ATGCCAACGC 
TTGCCCAGAC 
TGACCGTATT 
CTTGAGTCTG 
TCTGGAAGAT 
AGGAGATGTG 
TAGATGGTTT 
TTATCAAGGG 
CAAGCCGCTA 
CCTTGAGAA? 
TCATCGTAAA 
AGGATTCTAG 



AACCTATTGA TTTTGAGACT 
TACGCCAATT TATGGATCAA 
AATTTATTGA TTCCCTAGCA 
AATCGCAGTA GATGCCATGG 
TCAAGCCCTA TCTGACTTTT 
CAAGCAATAT CTGACAGCGA 
TTCGGATGAT GAACCTACGA 
AGCCAAGGCT GTCAAAGATG 
CTTG7TGGCA GCAGGATTCT 
CATGTCTACC TTGCCTACCG 
TGCAGAAAAT ACAGCCCAGC 
AAATGTCCGT GGCATTGCGC 
CAAGGGCGAC CCGCTTCGTA 
CTTTATCGCA AACGTGGAAG 
AGATCGTTTC ACGGGAAACG 
GGGCTTGCTC AAGACAGCTA 
TCTCAAGGAC AGCCTCAGTG 
GGTCTTGTTT GGTGTTAAGG 
TGTTTATAGT A CG ATTCGTC 
TGCGCGTGAA TTTTCAGGAG 
GTGACCATTA TCCAAGAGCG 
AAAGACGATT TGGATGCGGA 
GAATTTAGTA TCGAAATCAG 
GTTAAAATCA TTCAAGGAAA 
TTAGAAATGA GAAATATCGG 
CTCATGAAGA TATATCCTCA 
ACAGTTGAAG ATAAAGCCTG 
ATCTTTGCCA AACTAAAAAC 
CGCTTCGGAT TACGAATGAA 
TTTTGCAGGA AGTCTAATAG 
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TTGGAGACCA TTTCCCTCAA 
TTATATGAAG AGTACGTTGG 
GACTGGGGAC AATTACTAAA 
GGGGCGATTA CGCACCTCAG 
CAGATATCGA GGTTCAACTT 
CAGAGCGCGT CAGCATTATC 
GAGCTATTCG GAATAAGAAA 
GTGAAGCAGA CGCTGTCCTT 
TCATCGTGGG TCGTATCAAG 
TTGATGGAAA AGGTTTTGAC 
ACCTCCATCA ATATGCCGTT 
AACCACGCGT TGGTTTGCTC 
AGGAAACTTA TGAATTACTG 
CCCGTGATTT GATGAATGGC 
CTGTGCTCAA ATCCATCGAA 
TTACAGGTCG TCGTCTTCGA 
GTTTGAAAAA ACAGCTCAAT 
CACCTGTTGT CAAGACTCAT 
AGATCCGTAC CATGCTAGAA 
AATAAAACAG ATGACAGAAA 
ACAGGGAGAG GACTTTGTCG 
TTCTGTTGAC TTGATGGAGT 
CGATGAAGAA ATTGACCAAC 
ATAGCAATCG GAGTTCCAAG 
ACAAGC7CCT AAAATCTTGG 
AGCACAAACT CCACGTAAAT 
TAATCATGCG CTATCTAAGG 
GTTTAAAATC TTTTCAACAA 
TTTGATTGCT GGTATTATCA 
TAAAAAAGTG ATTAGAAAAC 



GCCTGGAATC 1860 

GATTCACCTA 1920 

AGAGGAAAAG 1980 

GCCATTGTTG 2040 

TACGGAGATG 2100 

CATACGGATG 2160 

AATGCCAGTA 2220 

TCGGCTGGGA 2280 

AATATCGACC 2 340 

ATGCTAGACC 2400 

CTAGGTTCCT 24 60 

AACAACGGAA 2520 

GCGGCTGATG 2580 

CTTCCACATC 2640 

GGGACAGCTA 2 700 

GCGAAACTAG 27 60 

TATTCAGATC 2820 

GGCTCAAGCG 2880 

ACAGACGTGG 2940 

AAGAAATTTT 3 000 

TGACAGAATC 3060 

TTATCTTGAC 3120 

TCCAAAACCr 3180 

TCAACGGAAG 3240 

CTGACAGTGC 3 300 

CCAGCAAACT 33 60 

AGATAAGCAA 3420 

CCTATCGAAA 3480 

ATCATGAACT 3540 

ATCTTTTTT A 3 600 
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AAAATAGAGA 


TGATTTTGAA 


ACAAAAAACC 


TAATTCAAGA 


CGTTTCGATG 


CCAATTCAAG 


3660 


ATTTCGATCA 


AAAAAATTAA 


TAGATACTGT 


TATACTAAAC 


TTGTCAAGTT TGTAACAAGA 


3720 


CAAATATTAA 


AAATAAAAAA 


GAGGTATTCG 


TTATGAATAC 


AAAAACGATG 


TCACAATTTG 


3780 


AAATTATGGA 


TACTGAGATG 


CTTGCTTGCG 


TTGAAGGTGG 


CGGATGCAAT 


TGGGGAGATT 


3840 


TTGCCAAAGC 


AGGTGTTGGA 


GGAGGAGCAG 


CACGAGGTCT 


TCAGCTAGGA 


ATTAAAACAA 


3900 


GAACATGGCA 


AGGTGCAGCA 


ACTGGTGCTG 


TGGGAGGAGC 


TATACTTGGA 


GGTGTGGCCT 


3960 


ATGCAGCGAC 


ATGTTGGTGG 


TAATTATGGA 


TTTTAAAAGT 


TTTATTATTG 


GTTTAGTAGT 


4020 


TGGTATATTT 


GCTCCTTATA 


TGGATGATTT 


AATTAGAAAA 


AAATTTTTAA 


AGTCTTCGGA 


4080 


GAAGAAAACA 


GAAAAATCTG 


TTAAAAAATA 


ATCAAAACTA 


TAAATGATGA 


ATCTGAATCA 


4140 


AAATTATTTT 


GCGCATCTAA 


AGAGGAGTCT 


TATAGTAACG 


AGTCAAAAAA 


GGAGTAACTA 


4 200 


TGAATCGTAA 


TTTAGAACGG 


TGTTATCTAT 


TCTGACTACG 


AATAGATCAT 


ACCAGAGGTA 


4260 


GCTTAGAAAT 


AGCAGAGACA 


TTAGAAATTG 


AAGTAATAAA 


TAGGATGTCG 


TAAGTGTTAC 


4320 


TATCAATGAT 


TTATTTGTTT 


CAAGCTTGCC 


TAGGGTGACA 


GTAAAAAATC 


AATTTCCTTT 


4380 


CAATAGCATA 


TTTTTAGTGG 


GCAGGACTCT 


TGTTCTGCCT 


ATTTTTTTA? 


CCAAAAAGTC 


4440 


CAGTTGGGAG 


GGAGATAGGC 


TCATTTGGGA 


AGGAAGTCCA 


GTTTTTGTTT 


AGTGATTGGG 


4500 


GTAAGATAGT 


TGTTATCAGA 


TGAGTTAATA 


CTCTTCGAAA 


ATCAAATTCA 


AACCACGTCA 


4560 


ACGTCGCCTT 


GCCGTATATA 


TGTGACTGAC 


TTCGTCAGTC 


CTATCTACAA 


CCTCAAAACA 


4620 


GTCTTTTGAG 


CAGCCTACGG 


CTAGTTTCCT 


AGTTTGCTCT 


TTGATTTTCA 


TTGAGTATTA 


4680 


GGGAAAAGG A 


GATGAATATG 


AAATTTGGGA 


AACGTCATTA 


TCGTCCGCAG 


GTGGATCAGA 


4740 


TGGACTGCGG 


TGTAGCTTCA 


TTAGCCA7GG 


TTTTTGGCTA 


CTATGGTACT 


TATTATTTTT 


4800 


TGGCTCACTT 


GCGAGAATTG 


GCTAAGACGA 


CCATGGATGG 


GACGACGGCT 


TTGGGCTTGG 


4860 


TCAAGGTGGC 


AGAGGAGATT 


GGTTTTGAGA 


CGCGAGCCAT 


TAAGGCAGAT 


ATCACGCTTT 


4920 


TTCACTTCCC 


GGATTTAACT 


TTTCCTTTTG 


TTGCCCATGT 


GCTTAAGGAA 


GGGAAATTGC 


4980 


TCCACTACTA 


TGTGGTCACT 


GGGCAGGATA 


AGGATAGCAT 


TCATATTGCC 


GATCCAGATC 


5040 


CCGGGGTGAA 


GTTGACTAAA 


CTGCCACGTG 


AGCGTTTTGA 


GGAAGAATCC 


ACAGGAGTGA 


5100 


CTCTTTTTAT 


GGCACCTAGT 


CCAGACTATA 


AGCCTCATAA 


GGAACAAAAA 


AATGGTCTGC 


5160 


TCTCTTTTAT 


CCCTATATTA 


GTGAAGCAGC 


GTGGCTTGAT 


TGCCAATATC 


GTTTTGGCAA 


5220 


CACTCTTGGT 


AACCGTGATT 


AACATTGTGG 


GTTCTTATTA 


TCTGCAGTCT 


ATCATTGATA 


5280 


CCTATGTCCC 


AGATCAGATG 


CGTTCGACAC 


TAGGGATTAT 


TTCTATTGGG 


CTAGTCATCG 


5340 
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TCTACATCTT 


CCAGCAAATC 


TTGTCTTACG 


CTCACGAGTA 


TCTCTTGCTT 


GTTTTGGGGC 


5400 


AACCCTTGTC 


GATTGACGTG 


ATTTTGTCCT 


ATATCAAGCA 


TGTTTTTCAC 


CTCCCTATGT 


5460 


CCTTCTTTGC 


GACACGCAGG 


ACAGGGGAGA 


TCGTGTCTCG 


TTTTACAGAT 


GCTAACAGTA 


5520 


TCATCGATGC 


GCTGGCTTCG . 


ACCATCCTTT 


CGATTTTCCT 


AGATGTGTCA 


ACGGTTGTCA 


5580 


TTATTTCCCT 


TGTTCTATTT 


TCACAAAATA 


CCAATCTCTT 


TTTCATGACT 


TTATTGGCGC 


5640 


TTCCTATCTA 


CACAGTGATT 


ATCTTTGCCT 


TTATGAACCC 


GTTTGAAAAG 


ATGAATCGGG 


5700 


ATACCATGGA 


AGCCAATGCG 


GTTCTGTCTT 


CTTCTATCAT 


TGAGGACATC 


AACGGTATTG 


5760 


AGACTATCAA 


GTCCTTGACC 


AGTGAAAGTC 


AGCGTTACCA 


AAAAATTGAC 


AAGGAATTTG 


5320 


TGGATTATCT 


GAAGAAATCC 


TTTACCTATA 


G7CGAGCAGA 


GACTCAGCAA 


AAGGCTCTGA 


5880 


AAAAGGTTGC 


CCATCTCTTG 


CTTAATGTCG 


GCATTCTCTG 


GATGGGGGC7 


GTTCTGGTCA 


5940 


TCGATGGCAA 


GATGAGTTTG 


GGGCAGTTGA 


TTACCTATAA 


TACCTTGCTG 


GTTTACTTTA 


6000 


CTAATCCTTT 


GGAAAATATC 


ATCAATCTGC 


AAACCAAGCT 


TCAGACAGCG 


CAGGTTGCCA 


6060 


ATAACCGTCT 


AAATGAAGTG 


TATCTAGTAG 


CTTCTGAGTT 


TGAGGAGAAG 


AAAACAGTTG 


6120 


AGGATTTGAG 


CTTGATGAAG 


GGAGATATGA 


CCTTCAAGCA 


GGTTCATTAC 


AAGTATGGCT 


6180 


ATGGTCGAGA 


TGTCTTATCG 


GATATCAATT 


TAACCGTTCC 


CCAAGGCTC7 


AAGGTGGCTT 


6240 


TTGTGGGGAT 


TTCACGCTCA 


GGTAAGACGA 


CTTTGGCCAA 


GATGATGGTT 


AATTTTTACG 


6300 


ACCCAAGTCA 


AGGGGAGATT 


AGTCTGGGTA 


GTGTCAATCT 


CAATCAGATT 


GATAAAAAAG 


6360 


CCCTGCGCCA 


GTACATCAAC 


TATCTCTCTC 


AACAGCCCTA 


TGTCTTTAAC 


GGAACGATTT 


6420 


TGGAGAATCT 


TCTTTTGGGA 


GCCAAGGAGG 


GGACGACACA 


GGAAGATATC 


TTACGGGCGG 


6480 


TCGAATTGGC 


AGAGATTCGA 


GAGGATATCG 


AGCGCATGCC 


ACTGAATTAC 


CAGACAGAAT 


6540 


TGACTTCGCA 


TGGGGCAGGG 


ATTTCAGGTG 


GTCAACGTCA 


GAGAATCGCT 


TTGGCGCGTG 


6600 


CTCTCTTGAC 


AGATGCGCCG 


GTCTTGATTT 


TGGATGAGGC 


GACTAGCAGT 


TTGGATATTT 


6660 


TGACAGAGAA 


GCGGATTGTC 


GATAATCTCA 


TTGCTTTGGA 


CAAGACCTTG 


ATTTTCATTG 


6720 


CTCACCGCTT 


GACTATTGCT 


GAGCGGACAG 


AGAAGGTAGT 


TGTCTTGGAT 


CAGGGCAAGA 


6780 


TTGTCGAAGA 


AGGAAAGCAT 


GCTGATTTGC 


TTGCACACGG 


TGGCTTTTAC 


GCCCATTTGC 


6843 


TCAATAGCTA 


GAAAGAGGAG 


AGGATGAAAC 


CAGAATTTTT 


AGAAAGTGCG 


GAGTTTTATA 


6900 


ATCGTCGTTA 


CCATAATTTT 


TCCAGTAGTG 


TGATTGTACC 


CATGGCCCTT 


CTGCTTGTGT 


6960 


TTTTACTTGG 


CTTTGCAACT 


GTTGCACAGA 


AGGAGATGAG 


TTTGTCCACT 


AGAGCTACTG 


7020 


TCGAACCTAG 


TCGTATCCTT 


GCAAATATCC 


AGTCAACTAG 


CAACAATCGT 


ATTCTTGTCA 


7080 


ATCATTTGGA 


AGAAAATAAG 


CTGGTTAAGA 


AGCGGGATCT 


TTTCGTTCAA 


TACCAAGAAG 


7140 
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GGGCAGAGGG TGTCCAAGCG GAGTCCTATG CCAGTCAGTT CCACATCCTA AAGGATCAAA 72 00 

AAAAGCAATT GGAGTATCTG CAAAAGAGCC TGCAAGAAGG GGAGAACCAC TTTCCAGAGG 72 60 

AGGATAAGTT TGGCTACCAA GCCACCTTTC GCGACTACAT CAGTCAAGCA GGCAGTCTTA 7 3 20 

GGGCTAGTAC ATCGCAACAA AATGAGACCA TCGCGTCCCA GAATGCAGCA GCTACCCAAA 73 80 

CCCAAGCCGA AATCGGCAAC CTCATCAGTC AAACAGAGGC TAAAATTCCC GATTACCAGA 74 40 

CAGCTAAGTC AGCTATTGAA ACACGTGCTT CCTTCCCCCG TCAGAATCTA CCCTACTCTC 7 500 

TTTACCAGTC C7ACAAGTCT CAGGGCGAGG AAAATCCCCA AACTAAGGTT CAGGCAGTTG 7 560 

CACAGGTTGA AGCACAGATT TCTCAGTTAG AATCTAGTCT TGCTACTTAC CGTGTCCAGT 7 620 

ATGCAGGTTC AGGTACCCAG CAAGCCTATG CGTCAGGGTT AAGCACTCAA TTGGAATCCC 7 680 

TTAAATCCCA ACACTTGGCA AAGGTTGGTC AGGAATTGAC CCTTCTAGCC CAGAAAATTT 7740 

TGGAGGCAGA GTCAGGTAAG AACGTACAGG GAAATCTTTT AGACAAGGGG AAAGTTACGG 7800 

CGAGTGAGGA TGGGGTGCTT CATCTTAATC CTGAGACCAG TGATTCTAGC ATGGTTGCAG 7 860 

AAGGTGCCCT ACTAGCCCAA CTTTATCCAT CTTTGGAAAG AGAAGGGAAA GCCAAACTCA 7 920 

CAGCTTATCT AACTTCAAAA TATGTAGCAA GAATCAAGGT CGGTGATTCT CTTCGCTATA 7 980 

CTACGACTCA TGATCCCGGG AATCAACTTT TCCTACATTC TACT ATT AC A AGTATTGATG 804 0 

CGACAGCTAC TAAGACTGAG AAAGGGAATT TCTTTAAAAT CGAGGCGGAC ACTAATCTAA 3100 

CTTCGGAGCA GGCTGAAAAA CTTAGGTACG GGGTGGAAGG CCCCTTGCAG ATGATTACGG 8160 

GCAAGAAAAG TTACCTACGT TATTATTTGG ATCAATTTTT GAACAAAGAG TAATGTTCGT 8220 

GTTTTTAGAG TTAAATAATT TTTAAACTGT GAGAAAGATT CTTCTTCCAG TTTTTTCTTT 82 80 

ACAATTTTTG AAAAACATCT ACTATTTATT CGGTTAAATT CTTGTGTTTT TTGGTTTTTT 8 340 

GTGGTAAAAT GTGCTCAAGT AATACGAAAG GCGAACTTTA AAATCTCAAA ACAATTGATC 8400 

TATTCGGGAA AAGCTAAAGA TAT CT AT AC A ACTGAGGATG AAAATCTTAT TATTTCAACT 84 60 

TACAAGGACC AGGCGACTGC TTTCAACGGT GTCAAGAAGG AGCAGATTGC AGGTAAGGGA 8520 

GTCTTGAATA ATCAGATCTC ATCTTTTATT TTTGAGAAAT TAAATGTGGC TGGTGTGGCG 8 580 

ACTCACTTTG TGGAGAAACT TTCAGACACG GAACAACTCA ATAAAAAGGT TAAGATTATT 8640 

CCTTTGGAAG TCGTGCTCCG CAACTATACT GCTGGTTCCT TTTCAAAACG TTTTGGTGTG 87 00 

GATGAGGGAA TCGCCTTGGA GACTCCGATT GTCGAATTTT ACTACAAAAA TGATGATTTG 87 60 

GATGATCCAT TTATCAATGA TCAGCATGTG AAATTCCTAC AGATTGCGGG 7GACCAGCAG 8820 

ATTGCCTACT TGAAGGAAGA AACGCGTCGT ATCAATGAAC TATTCAAAG7 CTGG7TTGCT 8880 
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CAGATTGGGC TTAAATTGAT TGACTTTAAG CTAGAGTTCG GTTTTGACAA GGATGGCAAG 8940 

ATTATCTTCG CAGACGAATT TTCACCAGAT AACTGCCGCT TGTGGGACGC TGATGGCAAC 9000 

CACATGGATA AGGATGTTTT CCGTAGAGGA TTGGGAGAAC TAACCGACGT TTATGAGATT 9060 

GTTTGGGAAA AGTTGCAGCA ATTGAAATAA TCTGTTTGCA ACGGAAAACC TTCGTCTCTC 9120 

AACTAAAAGG ACTCAGGCTG AAAAGGTCCC CCAGACCTTT TCACTCTGTA GAGAACTAGG 9180 

TGAACTAACA GATCTTTACG AAATTGTCTG GGAAAAGTTG CAGGGTTTAA AATAACAACC 9240 

TCAAGGCTGT TTGGGAATAT TGCAAGAGCT GAAATAAAGG AATAACAATT GATGGATAAA 9 300 

CGTATTTTTG TTGAAAAAAA GGCTGATTTT CAGGTCAAGT CAGAGAGTTT GGTTAGAGAG 9 3 60 

CTCCAGCACA ACTTGGGACT GTCAAGCTTG AAAAGTATTC GTATTGTGCA AGTATATGAT 94 20 

GTATTTGACT TGGCTGAGGA CTTGTTTGCA CCTGCAGACA AGCACATTTT CTCTGAGCAG 9480 

GTAACCGACC ATGTTTTAGA TGAAGTATCT GTGCAGGCGG ATCTTGCTAA CTATGCTTTC 9540 

TTTGCCATTG AAAGTCTGCC AGGGCAGTTT GACCAGCGTG CACCTTCCTC ACAGGAAGCC 9 600 

TTGCTTTTGT TGGGAAGTTC GAGTGACGTG ACACTCAACA CAGCCCAACT TTACTTGGTG 9660 

AATAAAGATA TTGATGCGAC TGAGTTGGAA GCTCTCAAAA ACTACCTCC7 CAATCCAGTT 9720 

GATTCTCGTT TCAAGGATAT CACGACAGGG ATTGCCAAGC AGGAGTTTTC AGAGTCAGAC 9780 

AAGACCATTC CCAAATTGAC TTTCTTTGAA AGCTATGCAG CAGAAGACTT TGCTCGCTAC 9S40 

AAGGCCGAAC AAGGGATGGC CATGGAAGTG GATGATTTGC TCTTTATCCA AGACTACTTT 9900 

AAGTCAATCG GGCGCGTGCC AACTGAGACT GAACTCAAGG TTTTGGACAC TTACTGGTCT 9960 

GACCACTGCC GTCATACGAC TTTTGAGACA GAGTTGAAAC ACATCGACTT TTCAGCTTCT 10020 

AAATTTCAAA AGCAATTGCA GTCAACCTAT GACAAGTATA TTGCCATGCG CGAGGAATTA 10080 

GGTCGGTCTG AAAAACCACA AACCTTGATG GATATGGCGA CTATTTTCGG TCGTTATGAG 10140 

CGTGCTAATG GACGATTGGA TGATATGGAA GTCTCTGACG AAATCAATGC CTCCTCAGTT 10200 

GAAATTGAAG TGGACGTTGA TGGTGTCAAG GAACCTTCCC TCCTCATGTT TAAAAACGAA 10260 

ACCCACAACC ATCCAACAGA AATTGAGCCA TTTGGTGGAG CGGCTACCTG TATTGGTCGA 10320 

GCTATTCCTG ATCCGTTGTC ACGCCGTTCC TATGTTTACC AAGCCATGCG TATTTCAGGT 10380 

GCTGGTGATA TTACAGCACC GATTTCGGAA ACTCGCGCTG GGAAATTGCC ACAACAAGTC 10440 

ATTTCTAAAA CAGCAGCTCA TGGTTATTCT TCATATGGTA ACCAGATTGG GCTTGCAACA 10500 

ACCTACGTTC GTGAATACTT CCACCCAGGC TTTCTACCTA AACCTATGGA ACTTGGTGCC 10560 

GTTGTTGGTG CGACTCCCAA GGGCAATGTT GTCCGTGAAA AACCTGAAGC ACGTCATGTG 10620 

ATCATCCTTC TCGGAGGCAA AACAGGTCGT GATGGTGTCG GTGGTGCGAC GGGCTCTTCT 10680 
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AAGCTTCAAA CAGTTCAGTC TGTAGAGACT GCTCGTGCTG AGGTTCAAAA AGGAAATCCC 10740 

ATCGAAGAAC GCAAGATTCA GCGCCTCTTC CGTAATGGCA ATGTCACTCG TCTGATCAAG 10800 

AAGTCCAATG ACTTTGGGCC AGGCGGTGTC TGTC7GGCTA TCGCTGAATT GGCAGACGGT 10860 

CTTGAAATCC ACCTCAACAA GGTGCCTCTT AAATACCAGG GCTTGAATGG TACAGAAATT 10920 

GCCATCTCTG AATCACAAGA ACGGATGGCG GTCGTGGTTC GTCCTGAAGA TGTGGATGCG 10980 

TTCGTTGCCG AATGTAACAA AGAAAATATT GATGCTGTTG TGGTGGCGAC AGTAACTGAA 11040 

AAACCAAATC TTGTCATGCA CTGGAATGGT GAGACAATCG TTGACTTGGA GCGTCGTTTC 11100 

CTTGACACCA ATGGTGTCCG CGTGGTTGTC GATGCCAAAC TTGTGGACAA GGATGTCAAA 11160 

CTCCCAGAAG AGCGTCAAAC ATCTGCTGAA ACACTGGAAT CAGATACCCT TACGCTTCTA 1122 0 

TCTGACCTCA ACCATGCAAG TCAAAAAGGA TTACAGACTA TCTTTCACTG CTCTGTTGGA 11280 

CGCTCAACGG TTAATCACCC ACTTGGTGGT CGTTACCAAC TCACACCAAC TGAGGCATCT 11340 

GTGCAGAAAT TGCCAGTTCA ACACGGTGTG ACTCATACTG CGTCGGTCAT TGCTCAAGGT 11400 

TTCAACCCAT ATGTAGCTGA ATCCTCTCCA TACCACGGTG CTGCTTATGC GGTTATCGAA 114 60 

GCAACTGCTC GTTTGGTGGC TCCTGGTGCC AACTGGTTCA AGGCTCGTTT CTCTTACCAA 11520 

GAGTATTTCG AGCGTATGGA TAAACAAGCA GAGCGTTTCG GTCAGCCAGT AGCTGCTCTT 11580 

CTAGGTTCTA TTGAAGCACA AATTCAGCTT GGCTTGCCAT CTATCGGTGG TAAGGACTCC 11640 

ATGTCTGGTA CCTTTGAAGA ATTGACCGTT CCGCCAACCT TGGCTGCCTT TCGGGTGACG 11700 

ACGGCAGATA GCCGTAAGGT GCTCTCTCCA GAATTTAAAG CTGTTGGGGA AAATATCTAC 117 60 

TACATCCCAG CTCAAGCCCT CTCTGCAGAG ATTGATTTTG ACTTGATTAA GAAAAATTTT 11820 

GCTCAGTTTG AAGCCATCCA AGCTGACCAT AAACTGACAT CTGCATCAGC TGTCAAATAC 11880 

GGTGGTGTAG TTGAAAGTTT GGCTCTTGCT ACCTTTGGAA ACTATATTGG TGCAGAGGTG 11940 

ACCTTGCCTG AACTTGAAAC AGCTTTCACA GCTCAATTAG GCGGCTTTCT CTTCACATCT 12000 

CCTGAAGAAA TTGCTGGAGT AGAGAAGGTT GG AC AAACG A AAGCAGACTT TACACTGACT 12060 

GTCAACGGTG TGAAGCTAGA TGGACACAAG CTTGACAGTG CATTTCAAGG GACATTGGAA 12120 

GAAGTTTACC CAACAGAATT TACCCAAGCG AAAGAACTAG AAGAAGTACC AGCTGTGGCA 12130 

TCAGATGTTG TGATTAAAGC CAAAGAAAAG GTTGAAAAAC CTGTGGTTTA CATCCCAGTC 12240 

TTTCCAGGAA CCAACTCAGA ATATGATTCA GCTAAGGCCT TCGAAAAAGA ACGTGCAGAG 12300 

GTCAATTTGG TGCCATTCGT GACCTTGAAT GAAGAAGCTA TTGTCAAGTC ACTTCAAACT 123 60 

ATGGTTGACA ATATCGACAA GACTAATATT CTCTTCTTTG CTGGTGGATT CTCGGCTGCG 124 20 
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GATGAACCAG ATGGTTCACC TAAGTTTATC GTCAATATCC TGCTTAATGA AAAAGTGCGT 124 80 

GTGGCTATTG ATAGCTTTAT CGCCCGTGGT GGTTTGATTA TCGGTATTTG TAATGGATTC 12540 

CAAGCCTTAG TCAAATCGGG TCTCCTACCC TACGGAAACT TTGAAGCTGC TAACAGTACT 12600 

AGCCCAACCC TCTTCTACAA TGATGCCAAC CAACACGTGG CCAAGATGGT GGAAACTCGC 12660 

ATTGCCAATA CCAACTCACC ATGGTTGGTT GGTGTGCAAG TGGGCGATAT CCACGCTATT 12720 

CCTGTTTCGC ACCCTGAAGG GAAGTTTGTC CTGACCGCTC AGGAATTTGC AGAGCTCCGT 12 780 

GACAATGGAC AAATTTTCAG CCAATACGTT GACTTTAACG GTAAACCAAG TATCGATTCT 12840 

AAGTACAATC CGAATGGTTC TCTCCATGCC ATCGAAGGAA TTACCAGCAA GAATGGTCAA 12900 

ATCATCGGTA AGATGGGCCA CTCAGAACGT TATGAGGATG GTCTTTTCCA AAATATCCCA 12960 

GGCAATAAAG ACCAACACCT GTTCGCATCA GCGG7TAAAC ATTTCACTGG AAAATAAGAC 13020 

TTACAGATTT TCTAATAGAT AGTATCAGTA ATGTAAAAGT CATGTAAATC TAGCTCTTGA 13080 

?G ATT AC AAA TGAAAATTAG GTATAAAAAA TGACATACGA AGTAAAATCT CTTAATGAAG 13140 

AATGTGGTGT TTTCGGTATT TGGGGACATC CAGATGCTGC TAAGTTGACC TATTTTGGAC 13 200 

TCCACAGTCT TCAACACCGT GGTCAGGAGG GGGCAGGAAT CCTCTCCAAT GATCAAGGAC 13260 

AACTGAAGCG CCATCGTGAC ATGGGGCTTT TATCAGAAGT TTTCACAAAT CCAGCTAATT 13 320 

TGCATAAATT GACAGGAGCT GGTGCGATTG GGCATGTGCG TTATGCGACT GCTGGCGAAC 13380 

CTTCTGTAGA TAACATCCAG CCCTTCCTCT TCCGTTTTCA CGATATGCAG TTTGGTTTCG 13440 

CTCATAATGG AAATCTGACC AATGCAGCCT CTCTCAAGAA AGAACTGGAA CAAAGAGGAG 13500 

CAATTTTCAG CGCGACTTCG GACTCTGAAA TCTTGGCTCA CCTCATTCGT CGCAGTCATA 13 560 

ATCCTAGCCT GATGGGCAAA ATCAAGGAAG CCCTCAGCCT TGTCAAAGGT GGTTTTGCCT 13620 

ATATCTTGCT GTTTGAGGAC AAGTTGATTG CGGCTCTTGA CCCAAATGCA TTCCGACCGC 13680 

TTTCGATTGG TAAAATGGCT AATGGAGCAG TTGTTCTATC TTCTGAAACC TGTGCTTTTG 13740 

AGGTCATTGG TGCCGAGTGG ATTCGTGATT TGAAGCCAGG TGAGATTGTG ATCATTGATG 13800 

ACGAGGGCAT TCAGTATGAC AGCTATACAG ATGATACCCA GTTGGCGGTT TGTTCTATCG 13860 

AGTATATCTA CTTTGCTCGC CCTGATTCTA ATATCCACGG TGTCAATGTC CATACGGCAC 13920 

GTAAGAGAAT GGGAGCGCAA TTGGCGCGAG AATTTAAGCA TGAGGCAGAT ATTGTAGTTG 13980 

GTGTGCCCAA TTCTTCCCTA AGCGCGGCTA TGGGATTTGC GGAAGAATCA GGCTTACCAA 14040 

ATGAAATGGG TCTGATCAAA AACCAATACA CCCAGCGAAC TTTTATCCAA CCGACTCAAG 14100 

AATTGCGGGA GCAAGGAGTG CGGATGAAAC TGTCTGCTGT TTCGGGTGTT CTCAAAGCCA 14160 

AACGTGTGGT CATGGTGGAT GATTCCATTG TACGTGGAAC AACCTCTCCT CGTATCGTTC 14 220 
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ACCTCTTGAA 


AGAAGCGGGT 


GCGACTGAGG 


TTCACGTTGC 


CATTGGAAGT 


CCTGCACTAG 


14280 


CGTATCCATG 


TTTCTACGGG 


ATTGATATCC 


AGACCCGTCA 


GGAGCTGATT 


GCAGCCAATC 


14340 


ATACGGTCGA 


AGAAACTCGC 


CAAATCATTG 


GTGCGGACAG 


TCTGACTTAT 


CTTTCAATTG 


14400 


ATGGCTTGAT 


TGAGTCGATT 


GGTATCGAAA 


CAGATGCGCC 


GAACGGTGGT 


CTCTGTGTCG 


14460 


CTTACTTTGA 


CGGTGACTAC 


CCAACGCCTC 


TTTATGACTA 


CGAAGAAGAC 


TATCGTAGAA 


14520 


GTTTGGAAGA 


AAAGACCAGT 


TTTTACAAGT 


AGGCGACAGA 


TTCTCCATTA 


AAGAAAAGGA 


14580 


AAAAATAAAT 


GACAAATAAA AATGCATATG 


CCTCACGTCT 


CACTACTGAC 


TAAAGGCTTA 


14640 


AGCATTTAGT 


CACTAGACCC 


TTTGTCCTAT 


AGGATCAAAG 


CTAGAGCCCT 


GACTAGTATT 


14700 


TTTAGATAAA 


AAGATGGTTT 


ATCTAAAAAT 


ACGTCGCAGT 


CTTTCTCAAA 


AAAAGAAAAG 


14760 


GAAAAATAAA 


ATGGCAAATA 


AAAATGCGTA 


CGCTCAATCT 


GGTGTGGATC 


TTGAAGCGCG 


14820 


TTATGAAGTT 


GTTGAACGGA 


TTAAAAAGCA 


CGTGGCCCGT 


ACGGAGCGTG 


CAGGTGTCAT 


14880 


GGGAGCTCTT 


GGTGGCTTTG 


GTGGTATGTT 


TGACCTTTCC 


AAGACTGGGG 


TTAAAGAACC 


14940 


CGTCTTGATT 


TCAGCGACTG 


ACGGTGTCGG 


AACCAACCTC 


ATGTTGCCTA 


TCAAGTACGA 


15000 


CAAGCACGAT 


ACCATCGGGC 


AGGACTGTGT 


GGCCATGTGT 


GTCAACGACA 


TCATTGCTGC 


15060 


AGGTGCGGAA 


CCCCTCTATT 


TTCTCGACTA 


CGTAGCGACA 


GGGAAGAATG 


AACCAGCTAA 


15120 


GCTAGAACAA 


GTGGTTGCTG 


GTGTGGCAGA 


AGGTTGTGTG 


CAGGCTGGTG 


CTGCCCTCAT 


15180 


CGGTGGGGAA 


ACGGCTGAAA 


TGCCGGGCAT 


GTACGGCGAA 


GACGACTATG 


ACTTGGCTGG 


15240 


TTTTGCCGTC 


GGTGTGGCTG 


AAAAATCTCA 


AATCATTGAC 


GGTTCAAACG 


TCGTAGAGGG 


15300 


AGATGTTCTT 


CTCGGACTTG 


CTTCAAGTGG 


GATTCACTCA 


AATGGTTACT 


CTTTGGTTCG 


15360 


TCGTGTCTTT 


GCGGATTACA 


CAGGTGAGGA 


AGTCCTACCA 


GAATTGGAAG 


GCAAGAAACT 


15420 


TAAGGAAGTT 


CTACTTGAGC 


CGACTCGTAT 


CTATGTCAAG 


GCTGTCTTGC 


CGCTCATCAA 


15480 


AGAAGAGTTG 


GTCAACGGCA 


TTGCCCACAT 




GGCTTTATCG 


AAAATGTCCC 


15540 


TCGTATCTTT 


GCAGATGACC 


TAGCTGCTGA 


AATTGATGAA 


AGTAAAGTTC 


CAGTGCTTCC 


15600 


AATTTTCAAA 


ACCCTTGAAA 


AATACGGTCA 


GATTAAACAC 


GAAGAAATGT 


TTGAAATCTT 


15660 


CAATATGGGT 


GTGGGACTTA 


TGTTGGCGGT 


CAGCCCTGAA 


AATGTAGAGC 


GTGTAAAAGA 


15720 


ATTGTTGGAT 


GAAGCAGTCT 


ATGAAATTGG 


TCGCATCGTC 


AAGAAAGAAA 


ACGAAAGTGT 


15780 


CATTATCAAA 


TGAAAAAAAT 


AGCGGTTTTT 


GCCTCTGGTA 


ATGGCTCAAA 


TTTTCAGGTG 


15840 


ATTCCCCAAG 


AATTTCCAGT 


GGAGTTTGTC 


TTTTCAGACC 


ATCGTGATGC 


CTATGTGCTT 


15900 


GAGCGTGCAA 


AGCAGCTCGG 


CGTTCTGTCC 


TATGCTTTTG 


AACTCAAGGA 


GTTTGAGAGC 


15960 
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AAGGCAGACT ACCAAGCAGC CCTTGTCGAA CTCTTGGAAG AACACCAGAT TGACTTGGTT 16020 

TGCCTAGCAC CCTACATGAA AATCGTTGGA CCAACCTTAT TGTCGGCTTA TGAAGGTCGG 16080 

ATTGTCAACA TTCATCCAGC CTACTTGCCA GAATTTCCAG GAGCTCATGG GATTGAGCAT 16140 

GCTTGGAATG CTGGCGTGGG TCAGTCTGGT GTGACCATTC ACTGGGTGGA TTCGGGTGTG 16200 

GATACAGGCC AGGTCATCAA ACAGGTTCGT GTGCCACGAC TAGCTGATGA TACCATTGAC 16260 

AGATTTGAAG CTCGCATCCA TGAAGCAGAG TACAGGCTGT ATCCGGAAGT AGTGAAGGCT 16320 

CTATTTACAG ATTGACTTTT TGATGATTCA TATGATATCT TTGATTTTAA ATTGGAGTCA 16380 

GTGTTTGTTG AAGACGGCTT CAAACGGAGG TATTTGTAAT GTTAGAATCT AAAAAAACAA 16440 

CTCGATATGT ATTTTATCTC TATCTGATGT TATTAACTTG GGGAATCTTA TTTAAGTTTG 16500 

AAACAAATCC TCAATTTATA GCATTTTTCT TAGCTCCAAG GTATATCAAT TGGATTCCAT 16560 

TTTCAGAACC ACTAATAGTC GATGGAAAAA TTGTTTTTGC TGAAATGTTA TTTAATCTGA 16620 

TTTTCTTTAT TCCATTAGGT GTTTGTTTCC CTTTGATAAA AACTAATTTA TCTAGTTTAA 16680 

GAATAGTCGG GACAGGTTTC TTGATTAGTT TATTGTTTGA GTGCTTACAG TATATTTTAG 16740 

CAATAGGTAT AACAGATATA ACGGATTTGA CTTTAAATAC GCTAGGTGTC TGTGTAGCCT 16800 

TACTGATTTA TCAAATTTTT ATAAGAGTGT TCAAATCACA GACTAGAAAA TGGATCAATA 16860 

TCTTAGGTAT GCTTAGCCTT GGTTTTGCTT ATCTTGTTTT ACTGTTACTG CATTTACTTA 16920 

GTGTTTAACT AATGATTAAA AAGGAGAATA TAATCACTAA ACGCGTCTTA ATCAGCGTCT 16980 

CAGACAAAGC GGGCATTGTT GAATTTGCCC AAGAACTCAA AAAACTTGGT TGGGAGATTA 17040 

TCTCAACAGG TGGAACTAAG GTTGCCCTTG ATAATGCTGG GGTGGATACC ATTGCTATCG 17100 

ATGATGTGAC TGGTTTCCCA GAAATGATGG ACGGTCGTGT GAAGACCCTC CACCCAAATA 17160 

TCCACGGAGG GCTTCTCCC? CGTCGTGACT TGGATAGCCA CTTGGAAGCG GCTAAGGACA 17220 

ACAAGATTGA GCTCATTGAC CTTGTGGTGG TCAACCTTTA CCCATTTAAG GAAACTATCC 17 280 

TTAAACCAGA TGTGACTTAT CCTCATGCAC TTGAAAATAT CGATATTGGT GGGCCATCTA 173 40 

TCCTTCCTTC AGCAGCGAAA AATCATGCCA GTGTTACAGT TGTGGTAGAT CCTGCTGACT ^7400 

ACGCTGTGGT TTTGGATGAA TTGGCAGCAA ACGCCGAAAC CTCTTATGAA ACTCGCCAAC 17 4 60 

GTTTAGCAGC CAAAGTATTT CGTCACACAG CGGCTTATGA CGCCTTGATT GCAGAATACT 17520 

TCACAGCTCA AGTGGGTGAA AGCAAGCCTG AAAAACTCAC TTTGACTTAT GACCTCAAGC 17580 

AACCAATGCG TTACGGTGAG AATCCTCAAC AAGACGCGGA CTTTTACCAG AAAGCTTTGC 17 640 

CT AC AG ACT A CTCCATTGCT TCAGCCAAAC AGCTCAACGG GAAAGAATTG TCATTTAATA 17700 

ATATCCGTGA TGCAGATGCT GCTATCCGTA TCATCCGTGA CTTCAAAGAT AGTCCAACCG 177 60 
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TTGTGGCTCT CAAACACATG AATCCATGTG GAATTGGTCA AGCTGATGAC ATCGAGACTG 17820 

CrrCGCACTA CGCTTATGAG TCTGACCCAG TGTCTATCTT TGGTGGGATT GTCGTCCTCA 17880 

ACCGTGAGGT GGATGCTGCG ACAGCTGAGA AGATGCACGG CGTTTTCCTC GAAATCATCA 17940 

TTGCACCAAG CTATACGGAT GAAGCGCTAG CCATTTTGAT CAATAAAAAG AAAAACTTGC 18000 

GTATCCTTGC CTTGCCATTT AATGCTCAAG AGGCTAGCGA AGTGGAAGCA GAATACACAG 18060 

GTGTAGTCGG TGGACTTCTC GTGCAAAATC AAGACGTGGT CAAGGAAAGC CCAGCTGACT 18120 

GGCAAGTGGT GACTAAACGT CAGCCAACTG AGACAGAAGC GACTGCTCTT GAGTTCGCTT 18180 

GGAAGGCTAT CAAGTACGTC AAATCAAATC GTATTATCGT GACCAACGAC CACATGACAC 13240 

TTGGTGTTGG TCCAGGTCAA ACCAACCGTG TGGCTTCTGT TCGCCTTGCC ATTGACCAAC 18300 

CCAAAGATCG TCTGGACGGG GCCGTCCTTG CTTCAGATGC CTTCTTCCCA TTTGCGGATA 18360 

ACGTGGAAGA AATCGCCAAA GCAGGAATTA AGGCCATCAT CCAGCCCGGT GGCTCTGTCC 18420 

GTGACCAAGA ATCCATCGAA GCAGCGGATA AATACGGCTT GACTATGGTC TTTACAGGTG 18480 

TGAGACATTT TAGACATTAA GAAGATAAAA GGGAAGAAAA CAGTTTCTTT CCTTTTTTGG 18S40 

CTTAAAATAC TAACTGAAAC AAGATTAAAA CGAACTTTTT TGATATAATG TTGGTAAATA 18600 

ATTCGCAAAA GAGGTTGAGG AATGAAACTG CTTGTTGTCG GTTCTGGTGG TCGTGAGCAT 18660 

GCGATTGCTA AAAAGTTACT TGAATCAAAA GACGTGGAAA AAGTCTTTGT AGCTCCTGGG 18720 

AATGATGGGA TGACTCTGGA TGGTTTGCAA TTGGTAAATA TCTCTATTTC CGAACATTAT 18780 

AAATTGATTG ACTTCGCAAA GACCAATGAT GTTGCTTGGA CCTTTATCGG TCCAGATGAT 13840 

GCCCTTGCTG CTGGTATCGT GGATGATTTT AACCAAGCTG GACTTAACGC CTTTGGTCCG 18900 

ACTAGGGCTG CAGCGGAGCT GGAGTGGTCC AAGGATTTCG CCAAGGAAA? CATGGTCAAA 13960 

TACGGCGTTC CGACAGCAAC ATATGGCACA TTTTCAGATT TCGAGGAAGC CAAAGCCTAT 19020 

ATCGAAAAGC ATGGTGCCCC TATCGTAGTC AAGGCGGATG GCTTGGCACT TGGGAAGGGT 19080 

GTCGTCGTTG CTGAGACGGT TGAGCAAGCG GTCGAAGCCG CTCATGAGAT GCTTrTGGAC 19140 

AATAAATTTG GTGACTCAGG TGCGCGCGTG GTTATTGAGG AATTCCTTGA AGGAGAGGAA 19200 

TTTtCACTCT TTGCCTTTGT CAATGGTGAT AAGTTCTACA TCATGCCAAC GGCTCAGGAC 19260 

CACAAACGTG CCTATGATGG CGACAAAGGG CCTAACACCC CTCGTATGGG TGCCTATGCG 19320 

CCAGTCCCAC ACTTACCACA GAGTGTAGTT GATACAGCCG TTGACACCAT TGTCAAGCCA 19380 

GTTCTAGAAG GGGTGATTAA AGAAGGTCGC CCTTATCTGG GAGTTCTTTA CGCAGGGCTT 19440 

ATCCTGACAG CTGATGCACC GAAAGTCATT GAGTTCAACG CTCGGTTCGG AGATCCAGAA 19500 
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ACTCAGATTA 


TCTTGCCTCG 


CTTGACCTCT 


GACTTTGCTC 


AAAATATCAC 


AGATATCCTG 


19560 


GATAGCAAGG 


AGCCAAATAT 


CATGTGGACG 


GACAAGGGTG 


TGACTCTGGG 


TGTGGTTGTC 


19620 


GCATCCAAGC 


GCTACCCGCT 


AGACTATGAA 


AGGGGCCTTG 


AGTTGCCAGC 


CAAGACAGAA 


19680 


GGCGATGTCA 


TCACCTACTA 


TGCAGGGGCT 


AAGTTTGCGG 


AAAATAGCAG 


AGCACTGCTC 


19740 


TCAAACGGCG 


GACGAGTTTA 


TATGCTCGTT 


ACCACAGCAG 


ATACCGTCAA 


AGAAGCCCAA 


19800 


GCCAGCATAT 


ACCAAGAACT 


ATACCAACAA 


AAAATAGAAG 


GACTCTTCTA 


CCGAACAGAT 


19860 


ATCGGAAGCA 


AGGCAATTAA 


GTAAAGATAT 


AAGAATAACG 


CGCCGTAGTC 


GCCAAACACG 


19920 


ATAATGGTCG 


TCGTGGTGAA 


AAGACCAGAA 


CAGTGAATGT 


TCTGGTCAGG 


GGGAAACTTG 


19980 


GAGACCTTAG 


GCTCAAAGTT 


TAGGAATGAA 


ACCGAAGGTT 


TGCTTCCGCC 


TCCATCACCT 


20040 


AAGACCATTA 


TCAAAAAGAA 


AAATAAAAAT 


TCACAAAATA 


CGTTAATGAT 


CGTATGGTTT 


20100 


GCGAGCGTTA 


GCGAGCTAAT 


ATAGAACAA7 


CACCGCCGTT 


G7GAAAGAAC 


GATTGGATGA 


20160 


TAATCCAATC 


GTTCAGGGAA 


ATTGGAAGAC 


CTTGGGTTTC 


CAATTTAGGC 


ATGAGACACC 


20220 


TTTGGTGGCT 


GCTGCCGTCC 


CTCACAAGCT 


AAGGTGATTG 


TTGAAAAAGA 


GGAAAAAGGA 


20280 


GAAGAAATGA 


AACCAGTAAT 


TTCCATCATC 


ATGGGCTCAA 


AATCCGACTG 


GGCAACCATG 


20340 


CAAAAAACAG 


CAGAAGTCCT 


AGACCGCTTC 


GGTGTAGCCT 


ACGAAAAGAA 


AGTTGTTTCC 


20400 


GCACACCGTA 


CACCAGACCT 


CATGTTCAAA 


CATGCAGAAG 


AAGCCCGTAG 


TCGTGGCATC 


20460 


AAGATCATCA 


TCGCAGGTGC 


TGGTGGCGCA 


GCGCATTTGC 


CAGGCATGGT 


AGCTGCCAAA 


20520 


ACAACCCTTC 


CAGTCATTGG 


TGTGCCAGTC 


AAGTCTCGTG 


CTCTTAGTGG 


ACTGGATTCA 


20580 


CTCTATTCTA 


TCGTTCAGAT 


GCCGGGTGGG 


GTGCCTGTTG 


CGACCATGGC 


TATCGGTGAA 


20640 


GCTGGAGCGA 


CTAACGCAGC 


TCTCTTTGCC 


CTCCGTCTCC 


TCTCTGTAGA 


AGATAAGTCC 


20700 


ATTGCGGATG 


CACTTGCCAA 


CTTTGCTGAA 


GAACAAGGAA 


AAATCGCAGA 


GCAGTCGTCA 


20760 


AATGAGCTCA 


TCTAAAACAA 


TCGGAATTAT 


CGGTGGCGGT 


CAACTGGGTC 


AGATGATGGC 


20820 


CATTTCTGCT 


ATCTACATGG 


GCCACAAGGT 


TATCGCGCTG 


GATCCTGCCG 


CGGATTGCCC 


20880 


GGTCTCTCGT 


GTGGCGGAAA 


TCATTGTGGC 


ACCTTATAAC 


GATGTAGACG 


XCCTCCCTCA 


20940 


GTTGGCAGAC 


CGTTGCGATG 


TCCTCACTTA 


TGAGTTTGAA 


AATGTCGACG 


CTGACGGTTT 


21000 


GGATGCCGTT 


ATCAAGGATG 


GACAACTCCC 


TCAAGGAACA 


GATCTGCTCC 


GCATTTCGCA 


21060 


AAATCGTATT 


TTTGAAAAGC 


ACTTTTTGTC 


AAACAACGCT 


CAAGTCACTG 


TGGCACCCTA 


21120 


CAAGGTCGTG 


ACTTCTAGCC 


TAGACTTGGC 


AGATATCGAC 


TTGTCGAAAA 


ACTATGTCCT 


21180 


CAAGACTGCG 


ACTGGTGGCT 


ACGATGGTCA 


TGGACAAAAG 


GTTATTCGTT 


CAGAAGCAGA 


21240 


CTTGGAAGCA 


GCCTATGCGC 


TAGCAGACTC 


AGCAGACTGC 


GTC7TGGAAG 


AATTTGTCAA 


21300 
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CTTTGACCTT 


GAGATTTCTC 


TCATCGTGTC 


AGGAAATGGC 


AAGGAGGTCA 


, CGTTTTTCCC 


21360 


AGTTCAGGAA 


AATATCCACC 


GCAACAATAT 


CCTGTCTAAG 


ACCATCGTAC 


CAGCCCGCAT 


21420 


TTCTGAAAGT 


CTAGTAGACA 


AGGCTAAAGC 


TATGGCAGTG 


CGAATCGGAG 


AACAACTCAA 


21480 


CTTGTCTGGA 


ACTCTCTGTG 


TGGAAATGTT 


TGCGACAGCT 


GATGACATCA TTGTCAATGA 


21540 


AATCGCCCCA 


CGACCACATA 


ACTCTGGGCA 


CTATTCTATT 


GAAGCCTGTG 


ATTTCTCTCA 


21600 


GTTTGACACC 


CATATTCTGC 


CTGTTCTCGC 


AGCACC ATTA 


CCAGTCATCA 


AACTCCATCC 


21660 


GCCAGCCGTT 


ATGCTTAATG 


TCCTCGGTCA 


GCATGTCGAG 


GCTGCTGAAA 


AATATGTCAC 


21720 


AGAAAATCCA 


AGCGCCCACC 


TCCACATGTA 


TGGTAAAATA 


GAAGCAAAGC 


ATAATCGTAA 


21780 


GATGGGACAT 


GTCACTTTCT 


TTAGTGATGT 


GCCGGATAGT 


GTGGAAGAGT 


TTGGGGAAGG 


21840 


GATTGATTTT 


TAGGACAAGT 


CTATGATACA 


AATTATCGTT 


AATACATTTA 


TTGAAAAGTA 


2 1900 


TAAGACTGGA 


GCAGTTGTTG 


AAGTGTTGTA 


TGCCAGTGCT 


GACCAAGATA 


AGGTACAAGC 


21960 


TAAATATGAA 


GAACTAGCTG 


CACAATACCC 


CG AAAATT AT 


TTAGCTATCT 


ATAATGTACC 


22020 


GCTGGATACG 


GATTTGAATA 


CACTACATCA 


TTACCCGTCT 


GTCTTTATTG 


GAAAAGACGA 


22080 


GTTTGAGTAG 


AAATCTTGGT 


TTACCTAGAT 


AGCTTATTCC 


CAACAGCTTA 


AGAAGAAAGG 


22140 


AAAAATTAAC 


ACATGATCAA 


CCGTTACTCT 


CGCCCTGAGA 


TGGCGAATAT TTGGAGTGAA 


22200 


GAAAATAAAT 


ACCGTGCTTG 


GCTTGAGGTG 


GAAATCCTCT 


CTGACGACGC 


ATCCGCTCAG 


22260 


TTGGGGGAAA 


TCCCTAAGGA 


AGATGTGGCT 


TTGATTCGCA 


AGAAGGCGCA 


CTTTGACATC 


22320 


GACCGTATTT 


TGGAAATTGA 


GCAGGAGACG 


CCCCACGATG 


TGGTGGCTTT 


CACGCGTGCG 


22380 


GTTTCTGAGA 


CTCTTGGTGA 


AGAGCGCAAG 


TGGGTTCACT 


ATGGGTTAAC 


TTCTACTGAC 


22440 


GTCCTCGATA 


CTGCTTATGG 


TTACCTCTAC 


AAGCAGGCCA 


ACGACATCAT 


CCGTCGTCAC 


22500 


CTTGAAAACT 


TCACTAATAT 


CATCGCTGAC 


AAGGCCAAGG 


AGCACAACTT 


CACCATCATC 


22560 




v- i 1 vjVj i\3 I 




C C G AC AACCT 


TTGGTCTTAA 


ATTAGCAACT 


22620 


TGGTACAGCG 


AAATGAAACG 


CAATATCGAG 


CGCTTCGAGC 


ATGCGGCTGC 


TGGTGTAGAA 


22680 


GCTGGTAAGA 


TTTCTGGTGC 


GGTTGGCAAC 


TTTGCCAATA 


TCCCACCATT 


TGTAGAGGAG 


22740 


TATGTCTGCG 


ATAAACTTGG 


CATCCGTGCC 


CAAGAAATCT 


CTACACAACT 


CCTTCCTCGT 


22800 


GACCTTCACG 


CTGAGTACTT 


TGCGGTTCTT 


GCCAGCATTG 


CGACTTCAAT 


CGAACGTATG 


22860 


GCGACTGAGA 


TTCGTGGTCT 


ACAAAAATCT 


GAGCAACGCG 


AAGTAGAAGA 


GTTCTTTGCT 


22920 


AAAGGGCAAA 


AAGGGTCTTC 


AGCAATGCCT 


CACAAACGCA 


ACCCAATCGG 


T7CTGAAAAT 


22980 


ATCACTCGTC 


TGGCGCGTGT 


CATTCGTGGT 


CACATGATTA 


CCGCTTATGA 


AAACGTCGCT 


23040 
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CTCTGGCATG AACCCCATAT TTCTCACTCA TCAGCTCAGC GTATCATCAC ACCAGATACG 2 3100 

ACCATTTTGA TTGACTACAT GCTCAACCGT 7TTGGAAA7A TCGTCAAGAA CTTGACAGTC 23160 

TTCCCAGAAA ATATGATCCG AAACATGAAC TCGACTTTTG GTCTTATCTT TAGCCAACGG 2 3220 

GCTATGTTGA CATTGATTGA AAAAGGCATG ACCCGTGAGC AAGCCTATGA CTTGGTGCAA 23230 

CAAAAACAGC CTACTCTTGG GACAACCAAG TAGACTTTAA ACCACTTCTT GAGGCAGATT 2 3 340 

CAGAAGTAAC ATCACGTCTC ACACAAGAAG AAATCGATGA AATCTTCAAC CCAGTTTATT 2 3400 

ACACCAAACG AGTGGATGAT ATCTTTCAAC GTCTTGGACT AGGTGATTAA TTAAAAAATA 2 34 60 

AACAGCGAGC TTCAATCTCG CTGTTTATTT TTTATCGAAA AGACTTAGTC TTCTTTTCTT 23S20 

TTAGTGAGTC CATAGCCTGC TAGTGTGGAC ATGAGTCCTG CGACTACTAG TCCTGCAGAA 23530 

TCGTGAGTTC CTGTTTCAGG AAGTTTTTTC TCTGTTACCA CAGGAGCTGG ATCTTGAGGA 23640 

AGAACTTTGC TTTCCTCAGC AGGAGCAGTT GATGGAGCTG GTTGGCTTGG GATTTCTACT 2 3700 

TTTGGTTTTT CTTCAGCAAT AGCGGCTTGT CCGTTTTCAT CGCCTACATG TGTTACCATA 2 37 60 

GTTCCGACTT CGACTATTTG ACTAACCGCT TCCTGTGCTA CGACACTATT TACAAGTGTT 2 3820 

TTCACTTCCT TACCATCGGC ACAAGTGCTC ACAGAGTAGA AGTTGCTACG ATGTCCATTC 2 3880 

ACGCCCTTAG TAATGACTTG TGTTTTTCCT TTGAGTAAGA GTGGATTTTC ACAAGTCACT 2 3940 

GTGGTAAATG GAATTTCTTC TTCTTGGATA TCCAGTCTAG GTTTTACCTC AGTAGTTGGT 24000 

GCAAGACCAC TTTCATCACC CTTGTGAGTT ACAGGAGCGC CAACTTCAAC CACTTGGTTT 24060 

ATAACTTCTT TGGTTACCTG GCTATCAAGG ACTGTTTCTG TTGTTTTTCC ATTTTCAGTC 24120 

ACTA C AG AG A TGTAATGAGT TCGTTCACCT TTGACTCCTG CTGTGATAAT ATTTTCCTGA 24180 

CCGGCTGGGA GGTTAGGATT TTCTTTCTTG ATAACTTCAA ATGGAATTTC TTCAGTTCTT 242 40 

GTGATGAGTT CTGCTCTGGT TTCAACATTG GCAGCCACTT CATTTTCATC TAGGCTTCCT 24 300 

GAATGAGTTA CAGCTGGTTT GAGGCCTTGA AGAGCGGCTT TTAGCTTGGC TACAAGCGTG 24 3 60 

TCAAGCTCAG CTTGTTTATT ACGGTTGAGG TTGTAATTTA GAGCTGTTTT AGCTCCGTCA 24420 

AGGGCCTCAA GACTTTCTTT ACTATATCCT TCTAAGTTTG TAGGAATTTT AGCTAATTCT .:44 30 

TCCCGGACAG CATTATAATT AGCACGAAAG TACTCTTTGT TGTGGTCTGC AAAGGCAGTC 24540 

ATGAGTTCAA AGATTTCCTC TTCCTTCTAT TCAGCGCTTC GTCTATCTGC CCAGATTGAA 24600 

AGCATACTTC CGACTGTTGG AAGATCTACT TCAGGATATT TGGTAGAAGC TAGTTGATTG 24 660 

AATGGTGTTT TTCCAGTATT CTCAATAGCT TTCTTGACGA AACCACCACC ATCTTCTGGT 2 4720 

TTTTGACCAA GAATCTAGTA CCAGTCACCG TTCGTATTCA AGAATTTATA CCCTTTGCTT 2 4730 

GCTAGGTATT GAGGTGATGC GAGGTTATAT CCCCACCAGC CTTTAGACCA GTAAGAAATC 24 840 
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AAGACATCTT TGTCAAACTG AACATCGTCC TTCTCTTCAT AGTAGAAGCC ATCGTTGAAG 24900 

GCCATTGGTT GAAGCCCTCT TTCTTTGGCC ATAGCTGCGA GGGTGTTGGC ATATTCGGCA 24960 

AATTTGCCAT AGAGTTGATA CCACTTGACG TAGTACCAGC CTTGGGCACT AGTCGCATCG 2 5020 

TTGGCGTATT CGTCAGTACC AAAGTTGAAA ATCTTTGTTT TACCTGCAAA GAAGTCCATG 2 5080 

TATTTACCGA TGAGGGCTTT TACAAAGTTC ATCGCTTCTT CGTTTTTCAA GTCCATAGTT 25140 

GTTTTTGAAA CTTTATCAAA GTGGGCTTGA GGATTTTTAA TACCTAATTT TTCCATGGCA 25200 

ACCAGCATAG CATCCATGTG ACCTGGACTG TTAATAGCTG CGATGAGACC GATCTCCTTA 2 5260 

GATTTAGCGT ATTCAATTAG CTCTCTTACT TCTGCCTCTG TTAGTGCAGT ACCGTTTGGA 2 5320 

TCGTCGTAGT AAGCTTTAGT TCCTTCGATA ATAGCTTTTT TAACGTCATC ACTAGCATAG 2 5380 

GTTTTTCCGT TGGCAGTAAT GGTCATATCA TCGAGTAGAA AGCGAAGTCC GTCATTTCCT 2 5440 

AGAAGGAGAT GGACATCAGA ATATCCGAGC TCACTGGCCT TGTCTACGAT GCGTTTGAGC 25500 

TGGTTCAGAG TAAAGTATTT GCGTCCAGCA TCGATTGAGA TTACCTTGTT TTTGGCAAGT 25560 

TTTTCAACCT CACGTTTAGC TTCTTCTTCT TTTTGAGCTT CAGGCCTGAG GGTCAAGTTG 2 5620 

TTGACACTTT CTTGAAGTTT AGCAATGGCT TGATCAATCG TATCTTCTTG GGCACGGCTA 25680 

AGGTTGCTAT CGAGAGAGCG AATAGCTTTT TCAGCTTCTT TTACGGCCCT GACCCTTTCT 25740 

GCAGTATAAC GGTTCACGTC TTTTGGTACC TCGTTAAGTG CTTGCTCTGC AGATTCATAA 25800 

TCAGCTGCGA AGTATTCAGC GTTGGCATTT GCAAAATGAC GCATGACTTT GAAGAGGCGT 25860 

GATGGTGAAT AACGTGCAGA TGGAGTGTCA GCCCAAGCAG CTACCATACC ACCGATGATT 2 5920 

GGGATATCAG CTCCTTCTGT TTTTGGTACA GAAGTGATTG GTGTGTTTTT AATACCATTG 25980 

AGCCCCTGAT CGAGATTGTA CCAGCCTTGG CCATCACCGT TTCGTCCAAG AACGTAGTAC 26040 

CAAGCATCAT TGGTATTAAG GATTTGGTGA CCTTTTTCAG CTAGTAGTT7 AGAAGAAGCG 26100 

ACATCGTAGC CTCCCCAACC ACCACTCCAC ATAGAAACGA TGATGTCTTT GTCAAAACTA 26160 

CCAAAGCTTG TGTCGCTATT GTAGTAGATA CCGTCGTTAA AAGCCATTGG TTTGAGACCG 2 6220 

TGCGATTTTA CAATACGAGC GAGGTCATTG GCGTAGCCAA TAAATTTTTC ATAGCCTTTT 26280 

ACAGGGTAGC CTTCGTTTGG ATAGTATTTA TCAGCTTGAA GCACACTCCA ACCTTTAGCA 26340 

TCTGTCGCAT CATTGCCATA TTCATCAAGT CCGATGTTGA AGATTTCAGT CTTTTTCCCG 26400 

AAATAAGCAG CATACTTGTC GATAAGGGCT TTTGTAAAAG CGACACCTTG TTCGTTGTCA 26460 

AGATCGACAG TACGGGCTGA TTTCTTCCCA AAATAGCTAA AGTTAGGGTT TTGGATTCCC 26520 

AATTCTTTCA TGGCATTGAG AATCGCATCC ATGTGTCCAG GACTATTTAC TGTCGGAATG 26580 
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AGACCGATAC 


CTTTATCTTT 


GGCATAGTTA 


ATCAGATCTG 


TCATTTGACT 


TTCTGTTAAG 


26640 


TGATTGCCCT 


TTGGATCGTT 


GTAATAATCA 


TTTGTACCTT 


TTTCAATGGC 


GCGTTTGACA 


26700 


TCGTCACTGG 


CATAGGTCTT 


GCCGTTAGCT 


GTGATGCTCA 


TATCGTCCAA 


CATGAAACGG 


26760 


AGTCCATCAT 


TTCCCACTAA 


TAGGTGTAAA 


TCAGTGTACC 


CATAATGTTT 


CGCTTTATCG 


26820 


ATGATTTCCT 


TGAGCTGTTC 


TGGTGAGAAA 


TATTTACGTC 


CAGCATCAAT 


AGAAACAATT 


26880 


TTCTTTTTCG 


CTAGTTTTTC 


ATTTACAGTT 


GCAGCACGTT 


CCTTTCCTGC 


CTCTGTTGCC 


26940 


GGTTTGTCAG 


CCTCTGCTTT 


CGCTTCATCT 


TTTTTACCTC 


GTTTATCCTT 


GTCAGTCTTG 


27000 


TCTGTATTTG 


ACTCTTTAGA 


ATCAACCTCT 


TTCGCTTCTT 


CCTTTTTAGG 


GCTAGCTTCT 


27060 


TCTGCCTTTT 


TATTAGCAG*? 


TTCTTTTTCA 


CCAGAACTTC 


GAGTTACCAC 


TTCTGCTTTA 


27120 


TCACTAGGAG 


TTGAACTAAC 


TTCCTCTTGT 


GGTTTTTCTT 


CTGTTTTTGG 


AAGACTACCT 


27180 


ACCTTATCAG 


TAGCTGGAGT 


TTCTGTTTCT 


ACAGTTTTTG 


GAGCTTCTGC 


TTGAAGCACT 


27240 


GCTTTAGGTG 


TTTCCTCAGT 


CCGATTTTCG 


GATGATTGAG 


GGGAATCAGA 


AACCGTATGG 


27300 


ATGGTCGGTT 


GGTTTTCTGT 


AGTAGTAGGA 


GTAACTCCAT 


CGGCTGCAAC 


AGTCTGTGCT 


27360 


TGGAAGGCAA 


ATCCAATTAG 


AACAGAAGCT 


GCTCC7ACAG 


CGTATTTACG 


AATAGAAAAA 


27420 


CGCTGTTGTT 


TTTCATGTTT 


CATTGCAAAA 


CCTCCTGATT 


GCATTGTTAT 


ATTGATAGCG 


27480 


ATTATATAAA 


TCAACGCCTT 


TATTTTATTT 


CTTATATTAA 


TTTCTTATAT 


TAACGAGAGT 


27540 


CAAGAGGAGA 


TGACAAAAAA 


CTATAATAAG 


TATAAAAAAA 


TATAAAATTT 


AAACTTAAGA 


27600 


TTTCAGATTG 


GTCCGAAAAA 


ATACGTATAT 


ATATCTAGTA 


TAATTTTTGG 


TTCTATTTCT 


27660 


ATAAAATATT 


CCACAAATTA 


TAGAATTTTC 


CAAAAATAGG 


TAAGCGCTAC 


CTTTTTGCTC 


27720 


TAGTATAATA 


AGCATAGAAA 


AAGCCCAAGC 


GATTAGCTCA 


GGTTTTCTTC 


TTACTCATCA 


27780 


CGGTCACATG 


AGATAAATTT 


AATCTTGTAG 


TAATCAGATC 


GTTTGTAAGT 


TTCACTGTAT 


27840 


TCTAAAACTT 


GGCCAGTTGA 


TTCGAGTTTG 


GTGATTTTAG 


TTTGTAGGAC 


AGTAGGGAAT 


27900 


TGTTCATCGA 


CTCCGAGGAC 


TGAAGCTGCA 


TGTTCTGGAG 


TTGGAAAGAC 


TATTTCCTTG 


27960 


ATTTCTTCAA 


AGTGTTCATC 


ATTCATGTGA 


ATGTGGTAGT 


CTAACTTGAA 


ACGATTATAG 


28020 


ATAGAACTAT 


AGTATTCAAG 


GTTTGGATAA 


TTTGCGTTGA 


TATATTGTTC 


TGGGATGTAG 


28080 


GATGTATGGT 


AGATATAAAC 


GACACCGTTT 


GATTCGCGGA 


TACGTTCAAT 


CTTCTAGTAG 


28140 


AATTGATCGC 


CGCGTAGACC 


CAATTTTTCC 


AAGTAAACAA 


GCTTGTTTCC 


CCGTTCAATT 


28200 


GAAAGAACAG 


TTACCTTATC 


ATCTTTAGCA 


TTGAAGAGTT 


CAATATCTGA 


AAACTCTACA 


28260 


AGCTTGTGTT 


TGCGTGCACG 


TGAAACGAAG 


GTTCCTTTTC 


CTTGTTGGCG 


GACAATATAG 


28320 


CCATCTTTGG 


CAAGGTCGTT 


TAAGGCGCGA 


, ACAACTGTGA 


t TAGAGC7CAC 


ATCGTACATT 


28380 
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GAAATGAGTT CTGCTTCACT GTAAAATTTA TCTCCACTGC TAAACTGCCC AGAGATGATT 28440 
TTATTTTTTA ATTCGTCTTT TATGTATTGA TGG 2 8473 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6749 base pairs 

(B) TYPE: nucleic acid 

--—■(■CI STRANDEDNESS : double - --■ 

ID) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
CCTCATGGCT GGTATCCGAG GATACAGTTC TGAAAATCCC CGTTACTTAA TTAATGGACG 60 

CGAAGTCACA CCTGAGGAAT TTGCTCACTA TCGTGCGACT GGTCAATTAC CAGGAAATGC 120 

AGAAACTGAT GTGCAAATGC CACAACAGGC ATCAGGTATG AAACAAGGCG GTGTCCTTGC 180 

AAAACTAGGT CGAAACTTAA CAGCAGAAGC GCGTGAGGGC AAGTTCGATC CTGTTATCGG 240 

ACGAAACAAG GAAATTCAAG AAACATCTGA AATCCTCTCA CGCCGCACCA AGAACAATCC 300 

TGTTTTGGTC GGAGATGCAG GTGTTGGTAA GACAGCAGTT GTCGAAGGTC TAGCGCAAGC 360 

CATTGTGAAC GGAGATGTTC CTGCTGCTAT CAAGAACAAG GAAATTATTT CTATTGATAT 4 20 

CTCAGGTCTT GAGGCTGGTA CTCAATACCG TGGTAGCTTT GAAGAAAATG TCCAAAACTT 480 

AGTCAATGAA GTGAAAGAAG CAGGGAATAT TATCCTCTTC TTTGATGAAA TTCACCAAAT 540 

TCTTGGTGCT GGTAGCACTG GTGGAGACAG TGGTTCTAAA GCACTTCCGG ATATTCTCAA 600 

CCCAGCTCTC TCTCGTGGAG AATTGACAGT GATTGGGGCA ACAACTCAAG ACGAATACCG 660 

TAACACCATC TTGAAGAATG CTGCTCTTGC TCGTCGTTTC AACGAAGTGA AGGTCAATGC 720 

TCCTTCGGCA GAGAATACTT TTAAAA7TCT TCAAGGAATT CGTGACCTCT ATCAACAACA 7 80 

CCACAATGTC ATCTTGCCAG ACGAAGTCTT GAAAGCAGCG GTGGATTATT CTGTTCAATA 840 

CATTCCTCAA CGTAGCTTGC CAGATAAGGC TATTGACCTT GTCGATGTAA CGGCTGCTCA 900 

CTTGGCGGCT CAACATCCAG TAACAGATGT GCATGCTGTT GAACGAGAAA TCGAAACGGA 960 

AAAAGACAAG CAAGAAAAAG CAGTTGAAGC AGAAGATTTT GAAGCAGCTC TAAACTATAA 1020 

AACACGCATT GCAGAATTGG AAAGGAAAAT CGAAAACCAC ACAGAAGATA TGAAAGTGAC 1080 

TGCAAGTGTC AACGATGTGC CTGAATCTGT GGAACGAATG ACAGGTATCC CAGTATCGCA 1140 

AATGGAAGCT TCAGATATCG AACGTTTGAA AGATATGGCT CATCGCTTCC AAGACAAGGT 1200 

GATTGGTCAA GATAAGGCCG TAGAAGTTGT AGCTCGTGCT ATCCGTCGTA ACCGTGCTGG 12 60 
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TTTTCATGAA GGAAATCCCC CAATCGGCAA CTTCCTCTTT GTAGGGTCTA CTGGGGTTGG 1320 

TAAGACGGAG CTTGCTAAGC AATTGGCACT CGATATGTTT GGAACCCAGG ATGCGATTAT 13 80 

CCGTTTAGAT ATGTCTGAAT ACAGTGACCG CACAGCTGTT TCTAAGCTAA TTGGTACAAC 14 40 

AGCAGGCTAT GTGGGTTATG ATGACAATAG CAATACCTTA ACAGAACGTG TTCGTCGCAA 1500 

TCCATACTCT ATCATTC7CT TGGATGAAAT TGAAAAGGCT GACCCTCAAG TTATTACCCT 1560 

TCTCCTCCAA GTTCTAGATG ATGGTCGTTT GACAGATGGT CAAGGAAATA CAGTAAACTT 1620 

CAAGAACACT GTCATTATTG CGACCTCAAA TGCTGGATTT GGCTATGAAG CCAACTTGAC 1680 

AGAAGATGCG GATAAACCAG AATTGATGGA CCGTTTGAAA CCCTTCTTCC GTCCAGAATT 1740 

CCTCAACCGC TTTAATCCAG TCATCGAGTT CTCACACTTG ACTAAGGAAG ACCTTTCTAA 1300 

GATTGTAGAT TTGATGTTGG CTGAAGTTAA CCAAACCTTG GCTAAGAAAG ACATTGACTT 1360 

GGTAGTCAGT CAAGCGGCTA AAGATTATAT CACAGAAGAA GGTTACCACG AAGTCATGGG 1920 

GGTTCGTCCT CTCCGTCGCG TGGTTGAACA AGAAATTCGT GATAACGTGA CAGACTTCCA 1980 

CTTGGATCAT TTAGATGCTA AACATCTGGA AGCAGATATG GAAGATGGCG TTTTGGTTAT 2040 

TCGTGAGAAA GTCTAAGACA GAATTTTGAG GATAAAAAAG AAGGAGCCAG CTGAAAAAAA 2100 

CTGGTTCCTT TTTAGGTACG ACAGGCATGT CGTATAGTAG AAGTGTATTA TTCTAGTTTC 2160 

AATATACTAT AGTAGCTCAG AAGTCGGTAC TTAAACGTGC TATATCAAAA CCAGTCCTGG 2220 

AAAAACGTGG ACTGCTTTCG TGTTTGGATT ATTACCTTGA ACGACATGCG TTAAAACTTA 2 2 BO 

GTTGAACCGC CGTATGCCGA ATGGTACGTA CGGTGGTGTG AGAGGGGCTA GAGATTATCC 2340 

CCTACTCGAT TTTAAATCAC ATGACGTTCA AAGGCATCAT CTGAAATCCC TTGTTCCAAG 2 400 

ATGACTTTTG CCCATTCTTT AGCAGAGAAG AGGCTGTGCT CCTTGTAGTT TCCGCAAGAT 24 60 

TCGATGGTTG TCCCTGGGAC ATCTTCCCAA GTAGTAGTTT CAGCGATTTC CTTGAGCGAA 2 520 

TCCTTGATAA CAGCTGCGAT TTTAGCACTG GTCTGACGTC CCCACATAAT CATGTGGAAG 2 580 

CCTGTGCGCC AACCAAATGG TGAACAGTCA ATCATGCCGT CAATGCGGGT ACGGATGAGT 2 640 

TTGGCTAAGA GGTGCTCGAT AGTGTGAAGG CCGGCAGTAG GGATAGAGTC TTCGTTTGGT ?7 00 

TGCACCAAGC GAATATCATA ATTGGAGATG ATGTCTCCTT TTGGTCCTGT TTCTTCCCCA 27 60 

ATCAAGCGAA CATAGGGTGC TTTGACAATG GTGTGGTCAA GTTCAAAACT TTCGACAATA 2820 

ACTTCTTTTG ACATGGTAAA TCCTTTCAGT TTTCTTCTCT CATTATATCA TAAAGGTTGC 2880 

TCCTGAGACA GAGAGAAAAC CTCTCCGAGC CTGGAGAGGT TGAAATCTTT ACTTACGATA 294 0 

TAAGCGGTCG TATTGCTACT ATGGGTCAAA GGTTACGTTG ATACCCAGTT TACGAAGGAC 3000 

ATTCTTGTCT TCATCAGTCA AGATGATGGT TGAGTGGGCT TCGCTTCCTT TGAGGTTGCC 30 60 
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GAGTTCTTCC AT AG CGCGGG CAGCATCAGG ATTTTCTGTA GCTGTGATAG CAAGTGCAAT 3120 

CAGGATTTCA TTTGAATGAA GGCCTGGATT GCGGCTACCG AGATGATCGA TTTTAAGACC 3180 

TTGGATTGGC TTAACAACTT CAGGCTCGAT TAGTTTTACT TCTTTAGCCA TGTCAGCTGA 3240 

TTTTTTGATG GCGTTGATCA AGGCAGCGGC TGTAGGACCA AAGAGTTCTG ACTTCTTACC 3 300 

ACTGATCA TT T CCCCATTTG GCAATTCAAA GGCTAGGGCT GGTCCACCAG TTTCTTCTGC 3 3 60 

TTTTTGGCGC GCAACGACAG CAACCTTACG GTCTGCAGGT GTGATACCGA GGTCGTTCAT 3420 

GAGCAACTCA AT T TTCTTGA CGGCAGCTTC GCCAACTTTT TCAGCTTTGA AGTCAAGAAC 3480 

TGTTTGATAG TAACGGCGGA TGATTTCTTG TTTAGAAGCT TCGACAGCGG CCTCGTCATC 3 540 

TCTAATAGCG AAACCAACCA TGTTGACACC CATATC7CTC GGTGAAGCGT ATGGTCATTT 3 600 

TCCGAGAATA CGTTCCAACA TGCGTTTGAG CACTGGGAAG ATTTCGATAT CACGGTTGTA 3 660 

GTTGACAGTG GTTTCTCCAT AGGTTTGAAG ATGGAAGGGG TCAATCATGT TGACATCATC 3720 

AAGGTCAGCT GTGGCAGCTT CATAAGCCAA GTTAACTGGA TGATGAAGGG GAAGATTCCA 3 7 80 

AACAGGGAAG GTTTCAAATT TAGCGTAGCC AGATTTCATG CCATTGATTT GGTCGTGGTA 3840 

CATATTGGAC ATACACGTTG CCAATTTTCC AGAACCAGGT CCAGGAGCGG TTACGACAAT 3 900 

CAAGTTGCGA CTGGTTTTGA TCTAGTCGTT TTTGCCCATG CCTTCTGGGG AAATGATGTC 3 960 

ATCCATATCC GTCGGATATC CTTTCATTGG ATAATGAAGA TAAGAATCAA TTCCGTTTTT 4020 

CTCAAGTTGA TTGCGGAAGG CATCTCCAGC GGGTTGGCCA GCGTATTGTG TAATGACAAC 4080 

GGAACCAACA AAAATCCCTA ATTCATTGAA TTTATCAATC AAACGAACAA CTTCTTGGTC 4140 

ATAAGAAATG CCTAAGTCGC CACGTGCTTT GGAATGTTCA ATGTTGCTAG CATTAATGGC 4200 

AATCACAACC TCAACCTGCT CTTTCAATTC TTGCAAGAGC TTGATTTTGT TGTCAGGTTC 4 260 

ATAACCAGGA AGGACACGAG CAGCCTGGAA ATCTTCTAAC ATTTTACCGC CAAACTCTAA 4 320 

GTAGAGCTTG CCGTCAAATT GGTTAATGCG CTCCAAAATA TGGTCGCGTT GTAAATTCAA 4 380 

ATATTGTTCA GAACTAAAAG CTTGTTTTTT CATTTTTTTA CCTCTGCACT CTATTATAAT 4440 

AAAAAATTGG AAGTTAGGAA ACTACGGAGC TAAAAAAGAA ATTAAAAACA TTAAGCAAAC 4 500 

GCTTGCACAA AATTTTAAAA AGTGCTATCA TAGACTATAG ATTATGAAAA TAATGAGGTA 4 560 

AACAGATGCA AGAAAAATGG TGGCACAATG CCGTAGTCTA TCAAGTCTAT CCAAAGAGTT 4 620 

TTATGGATAG TAATGGAGAT GGAGTTGGTG ATTTGCCAGG TATTACCAGT AAGTTGGACT 4 680 

ATCTAGCTAA GCTAGGAATC ACACCAATTT GGCTTTCTCC CGTTTATGAC AGCCCTATGG 4740 

ATGATAATGG CTATGATATT GCTGATTATC AAGCGATTGC GGCTATTTTT GGAACCATGG 4800 
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AGGACATGGA 


TCAGCTGATT 


CCAGAAGCTA 


AGAAGCGTGA 


CATTCGTATC 


ATCATGGACT 


4860 


TGGTGGTCAA 


TCATACCTCA 


GATGAACATG 


CTTGGTTTGT 


CGAAGCCTGT 


GAAAATACTG 


4920 


ACAGCCCTGA 


GCGAGACTAC 


TATATCTGGC 


GCGATGAACC 


CAATGACCTA 


GATTCTATCT 


4980 


TTAGTGGGTC 


TGCTTGGGAA 


TACGATGAAA 


AGTCAGGTCA 


ATACTATCTC 


CACTTTTTCA 


5040 


GCAAGAAACA 


GCCGGATCTC 


AACTGGGAAA 


ATGAAAAACT 


TCGCCAGAAA 


ATTTATGAGA 


5100 


TGATGAACTT 


CTGGATTGAT 


AAAGGTATTG 


CTGGTTTCCG 


TATGGATGTT 


ATTGACATGA 


5160 


TTGGCAAAAT 


TCCTGACGAG 


AAGGTAGTCA 


ATAATGGTCC 


TATGCTCCAT 


CCCTATCTCA 


5220' 


AGGAAATGAA 


TCAGGCGACC 


TTTGGAGATA 


AGGATCTCTT 


GACAGTAGGG 


GAGACTTGGG 


5280 


GAGCAACTCC 


AGAGATTGCC 


AAGTTCTACT 


CTGATCCAAA 


GGGGCAAGAA 


TTGTCTATGG 


5340 


TCTTCCAGTT 


TGAACATATC 


GGTCTTCAGT 


ATCAGGAAGG 


TCAGCCTAAA 


TGGCACTATC 


5400 


AAAAAGAGCT 


GAATATCGCT 


AAGTTAAAAG 


AAATCTTCAA 


CAAATGGCAG 


ACAGAGTTAG 


5460 


GAGTTGAGGA 


CGGCTGGAAT 


TCCCTCTTCT 


GGAACAACCA 


TGACCTCCCT 


CGTATTGTCT 


5520 


CAATCTGGGG 


AAATGACCAA 


GAATACCGCG 


AAAAATCTGC 


CAAAGCCTTT 


GCAATCTTAC 


5580 


TTCATCTCAT 


GACAGGAACT 


CCTTATATCT 


ACCAAGGTGA 


GGAGATTGGG 


ATGACCAACT 


5640 


ATCCGTTTGA 


AACACTGGAT 


CAAGTAGAAG 


ATATTGAATC 


TCTCAACTAT 


GCGCGTGAGG 


5700 


CTCTTGAAAA 


AGGTCTTCCG 


ATTGAAGAAA 


TCATGCACAG 


TATCCGTCTT 


ATTGGACGTG 


5760 


ACAATGCCCG 


TACCCCTATG 


CAATGGGACG 


AGAGCAAAAA 


CGCTGCTTTC 


TCAACAGGTC 


5820 


AACCTTGGTT 


GGCGGTTAAT 


CCAAATTACG 


AGATGATCAA 


TGTCCAAGAA 


GCGCTGGCAA 


5880 


ATCCAGATTC 


TATTTTCTAT 


ACCTATCAGA 


AACTGGTCCA 


AATTCGCAAG 


GAGAATAGCT 


5940 


GGCTAGTTCG 


AGCTGACTTT 


GAATTGCTTG 


ATACGGCTGA 


TAAGGTCTTT 


GCTTATATAC 


6000 


GTAAGGATGG 


CGACCGTCGC 


TTCCTAGTTG 


TGGCTAACTT 


GTCCAATGAA 


GAGCAAGACT 


6060 


TCACAGTAGA 


AGCAAAAGTC 


AAATCTGTCT 


TGATTGAAAA 


CACTGCGGCT 


AAAGAAGTAC 


6120 


TTGAAAAACA 


GGTCTTGGCT 


CCATGGGATG 


CTTTCTGTGT 


GGAATTACTA 


TAAATATTTT 


6180 


TTGCAGAAAA 


ATTTAAAATT 


GAAATCGTAT 


AAAAACAAGG 


GAGGACTGTA 


TAAAAGACAG 


6240 


AAATCCTTTG 


TTTTTTATAA 


CCAAAGTTTA 


TAAACTTTCA 


TTCTTGAAAT 


TCAATTAACT 


6300 


TTACAAATTC 


CCACTATTAA 


GGAGAAAGAA 


GATGAACATA 


AAGAAGCGTG 


TCCTTAGTGC 


6360 


AGGCCTGACT 


TTTGCATCTG 


CTTTGCTTTT 


ACCCAAATCA 


TTCATACCTC 


TCTCAACTAG 


6420 


ATGTAACTTA 


CAAAACCCCT 


GACCTCATGA 


GCCACTTTCT 


TCCTCCTCAT 


GAGGTCAGTT 


6430 


TTACTTTCTC 


CTGTTCCAGT 


ATCGTTTTTC 


CTCGCTAGAT 


TTCCTCAAAA 


GGGCAGACTC 


6540 


CTCCCTTGGT 


GCGTCACACG 


ATTTTTTCAT 


CTCGACTGTT 


CTTTAATGCA 


TCATTAACGA 


6600 
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CGCTTTTCTT CTAGGTGGTT CATAAGGAAC AGGAAGATTC AGGTTGACTT TTCTAATCCT 6660 

AGAATAAAGT GCTGAAAACA ATTCGGAATA CCCATAGAGA CTAGACAATT TGAGGAGCTC 6720 

CTTCCGTCCT GTTCGAACAC ATTTTCCGG 67 49 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:- 1842 base pairs- - 

(B) TYPE: nucleic acid 
<CJ STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

TCTACCCATG GACTTTGAGG CATTCATTGT TCCATCTTCT AGTGGCGAAT CTTTTGATAC 60 

AAACGATTCA ATTCACTTGG ATACTGAAAC TCTCCCGCAA ACATTTTTCT CGTTAACTCA 120 

ATCCAGCTGA TATTTCTTTC AGCCAAAATA ATGGACAAGT TCTCCCAAAA TCCTTCAGCC 180 

ATATTGCTTC TCCTTTAGTT AGATAAATAA TGTGTTTGCG CCATGTAAAT CAATTGTTTC 240 

GTATCTCTTG GCAATAGAGC TCTAGCCTCT TCCAAATTCA GACTTGGATA AACTCGCTTA 300 

TTTGAAACCG CAAGAGGAAG TCTGATGCTT AGTTCAGGAT TTTTTAAAAT TATCTCAACG 3 60 

AAATCCGTTA ATCTTAGATT GTCACGGTTC TTAAATCGTA ATAAATTGGG AGATAAAAAC 420 

TCAAAACAAT CTGAAGAATA GCTCATCATC TCAATTAATT TGTCCTTTGT CATTTCAGAA 480 

ACTGAATGAC AAGATACCTC TATGCCATAG TTTTGGAAGA AATCTAAAAG AAGTTGATTT 540 

CTTTGTCTAT TTTTACTTAG ATAGAGATCA ATCATGCGAG ACCTCCCAAA GATTCGGTTC 600 

CATTTGATAT TCTGACACGA TTAAGGAATC TAATAAATTA AGGAATCTAA TAAATTTGCG 660 

AAGTTAATCG GTTTCTTGTC TTCATCATAA GCTTTTACAG TTACTTGGGT TGTAAGTATT 720 

CCCTCTTTTC CCTCGGCTCG ATAGCCTTGT CCATATAAAA CAAAAACGAG ATTTTGATGA 7 80 

TCATCTACAA AGGCATCAAC CCCATTCTTT ATGTCTTGAC TTTCAAGGAA TTCCATAACG 840 

TTTTGAAGAT AGGATTCGTA AAATAGTGGG TAGTTATGTT TTTTATGGTA ATCATCTAAA 900 

AATGTCACTT CAAACTCACA TGGAGAGTAA TTTTGACTTT GAACAGCCTA AAAGTGCCAT 9 60 

CAAATTTGAA TTGGAATAAA TCAAATAAAT AGCCCCATCC TCATCAATCC AACCTTTGCT 1020 

CAAAGACAAC TCCAACCGAT CTTTTAAAAC TGAGTAAACC ACCTTAACCT CCAGTTTCAT 1080 

ATTCTTATAC CGTTCACTCT CAAATAAAAG TTTGGGGAGC TTATAATAAC GCTCTGATGT 1140 

CTGATATTGA TTAGCGGTAA TACGCTTCAT TATTGTCCCT CCAACACTAA AATTCCAACA 1200 
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TTTCCAAATT CATCAAATCG GATTAAACCT ACTTGTTCCA TTTCATCAAC TAACTGAGTT 1260 

GCTTTTACCC AAATCATTCA TACCTCTCTC AACTAGATGT AACTTACAAA ACCCCTGACC 1320 

TCATGAGCCA CTTTCTTCCT CCTCATGACG TCAGTTTTAC TTTCTGCTGT TCCAGTATCG 1380 

TTTTTCCTCG CTAGATTTCC TCAAAAGGGC ACACTCCTCC CTTGGTGCGT CACACGATTT 1440 

TTTCATCTCG ACTCTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG GTGGTTCATA 1500 

AGGAACAGGA AGATTCACCT TGACTTTTCT AATCCTAGAA TAAAGTGCTG AAAACAATTC 1560 

GGAATAGGCA TAGAGACTAG ACAATTTGAG GAGCTGCTTG CGTCCTGTTC GAACACATTT 1620 

TCCCACCACG TGAAGAAAAA GATGGCGGAA GCGTTTGATT CTTAAAGTTT GGAAGTCACC 1680 

TCCAGCTAGA TGTTTGAGAA AAAGATAGAG ATTGTAGGCG ATACAGCTCA TGATCATACG 1740 

AACTTCCTTT TTGATTAAGG TTGAACTATC CGTTTTATCG CCAAAAAATC CCTCCTTCAT 1800 

CTCCTTGATG AAATTCTCGG CTTGACCACG TCCACGATAA AG 1942 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) -SEQUENCE CHARACTERISTICS: 

(Ai LENGTH: 19390 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

TCATCTTTAT CTCCTCGAAA TTTTCTAATA TAGCCATTAT AACAGAATTT TGTGAAAATT 60 

CCTATTATAG TAAATCACTA TTTCAGTATA AAAAGAAAAA ACGAATCAGA CGATTCGCTC 120 

TTCTTAAAAT CTGAAAATAG CTTTCCAGAA AGGATTAGCC GATTTTTTCC AGATTGAGCA 180 

CTGCATCGTG ACTCATCAAG ACTTGACCAT ACTCTTCTAA GACTGAGCGA CTGATATCAC 240 

TATCGTCTGC AAACTCGCGC ATACGGGCCA ACAGCCAAGC TGGATATGGC CTTGGATCAT 300 

TTTCAATATC CACTAAAATC GTCAAATAAT AGCGCTCCTT CATTTTGTAG AGTTCAGAAG 3 60 

TTTCCATTTC AAAACTCACT GTCTTGGCAA AAGCTACCAA GTCAGCCAAC TTAGCAAAAG 420 

AAAGGATGTA GTAGATGTAA GGTTCTTTCT TACTCTCAGC TTCTTGTTCA GCCTGCTCTT 480 

GCTCTTCTTC CTTGACTTCA ACTTGCTCAA GAGATTGAAT GGCTTCGATA TCATCCTTGG 540 

TTTTGTCTGC GATGCTTTTT TCCAGGCTTT TGATAAATTC ATCTGGAGAC ATTTGAGCCA 600 

ATTCTTCCAT ATCTGGCAAA TCCGATAAGT CTTCAAAATC TAGATTTTGG TCAATCTTTG 660 

ACTTGGTCAC AAAGACATCT ACCTTATCAG GTTTTGGAGT CACACGGAAG CTCAACATGC 720 
CTCTATCCAG AAAGCTATCA GGCATCTCTA GCTCATCCAA GATAGCATAA AAGAACTCTT 
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CTCTTTTTTC TTGAGGAACG AGAAAGTCAG CAATCTCCAT TCCACGATCC ATCAAATCCT 840 
CTAAAGATAT CGTGATTTTT AAACTTCTAT CACTAATTTG TTTCATTTTC ATTGCTAGTA 900 
ACCTCATACT TTCAGTTCTA TCTATTATAC TAGATTTTTA CGATTTTATC AAAAGAAGGC 960 

TCCTCTATAC GGATAGATTT TCCCTAGCGT CTTTCTATAG CAGACTCCAA AAGAAAATTT 102 0 

CTGCAGACAG ATAGAAAAAG CCTTCAAAAT CGCCTAAGAC CCCACTTTGA AGACCTTATA 1080 

CATCAGAATA CTTATAATTT AAAGGTTGCT ACACCGAGGA TAGAACGATT TAAGTTTCTG 114 0 

AGAATTTGAA GACTTTGCTC AAATTTCTTA TAACGAGTCA CTCCGTACTC TTCAACAAGA 1200 

AGGACTGTAT CTCTTTCCAA AAGAGATGAT ACATCCTGTA AATCTACAAA ATCCATTCCT 12 60 

TTTAAAGCTT CTTGACTCTG TTTCAATTTA TCTAACATAG CTTTATTTGA GCTAACGATC 132 0 

GTCAATTCCT GTCCAGTATT TTTGTATGAC AAAACATCTG CTAGGTTACC AATTGTTGTA 13 80 

ATCTCTGTTA CAAAATCAA7 TTGATACTGA GAAAAATCAC CTACTCTATT GATTGTTGGA 1440 

TTAAAGAGAT AAACTAACAC ATTTCCCATC ACAACCAAAA TCAC AC AAAC CACTCCAATA 1500 

ACAACTAAAC GAAGAATCAG ATTTTTCACA TTTAAGCCAA GCGCTGTTTC ACCATTTGCG 1560 

TTCAATTCTT TAGAGTTGAT GCTTTCCAGT TTTTCAATTT TCACATTTGC ATAGCCATGT 1620 

TTAAATTTCT CAATCAACCC ATCAATTTTT TTCTCTAACA AGTTATTGGC ATCTTTACTT 1680 

GATGTCAAAA TTTTCACACC AACCCCTGCA TCCTCAATCA TATAGTACAC GCTCAATTTT 174 0 

TTCCACCAAT AGTCATTCGT TGAATTTTTC AAGGTTGTTT CTGTCGTGTC TAATTCACTG 1800 

GCAATTTTTT TCAACTCACT GGGTTCTACA TCATTCAAAA GATAAGCTCC ATTCAAATTA 18 60 

CCATCAATCA ATTTCCCATA AAAATCACTA TAACCACCAA TTTGATCATT CAAAATCGTT 1920 

TTGTCCGACT CTTTTGGAGG AGTGATTTTA TAGATAAGAT AAGTTGAATA ACTTGTTGTA 19 80 

TCTTTGACAG TGTTTTTATT CCTAACTGCT TTAATTGTAA ATGGTACAGC AATGAGAGCA 204 0 

AATAAAGCGA TCAGACCTAA AATATTTGCT TTTCGCTTTT TATAAAGATT TGCAAACAAA 2100 

TCAGCTACTG AATAATGTTC AAAC ATG ATT TTTTTCTCCT TTGTTTAGTA GATACTAGTT 2160 

TTCCTTTGTA AGCATTTTTG CTACAAATAT AATCACAAGA ACAATTCCCC AGAATTGCAT 222 0 

TGTAAATAAA TTGAAGAAAC TTTCTGAAAA GCTGCTTCTT GGCATAAAGA ATAGATTATT 22 80 

CAAGATGAGT AGCGATAAAG CAAATAGGAT TGTCCTTGAG CGATAGGCTA CTTGCAGCAT 2 3 40 

GCCTATAAAT AATACGCCGA GTAAGAAACT AAGCAGAAAG ACTCCAATCA TACCATAGTC 24 00 

GGTATACAAC TCCATGATAT AACTACTTCC GATACCATGC CCTTTCAAGT ATTCCTTG7T 2 4 60 

CAAGACAAGA TAGGATAGAT TGTGGGCATA ACTATTACTA TCAATAGCTA CTTCCACACT 252 D 
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ATTGGTTCTA TCTTCAAACC CTTTTCCTCC GAAAATGGCT CCCAAACTCC CCCTTGCAAA 2580 

ATAATCAAGA ACAGGACCAA AAGTAAAATT ACGGAAATCT CGGTAAGGGA GCCTACTGTT 2640 

AAATAGAAAA CCTCGAGCCA GAACACCAAA ACTAGTCCCT TGTTTATAGA TAAACTCAAG 2700 

TAAGATATCC CAGAAACCTG 7ATGGGAAAC TTGGACATTA TCCCGTACAT AATTGAGTAC 27 60 

TCCCATCGCT AACATGAGAA TAGGAGAACC TACAAAAATC GCTAACTTTT CTTTAAACCC 2820 

AATCCATTTT CCTTTTTCAG TTTGCTCCCG CATAAAGTAA TAAACAAAAG CAAATmAAAT 2880 

ACTTAAAATA AAGGGATTTC GTGTCCCAAT TGCCAAATGA ATAGTATTAG CTGCAATAAA 2940 

GGAGACAAGC ACTGCTGTGG CCTGCAATTT CTTTGGCTTG GTTGCCAGAT ACATACACAT 3000 

TCCATAGACC GTAAAGGTAG ACAAAATGTA GGTAAAATAA GGCAGTTTAC TTTCAAAATT 30 60 

TGCATAGTAG GCATAGTAGG AAGTCTGCAA ACGATACAAG AGCCGTTCAA ATAACCGAAT 3120 

GAAATAGAAA GGATAACTTA GAAGAAAAAC TCCTAGTGAT ACAAACCGTA ACCGCTTGAT 3180 

ATAAACCTCT TTTAGAGAAT TTCCTATATT TGCTACTTTT ATTTT C TTCC TAGCTATGAA 3240 

GTAACGAGCC AGAATGCCTC CTGTGGTCAA GCCCAGAATC GAAATCATGA CAACTATAAA 3 300 

CGCAAAACGA TAGCCTATTG GATGATAGGT ATCCAAAGCA CCATCCCTAA AATAATCAAT 3 3 60 

GGTCGGTCTT GATACCAGAA ATACAAAAAT GGTTAAATAG AAAATAAAAT GGATTAAGTA 3420 

ATACTTGATA TCATTCCAAC AAGCAATTAA GCTACTAACC AACAAGAACA ATAAAGTAGA 3480 

AAGTAAGCTA ACATTATTAT TATTAAACAG ATACACAATT CCACTTACTA GCGTCAAGGC 3540 

ATAACTGACT ATGGTCAAAC TAAATAATAA TCGTTTCCCA TCAATCACTT GGTCACCCCC 3600 

GTTCTAATGT AATTTTTTAG ATTTTTCAAT ATTTTTCAGT AATAAGAATC GATATAAGGA 3660 

AATATTTATG AATAGGGCCA AAGCACTAAT TCTTCTCCCC TTACGGAAAA TTGGATTCCT 3720 

AGAAATAGCA AAGGCATGGC CTTTTAAAAA ACCATGAATC TGAGAATAGG CTTCAAACTG 3780 

TTTATACTGA TCATCTAGCA ACATCTTATC CAGAATAAAG AAGTGGGCAT AGGCCAATCT 3 840 

GAAAAAAGCG ACCTCTTTCA ACTCAGGATA GTTTTTCACA ACTTCATTAT AAAACTTITG 3 900 

GTAGATATCA ATATAGGCTA AATCCTTCTC TGCATAGGGT TTGGTCGTAA TACTATCCCC 3 960 

TCTATGGAAA TAGTAATAAT AGGGTTTAGT ATTAACCACA TACTTCTTGG CCAACTTGAT 4 020 

TAAATCAAAA TGGTAATAGG CATCTTCGTA AATCAACCCC TTAGGAAAGG ATAGGGCAGT 4080 

TGCAATCTGT CTCTTGATTA GCTTATTGCA AATCGTCCCA CGTATTTTTT CACCTATGAG 4140 

GTATTCCTTT AGAAATGTTT GAGAATCACA GACAAAATAG TCATCCTGAT TGGCTGACTG 4200 

TCCGCTTTCA TC ATT AG CAT AG AC AT TC AT G AC AC C AC AG CTCGAAACAT CCGCATCTTC 42 60 

TTGAACTAAT TGCTCATATA AGCTCTGAAT CATTTCTCCA TGGATATAAT CATCTGAGTC 4 320 
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AATAAAAATC AGATAATCCC CGTCAGCCTG CTTCATCCCA TCATTTCGTG CTTGCGACAA 4 380 

TCCTTCGTTC TTTTTATGAA GCACTGACAC CCTGTCATCT TGTTCAGCCA TTGAATCACA 444 0 

CAAGCGACCA CTTTCATCTG TTCCACCATC ATCAACAAGA ATAATTTCCA GATTTTGATA 4 500 

CCTCTGCTTC TGAATGGAAG CTATCGATTT TTCTAGGTAC TGCGCCACAT TATAGACTGG 4560 

CACAATCACA CTAATTAATG CACTTTCCAT GCTACTCCTC TAATACTTTT TCTACTTGTT 4620 

CGATTTGTTT TGTAATTGTA AATTGTTGAA TGAATTGGCT AGCCTCATCG ACATCAAAGT 4 680 

TTGAGGCAGA AGTCATGTAA TTAGTAATCG CCTGAGCTGC CTCTTGATTG CTCTCAATGA 4740 

TTTGTCCAAA TCGTCCTTCT TGGGATAATT CCTCAGCCCC TCCAACGTCC GTAGAGATAA 4 800 

AAGGGAGTCC CAGACTCAAC CCCTCCACAT ACACTCCAGG AAAACCTTCT TGTTTAGACA 4860 

TACACAAAAG AACTTTCGTC TGAGATAGAT ACTCATAAGC ATTTTTTTGA TAACCAACGA 4920 

AATGTACATA GTCCTCAATC CCATACTCTT TGACTCGTTT TTTCAGTTCC TCTTCCATAT 4980 

CACCAGCCCC GATAAAATAG AGATGATAGT TTTTTCCCTC TTGGTGTAAT AATCGTATCA 5040 

CTTCCACTAC ACGGTCAGAA CCCTTATTTT CCTCAATCCG TCCGATAGTA CAGATACTTT 5100 

GAGGAGCAAT CTCGATATCG ATCTTCTCTT GAGATTTTTC TAGAATACTC TGAAAATCAT 5L60 

ATCCATTGTA GATTGTCTGT AATTTAGAAG TATAATCTGC ATAAACTTCC TTGATAGAAT 5220 

TGCTGGTCTT TTTTGAAATC CCTACAATTG TATTCGCAGC ATCCAACTGC CTTCTATGTG 5280 

ATTCTCTTTT AGAGCTATCC TTAAGAAGTT CTTCAATACT TCCATGAATC CAAGATATCT 5340 

TCTTGACTTC TCTTCTTTTA GAGAACAACA GTGGTGGATT CATAATGGTA AAAGAAACTT 5400 

CAACATCATA ATCATCTTTT ACAAGCAAAC GACGAGTCAC TCTTGGAAAA TAAATTCTCA 54 60 

TTCTCCACAA AAAAGCTCGT AACCATCTGG TTTGGCGATA ATCTTGAAGG GATTTTAAAA 552 0 

TGCGTACATG CTTTGGAACA GATTCATATC CCTTGTCAAA GTGCTCCATT TCAAGAATAT 5580 

CAATATCATA CTTTTCTCGA TCCAGATTTG AAACAATCGT TGATAGAATC TTCTCTGCAC 5640 

CACCTCCAAG AGAAAAAGAC CACATAAAAA ATAAGATTTT TTTCTTACCC ACCATATTCT . 5700 

CCCTTGTATT CTGTATAAGA CTTATCCATA TCAGCGATGA CAGCATCATG ATGCCGTACC 5760 

TGCTTGTCTG CTGGTGGACG CGTCATATAA TCCCCAAAAG CAGTTCTGAG ATAGACATCA 5820 

TAGCCGATTG GAATAGGCAT CTCTGTTCCT TCAAATGGCA AGAAAAGATT GTCTTCAAAA 5880 

GATGTGATTG GGTACTTGTT TCTCATGTAG CCACCACCTC AGCATAATTC TGTAATGCCA 5940 

TCACAATCAG CCAAATCATA CTTAGTCATT TCTTTCTCAG CTTTTTTCCA GATGCGATAA 6000 

CGGAGAGATT TTGGAGTCAA ACCCAGTAAA ATGCGACTTC CCCATTTCAT GAGATCACCA 6060 
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TGCTTTTCTG 


GAATAGTTTG 


CGCACAAAAG 


AGTGAATAAA 


TCAAGGCCCA 


ACGAACCTGT 


6120 


TTTTTCCGCT 


CAGCTGGATT 


TTTCGGATAA 


TAATCCAAAG 


GCAAAACATC 


CAAGGCCAGA 


6180 


CCATGTGGCA 


AATCCAAATC 


CTGCTGATAA 


GGCTTGATAC 


AGGTGGTTTT 


CTTGTCACGA 


6240 


ATGGTAATAA 


AAAGATTACG 


ATCAACAAAA 


TCCTTGTGAC 


TCTTTGACAA 


GAAATAACGT 


6300 


TCATCTGCAT 


AACGAGGCCA 


TAATTCTGCT 


AATTTCTCAT 


AATCTTTACG 


AGGCATAAAA 


6360 


AAGTCTAGGT 


CGTCGTCCCA 


AGGAATAAAT 


CCCTTGTTTC 


GAAGGGCACC 


AATAGCGCCT 


6420 


CCGCCACAGA 


GATAACAGAG 


CAAATCATGT 


TCTTTACAAA 


AGGCCACAAA 


ATATTCAGCC 


6480 


ATCTCCAGAC 


TACGAGCCTG 


AATTGCTTTT 


AAATCAGTCA 


TATTGTTCAT 


TATTCTTTCT 


6540 


ATCGTATCGT 


TTCATTATAC 


CACAAACAAG 


GGGTGAAAAT 


CTATTGCAGA 


CTGTAAAAAA 


6600 


TCAAAGCCTG 


ACTGCTATCC 


AAATAGCTAT 


CAAACTTTGA 


TTTTTCTGTC 


TTATACTCTT 


6660 


CGAAAATCTC 


TTCAAACCAC 


GTCAGCTTCA 


CCTTGCCGTA 


GCTATAGGTA 


ACTGACTTCG 


6720 


TCAGTCTTAT 


CTACAACCTC 


AAAACTGTGT 


TTTTAGCAGC 


CTGCCGCTAG 


CTTCCTAGTT 


6780 


TGCACTTTGA 


TTTTCATTGA 


GTATTATCTT 


ATCTTAAGCC 


CATTTGAGCG 


AGCTTGGTTT 


6840 


GATATTTGTT 


TTGATCAACC 


AGCAGGCCCA 


AGCCCCCATA 


AACATCATAG 


GCATCTACCC 


6900 


AGTCACCCAG 


TTCTGGAATC 


GTCAATTTTT 


CAATACCATT 


TTTTGCTCCA 


TCCAAAACAG 


6960 


ATAAACCGTT 


TGTTAGGAGG 


AAAGTATAGG 


GTACGTTGGT 


TGAGGTCATA 


GCAAAAACCT 


7020 


TTCCAAGAGC 


TTCAGAACCA 


GTGAAAAGTT 


TACTGGCATC 


TTTAATTTGC 


TCTAAAATTG 


7080 


CTGTTAAAAC 


TTGTTGCTGT 


CTTTTTGTAC 


GGCCGTAATC 


TGCCTCATCA 


TCATCACGGA 


7140 


AACGAGCATA 


ATTGAGCAGG 


GTCGAGCCAT 


TCATCTGCTG 


TTTTCCGACT 


TTAATGGTTT 


7200 


GGGTTGGAGA 


CTCAGTCTCG 


GTAGCGTATA 


AATCATCTCC 


GACTGTAGCT 


TCTGTTAGGG 


7260 


GACGCCCATT 


CAATGTTGAA 


AATTGAGCAT 


CAATCGTCAC 


CCCATCAGGG 


AAAAGCGTGT 


7320 


CAATCGCTGT 


GGCAAAGGCC 


TGGAAATCAA 


CCAAGGCGTA 


GTACTTAATG 


TCCAAGTCAA 


7380 


AATTATCTTT 


CAAGACTTGG 


CGAACCATTT 


CTGCCCCTTT 


TTGCCCCTCT 


TGTTCTCCTA 


7440 


ACTCGTAGGC 


TACGTTTAAC 


TTGTTATCTG 


TCTGTTTTCT 


ACCATTAATC 


ACTTGACTAT 


7500 


AACCATCTAT 


ATAGACCAAA 


TTATCACGCA 


TGAAACTGAC 


TAGCTTCATT 


TTCTTATCTG 


7560 


AGCCCCCGAC 


ATTTAATACC 


ATAATAGAGT 


CAGTTCGTGT 


CTCAACACTG 


TTCTGGCCGA 


7620 


TTCGACCATC 


AGTACCCATG 


ATTAAAATAT 


TAACTCCATC 


TCTACTGTCC 


TGACCATTAA 


7680 


AGACTTCTAC 


TTGAGCTGCC 


CGGGCATCAG 


CAGTTTTCTT 


TGCGCTAGCA 


TCTTGGTAAC 


7740 


CACGCAAAAA 


CATGAATACC 


ATGGCCAAAG 


CCACACAGAC 


CAAAAGTGAA 


AAAATCACCA 


7800 


TAAAAATTCG 


TTTAAGACGG 


AGCTTCCGTC 


TTTTCTTTTT 


TGGAGGGAAA 


GAGAGTGCTT 


7860 
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GTGATT/TOOA 


TT OTO AGO O A 






TAAGTCAACC 


TGCTCTTCTC 


7920 
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flATAfymAAT 


A TTTTTT AGC 


AAATAATCAT 
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AACGCAACTG 


CTCATGATGA 
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TtTCttTACT 
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CCTTTCCATG 
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GTCTGATATT 


GGATAAATAG 


GATAOOCACC 


LAunnl 1 I In 


TAfTWIATTf* 
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r AATC GCTTC 


8160 


TAATTCTTTT 


TGGGCAAAGT 


GGAOOAAGTC 


fTT ATffWT* A 
LI In! Luv 1 A 


TAATCCACAT 


C C AT AATG AA 


8220 


AAAGTATTCA 


CCCAGTGCTG 


T CTTO AGT GO 


AI.AAL » 1 1 Ln 


» T ,T> TTTr;Tr t a 

nl > 1 1 lul 


AGTCAATTCC 


8280 


TCGCCAAGCA 


AAGGTCGACA 


GOOCOTTA vA 






CAGGTAATGT 


8340 


CAAGGCCAAA 


CTCATCTTTT- 


CAGTTTOTOO 




a t a cr Ar:n c t 


TTTCAGCTCC 


8400 


TAGAACCCAG 


AAACGTGTGA 


AAi iuoLl lO 






CAATCAGTTC 


84 60 


CAATCCATAT 


TCTTCAGCAG 


AACTTCT AGO 


FOC AAC I uU L 


r C A A Aftf^ CT 


GGTCTGGATG 


8520 


TTCGGAAATA 


AAACGGGCCG 


*•» R T* A A /"OTV*T 

C AT AAOC I O T 
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CAGCCTCTGG 


8S80 


ATATTGTTCA 


TCGATGAATT 


TCTTTCCTtG 


AGCCAAOOCC 




AAJLAAATCTT 


8640 


TTCAATCTTA 


GTATGGCCTG 


GAACCACCAT 


C AAL IOC 1 v»A 


A AT AGHPT 




8700 


TGCTACTGCT 


TGGATGTGAG 


CCTGATGAAA 


AAOA 1 nU i LL 


A AnnTTTC AT 


GAACACTACC 


8760 


CTCAATAGAA 


TTTTCAACTG 


m >^ m ft ^ ft A ft 

GC AC C AC AGA 


AT AO ITLAL 1 




rATAAGCCTT 


8820 


GATCACATCT 


GtAATGTtGG 


/-^ ft ft ft ft: ^MAM^ 

CAAAAGCCTG 


L AA l 1 CC rCA 


Tf! Ann AAAAG 


CTGTCTGCAC 


8880 


AACGTGGTGT 


GAAAATGATC 


Co L I GGGAcC 


I AO A 1 MAOO A 


*(T»m*p'pf*ATrT 
t\ » 1 1 1 * 1 


TAGTtCCTCT 


8940 


ATAATTTCCT 


CTGGGCTTAG 


/ — ww^^tf*/ 1 * ft ft 

CTTGGTCACA 


TOCAAAAOOO 




TTCCTCATAC 


9000 


CAAGCCTGTC 


TTTCTTGGAA 


AA * AOCTAC I 


AO 1 IL 1 


TflfT A "*T ATT 


TAGAAAAAGC 


9060 


GGTCGCTGAT 


TGTCCTTATC 


AOCTOCOA 1 A 


rr?rnnT AG a 


GGGTTTCAAA 


ATCTGCTCTC 


9120 


AGGTAGATGT 


TATCTGTATT 


ACTCTTGAGT 


AAGTCACGAT 


TTCTCTGAGA 


AATAACCACT 


9180 


CCTCCTCCAG 


TTGACACGAC 


TTGGTCTGTT 


TGTAGTAAAT 


CAGCTACGAC 


TTCTGATTCT 


9240 


ACCTGACGAA 


AGGCTGTTTC 


TCCCTTTTCA 


GCGAAAAAAT 


TCGCAATGCA 


CATACCTAGG 


9300 


CGATTCTCAA 


TCAGAGCATC 


CATATCAAGG 


TAATTAGGGT 


CCAAGCCTCT 


TGCAATAGTC 


9360 


GATTTTCCAG 


CCCCCATAAA 


CCCTAATAAC 


ACCTTAGCCA 


TGAATCAAGC 


TCTCCAAATC 


9420 


ATCAAAGAAA 


CTAGGATAGC 


TGGTATTGAT 


GGCTTCTGCA 


CGGTCAAGCT 


CCACCTCTCC 


9480 


ATCTGCAACC 


AAGAGGGCTG 


CGATAGCTGT 


CATCATGCCG 


ATACCCTGGT 


CACCAAACGT 


9540 


ATTGACTCTA 


GCACCGTGAA 


GAGCTGATTT 


TCCTTTGATA 


ATCATCCCAT 


CTGCCGTAGG 


9600 
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ACTAATATCT 


GCTCCCATAC 


TATTTAAGGC 


GTCTGCCACA 


ACCTGAATAC 


GGTCTGTTTC 


9660 


CTTCACCTTC 


AGCTCCTCAG 


CATCCTTGAT 


AACTGTTACA 


CCTTGGGCTT 


GGGTCGCAAG 


9720 


CAGGGCAATA 


ATGGGCAATT 


CATCAATCAA 


TCGTGGAATC 


AAAGCGCCAC 


CAATCTCTGT 


9780 


TCCTTTCAAG 


TCAGAAGACT 


CAACAATCAA 


GGTAGCAGAT 


TTAGCGACTG 


GATCGATTTC 


9840 


AGTTATTTCC 


AATTTTCCAC 


CCATGGCACG 


AATGACATCA 


ATAATACCGG 


TGCGAGTTTC 


9900 


GTTGATCCCC 


ACATTCTGCA 


GCACTAGACG 


AGAATTTGGA 


GCAATCAAAC 


CTGCGACTAA 


9960 


CCAAAAGGCT 


GCACTGGAAA 


TATCTCCTGG 


TACGACCACC 


TTCTGTCCTG 


TCAATTTTTG 


10020 


TGGCCCCTGG 


ACTGTGATTT 


TCTTACCATC 


CACACTTAAA 


TGACCACCAA 


ATTGTTTCAA 


10080 


CATATCTTCA 


GTATGATTAC 


GGGTGTACTC 


TTTTTCGATA 


ATAACTGACT 


CCCCCTTAGC 


10140 


TTGTAAGGCT 


GCAAACATCA 


AGGCTGACTT 


GACTTGGGCA 


GAGGCAATTG 


GCAACTCATA 


10200 


ATGAATAGGT 


CTTAGGTTTT 


TCGTCCCTTT 


TAAGCGAAGG 


GGAGGCAAGT 


CTCGTTCAGT 


10260 


TTGCCCTGAA 


ATGCTGACGC 


CCATTTTTTT 


CAGTGCAACG 


GTCACACGCT 


CCATAGGACG 


10320 


TTTGGAAAGA 


CTATCATCTC 


CAAACATCTC 


TACTTCGAAA 


TCTGCACCAG 


CAAGGACACC 


10380 


TGAAATCAGG 


CGAATCGAGC 


TGCCAGAATT 


TCCCATATTA 


AGGGCATTTT 


GTGGCGCTTT 


10440 


TAAGCCAGCC 


ATGCCTACAC 


CTTGAATGGT 


AATAACCCCA 


TCTTTATCCT 


CAATTTCAAC 


10500 


ACCAAGGTCA 


CGAAAAACCT 


GCATGGTCGA 


AAGAACGTCT 


TCACCTCGCA 


GAATATCATA 


10560 


AACCTTGGTC 


TCACCCTCAG 


CCAAACTTCC 


AAAGATAATG 


GAACGGTGGC 


TGATAGACTT 


10620 


GTCACCTGGG 


ACGCGGATAC 


TACCATGTAA 


ATGGCGAATG 


TTTCTTTTTA 


GTTTCATACT 


10680 


GGACCTCATA 


CTTGCAATAC 


TTTTACCTAT 


TTTATCATAA 


AAACCCAGAA 


ATTCCTTAAA 


10740 


AATTCCTGAC 


TTTAGGATCG 


TTCTTTTCTT 


ATTTCAGCAA 


TTCTGAAACT 


GGTTCAAAAA 


10800 


CAATTTTTTC 


AATATCAGAA 


AGGTAAATGG 


CCAATTGTTG 


TTGCTTGGTA 


AAGAATTCTG 


10860 


ACAAGAGGCT 


ATTTCCTTGA 


ATCTGTTTAC 


CAAAGCCTTC 


CATCT7AGCT 


TGGAAGGACG 


10920 


CATCTGGCAT 


TTGACCTGTC 


TGTGCTAGTT 


TTTGAATTTC 


CTCTTGAAAG 


CCAACATAAT 


10980 


CTGTAAAGAT 


TTTGCTTGCC 


TCAGCATCTG 


CTGCAATCGC 


ATCTTTAGCT 


GCTTTAACAG 


11040 


CCTTGTATTC 


TGGTAATCCG 


CGTAGACCGC 


GACTGAGTTC 


GTTTGCACTA 


TCGTAAATAT 


11100 


TTGACATGTT 


CTTCTCCTTA 


TTTGATGACG 


ACTGTATACT 


CAGTATTTTC 


TGTTATGAGA 


11160 


TGCTCAGCTC 


TTTCCAAGTC 


TTGAGCATTT 


TTAAATGAAA 


TTTGTAGGAT 


TCCGTGAATA 


11220 


TCCTCACGAT 


TTTCCTCGTT 


GATGTGGATA 


TTAACCAAGG 


AAGTTCCACG 


TAGCAGTTCC 


11280 


AAAATCCGCA 


GGATGACATC 


TTCTTCATCA 


GGAACGTCAA 


CATAGAGGTC 


GTAAGAGCTA 


11340 


TCCACACCAC 


CACGCTTATG 


GATTTCCATG 


GTCTGGCGTT 


GTTCACGCGC 


TTGGTTAAAA 


11400 
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AACTTCCAAA TTTCCTCTTC ATCTCCCTTA CTAATGGCCT GACCAATCCC TTCCAAACGT 11460 

TCCTTCAAAT CCTCAATTCT ATCCAGAATG ATCTCGCTAT TGGACAAGAG AATGGAGGTC 11520 

CACATTCCTG GCTCGCTTTC CGCAATTCGC GTCATATCTC GAAAACCACC TGCCGCAAAG U580 

CGCCTTGCCA TCTCATCCTC TTGAGCATAG ACCGCAGTCT GCTCCATGAG ACTAGAAGCC 11640 

AAAATATGAG GAAAATGGCT AATCTGAGAA CTCACACGAT CATGCTCCTT GGCATCAATC 11700 

TCGATAAAAC GAGCATGAAG ACCTGAAAGC AGATCCTTCA TTTCCTTAAG CGTGTCCTGA 11760 

CTTGTCAGGC TTGAAGGTGT AAAGATATAA TAGCCATTTT CAAAAAGATT GACATCTGCC 11820 

GAAGCAGCCC CTGTCTTGTG ACTACCAGCC ATGGGATGGG CCCCGACAAA GCGAACAGAC 11880 

TTGCCAGCCA AATACTGCTC CGCCGCATCC ACAATGGTTG ACTTGGTCCA ACCAGCATCT 11940 

GAAATAATAA CGCCTTCTCG CAAATCCAAA TTGGCCAACT CCTTAATCAA AGCAATAGTT 12000 

TGTTTGATTG GCAAGCTGAG GATAATGACA TCTGCCAAAG GAGCAAAACT AGCAAAATCA 12060 

TCCGTTGCAC GGTCAATCAT ACCTTCTTTC AAGGCGATAT CTCTCGAAGC TTGACTACGA 12120 

TTATAACCTA AAATTTCATA ATCTGGATGA TCGCCTTTCA TACCAAGTGC CATAGAGGCT 12180 

CCAATCAACC CAAGACCTGC GAT AT AG ATT GTTTTTGCCA TAGGAACTCC TTAATAGTTC 12 240 

TTTGTATAGT CTCGGTGTTT GGCTACCGCT TCTTTTAGTT CCTCAAGATT ATCTGATGAG 12300 

AATTTTTCGA GGATTTCTTG CGCCAGAACC GTTGCTACAA CTGCTTCCAT GACCATTCCT 12360 

GCAGCTGGAA GAGCAGTCGG ATCACTTCTC TCCACGGTTG CCTTGTAAGG TTCCTGGGTT 12420 

TCGATATCCA CACTCATAAG AGGTTTATAA AGAGTAGGAA TCGCTTTCAT GACCCCACGA 12480 

ACAACGATGG GTTCCCCATT AGTCATACCA CCTTCAAAAC CACCTAGATT ATTGGTACGG 12540 

CGAGTATAAC CCTCTTCTTT AGACCAGAGA ATTTCATCCA TAACTTCGCT GCCTTTACGA 12 600 

TAACCAGCCT CAAAGCCAAG ACCAAATTCC ACCCCTTTAA AGGCATTGAT AGAGACAACA 12660 

GCTTGAGCCA ATCTTGCATC CAATTTTCTA TCCCATTGGA CATAGGAACC AAGACCAACT 127 20 

CGAACGCCTC CGACGACTGT CTCCACAACC CCACCGATGG TATCACCATC ACGTTTGATT 127 80 

TGGTCAATAT AGTCCTTGAT TTCCTGTTCT CGTTCTTGGT TGACAATAGA AACTTCAGAC 12840 

TGGGCAGCTC TTTCCTTAAT TTCAGCGACT GTCAGATTTT CACGAACATC GATTTCCTTG 12900 

CCACCAAAGA CCACGACATG GTTGGCAATC TCCATATCCA GCTCAGCCAA GAGGCGTTTG 12960 

GCTACTGCAC CAACTGCCAC CCGCATGGTG GTTTCACGAG CTGATGAACG CTCCAAAGAA 13020 

TTTCGCAAAT CATCAAAACG GTACTTAATC CCCCCAACCA AATCGGCATG ACCTGGGCGA 13080 

GGATGAGTAA TTTTCCGCTT GCTTTTAAGG CCGTCTTCAA TGTCCTCCGC AGACATGATG 13140 
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TCCAGCCATT TCTGGTGGTC CTTATTGATG ACATCCATAG TAATAGGCGC CCCTGTCGTC 13 200 

TTCCCCTCCC GAACGCCCGA AGTAAAGACA ACCTGGTCAT TCTCAATCTT CATACGACCA 13 260 

CCACGACCGT AGCCACCCTG ACGGCGTCTA AGGTCCTCAT TGATATCCTC AGCTGTCAAT 133 20 

GGAAGTCCAG CTGGAATTCC CTCAATAATA GCTGTTAGAC GGGGGCCGTG TGATTCTCCT 13 380 

GCAGTTAAAT ATCTCATACA CTCTCCTTAT TTTACCAAGT AGTCTTTCAT CTCTTCCAGA 13440 

GAAACTGGCT GAATGGTCGC TGAACCAAGC TCTGGCACCA AGACCAATTT CAAGGTGTTA 13 500 

CCACGCCCTT TCTTGTCATG AGTAAGAGCC TGATAAAGCT TGCCAACTTC CCAATTTTCA 13 560 

TAGTCAACAG GCAAACCGAA TTTCTGACAC ATCTCTGTGA TAGATTGGGT AATGCCAGCT 13 620 

GGCATGAGGC CTTTTTCCTC AGCAACCTTG GAAATCTCTA CCATTCCCA7 CCCAACAGCC 13 680 

TCTCCATGCA TGACCTTGCC ATAACCGGCA GTCCCTTCCA TGGCATGGCC AATAGTGTGG 13740 

CCAAAATTGA GGTAAAGACG AATACCATTG TCCAACTCAT CTTCAACCAC CATCTTGCGC 13800 

TTCACCTGAC AAGAATGTTC AATCAAGGTC TCTGCATGTT CCAAAATACT CTCAACAGAA 13860 

CCATTCAGTC CCGTCAAGAG AGCCCACAGT TCTGGATCCT CAATCAAGCC ATACTTGATA 13 920 

ACTTCACCCA TCCCTTCAAT CAACTCTCTT TTTCCGAGCG TTTCAAGAAC AAGTGGATCA 13980 

ATCAGAACCC CATCTGGTTG GGCAAAGGTC CCCACCATAT TTTTACCAAA TGGTGTATTA 14040 

ACGCCTGTCT TTCCACCGAT AGAAGAATCA ACCTGAGCTG TCAAACTAGT CGGAATCTGA 14100 

ACAAAGTGAA TACCCCGCAT ATAGGTAGAG GCTACAAATC CACCCAGCTC CCCAACAACG 14160 

CCACCACCAA GAGCAACGAT TCCATCGCTA CGAGTCAGAC CTTGCTTGAC TAGAAATTCA 14220 

TAGACTTTCT GAACAGTAGT TAAATTCTTT CTTTCTTCAC CTTCTAAGAA ATCAAAAACA 14 280 

GCTACCTGAA AACCAGCATC TTCTAGGCTG AGCTTGACCT TCTCTGCATA GAGAGAGGCT 14 340 

ACATGGTTAT CTGTCACAAT GACTACCTTT TGCGGTTGCC AGAGTTCTCG CAACCACTCA 14400 

CCAGCCTGGG CCATACAACC TTTTTCAATC TGAATATCAT AAGGATGGTG AGGAATATCG 144 60 

ATTCTCATTT TCATAGGAGA GTCTCCCTTT CTTTATTGGT ATTTTTCTGT TAAAGACTGC 14520 

CAAATCTCTT CTGTCGGCAT TTCCTTGCCT GTCCACAGTT GAAAAGC7TC TGCAGCTTGA 14 580 

TAGAGTAACA TTCCCAGACC ATTGACTGCT CCATTCCCCT GACTTCTAGC CCATTTCAAA 14 640 

AACGGTGTTT CAAAGGGTTC GTATATGATA TCTGCAACTA AAAGAGTTTC TGGTAAGACT 14700 

ATGTTTTCAG GAACAGGAGA GGATTGCCCA TCCATGCCCA CACTGGTCGC ATTAACTACC 14 760 

AAATCCGACT CGGCAATCCT TGCTTGCAGT TCAGAAACAT ATTCTAAAGC ACACAAATCC 14820 

ACTTTAAAAC CTGTCTGCTC CTGTAACTTG TCTAGGTAAG GTCTTGTTTT TTCCATAGAA 14880 

ACGGAACGAA CAAAGACCGA AATCTGACTG ACGCCATCCA AAATAGCCTG TGCCAAGATT 14940 
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GATTTAGCCC 


CACCACCTGC 


ACCCAGCAGG 


GTCATCTTTT 


TsrrTfiiA at* 


TfVT AAAAr.AA 


15000 


GGCAAGCACT 


TAAAAAATCC 


CTTGCC A i t- I 


GTATT AT A 1 v. 


r*A ATTiiATT 
Unn 1 1 AAA l l 


VJ V* wn * i W »\*n 


15060 


TTGACAACCG 


TATTAACCGC 


ACCAATCAAG 


CGCCCTTCAT 




ATfCAAATAA 


15 120 


GGAATCACCT 


GCTCCTTATA 


GGGCATGGAC 


AGATTGATCC 


CAAACATCTG 


vjT AOL. L» A\_U A 




ATATTGGCCA 


CTGTTTCTAC 


CAAGTCACTC 


GCTTCAATCT 


C Lv_ AAt»(- L. AL. 


& t & a rr a 

A 1 AA«UAL\.w 


152 40 


TTGGTAGCTG 


TCGCCTCAAA 


GGCTCTATTG 


TGGATGAAGG 


GAGAAATAGA 


» rp/"»/"><T , 'P ft AT* A 

ATGQ.TTAA 1 A 


1 ^ ~k n n 


GGATTGGCAA 


CAACTGCAGC 


TAAACGTGTA 


TAGCCATCAA 


GC rTUATuUA 


AAAlv- I WLL I 


i JJOv 


GATTTTTTTC 


ATGCTAGC7A 


GACAAATCTG 


CCCAGGGGCA 


CI AAl_L I LA I 




154 20 


AAAAGACCAA 


CTCGAACCAG 


TCACATCCGC 


ft f^T* ft IT> ft Z^/"** ft 

AGT G AT AC\j A 


GAvjAd. 1 i Ov. 


LLAV.C 1 InLV. 


154 80 


CATAGAAATG 


GTCACATATT 


CCTGTTCAGG 


ATTGAGGGTT 


i I AAAGC CT U 


^Tr**r ATA 




CATCAAGTCT 


AAGACATCCT 


GCTCCGTGTG 


AGCCATCACC 


GCAACt. 1 I AA 




1 5600 


ATTTAGGATC 


GTCAACTCTG 


ACAAGATTTC 


CATCATGTTC 


TCAGGTGTTT 


CTTGGAAATT 


15660 


ATGGTAACTC 


AAAACAAGAT 


TTCGGAAGTC 


CAGCATTTCC 


^/■^ ft ft ft ft ft ft W 

TCAAAAACAT 




15720 


ATAGTACTCA 


AAATCAATAT 


AGTCTGGTTG 


ATAGAGTTCC 


GCAAt-TTCL. L 


1 on i I nun I u 


15780 


GATATACTCT 


TCTGGAGAAA 


GGTCGATTTC 


TCCACCTTCG 


ft ft / 'Wifp 

GAGt-GAUTTu 




15840 


AACCAACTCA 


CGGCCTGCGA 


ATTTTTCAAA 


AATGGCTGGA 


GCTACCTGCA 


AAATv_uL. I l L 


15900 


TTTAGGCAGA 


TAGTCGGCAC 


GCCATTCAAT 


GATGTCGGCA 


TCCAGGTAH- 


L i V»ul_n & \- 


15960 


CAGAGCCTGA 


GCCTCCTCTA 


AACTTCTTGG 


CATTACTGAA 


AC G ATT AATT 


TCATTTACTA 


16020 


ACCTTCATAC 


TAATCACCTT 


GAGGTAATTA 


CTACTTTCAT 


CTTTTTTATT 


ATAfTTrAAAA 
r\ l nw>-iV.nnnn 


16080 


TCTGCTGGAA 


GACCATATTT 


GTTTAAAATC 


TGGTAACT * C 


T*H.i Gt- AAA 


ni_\_ a i i n k \»n 


16140 


ATTTGTTCTG 


TAAATTTCTG 


ACGGGAAACA 


TTGGC AGC AT 


TUGTAV, rwrtjV. 


AATnATAATC 
/v\ 1 un 1 nn i \». 


1 6200 


CCTCCCCGAT 


TTAAAATCTC 


AACACTCTGG 


GAAATCAACT 


TGTGATAATC 


CTTCGCCACA 


16260 


GAGAAAGTTT 


GTTTTTTATT 


CCGAGCAAAG 


CTACCCGCAT 


CTAGGACAAT 


CACATCGTAG 


16320 


GTCAACTCTT 


TGCGTTTGGC 


ATATTTGAAA 


TACTCAAAGA 


CATCCATGAC 


TATAAAACGA 


16380 


TGCTCGTCTG 


TGCTGAGCCC 


ATTTGCCTGA 


AAATGCGCTT 


GAGACAATTC 


TCGTGAACGT 


16440 


TTGGCTAGAT 


CAACAGAAGT 


TGTATGGCTA 


GCTCCTCCCA 


TGCCCGCAGC 


TACTGAAAAA 


16500 


GCCGCTGTGT 


AGGAAAACAT 


ATTGAGTAAG 


GATTTACCCA 


TAGCCAAGCC 


CTCAACTAAA 


16560 


CTACCGCGAA 


CCTCATGCTG 


GTCTAGGAAA 


ATTCCTGTCA 


TCAAGCCATC 


ATTCATAAAG 


16620 


ACTTGATACA 


GGACACCATT 


TTCTAAAACA 


TTGAAAAAGT 


CAGGTGCTTC 


TTGACCATAA 


16680 
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ACATGGCCAG 


ATTCATAGTC 


CAAACCC7TA 


AAGCGGATTT 


TCTCATAAGC 


TCCTAAAACC 


16740 


TCACGGAAAA 


CCTCTCTAAA 


GGCTTCTGAT 


ATAGTCTGAC 


CAATCTGATA 


AACATAAGAC 


16800 


TTATACCAAG 


AAAAGACGGC 


GTAGTCGCCA 


TAAAGGTCCA 


CTGTCAGACC 


CCCAAAGCCA 


16860 


TCTCCCTCTT 


GATTAAAGAG 


ACGAAAGGCA 


GTTGTCAAAT 


CATCTTGATA 


GTAGGCGTTT 


16920 


CTCTTTTCTT 


TGGCTTTTCT 


AAACAACGTT 


TCAAAGAAAG 


CTTGATTCAA 


GGCCACCTTG 


16980 


TCTTPGCTGA 


TAAACCAGCC 


CAAGCCCTTG 


TTTTGCTGAG 


AAAGGTAGGC 


AGTCCCAAGA 


17040 


AAGTTTCCTT 


CCTGACCCTG 


CACCTCTACT 


TCCTGATCCT 


TAAGATTGAC 


ATTCTCAAGA 


17100 


TCACTGGCTT 


CTAGTAAAAC 


TAGCCCCTTA 


GCAAGCTTCT 


TTTCAACCCT 


TTTGCTGACT 


17160 


CTTATTCTAT 


TCATAACTAC 


CATTATATCA 


AACTTTTAGA 


CAATTCTCAA 


AAAAGAAACT 


17220 


ACCCTTGCTT 


TTTTACTCTT 


CTTTTAAAAA 


ATGCTATACT 


AGACTTCCTG 


CAAAACTAGG 


17280 


AAGTAAATGT 


GTAAGAATCA 


CAGTAAAAAA 


TGCTCTTCCG 


TCTTGGAGGA 


GCATTTCTTT 


17340 


TTATCAACGA 


AAATCAAATA 


GCAAACTATG 


AAACTAGCCT 


CACGTTAACT 


CTGAGATTAT 


17400 


AGGTAGAGAG 


GTTGTATCAG 


CAATATGTGT 


CTGTCAAATT 


TAGTGACAAA 


GGTAGTAGAA 


17460 


GAAAGATAAA 


GAAATAAATC 


AGCTTCAGTA 


GGTATCTGGA 


AAATTTGATT 


TTATAGAGAA 


17520 


CCCTTTTGTT 


ACAAACTCAA 


TATACTATCA 


ATAAATAATA 


TTATAGAACC 


AACAATAATT 


17580 


ATAATTTCAC 


CTATCTGCAT 


CATTCTATTT 


CGAACTCTAA 


ATATATCTTC 


TATCAAAAAT 


17640 


ACTTGGAACA 


CACACATTAT 


AGGAATTAAC 


GTTTTTGAAA 


TTCAAAAATA 


TCCAAATAAA 


17700 


TAAACTATAA 


ACAACAAAAA 


TAGAACTATG 


TTATATTTCT 


TATTCAAAAC 


ATTCCTCCCT 


17760 


ATATATTTTT 


GATTACCAA? 


CTTAATCATT 


TACAACTACA 


TTCTAACAAA 


CTATAAAAGC 


17820 


GTTTGTCGAA 


TTGAATTTAT 


CAAGCAAGCG 


ACCAACCAGT 


TCATCTTTTT 


TCTATTTCTG 


17880 


CCAATATGCG 


TGACAGGTAA 


TAATCATAGC 


CAAAAATAGC 


AAGAGCAAGC 


AAGACGATAA 


17940 


GAGCTCCTAC 


TCCCAAGCTG 


ATGGCAAGGA 


TAGGGGAGAG 


AGACTGAACC 


AAGAATATGC 


18000 


TCCCAATTAC 


AAGGGCCATC 


AGGATTGCAC 


TATAAATAAA 


CAATAAAACT 


ATGGCGAC f A 


18060 


TGCCATTTGA 


ACGATTCACC 


AGGTCCGTAA 


TCCTACTCCA 


ATTGGTTGAC 


AGATTTTTAA 


;ei2o 


CGTCCTTAAA 


GTAATGGTGG 


CAAGAAAGGA 


TGACACTCGC 


AATGATCCAG 


ACTACAAGAA 


18180 


GGTAAATCAT 


CGAAATGATG 


GGCAAGCCTA 


GATATACAGA 


AAGACCAAGC 


AAAGTCAGAA 


18240 


CTGGTAAAAA 


GGACTGGACA 


GCATATATAA 


TCCAAAATTT 


CACTTTCACA 


TAACGAGCAA 


19300 


AGTCAAAGGG 


TAAACTCTTA 


AGAAAATCAA 


CATTTTCCCT 


CTCCAAGGAC 


AAGGCAATTG 


18360 


AATGCAGGCT 


GGTGATATTG 


TTATTGACAA 


CTGCTATAAA 


GAGAGCTATA 


AAAAACAAGG 


18420 


GTAACCAGTA 


TGGAGGATGA 


ATGTCTGGAA 


CTATCTGAGA 


ATCTCGGATT 


TTGGAAATCA 


18480 
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GACCGATCAT CATGAGATAA GGAAGGAAAG CACTTGTAAA AAGCACTGTA ATCACGCCAG 
TCCCCTGTCC CAAGAGGGTG AGGTGGTAGC GTAAAACCAT GCGAAAAAAT CCCTTTTTAG 
TGGTTGAAAT TCTCTCCTTG CTGCGACGTT CTTTTTTGAC CTTCTCCTCA CTATTAAGCA 
GGATCACGTC ATAAAAACGA GGAAGGACCT TCTTTTTGGT CAGATAAAGC AGGAAGAGAG 
TTAGTCCTAT CCAAGCGAGC AGACCCACTA AGGCTTCTGT CGAAAAAGGC TCCACTGCTA 
TTTTGTAAAA ^3ATATGAAGA GGATAAAGGA GAAATCGAAT GTCTCTAACT TTGTCAACAA 
TACTTCCAAA AGTCGACTGA AGAAAGAAGA TAAATATTAA AGGTATGAGA ACTCCTATCC 
CAATCATCAC ATTCGAAAAA ATAGACTGAT ACTTTCTGAA GACCCTAGTT TGAGCCAAGA 
AATGCACTGC CACTACCATC ACTAGAGCCA CAGAGACAAA TAATAAGGTC AAGGACAGTA 
GCATCAAAGG CAAACCCAGC CATAGAGAAG GAGCTAGCCT AATCTAGAGG ACCAGAAAAT 
AAGCTAGGAT TGGTACAATT CCAGTTAGAG CTGGCAAAAG G AC AG AC ACT CCTTTAGCAA 
TTATAATCTC TGATTCTTTA AAGGCATAGG GCCTATACGA TACCAAATCC TTACTCTCAT 
AAAAGACATT GTAAAAGGCC GTTAAAGAAG TTGAAAAGGC AATCACTAGT AAAATAGCAA 
TCATCGAGCT AAAATAAATA CGTATTTCCT CAAAAGGAAA ATCAATGCCT AT ATT ACT AA 
AACAGATGAT CATCAAGAGA CTGGAAAAAA TGTAAGAACT TAAGACTCTA GCGGAAACAT 
TTACTTTTTT 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS : 

<AJ LENGTH: 18436 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



18540 

18600 

18660 

18720 

18780 

18840 

13900 

18960 

19020 

19080 

19140 

19200 

19260 

19320 

19380 

19390 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CCGAGCCTCG TTACAGACTT TATCAAGATT GGACGCAAGA ACAAATTCAA CATATAAAGC 60 

AAAATATGGC ACAATCTCCA TGGCATACTC ATTACCATGT TGAGCCAAAA ACAGGACTTC 120 

TCAACGACCC AAATGCCTTT TCTTACTTTG ATGGCAAGTG GATCCTCTTT TACCAGAATT 180 

TTCCTTTTGG TGCAGCCCAC GGTTTAAAAT CTTGGGCACA GCTAGAAAGT GATGATTTGA 240 

TTCACTTTAA AGAAACTGGA ATCAAAGTTT TACCAGATAC TCCATTAGAT AGCCACGGTG 300 

CCTACTCTGG TTCTGCCATG CAATTTGGCG ATAACTTATT CCTATTTTAT ACAGGAAATG 3 60 

TTCGCGATAA AAACTGGATC CGTCACCCAT ACCAGATCGG TGCTTTGATG GACAAGGAGG 420 
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CTAAGATTAC AAAGATTGAC AAGATCTTGA TTGACCAGCC ACCAGACTCT ACTGACCACT 4 80 

TCCGCGATCC ACAAATTTTT AACTTTCAGG GTCAATATTA TGCCATTCTC GGCGGACAAG 540 

ACTTGGAGAA AAAAGGTTTC GTTCCTCTCT ACAAGGCTGT CAATAACGAC TACACAAACT 600 

GGCAACCAGT TGGCGACCTT GACTTTGCTA ACGACCGTAC TGCCTACATG ATGGAATGTC 660 

CTAATTTGGT CTTTGTAGAG GAACAACCTG TCCTTCTCTA CTGTCCACAA GGATTGGATA 720 

AGAAAGTTCT AGACTACGAT AATATCTTTC CAAATATGTA TAAGATCGGG GCTTCCTTTG 780 

AC CCT AAAAA TGCCAAAATG GTAGATGTGT CTCAACTTCA AAACATGGAT TACGGTTTCG 840 

AAGCCTATGC AACTCAAGCC TTCAACGCTC CTGATGGGCG TCCTCTAGCA GTTAGCTGCC 900 

TTGGTTTGCC AGA7GTTTCT TACCCATCTG ACCGTTTTCA CCACCAAGGA ACCTTCTCTT 960 

TGGTCAACGA ACTCACTATC AAAGACGACA AGCTCTACCA GTATCCAGTC GCTGCTATTA 1020 

AGCACCTTCG TGCTTCTGAA GAAGCCTTCT CAAACCGTTC CCAAACCAAG AACACTTACG 1080 

AACTTGAACT CAACTTGGAA GCTAATAGCC AGAGCGAGAT TGTCTTACTT GCTGATAAAG 1140 

AAGGTAAGGG ACTTTCAATC AACTTTGACC TTGTAAACGG TCAAGTAACA GTGGATCGTA 1200 

GCCAGGCTGG AGAACAGTAT GCCCAAGAAT TTGGGACAAC TCCTTCTTGC CCTATCGAGA 1260 

ATCAGGCTAC TACTGCTACA ATCTTCATCG ATAACTCTGT CTTTGAAATT TTCATCAATA U2 0 

AAGGAGAAAA AGTATTTTCT GGTCGTGTCT TCCCACATGC GGACCAAAAT GGTATCCTGA 1380 

TTAAATCTGG AAACCCAACT GGAACTTACT ATGAATTAGA TTATGGTCGC AAAACTAACT 1440 

GATGTCGCCA AACTTGCAGG CGTCAGTCCT ACTACCGTTT CTCGGGTTAT CAATAAAAAA 1500 

GGGTATCTA7 CTCAGAAAAC CATCCAAAAA GTCAATGAAG CCATGCGAGA ATTGGGCTAT 1560 

AAACCCAACA ACCTGGCTCG TAGTCTGCAA GGAAAATCAG CTAAGTTAAT CGGCTTGATT 1620 

TTCCCCAATA TTTCCAATGT TTTCTATGCA GAATTGATTG ATAAATTGGA ACACCAACTC 1680 

TTCAAAAATG GTTACAAGAC CATCATCTGC AACAGTGAAC ATGATTCTGA GAAGGAACGC 1740 

GAATACATCG AAATGTTGGA AGCCAATCAG GTGGACGGCA TCATTTCTCG TAGTCACAAC 1800 

CTAGGAATCG AAGACTACAA TCGTGTCACA GCGCCGATTA TTTCCTTTGA CCGAAACCTA 1860 

TCGCCAGACA TCCCTGTCGT CTCCTCTGAC AACTATGCTG GTGGGGTTCT TGCTGCCCAA 192C 

ACCTTGGTCA AGACAGGTGC CCAGTCTATC ATCATGATTA CAGGGAATCA CAATTCTAAT 1980 

TCGCCAACCG GACTGCGCCA CGCTGGTTTT GCATCCGTAC TCCCAAAAGC TCCTATTATC 2040 

AATGTTTCCA GTGACTTTTC TCCCGTCAGA AAAGAAATGG AAATCAAGAA TATCTTGACC 2100 

CGGGAAAAAC CAGATGCCAT TTTTGCTTCG GATGATTTGA CAGCTATTCT GGTCATTAAA 2160 

ATCGCTCAAG AATTGGGCAT TTCTCTCCCA AAAGAGCTCA AGGTCATCGG CTATGA7GCC 2220 
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ACCTACTTTA TCGAAAATTA CTACCCTCAA TTGGCTACTA TCAAGCAACC TTTGGAAGAG 2280 

ATTGCTTGTC TCACTATTGA TCTTCTCTTG CAAAAGATTG AAGGCAAGGA AGTCGCCACA 2340 

ACTGCTTACT TCTTACCAGT TACGCTATTA CCAGGAAAAA GTATTTAAAC ACAAGAAAAC 2400 

TCAGACCGAT TCGTCTGAGT TTTTATGATC TTAAATTTTC GAGATAGCGC TGGGCTGTCT 2460 

CTAGGTTAAA GGTTTTATCT GAGATGAGGC GCTCTACTAG GGGAGCAACT TCAGATTCAC 2520 

TAGCCCCAGC TACGAGAGCT AGGGATTTGG CCTGTAGTTT CATGTGGCCT TGCTCCATCC 2580 

CCGTACTTAC CAAGCCTTTG AGGGCTGCAA AATTTTGAGC AAGACCGATG GACACGATAA 264 0 

TCTGGGCTAA TTCTCTGGCA GAAGGATTTC CTAGTAGATC ATGACTGAGA ACTACACGTG 2700 

GGTTGAGGCC CATAGAGCCA CCCTTAGTCG CTACAGGCAT GGGCAGGCTC ATCTCACCCA 27 60 

CCAATTCTTC TCTTTCAAGG TCCAGCGTCC AGCAGCTAAG ACCTTGATAG CGTCCATCTC 2 820 

GACTGGCAAA GGCATGGGCC CCAGCTTCGA TGGCACGCCA GTCATTACCA GTGGCAATCA 2880 

AAATCGCATC AATACCATTA AAAATTCCTT TATTATGAGT AGCAGCTCGG TAAGGATCAG 2940 

CCTGCGCAAA CTGACTAGCC AACCCAATTT TCTCCGCAAT CTCTCGTCCT TGATCCTTTT 3000 

GGCGGCTCAA GTAGCGAAAG GCGATGCGAC ACCTTGCACT CACCAGAGAA TCGGTCGCGT 3060 

AGTTGGACAG GATTCCCATG AGACTCTGTC CCTGACTGAG TTCTTCTAAG ACTGGTTTCA 3L20 

AGGCTTCCAG CATGGTGTTG AGCATATTGG CACCCATGGC TTCCTGGGTA TCGACATGAA 3180 

TATAAACAAC CAGAAAGTCT GGTTCGCCTT TTATCTGCTC CACATGCACA TCACGCGCCC 32 40 

CACCTCCACG TTTAACGATA GAAGGATAGG CTTCATTGGC AAGCTCCAAG AGCTCCGCTT 3300 

TCTTGCTGGC AATCTTCTCT TGCGCTACTT TAGGATTAGC AACTTGATAA AGGGCTACCT 3 3 60 

GCCCAATCAT CTGTCGCTGA TGGACTTGTG CAGTAAAACC ACCTGCACGC TTGATGATTT 3420 

TGCTGGCATA GCTGCCCGCC GCAACCACAG AGGGTTCTTC TGTCACATAG GGAACGGTGT 3480 

ATTCCTGACC GTTGACAAGT ACCTCCGGAA CCAGTGAATA AGGCAGAGAA AAAGTTCCCA 3540 

CTACATTCTC ACTCAGCTGC TCTGCCACAG TCACGCTCAT CTGTTCATCC TTCTCCAGAC 3 600 

TAGCTTGTCT CTCAGGACTA AGGAGCGCCT GAGCTTTTAA CAGCTCGAGG CGCTCTTGGT 3660 

ATGATTT T TT AGAAAATCCA TTCCAACTTA TCTTCATTAT TTTTCAACCT TGCTATAACG 3720 

GCGTTGGTGG TCGAGAATTT CAACCAAGGC AAAATCTTGA TTTTCATAGC CAGCAAACTG 37 80 

GGCAGAGTTA GTTTCATCCA AGTTTACTTC CTCAAAAAAG ACCTTTTCAT AGTCTGCAAC 3840 

GGATAGGGCA GTTCGTTGGT TGAGCTTGTT CAAACGGTCT TTATCCAAAT AAGCTTCATA 3900 

TCCTTCAACC AATTCACCAC TGAAGAACTC AGCCACAGCT CCACTTCCGT AACTATAAAG 3960 
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GCCGATTTTA 


TCCCCAGCTT 


TCAAGCTATC 


TGTATTTTCC 


AAGAGAGACA 


AAAGTCCAAG 


4020 


GAAAAGTCAA 


CCTGTGTAGA 


TATTCCCCAC 


CTTTTGACTG 


TAGAGAATAG 


ACTGGTCAAA 


4080 


ATGCTTTTGT 


AAGAGGTCTT 


TTTTCTCTTG 


AGGCAGGCTC 


TTATCCATGA 


TTTTTTTCAA 


4140 


GCCTTTTAGC 


GCTAATTTAG 


GATAAGGCAA 


GTGGAAACAA 


ACAGCCGCAA 


AATCATCCAA 


4200 


AGTAAGCTGG 


TAGCGTTTTT 


GATATTCAAG 


CCAAGTCGTT 


TTCAAACTAT 


CCAAGTATTG 


4260 


TTGGGTAGAA 


TAGACACCAT 


TTACATAAGG 


AGTTGTCGAG 


TAATTTGGTC 


GCCAGAAATC 


4320 


CATGATGTCA 


CGGGTCTCAG 


CTACATTCTC 


ATTATTAAAC 


GCCATCATGC 


GTGGATTTTG 


4380 


TGTAATCAAC 


ATAGCTACAC 


TTCCAGCACC 


TTGAGTTCGT 


TCTCCTGGAG 


TTTCAATACC 


4440 


GTATTTGGCA 


ATATCACTGG 


CAATGACCAA 


GACCTTGGAC 


TCCGGAGAAT 


TTTCCACATG 


4500 


CAATTTGGCA 


TAATGGAGGG 


CAGCAGTCGC 


TCCGTAGCAG 


GCTTCTTTAA 


TCTCGAAACT 


4560 


ACGAGCAAAG 


GGCTGGATGC 


CCAGCAAGCC 


ATGCACAAAG 


ACGCCCGCAG 


CCTTACTCTG 


4620 


CTCAATTCCT 


GACTCGCTCG 


CCACAATGAC 


CATGTCAACT 


TCTTGTCTTT 


CTTGCTCAGT 


4680 


TAAAATAGAG 


TCACTACCAC 


TGGCCGCCAA 


GGTCACGATA 


TCCTCAGTTA 


GGGGCGCAAT 


4740 


ACTCAATTCC 


TTGAGTAAGA 


GTCCTTTACT 


TAATTTTTCA 


GGGTCAATTC 


CCCTCGCTTC 


4800 


TGCTAAGTCT 


TGTAATTTCA 


AGACATATTG 


ACTGGTCGCA 


AAACCAATCT 


TATCAATACC 


4860 


GATTGTCATA 


TTTACCTCTG 


TTTTATCATT 


CATGTAAAAA 


ATCGTTCTAT 


ACTATTTTAT 


4920 


CACAAATCGC 


AGTAAAAGAG 


AGAAAAAAGA 


CTTGATTCAC 


CAAATCAAGC 


CTCTTATTGG 


4980 


TCATCATTTT 


AAAGAATGAT 


TAGTTGCTAG 


ACAGTTCACC 


GATATAAGTA 


GCTTTATAAG 


5040 


CTCCATTCAC 


AGTTATCAGC 


TCCTGGAGGA 


TCAAATTTCC 


TGAGTAAGTC 


CTTCCCATCT 


5100 


CATCTACAAA 


TTTTTGATAA 


AACTGACTGG 


TCGGAATTTC 


TCTGACATCC 


TTATCAAATG 


5160 


TCTTATCAAG 


TGTTTTACTA 


ACCTTCTCAG 


CAATCAATTG 


ATGCTCTTGC 


CATCCACTTT 


5220 


GAAACTCTGA 


GCCCGAACTA 


GAAACCATGA 


CTGGGATAAA 


CAACAAGGTC 


AGTAGATTTA 


5280 


CAGACAATAA 


GGAAAGTAGT 


AGACTTCCTG 


CAAAACTAGA 


ATCCTAGTTC 


ATGATTGATA 


5340 


ATACCAGCAA 


TCAAATTCAT 


TCGTAATCCG 


AAGCGTTTAC 


GATGATTTCG 


ATAGGTTGTT 


5400 


GAAAACATTT 


TAAACGTTTT 


TACTTTGGCA 


AAGATGTTCT 


CAACCTTGCT 


TCTCTCCTTA 


5460 


GATAGCGCAT 


GGTTACAGGC 


TTTATCTTCA 


GCTGTTAGCG 


GCTTGAGTTT 


GCTGGATTTA 


5520 


CGTGGAGTTT 


GTGCTTGAGG 


ATATATCTTC 


ATCACCCCTT 


GATAATCACT 


GTCAGCCAAG 


5580 


ATTTTACCAG 


CTTGTCCGAT 


ATTTCTGCAA 


CTCATTTTGA 


ACAACTTCAT 


ATCATGACTA 


5640 


TAGTTCACAG 


CGATATCCAA 


AGAAACAATT 


CTCCCTTGAC 


TTGTGACAAT 


CGCTTGAGCC 


5700 


TTCATAGCGT 


GAAATTTCTT 


TTTACCAGAA 


TCATTCGCTA 


ATTGTTTTTT 


AGGGCGATTG 


5760 
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ATTTTTACTT CCGTCACATC AATCATTATC GTGTCCTCAA AGCTGAGAGG AGTTCTTGAA 5820 

ATCGTAACAC CACTTTGAAC AAGAGTTACT TCAACCCATT GGCTCCGACG GATTAAGTTG 5880 

CTTTCGTGGA TACCAAAATC AGCCGCAATT TCTTCATAAG TGCGGTATTC TCGCACATAT 5940 

AGAAAGCGTT ATCAATTTAT TTATCTCATT TTTCAGAAAA TTCTTTTATT TCTGTAAAGT 6000 

CTACGATACT CGATGTGTT? TTATATAATG ATAGAGTCTC AGAATCACTC TTCCGCTAGC 6060 

CATTCCAATA GACATTACCA AAGCCAACAT GACAACCAAG GTCGCACTTG CCAGTGCTTT 6120 

ATTATAGTCC CCTGTCACAA AAAAGGCAGT TGTTCGGTAG GAGAGATAAC CTGGAACCAG 6180 

CGGTCCCAAA ATGGCCAAGA TAAAGACCAC ACCAGGTGTC TTATAAAGAA TACTTAAAAT 6240 

CTGGCTCACA CAAGAACCAA TAATGGCTGC AATGAAGGTA GCTACAATGA CATTGGTCGG 6 300 

TTCCTTGAGC AAGAGATAGA TTAGCCAGAC AGTCATGCCC AAAATCCCTC CAGGTAAGAG 6360 

CATAGACCGT TGCACATTGA GTACGATTAA AAAAGTGATA ATGCCAACAA AACTTGCTAC 6420 

TGCTTGTAAT AAAAAGGTTG TTAGTGTCAT ATTAGTTCAT CAATACCAAG GCGACAGAAG 64 80 

TTCCTGCCCC TAAAGCGAGG GTAATGAGCA GGGATTCAAA CATCTTACTC ATACCAGAGT 6540 

TTATGTGCTT GGTCATAATA TCACGGACCG CATTGGTCAA GGCAATACCT GGTACAAACG 6600 

GCATGACCGC ACCAGCTATA ATCAAATCTG CCGTTGAAGG AAAACCTGTG TAGCGAGCCC 6660 

AAAACTGGGC AATTATCCCA AAGACAAAGG CTCCAGCAAA GGCTGTCACA AAGGGAATTC 6720 

GGATAAATTT TTCCACATAG AGGGAAAAGG CAAAACCAAA TAAGGTCGCC ACTCCTGCCC 6780 

CAAGTGCGTC GTAGATATTT CCGCTAAACA TAACTGAAAA GAAAGGACCA CTAAAGGTCG 6840 

CAGCCAGAGT TACCTGCAAC TTAGTATAGG GAAGGGGTTT GGCTTGCAAG GCCGTCAATT 6900 

GCTTAAAGGC TGTTTCTAAG TCAATCTGCC CCCCAACTAG CTGACGAGAA ATCTGCTTCA 6960 

CATCGCAGAC TTTTTCGATG TTATAAGAAG AGGAGGTCAC GCGCTTCATG CGCAAATATT 7020 

GGTATTTTCA ATAGAGAAAA AGATAGCGCC AGGCATGGCA AGCACATTGC AATCCACAAT 7080 

CCCCTGCGAA TGCGCGATTC GAATCATGGT ATCTTCTACA CGATGGATTT CTGAGCCACT 7140 

TTTAAGGAGA ATAGTCCCCG CTAGCATAAT CACATCAATG ACGGCATTTA ATTCTTTTGA 7200 

TTCTTCCATG CTTTCCTCCT TTTATCAACT CCCTCTATTC TATCACAAAT CCGGACTCAA 72 60 

AAAAAATCTT TGCCATGAAA TCATGACAAA GATTGATTAC TCATTTTGAT TATCCATCTG 73 20 

CTTTTAAGGA GTAGCTGAAG TTGTTTTAGG TTTGTAGATT GAAATCTTGA CTCTAGTCTT 7 380 

ATTGAGGTCT ACCTTTTCAC CTGCTCTACG ACTTTGTTCA ACAACCATGC CTTCTGCACT 7440 

ACCTGCAGGC GCTGTCGTCA CTTCTACAAC TTCTATATTA GCTTCCTTAA TCCCAACAAT 7500 
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TTGAATCAAA TTGTTCTTAC TAAACTCCAA GCTAGAACCA ATGTAACTCG GCATCGCAAC 7 560 

ACTTGTAACT TTTTTAGCTA CTGTCAAGAC AATTTGACTA GGTTTACTCA CATCATAAGT 7 62 0 

CGTTCCGGCA CCTGGACTTT GTTTCATAAT CGTTCCTGGT TCGCTTTCGC TGGACTCTTC 7680 

TTCCTCTATC TTAATCAAAT TCTCAGGAAC CTTCTTCTCC TTGAGTTCTG AGATTACTTC 7740 

TGTAGAGTTC CCTCCAATAT AGTTCCCTAA TTGAATCGTC O r AU-T lT IT TAGCTACTGT 7800 

CAAAACAATT TGAGTTCCCT TGCTCAAGTC ATAGGTCGTA CCTTCTGGTA GACTTTGCTT 7860 

CAGGACCGTT CCAGCCTCAC TCTCATTCGA CTCTTCTTCC TCAATTTTAA TCAAATTATC 7920 

TGGAACTTTT TTCTCTTTTA ATTCCGCAAT GACATCAGAG GATTTCCGAC CGACATAATT 7980 

ACTAATTTGC AAAGATTGCT TGCCTGATGA CACAACCAAA TTGATTTTCG TTCCTTCTTT 8040 

TCGACCAGTT CCAGCGCCAG GATCTGTACG GATAATCCGC CCTTCTTCCA CCTTTTCACT 8100 

AGCCTCTGTC TTCTCCTCAC CAATCTCAAA ATTGGCTTTT TTGAGCGTTG CCTTGGCCTC 8160 

TGCAACTGTC TGACCTGCCA CATCTGGAAT GGCAATGGTT GCAGGACTTC TGGATAGTAT 8220 

CCAAATAAGA GAAGCTGCCA CCAATACAAG GCTGGCCAAC AAAATCAGGT AACGCATCTT 8280 

AAATCTATGT TTTTTCGGTG CTTGTGGTTG GTAAGTTTCC TCTCTCACAG CCTGGCTTGG 8340 

GTTTTTGATT GATTTGTGTT CTGTTTGCGC TTGAACCTTA GGAATAGATG TCAAGGTACT 8400 

CTGAGAAACC TTCGGCAAGG TCTTGGTATC TGCCTTGCTC GTTTCATCAA AGATTAACTT 8460 

ACTTTCATTT CTACGATTGT AGCACAACCT ACTAGACAAG TCCACATACA TCTCTGAAAC 8520 

CGAGCGGTAG CGATTGGTCA ACTTTTTACC AGTTGCCTTG ATAATAACAT TTTCTAAAGC 8580 

CTGAGGTACA GATGGATTTT CTGCAATAAC GGACGGCAGG GGTTTCTGGA AATCCTGGAG 8640 

GGCAATGGTC ACCGCGCTAT CCCCGTCATA AGGGATATGG CCTGTCAGCA TCTCATAGAA 8700 

AATAATCCCC ATGGCATAGA TATCACTCTG CACAGTCCCC TTCGAACCAC GCGCCTGCTC 8760 

TGGTGACAAG TAATGAACTG AGCCCAACAT CGAGTTAGTC TGGGTCAGAC TTCTCTCTGC 8 820 

AAAGGCTACA GCAATCCCAA AGTCTGTGAC CTTCGCAGTC CCATCTGGTG TCAAGAGGAT 8880 

ATTTTGAGGT TTCAAGTCCC TGTGAACAAT TCCTCGAGTA TGGGCCAAGC GCATAGCCAA 8940 

GAGAATTTGT CCCATGATAC GGACTGCTTC TTCATTAGAA AGAGGATAAT GTTCCTTGAT 9000 

ATAGCGTTTG AGGTCCACTC CAGCCACATA CTCCATAGCT AGGTACTGTT GACCGTCTTC 9060 

CTCGCCAATA TCTGTTATCC CAACGATATG AGGATGGTCT AGATCTGCCA TAGCTCTCCC 9120 

TTCACGCTGA AAACGAGCTA CACCTATCGG GTCCGTCTGG TACTTGGTCC TCAGAACCTT 9180 

CACTGCCACT TCTTCCCCAT CTAAGATTAA GTCTTTGGCT AGGTAGACAT CCGCCATACC 9240 

TCCTCGACCA ATCTGTTTGA CAATCCGATA GCGTCCGGCA AAAATCTTGC CGATTTGGAT 93 00 
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CATTCTGCAT 


CCTCCTCGTT 


CATAGAAACA 


AGGGCAALLLi 


TAATP.TTP.TP 


TAJIAPPTPPT 


9 360 


GCATTGTTAG 


CAAAACGAAC 


AAGTGTL TUu 


vjTTTT AT Liu 




ATPACTP.P.TT 


9420 


ACAATATCAC 


GAATCTCACT 


GCCTGAAATC 


ATGTTGGT C A 


AOLLL* i LAV- 1 


ATTr.Ar.PAAf. 


9480 


AGATAGTCAC 


CTGACTCAAG 


GATAACTGTC 


ym f+ k ft II X ft 

CCAAAAT C AG 


GLl viAA U IL 


niLl I 1 i 1 Itl. 




CCAATAGACT 


GGGTGATAAT 


ATTTTTTTGC 


GGATGAGCTT 


CTGCCTCTTC 


Tr^tnTi AT* 


9600 


TGACCAGCCT 


TGAGCAATTC 


ATTAACCAAG 


GAATGATCGC 


TLG I LAAL L Vj 


A i. I A i 


Q£ fin 


TCTCCACGAA 


TCAAGCCGAT 


ACGCGAATCA 


CCAATATGAG 


CAT AG A rAGL 


fTT &TT iTf ft 
L i\iA 1 lAILn 


9720 


ATAATAGCAA 


GGACTTCCAA 


AGTAGTTCCC 


ATGCCTCTGT 


ft ft C"T TPiT/* 


rrr & r*r" a Ar.r 


9780 


TGGTGAATCT 


TTTGATTTTC 


AATTTCTAGG 


TAATGGGCGA 


ALLA I TLALG 


L AL 1 • Ln I Iw 


9840 


ACTGTATCGA 


TCTGGGTATC 


AACCCAAGCT 


ACACCCAGGT 


LTu ivjAL L G L 


Lnl 1 1 I. AA. i. A 




GCGATATTCC 


CTGCGCGATG 


ACCTCCCATC 


CCATCAGCTA 


AAA I AA 1 L A 1 


pp.tapp/pppa 


9960 


GCTCTATTGA 


CATAGTGGTT 


GACATAGTCT 


TGGTTATTTG 


TTC GTTTCTG 


Af*f* a ArATrT 

AL.I.A/\LA1L L 


10020 


GTTAATAATG 


AAATTTCCAT 


GTGTCAGTTC 


CTTCCTAATC 


CGAT ATL 1 1 G 


r*p A A ft TTr AP 


L0080 


TGATGAAGAA 


TCCATCACTT 


CCATACAATT 


CAGGTGTAAT 


UAOuA I At. Avj 




10140 


TGATATCCTT 


ACATTCATGT 


TCTAGTTTTA 


CCTGCTCGAA 


LTlGGO A rG A 


rtrrrr a a a a 


10200 


AGGCCTTAAC 


GACTTGAAAA 


TTCTCCTCTG 


AGACGATAGT 


* (^r^Tif i***V ft 
uCAutj L i A 


T A A PT-r A TT A 


102 60 


TACCACCTTT GCCTACTATT. TGACAAACAC TACCTAATAT 


T TL 1 AAL i\iA 


All 1 tU lOtn 


10320 


AGGACGCGAA 


ATCTGCCGTT 


TCTTTATTGT 


ATTTGATATC 


1 ut» 1 I I 1 Luu 


PPPAAAAGAP 


10380 


CGATTCCTGA 


ACAAGGAGCA 


TCCACCAAAA 


TCTTATCAAA 


OuAATCv. iGG 


TP A A A AA. A PT 


10440 


CATGCACCTT 


TCTGGCATCC 


AATTTTTGAG 


TTTGAACCCG 


ATL 1 .V.AAL I 


PPPAP.APP.TT 


10500 


GGGCATTTTC 


TTGAATTAAA 


TCCAACTTGT 


GGTCGTACAA 


p«pp a p a P.P a. 


PTAAPPTGAC 


10560 


CTCTCG7AAG 


ATAAGAGGCT 


ATATGGGCTG 


TTTTCCCACC 


TGGAGCCGCA 


CAGGCATCAA 


10620 


GCACTCGCTC 


ATCACCTTGT 


AAATCAAGCG 


TCGGAGCAAC 


CAGCTGACTG 


GACTCATCTT 


10690 


CGATGGTAAT 


GGCTCCATCC 


GCAAACAAAT 


TATGCCCTGC 


AAAATGCCCC 


TGCTCCTTAA 


10740 


CCAGACCAGT 


GGTTGCTAAA 


AGGGAATTAT 


TCGCCTCCAA 


CAAGGCTTGG 


ATTTCCTCTT 


10800 


TTCGACTTAG 


GTCTGTTACA 


CGAATACTGG 


CTTTGTTTCG 


CACTAACAGC 


CTTTCAAAGA 


10860 


TGGCTTTTGC 


TCTCTCCTCT 


CCGTATTCTT 


CCTTGAGTTT 


GGCAACTAGC 


CAAACTGGGA 


10920 


GAGAATAGGC 


AATGGAGTCA 


CGCTTGTTTT 


TTCGCTTGAT 


GCTAGCAATA 


TCTGGCCAGC 


10980 


CTTCACGCAA 


GATACGGCGA 


AGGACAGCGT 


TGACCAATTT 


TTCACTGCCT 


TTTTTACGGA 


11040 
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GTTTGGCCAA TTCCACTGCT TCATTAACCA CAGCATGATC TGGAATCTTG TCCAAATAGC U100 

GGAGTTGGTA GGCACTCATG AGAAGAAGGA CATAGAGCCA GCTGTCTAAC TGGTCTCTGT 11160 

CTTCGATAAA GTGGGATAGG TACCATTCCA GAGTCAGTTT ACGGGCTACC GTTCCATAGA 11220 

CCAGCTCGGT CACTAAGCCC TTGTCTGCTG CCAAAAGTTG ACTTCCCTTT AGATGCTTAT 11280 

TTAAGGCGAT ATTTGAATAT GCTTGGTTCA CAAAAACATC CTCTAGCACT GCTAGAGCTA 11340 

AACTTCTAGC CGTTTCTACT TTAGTCACCA AATCGTTCTC CTACAGTCAA TGTACCTCCA 11400 

ACTCCGTTGA CGAAGGAAGC AATGTCCATC TTAGGCTTAC CAGCTGGCTG CACTTGTTTG 11460 

AGGGATAGAG CCCCTTCAGC CCTTGCGACA ATCAATTCTT TCTTGCCGAT AGAGAGAATC 11520 

TCACCTGGAT TTCCCTGACC TTCTACTGGT AGGGCTTCAT AAATCTTAAA GCGGTCGCCC 11530 

TTAAGGAAAG TATGGGCAAC AGGCCAGGGG TTCATTCCAC GAATTTGGTT AAAGAGTTGA 11640 

CGATTGGTTT TGTTCCAGTC CAGTTTTTCT TCCTCTGGCT TTATATTTGG AGAGAAGGTA 11700 

ACCTGACTCG TATCCTGCGG TTCAGGTTTG ATATCACCAG CAATATAGGC AGGCAGAGTG 11760 

TCCAAAAGCA AATCACGACC AACTAGCGCC AATTTTTCAA ACAAGGTGCC AACATTGTCC 11820 

TCATCTGTGA TCGGAATGCT GCGACGAGAA ATCATATCTC CTGCATCCAT TTCCTTAACC 11880 

ATTTCCATGA TGGTCACACC AGCTTCCTCA TCCCCTTGAA TCAAGGCATA ATGGATAGGC 11940 

GCACCACCAC GGTCTCTAGG AAGGAGGGAG GCATGAACCT TGACAGCAAA CTCCATGCTA 12000 

TCAAGGAGTT TGCTTGGGAG AAACTGCCCA AAAGCAGCAG TCACAATTCC ATCTGCTCCT 12060 

AGCTTCATAA GATCTTCCAT CTCTGGACTT CCAGATAATT TTTCAGGTTG GTAGATACAT 12120 

AGTCCTGCTT CCTTGGCAGC CTGCTTGACT GGGCTTTCTT GG AT AACTTT TTTACGACCA 12180 

ACAGCACGGT CTGGCTGGGT CACAACGGCT AGAATTTCGT AACGGTCATC TGTCAAAAGT 12240 

CCTTTTAAGA CTGTTGCTGA AAAGTCGGGG GTCCCCATAA AGATTAGTTT TGTCATATCT 12 300 

TCTCCTTCTT ATAAAAATTG CTGCGGCTCA TGGTCAATCC TCAGACCGAG CTCACTATTT 123 60 

TCCCGTTCTT GAGTCAACGC TAAAACCTCC TTGAGGGTCG ACCCCAGCTC ATCTTCTAAA 12420 

CGGTATTTAA TTAAAATCTG GTAATGATAG AGCTTGTGGG TACGGGCAAT CGGTTTTGGC 12480 

GTTGGCCCCA GAATGGGACT GCTCTCTGAC AAGCCTGACC GCAAAATGTT CATGACTTCA 12540 

TAGGCACGTT TGAAAACCTC TTCTTCTTTC TTGTGAGAAA GGGTAATACC AATCGTGAAA 12 600 
TAGTAAGGTG GATAGCCGAG TTGTCGTCTG ATTCCCATTT CATAGGCATA AAAGCCTTCG 12660 
TAATCTTGAT CCTTGGCAAA TCGAATAGCA TAGTGCTGCG GATTGTAGGA CTGTATCAAG 12720 
ACTTGACCTC CCTTTTCAGC CCCACCTGAT CGACCTGCCA CCTGAGTCAA GACCTCGAAG 12780 
GTTCTCTCAG AAGAACGGAA ATCAGGCAGA TTCAAGGCCG TATCCGCATT TACAACTCCG 12840 
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ACTAGCGTAA 


CATTGGGAAA 


ATCCAAACCC 


TTTCCAATCA 


TCTGAGTACC 


AAGTAAAATA 


12900 


TCCCCTTCCC 


CTCGCCCAAA 


CTGGTCAAGC 


AAGGCTTGGT 


GACTCCCTTT 


CTTTCGAGTC 


12960 


CTATCCACAT 


CCATCCTCAG 


AATGCGAGCT 


TGGGGAAAGA 


GTTCTGCTAG 


CTCATCATAA 


13020 


CCCTTCTGAG 


TTCCCGTCCC 


ATAGTAACGA 


ATACTGCGGC 


TCTTACAGTT 


AGGACAGACC 


13080 


TGAGGAATAT 


CCTTCCAGAA- 


-ACCACAATAA 


TGGCAGTTCA 


TAGTCTTGGT 


ATCCATATCC 


13140 


AAGGTCAGAG 


AAATATCGCA 


CTTGGGACAA 


GTATCCACCG 


TCCCACACTC 


CCGACACATG 


13200 


ACAAAGCTAG 


AATAACCACG 


GCCATTGAGC 


ATGAGAACCA 


CCTCCTCTTT 


TTTAACCAGA 


13260 


CGCTCTTGCA 


TAGCCTCTAG 


CAAAGGAGCC 


GTAAAGTTTG 


ACGTCTCATT 


TTGTCCGATA 


13320 


TAGTCTCCAA 


AGTCAATCAC 


TTGAACCTCA 


GGGATTGTAC 


CCAAAGCATT 


GGCACGTTGG 


13380 


GTTAGACCTA 


AGTGTTGATA 


GACCCCTTTC 


CCACCACGTC 


CCCGCCTCTC 


TAACCTCGCC 


13440 


GTTGCAGATC 


CAACTACCAG 


AGTTGCTTGA 


TTATACTGAG 


CCCGTAAAAT 


AGCTACCTCT 


13500 


CTGGCATGGT 


AACGGGGATT 


GCTGTCCTGC 


TTATAAGCCG 


CTTCATCCTC 


TTCATCAATA 


13560 


ATCATGACAC 


CCAGATTTTT 


CACAGGAGCA 


AAGATAGCAG 


ATCTGGCACC 


AACAACAACT 


13620 


TGCGCATCCC 


CACGCTCCAC 


CTTGCCCCA7 


TCATCATACT 


TTTCACCATT 


GGATAATCCT 


13680 


GAGTGAAGAA 


TGGCTACCTT 


CTCCCCAAAA 


CGTGCTATAA 


AACGCTCGGT 


CATCTGACCA 


13740 


GTCAAGGAAA 


TCTCAGCTAC 


CAGCAAAATA 


CCTC7CTTCC 


CCTTATCCAG 


CGCACCTTGG 


13800 


ATAATCTGCA 


AGTAAACCTC 


GGTCTTCCCA 


CTTCCTCTAA 


TCCCTTGAAG 


TAGAAACGGA 


13860 


GGTTGAGAAC 


TGCCAATACA 


ACTCACAACC 


GCATCACGCC 


CCTGTCTT7G 


TTCTGGATTT 


13920 


AACTCCAAAG 


GTCTACTTGC 


TTCAATTCCT 


TCAAAATAAG 


CAGCCGAGCG 


TTGAACTTCC 


13980 


TTTTGGACTA 


TGGTAACAGC 


ACCTTGATCC 


ACAAAGAAC? 


TGACTTGCTC 


TCGCGAGTAG 


14040 


GACTCTAACA 


AGCTAGCCAA 


GGAAGCGCTC 


TCTCGATGAC 


ACACCAGATA 


ATCTCTCAGT 


14100 


TCCAACTTTT 


TCTTGGCACG 


TCTAGAAATC 


TCAACACCTT 


OTAATTGAGC 


ATGGTCAACC 


14160 


TCATACCAAG 


ACTGGCTCTT 


GACCTTCTTT 


TCATCGACTC 


CCTCATA7TC 


CACACCAACC 


14220 


AGGCCTTTTC 


TAGTCAAACG 


CATCATTTCA 


GCTTCCTTGG 


CAAGGTCTAG 


TGAAGAAAAG 


14230 


GCTAGCGAAT 


CTTCTGAACC 


AAACAGGCGC 


ACTCGTTCTT 


CCTGACTCAA 


GCCTTCCAGA 


14340 


GGATAGAGAA 


TCTTGTCATA 


GCTAGAATTC 


AGAAACCCTG 


GAAGCATCGC 


CTTGAGGATA 


14400 


GAGATTTTGT 


AGGAGAAGAC 


AGATTTGCGT 


AACTCCTCAG 


CCACCCAGAG 


TTCTTCTGGC 


14460 


GTCAGAACAG 


GAGAAAAATC 


CAGCACCTCT 


GCAATATCTT 


TTAAATCTTG 


CTCCATCTCT 


14520 


TCTCCATCTG 


ATTGGGACTT 


CAAACCAAGA 


ACAATCCCTT 


GAATCACGCG 


ATTACCCTTA 


14530 
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CCAAAAGGCA CATGAACCCC CATCCCAACT TCCAGCATTC CCTCAAATTC CTCCGGAATC 14 640 

CTCTAACTAT AGGGCTGGTC CGTCTGCATC AAGGGCACAT CTACGATAAT CTTAGCTAGG 14700 

GCCATCTTCT CACCTCCTCC TTGTCAGTAC ATTCTTGCAA TAGAAAAAAT AAGATTGAGT 14760 

CCCCCCAACC TTAAATTTTT TCACCATCTT CTTTTTC TH* AGCAATTTGC TCTTTGATTT 14820 

TCTTTTCTTC TTCTTCTTTG CGGCGTTTTT CTTCTTCGAT ACGGCGACGC ACTGCTTCAC 14880 

GTTTTCCTTC TGGATCTGGG TGAATTGTAA CGTTTCCTGA TTCGATTTCT TCTAAAGCGC 14940 

GAAGAGTTGA TTTTTCAGAC TTGAAACCTT GAGTTGCTGG GGCACCTGCT TCCAATTCGT 15000 

GGGCACGTTT TGCTTCCAAG ATTACGAGTC AATATTTTGA AGGAACCTTG TCGAGCAAGG 15060 

TATCAATAGA GGGTTTTAAC ATCATTTGCT TGTACCTATT TTCTAAATTT TATCGCCTAC 15120 

TTGGAGATTT TGGTAACATC TCCTGATAGT GACCAATGAC ACGATCCACA CAGAAGTGTT 15180 

CTGCTTCAAT CACACATTTG ACACGTTCAG CAGCTAGGGG TACCTGATCG TTGACAATCG 152 40 

CATAATCATA CTCACGCATG AGGGCAATTT CTTCCTTGGC CTTTTCCATT CGTTGGGCAA 15300 

TCACTTCTGC ACTATCTGTT CCACGACCTA CCAAGCGATC TTGCAATTCA TCCAAATCTG 153 60 

CTGGTCTCAG GAAGATAAAG ACAGCATCTG GAACCTTTTT CTTGACCTGA AGAGCACCCT 15420 

GAACTTCAAT TTCAAGGAAA ACATCGATTC CCTTGTCCAA GGTTTCATTG ACATAGCTCA 1S480 

GAGGAGTTCC ATAGTAGTTA CCGACATATT CTGCCTATTC CAACATCTGT CCTTGACGAA 15540 

TCAGCTCTTC AAATTCTTCA CCAGTACGGA AGAAATAGTC AACACCCTCC ACTTCTCCAG 15600 

GACGTTGTGC GCGTGTCGTC ATCGATACAG AATATTGAAA TTGGTTTTCA CAACTCTCAA 15660 

AAATCTCTCT TCTAACCGTT CCTTTTCCAA CCCCTGAAGG ACCAGAAAAA ACGATTAGTA L5720 

AGCCTCGGTC TGCCATTGTG TCTCCTTTTA GTCAATCTGT GAAATAACAT TTCTCTAGAA 15780 

TAATGGCAAA AAGCCAGATT ATCCTTTACA GTCTTTCTAT CTAGTGTAAC AAAAAAGCAG 15840 

TAATTTTTCA ACTGCTCTTT CTTATTTATT TAGCATAATC TACTGCACGA AGCTCGCGAA 15900 

TCACGGTTAC CTTGATATTT CCTGGATAAT CGAGATTGTT TTCAATTTTC TTACGAACTT 15960 

TCTCACCCAA GATTGTGACT TTCTCGTCCT TGATTTTTCC TGGATTGACC ATGATACGAA 16020 

TTTCACGTCC TGCTTGAAGG GCAAAGCTAG TTTGCACTCC TTCAAAGCCG TTAGCAATTT 16080 

CTTCCAAATC ATGGAGACGC TTGATGTAGC TTTCAAGAGA CTCACTACGA GCACCTGGAC 16140 

CCGCTCCCCT CAAGGCATCT GCTGCAGCGA CGATAACTGC 7ATCACCCTC TCAGCTTCAA 16200 

CATCTCCGTG GTGACTACCA ATCGTATTCA CCACAACTCG GGGTTCCTTG TACTTACGGG 16260 

CCAATTCCAT ACCGATTTCA ACGTGGCTAC CTTCAACCTC ATGGTCAATG GCTTTCCCGA 163 20 

TATCGTGAAG GAATCCAGCA CGACGGGCAA CAGCCGCATT TTCACCAAGT TCGCTCGCCA 16 380 
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CCAGCGATAC 


GTTGCATAGC 


TTGAACuAAl* 




16960 


CCATTTTGTC 


AGAACGTTCC 


TTGACCTC7T 


GCTCAGCTTC 


GCGAATGCGA 


rflTP A AT/"*T 




CCCTGGTCAA 


GTTTTCCTCT 


GTCTGAGCCA 


AGATAATATC 


TCGTGCTT(_ i 


rr^TnPiP a 
Lit-L i VjAUAwA 


L070U 


GCGCACCAAT 


ACGCTCTAGT 


TCTGCTTCTT 


TTTGTCTTTC 


^ A f^TPPTP^ 

VjALTI lLi 


A A T^P^ / ,fc ^^^^PT 
ni\ 1 I VjV- IV. 1 1 




CACGCGCATC 


AAGGTTTTTC 


GCTCTATCAG 


* ft ft. ff% ft ^H*rtftWft^ 

AAATACTTTG 


TTCTTTTTGT 


TCAAGTGTTT 


1-7 100 


GTTCTTTACT 


CGTCAAATTG 


TCGTCCTTAC 


GGTCAAGGCT 


AGTAGCTtTTC 


TfT^Tf"' MAP 




GACTTTCGAT 


TTGTTTGAGT 


i mm utVJ~l ft ^/NWMW 

TCTTGACGTT 


CTGATTTGAA 




A f ^T^H PTTP A ^ 


17220 


GCTATTTTCT 


GGCTTCTTCT 


TTGGCCTCCA 


ATAGTGCTTC 


I"L~l l'i l AAOA 


IsinL 1 L vjV- 111 


17280 


CACGTTTGCC 


TTCATTAACA 


AGTAAATCCG 


CTTCACGCTC 


AGCTTU rut. A 


LO 1 WW 1 1 MV«r 


17340 


TTGCTTCTTG 


TTCAGCATTT 


AAAAGCATCA 


ACTCTGCAGC 


TTri^TT A/™ AT 


Vinl 1 1 1 v_ l 


17400 


TAGCTGAGAT 


GCTGACATAT 


CCAATGACTA 


AACCAATGAT 


GACGGCAAAA 


nLAwtAA 1 \— O 


17 460 


CAAGCGACAT 


GATTTCCATG 


TTTTTACCTC 


ATTTTATTGT 


Tiwrrr & at 


f^AfATAfATT 


17520 


CTrTTACATT 


CTACCATAAA 


AAAGTGATTT 


TCACAAALW T 


a AAATA^""AAT 


ATftTTTTG AG 


17580 


CAATTTGGAA 


CACATTTACC 


AAAATAAACT 


TGTTGTTTAG 


A A A T &f!T if.T 
AAA 1 rtO 1 MO 1 


TTAdTAGAGA 


17640 


CTTGAGAAAA 


AGCCTACCTT 


TCAATAGACT 


TAGTAATGAT 


CTTTAAAGGA 


CAAGAAAGCC 


17700 


ACCCTATCTC 


CATCCATCAT 


ATAAATCAAG 


CGATTTTCTG 


CATCAATACG 


CCGTGACCAG 


17760 


GCTCCTTGGT 


AATCATATTT 


GAGTGGTTCT 


GGTTTACCTA 


TTCCTGTAAA 


GGGATCACGT 


17820 


TGAATATCCT 


TGATTAGTTT 


ATTGATTCTT 


TTTAACGTTT 


TCTTATCCTG 


ATTTTGCCAG 


17880 


TAGCAATAAT 


CTGCCCAGGC 


ATCTTCTGTA 


AACTTGAGCA 


GCATTTCTTA 


CTCCTCAATA 


17940 


ACATGGACCT 


GAGTACTTCC 


AGCACGAACT 


TGAGCCATTC 


CTCGCAAAAC 


CTTATCAGAA 


18000 


AGTTCCTTAT 


TTTGAGCAAT 


TCTCAGGGTT 


TCTTGGATAC 


TATCCCACTC 


ACTCTTTGAA 


18060 


AGGACTACAA 


TGTCCTCATC 


TGGATTTTTA 


TTGACCACCG 


TCAAAGGCTC 


AAATTCATCA 


18120 
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TTTACCTTCT TCATCTACTC CTTTAAATGA TTTCGGAATG 
ATAACCATAC CTCGTTTTAG CTCTTTTCCA CTATTATACA 
GGAACTTGTA CAAGATTTTC TTTTCTATCT ATTTATACTC 
CTAGGAAACT AGCCGCAGGC TGTACTTGAG TACGGCAAGG 
TTTGATTTTC GAAGAGTATT ATTCGTAAAA AATCTCAAAA 
TAGTTTGTTT CTATTC 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS : 

< A} LENGTH: 7001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 



TTGAGTAAAG CACTCCTTCC 
CGAAAAGAAA CAAATTGTCA 
AATGAAAATC AAAGAGCAAA 
CGACGTTCAC CCGATTTGAA 
AGCCTACCTT TCGGTAGACT 



13180 
18240 
18300 
18360 
18420 
18436 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
ACGTAGAAAA ACTATTTCTA TCACAGATAA TATTCCGTAT CTTCTTGGAG GTATTGAAAT 
AAACGTCCTA GGTATCTTTC TCAGTCTATG TGACTTACAA GGGAAAACTC TTTTCGAGAC 
AGAAATTTTG AATGAAGATT ATCCTATTTC AGAAATCAAT TCCACCATTA CCAATATGAT 
AAAAACAGCT ATAGAGTACG TCCCTTTGGA AACAAAATTA CTTGGATTTG GCTTATCAAT 
ACCTGGACAT TATAACAAAG ACTCCGGAAG T AT C ATT AC A AACAACCCCA TATGGGAATC 
TTTTAATTTA TTAAATGTAA TTAAAAGATT CAATTTTCCT TTTATTGTAA AAAATAATAT 
CGATTGTATG GCTATAGGAC AATACCTTTT TAATCCACAC AATACCCCCG ATAACTTTAT 
TTTCCTACAC GCTGGATTAG GTATTTACAC TTCCTTTTTC ACAAAAGAAA AAATAGGAGC 
CTCTAAAAAT CCTTATATCG GAGAAATTGG ACACACCATT GTCGAATTGA ATGGGCAATA 
TTGTGAATGC GGAAAAAAAG CTTCTTTACA AACATATATT TCGGATGCTT GGTTAATCAA 
ACACGCCCAA TTATTATTTA AAAATTCCCA ACTAACTGTA CTAAAAACCC TTCTAAAGAC 
TGAAAAAGAC ATTCATTTAG ACACCCTTTT AACGGCTTAT AATTTAGCCG ACTCCGCTTT 
ACGTCAACAA ATTGATAAAG GAGTCAATTT ATT AG CC ACT TCTATTGCAA ATCTCCTCCT 
CATCAATCCT GCTGATAAAA TCTATATCAA CAGTCAATTG CTTAATTATC AACCTTTCAC 
TCATGAAGTC AGGGATAAAA TCCAAGACCA GCTCCACTTC GTTCCCTTTA CTCCTAATAT 
AGAAATTGAA ATTTTACCTT ACAACAAACA TCGTGGAAGT ATAGGACCTT CTCCATTACC 
TATCGTCGCT TTTTTCATAG AACATAGCAA TGTATTACAA GATATTATTT CACCTTAATA 
TATTAGAAAT CTATAGACCT GTTTAAATCA ACTATAACCT GTAGTAGATA TCTCGTATTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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AGACAATATG AAAACAAGAC CACTTCCATA TAGGAAACCC CCTTCTCGCT ATGTTGAGTG 1140 

ATTTATATTA AAATAACTTT TCTTCTAGCT GCATTTTATT ATTATAAAAA CATTCATCAT 1200 

AACCCCCAGA ACTTAAATAA CAATTTTTAT TCAAGATACA TACTCCTAGA ATAAACTTTA 12 60 

TATGAAATTC TCATTTTTGT TTTTACAATT CTCCTTAGTT AAATCTTGTT TAATATATGT 1320 

TTTACATATA GTATTTAGCG CC AC AT AGTA CTGAACTCTC TCCAAAAACC GTTATTCCTC 1380 

TTTGAATAGG CCGTTATCAC AAGAAAAGCA TCTCCACGTT TCAACTTCAT"aTCGCTCAAA 1440 

AACAATCAAT TGATGCTAAA ACCTGTACCT ACATGTTTCC GTTCATAAAA CCATGAAACT 1500 

GTAAAAGTGG ATGAAATTGA TAGCGATAGT CAAATCAAGA GGCATCATAA CTCTAAAAAC 1560 

TCACAATATA TAAGTTCATC CTCGGAAAAA TATCATTCTA ATTGTTGAAA TGCCTACATG 1620 

AAAAGAAACG TCAAATGCTC ATGAAACAAC GAATACAGGT ATCAAAACTA TGACAAAACA 1680 

AATCCCTAAA TTTACTAAAG ACACTGCTCA ACTTTACACC TGT AAATGGT TGTTGTATAA 1740 

TAAAGTTACA AAGATGTACG ACCACACTGT TGTAAATCAT AGTCTTCGCG AATATATTAC 1800 

TGATAGCATT TCTACAAATA CAAGTAAAGA GAGCGGATGA CATTCAAACG AAATATGTCA 1860 

GTGCTTTGGC ATTCCTAGCC TTCATATCAT TTAAAGAATT CT AT AG AC AA AATTTTTTCC 1920 

AATACAGACA CTCGTAACAA CTGCTTCATT TTTCTACCAA CATATTTAGG AACAGGATAA 1980 

GATACAAGAG TATTAATCCA TAGCTCAGTT CTATACCAAT CTAAGACAAA TAAGCTAAAA 2040 

AAACGATTGA TAATAAGCAA ATAGATTCCA AATTTTCTCT ATCTGCTCAT TTTAATAAAC 2100 
AATACTAGTG TAACTATCCT TCCACTCAGA AGCTTGTCAA ATCACACCGA AAATTCTTCT 2160 
AAAATTTATC TCGTTAGGCA ATCAAGCAAA AACTCGACGA TAGTACAAAC ATTATCATAC 2220 
AGGATTGACT TCCTAAATTA TATACTTTAG TAAGGTTTTC GGATAAGAAA AAAGGTTCAT 
TTTACATTTC TAAACATTCT TTTCTAAGAT GAAAAACAGA ATTTTTCGAT TGTGATTTAA 
AGCAACAAGA AGATTTTCAG TATCATCCTA TAGATACGAG CTAATTAAGA AAAACTACAT 2400 
TTTTGAATAT AAACTACAAT AATATAAACT AAATTTTATA GGAGGAAGAC AATGGATTGG 24 60 

TACGATTATA TGATACAGGC ATCCAAACAA TCACAATTCA ACGCAAGCCA TTGGTTTCGC 
TATTTGCGAA AAGTTATTTT TGAAGACTAT TCTTATTTAA CAAACCAAGA TGTAGAAAAG 
TTGCTAGACT CCAAAGAACT AACCCGTTTT CAAAAAATTA GCTTGAAGTA TGCCTTTCAA 2640 
GAGCATACTC CAACTCATAA ATATGTGATT TCATTAAATA AACCTGCTAA GTTAACCAAT 
GTTCAAAAAT TGATGGAGAA ATACAAACAT GGATAAAATG AAACCGGTCT TCCAAGCCCT 
AAATAAGGAA TTAATTCAGG AAAATCTGAC TTTAACAATT ATCTGTGTCG GTGGTTATGT 



2280 
2340 



2520 
2580 



2700 
2760 
2320 
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CTTAGAATAT 


CATGGTTTAC 


GTGCCACACA 


AGATGTTGAT 


GCTTTTATGG 


CTCTATAATA 


2880 


TTTGTAGTGG 


GTAAATCCCC 


TATGGATATT 


ATGGAGCCTA 


TTTTTGTGTA 


GAAAAAAAGT 


2940 


CCCATATGAC 


CTATAATGAA 


AAGCGACAAA 


ACAACTCATT 


AGAAAGAATC 


ATATGGAACA 


3000 


ATTACATTTT 


ATCACAAAAT 


TACTAGACAT 


TAAAGACCCT 


AATATCCAGA 


TTTTAGACAT 


3060 


CATCAATAAG 


GATACACACA 


AGGAAATCAT 


CGCCAAACTG 


GACTACGACG 


CCCCATCTTG 


3120 


CCCTGAGTGC 


GGAAACCAAT 


TGAAGAAATA 


TGACTTTCAA 


AAACCGTCTA 


AGATCCCTTA 


3180 


CCTCGAAACA 


ACTGGTATGC 


CTTCTAGAAT 


TCTCCTTAGA 


AAACGCCGTT 


TCAAGTGCTA 


3240 


TCACTGTTCA 


AAAATGATGG 


TCGCTGAAAC 


TTCTATCGTC 


AAGAAGAATC 


ATCAAATTCC 


3300 


TCGTATTATC 


AACCAAAAAA 


TTGCGCAAAA 


GTTGATTGAG 


AAGATTTCTA 


TGACCGATAT 


3360 


TGCTCATCAG 


CTGGCCATTT 


CAACTTCAAC 


TGTCATTCGC 


AAGCTCAATG 


ATTCTCACTT 


3420 


TGAGCATGAT 


TTTTCGCGTC 


TTCCTGAGAT 


TATGTCCTGG 


GACGTTGAAA 


CAGTCCGGGG 


3480 


AGTGACTGTT 


TCAATCGGGA 


GATGGAGATG 


AGCTTTATTG 


CGCAAGATTT 


TGAAAAGCTC 


3540 


GATATCATCA 


CTGTTCTTGA 


AGGTAGAACA 


CAAGCTGTCA 


TCCGAGATCA 


CTTTCTTAAA 


3 600 


TAT GAT AG AG 


CCGTCCGATG 


TCGCGTCAAA 


ATTATTACTA 


TGGATATGTT 


TAGTCCTTAC 


3660 


TATGACTTAG 


CTAGACAACT 


TTTCCCGTGT 


GCTAAAATCC 


TTCTTGATCG 


CTTTCACATT 


3720 


GTACAACATC 


TTAGCCGTGC 


TATGAGTCGT 


GTGCGTGTCC 


AAATCATGAA 


TCAGTTTCAT 


3780 


CGAAAATCCC 


ATGAATACAA 


GGCTATCAAG 


CGCTACTGGA 


AACTCATTCA 


ACAGGATAGC 


3840 


CGTAAACTCA 


GCGATAAACA 


TTTTTATCGC 


CCTACTTTTC 


GTATGCATTT 


AACCAATAAA 


3900 


GACATTTTAG 


ACAAGCTTTT 


GAGCTATTCA 


CAAGACTTGA 


AACATCACTA 


TCACCTCTAT 


3960 


CAACTCTTGC 


TGTTTCACTT 


TCAGAATAAG 


GAACCGGAGA 


AATTTTTCGA 


ACTTATCGAG 


4020 


GACAATCTTA 


AGCAGGTTCA 


TCCTATTTTT 


CACACTGTCT 


TTAAAACCTT 


CCTCAAAGAT 


4080 


AAAGAAAAGG 


TTATCAACGC 


CCTTCAACTA 


CACTATTCTA 


ATGCCAAACT 


GGAAGCGACC 


4140 


AATAATCTCA 


TCAAACTTAT 


CAAGCGCAAT 


GCCTTTGGTT 


TTCGAAACTT 


TGAAAACTTC 


4200 


AAAAAACGGA 


TTTTTATCGC 


TCTGAATATC 


AAAAAAGAAA 


GGACAAAATT 


TCTCCTTTCT 


4260 


CGAGCTTAGC 


TTTTTTTCAA 


CCCACTACAG 


TTGACAAAGA 


GCCGGAAAAA 


GGAACAGCCT 


4320 


TAGCTTTCCT 


TTCATTTCTT 


TTTATTTCCC 


TCGTAGTAAA 


CGTGCTAGCT 


TCCACAAAAC 


4380 


AAACAGGATT 


CCCAGAAATG 


CCAGTACCAC 


TAGCCCACGG 


TACAACCATT 


GACAGGTTCC 


4440 


AACACGCGAT 


ACAGATTGTC 


CTTCTTTCGT 


AAAAGCAACC 


CTCGCAACTG 


CAGCTGTTTG 


450O 


TGGATCTGAT 


TTTTGATAAA 


, CAGCGACTCG 


; TTCAAAATTC 


ACTAATAAGC 


GTTTATTAAA 


4560 


GGTAGGAATC 


GGATCGCAGG 


l TTATCAAGGT 


' CATGATATTT 


1 TTAGAGCTAA 


> CCGATTCTAA 


4620 
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TTTTTCCCAT TCCGACGCTA AAATAATCTC TGTGTCCATC ATCTGATATT CTACAATTTC 4680 

CTCGCCATTA TCATAATAAA GAGCATCTCC AACTTTTAGC TGATCCAAAT GGCGGAAAAA 4740 

GACATGGCTT GGCTCTGCAC CGTCCCCAGC AATCACTGAG CCAATCCCTG TACCATCCAG 4 800 

AGGCAGCGGT GTACCATCCA CATGAGCCAA GCCCATCCCT AAATGATGAT AATCTGCTCC 4 860 

CAAATAAACC GGCTCCATGA TTTCCAAACT TGGAATAGAC AAGTAACCAT ACACTGCATC 4920 

AGGGTCGTCA GACACTTGGT AATTGACCTC ATATCCCTCC GCCAAAAAAG GATCTACAAT 4980 

GCGATTTTGC GAAGCCAAGC GTTGATTGTA GGCGAGAGAA TGCTTCTCTT GTTCTTGGTA 5040 

CATTTCAGTT GTCATGGATT TCACAAATGT AGCATGACCT TTCACCTGTC CAAGAGACTG SI 00 

CAACACCATC TGTCCAAAAC AATAAATAGG AATCAAACAG GCTACCAACA TCAACAAGTA 5160 

TCCCAATAAG GCTCGTAGTT TAGTCCTTGA CATGACGCCC CTCCAATTGC TTTTCTACTC 5220 

CTTTGACAAT CCCTCCATTA CGATACACGC GATACAGCAA GAGAAGGATG ACCGCCATCG 5280 

CTCCTACTAA TAACCACAAC CAGAATTGCC CACGCTCTCT CACCGCTCGA TTCCGCTCTG 5340 

CAATTGGTGC CGTATACGGA ATCCCCTTCC CACGTACCAA CAGACGATGA CTGTTAATCA 5 400 

TATACGGTGT ACAAGTCAAC AAGGTCGCAT AATCTTCCCC ATGTTGAATC AAGACAGGCT 5460 

CAAAGTCATT CGGCTCCACC GTCACTATCT GATCCACTTG GTAGGCCAAC ACCTGATCTA 552 0 

AAACGTGAAG ATAAAAGATA TCCC C TTTTT TCATCTTATC CAATTGACTG AACAATTCTG 5580 

CCGTTGGCAA TCCTCTGTGA GCAGTGATCA CTGTATGGGT ATTTTCACCT CCAACAGGCA 5640 

GCGAAGCCCC TTCTAACACC CCTCCCCCTT TCTGAAGAAT GTCCTCACTC CTTCCGACAT 5700 

ACATCGGAAT TTCCTGATCA ATCGCAGGAA TTTCCACATA GCCAATCCCC TCATGGACCT 57 60 

TTAGCATATT GGCATATTCT GAGACGCCTT TCTTTTTCTC TTGCTCTGTA AAAGGATCAA 5820 

GAATTTCAGA TGGTTTCAAG GTCGCATTGA AGGCTTGAGC CAAGCGCCAA CGCTCCTCAA 5880 

GTTCTGCCTT ATCCATCTGG GAAACCGTCT CATCAAACTC TTTAATAACC TCGTTTGACT 5 940 

CAATACGATA ATAATAACGA GACACCAATG GATATATCGC AACGGCGAAT CCTACTAAGA 6000 

AAATCAGAAG AAGGATCAGC GGATGTTTCT TCTTTTTTGT GCCTTTTTTT CGTGAACCTC 6060 

TACTGTTGTC CATCCTCCAC CTTCACTTCC TTCCTTGCTG CTTTCAGCGC CTTCAAAGCC 6120 

TCTTCCGCTT GTTTTTTCTT CTTGCGCAAG CGTCGAATAA TCCATAAAAG AATCACAATC 6180 

AAACCAACTC CCACATAAAA CAGGTAGCGA TAGAGATGAC TGAGTTTGTT TGCTGCAATA 62 40 

AATTCTTCCT CAACCTCTGC TACCTACGGT ATCCGATGCC CCCGAACCAA TAGACGATCG 6 3 CO 

GTATTCATCA TGTATGGCGT ACAAGTCAGC AAGGTCACAT AATCATGACC TCGTACAATC 63 60 
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AATAAATCAT CAAAGTTCGT CCCCTCAATC ACCTTTACTT GATCCACTTG ATAGGCCATC 64 20 

ACTTCCTTGA TATTGTGCAC ATAAAACTTA TCCCCAACTT TAACTTTGCT CAAATCCGTA 64 80 

AACATCT7AC C7GTTGGCAA ACCTGTATGT GCCGTAATCA CCGCATGGGT CGAATTGCCT 6540 

CCGATCGCCA CAGAAGTTCC CTCTAGATGC CCAGCCCCTT GCTGCAATAC CTCTTCAGCA 6600 

GTACCAGCAT AAACCGGCAA ATCCACGTCA ATAACGCCGA TTTCCACATG CCCCATCCGC 6660 

TCATGGATTT CTAACATACG TGCATACTCT GCTCGCCCTT TT TTCTTCAT TTCTTCCGAC 6720 

CAAGGATCGC CACTCACTAC ATTATTCAAA GAGTCATTGA AGGCTTCTGC CAATTTCATT 67 80 

CCTTCATCAA TGTCAGCCTC ATCCAACGTT GCTTTTTCCT TATCAAAGTC AGCAATTTGT 6840 

TGATTTGATT CCACTCGATA ATACAAGCGA GACACCAGCC CATACGCCAT TACCGCCATT 6900 

CCAATGAAAA ATACCACTCC TAATAGGAGA TTATTTCGTT TTTGCTTTTT TGTTTTTACC 6960 

ATTTTTATCA GCATCCCTTT ATCTTCAAAC TTCAGGGTAT C 7001 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



GAGGGAGCTT 


AAGAAGTTAC 


CACCGTCCTC 


TAGCGCCTTA 


TCCGCATCAA 


AGTTAACCTT 


60 


GATATTTTTA 


AAACTGTCGC 


CAGCTTCTGA 


TACGATGCTT 


TCTTTAAGCT 


CATTTAGGGT 


120 


TTTAGTGAAA 


TCTGCATTGC 


TGAGGATATC 


ACTCTTTGAG 


AGATTCAAGG 


CAAAATTGAT 


180 


GATGATATTG 


ATCTGGTTTC 


CTGTTATGAC 


C7GATCAAGT 


TTGTAATTTT 


TTAAGGTATC 


240 


TTCAACAATC 


TTGCGGATAT 


CTTCTTCTGT 


CAGATTTCCC 


TTACTTTCTT 


TAGCTTTGCC 


300 


GAGTCCTGAC 


TTGATATCAG 


CTAGGGCAAC 


GTTTAATTTA 


TTAGCATCAT 


AGCCTGAirT 


360 


GTCCTTGTTT 


TCAGCATTGA 


TATCTGACAA 


AGCTTTTAGC 


TCTTCTTGAG 


CCAAATCTTT 


420 


ATTAGCTTGT 


GGCACCTTGG 


CTCCATTAGC 


CTCTAGCGAA 


TAGTAAATCC 


CTGCTAAACC 


480 


ACTTTCTCCT 


GTAACTGGAA 


TAGGGGCTGC 


TACAGTGATT 


TTGGCATGTT 


CCATACCCAG 


540 


CGTTACTGCT 


GCGTTTCGGT 


ACATATCCTG 


AGTCACCTTA 


GTGATATTTT 


CTGGTGTTTC 


600 


AATCTTGACC 


TCAAGTGGCG 


ATTTGTCACC 


TAGCTTTTGA 


ATCTTGGCTG 


ATGAATACAA 


660 


CTGTAAGCTA 


GAGTCATTGG 


CCACATTCAT 


GATTTTAGAA 


TAAACATCAG 


GTGTCATGGT 


720 


CTTGAGTTCT 


TTGGTATCTG 


TTGAGGCATT 


GTAGCCCAGT 


TTTTTAAGAG 


TTTGATTTTT 


780 
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TTGGTCTTCA GATAGGGAGG AACCTAGGAC ATATTCAGGT TGGACATAGG TTTCATCCAT 84 0 

AACTTTTTGA ACATCTGTTG CTGCATGGAC GCTATTCATA GCTGTTACTC CCCACAAGAT 900 

CGCAGCGCTA GTCAGAAAGA GTTTCTTTCT CATAGGGAAT TTCCTCCTTT ACTTCTTTAG 9 60 

AGTAATATAT CTATCTTAAA GAAAACTTAT AACAAAAACA CCTCGTCTAG CCAGATGTTG 1020 

AAAAGAGAGT GAAACATTTG ATGATGTAAA GGTTAACTCG TACCTGTCTA GAATAATAAT 1080 

AGTTTCCTCC ATT T AC AT AG AGTTCAGCAC CGTGAAAAAT GGAAATGGCG TGAATATAAC 1140 

TATAAGTCTT TCCAGTCCTA TTACCAAGCA AGGGGGCAAC AGTCTCACGA GAGTACTGTT 1200 

TGGCTAGAGC CAGGGTATTT TCCTTGCCAT TTTGGGCGAT AAAATCGATA TAGGCAGGTC 1260 

CAAAATTATA GGCTTGAACA GCTGTCCAGA TATCTACCCC CTTCTTCTGC GCCAGATAGA 1320 

GATTGCCTGT CAGAGTTTGA ATGCCTTGCC GAATGCTAGA GGCATTATCA TTGATGGTGT 1380 

TGGTGGAACC ACTTGCAGAC TC ACT AG ACT GCATAACATC GCCTTCTTTT CCTTTTGTTT 144 0 

CACTATAAAT CATAGCAAGC ACAAGCTCTT CGTTTGCTGG GGTGTCTTGT TCACTCAATA 1500 

TTTCTCGCAC CATGGGTTGA TAGGTCATGA CTTCTTTGAC ATCTTGATGA ACGCGGTAAG 1560 

CTTTATAGCC AGCAAAAACG AAGACTGCTA GTACAAGCAC TCTTCGAATT CGTTTAAACA 1620 

TTATTTACTT TGGATATCCT CGATATTTTT GATTAACATA GAGTAGGTTC CATTTTCGTT 1680 

TTGGATAAAC TCAACAGACT CGGCGTCTTG ATAGACGTTA TTGGGAACCA TGAGCTCAAT 1740 

TCCATTTGAT AAGGAGAGTT TTTGGTTTTC AAATTTCTTT AATTGGCGAC TGGCATCAAT 1800 

TTCATCAAAT TGAACACGTT CTGGTACGGC TTCTTTCACT TCGTCAATAA AGCTCAAACG 1860 

AGCCGTCAGA TTCTTGTCAA AAAGGTCATT AGCCAATTTC TCACGTGACA ATTCATTGCT 1920 

TTCTTCTAGG TTGTTGAAAA TAGCTCATTT GACCTTGGAT TCAAATTGAA AATCATCTGT 1980 

GTTAAAAGAT TTAGCAATTC TCTGGGCTGT TTTTTCCAGT TCCTTGATAG ATTTTTTAGG 2 040 

AGAAATCTTA GGAGCGACAG CAAGAAGATT ATCTGAAAAA TAGTTCAAAA AAGTCCCGTT 2100 

GTACTTGATT CGTTTTTCAA TCAGGTGATA CTTCCTACTC TGAAGATTGA CCACCAAGGC 2160 

CTCATCAGCT CCTGTTCCAA ATCCAGGCAG GTTATTCTGA GTTAGCTTGA TTGGATTATC 2 220 

AACTTCTCCT CCGAGGTGGG TCAAGGTCTC CCGCAGGGCA ATTCGCAAGA AAGCGAAATG 2 280 

TTCTACACCT TCTTTAGAAA ATTGCACAAA AATCAAGTCA TTGGTCTTGA GATTTTCAGA 2 340 

AATGCTAAAC TCCTCTTTCC AGAGATTAGC CAGCGTTACT GATGTCTCCA ACAAATCGTC 2 400 

TGTAATATGA TTGAAGAAGG GATTTTCTTC TTCGAAAATC CCAGTCTTGG CTTCATCTGA 2 4 60 

ATACACATGT TCAATTTTTT TACGCAGGTA TTCTTCGATT TTTGGAGTAA TATTCAGAAA . 2 520 
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CTTATCTGCT AAGAACAGTT CGGTATCATC CGGACTGAAC TGGTGAATAA TGGCTTTCTT 2580 

AATATAAATG TCCATAAAAG TTTTAGTCCT CGTATAATGG GAAGGCATCT GTCAATTCTT 2640 

TGACTGCACT TCTCACTTCT TCTAATACAG CCTCATTTTC TGAATTCTTA AGGGTTTTAA 2700 

TGATGAGTTC AGCCACTTTG CGACTTTCTT CTTCACCAAA TCCACGTGCA GTAATGGCTG 27 60 

CTGCTCCGAT ACGAATCCCA CTTGTCTTGA ATGGTGACAA GCTTTCGTAA GGGATTGAGT 2820 

TTTTATTTAA GGTAATATTG ACTTCATCCA ACAAGTTTTG AGCAACTTTG CCGTTTTCTA 2880 

CAACTTTAGT CACATCAACA AGGAAGAGAT GGTTTTCAGT TCCACCTGAA ATAATACGGA 2940 

AATCAGGGTC TTCCAAGAAG ACATCTCCCA TAGCCTTCCT GTTCTTAATT ACATTGGCAG 3000 

CATATTCCTT GAAGGCTGGA TCCAAAACTT CTTTGAAGGA AACTGCCTTA GCCGCCACAA 3060 

CATGCTCTAA AGGACCGCCC TGAATACCTG GGAAAATAGC TGAATTGATT TTTTTAGCAA 3120 

GTTCTTCGTC ATTGGTCAAA ATCAAACCAC CACGAGGTCC ACGAAGCCTT TTGTGGGTCG 3180 

TTGTTGTTGT GATATGAGCG TATGGAACTG GGCTTGGATG AAGGCCAGCC GCAACCAAGC 3240 

CAGCGATATG GGCCATGTCC ACCATGAGCT TCGCACCGAC AGCATCTGCG ATTTCACGGA 3 300 

ATTTTGAAAA ATCCATAATT TGAGAATAGG CTGAAGCACC AGCTACAATC AGTTTTGGTT 3 360 

TTACTTCTTC GGCTT G TTTC AAGATAGCAT CAAAGTCTAA GAGTTCCGTT TTAGGATCAA 3420 

CACTATAAGA AACAAAGTTG TAGCTTTGAC CAGAGAAGCT AACAGGAGCC CCATGAGTCA 3480 

AATCACCACC TGATGCCAAA TCCATTCCCA TAACCGTATC ACCTGGCTCA ATCAAGGACA 3 540 

TGTAAGCCGC ACAGTTAGCT TGGCTTCCTG AATGTGGTTG AACATTGGCA AATTTAGCAC 3 600 

CGAAAATTTC TTTTGCGCCT TCAATAGCAA GAGTCTCTAC AACGTCTACT ACATCAGTTC 3660 

CACCATAATA ACGGCGTCCT GGGTAACCCT CGGCATATTT ATTTGTCAAG ATAGACCCTT 3720 

GAGCTGCCAT AACAGCCTTG GAAACTACGT TTTCCGAAGC AATTAACTCG ATATTATTTT 3780 

GTTCGCGTTC TTCTTCTTTG GCAATAGCAT TCCAGAGATC AGCATCATAT GCTTTAAAAT 3840 

CATCTTTGTC AAAAATCATA GGTCTTCTCC TTTATTGTCT GACTAGTCCA TTAGTTTGAT 3 900 

TTTACAATAA GAAAATCAAA CTAACAGATG CGAATAAACC GTTTCTGCAT TTTATCACAA 3960 

GTATAGCCAA CTTTTTCATA AAATGCATGA GCACCCAGAC GATGATTGGC AG AATTT AAG 4020 

CGGATAAACC CATAACCACA TCTTTTTGCT TCTT C TTCCA ACCCTTGTAG TAAACTTTTA 4 030 

CCAATACCTT GACCTTGCGC TTGAGGTGAA ACTGCTAAAG CTAAGATATT AAATCCTGCT 4140 

TTGGAATAGA GTGATTCGTA AACTTCAGCG TGGACATATC CAAGTAAGAC ATGATTAGCT 4 200 

GCATCCTCAT AGCCAAGTAG GAAATGATGG GAATCCTGAG ACAGTCTAGC TAGTTGGCTA 4260 

GCCGTTTCCT CTGGACTAAA AGTATAACCC AAACCCTCTT GGTTGATCTC AC AT AT AG CT 43 20 
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CTTCTGCTGG CTACGTTCTT CCTGCATCTC AACCTTGATA TTGGTTGGCA ACTGAACGAT 6120 

ACGAACGGCA GTCGCAACCT TATTGACGTT CTGTCCACCA GCACCAGAGG CGTGATAGAT 6180 

GTCGACACGA AGGT C TTTTG GATCAATGTC GTATTCAACC TCTTCAACTT CTGGCATAAC 6240 

AAGAACTGTC GCTGTCGAAG TATGAACACG GCCTTGGCTT TCTGTCACAG CAACACGTTC 6300 

CACACGGTGG GCACCTGATT CATACTTAAG CTTAGAGTAT ACAGACTGAC CTGAAACCAT 63 60 

AGCAACCACT TCTTTAAAAC CACCGACACC ATTCATAGAG GCTTCCATGA CTTCAAAGCG 6420 

CCAACCTTGG GCTTCCGCAT ACTTTTCGTA CATAGTTACC AAATCTCCAG CGAAAAGTGC 6480 

CGCTTCGTCT CCACCAGCTG CTCCACGGAT TTCAAGGATG ATATTCTTGT CATCGTTTGG 654 0 

ATCCTTTGGA AGGAGCAAAA TTTTCAGTTT TTCTTCATAT TCTTCTTTTT CAGCCTTGGC 6600 

ATCTTTGAGT TCTTGCTTGG CCAATTCTTC CAAGTCCGCA TCTCCGCCTC ATTCCTTAAT 6660 

CATCTCTTCG GCATCGACGA TATTTTGAAG GACTTGTTTA TACTCACGGT AGGCTATTAC 6720 

GGTGTCACCA TTGGAAGCTT CTTCTTTTGA AAGCTCCATA AAACGCTTGC TGTCTGAAAC 6780 

GACATCAGGG TCACTCAGCA ATTCTCCTAA TTCTTCATAA CGCTCTTCTA CAACTTGTAG 6840 

TTGATCATAG ATGTTCATTT TTTCTCCTTA TTTCTCAATT GTTAAATCAT ACATTGCTAC 6900 

TACTTCATTC TCGGATATTT CCCCAGTTTC TTTAAATCCA TAACTGAGGT AACAAAATCT 6960 

TGCCTGTTCA TTTTCTGGTT CATArGACAA CCAAAGTTTA TTGCTTAAAC CTCCTGGCGC 7020 

TCTTCGAACA TAGTCTAGTA CTTTATCCAT AATTGGTTTA AAATATCCTT GATTTTGAAA 7080 

ATTCTTATCA ATCATAAAAC GAAATAGTAA ATAATTTCCA CTACTAATTC CGATCTTTTT 7140 

ATCATAAGCT ATCATCACAA AACCTATAAT TGCATCATTA TCATAAACTG CCAATGGACC 7 2 00 

TACAAAATCT CCATTTTTAG TGTAGACGTA TCCTTCAGCT AAACTAATTG CGTTGGTTGC 72 60 

AATGAATTGT TTTTGATATT CCTTCACATC CAAATTTAAA ACATCAAAAT AATTTTCCAT 7 320 

TGTAACATCT CTTAGTTCAA TTGTCATAGT TTTGCTCCTT GTTAGAGGTT ATCATTGGCG 7 3 80 

CAAAATAATG TTTACGGCAA ACTGAGATAT AGGTTTCGTT ACCACCAATC TGCATCTG'1*T 7440 

CTCCATCGTA AACGGGCAGT CCATCCTGTG TTCGCAACAC CATGGTCGCC TTTTTCTTGC 7500 

AATACTGACA GATGGTCTTG ATTTCGTCAA TCTTGTCTGC TAAAAGCAAG AGATATTTGG 7 560 

AACCTTCGAA CAATTCATTG CGAAAGTCAT TTTTCAACCC AAAAGCCATG ACGGGTATGT 7620 

CTAACTCGTC CACAACACGA GCTAGGTCGT AAACATGGTG GCGTTTGAGA AACTGGGCTT 7 680 

CATCGACCAA AACACAGTAA GGTTTTTCTG GTAGGTCTCG GATATAGCCA AAGATATCCG 77 40 

TTCTTTCCTC AATCGCAAgG GCAGGGCGTT TCATGCCAAT TCGACTCGAC ACATAGCCAA 7800 

CGCCGTCACG CCTATCCAGA GCCGAGGTCA TAATCACAAC ACCTTTTCCT TGCTCCTCGT 78 6C 
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AGTTATAGGC CACTTTGAGA ATCTCAATCG TTTTACCAGA GTTCATGGTC CCATAACGAT 7920 

AGTACAACTG TGCCATGTTT CTTGCTTCAC GTCCATTTCT AAATTTTTGC TACATTCTAG 7980 

TATATCATAA TTTTCTTAAG CTTTAAACGG CAAAATGTCG TAAAATAGAA GAAATCAAAA 8040 

ACTAGTGGAG GAAGCTATTA TGCCATTTGT ACGCATCGAT TTATTTGAAG GACGCACGCT 8100 

CGAGCAAAAG AAAGCTCTTG CTAAGGAAGT AACGGAAGCA GTTGTCCGCA ACACTGGAGC 8160 

CCCTCAATCT GCTGTCCATG TCATCATCAA CGACATGCCA GAAGGAACTT ACTTCCCACA 8220 

AGGGGAAATG CGTACTAAAT AAGCTAGCTT AAGCAGAATT GCTTAGGCTT TTTCAATCTC 82 80 

CAAGTAGCAT TCATTGAAGA AATATCCTAA ATTTGTTACA ATTTGAAAAG AAACTTGGAG 8 340 

AATTTCCAAG AAAAGAGCTA TTAATTAAAG GAAACATTAT GATTACACGT GAATTTGATA 8400 

CCATCGCTGC TATCTCTACT CCACTAGGTG AAGGGGCTAT TGGTATTGTC CGCCTGAGCG 8460 

gAACAGACAG TTTTGCTATT GCGCAAAAGA TTTTTAAAGG AAAAGACTTG AACAAGGTTG 8520 

CCAGCCACAC TCTCAACTAC GGTCACATTA TTGATCCTCT GACTCCTAAA GTCATGGACG 8580 

AGGTTATGGT TGGGGCTATG AAGTCTCCAA AGACCTTCAC TCGTGAGGAT ATTATCGAGA 8640 

TTAACACCCA CGGTGGGATT GCGGTGACCA ATGAAATTCT CCACCTAGCT ATTCGTGAAG 8700 

GGGCTCGGTT GGCAGAACCT GGTGAATTTA CCAAACGTGC TTTTTTAAAC GGTCGCGTAG 87 60 

ACTTGACACA GGCAGAGCCT GTGATGGATA TCATCCGTGC CAAGACTGAC AAGGCCATGA 8820 

ACATTCCGGT CAAACAATTA GACGGCTCCC TTTCTGACCT GATTAACAAT ACCCGTCAAG 8880 

AAATCCTCAA TACACTTGCC CAAGTTGAGG TCAATATCGA CTATCCTGAG TATGACGATG 8940 

TTGAGGAAGC CACTACTGCT GTTGTCCGAG AGAAGACAAT GGAGTTTGAG CAATTACTAA 9000 

CCAAACTCCT T AGG AC AG C A CGTCGTGCTA AAATCCTTCC TGAAGGAATT TCAACGGCTA 9060 

TCATTCGACG TCCCAACGTT GGGAAATCAA GCCTTCTCAA CAACCTCTTG CGTGAGGACA 9120 

AGGCTATCGT AACAGATATC GCTGGGACAA CACGACATCT CATCGAAGAG TACGTCAACA 9180 

TCAATCGTGT ACCTCTCAAA TTGATTGATA CAGCCGGTAT TCGTGAAACG CATGATATCG 92 40 

TTGAACAAAT TGGAGTTGAG CGTTCGAAAA AAGCTCTTAA GGAACCTCAC CTAGTTCTGC 9300 

T ACT ACT AAA CGCTAGTGAA CCACTAACCG CCCAAGATCG CCAACTCCTA GAAATCAGTC 93 60 

AGGAGACTAA TCGCATTATT CTTCTTAACA AAACTGACCT GCCTGAAACG ATTGAAACTT 94 2 0 

CGGAACTACC TGAAGATGTC ATCCGCATTT CAGTTCTTAA AAATCAAAAC ATCGATAAAA 9480 

TCGAAGAGAG AATCAACAAC CTCTTCTTTG AAAATGCTGG TTTGGTTGAG CAAGATGCTA 9540 

CCTACTTGTC AAACGCCCGT CACATTTCCT TCATTGACAA GGCCGTTGAA AGCCTACAAG 9600 
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CTGTTAACCA AGGTCTTGAA CTAGGGATGC CAGTTGACTT GCTTCAAGTT GACTTGACCC 9660 

GTACTTGGGA AATTCTAGGA GAAATCACTG GAGATGCTGC TCCAGATGAA CTCATCACCC 97 20 

AACTCTTTAG CCAATTCTGT TTAGGAAAAT AAGAAAAATC CATGATCCTT CATTCGGTCA 97 80 

TGGATTTTAG GTTCTATAAT ATTTCTAGTG GGTAAATCCA CTATAGATAT TATGGAGCCT 984 0 

ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA AACAACTCAT 9900 

TAGAAAGAAT CATATGGAAC AATTACATTT TATCACAAAA TTACTAGACA TTAAAGACCC 99 60 

TAATATCCAG ATTTTAGACA TCATCAATAA GG AT AC AC AC AAGGAAATCA TCGCCAAACT 10020 

GGACTACGAC GCCCCATCTT GCCCTGAGTG CGGAAACCAA TTGAAGAAAT ATGACTTTCA 10080 

AAAAACCTTC TAAAATTCCT TATCTTGAAA CGACTGGTAT GCCCACTAGA ATTCTCCTTA 10140 

GAAAGCGTCG ATTCAAGTGC TATCACTGTT CAAAAATGAT GGTCGCTGAA ACTTCTATCG 10200 

TCAAGAAGAA TCACCAAATC CCTCGTATCA TCAACCAAAA GATTGCTCAA AAGTTAATTG 10260 

AAAAGATTTC TATGACTGAT ATTGCCCATC AGCTTTCCAT CTCAACTTCA ACTGTTATTC 10320 

GTAAGCTCAA TGACTTTCAC TTTAAACATG ATTTTTCTTG TCTTCCTGAG ATTATGTCTT 103 80 

GGGATGAGTA TGCTTTTACA AAAGGGAAGA T 10411 
(2) INFORMATION FOR SEQ ID MO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2393 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

GTTTTGGGTT CTGGAAATTA TCAGATGGTT GGAAAAGCCG TCCACATCAA GATAGTGTTC 60 

GGAGATTTAA GTTTAAATTG AAGAAACTAA CACAGAGGAA ATGGAGTATA GACCTAACAA 120 

GACGTATTGA GCAACTGAAT TTGTCTATTC GACGATGGAT AAACTATTGC TCATTGGGAA 180 

ATATGAAAAG TATAGTCGCC AGCATAGATG AGCGCTTGCG TACTCGCCTA CGAGTGATTA 240 

TCTGGAAGCA ATGGAAGAAG AAATCGAGAC GATTATGGGG ATTGCTTAAC TTAGGAGTTC 300 

CTAAATGGAT AGCAGATAAG GTATCTGGCT GGGGCGACCA TTATCAATTA GTAGCTCAGA 3 60 

AGTCGGTACT TAAACGTGCT ATATCAAAAC CAGTCCTGGA AAAACGTGGA CTGGTTTCGT 4 20 

GTTTGGATTA TTACCTTGAA CGACATGCCT TAAAAGTTAG TTGAACCGCC GTATGCCAAA 4 80 

CGGCACGTAC GGTGGTGTGA GAGGGGCTAG AGATTATCCC CTACTCGATT AACTCCCCTG 540 

AAATTTATTT TAATTATGCA AATTTCACG7 ATTTTTGATG CTGAGACGAC GATCCTGGGA 600 
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ACTTTTCAGA TATTTTTTTG 
GGATTTGAGC GTTTTTCTGA 
TAATTATTCT ACTAACTAAC 
TCCTAAAATT AGGAATAATA 
TTTGTTTTCT GCTATTTTAT 
ACTAAAGGGG GAGCGCTACA 
TGCAAATTTA GCAGATATTT 
AAAATCAGTA ATTGCCACAT 
GAGTCTTTTA GTTCCGTTGG 
TCAATTTGGA AAGACTATAT 
CGTAGCGCCT TTGGTGACCT 
AGCACCCGTT TCCTATGCTA 
AGCCTTATCA ATGACTGGTG 
GTTTGCAACA ATTGGTCTGT 
TAGCTTTCTG ATGTTATTTC 
TCTTGAAATT TTGCTCAAAG 
TGTATCACCA AATTTATTGG 
TGTTTTTGTA ACGGAGTTAT 
ATACTCTATT GGTATTATAA 
TGCTGCTAAA TGGGAACCCC 
CCTTAGCTTA GATCCTGGAT 
AAAAGAGTTT CCCCTTTATG 
CATGAACAGT TTACCAAATC 
TGGAGGTCAT TTAACCCAGT 
GAAACTAAAA GAGCGGATTT 
TTATTCGGAT TCAGATATCC 
GGACTATGCT TGTAAAGAAT 
GCAGCTTGCT TCACAGCCAA 
CCATAGTTTG CGATGCCTCA 
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ACTATCTAAA TCTATCATTA 

TTTTTAAGAC TTTTTCCAGT 

TAACTTCTTA GTACTAGCCA 

AAGGCAATAG TTTTTGTTTT 

GCTAAAATAT TAAAAATCAA 

TGTCTAATTC ATTTGTCAAG 

TCTTTAGAGT AACAATCATT 

CACTAGTTCC TATCTTAATA 

TTACTAAAAG CTTAGCCCTA 

TATTGGCGAT ACTGGTAGGA 

ATCTATTTGT TGTTGCAATT 

TTGTGCCACG CTATGCCACC 

AAGCTGTTCA ATTGATAGGT 

TACCTACCAC GTGTATCAAT 

TTCCTAACGC TGAAGTGGAG 

GTTGGAAGTT AGTTCCTAGA 

AAATTTTTTC AAATACGATT 

TAAATAAAAC GGAAAGTTAC 

TTAGTGGCTT AATTGCTTTT 

AATTATTCAC CCCAAATCTA 

GGTTTCTTTT TTCACCCAAT 

GTATAAGTGT AGAAAAAAAC 

ATCACTTCCA AAACAAGTCT 

ATGGTGGTCT TATCTTTTTT 

CTAACTATTT AGTAACGAAT 

TTGTCCAGTT CCTCTTTCAA 

TGTCAGCTGA TGCCTACTTT 

CCTTATCCCG TTTTCTTTCC 

ACCTTGAATT GGTCGAATTC 



GAAAAGCTTA GAGCGCCAAA 660 

CTCTTTTTCG ATTGAAGATG 720 

ACAACGATAA TCATAATTCC 780 

TTCATGTAAA AAACCTCACT 840 

ATTTAATTCC AAAGTTTGTA 900 

TTGTTAGTCT CTCAATTATT 960 

GCTAACATAT ACATTATTTC 1020 

GGAATATCCT CTTTTGTTGC 1080 

AATAGGGTTT TATCTTTATC 1140 

ATGTTTACCG TAATGCAATC 1200 

TCCATACTAG ATGGTTTTGC 1260 

GATTTGGGTA AGGCTAATTC 1320 

TGGGGATTAG GTGCACTCTT 1380 

TTAGTCTTGT ATATCATTTC 1440 

GTGTTAGAGT CAGAAACTAA 1500 

AATCCTAGAT TAAGACTTTT 1560 

TGGGTTTCTT CC ATT AT ACT 162 0 

TGGGGATATT CTAATACAGC 1680 

AGGCTATCTG AAAACTTCCT 1740 

AAAACCATCC AGAATCCTTC 1800 

GGCTGTTTTT TACT AG AC AA 1860 

ACAAAAAGAA AGGAAACTCA 1920 

TTTTACCAAC TATCTTTCGA 1980 

CAGGAACTTT TTTCCCACTT 2040 

GACCAACGCC GCTACTGTCG 2100 

CTGTTAACAG GTTATGGAAC 2160 

CCAAAATTGT TGGAAGGAGG 2220 

AGAACTGACG AGGAAACAGT 2280 

TTTTTACAGT TTCACCACCT 2340 
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AAACCAACTC ATTGTAGATA ACGATTCTAC CCATTTCACA ACTTATGGCA AGC 23 93 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4762 base pairs 
(Bl TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

TTTGTATCTT TTTAGGTCTC TTTCAATCCA AACCCTTTAA ACTATACGTC ATTTCGGTTC 60 

CTGCAAGTCT TGTGCTAATT TTAGGTTTGA TTTTACTTTT CTTTTCACAA GAGCCTCTGC 120 

ACGCTTCTTA TTTGATGGTC GTCTTCCCTG TTTTCCTACT TTTATTGGTA ACCAATATTA 180 

AGAGTCAACA GAGGGGGCGT ACTGCTAGAA GAAGCCGAAG AGAAACGCCA TTATGCCTAT 240 

GGAGTCGTTT CTTCAAAGGA AATCTATATC TGCTA G TTTT TCGGTTTGTC TATCTTTTCT 300 

CTGTTCCTTT TTTGATGAAG TTTGTCCTTT ATCCAGTACC TTATCAAGAA CGTAATCGTC 3 60 

TTGCTGATTT GGTAAAAGAG GAGACAAATA CGGAAGATGC TATCTCATGC ATGGGATGAT 420 

ACTCCGACTC TTTATCGTAA GAGTGAGCGC TTGTCCCATC CCCCATTTTG TCCCCGTTGC 480 

ACTATACAGC AACTGAGGAA AATCGTAATA AGTTACTTAA TGACTTCAAA GAAAAACAAC 540 

CTAAGGTGAT TGTGGTAAAT GATAAGGTGG TAGTCTGGTC TGAAGTGGAA ACACTCTTAA 600 

AAGAAAATTA CCAACAAGTA AAGACTGATT ACTCAGAGTT TAAAGTCTAT AAAATTAAAT 660 

AACCAAATCA ATATCTTGTG TATTTTTAAA AATTTTAGGA TTTTTAACAC AAGATATTGA 720 

TTTTTCTTTT TAGAGTGGTA TAATACTTTT TAGAAAGAAC ATTTTAGAAA AGAGCATGCA 780 

TATGATTGCA CTAGAAGAAA AAATTACAAT TTTGCCAACT CTCTTCGTCG AGAAACGAGA 840 

TGGGAGACGT GTTGTATTTG ATGTGGACAA GATTGACAAG GCTCTCCACA AGGCGGCTGA 900 

CAAGGTTATG GATGTGACAC CCCTGGTTGA AAAATGCCTC AATGATCTGA CTGAGCGAAT 960 

TATTACAGAA ATTCATACTC GCTTTCCACA GGGAATTAAG ATTTACGAAA TTCAAAATAT 1020 

CGTAGAACAT GAACTCCTTG AACCCAAAGA ATATGCGCTG GCTCAGGAGT ATATTACTTA 1080 

TCGGACACAG AGGGATTTTG AGCGCTCAAA AGCGACGGAT ATCAACTTTA GTATTCATAA 1140 

ACTTCTCAAC AAAGACCAGA CAGTTGTCAA TGAAAACCCT AATAAAGACA GTGATGTCTT 1200 

TAACACTCAG CGTGATTTGA CAGCAGGGAT TGTTGCGAAA TCAATCGGAC TGCAAATGCT 1260 

TCCTAAGCAC GTAGCCAATG CCCACCAAAA GGGGGATATC CACTATCACG ATTTGGACTA 1320 

CAGTCCCTAT ACCCCTATGA CCAACTGCTG TTTGATTGAT TTTAACGGTA TGTTGGAAAA 1380 
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1500 
1S60 
1620 
1680 
1740 
1300 
1860 
1920 
1980 
2040 
2100 
2160 
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TGGTTTTAAG ATTGCAAATG CAGAGGTAGA GAGTCCCAAG TCTATCCACA CTGCGACACC 
ACAGATTTCT CAAATCATTG CCAACCTTGC TTCTAGCCAG TACGGTGGCT GTTCACCTGA 
CCGTATCGAT GAAATTTTCG CGCCTTATGC AGAGAAGAAT TATCAAAAAC ATCTCAAAGA 
TGCAGAAGAG TGGGTATTGC CTGAAAAACA GGAAGATTAC GCTTGGAAGA AAGCGCAAAA 
GGACATCTAC GATGCCATGC AATCTCTTGA GTATGAAATC AATACTCTCT TCACTTCAAA 
" TGGACAAACA CCTTTTACTT CGTTAGGTTT TGGTCTGGGA ACCACTCGTT TTGAACGAGA 
AATTCAAAAA GCTATTTTAA ACATTCGCAT CAAGGGTCTT GGTTCAGAAC ACCGTACGCC 
TATCTTTCCT AAACTTATCT TTACGCTTAA AAGAGGCCTC AACTTAGAGG AAGGAACTCC 
CAACTATGAC ATCAAGCAGT TGGCTCTAGA GTGTGCAACC AAGCGGATCT ATCCAGACGT 
CTTCTCTTAT GATAAGATTG TTGATTTGAC AGGTTCTTTC AAGGTGCCTA TGGGCTGCCG 
TTCTTTCCTT CAAGGGTGGA AGGATGAAAA TGGTGTAGAA GTCAATTCAG GTCGCATGAA 
TCTGGCTGTT GTGACGGTTA ATCTGCCTCG TATTGCTCTT GAGTCTGAAG GTGATATGAA 
TAAGTTCTGG GAAATCTTCA ACGAGCGAAT GAATATCGCA CAAGATCCTC TTGTTTACCG 
TGTCGAACGC ACTAAAGAGG CGACACCAGC GAATGCTCCT ATTCTTTATC AGTACGGTGC 2 220 

TTTTGGCCAT CGTCTAGGTA AAGAAGAAAG TGTTGACCAG CTCTTTAAGA ATCGTCGTGC 2280 
GACCGTTTCC CTGGGCTATA TCGGCTTGTA TGAAGTAGCG ACAGTTTTCT TTGGTAACAG 2 340 

CTGGGAAAGT AATCCAGATG CTAAGGAATT CACGCTAGAC ATCATTCACG ATATGAAACG 2 400 

CCGTGTAGAA GAGTGGTCAG ACCAATATGG CTACCATTTC TCTATCTACT CAACACCATC 2460 
CGAAAGTCTG ACAGACCGTT TCTGCCGACT AGATATAGAC AAGTTTGCCT CTATTCCTGA 2 520 

TATCACAGAC AAGGAATACT ACACCAACTC TTTCCACTAC CATGTTCCTA AAAATCCAAC 2 580 

ACCGTTTGAA AAATTGGACT TTCAGAAAGT CTATCCGGAA GCAGGTGCGT CAGGTGGTTT 2 640 

CATCCATTAT TGTGAGTATC CAGTCCTTCA GCAAAATCCA AAGGCCTTGG AAGCTGTCTC 2 7 00 

GGATTATGCT TATGACCGTG TAGGCTATCT AGGCACCAAT ACTCCGATTG ACCGTTGCTA 27 60 

CAAGTGTGAC TTTGAAGGGG ATTTTGAACC AACTGAGAGA GGCTTTGCTT GTCCAAACTG 2820 
TGGCAATAGC GACCCTAAAA CAGTAGATGT GGTGAAACGA ACTTGTCGCT ACCTAGGTAA 2880 
TCCTCAACCA AGACCGATGG TCAACGGGCG TCACAAGGAA ATCCCTGCCC GTGTCAAACA 2 940 

TATGAATGGT TCAACGATTA AAATAGCTGG GCATCAAGTA ACAAATTAGA AAGAAATGAA 
ATGGGAAAAT ATCAACTAGA CGATAAGGGG CGCGCACAAG TGACCCGTTA TCACGAGAAA 
CACTCTAAAG GTGGAGCTGG TAAGAAAGAA CGCTTGCTTA GCTTCAGAGA ACAATTTTTA 



3000 
3060 
3120 
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AACAAGAACA AGAAAAAATA AAAGTGAGAG CCAGCTCTCC CTTTTCTCAT AGTGGGAGGT 3180 
AAGGATGGAA TTACGCACAC CAAGATTAGC GGATAAGAAA GCTGTTTTAG ATATGATGAC 3240 
AGAGTTTGAA AAATTTCAGT CCCCTCACCA CGGCGGTTTC TGCCATACAG AGAACTTTGT 3300 
GTATGAAGAC TGGTTAGAAA GCAATCAGGA ACAGGAAATG GGGATTAATC TGCCTGAAGG 3360 
ATGGGTTTCT GCAATTCAGT TAGTGGCTTT TTCTGAGAAA GGTCAAGCAG TTGGATTTCT 3420 
TAATCTCCGG TTGCGCCTCA GTAACTTTCT ACTAGAAGAA GGTGGCCACA TTGGCTACTC 3480 
CATTCGTCCA TCTGAAAGAG GCAAGGGTTA TGCAAAAGAG ACTCTCCGTC AGGGCTTGCA 3 540 

AGTTGCTAAG GAAAAGAACA TCAAGAAAGC TCTGGTGACC TGTAGTGTGA ATAATCCTGC 3600 
TAGCAGAGCA GTCATTCTAG CAAATGGTGG AATATTTGAG GATGCTCGCA ATGGACTCGA 3660 
GCGTTATTGG ATAGAGGTAG CGAATGAATA ATCCAAAACC ACAAGAATGG AAAAGCGAGG 3720 
AACTTAGTCA ACCTCCTATC ATTGACTACA AGGCCTTTAA CTTTGTGGAC GGCGAAGGCG ' 3780 

TGCGCAACTC TCTCTATGTA TCAGGCTGTA TCTTTCACTG CGAGGGATGT TATAATGTTG 3840 

CGACTTGGTC TTTTAATGCT GGCATTCCCT ATACAGCAGA ATTAGAAGAG CAGATTATGG 3900 

CAGACCTTGC CCAACCCTAT GTTCAAGGCT TGACTTTGCT GGGAGGGGAG CCTTTTCTCA 3960 

ATACTGGGAT TCTCTTGCCA CTTGTTAAGC GGATTCGGAA GGAATTGCCA GACAAGGACA 4020 

TCTGGTCCTG GACCGGCTAC ACTTGGGAAG AAATGATGTT GGAAACTCCA GATAAACTGG 4080 

AATTCTTGTC ACTGATTGAC ATTCTTGTCG ATGGAAGATA TGATCGAACT AAGAGAAATC 4140 

TTATGCTCCA GTTTCGACGT TCATCTAACC AACGAATTAT CGATGTGCAA AAATCGCTCA 4200 

AAAGT GGGC A AGTAGTGATT TGGGACAAGC TCAATGACCG AAAAGAAAGC 7ATCAACAGC 4 260 

TGAAGAGAGA ATGAAGAAAA AGCACTTACT AGACCAACTA GTCTCAGAGA TCGAGACGGG 4320 

GAAAGTCAGG ACACTGGGAA TATACGGTCA TGGAGCTTCA GCTAAATCAA CCTTTGCACA 4380 

GGAATTGTAC CAAGCTTTAG ATTCTACTAC AGTAAATTTG CTACAGACAG ATCCTTATAT 4440 

CACCTCAGGA CGCCATCTGG TAGTACCCAA GGACGCGCCG AATCAAAAGC TGACAGCCAC 4 500 

TCTGCCACTG GCGCATGAAC TGGAGAGTTT GCAGAGAGAT ATCCTTGCTT GCAGGCGGGT 4 560 

ATGGATGTCT TGACAATTGA AGAACCTTGG AAGGCTAGTG AGGTCTTGTC TGGAGCCAAA 4620 

CCAATTTTGA TTGTCGAAGG GATGTCTGTT GGCTTTCTAC CCAAGGAACT CTTTGAAAAA 4 680 

ACCATCTGTT TCTACACGCA TGAGGAGACC GAATTAAAGC GACGCCTTGC TAGAGATACG 4740 

ACTGTGAGAA ATCGCGATGC GG 4762 
(2) INFORMATION FOR SEQ ID NO: 92: 
U) SEQUENCE CHARACTERISTICS : 



WO 98/t8931 



PCT/US97/19S88 



719 

(A) LENGTH: 3832 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 92: 



GATGtTAGG I T 






A A T*f* A C T*TT A 

rtA 1 » 4 4 4 /% 


AAGATGATGA 


TGTTAATCAG 


60 


GTACCTTGTC 


TTTGTTGGTC 






TrAGTAATTG 


GGTAGACCAT 


120 


GCGGTCTATC 


AG G AG ACGCt. 


1111 Vj A I 1 vu 


AP.AAAP.ATAG 


AAG ATG ATG C 


A^CTGCATAT 


180 


GGGTATTTAT 


& »^»/*^n & linn 

AAGAGGAAl I 


&TP ITIITITT 
A 1 Vj At. A I n. 1 1 


AP. A PP CTTT 
t AunLViV. Ill 


TAAATCAGGT 


ACCTTGGTTT 


240 


TACCGAGTGC 


CCTGCTCTTG 


L.A 11*1 ftnw 


A1\V> 1 ^ *. 1 1 V- *«. 


TTf**"AGCGAG 


CATTTTCTGG 


300 


TTTGGCAATT 


TTTCTATTTG 


riaiifs pp a 
l_ AAAA 1 ntun 


LnuuV, 1 1 nyn 


AdAAATCTCG 


CCAAGCCACA 


360 


TTGCTGAAAG 


GATTGGCAAG 




ATP.TCAAGCA 


CTCCATTTCT 


AATCTGACCG 


420 


AAAGGGGACT 


GCTi t»v-AG 1 A i 


\_Ls 1 1 n 1 *w>-» 


AATTAAATnG 


CGAAATTGAA 


TTGCTCTTTG 


480 


ATGCTAGTTT 


GGCCTTGGAA 


Lu 1 1 1 UOn 1 Vj 


APPTP. f PT w PPP. 


AGCAGTTCAT 


TCAAGTTCAG 


540 


AC C AGCT AAt. 


jv r*r*vf~ ft nip 
ACL I uAAAAv. 


p apptpaap.p. 


ATTTGCTGGA 


AACCTTCCAG 


CAGGAGTTGC 


600 


GACGATTGTT 


GACGCCTTTT 


& * t^pp. ap_p. 


ATTTP.APPAA 
nil lunVvVw/v% 


GACACTAAAG 


GAAGATGGAA 


660 


CCAGTGCTGA 


CTTGA™rAAG 


GAlfvrt. 4 V- 1 1 V- 




TTTGAATGGA 


AAACCAAACT 


720 


GGAAGTACAT 




TTCAP.AAAPT 




AGGAATCAAG 


AGTGTGCCTC 


780 


AAATTGAGGC 


CAAGAGAGCA 


GAAAGAGAAG 


CAAGCAATCC 


TCAGTTGACA 


CAGGTATCTG 


840 


CAGATTTCAT 


AAATGCCATG 


GATCTCTGGA 


AGGATTAATC 


CATGCAACTA 


GGCTTGAAAT 


900 


CCGAGTAAGA 


TTTGCAAGCT 


GTGTATAATT 


GTGATAGAAT 


AAATAGAAAA 


TAAATTGAAA 


960 


AAAGAGGTAT 


GTGAAATGTC 


ACCTAAACCA 


TTTATCGCTG 


GTAACTGGAA 


AATGAACAAA 


1020 


AATCCAGAAG 


AAGCTAAAGC 


ATTCGTTGAA 


GCAGTTGCAT 


CAAAACTTCC 


TTCATCAGAT 


1080 


CTTGTTGAAG 


CAGGTATCGC 


TGCTCCAGCT 


CTTGATTTGA 


CAACTGTTCT 


TGCTGTTGCA 


1140 


AAAGGCTCAA 


ACCTTAAAGT 


TGCTGCTCAA 


AACTGCTACT 


TTGAAAATGC 


AGGTGCTTTC 


1200 


ACTGGTGAAA 


CTAGCCCACA 


AGTTTTGAAA 


GAAATCGGTA 


CTGACTACGT 


TGTTATCGCT 


1260 


CACTCAGAAC 


GCCGTGACTA 


CTTCCATGAA 


ACTGATGAAG 


ATATCAACAA 


AAAAGCAAAA 


1320 


GCAATCTTTG 


CGAACGGTAT 


GCTTCCAATC 


ATCTGTTGTG 


GTGAATCACT 


TGAAACTTAC 


1380 


GAAGCTGGTA 


AAGCTGCTGA 


ATTCGTAGGT 


CCTCAAGTAT 


CTGCTGCATT 


GGCTGGATTG 


1440 


ACTGCTGAAC 


AAGTTGCTGC 


CTCAGTTATC 


GCTTATGAGC 


CAATCTGGGC 


TATCGGTACT 


1S00 
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GGTAAATCAG CTTCACAAGA CGATGCACAA AAAATGTGTA AAGTTGTTCG TGACGTTGTA 1560 

GCTGCTGACT TTGGTCAAGA AGTCGCAGAC AAAGTTCGTG TTCAATACGG TGGTTCTCTT 1620 

AAACCTGAAA ATGTTGCTTC ATACATGGCT TGCCCAGACG TTGACGGTGC CCTTGTAGGT 1680 

GCTGCCTCAC TTGAAGCTGA AAGCTTCTTG GCTTTGCTTG ACTTTGTAAA ATAATCAGTA 1740 

AGTAGCAAAA GCTAGGTGCA ACAGCATTCA GATGTCTCTT ACATTTTTTA TAGGAGAGAA 1800 

AGATTGAAAA CAAAAATTGG ATTAGCAAGT ATCTGTTTAC TAGGCTTGCC AACTAGTCAT 1860 

CTCGCTGCAA ATGAAACTGA AGTAGCAAAA ACTTCGCAGG ATACAACGAC AGCTTCAAGT 1920 

AGTTCAGAGC AAAATCAGTC TTCTAATAAA ACGCAAACGA GCGCAGAAGT ACAGACTAAT 1980 

GCTGCTGCCC ACTGGGATCG GGATTATTAT GTAAAGGATG ATGGTTCTAA AGCTCAAAGT 2040 

GAATCGATTT TTGACAACTA CTATAAGGCT TGGTTTTATA TTAATTCAGA TGGTCGTTAC 2100 

TCGCAGAATG AATGGCATGG AAATTACTAC CTGAAATCAG GTGGATATAT GGCCCAAAAC 2160 

CAGTGGATC? ATGACAGTAA TTACAAGAGT TGGTTTTATC TCAAGTCACA TGGGGCTTAT 2220 

GCTCATCAAG AATGGCAATT GATTGGAAAT AAGTGGTACT ACTTCAAGAA GTGGGGTTAC 2280 

ATGGCTAAAA GCCAATGGCA AGGAAGTTAT TTCTTGAATG GTCAAGGAGC TATGATGCAA 2340 

AATGAATGGC TCTATGATCC AGCCTATTCT GCTTATTTTT ATCTAAAATC CGATGGAACT 2400 

TATGCTAACC AAGAGTGGCA AAAAGTGGGC GGCAAATGGT ACTATTTCAA GAACTGGGGC 2460 

TATATGGCTC GGAATGAGTG GCAAGGCAAC TACTATTTGA CTCGAAGTGG TGCCATGGCG 2520 

ACTGACGAAG TGATTATGGA TGGTACTCGC TATATCTTTG CGGCCTCTGG TGAGCTCAAA 2580 

GAAAAAAAAG ATTTGAATGT CGCCTGGGTT CACAGAGATG GTAACCGCTA TTTCTTTAAT 2640 

AATAGAGAAG AACAAGTGGG AACCGAACAT GCTAAGAAAC TCATTGATAT TAGTGAGCAC 2700 

AATGGTCGTA TCAATGATTG GAAAAAGGTT ATTGATGAGA ACGAAGTGGA TGGTGTCATT 2760 

GTTCGTCTAG GTTATAGCGG TAAAGAAGAC AAGGAATTGG CGCATAACAT TAAGGAGTTA 2820 

AACCGTCTGG GAATTCCTTA TGGTGTCTAT CTCTATACCT ATGCTGAAAA TGAGACCCAT 2880 

GCTCAGACTG ACGCTAAACA GACCATTGAA CTTATAAAGA AATACAATAT GAACCTGTCT 2940 

TACCCTATCT ATTATGATGT TGAGAATTGG GAATATGTAA ATAAGAGCAA GAGAGCTCCA 3000 

AGTGATACAG GCACTTGGCT TAAAATCATC AACAAGTACA TGGACACGAT GAAGCAGGCG 3060 

GCTTATCAAA ATGTGTATGT CTATAGCTAT CGTAGTTTAT TACAGACGCG TTTAAAACAC 3120 

CCAGATATTT TAAAACATGT AAACTGGGTA GCGGCCTATA CGAATGCTTT AGAATGGGAA 3180 

AACCCTCATT ATTCAGGAAA AAAAGGTTGG CAATATACCT CTTCTGAATA CATGAAAGGA 3 240 

ATCCAAGGGC GCGTAGATGT CAGCGTTTGG TATTAAGCCA TGATTTGAAA GAGCGATGTG 3 300 
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ATAGTAGCAC CCTCTTTTTC TTTGTTTTAT GATAGTTCAT CCTCGAGTAA ATTCAAGTTC 3 360 

TTGCTCGGAA ATGAAGCTTA TATAGTAGAT TGAATATAGA CAAATACCTT GTGATTGGTA 3420 

AAACATTTTA GAAATTCATT TACCTTTCCT AATCGACTTG CTTTCATCTT ATTTCAATCT 3480 

ATTATAGTAT TGGGGAATTT CTTCAAACCA CATCAGCTTG GTCAGTTCTA CCTGCGACCT 3540 

CAAAACTTGT GCTTTGGTCA AGCTGGGTTT AGTTTCCTAG TTTGCTGATG GATTTCCATT 3600 

GACTATAAGC ATCCAACCCT CTTTTTGTCT TCTAAAGAAT TCTTAAATTA TCAGTCTATT 3 660 

GCAACTTTTC TCATATAAGT TCTTTGTCTT GCTATTGGTT TTCCTTAGTA GTATACTAAG 3720 

GTAGTAATCA TTAAGAAGTG GTTACAAAAA ATAATGAATC AGGTAAAGAA AATGGTAGAA 3 780 

TTGAAAAAAG AAGCAGTAAA AGACGTAACA TCATTGACAA AAGCAGCGCC GG 383 2 

(2) INFORMATION FOR SEQ ID NO: 93: 

( i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10690 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 



TGAAAAAATC 


CTCATGAACC 


TGGCGCCAAT 


AGACAAGTGT 


CTTGTTTCCC 


TCACCTTCCT 


60 


TATAGGCATG 


GTCAGCTGAC 


ACTCGATTGA 


AGGGTTTAAC 


AGAAACCTTT 


GTAATTTCGA 


120 


CAATGCAGAC 


AGCCTGATTT 


TGACTATCTA 


AAATGACATC 


GAAGGTCCCT 


ACTTGGGGAA 


180 


GTGGTTCGTC 


TTCTAGCACA 


TAGAGGTCAT 


AGGCTGATGC 


TGTTGCTGTC 


TTTTCTCCTT 


240 


TAAACACCAA 


ATCCGCTAAA 


AGGTCTGGTT 


CAACTCCAAA 


AGCCCAGGCA 


TCGATTTCAT 


300 


CTCCGATCAA AGGATTGATT 


TGCTTGTATT 


TATTCCACAT 


TTCTTGCGGT 


ATCATGGGTG 


360 


CTCCTTTGTA 


ATTTTTTACT 


TTCTTCTTTT 


ATGTGTTTAA 


GATCATCTGG 


ATGGTCAATC 


420 


TCTAAATCAA 


AAATCTCTGG 


AATAGAACTG 


TAGTGGATAA 


TGCACTTGAT 


ACCCAACTGA 


480 


TTCATTTTTT 


GTATGAAAGA 


AGTATTCAGA 


TAGCCTGCTA 


CAGCAAAATC 


AATCTTGTTC 


540 


TTTCTTGCTT 


TATCCTGCAT 


ATCTCTTAGC 


ATATCTAACA 


TTATTGGACT 


TTCCATATCA 


600 


TGCCATTGAC 


TGTTTCTCAT 


AGTCGCAAAA 


ACAAAGCAAG 


TCAAATCATT 


CATTCCAACT 


660 


ACAATCTTTG 


AAATGCCCGT 


TTCC AGTAT A 


CTAGATAACT 


CAAAATACGC 


TGACGGTAAT 


720 


TCAATCATCG TTCCGACTTT 


CCCAGTAAAA 


CCCTGCTGAC 


GCAATACTGT 


AATACCTTGT 


730 


7TTAATTGGT 


CGG CATC ATT 


GACAAAAGGA 


AAGATAACAG 


ATACATTGGG 


GTTGGTTTGA 


340 
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TAAACTTCTG TAACGACATG TCCTTCAGCC TGAAATTCAT CCAAACACGC CAGTAAACGC 900 

CTAGTTCCTC TATAGCCAAA CAAGGGATGC CCTTCGTCAA AAAACTCTTT AGTCCCCACT 960 

AAACAATTGG CTTCTGTATT CGTTAATTCA GTAAAACGAT ACCAAACTTC CTTACCTAAG 1020 

TAAAAGGAGC AAATACTATC AAGATAATCT TTCACAAATT CCTGACAACT TTCTAATACT 1080 

ATATTTTGAT TGAGCTCTCT CAATAAGTAT TCCCCACGAA-TCATGCCGAC GTGGTGAAAT 1140 

AGTTGAGGAT AAATTTTTTC AAGAATTTTT TCGCCACTAA CGGCAAGTTG ATTTCTCATC 1200 

ATTCACCTTC CAATTCATGT AAGAAGTCTT GTCCAGTTCT GGAAATCCTA ATAATTCAGA 12 60 

CTTAACCTTC AAGACTAATG GCGATGCATT TTCTTCTGTA ATCTCTTGAA TATCCATCCA 13 20 

AATATATCCA AGTGAATCAT TCGCACCATC AGACACAGCT TCCGAAATCG TAACTTGACC 13 80 

TGCACTCTCA TTCATTTCAA CATCATACAA GGCTATGACA TGGTGAACCA TAAAATTTTT 1440 

TAACTCTTCC CTGACGAAAA CATCGTACAT TCGAGGATTA GAGTAGCTTC T AAC AGT AAA 1500 

TCCCCTCTCT TCCATAACTT CTCTAGTCAG CGTTTCCGTC ACTCCTTCAC CAAGTTGCTG 1560 

ACTGCCTCCA GGTAGATCAT ACCGATGTTG A7AAGGCCC? CTCGTTTTTT CAATGCAAAG 1620 

TAACTTTCCA TTTTCAAAGC AAACACAGTA GACCCCAAAG TGATTTTTGA TTTCCATCCA 1680 

ACTCCTCCTA CTTCAAAGAC CAGCCACCAT CTATTGTCAA GATTTGTCCT TGCATGGCGC 17 40 

TCGCTTTTCC ACTTGCTAAA AAAAGACTAA GCTCTGCTAT TTCCTCTGGC TCAATCCAGC 1800 

GCTTGATTGG GGTTTCACTA GCCACCCAGT CAGCCAAACC ACCTGGTTCA AAATCCGCAG 1860 

CGGTCATAGC TGTCTTGACT GCTCCTGGAG CGATACCAAA GACCTGAATC CCAGCTTCAG 1920 

CATAGTCTAG AGCCAACTGC TTGGTGAAGC CAGCCAAGGC ATCCTTGCAT GAAGTATAGG 1980 

CGTGACCACC TCCACCTGCT AGCCTAGAAC CAATGGAACA CATATTCATC ATGATTCCCT 2040 

TTTTATTTTC CAGCATTTGT GTCAAATAAT ACCGAGTCAA CTCTACTGGA ATAATGTAGT 2100 

TGATTTCAAA AATCTCTTGA ATGTCCTGCG CCGTTTGTTC CAACAGTGGT TTGTAATCAT 2160 

CCAAAACTCC AGCAGTATTA CACAAAACAT CCACCTGAGG GCACCAGTCA AAAATAGGTT 2220 

CCAAGTCCAA GGTCAAATCT CTCTGTAAAA AGCGAAAATC ACCCTCTAAG AGTGGCTTTT 2280 

CACCTTGGTC AACTCCATAA ACTTGATAGC CCTTCTCTAA AAAGAGGCGA GCTTGAGCCA 2340 

ATCCGATCCC TGAACTCACT CCTGTAATGA GTACACGTTT AGTCATGCAC TTCTACCCAA 2400 

TCCCTTGCCA AAACATCACA AACTGTCGGG CTCCACATGG AAAAACCTTC TCCTTCGCCA 2460 

GAAACGTTGA TTAGGAAATA AGGTCTCATT TCAAGTGCAA GCCCATTTTG CTCGATGGTA 2520 

TCAAACACTT GGACATAGTT TTCCGCACCT CCCCAACCAG TTCGTACATA TTTTCTCTTA 2580 

GCCTTTAACC CAGGCAGCAT CTCTTCAAAT GTCATGTTTT TCTCCTTTAA TTCTACATTC 2 640 
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I .LAI l IMl 


T ATAPPAAAA 


aappppttta 


T A C f* r^TTTT 


TGAATGTGAG 


TTATTCAAAC 


2700 


i_ rv*t- 1 au. 1 a*_ 


TT APPfLP AAA 


TTAT*" , PPPTG 


CAGCAAGATA 


AATTTCATAC 


CATTCTTTTC 


2760 


TTGTTAAGCT 


AAAu 1 * luLL 


pptpppptaa 


CTTCTCTCAA 

*w 1 1 *w 1 V* 1 <P%^1 


GTGCTTAGGA 


TTTGTTGTAC 


2820 


CTACGACTGC 


CTG CATTTTT 




GPAATATPPA 


AGAAATGGCA 

rt.VJ ****** A WW^*> 


ATAGTTGAAG 


2880 


AGGTTACTCC 


AT ATTT AAT A 




P AAPT A PTTG 


ATTTAAAGCT 


TGAAATTTCT 


2940 


CATTTCCAAC 


AAAAl 1 CuL. r 


tta a A AT APf* 


PPiAATTPTAA 


GACAGACCAT 


GCTTGAATGA 


3000 


CCACATCGTG 


TAATTGQiLAA 


T A TTr A A A A A 


TPPTP.PP ATP 


TCGCATAGCT 


GCTTGACTAT 


3060 


CTTC C AT ATT 


AAC ATV» AAAA 


PPTP.ATTPAA 


ATCCTGGAGT 


AAAAGCCGCA 


CTCAATTGTA 


3120 


GCTGATTAAU 


auL 1 nALuuL 


tppttgapa.? 


CTTTTTTAAG 


CAACTCCATC 


ATCATAGGAT 


3180 


TTTGATTAGA 


AACTCt-AAAA 


TPTPP A A PT*** 


T APPTTPiTTT 


ATAAAGGAGA 


TTAAAGGCTT 


3240 


CTGCTACTTG 


f"*T/* i/^lTTVT 


ATP A A AGP AT 


PTPPTCGATG 


AAGGAGCAAG 


CTATCTAGAT 


3300 


GATCAATCTT 


C AATCTTTG C 


A A A AT APPPT 


PTAP^PATTT 


TAT AAT AT AG 


TCCTTAGAAA 


3360 


AATCAAAATA 


GCTAAA l 1 v_ T 


T/^A ATPPPA A 


TPPPAPATTT 


GGACTGAATC 


CACATCTTTT 


3420 


CTCTTAAATC 


TGGACGATTT 


TTT APPAPAA 


PAPCTAACAG 


TTCTTCACAA 


CGACCACGAC 


3480 


CATAAATATC 


AGCCAAGTi-<j 


& APPPATTP.A 


TTPPAAPAGA 


AAGTGCTGTT 


TCTACAAGCT 


3540 


CTTCAACTTC 


TTTTACAGAT 


TT AT CTTTT A 


TTPTPATPAT 


TCCGAGAACA 


ATTTCTGATA 


3600 


ATTCTTTGTC 


ATCTTG AC UA 


AP APTATPT 


ATPTCATPAA 


ATTTttCTCC 


TTTAATTTCT 


3660 


AACATTCTTC 


CCTTC ATT AT 


&&PAAAAA AP 


PP PTTG P AA 


CGACTTTTTG 


ACTATACTTC 


3720 


ACTCCATTTT 


ATCTTCTTAA 


a f*f*r* a PPP A A 


PAAPAPAAAG 


A TT CC AAT AA 


AGAGGACAGC 


3780 


T AAAGG AAT A 


ACtTTTGTAA 


PPAAAACATT 


TGAAATTCCC 


ATCCACTCAT 


AATAACGGAG 


3840 


CAGAGAACCC 


AL.L ALAAun 1 


PPP P AAT AAT 


CATACTGACA 


AATGGACGAA 


AGACCGCTTC 


3900 


TTTCCAATTC 


CAAATACCGA 


TAACTAGCGA 


AATCGTAAAG 


ACAGACAAAC 


TATCCCAGGG 


J 70U 


AGCCGGAATA 


taaaaggctc 


CTTCTTGTAT 


GAACCTTGCC 


ATTCCTACAT 


ATCCTAAAAC 


4020 


AACTAGAAGA 


ACTATAGTCC 


CAACAACAAT 


GTAAGTGCCA 


ATTTTCATTT 


TAGGAGAATC 


4080 


TTGGACTAAA 


CTTCTTCGTA 


AAATTGTGGC 


CACAAGTCCA 


AATCCAATCA 


GAAAAATAAG 


4140 


AAGTTGCCCT 


AAAAATGTGA 


GCAAATTGAC 


TGTTAAGAGA 


GGACCTTTAG 


AAAAATCACT 


4200 


TAGTAGTTGA 


TAATAACGTA 


ATACCGCCAG 


GACAAGAATT 


GGCGTCAAAA 


GGGACTCTTT 


4260 


GATAGAACTG 


CGAGGTGCTC 


CCTTGAGAAT 


CTCTTTCATT 


ATTTTTTTAG 


GATTCTTACC 


4320 


TAGATAATCC 


TCTGCACTCA 


TGCCATCTCG 


TTCTGCTTCT 


GAGAAATCTA 


GCATCATCAA 


4380 
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ATAGATCTCC TCTCTCAGAT AGTCTTCATC ATAGAGAAAT CCAGCAAGAT TAAAACTTTC 4440 

CCACAACTCC TCAAAATACT TTTGATTCTC CTCAGAAAAC TCATGTAGCA AAGCGCTTCT 4500 

TTCTTCGTAA TACTTCATTT TCTTCATGGT TTAACCCCCA TTCTTAATCC CTTCTACTTT 4 560 

TTCACTCAAA TCGTCCCATT GTTGCCAAAA GACTGAGACA CGCTCTTCTC CTTCTTTCAT 4620 

TAATGAAAAA TACTTCCGAT CTGGACCATC TGGCGACCCC CGCATGTCGC CTCTTATCCA 4680 

TTGATTTTTT TCTAACTTTT GCAACAAAGG ATAAATAGTT CCTGGAACGA TAGTATCAAA 4740 

TCCAGCCTCT CGCAAAGTCT GAACCAACTC ATAACCATAC CGCTCTTTTT GACCAATCAT 4 SCO 

ATCCAAGACA CAACCTTCAA GAACACCTTT TAATAGCTGA GTTTCTTTCA TCACTTCTCC 4860 

CTTCTAATCT A7TTTCTAAT ACCTACTAGT GACTTCACCT ATAGTATATC ACTTCTACAC 4920 

TAGTTTGTAA AGCATAATAG TTAATACTCT TCGAAAATCT CTTCAAACCA CGTCAGCGTC 4980 

GCCCTACCCT ATCTATCGTT ACTGACTTCG TCACTTTCAT CTACAACCTC AAAAACATGT 5040 

TTTGAGCTGA CTTCGTCAGT TTCATCTACA ACCTCAAAAC AGTGTTTTGA GCTGACTTCG 5100 

TCACTTTCAT CTACAACCTC AAAACACTGT TTTGAGCTGA CTTCGTCAGT TTCATCTACA 5160 

ACCTCAAAAA CATGTTTTGA GCTGACTTCG TCAGTTTCGT CTACAACCTC AAAACAGTGT 5220 

TTTGAGCAAC CTGCGGCTAG CTTCCTAGTT TGCTCTTTGA TTTTCATTCA GTATAAATAA 5280 

AAAAACAGAA CTAGCCTGAA CTAGTCCTGT CTACTTTTAC CCAATCACAC TTCCATTTGG 5340 

TACAGCTGGA TCAACTGTGA GAAGGGTTAA TTTGCCATCA TGTTCAGCTG AGAGAATCAT 5400 

ACCCTGGCTG ACATATTTTT TCATCATTTT ACGTGGTTTG AGGTTACCAA CGATTTGAAC 5460 

TTTCTTGCCG ACCAATTCTT GTTCATTTGG ATAGTATTTT GCAATTCCTG AAAGAATCTG 5520 

ACGATCTTCT CCATCACCAG CATCCAAGCG GAATTGAAGC AACTTATCTG AACCTTCTAC 5580 

TTTAGACACT TCTTTGACTT CTGCGACACG GATTTCAACC TTGTCAAAGT CTTCAAACTT 5640 

GATTTCATCC TTGTTTAGTT TCAGCTCAAC TTCGTCCGGA TTCCATTCTT TTTCGACTGC 5700 

TGGTTTATTG CCTTCCATTT CTTCCTTCAT ATAGGCGATT TCTTCTTCCA TATTTAGACG 5760 

TGGAAAGATA GGTCTTCCTT TGGCAACTAC AGTCACATCT GCTGGGAAGT CAGCCAAACT 5820 

CAAGTTTTCA AGACTAGAAA CTTCTTCCAA ACCAAGTTGA GTCAAAACTG CACCACTAGT 5880 

TTCCATCATA AATGGTTCAA TCAAGTGAGC AACTACACGA ATGCTGGCTG CCAAGTGGCT 5940 

CATGACACTT GCCAATTGGT CACGAACAGC TTCATCCTTG GCCAAGACCC ATGGTGCGGT 6000 

CTCATCGATG TATTTATTGG TACGAGAGAT CAGAGTCCAG ACTGCTTCAA GCGCACGTGG 6060 

ATAGTCAACT GCTTCCATGT GTGTATGGAA GTCTGCGATT GATTGTwCTG CAACCTCACC 6120 

AAGAACATGA TCATATTCAG TCACACCTTC TACATAGGCA GGGATTTGTC CATCAAAGTA 6180 
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CTTATTAATC ATGGAAACCG TACGGTTAAG GAGCTTCCCA AGGTCATTAG CCAATTCATA 6 240 

GTTGATACGG CCGACATAGT CTTCAGGAGT AAAGGTTCCC TCTGAACCAA CTGGAAGGTT 6300 

ACGCATGACG TAGTAACGAA CTGGATCTAG TCCATAACGC TCTACCAACA TTTCAGGGTA 6360 

AACGACATTC CCTTTTGACT TAGACATTTT TCCGTCTTTC ATGACAAACC AACCATGGGC 6420 

AATCAAACGA TCAGGTAATT TAACATCCAA CATCATAAGA AGGATTGCCC AGTAGATAGA 6480 

GTGGAAGCGA AGGATATCTT TTCCTACCAT ATGGAAGACT GTTCCATTCC AGAACTTGTC 6540 

AAAGTTACCA TGTTCGTCTT GAGCGTAGCC AAGAGCTGTC GCATAGTTAA CAAGGGCATC 6600 

AATCCAAACG TAGACAACGT GTTTTGGATT TGATGGGACA GGCACTCCCC ATGTAAAGGT 6660 

TGTACGAGAT ACCGCCAAAT CTTCCAAGCC TGGCTCGATG AAGTTGCGTA GCATTTCATT 6720 

AAGGCGACCA TCTGGCGTGA TAAATTCAGG ATGACCTTTG AAAAATTCGA CCAAACGGTC 6780 

TTGGTATTTG CTAAGGCGAA GGAAGTATGA TTCTTCAGAA AcCCATTCAA CCTCATGACC 6840 

TGATGGAGCA ATACCACCAG TCACATTTCC AGCTTCATCA CGGAAAACTT CTGCCAGCTG 6900 

GCTTTCTGTA AAGAATTCTT CCTCTCATAC TGAATACCAA CCAGAGTATT CACCCAAGTA 6960 

GATATCATCT TGAGCAAGTA AGCGTTCAAA GACTTGTGCG ACAACTTTTT CATGGTAGTC 7020 

ATCAGTTGTA CGGATAAATT TATCGTATGA GATATCTAGT AATTGCCAGA GTTCTTTAAC 7080 

TCCAACCGCC ATTCCATCAA CATAGGCTTG AGGTGTAATA CCAGCTTCTT CCCCTTTCTG 7140 

CTGGATTTTC TGACCATGTT CATCAAGACC TGTCAGATAA AATACATCCT AGCCCATCAG 7200 

GCGTTTGTAA CGTGCTAGGA CATCACATGC GATAGTTGTG TAGGCAGAAC CGATATGAAG 7260 

TTTCCCACAT GGATAGTAAA TCGGCGTTGT AATATAAAAA TTTTTTTCAG ACATAATTTT 732 0 

TCCTTTCCAG GCAAATGAAA CCTGTTTTTC TAACACTTCA TTATATCACA TTTTTAATGA 7380 

ATTTCAATAG GGAAATCCAT ACAAAAACAA GATAGACGAG TGTCCATCTT GTTGATCTCA 74 40 

TTCATAACGA AGGGCTTCAA TTGGATCAAG TTTCGATGCC TTGTTGGCTG GCAAGACTCC 7500 

AAAAATCATA CCAACACTAG CCGAAACTGC AAGACTAAAT AGGGCGACTG GGATTGATAC 7560 

TCCAACTTCT ATACCTTCTA TTAAACCTTG CAGTAACAAA CCTGCTAAGg CAGTTAAACC 7 620 

ACTTGCAATT GTCAAGCCAA TTAAGCCACC TAACAAGGTC AAAATCATGG ATTCAATCAA 7 680 

AAACTGAATT AAAATATTGG CACGTGTTGC ACCCAAAGCC TTACGAAGAC CAATCTCACG 7740 

AGTGCGCTCT GTCACCGAAA CCAGCATGAT GTTCATGACA CCAGTTCCTC CAACAAAGAG 7800 

AGAAATCCCT GCGATGGAAC TAATAATCGT CGTCATAAAA CTAAACGATT GTTGAATTTC 7860 

TGCAAATACA ACGGACTCAT CTGCCAGCTG GTATTCTCCC TGTTGTAAGC CTGCAAGCTC 7 920 
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TGTCATTTTT 


CGTGCCAGTT 


CTGGACCCAG 


AGTTGGGCTT 


AAACTGGTAT 


CATTCACTCG 


7980 


AAACACAATA 


TTAGCTATTT 


CATCTACATT 


AAAATTCGCA 


GCAAGGGAGA 


TATTGGTACT 


3040 


AATAGGCAAG 


CCACCAAACC 


CATATATTTT 


TGATCTTTTA 


GCCTCCGGAC 


TAGTATAAAC 


8100 


CCCAATGACC 


CGGTAACTAA 


ATCCATTGAC 


TTCTACAACC 


TTGTTAATAG 


CCTCTTGAGG 


3160 


AGATTCAAAT 


AAACTAATGG 


ACAATTCCTC 


ATCTAGCAAA 


ATGACACTTG 


CAAACTCTTT 


8220 


GAAATCTTGC 


TCTCTCAGAC 


TACGACCTGC 


AATAATTTCA 


TTCTTAACAG 


CGTCCATGTA 


8230 


AGTTCTGTTT 


CCACCTGTCA 


AATTAGCATT 


CTCAACCTTT 


TTATCTTGAT 


AGGTCAAGAT 


8340 


GGCATTCGTT 


GAATTGGTTA 


CATAGTAACT 


ATCCACTCCC 


TTCAGTTTAG 


CTGCCTCTTG 


3400 


GACCCAGGAT 


TCTTGCGGTT 


TTGGCGGTTC 


AACAGGAACT 


TCCTCTTCCT 


TTCCAGAAAC 


8460 


CGTAAAAGCT 


GATTGTTTCT 


GAGTAAAAGA 


CCCGTCTTTA 


CTTTTTTTAG 


GAGAGAAAAA 


8520 


GACGCTAATA 


TTTTTCTGAG 


ATTTAGTCAT 


ATCTTTATTG 


ACTTGACGAG 


ATAGGGAATC 


8580 


ACCCAAAGCC 


ATAATCACAA 


CAACTGATGA 


AACACCGATA 


ATAATCCCAA 


TCATAGTAAG 


8640 


CAAAGAACGC 


ATCTTGTGAG 


CCATGATAGA 


TGAAAAGGCA 


AATTTCACAT 


TCTGCATCTT 


8700 


AGTTTTCCTC 


CTTTCCTAAC 


TGAGCACTGT 


CAGACGAAAT 


GACCCCATCC 


CGAATCACAA 


8760 


TCTGACGTTT 


GGCATAGCCA 


GCAATCTCAG 


GCTCATGCGT 


TACCATGATA 


ATGGTTTTTC 


8820 


CTTCTTTATT 


CAAATCAACC 


AATAATTGCA 


TAATTTGGTT 


ACCTGTTTTG 


GTATCCAAGG 


8880 


CTCCTCTCGG 


TTCATCCGCT 


AGGATAATAG 


AAGGATTGTT 


TACCAAGGCA 


CGCGCAATGG 


8940 


CTACACGTTG 


CTTTTGACCA 


CCAGATAATT 


CTCAAGGTAA 


ATGGTGACTA 


CGTTCTGTCA 


9000 


ATTCAACCTT 


GTCTAAATAT 


TCCTCAGCCA 


ACTTGCGACG 


TTTTGAAGAC 


GAAACTCCTG 


9060 


CGTAAATCAA 


GGGCAATTCT 


ACATTTTGCA 


GAGCATTGAG 


CTTCGATAGA 


AGAAACAAC? 


9120 


GCTGAAAGAC 


AAAACCGATT 


TGTTGGTTAC 


GGACCTTAGC 


TAGTTGTTTT 


TCACCAAGCC 


9180 


CAGCCACTTC 


TTGACCTTCA 


AGATAATATT 


CTCCACTGGT 


TGCTCTATCC 


AACATGCCAA 


9240 


TCGTATTCAT 


CACAGTGGAC 


TTACCAGACC 


CAGATGGTCC 


CATGATCGCT 


ACAAATTCAC 


9300 


CCTCATTCAC 


TTCTAGATTG 


ATATTTTTGA 


GAACCTGCAG 


TTCTTGGTCA 


CCATTACGGT 


9360 


AACTTCTGAA 


GATATTTTTT 


AGACTAATTA 


GTTGCTTCAT 


CAGCCTTCAC 


CTCTTTTCCT 


9420 


TCTTCCAAGG 


AAGATGTTGG 


ATTACTGATG 


ACCTTAGCAC 


CGTTCGTTAA 


ACCAGAAGTG 


9480 


ATTTCTTGAT 


TTTCTGCGTC 


AGCATTTCCC 


AATGAAACCT 


CAACTTTTTT 


AGCCTTTTGT 


9540 


TGTTCATCCA 


CAATCCAGAC 


ATAATTTTTA 


CTATCATCCA 


TTACTAGACT 


GCTAACAGGA 


9600 


ACAAGAATAG 


CCTTAGTTTT 


GCTTTTAACC 


TCAATGTTGA 


CAGAAAAACC 


TTGTTTCAAA 


9660 


TCACCAACC? 


CGCCTGTCAC 


ATCAATAGTA 


TAAGGGTATT 


TAGAACCTGT 


ATTATTCCCG 


9720 
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GCTGCTGGAC TAGCTGCTTC ACCATTGTTT TTAGGATAGT CAGAAATATA GCTTAATTTC 9780 

CCAGTCCAT? TTTTATCAGG ATACACTTTA GAAGTAAACC TTACTTCTTG ACCTACAGAA 9840 

AGGTTGGCTA GATTGTACTC AGACAATTCT CCCTTGACTT GTAAATTTTC ATTGCTGACA 9900 

ATATGAACCA TAACTTCACT CGCCCCTGTT GGAGATTTAG AAACATTGCT ATTGACTTCG 9 960 

ACCACAGTTC CCTCTAGGGT ACTGAGAACA GTTGTTGCAT CCAATTGACT TTGAGCCTTG 10020 

CTTAATTGCG CCGCAGCATC TGCACGCGCA TCACGGGCAT CACCCAATTG AGCGTCAATA 10080 

GAAGCAACAG AATTTCCAGC CACTGGACTT GGGCTTTGCA CCCTTGCATC TTCTCCTCCT 10140 

ACTGGCGCTG GTAACTGTGG AGCCGGAGCT GAAGCGGCTT CATTTCGTGC TTGATTGAGT 10200 

TCATTGATAT GACGATCTGC CCTAGCTACT GCTCGACTAG CTGAATCATA GGCCGCCTGC 102 60 

GCTTCTGAAC TACTGTACTT GACTAAAGCC TGCCCTTCCC TGACCTTATC GCCCACAGAA 10120 

ACAAGGATTT CATCTAAATC ACCCTTACTA GCATCAAAAT AAACATATTG TTCATTTTTT 10380 

GCTCTTACTG TCCCTGACAA TAAAACAGAG GAGGCCACGC TTCCTTCCTT GGCAACAACA 10440 

AGATGAGTAG GCTCATCTTT TAGAGCAGTC TGACAACGTT GTCTAAAGAG TAAAATCCCC 10500 

CCAGCACCCA ATACAACTAC ACTCGCAGCA CCGATTGCTG CATACAGTTG CCACTTTTTA 10560 

GCTTTACCAT TCTTTTTCTT CATAATGAAA CTCCTTTTCT TTTTTACAAT ACTTTGCTAT 10620 

TATACCAAAT TTCCCTCCAG CAAACAATAC AGTTCAGGAT TAAACAATCG TTCGGAATTT 10680 

TGCTTTTCGG 10690 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

GAGAAAGCGC CCACGTTTCC CCGAAGGGAG AAAGGCGGAC AGGTATCCGC TAAGCGGCCA 60 

GGGTCGGAAC AGGAGAGCGC AACGAGGGAG CTTCCCAGGG GGAAACGCCT CGTATCTTTA 120 

TACTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 180 

GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 2 40 

CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAACCGTAT 3 00 

TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGAGTC 3 60 
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AGTGAGCGAG GAAGCGGAAG AGCGCCCAAT ACGCAAACCG CCTCTCCCCG CGCGTTGGCC 420 

GATTCATTAA TGCAGCTGGC ACGACAGGTT TCCCGACTGG AAAGCGGGCA GTGAGCGCAA 480 

CGCAATTAAT GTGAGTTAGC TCACTCATTA GGCACCCCAG GCTTTACACT TTATGCTTCC 540 

GGCTCGTATG TTGTGTGGAA TTGTGAGCGG ATAACAATTT CACACAGGAA ACAGCTATGA 600 

CaTGATTACG AATTCGAGCT CGGTACCCGG AAAATCCAGA AAATGCTTGA AAAAAATCCT 660 

AGAAGATGGT ATAATACTAA ATTGTAAGGG TTATCACATA TAACTCAAAA AAAGAAAGAA 720 

CAAAAGGAGA GTCAAACTAT GGCTTCTAAA GATTTCCACG TAGTGGCAGA AACAGGTATT 7 80 

CACGCACGTC CAGCAACATT GTTGGTACAA ACTGCTAGCA AATTTGCTTC AGATATCACT 840 

CTTGAGTACA AAGGTAAATC AGTTAACCTT AAATCAATTA TGGGTGTTAT GAGTCTTGGT 900 

GTTGGCCAAG GTGCTGACGT AACTATCTCA GCTGAAGGTG CAGATGCAGA TGACGCTATC 960 

GCTGCAATCT CAGAAACAAT GGAAAAAGAA GGATTGGCAT AAGGGAAATG ACAGAAATGC 1020 

TTAAAGGAAT CGCAGCATCT GACGGTGTTG CAGTTGCAAA AGCATATCTA CTCGTTCAGC 1080 

CGGATTTGTC ATTTGACACT ATTACAGTCG AAGATACAAA CGCAGAAGAA GCTCGCCTTG 1140 

ATGCCGCTCT ACAGGCATCA CAAGACGAGC TTTCTGTTAT TCGCGAGAAA CCAGTAGCTA 1200 

CGCTCGGTGA AGAAGCAGCT CAAGTTTTTG ATGCTCACTT AATGGTTCTT GCTGACCCAG 1260 

AAATGATCAG CCAAATCAAG CAAACTATCC GTGCGAAGAA AGTGAATGCA GAAGCAGGTC 1320 

TGAAAGAAGT TACAGATATG TTTATCACTA TCTTTGAAGG CATGGAAGAC AACCCATACA 1380 

TGCAAGAACG CGCAGcCGAT wTCCGCGACG TGACAAAACG TGTATTCGCA AACCTTCTTG 1440 

GTAAAAAATT GCCAAACCCA GCTTCTATCA ATCAAGAAGT GATTGTGATT GCGCATCAC7 1500 

TGACTCCTTC AGATACAGCT CAATTGGACA AAAACTTTGT AAAAGCTTTT GTAACCAACA 1560 

TTGGTGGACG TACAAGCCAC TCAGCTATCA TGGCACGTAC ACTTGAAATT GCTGCTGTAT 1620 

TAGGTACAAA TAACATCACT GAAATCGTTA AAGACGGTGA CATCCTTGCT GTTAACGGGA 1680 

TCACTGGAGA AGTGATTATC AACCCAACAG ATGAACAAGC GGCAGAATTT AAACCACCTG 1740 

GTGAAGCCTA TGCGAAACAA AAAGCTGAAT GGGCACTTTT GAAAGATGCT CAAACAGTGA 1800 

CTGCTGACGG TAAACACTTC GAGTTGGCTG CTAATATCGG TACTCCAAAA GACGTTGAAG 1860 

GTGTTAACAA CAACGGTGCA GAAGCTGTTG GACTTTACCG TACAGAGTTC TTGTACATGG 1920 

ATTCTCAAGA CTTCCCAACT GAAGATGAGC AGTATGAAGC ATACAAGGCT GTTCTTGAAG 1980 

GAATGAACGG TAAACCTGTT GTCCT7CGTA CAATGGATAT CGGTGGAGAT AAGGAACTTC 2040 

CTTACTTCGA TATGCCTCAC GAAATGAACC CATTCCTTGG ATTCCGTGCT CTTCGTATCT 2100 

CTATCTCTGA GACTGGAGA7 GCTATGTTCC GCACACAAAT CCGTGCTCTT CTTCGTGCGT 2160 
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CTGTTCACGG 


TCAATTGCGT 


ATCATGTTCC 


CAATGGTTGC 


GCTCTTGAAA 


GAATTCCGTG 


2220 


CAGCGAAAGC 


AGTCTTTCA? 


GAAGAAAAAG 


CAAACCTTCT 


TGCTGAAGCT 


GTTGCAGTTG 


2280 


CGGATAACAT 


CCAAGTTGGT 


ATCATGATCG 


AGATTCCTGC 


AGCCGCTATG 


CTTGCAGACC 


2340 


AATTTGCTAA 


AGAAGTTGAC 


T7CTTCTCAA 


TTGGTACAAA 


CGACTTGATC 


CAATATACAA 


2400 


TGGCAGCAGA 


CCGTATGAAC 


GAACAAGTTT 


CATACCTTTA 


CCAACCATAC 


AACCCATCAA 


2460 


~ TCCTACGCTT 


GATTAACAAT" 


GTGATCAAAG 


CAGCTCACGC 


TGAAGCTAAA 


TCGGCTGGTA 


2520 


TGTGTGGTGA 


GATGGCTGGT 


GACCAACAAG 


CTGTTCCACT 


TCTTGTCGGA 


ATGGGCTTGG 


2580 


ATGAGTTCTC 


TATGTCAGCA 


ACATCTGTAC 


TTCGTACACG 


CAGCTTGATG 


AAGAAACTCG 


2640 


ACACAGCTAA 


GATGGAAGAG 


TACGCAAACC 


GTGCCCTTAC 


AGAATGCTCA 


ACAATCGAAG 


2700 


AAGTTCTTGA 


ACTTCAAAAA 


GAATACGTTA 


ATTTTGATTA 


ATCCAAAAGT 


CCCTGCAACT 


2760 


CAGTTACAGG 


GATTTTTTTG 


ATATTTTAAA 


AAGAATTTTC 


AAGAAAATCT 


TTCTTATAGA 


2820 


AAGTCCAACC 


TTGAAAAAGT 


AGTGGTCAGA 


ACAAAAAATA 


CTTAAATGGT 


TCATAAAATT 


2880 


CTTGACAAGT 


TGGATATTTA 


GGAGTAAACT 


ATTAACCAGT 


TAAGTAATAG 


AGAGGAGTTT 


2940 


CTGCAATTTA 


GAAATGAATT 


GCAACTAGAA 


ATATCAAATA 


GAAAGAGAGT 


TTCGATGAAA 


3000 


ATTAATAAGA 


AATACCTTGT 


TGGTTCTGCG 


GCACTTTCAT 


TTTAAGTGTT 


TGTTCTTACG 


3060 


AGTTGCGACT 


GTATCAAGCT 


AGAACGGTTA 


AGCAAAATAA 


TCCTGTTTCC 


TATATAGATG 


3120 


GAAAACAAGC 


CACGCAAAAA 


ACGGAGAATT 


TGACTCCTGA 


TGAGGTTAGC 


AAGCGTGAAG 


3180 


GAATCAATGC 


TGAGCAAATC 


GTCATCAAGA 


TAACAGACCA 


AGGCTATGTC 


ACTTCACATG 


3240 


GCGACCACTA 


TCATTATTAC 


AATGGTAAGG 


TTCCTTATGA 


CCCTATCATC 


AGTGAAGAAT 


3300 


TACTCATGAA 


AGATCCAAAC 


TATAAGCTAA 


AAGATGAGGA 


TATTGTTAAT 


GAGGTCAAGG 


3360 


GTGGATATGT 


TATCAAGGTA 


GATGGAAAAT 


ACTATGTTTA 


CCTTAAGGAT 


GCTGCCCACG 


3420 


CGGATAACGT 


CCGTACAAAA 


U AWj AAA 1 1. A 


A 1 LOHUivw^ 


XC XACl ACC AT 


AGTCAACATC 


3480 


GTGAAGGTGG 


AACTCCAAGA 


AACGATGGTG 


CTGTTGCCT7 


GGCACGTTCG 


CAAGGACGCT 


3540 


ATACTACAGA 


TGATGGTTAT 


ATCTTTAATG 


CTTCTGATAT 


CATAGAGGAT 


ACTGGTGATG 


3600 


CTTATATCGT 


TCC7CATGGA 


GATCATTACC 


ATTACATTCC 


TAAGAATGAG 


TTATCAGCTA 


3660 


GCGACTTGGC 


TGCTGCAGAA 


GCCTTCCTAT 


CTGGTCGAGG 


AAATCTGTCA 


AATTCAAGAA 


3720 


CCTATCGCCG 


ACAAAATAGC 


GATAACACTT 


CAAGAACAAA 


CTGGGTACCT 


TCTGTAAGCA 


3780 


ATCCAGGAAC 


TACAAATACT 


AACACAAGCA 


ACAACAGCAA 


CACTAACAGT 


CAAGCAAGTC 


3840 


AAAGTAATGA 


CATTGATAGT 


CTCTTGAAAC 


AGCTCTACAA 


ACTGCCTTTG 


AGTCAACGAC 


3900 
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ATGTAGAATC TGATGGCCTT GTCTTTGATC CAGCACAAAT CACAAGTCGA ACAGCTAGAG 3960 

GTCTTGCAGT GCCACACGGA GATCATTACC ACTTCATCCC TTACTCTCAA ATGTCTGAAT 4020 

TGGAAGAACG AATCGCTCGT ATTATTCCCC TTCGTTATCG TTCAAACCAT TGGGTACCAG 4 080 

ATTCAAGGCC AGAACAACCA AGTCCACAAC CGACTCCGGA ACCTAGTCCA GGCCCGCAAC 4140 

CTGCACCAAA TCTTAAAATA GACTCAAATT CTTCTTTGGT TAGTCAGCTG GTACGAAAAG 4 200 

TTGGGGAAGG ATATGTATTC GAAGAAAAGG GCATCTCTCG TTATGTCTTT GCGAAAGATT 42 60 

TACCATCTGA AACTGTTAAA AATCTTGAAA GCAAGTTATC AAAACAAGAG ACTCTTTCAC 4 320 

ACACTTTAAC TGCTAAAAAA GAAAATCTTC CTCCTCGTGA CCAAGAATTT TATGATAAAG 43 80 

CATATAATCT GTTAACTGAG GCTCATAAAG CCTTGTTTGA AAATAAGCGT CGTAATTCTG 4 440 

ATTTCCAAGC CTTAGACAAA TTATTAGAAC CCTTGAATGA TGAATCGACT AATAAAGAAA 4 500 

AATTGGTAGA TGATTTATTG GCATTCCTAC CACCAATTAC CCATCCAGAG CGACTTGGCA 4560 

AACCAAATTC TCAAATTGAG TATACTGAAG ACGAAGTTCG TATTGCTCAA TTAGCTGATA 4620 

AGTATACAAC GTCAGATGGT TACATTTTTG ATCAACATGA TATAATCAGT GATGAAGGAG 4 680 

ATGCATATGT AACGCCTCAT ATGGGCCATA GTCACTGGAT TGCAAAAGAT AGCCTTTCTG 4740 

ATAAGGAAAA AGTTGCAGCT CAAGCCTATA CTAAACAAAA AGGTATCCTA CCTCCATCTC 4800 

CAGACGCAGA TGTTAAAGCA AATCCAACTG GAGATAGTGC AGCAGCTATT TACAATCGTG 4860 

TGAAAGGGGA AAAACGAATT CCACTCGTTC GACTTCCATA TATGGTTGAG CATACAGTTG 4 920 

AGGTTAAAAA CGGTAATTTG ATTATTCCTC ATAAGGATCA TTACCATAAT ATTAAATTTG 4980 

CTTGGTTTGA TGATCACACA TACAAAGCTC CAAATGGCTA TACCTTGGAA GATTTGTTTG 5040 

CGACGATTAA GTACTACGTA GAACACCCTG ACGAACGTCC ACATTCTAAT GATGGATGGG 5100 

GCAATGCCAG TGAGCATGTG TTAGGCAAGA AAGACCACAG TGAAGATCCA AATAAGAACT 5160 

TCAAAGCGGA TGAAGAGCCA GTAGAGGAAA CACCTCCTGA GCCAGAAGTC CCTCAAGTAG 5220 

AGACTGAAAA AGTAGAAGCC CAACTCAAAG AAGCAGAAGT TTTGCTTCCG AAAGTAACGG 5280 

ATTCTAGTCT GAAAGCCAAT GCAACAGAAA CTCTAGCTGG TTTACGAAAT AATTTGACTC 5340 

TTCAAATTAT GGATAACAAT AGTATCATGG CAGAAGCAGA AAAATTACTT GCGTTGTTAA 5400 

AAGGAAGTAA TCCTTCATCT GTAAGTAAGG AAAAAATAAA CTAATGAAAA ATGAAAGTCT 54 60 

CGATAAAGAG GCTTTCATTT TTATTATGTA TATATGTAAA ATTCTTGACA AGCAATATTA 5520 

AAAAGAGTAA ACTATTAACT AGTTAATTAA CCGGTTTATT ACTTTATAGT GAATCAAATA 5580 

TACTTAAGAA AAGAGGAAAG AATGAAAATT AATAAAAAAT ATCTAGCAGG TTCAGTGGCA 5640 

GTCCTTGCCC TAAGTGTTTG TTCCTATGAA CTTGGTCGTC ACCAAGCTGG TCAGGTTAAG 5700 
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AAAGAGTCTA ATCCAGTTkC TTATATAGAT GGTGATCACG CTGGTCAAAA GGCAGAAAAC 5760 

TTGACACCAG ATCAACTCAG TAAGAGGGAG GGGATCAACG CCGAACAAAT CGTCATCAAG 5820 

ATTACCCATC AAGGTTATGT GACCTCTCA? GGAGACCATT ATCATTACTA TAATGGCAAG 5880 

GTCCCTTATG ATGCCATCAT CAGTGAAGAG CTCCTCATGA AAGATCCGAA TTATCACTTC 5940 

AAGGATTCAG ACATTGTCAA TGAAATCAAG GGTGGTTATG TTATCAAGGT ACATGGAAAA 6000 

TACTATGTTT ACCTTAAGGA TGCAGCTCAT GCGGATAATA TTCGGACAAA AGAAGAGATT 6060 

AAACGTCAGA AGCAGGAACA CAGTCATAAT CACGGGGGTG GTTCTAACGA TCAAGCAGTA 6120 

GTTGCAGCCA GAGCCCAAGG ACGCTATACA ACGGATGATG GTTATATCTT CAATGCATCT 6180 

GATATCATTG AGGACACGGG TGATGCTTAT ATCGTTCCTC ACCCCGACCA TTACCATTAC 624 0 

ATTCCTAAGA ATGAGTTATC AGCTAGCGAG TTAGCTGCTG CAGAAGCCTA TTGGAATGGG 6300 

AAGCAGGGAT CTCGTCCTTC TTCAAGTTCT AGTTATAATG CAAATCCAGC TCAACCAAGA 6360 

TTGTCAGAGA ACCACAATCT GACTGTCACT CCAACTTATC ATCAAAATCA AGGGGAAAAC 6420 

ATTTCAAGCC TTTTACGTGA ATTCTATGCT AAACCCTTAT CAGAACGCCA TCTGGAATCT 6480 

GATGGCCTTA TTTTCGACCC AGCGCAAATC ACAACTCGAA CCGCCAGAGG TGTAGCTGTC 6540 

CCTCATGGTA AC CAT T AC C A CTTTATCCCT TATGAACAAA TGTCTGAATT GGAAAAACGA 6600 

ATTGCTCGTA TTATTCCCCT TCGTTATCGT TCAAACCATT GGGTACCAGA TTCAAGACCA 6660 

GAACAACCAA GTCCACAATC GACTCCCGAA CCTAGTCCAA GTCCGCAACC TGCACCAAAT 6720 

CCTCAACCAG CTCCAACCAA TCCAATTGAT GAGAAATTGG TCAAAGAACC TGTTCGAAAA 67 80 

GTAGGCGATG GTTATGTCTT TGAGGAGAAT GGAGTTTCTC GTTATATCCC AGCCAAGGAT 6840 

CTTTCAGCAC AAACAGCAGC AGGCATTGAT AGCAAACTGG CCAAGCAGGA AAGTTTATCT 6900 

CATAAGCTAG GAGCTAAGAA AACTGACCTC CCATCTAGTG ATCGAGAATT TTACAATAAG 6960 

CCTTATGACT TACTAGCAAG AATTCACCAA GATTTACTTG ATAATAAAGG TCGACAAGTT 7020 

GATTTTGAGG CTTTGGATAA CCTGTTGGAA CGACTCAAGG ATGTCyCAAG TGATAAAGTC 7080 

AAGTTAGTGG ATGATATTCT TGCCTTCTTA GCTCCCATTC GTCATCCAGA ACGTTTAGGA 7140 

AAACCAAATG CGCAAATTAC CTACACTGAT GATGAGATTC AAGTAGCCAA GTTGGCAGGC 7200 

AAGTACACAA CAGAAGACGG TTATATCTTT GATCCTCGTG ATATAACCAG TGATGAGGGG 72 60 

GATGCCTATG TAACTCCACA TATGACCCAT AGCCACTGGA TTAAAAAAGA TAGTTTGTCT 73 20 

GAAGCTGAGA GAGCGGCAGC CCACGCTTAT GCTAAAGAGA AAGGTTTGAC CCCTCCTTCG 7 380 

ACAGACCATC AGGATTCAGG AAATACTGAG GCAAAAGGAG CAGAAGCTAT CTACAACCCC 7440 
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GTGAAAGCAC CTAAGAAGGT GCCACTTCAT CGTATGCCTT 
GAAGTCAAAA ACGGTAGTTT AATCATACCT CATTATCACC 
GAGTGGTTTG ACGAAGCCCT TTATGAGGCA CCTAAGGCCT 
GCGACTGTCA AGTACTATGT CGAACATCCA AACGAACGTC 
GGTAACGCTA GCGACCATGT TCGTAAAAAT AAGGTAGACC 
GATAAGGAAC ATGATGAAGT AAGTGAGCCA ACTCACCCTG 
CACGCTGGTT TAAATCCTTC AGCAGATAAT CTTTATAAAC 
ACAGAGGAAG AAGCTGAAGA TACCACAGAT GAGGCTGAAA 
GTTATTAACG CTAAGATAGC AGATGCGGAG GCCTTGCTAG 
ATTAGACAAA ATGCTATGGA GACATTGACT GGTCTAAAAA 
AAAGATAATA ACACTATTTC AGCAGAAGTA GATAGTCTCT 
CAACCGGCTC CTATACAGTA CTAAAATGAA TGGAGCATAT 
CCTGTTACTT CTCTTTTTTA GAAAAACGTA ACAGA 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 
(C» STRANDEDNESS : double 
(0) TOPOLOGY: linear 



ACAATCTTCA 
ATTACCATAA 
ATACTCTTGA 
CGCATTCAGA 
AAGACAGTAA 
AATCTGATGA 
CAAGCACTGA 
TTCCTCAAGT 
AAAAAGTAAC 
GTAGTCTTCT 
TGGCTTTGTT 
TTTATGGAGA 



ATATACTGTA 
CATCAAATTT 
GGATCTTTTG 
TAATGGTTTT 
ACCTGATGAA 
AAAAGAGAAT 
TACGGAAGAG 
AGAGAATTCT 
AGATCCTAGT 
TCTCGGAACG 
AAAAGAAAGT 
AGTAACCTTT 



7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8195 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
TTTACTAAAA GGAAAAAAGA ACTGATTTCT CACTCCTTCA TTAATCTTAT 
ATAGGTATGG GTAAACAGGT TGTTGACCTT GGTCAATCTC GACTTCAACG 
CTTCTACGAT TTCTTGAGCG ATTTCATTGG CAAGTTCTTC GCTTCCGTCT 
AGAAGGTTAC GATTTCACTG TCTTCATCCA ACATATCTTT CAAGCTTTCA 
GGTGCATATC AGGGTTTGAC ACAAGAATTT TTCCATCCAC CATACCTAAA 
CATGGATTTC TAAGCCATCG ATCGTTGTAT CACGCACGGC TGTTGTGACG 
CGACATCGCT AAGAGCAGCT GTCATACGCT CTTGGTTTTC TTCAATGGAC 
CAAAGGCAAG AACACTTGTC ATACCTTGAG GAAGAGTGCG AGCCTCTACC 
GTTGCTCCAA AACTTCTGCC CCAGATTGAG CTGCCATCAA GATGTTCTTG 
AGAAGATGAT GTTACGGGCA TTAACCTGTT CAACAGCCT7 GATAAAGTCT 
GGTTCATGGT TTGACCGCCT TCGATAACAT AATCCACCCC TTGAGAACAG 



TCCACACTAA 
TCTTCCAATT 
TCACCTACAT 
GTCAATGT IT 
TTATCGTTTT 
CTTCCGCTAA 
TTGCTTGGAT 
ACTACCGCTG 
TTGTTTGGCA 
TCTGTTGAAG 
AAGATATCTG 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 
600 
660 
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CTAGACCTTT ACCAGCCACC ACAGCAATCA AAGCATACTC TTTTTCTTCA GCCGACTTGA 720 

TAACTTGAGT AGCTTCTTTC TCAACCTGTG CTTCGTGTTG GTTACGCATA TTGTCAACTT 780 

TTACCTTGAC CAAGCTACCA TATTTGAGAC CTTCTTGCAT AACAAGTCCT GGATCTTCTG 840 

TATGAACATG GACTTTGACA ATTTCATCAT CGTTAACAAC AAGGAGAGAA TCTCCAAGCT 900 

CATCCAAGTA GTTACGGAAT TCATCGTAGT CAAAATCTTT AGCATAGGTT GGACCTTCCT 960 

TAAGAGCTAC CATGATTTCA GTACACTAAC CAAACGTGAT GTCCTCAGTC GCTACGTGAC 1020 

CAGCTACAGA CTTATGATGC TCTACATTGA TCATCTCACT CATGTTGGCA GGAGTCGCTA 1080 

CAAAGTCCTC AGATGCAATA TATTCGCCAG TAAGGGCTGA AAGGAAACCT TCGTAGATCA 1140 

AGACCAATCC TTCACCACCT GAGTCCACAA CGCCAACTTC TTTCAATACT GGAAGCATGT 1200 

CTGCTGTTTT AGCTAGAGCT GTTTTAGCAC CTTCCAAGGC TGCGCGCATG ACTTCAACAG 1260 

CGTCATCTGT TTGCTCACCT TTTTTCTTAG CACCGATAGC AGCTCCACGA GAAACTGTTA 1320 

AAATCGTTCC TTCAACAGGT TTCATCACTG CCTTATAGGC AACTTCCACA CCTGATTGGA 13 80 

AGGCCAGAGC CAAGTCTTGA CCTGTTAACT CGTCTTTATC CTTGATAGCT TGGGAAAATC 1440 

CACGGAAAAG CTGAGACGTA ATCACTCCTG AGTTCCCACG CGCACCCATC AAAAGCCCTT 1500 

TGGCAAGAAT GCTCGCTACT TCTCCAACTG TAGAAGCTGG CTTCTCTGCA ACTTCTTTAG 1560 

CACCATTTTC AATGGTCATT CCCATATTTG TCCCAGTATC TCCATCTGGA ACTGGAAAGA 1620 

CGTTTAATGA ATTGACATAT TCAGCTTGCT TATTCAAGCG AGTTGATGCA GCCTGCACCA 1680 

TTTCTTGAAA TAAGCTAGTA GTAATTTTTG ACACGGTTAT TCTCCTACAA CTTTGATATT 174 0 

T7GAATGTAG ACATTTACAG TCTGAGCAGT AATTCCAAGC TGGTTTTCCA ACCTAAAGGC 1800 

AACACGCTCT TGAATGTTTT TTGACACTTC ACTAATCTTT GTTCCGTACC TTAACACGGT 1860 

ATATACATCA ACTGCAATAC TGCCATCTTC GGCTGCCTTT ACGACGACAC CTTTAGAATA 192 0 

ATTTTCCTTA CCTAGCAGGG CTTGGAAATT ATCTTTGAGG GCATTTTTAC TAGCCATACC 1980 

GACCACACCA GAAATCTCAG TTGC 2004 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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CCGCCTTGGC 


CTGTTCGCCC 


ATTAAAGCGG 


CACCACAGCT 


GGGTTCAGAA 


CGTCGTGAGA 


60 


CAGTTCGGTC 


CCTATCCGTC 


GCGCGCGTAG 


GAAATTTGAG 


AGGATCTGCT 


CCTAGTACGA 


120 


GAGGACCACA 


GTGGACTTAC 


CGCTGGTGTA 


CCAGTTGTCT 


TGCCAAAGGC 


ATCGCTGGGT 


180 


AGCTATGTAG 


GGAAGGGATA 


AACCCTGAAA 


GCATCTAAGT 


GTGAAACCCA 


CCTCAAGATG 


240 


AGATTTCCCA 


TG ATT AT AT A 


TCAGTAAGAG 


CCCTGAGAGA 


TGATCAGGTA 


GATAGCTTAG 


300 


AAGTGGAAGT 


GTGGCGACAC 


ATGTAGCGGA 


CTAATACTAA 


TAGCTCGAGG 


ACTTATCCAA 


360 


AGTAACTGAG 


AATATGAAAG 


CGAACGGTTT 


TCTTAAATTG 


AATAGATATT 


CAATTTTGAG 


420 


TAGGTATTAC 


TCACAGTTAA 


GTGACGATAG 


CCTAGCAGAT 


ACACCTGTAC 


CCATGCCGAA 


480 


CACAGAAGTT 


AACCCCTAGA 


ACGCCGGAAG 


TAGTTGGGGG 


TTGCCCCCTG 


TGAGATAGGG 


540 


AAGTCGCTTA 


GCTCTAGGGA 


GTTTAGCTCA 


GCTGGGAGAG 


CATCTGCCTT 


ACAAGCAGAG 


600 


GGTCAGCGGT 


TCGATCCCGT 


TAACTCCCAT 


TTTAGCGGGT 


GTAGTTTACT 


CCTAAAACTA 


660 


CAGCCTTCCA 


AGCTGTTGTC 


GCGAGTTCGA 


TTCTCCTCAC 


CCGCTTTGAA 


CTTTGTTCTT 


720 


TGTACCAACT 


TTTTGACTTG 


GCCCCGTAGC 


TCAGGTGGTT 


AGAGCCCACG 


CCTCATAAGC 


780 


GTCAGGTCGG 


TGGTTCGAGT 


CCACTCCTGC 


CCATAGTGTT 


TAGTCCATTA 


CTAGGGGATT 


840 


GGAATATTAT 


CTGTTCACTA 


AGAGGACACG 


GGCTTGTTCC 


CGTATAAACT 


ATTTTGGAGG 


900 


ATTACCCAAG 


TCCGGCTGAA 


GGGAACGGTC 


TTGAAAACCG 


TCAGGCGTGT 


AAAAGCGTGC 


960 


GTGGGTTCGA 


ATCCCACATC 


CTCCTTTTAT 


ATTAACGCGG 


GATGGAGCAG 


CTCGGTAGCT 


1020 


CGTCGGGCTC 


ATAACCCGAA 


GGTCGTAGGT 


TCAAATCCTG 


CTCCCGCAAT 


AAGGCTCGGT 


1080 


ACCTCACTTG 


CTAGACCAAT 


GGATTGAAGC 


TCCATGTGTC 


GGCGGTTCGA 


TTCCGTCTCC 


1140 


CGCCATTTAT 


ATATTTTGGA 


AGGGTAGCGA 


AGAGGCTAAA 


CGCGGCGGAC 


TGTAAATCCG 


1200 


CTCCTTCGGG 


TTCGGGGGTT 


CGAATCCCTC 


CCCTTCCATT 


TTACGGGCAT 


AGTTTAAAGG 


1260 


TAGAACTAAG 


GTCTCCAAAA 


CCTTCAGTGT 


GGGTTCAATT 


CCTACTGCCC 


GTGTTAATAG 


1320 


AATTATGGCG 


GGTGTGGTGA 


AGTGGTTAAC 


ACACCAGATT 


GTGGCTCTGC 


CATGCGTGGG 


1380 


TTCGATCCCC 


ATCACTCGCC 


TATTTTATAT 


TCGCGTATAG 


CCAAGCGGTA 


AGGCAAGGGA 


1440 


CTTTGACTCC 


CTCATGCGTT 


GGTTCGAATC 


CAGCTACCCC 


AG TT ACT ATT 


TGCCGGCGTG 


1500 


GCGGAATTGG 


CAGACGCGCT 


GGACTCAAAA 


TCCAGTGTCC 


GCAAGGACGT 


GCCGGTTCGA 


1560 


CCCCGGCCGC 


CCGTATAGTA 


TAGTGTTAGG 


AACGTTGTTA 


TTCTTCGTTC 


CTTTTTTATA 


1620 


TTATTTTTGG 


TATAATTATA 


GTTATTCAAA 


TTTTATT7AG 


ATTAAGAAAG 


TGTAGGGGAG 


1680 


TATCTCTTGT 


TCTATCGATT 


TATTAAAACA 


, TCGGTATTTG 


AAAAATATTA 


AAGAAAATCC 


1740 


TGAATTGTTT 


GTCGGAATTG 


AGTTGGAGTA 


> TCCTGTTGCA 


AGTTTAGAAG 


; CGGATGCTAC 


1800 
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AGATGTTGAA GTTATGAAGG ATCTATTTCA TTATTTAGTT TCTACTTTGG ATCTCACCGT I860 

AGCAAAGGTA GATGATTTTG GCAATCTGAT CCAGTTAGTA GATCCGATAA CTCAGGATCC 1920 

TATTTTATTT GAAGTTTCCT ATACAACGAT TGAGTTTGCA TTTGGTAAGG CTGAAACGAT 1980 

TCAAGAGGTC GAAAATCGTT TCAATAATTA TATGAATGTA ATTCAGAGAA AGTTAGCTCA 2040 

ATCAAATCAT GCTATTGTTG GCTGTGGTAT CCATCCCAAC TGGGATAAAA ATGAGAATTG 2100 

7CCAGTGCCT TATCCACGCT ATCAGATGTT GATGGATTAT TTGAATTTGA GTAGAAATAT 2160 

TATTAAATCA GATTTACATC ATTTCCCTGA ATATGGTACT TTTATCTGTG GGAGC CAGGT 2 220 

TCAGCTGGAT ATTTCAAAAA CCAACTACTT ACGGGTGATT AATGCTTTTA CTCAAATTGA 2280 

AGCGGCTAAG GCTTATTTAT TTCCAAACTC TGAATTTTCG GGTGCGGATT GGGATACGAA 2 340 

AATTTCAAGG GATATTTTCT GGGAAGAATC TATGCATCGT ATCTATCCAG ACAATGTTGG 2400 

GGTCAATGCT AGACTCCTTA ATGATGAAAC TGATTTTTTT GACTATCTAA ATCATTCTGC 24 60 

GATTTTTACT GCGGAACGTG ATGGGCAGAC CTATTATTTT TATCCTATTC AGGCTGGGGA 2520 

CTATTTGGCT ACGTCCGAAA TCCAAGCATT TGCTCTGAAT GGGGATGAGG TTATTATTTA 25 80 

CCCCCAAGAG AAGGATTTTG AAACTCATCG TAGTTACCAG TACCAAGATT TAACGACTCG 2640 

AGGAACAGTT GAGTTTCGTA GTGTGTGTAC ACACCCACTT GATAGGACTT TTCCTTCTGC 2700 

AGCTTTTCAC TTGGGATTAT TGGTTAATTT AGACAAGTTA GAAGCTTACT TAGAAACAGC 2760 

ACCTTTCTTT AAAGTATTTG GTTATGATTA CAAGTCTTTA AGGAGACAAT TTTCTAAGAA 2820 

AAATCTTACA GATGAGGAAG AAACTACGAT TATTGAATTT TCCAAAGACT TACTCCTACT 2 880 

AGCTGAGGAG GGACTACTGG TGAGAAATAA GGAAGAAATG ACCTATTTAC AGCCTTTCAG 2940 

AGAAGAATTG AGCCTATAAT TTCTCTTATA AAGGGAGAAT TTTCTGAAAA ATCATGATAT 3000 

AATGGACGAG ACTATACATA AAGGATAGAG AGTAATGACA TTAGTTTATC AATCAACGCG 3060 

TGATGCCAAC AATACAGTAA CTGCCAGCCA AGCAATTTTG CAAGCTTTGG CGACGGACCC 3120 

CGGTTTGTTT ACACCGGATA CTTATCCAAA GGTAGATTTG AACTTTGACA AATTGAAAGA 3180 

TGCTTCTTAC CAGGAAGTTG CTAAGCTAGT TTPGTCAGCA TTTTTAGATG ACTTTACAGT 3 240 

TGAGGAGTTG GACTACTGTA TCAACAATGC CTACGATAGC AAATTTGATA CTCCAGCTAT 3 300 

TGCACCATTA GTGAAATTAG ATGGGCAATA CAATTTGGAA CTTTTCCATG GTTCAACGAT 3 360 

TGCCTTTAAG GATATGGCCT TGTCTATTTT GCCATACTTT ATGACGACTG CTCCTAAGAA 3420 

ACATGGTTTG GAGAACAAGA TTGTTATCTT GACAGCGACA TCTGGTGACA CGGGGAAAGC 3480 

TGCTATGGCG GGGTTTGCGA ATGTGCCTGG TACTGAGATT ATCGTCTTTT ATCCAAAGGA 3540 
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TGGTGTCAGC AAGATTCAAG AGTTACAAAT GACCACTCAG ACTGGCGACA ATACTCATGT 3600 

TATTCCTATT GATGGTAACT TTGACGATGC GCAAACAAAT GTGAAGCACA TGTTTAACGA 3660 

CGTGGCTCTT CGTCAAAAAT TGACTACCAA CAAGTTGCAA TTTTCATCAG CTAACTCTAT 3720 

CAACATTGCT CCTCTGGTCC CACAAATTGT TTATTATGTT TATGCTTACG CTCAATTGGT 3780 

TAAGACTGGT GAAATTGTAG CTGGTGAAAA GGTTAACTTC ACAGTACCAA CAGGAAACTT 3 840 

TGGAAATATC TTGGCTGCCT TTTATGCCAA ACAAATTGGT- TTGCCAGTTG GTAAATTAAT 3 900 

CTGTGCTTCA AATGACAACA ATGTTTTGAC AGACTTCTTT AAAACACGTG TCTATGACAA 3960 

AAAACGTGAG TTTAAGGTAA CAACCAGCCC ATCTATGGAT ATCTTGGTAT CTTCAAACTT 4020 

GGACCCCTTG ATTTTCCATC TTTTGGGAAA TAATGCTGAA AAGACAACTG AACTTATGAA 4080 

TGCCTTGAAC ACGCAAGGAC AATATAAGTT CACAGACTTT GATGCAGAGA TTTTGGACCT 4140 

CTTTGCAGCT GAATATGCGA CTGAGGAAGA AACGGCAGCA GAGATCAAGC GTGTTTGTGA 4200 

GTTAGATTCT TATATCGAGG ACCCTCATAC AGCTGTTGCT TCACCAGTTT ATAAAAAATA 4260 

CCAATCGGCC ACTGGAGATG TAACTAAGAC AGTGATTGCT TCAACAGCTA GTCCATACAA 4 320 

GTTCCCAGTA GTTGCAGTAG AAGCTGTAAC TGGAAAAGCA GGTTTAACAC ACTTTGAAGC 43 80 

CTTGGCTCAA TTACATGAAA TCTCAGGCGT TGCAGTGCCA CCAGCACTTG ATGGGCTTGA 4440 

AATAGCTCCA ATTCGTCACA AGACAACAGT GGCAGCTGCT GACATGCAAG CAGCGGTTGA 4500 

GGCTTATTTA GGACTTTAAG ACAGAGGGAG CAAACTCGCT TGGGAAACCA ACTGAGTTTC 4 560 

TTTTCATCAG GAGGAGAGAT TCTTTAAGAA AAATAAAGAC ATTCTTAATA TTCCATTGCC 4 620 

AGCTATGGGT GAAAACTTTT TGCAGATGCT AATGGGAATG GTGGACAGTT ATTTGCTTGC 4 680 

TCATTTAGGA TTGATAGCTA TTTCAGGGGT TTCAGTAGCT GGTAATATTA TCACCATTTA 4740 

TCAGGCGATT TTCATCGCTC TGGGAGCTGC TATTTCCAGT GTTATTTCAA AAAGCATAGG 4800 

GCAGAAAGAC CAGTCGAAGT TGGCCTATCA TGTGACTGAG GCGTTGAAGA TTACCTTACT 4860 

ATTAACTTTC CTTTTAGGAT TTTTGTCCAT CTTCGCTGGG AAAGAGATGA TAGGACTTYT 4 920 

GGGGACGGAG AGCGATGTAG CTGAGAGTGG TGGACTGTAT CTATCTTTGG TACGCCGATC 4980 

GATTGTTCTC TTAGGTTTAA TGACTAGTCT AGGAGCCTTG ATTCGTGCAA CGCATAATCC 5040 

ACGTCTGCCT CTCTATGTTA GTTTTTTATC CAATGCCTTG AATATTCTTT TTTCAAGTCT 5100 

AGCTATTTTT GTTCTGGATA TGGGGATAGC TGGTGTTGCT TGGGGGACAA TTGTGTCTCG 5160 

TTTGGTTGGT CTTGTGATTT TCTGGTCACA ATTAAAACTG CCTTATGGGA AGCCAACTTT 5220 

TGGTTTAGAT AAGGAACTGT TGACCTTGGC TTTACCAGCA GCTGGAGAGC GACTTATGAT 5280 

GAGGGCTGGA GATGTAGTGA TCATTGCCTT GGTCCTTTCT TTTGGGACGG AGGCAGTTGC 5340 
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TGGCAATGCA ATCGGAGAAG TCTTGACCCA GTTTAACTAT ATGCCTGCCT TTGGCGTCCC 54 00 

TACGGCAACG GTCATGCTGT TGGCCCGAGC AGTTGGAGAG CATCATTGGA AAAGAGTTCC 54 60 

TAGTTTGAGT AAACAAACCT TTTGGCTTTC TCTGTTCCTC ATGTTGCCCC TGTCCTTTAG 5520 

TATATATGTC TTCCCTCTAC CATTAACTCA TCTCTATACG ACTGATTCTC TAGCGGTGGA 5580 

GGCTAGTGTT CTAGTGACAC TGTTTTCACT ACTTGGGACC CCTATGACGA CAGGAACAGT 5640 

CATCTATACG GCAGTCTGGC AGGGATTAGG AAATGCACGC CTCCCTTTTT ATGCGACAAG 5700 

TATAGGAATG TGGTGTATCC GCATTGGGAC AGGATATCTG ATGGGGATTG TGCTTGGTTG 5760 

GGGCTTGCCT GGTATTTGGG CAGGGTCTCT CTTGGATAAT GGTTTTCGCT GGTTATTTCT 5820 

ACGCTATCGT TACCAGCGCT ATATGAGCTT GAAAGGATAG GAAATGCAAA AAACAGCTTT 5880 

TATTTGGGAT TTAGACGGGA CTTTATTGGA CTCTTACGAA GCGATTTTAT CAGGGATTGA 5940 

GGAGACTTTT GCTCAGTTTT CTATTCCTTA TGATAAGGAC AAGGTGAGAG AGTTTATCTT 6000 

CAAGTATTCG GTGCAAGATT TGCTTGTGCG GGTGGCAGAA GATAGAAATC TCCATGTTGA 6060 

GGTGCTAAAT CAGGTGCGTG CCCAGAGTCT GGCTGAGAAG AATGCTCAGG TAGTTTTCAT 6120 

GCCAGGTGCG CGTGAGGTGC TAGCTTGGGC AGACGAATCA GGAATTCAGC AGTTTATATA 6180 

TACTCATAAG GGGAACAACG CTTTTACCAT TCTCAAGGAC TTGGGGCTGG AATCCTATTT 6240 

TACAGAGATT TTAACCAGTC AGAGTCCCTT TGTGCGGAAG CCAAGTCCAG AAGCGGCTAC 6300 
CTATCTGCTA GATAAGTATC AGTTGAATTC TGATAATACT TATTATATAG GGGATCGGAC ■ 6 360 

TCTGGATGTG GAATTTGCCC AGAATAGTGG GATTCAAAGC ATCAACTTTT TACAGTCTAC 6420 

TTATGAAGGG AATCACAGGA TTCAAGCGTT AGCAGATATT TCCCCTATTT TTGAGACTAA 6480 

GTGATAAAAA GATTGTCTCA GTTTTCTGAC AGAGACCTAA CAAACTATTT CAAGTAACCT 6540 

AGTTTGTTAC AAGGAATAGA CAGTTCTGTT AAATAGGCCC GAGAGGGCTT TTTTTCTACA 6600 

TTTTTTGTGT TATGATAGAC AGGTACTCAT TTGAAAGCAA TTTGAAAGAA TCAAGAAAAG 6660 

AATGTTATTA GCGTCAACAG TAGCCTTGTC ATTTGCCCCA GTATTGGCAA CTCAAGCAGA 6720 

AGAAGTTCTT TGGACTGCAC GTAGTGTTGA GCAAATCCAA AACGATTTGA CTAAAACGGA 6780 

CAACAAAACA AGTTATACCG TACAGTATGG TGATACTTTG AGCACCATTG CAGAAGCCTT 684 0 

GGGTGTAGAT GTCACAGTGC TTGCGAATCT GAACAAAATC ACTAATATGG ACTTGATTTT 6900 

CCCAGAAACT GTTTTGACAA CGACTGTCAA TGAAGCAGAA GAAGTAACAC AAGTTGAAAT 6960 

CCAAACACCT CAAGCAGACT CTAGTGAAGA AGTGACAACT GCGACAGCAG ATTTGACCAC 7020 

TAATCAAGTG ACCCTTGATG ATCAAACTGT TCAGGTTGCA GACCTTTCTC AACCAATTGC 7080 
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AGAAGTTACA AAGACAGTGA TTGCTTCTGA AGAAGTGGCA CCATCTACGG GCACTTCTGT 7140 

CCCAGAGGAG CAAACGACCG AAACAACTCC CCCAGTTGAA GAAGCAACTC CTCAGGAAAC 7200 

GACTCCAGCT GAGAAGCAGG AAACACAAGC AAGCCCTCAA GCTGCATCAG CAGTGGAAGT 7 260 

AACTACAACA AGTTCAGAAG CAAAAGAAGT AGCATCATCA AATGGAGCTA CAGCAGCAGT 7320 

TTCTACTTAT CAACCAGAAG AGACGAAAAT AATTTCAACA ACTTACGAGG CTCCAGCTGC 7 380 

GCCCGATTAT GCTGGACTTG CAGTAGCAAA ATCTGAAAAT GCACGTCTTC AACCACAAAC 7440 

AGCTGCCTTT AAAGAAGAAA TTGCTAACTT GTTTGGCATT ACATCCTTTA GTGGTTATCC 7500 

TCCAGGAGAC AGTGGAGATC ACGGAAAAGG TTTCGCTATC GACTTTATGG TACCAGAACG 7560 

TTCAGAATTA GGGGATAAGA TTGCGGAATA TGCTATTCAA AATATGGCCA GCCCTGGCAT 7620 

TAGTTACATC ATCTGGAAAC AACGTTTCTA TGCTCCATTC GATAGCAAAT ATGGGCCAGC 7680 

TAACACTTGG AACCCAATGC CAGACCGTGG TAGTGTGACA GAAAATCACT ATGATCACGT 7740 

TCACGTTTCA ATGAATGGAT AAACCCGACT TGATAACATC ATTTTGACGA ATGAGATCTA 7 800 

GCTTTCGTGA TGGAAAGCGA TTCTCGTTCC TTTTTTCTTT GTCATACTCT TCGAAAATCT 7860 

CTTCAAACCA CGTCAGTTTT ATCTGAAACT TCAAAGCTGT GCTTTGAGCA ACCTGCGACT 7920 

AGCTTCCTAG TTTGCTTTTT GATTTTCATT GAGTATCAAT TTGAATGGAA AATGGAAAGT 7980 

TATCATCTTG TAATGAGTTA AGCAACATTC TTGCAATCTA TTTTACTTTA TATCACAATT 8040 

AATTAGTCAA ATATTGATAA ATCAATAAAA AGAGAGGGGA AGAAATGCTA GAGATTCAAG 8100 

ATTTACTGTA TCAACTCCGC TTGTCTGAGC AAGCGAGTAC GCAATTGTTT GAAAAAAGGC 8160 

TTGGGATTAG TTTGACACGG TATCAGATTT TACTGTTTTT GCTGGAGCAT TCTCCTTGTA 8220 

ACCAAATGGC GGTTCAGGAG CGTTTGAAAA TTGATCAGGC TGCTTTGACA CGGCATTTCA 8280 

AAATTTTGGA AACGGAAGGT TTGCTGGACC GTCATCGTAA TCCTGAAAAT CAGCGGGAAG 8340 

TGTTCGTAGA GGCTGCGAAG TATGCCAAGC AGCAGTTAGT GGTGAATCCC CCTCTGCAAC 8400 

ATATCAGGGT TAAGGAAGAG ATAGAAAGTA TCTTAACAGA GTTTGAGAGA ACAGAACTCA 8460 

GCCCTTTATT AAATAAATTG GTTTTGGGTA TTGAAAATAT AGAAATTTAA CGAGAAATAG 8520 

ATGTCAATTA TTTTAACAAC GATCGTTGCT TTGGAGCATT TTTACATTTT TTATTTGGAA 8580 

AGTATTGCCA CGCAATCAGA TGCGACTAGT CGTGTATTTA ATATGGAAAA GGAAGAATTG 8640 

GCTCATCCGT CAGTAAGTTC ATTGTTCAAA AATCAAGGAA TTTATAAGGC TCTGCTAGGA 87C0 

GTCTTTCTCT TGTATGTCAT TTATTTCTCA CAGAATTTAG AAATTGTGAC TATTTTTGTC 8760 

TTATTTGTGA TTGGTGCTGC GACTTACGGC TCTTTAACAG CGGATAAAAA AATTATTTTG 8820 

AAACAAGGTG GATCAGCTAT TTTGGCCTTG ATTAGTATTT TACTCTTTAA ATACACTTGA 8880 
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AGGTCGATTC 


TAATCTCGCT 


AATCCTTTTT 


AATCCAGAAT 


AACGGAAATA 


TGTTATACTT 


8940 


GTT TTTAAGA 


AAAAAGTCTC 


ATTGAATTGG 


TTTTGAGGAG 


TTAGAAATGA 


AAGTATTACT 


9000 


p A P APGTTTT 


gagppptttg 


GAGGGGAAAA 


GGGCAATCCA 


GCTTTGGAGG 


CCATTAAAGG 


9060 


l 1 1 /\V-V—/\VJ\_ 1 


onAA i i vj 


GTGCTGAGGT 


CCGTTGGCTA 


GAGGTGCCGA 


CAGTTTTTCA 


9120 


PA A A TPTPPT 


V. /WW* Ini 1 vjVj 


AAGAAGAGAT 


GAATCGTTAT 


CAACCTGACT 


TTGTCCTTTG 


9130 


lUvjVJV_/Vrt 




GAACTAGTTT 


GACACCTCAA 


Cv^GTGACCA 


TTAATCAAGA 


9240 


■ v-vj/\ 4 1 VJV. 


ATTTPTPATA 


APGAAGATAA 


TCAACCGATT 


GACCGTCCCA 


TTCGCCCAGA 


93C0 


I VJVJ I vjL. 1 1 tu 


vjL_v. 1 nV. 1 > In 


PTAP.TTTGCC 


GATTAAAGCG 


ATGGTTCAAG 


CTATAAAAAA 


9360 




P P P PPPTPTP 


TTTCCAATAC 


GGCAGGGACT 


TTTGTCTGCA 


GCCATTTGAT 


9420 


pt a tp & fir: rr 


PTPT A'f^'l "f'PP 


TAGAAAAGAA 


ATCTCCATAT 


GTTAAGGCAG 


GTTTTATGCA 


9430 


1A1 1 v.v» I Ini 


ATPATGP.AAP 


AGGTGGTGAA 


CAGACCGACT 


ACTCCAGCTA 


TGAGTTTAGT 


9540 




PPAPP.GATAG 


AACCAGCAAT 


CGGCGCTATA 


ATAGAACATG 


GAGATCAGGA 


9600 


AtJTL AAGTTG 


GT AGG LVjIj AG 


A A APTPATTG 

AAACIV.A1 I VJ 


ATAGAAAAAA 


GCTTGAGGGG 


AAAAACCTTC 


9660 


AAGv, ill 1 GG 




ppp AATACTG 


CTCGGTAAAA 


CATAATTTTA 


GTGCATTGGA 


9720 


TATAAGGTAG 


/— » /— ♦TW^ A A A A A 
GAG I wAAAAA 


P***APPAATCP 


CAAAGGTAAT 


CCAATTCAGG 


AAGTACCAAG 


9780 


L» AAG AAG v_ 1\j 


T A A ATPT &PP 
1 AAA 1 V_ 1 


APAAAGTGCT 
nCAAnu i vjv» a 


GGAACTTGTA 


GCCCTTCATA 


AAGGAACGGC 


9840 


TAGTTTTTAG 


P AiTPPTPTT 
* 1 v_vj i v» 1 * 


GGTCGGACCT 


GTCCTAGGTC 


TAG ACT AT AA 


CAGAGAAGAA 


9900 


ai rv_v_Av.v_TG 


I GAA i nylji-n 


TAATAPTCTG 


GAATATAGAG 


GATATTTCCT 


ACAATGATCA 


9960 




t*PP A APA A AP 


TAGAGTCCAA 

1 /\\J/\VJ * \>V.AA 


ACACCATGAC 


GAAACGCTCG 


GTTTCAACTC 


10020 


A 4 GAGAG A i v. 


IAGA1 I 1vjvj/\ 


A A PTACCAT 


GTAGGGTGAC 


GAATTTTTTG 


GCTAAAAAGC 


10080 


* AC iAi AAAA 


P AflPAPPTAA 


ATPPPAAGTA 


AATTAGGCAT 


ACTCCATAAA 


AAGAGATAGA 


10140 


AACGTTTGAG 


AACTAGGGTC 


AAAAAGGTTT 


GAGAAAAGCG 


CTCCTCATCA 


AAG AG AG CT A 




GGCTGTTTTT 


TACAGATGGC 


TCCGTTTTAC 


AATCTTTCAT 


GAGTGTCAGT 


GTTGCATAGA 


10260 


CCCAACTGGT 


CAAAAGAATA 


GTCCCGATAA 


AGGAGACTAG 


TAGAGGAAAG 


AGGTAGGTTT 


10320 


GAAGTATTTG 


GCCAAGTATG 


CTGAAAAATG 


GCTGTTCTAA 


AACAGTCCCG 


TGGATCCGAG 


10330 


ATAAGGGATT 


AAGAAAACCA 


GATAAGATGA 


CCAGCATACT 


GGGAAGGATA 


TAGAGGAGAA 


10440 


AGAGACGGGG 


GGTGTCAGCC 


TGAAAATGTT 


TTGACTCCTG 


ACGAATTGTT 


TTTAAATCAA 


10500 


TTTTTGGATA 


GTTCATTCTC 


TTATTATACC 


ATAGTTCTTA 


TACATAGTTC 


GTGACAGTTC 


10560 


CTACTTTTTT 


TGATAAAATC 


ATACAGTGTG 


TCCTTGGCCA 


CACTGTATGA 


ACTGGGACTG 


10620 
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TCTTTCCCAG CTTCGGAGGT AAAAAATGTC AGATTCACCA ATCAAATATC GTTTGATTAA 10680 

GAAAGAAAAA CACACAGGAG CTCGTCTGGG ACAAATCATC ACTCCCCACG GTACCTTTCC 10740 

GACACCTATG TTTATGCCAG TTGGGACACA AGCCACTGTC AAAACTCAGT CACCTGAAGA 10800 

ATTGAAGGAG ATGGGTTCGG GAATTATCCT ATCAAACACC TATCATCTCT GGCTTCCCCC 10860 

TGGAGATGAA CTCATTGCAC GCGCTGGTGG TCTCCACAAG TTCATGAATT GGGACCAGCC 10920 

TATCTTGACA GATAGTCGTG GTTTTCAGGT TTATTCTTTA GCAGATAGCC GTAATATCAC 10980 

AGAAGAAGGA GTAACCTTTA AAAATCATCT AAATGGTTCT AAGATGTTCC TATCCCCAGA 11040 

AAAAGCCATC TCTATTCAGA ATAATCTGGG TTCAGACATC ATGATGTCCT TTGATGAATG 11100 

TCCTCAGTTT TATCAACCTT ATGACTACGT TAAGAAATCG ATCGAGCGTA CCAGCCGTTG 11160 

GGCTGAGCGT GGTTTGAAGG CTCACCGTCG TCCACATGAC CAAGGTTTGT TTGGAATTGT 11220 

GCAAGGTGCA GGATTTGAAG ACCTTCGCCG CCAATCAGCT CATGATCTTG TCAGCATGGA 11280 

TTTCTCAGGC TACTCTATCG GTGGTTTGGC AGTGGGAGAA ACCCATGAAG AGATGAATGC 113 40 

GGTCTTGGAC TTTACAACTC AACTGCTGCC TGAAAATAAA CCTCGTTATC TGATGGGTGT 11400 

GGGAGCGCCA GATAGCTTGA TCGATGGGGT CATTCGTGGG GTGGATATGT TTCACTGTCT 11460 

CTTACCGACT CGAATTGCTC GTAACGGGAC TTGTATGACC AGTCAAGGAC GTTTGGTTGT 11S20 

GAAAAATGCC CAGTTTGCTC AGGACTTTAC GCCACTGGAT CCTGAGTGTG ATTGCTACAC 11580 

ATGTAATAAC TATACACGCG CTTACCTTCG TCACCTGCTC AAGGCTGATG AAACCTTTGG 11640 

TATCCGCTTG ACTAGCTACC ACAATCTTTA CTTCTTGCTT AACCTGATGA AGCAACTGCG 11700 

ACAAGCCATC ATGGATGACA ATCTCTTGGA ATTCCGTGAG TATTTTGTGG AAAAATATGG 11760 

CTATAATAAG TCAGGACGTA ATTTCTAAAA TGGAATTGAT ATAAAAAAAT CCTAAGTTTT 11820 

CTCTTAGGAT TTTTCTTCTT TTTTTGATAG AATAAAGTGT ACAATGAAAG GAACAATAAA 11880 

CTCGTATGCG CATTAAATGG TTTTCCTCGA TTAGG 11915 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

GAGAGGGCAA CAGTTCTATC GCTTCAAATT TTTTCTTCCT TTGCAGATAT TCAAGAATCG 60 

CGAGTTTTTC TATAGTATTC GGCAGATTTA TTACAGCCAA GCATCTCAAA AATACGGACA 120 
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CCATCCTCCA TCTTTTTCTC GCCTTCCTTG ACTCTACCTT GCTTGCTATC AAGGAGACCT 130 

TCTCCCCACA GATAAACAAT TCGGAAATAG GTCTCATTTT CCTTGTAGAA ATGCTCTTCG 2 40 

ATAACACGTT TAAAATAATA GGCATTGGTA AATTCTTCAC ACTCAATACT AGCTAAAAAG 300 

CCATTCAATA GTATAGTATG AAAAAGGTTT CGATTGCCAG ACATTTCCAT TAGAAAATCA 360 

GATTTACGTA CCATTTCTCG TACATATCTA GTAAAAAGAG AAACAGATAA AAATGGAGAA 420 

CTGACTGAAA ATAAATTGAG TTCATAGATT CCCCAGATCT CGGTAGAAAA CAAATAATCA 4 80 

TGAAGGACTT TTCCTTCCTC TGCTGTTAAG TCTACCCTTT CAT CT AT GC? CTTCATATAA 540 

GACTTGATAA TAATGGCATT TAGAATATGT TTCTGTTTGT TGTGAGAATG GGCATGCTTT 600 

TATACTCCCT GCGATATAAG TCCTCAAGAG GTGCTATATT CTTTCGTTCC AAGACATCTG 660 

TAATTTCTTT TCTCAACTCA GAATCTGTAT CATACTGGAA ACCTCTTCCC AGAAAGAGGA 720 

TCTCCTCCAC ACTCGCAGAT ATATTTTCCA GAGCAAATAG AAACTTTTCC ACCGAAAGCT 780 

CACTCTGACC TGTTTCAAAA CGGGACAACA TAGACGGCGA AAATTGTCCT CCGGTTGCTT 840 

GTCTCAGTGA GATATTTCTT GACTCTCGTA ATTGTCTAAA GACTTTTCCA ATCTGCTCCA 900 

TAGACTTCCC CTTGATTCCG TATTTTCTTC ATTTTATCAT ATTTTTCAGA AAATTCATCA 960 

AAAACTTGCC AAATTGTCAG AATTATGAGA AAATAGAGGA TATTTATCAC GTGGAGGGAC 1020 

TGCTATGAGA GACGATATCA AAATCAATCA CCGTGCTTTG GCCTTGCAAG ACCAAATTAT 1080 

CGAAAAACTA GAGAAAGTTT TTGATACAGA TGTGGAATTG GATGTTTACA ATCTAGGTCT 1140 

GATTTATCAA ATCAATCTGG ATGAAACGGG GCTCTGCAAG ATTGTCATGA CCTTCACCGA 1200 

TACTGCCTGT GATTGCGCCG AAAGCCTGCC TATTGAAATC GTGGCAGGTC TGAAACAAAT 12 60 

CGAGGGTATC AAAGATATCA AGGTTGAAGT TACCTGGTCG CCTGCTTGGA AAATCACACG 1320 

AATCAGTCGC TATGGCCGTA TTCCCCTTGG ACTACCACCT CGTTAAGCAG ACCAATCACT 1380 

TTTAAAGATG AAAATCAAAG GGCAAACTAG AAAACTAGCC GCAGGTTGCT CAAAACACTG 14 40 

TTTTGAAGTT ATGGATAGAA CTGACGAAGT CAGCTCAAAA CACTGTTTTG AGGTTGTGGA 1500 

TAGAACTGAC GAAGTCAgCT CAAAACACTG TTTTGAGGTT GTGGATAGAA CTGACGAAGT 1560 

CAGCCCAAAA CACTGTTTTG AGGTTGTGGA TAGAACTGAC GAAGTCAGTA ACCATACCTA 1620 

CGGCAAGGCG ACGTTGACGT GATTTGAAGA GATTTTCGAG TATCAGTTTA TTTTTTACCT 1680 

GACTTGTCCA TATTCCAGAA GTCTGTCACG GCTCCGCGTG AAGCAGATGA TACGATGTGG 1740 

GCATATTTAC CGAGGACACC ACGGCTGTAA AGTGGTGGCA AGGTTGTTTC TGCCTTGCGT 1800 

TTTTCAAGTT CTTCTTCGGA TACGGCCATA GAAATTTCTT TGGTATCTTG GTCAACCGTA 1860 
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ACGATATCGC CGGTACGGAG ATAGGCAATT GGTCCACCAT CCTGAGCTTC AGGAGCGATA 1920 

TGTCCAACAA CCAGACCATA AGTACCACCA GAGAAACGTC CGTCCGTCAA GAGGGCCACC 1980 

TTATCTCCCT GACCTTTACC AACAATCATT GAAGAAAGTG ATAGCATCTC AGGCATACCA 2040 

GGACCACCTT TAGGTCCAAC AAAACGAACA AC G ACT AC AT CGCCATCAAC GATTTCATCT 2100 

GTCAGAACGG CCTGAATCGC ATCTTCTTCT GAGTCAAAGA CCTTAGCTGG CCCAACGTGA 2160 

CGACGCACTT TAACACCTGA TACCTTGGCA ACTGCACCGT CAGGAGCAAG GTTCCCGTTC 2220 

" AAGATGAT AA GCGGACCATC CGCACGTTTT GGATTTTCAA GTGGCATGAT AACTTTTTGG 2280 

CCTGGAGTCA AGTCTGCAAA GTCAGCCAAG TTTTCAGCTA CAGTCTTACC AGTACATGTG 2340 

ATGCGATCTC CGTGAAGGAA ACCATTTGCC AACAAATACT TCATAACCGC AGGGACACCA 2 400 

CCGACTTCGT AGAGGTCTTG GAAGACATAC TGACCAGATG GTTTCAAGTC GGCCAAGTGA 2460 

GGCACACGTT CTTGAATCGT ATTGAAGTCC TCAAGTGACA AGTCAACATT TGCGGCATGG 2520 

GCAATGGCGA GCAAGTGAAG AGTGGCGTTT GTAGAACCAC CGAGACCCAT CGTTACAGTG 2580 

ATAGCATCTT CAAAGGCTTC ACGAGTCAAG ATATCTGATG GTTTGAGACC AAGTTCCAAC 2640 

ATCTTAACAA CAGCACGTCC TGCTGCTTCG ATATCTTCTT TCTTATCAGC TCATTCAGCT 2700 

GGGTGAGAGG ATGACCCTGG CAAACTCATC CCTAGAACTT CGATAGCAGT TGCCATGGTA 2760 

TTAGCAGTAT ACATACCACC ACAACCACCA GGGCCAGGGC AGGCATTACA TTCAAGACGT 2820 

TTCACGTCCT CAGCTGTCAT GTCACCGTGG TTCCATTTTC CGATACCTTC AAAGACAGAA 2880 

ACCAAGTCGA TATCTTTACC ATCAAGATTT CCCGGTCCAA TAGTTCCACC ATAGGCGAAA 2940 

ATAGCTGGGA TAT CC AT ATT AGCAATAGCA ATCATAGATC CAGGCATGTT CTTGTCACAG 3000 

CCACCCATAG CGACGAAGGC ATCCACGTTG TGACCACTCA TAGCCGCCTC GATGGAGTCC 3060 

GCGATGATGT CACGAGATGT TAGAGAGAAA CGCATACCAG GCGTTCCCAT AGCGATCCCG 3120 

TCCGCTACGG TAATGGTTCC AAACTGTACA GGCCAAGCGC CTGCAGATTT GACACCTTCT 3180 

TTAGCCAGTT TCCCGAAATC ATGCAAGTGA ATGTTACATG GTGTATTTTC CGCCCAAGTC 3240 

GAAATCACTC CCACAATCGA TGTTTCAAAG TCCTTATCTG TCATACCACT CGCACGAAGC 3 300 

ATAGCACGGT TAGGTGATTT AACCATGCTG TCATAAATGC TACTGCGGTG ACGTTTATCT 3 360 

AATTCAGTCA TCTTATCCCT CCCATTTCAG TTTTTACTAT TATAGCACAA TTTTCGCATG 3420 

AAGAACAGAA TAAAATTCTT GAATTTTCAG AAAATTCTAT ACACATGTGA AATATTTAAA 3480 

ATTAAAAACA ACAAAGCGGA TTACTGCACT TTCTGATGAC CAGAATATGC TTTTTAATCC 3540 

GCTTTCTTTA AATAACGTAC TGTAATTTTT ACAGAAATTC TTTCAAATAA GTGTATTTAA 3 600 

CATCTATCTT GC ATT AT AAA TTTCTAGAAC CTTCTCTTTT ATATTCGATT CACTCAAACC 3660 
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ATACTCATTA AGAAGATAAT CCATTTTCCC TACTTGACCG AATCTTTCTT GAACACCCAT 3720 

CCGATGAATT TTTGTTATTC CATCATCAGA GAATAATTCA CATAAAGCAC TGCCAATTCC 3780 

ACCTATCTGA TTGTGGTTTT CT AC ACT AAA TATAGTTTTT CCACTTAACA TTGTTTTTAT 3340 

CTGTTCTGGT ATCGGTTTGA TTCTAAATAA ATCTATCACA CCTACTGAAT AACCTAATTT 3 900 

AGACAGTTCA TCTGCAACTC GAATACTTGG AGCAACCATT ATGCCAGAAG CAACGATTAC 3960 

AAGATCTTCA "CCATGCCTTA ACTCAATGTA GCCTTTAGAA AAATCTTCTC CACCTTGATA 4020 

CACAGGAACT GGAGCTTTTC TAATTGTTCG AATATATTTT AGTCCTTTTA AGTCTAATGT 4080 

CTCGTTCAAT ATTTCACGAA ATTGGATATC ATCAGTTGCT TCGAAAATGA TTGATTTAGG 414 0 

AATTAAACGT AACAATCCAA TTTCTTCAAA TGGCATATGT GTTCCACCAT TCATCTCTGC 4200 

CGTTACTCCT GCATCTGATC CAATCACAGT GGCATCCAAT TGTGCGTATC CAAGAGAAAT 4260 

AAATAATTGA TCAAATACTC TTCGTGAAGC AAAAGGACCA AATCTATGAA GATAAGGTCT 4320 

AAACCCCTGA ATAGACAAGC CTGCTGCAAG GCCGACCATT TCTGCTTCCA TAATCCCAAC 4 380 

ATTCACATAA CGGTCTCCAA AGTCCTTTTC AAGATTATTA GTAGCCATCG AACTTGACAA 4440 

ATCGGCTTCT AAGACTACTA TATCAGAATC ACTTTGATTA GCCTCTAAAA GGAAGTCTCT 4 500 

ATATACATGC CGTAATTCTT TCGTACTTCT CATCATTCTG TTTCCTCCAA TTCCTGACTT 4560 

AATCTTTCTA CAACTGAAGT TAACATTTGT TTCTCCTCTA CAGTAGGGCG AAGATGATGA 4620 

TTGGATTTCA TTTCTTCCAG CTCTTGAACC CCTTGACCTT TAATAGTATC TAATACAATG 4680 

CACTTAGGTG ATCAATTATT TGACTGTTTT AATTGGACAA TCCCTTCATA AATTTCTCTA 4740 

ATATCTGAAC CCTTGACCCT AATGGATTCA AATCCAAATC CTGAAAATTT TTCTACGAAA 4 800 

TCACCTGGAT TACAAATATC CTTTGTAAAA CCATCTAATT GTTTTTTCTT ATCATCAACA 4 8 60 

AATACAATTA AGTTGGATAA CTCTTGATGA GAAGCAAACT GTATAGCCTC CCAACATTGT 4 920 

CCCTCATTTA ACTCACCATC TCCAACAATA GCGTAAGTAT AAAAGGGACT CTTTCTTATT 4 980 

CTCTGACCAT ATGCAAGTCC AGTTGCAACA CTAATTCCTT GTCCTAAAGA GCCCGTTGTC S040 

ATATC7ATGC CTGGCGTTAG ATTTCTATCA GGATGAGACG GTAATTTGGT TCCATTTGTA 5100 

TTTAAAGAAT ATAAGAATTC TTTGTCAAAG AAACCATTCA AATAGAGTGT ACTGTATAGA 5160 

GCTGGTCCTC CGTGACCTTT TGATAATATC AAATAATCTC TATCTCGTGC TGCAAATATT 5220 

TCTGGAGTCA TTGGCATTAT TTCACCATAA AGCACCGCTA AAACTTCTAC GATAGACAGA 5280 

CTTCCTCCGT AATGTCCGAA TCCAAGATGA TTCAATGTTC TAAGAGTATT T AATCGG ATG . 534 0 

TTAGTCGCAA ATTTTCTTAA CCCATCTTCT CTATTTTTAC TTAAAATCAT CCCTTATTCC 5400 
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TCCGTTGCAG ATGGCTTTTT AATAAAGGAT ACTCCAAACA TAACTGCTAG AATAAGAACA 54 60 

AGACCAATCA CAATGCCTGC TTGTGAGCCA AATTGATTTA ACATTCCTAA AATAATTCCT 5520 

GATAGACCAA AATCTGCATC TGAGAAAGTT GATCCTTGGA AACCAAGTCC TCCCAAAACT 5580 

CGCATTAAAA AGACTGGAAG AAAACTGATT AAAATACCTT GTAAAAATGC TCCAATAGTG 5640 

GCTCCACGAA CACCACCAGA TGCATTCCCA ATCACACCTC CAGTCGCTCC ACAGAAGAAA 5700 

TGAGGCACAA CACCTGGTAA GATAACAACC GTTCCTGAAG CAATCATAAT TACCATACTT 57 60 

ACTAAACCAC CAACAAAACT AGAGATAAAT CCAATTAGAA CTGCATTGGG TCCATAAGTA 5820 

TAAACAATCG GACAATCCAA AGCAGGTTTT GAATTAGGTA CAAGACGCTC TGAAATACCT 5880 

TTAAAGGCTC GAACAATTTC GCCCAAAATA AGGCGAACAC CTCCTAAAAT AACAAATACC 5940 

CCTGCTGCAA ATTGACCTGC TAATTGTAAA GCATAAACTA GACCACTTGT ACCACTACTG 6000 

ATTTCTTTTT CTATATATTC TGACCCTGCA AAGATAGCTA CAATAATGTA AATAACTGCC 6060 

ATGGATAAAC TAATACTAAC AGTACTATCA CGTAAAAAAG CTAAACTCTT TGGAAATTTA 6120 

ATGTCCTCTG TTGATTTTGA TTTCTCACCC ATAAGGCTAC CAGTAAAACC ACTCAACCAA 6180 

TATCCCAAAG AACTCAAATG ACCTAAAGCT ACCTTGTCAT TTCCAGTTAA TTGAACCATA 6240 

TATTTTTGCA CAAATGCTCG GGAAATACTC ATAATAATAC CGAGTGCTAA TCCTCCTAGT 6300 

AAGATGAGAG GCAACCTAGT AAAGCCAGCA ACTGATAAAA TGACCGCAAT CATACATGCC 6260 

ATATATAGAG TGTGGTGCCC TGTTAAAAAA ATATATTTAA ATCGAGTAAA ACGACCGATT 6420 

AAGATATTGA ACACCATGCC TGCAAACATA ATCATTGCAG TAGCTGAGCC ATATGTTCTT 6480 

AAAGCT AC AG CTACAATTGC TTCATTATTC GGCACAACGC CAGATAAATG AAAAGCATCC 6540 

TCAAACATGG TACCAAATGG ATTTAAAGAA TTTTGTACAA TTCCTGCACC ACCAGATACA 6600 

ACTAAGAAAC CAACAAAGGT CTTAATTCCA CCTTTAATAA TATCAGGTAA TTTCTTCTTC 6660 

TGAAGAACTA ATCCTAAGAT TGCAATTAAA GCTACTAAAA TAGCTGGTGT ACTAACAATA 67 20 

TCCAATATGA ACTTCATCAT GACGCTAGCC TCCTATATAA GTCCTTTTTC TTCACAAA^T 6780 

TTACTAATTA ATTCTCGTAG TTCATCCATA TCAATAATAC TATTTAAGAT ACGAACATCT 6840 

CCAAGATGAC TAGCTGAATC AGCTAGATCA CGACCAACAA TCCAAATATC AGCTGCATTT 6900 

GGATCTGCTC CACCTAAATC ATAATGTTCA ACTTCTACAT CCGAAACATT CAAATCACTC 6960 

AATACAGATT CAATATTCAT CTGTACCATA AAACTTGAAC CTAATCCTCA ACCACAAGCT 7020 

GTACCAATTT TTAACATTAT CTAATCCTCC TGTTTAATTA TCATTTTAAT GTCATCATAG 7080 

TTTTTTGATG ATATTAAAGT TTGAACATGA TTTTTATCTC TTAAAATTGT TGTTAAATGT 7140 

GACAAAGCCT TTAAATCAC? CTCATTATCA ATCCCTCCAA TACAAATCAA CAATCTTACC 7 200 
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TCTTGTTCTG 


GATTATCCAA 


/Oh a m ft M 1 ft t ^%/^ 

TAAATAAATC 


uGiicnCv. A 






7260 


ATTTCATTCA 


CACCTTCATC 


TGGCCGAGCG 


IajAOwAAI iU» 


PT APTPPPTT 


rrrtAAATTA 


7320 


ATAAAAGGTC 


CAAACTCTTC 


TAcr^rTTGA 


ATCATTGCCT 




PTC & P.TT ATP 


7380 


TTATCTTGAT 


CCAAAAGCGG 


TTTAGCTGCT 


AAACGAATCG 


I V-V- i 1 V.L.A 


TCCTAATTTT 


7440 


TGCGAACTAA 


CCTGATAGGT 


TTCTTTGGTA 


ATAAGTTGTT 


LTAuLAL rviO 


I AL.AA i 1 1 


7 ^nn 


TTTCTATCAT 


TTTTTTGGTA 


AAGATAATTC 


TTTAACGCCA 


ATCTTAATTC 


CAATTCTTGT 


' jOU 


GTAATAATTC 


CATATCTTTT 


GACAATATTC 


AGGATTTGTT 


CAATCTL AAA 


A I L 1 A 1 Ai- 


1 OtU 


TCTAAATTCC 


GAAAATCTTT 


TAACACTAGT 


TCTACTAGTT 


GTATTGCTTG 


<. 1 V. 1 IV. AO 1 L. 


7 680 


ATCATAACCG 


AAACTAGATA 


ATTTGGCTTT 


TCTGTCTCCA 


CCTTTATGGT 


» » » » » » * /""*/"■ 
AuAAAAAALL 


77 40 


AT AT CAT ACT 


CACTACTAGC 


TTTCACCTGT 


AAATCATCAA 


TCTTTGAGGT 




7 800 


TCAATTTGAG 


GAAATAATGC 


TAATAGATTC 


TCTTTTAACA 


TCAATGAAGA 


apappa 


7 860 


TTAGGACAAA 


TGATTGCTGC 


TTTATACCAT 


TTTTGAGGCA 


AAGTATCTGC 


TTTCTTTAAA 


7Q ?n 


TAACCTCCGA 


AATGGATAAC 


AAAATATGCT 


GTTTCACTAT 


CAGGTATGGG 


A iTt» V\- AA i A 


7980 


GCGTCCATCA 


AGGGCATCAA 


AGAATCTTTG 


ACTAATTCAA 


ATAAATCAGG 


A 1 AA lul 1 v- 1 


8040 


TTAACATGCA 


ATACATATTC 


ATTTGAACTA GGTAGGCCGA ACTTTAATCT 


ATAuTAAuLL 


ft 1 no 


CGTATAAGGT 


GGCGGCGAAG 


ATTTTCTCTC 


AATCCTTCCC 


TTTGTTTAAA 


A IVTAAUAAA 


8160 


GAAATATCTT 


CCATTCTACT 


TATAATAGCC 


TCTGTTAATT 


GATTAAACTA 


& APPPP.AP.PA 


8220 


ACATCTACTT 


CACCTTCAAA 


GCAACTTGAT 


AATAAAACGG 


TGATATAGCG 


imk atp atpp 

A I AA 1 V.A1 H. 


82 80 


TCAGAAAACA 


CCGTATCTAT 


AATTCCCAAA 


TCAACCACTG 


TATCCAATAA 


AA 1 AO 1 w L 1 


834 0 


ATATCTTGAA 


TAACAGGAGA 


TACTAATGTC 


TCTGAAAGAC 


ATACTCTTTC 


AAPATCPPTT 


8400 


TGATACCTAC 


ACAGAATGAA 


TACTAAACCG 


AAAACCTAAA 


CTTTTAATTG 


ATT A AP A AT A 


8460 


GGTACTAGCT 


GTAGCTTCTC 


ATAATAATCT 


TTAACTACCT 


GATCAATCAA 


ATCATAAGTT 


8520 


AATGAATACC 


CCCAACTGGA 


TAAAACATAA 


TCCAAACCCC 


AAATCCCTAT 


GGAGGATTCC 


8580 


AGCAACTCAC 


TAACCATTTG 


AAAAGCTAAG 


CGGTGCTTAT 


TCCACTCTGA 


ACCGTGTAAA 


8640 


GTATAACCTT 


TTGCTCTACT 


GTACCCTAGC 


TCCAAATCAT 


TATCTAACAT 


AATCTTTCTT 


8700 


AATGATTGAA 


TATCAGATAA 


GGTTGTATTC 


TTACTTACTT 


TCAAAAAGTC 


TTGGTAATGA 


8760 


CTATTCGATA 


TAAAATCTAA 


TCGGCAAAAA 


GTGTAAAGAT 


AGATTAAAGC 


TAAGCGAGTC 


8820 


GACTTTGGTA 


AAACCAATTC 


ATCCGACTTA 


ATAATATCTG 


TCAAAGACTG 


CTTCGTACGA 


8880 


TTTGATAAAC 


TATAGCGACC 


TTGCTTTTTA 


TCCAGCACTA 


TCCCTTTATT 


AGCTAGATAA 


8940 
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GGCACTAAAT AATCTATTCC TTCTTTGACT TCCTTTATAG GTAAGCTCAC CTTAACAGAT 9000 
AATTCATATA ACGATAGCTC ACAATGATCC ATCAAAGTCA TCAAAATAAC TACTGCTCTA 9060 
TAATCAAAC 9069 
(2) INFORMATION FOR SEQ ID NO: 98: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8654 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 



CGAGACAACA 


AGATGAAGAA 


AAATTTGCCC 


TATCGTTTGT 


GGCGCTTGCA 


AGTCTAGCAC 


60 


TTCTTGCAGC 


CTCTCGAGAA 


GTGAAGTCTG 


GAGCAGTCAA 


CACTGCTGGT 


AACTCAGTAG 


120 


AGGAAAAGAC 


AATTAAAATC 


GGGTTTAACT 


TTGAAGAATC 


AGGTTCTTTA 


GCTGCATACG 


180 


GAACAGCTGA 


ACAAAAAGGT 


GCCCAATTGG 


CTGTTGATGA 


AATCAATGCC 


GCACTGGTAT 


240 


CGATGGAAAA 


CAAATCGAAG 


TAGTCGATAA 


AGATAATAAG 


TCTGAAACAG 


CTGAGGCTGC 


300 


TTCAGTTACA 


ACTAACCTTG 


TAACCCAATC 


TAAAGTATCA 


GC AGTCGTAG 


GACCTGCGAC 


360 


ATCTGGTGCG 


ACTGCAGCTG 


CGGTAGCGAA 


CGCTACAAAA 


GCAGGTGTTC 


CATTGATCTC 


420 


ACCAAGTGCG 


ACTCAAGATG 


GATTGACTAA 


AGGTCAAGAT 


TACCTCTTTA 


TTGGAACTTT 


480 


CCAAGATAGC 


TTCCAAGGAA 


AAATTATCTC 


AAACTATCTT 


TCTGAAAAAT 


TAAATGCTAA 


540 


GAAAGTTGTT 


CTTTACACTG 


ACAATCCCAG 


TGACTATCCT 


AAAGGCATTG 


CAAAATCTTT 


600 


CCGCGAGTCA 


TACAAGGGTG 


AAATCGTTGC 


AGATGAAACT 


TTCGTAGCAG 


GTGACACAGA 


660 


CTTCCAACCA 


GCCCTTACAA 


AAATGAAAGG 


GAAAGACTTT 


GATGCTATCG 


TTGTTCCTGG 


720 


TTACTATAAT 


GAGGCTGGTA 


AAATTGTAAA 


CCAAGCGCCT 


GGCATGGCAA 


TTGACAAACC 


780 


AATCGTTGGT 


GGTGATGGAT 


TCAACGGTGA 


GGAGTTTGTA 


CAACAAGCAA 


CTGCTGAAAA 


840 


AGCATCAAAC 


ATCTACTTTA 


TCTCAGGCTT 


CTCAACTACT 


CTAGAAGTTT 


CAGCTAAAGC 


900 


TAAAGCCTTC 


CTTGACGCTT 


ACCGTGCTAA 


GTACAATGAA 


GAGCCTTCAA 


CATTTGCAGC 


960 


CTTGGCTTAT 


GATTCACTTC 


ACCTTGTAGC 


AAACGCAGCA 


AAAGGTGCTA 


AAAATTCAGG 


1020 


TGAAATCAAG 


AATAACCTTG 


CTAAAACAAA 


AGATTTTGAA 


GGTGTAACTG 


CTCAAACAAG 


1080 


CTTCGATGCA 


GACCACAACA 


CAGTCAAAAC 


TGCTTACATC 


ATGACCATGA 


ACAATGGTAA 


1140 


AGTTGAAGCA 


GCAGAAGTTC 


TAAAACCATA 


ATAGAAAAAT 


GTTGAAATAG 


GGAATGAGCC 


1200 


TTTGACTCAC 


TCCCTGTTTC 


GATATTTAAT 


ACTCTTCGAA 


AATCTCTTCA 


AACTGCCTCA 


1260 
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ACCTCCCCTT GGATTATATA TGTGACTGAC TTCGTCAGTC TTATCTACAA CCTCAAAGCA 1320 
GTGCTTTGAG CAACCTGCGG CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATAA 1380 
GAACCTATCA AAAAGTGAGG GAAAACCCTC GGAATTATAA ATAGAAAGAG TGAATCTTAT 144 0 

GCTCCAACAA CTCGTAAATG GTTTGATTCT AGGTAGTGTT TACGCGCTGT TACCCCTACG 1500 
ATATACCATG GTTTACGGAA TTATCAAGCT CATCAACTTC GCCCATGGTG ATATTTATAT 1560 
GATGGGAGCC TTTATCGGTT ATTTCTTGAT CAATTCTTTC CAAATCAATT TCTTTCTACC 1620 
GCTTATTGTA GCTATGCTAG CCACAGCTA7 TCTTGGTGTC GTGATTGAGT TTCTTGCTTA 1680 
CCGACCTTTG CCCCACTCTA CTCGTATTGC TGTTTTGATT ACGGCTATTG GGGTTTCTTT 1740 
CCTATTGGAG TATGGAATGG TCTATCTGGT TGGTGCCAAT ACCCGTGCCT TCCCTCAAGC 1800 
GATTCAAACA GTTCGATATG ATTTGGGACC AATTACCTTA ACAAATG7GC AGTTAATGAT I860 
TTTGGCCATT TCCTTGATTT TGATGATTTT GTTACAAGTC ATTGTCCAAA AGACTAAGAT 1920 
GGGGAAAGCC ATGCGTGCAG TATCAGTAGA TAGCGACCCG CCGCAATTGA TGGGGATCAA 19 80 

TGTAAACCGT ACCATTAGCT TTACCTTCCC TTTGGGTTCT GCTCTTGCGC GTGCGGCTGG 2040 

TGTTCTGATT GCTCTTTATT ATAACTCTCT TGAGCCTTTG ATGGGGGTTA CTCCAGGTCT 2100 

TAAATCTTTC GTTGCCGCAG TACTTGGTGG TATCCGAATT ATTCCTGGTG CGGCTCTTGG 2160 

TGGCTTTGTG ATTGGTCTAT TGGAAACCTT TGCGACTGCC TTTGGGATGT CAGATTTCCG 2220 

TGATGCCATT GTTTATGGAA TCTTGTTGTT GATCTTGATT GTCCGCCCAG CTGGTATCCT 2 280 

TGGTAAGAAT GTGAAAGAGA ACGTGTAAAC CATGAAGGAA AATTTAAAAG TTAATATTCT 2 340 

ATGGTTACTC CTTTTGTTAG CTGGCTATAG CTTGATTAGT GTACTGCTTT CAGTCGGAGT 2 400 

ACTTAATCTA TTCTATGTAC ACATTTTACA ACAAATTGGA ATTAATATTA TTTTGGCTGT 2 4 60 

TGGTCTCAAC TTAATCGTTG GTTTTTCAGG ACAATTTTCA CTTGGTCATG CTGGTTTCAT 252 0 

GGCGATTGCT CCCTATCCAC CAGCTATTAT TGGTTCTAAA TCACCAACCT ACGGTGCCTT 2 580 

CTTTGGAGCT ATGCTTGTAG GGGCTTTGCT TTCAGGAGCA GTTGCCTTAC TTGTCGGCAT 2 640 

TCCAACCTTG CGCTTGAAGG GGGACTATCT TGCGGTAGCA ACTCTGGGTG TTTCTGAAAT 2 7 00 

TATCCGTATC TTTATCATCA ATGGTGGAAG CCTTACAAAT GGTGCCGCAG GTATCTTAGC 2 7 60 

GATTCCTAAC TTTACAACTT GGCAAATGGT TTACTTCTTT GTCGTGATTA CAACCATTGC 2820 

AACCTTGAAC TTCTTGCGTA GCCCAATTGG TCGTTCAACC CTCTCTCTTC GTGAAGATGA 2880 

AATCGCTGCT GAGTCAGTTG GGGTTAATAC GACTAAAATT AAAATCATCG CTTTTGTCTT 2940 

TGGTGCCATT ACTGCAAGTA TTGCTGGGTC ACTTCAGGCA GGATTTATCG GGTCTGTTGT 3000 
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ACCCAAAGAT 


TACACCTTCA 


TCAACTCAAT 


CAACGTTTTG 


ATTATTGTTG 


TATTTGGTGG 


3060 


ACTCGCTTCC 


ATTACAGCTG 


CGATTGTTTC 


GGCTATTGTT 


CTCGCAATTT 


TGAATATGCT 


3120 


TCTCCAAGAT 


GTTGCTAGTG 


TGCGTATGAT 


TATTTACGCT 


TTGGCCTTGG 


TATTGGTAAT 


3180 


GATTTTCAGA 


CCAGGTGGAC 


TCCTTGGAAC 


ATGGGAACTG 


AGCCTATCAC 


GTTTCTTTAA 


3240 


AAAATCTAAG 


AAGGAGGAAC 


AAAACTAATG 


GCATTACTTG 


AAGTAAAACA 


GTTAACCAAA 


3300 


CATTTTGGTG 


GTCTAACAGC 


TGTTGGAGAT 


GTGACTCTTG 


AATTGAACGA 


AGGGGAACTG 


3360 


CTTGCATTAA 


TCGGTCCAAA 


CGGAGCTGGG 


AAAACCACCC 


TTTTCAACCT 


TTTGACCGGT 


3420 


GTTTATGAAC 


CAAGCGAGGG 


AACAGTAACC 


CTAGATGGTC 


ACCTTTTGAA 


TGGGAAATCA 


3480 


CCTTATAAGA 


TTGCCTCTTT 


GGC ACTTGG A 


CGTACTTTCC 


AAAATATCCC 


TCTCTTTAAA 


3540 


GATTTAACAG 


TTTTAGATAA 


TGTTTTGATT 


GCTTTTGGAA 


ACCATCACAA 


ACAGCATGTT 


3600 


TTTACTAGTT 


TCTTACGCTT 


ACCAGCTTTT 


TACAAGAGTG 


AAAAAGAATT 


AAAGGCTAAA 


3660 


CCTTTGGAAT 


TGTTGAAAAT 


CTTTGATTTA 


GATGGTGATG 


CACACACTCT 


TGCTAAAAAT 


3720 


CTTTCCTACG 


GACAACAACG 


TCGTTTGGAA 


ATTGTTCGTC 


CCCTTGCTAC 


GGAACCTAAA 


3780 


ATTCTCTTCT 


TAGATGAACC 


AGCAGCAGGT 


ATGAACCCAC 


AGGAAACAGC 


CGAATTCACT 


3840 


GAGTTAATTC 


GTCGTATCAA 


AGATGAGTTT 


AAGATTACAA 


TCATCTTCAT 


TGAACACGAT 


3900 


ATGAATCTGG 


TCATGGAAGT 


AACAGAACCT 


ATCTACGTAC 


TTGAATATGG 


CCGTTTAATC 


3960 


GCTCAAGGAA 


CTCCAGACGA 


AATTAAGACC 


AATAAACGCG 


TTATCGAAGC 


TTATCTAGGA 


4020 


GGTGAAGCCT 


AATGTCTATG 


TTAAAAGTTG 


AAAATCTTTC 


TGTGCATTAC 


GGTATGATCC 


4080 


AAGCAGTTCG 


TGATGTAAGC 


TTTGAAGTTA 


ATGAAGGAGA 


AGTTGTTTCC 


CTTATCGGTG 


4140 


CCAACGGTGC 


AGGTAAGACA 


ACTATTCTTC 


GCACCTTGTC 


AGGTTTGGTT 


CGACCAACTT 


4200 


CAGGAAAGAT 


TGAATTTTTA 


GGTCAAGAAA 


TCCAAAAAAT 


GCCAGCTCAG 


AAAATCGTGG 


4260 


CAAGTGGTCT 


TTCACAAGTT 


CCAGAAGGAC 


GCCACGTCTT 


TCCTGGCTTG 


ACTCTTATCG 


4320 


AAAATCTTCA 


AATGGCAGCT 


TTCTTAAAGA 


AAAATCGTGA 


AGAAAATCAA 


GCTAACTrCA 


4380 


AGAAGCTTTT 


CTCACGCTTT 


CCTCGTCTTG 


AAGAACGGAA 


GAACCAAGAT 


GCAGCCACTC 


4440 


TTTCAGGGGG 


GGAACAACAA 


ATGCTTGCCA 


TGGGACGCCC 


CC7CATGTCA 


ACACCAAAAC 


4500 


TTCTTCTTTT 


AGATCAACCA 


TCAATGGGAC 


TTGCCCCAAT 


CTTTATCCAA 


GAAATTTTTG 


4560 


ATATCATTCA 


AGATATTCAG 


AAGCAAGGAA 


CAACGGTCCT 


CTTGATTGAA 


CAAAATGCCA 


4620 


ATAAAGCACT 


TGCAATCTCT 


GACCGAGGAT 


ATGTACTGGA 


AACAGGGAGA 


ATCGTCCTAT 


4680 


CAGGAACAGG 


AAAAGAACTC 


GCTTCATCAG 


AAGAAGTCAG 


AAAAGCATAT 


CTAGGTGGCT 


4740 


AAAACAATCC 


AGTGCATTGT 


TTTAGTCGGC 


AGATGGAGAT 


TACGAAGTAA 


TCATCAATAT 


4800 
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4920 
4980 
5040 
S100 
5160 



5280 
5340 



749 

AGTCCGGGGG ACCTTTTTAG TCGGTAGATT GAGATTGCAA ACAAATCTGC ATCTACATTG 4860 
AAAGCTTAAT TTCTAATAA? TGAAAAAATC GAATCAAAAA TTTCTTACCT TCATTCACAG 
AGCTCGATTT CAGAGCTCTT TTTGCTAGCT TATTCATACT TTTCTGAATT TCGAAAAAGA 
AATGTAAGCG TTTGATAGAT TTACAAAAAC ATTCTATAAT AGGGATAAGA ATAGAAAAGG 
AGAAGTCTCA TGGCAGTTAA AGATTTTATG ACCCGCAAGG TAGTTTATAT TAGTCCAGAT 
ATAACAGTAT CTCATCCAGC ACATTTGATG AGAGAGCAAG GTTTGCACCG TCTGCCTGTT 
ATCGAAAATG ATCAATTAGT TGGTTTGGTG ACTGAGGGAA CCATTGCACA AGCAAGTCCA 5220 
TCTAAAGCAA CAAGTCTTTC 7ATCTATGAG ATGAATTATC TTCTGAATAA GACAAAAGTA 
AAAGATGTCA TGATTCGCGA TGTTGTCACT GTCTCAGGCT ATGCTAGTCT AGAAGATGCA 
ACTTATCTGA TGTTGAAAAA TAAGATTAGT ATTCTCCCTG TCGTAGATAA CCATCAAGTA 5400 
TACGGAGTTA TTACTGACCG TGACGTTTTC CAAGCCTTTC TTGAAATTGC AGGTTATGGC 54 60 

GAAGAAGGGA TTCGTGTACG CTTTGTTACA GAACATGAAG TTGGTGTTCT TGGAAAAATT 552 0 

GTTTCTTTGA TTGTAGAAGA AAATTTGAAT ATCTCCCATA CAGTCAATAT TCCGCGTAAG 
GATGGTAAGG TGATTATCGA AGTGCAAATC GATGGATCAA TTGATTTACC AGCCTTGAAA 
GAAAAATTTG AAGCAAATGG TATTCAAGTG GAAGAAATCG CTCGCACTTC ACCAAAAGTC 
TTGTAAGAAG GGAAGCCCAA AGGCTTCTTT TTTCATGAAA AGGGGATTAG AGCAAAAGAT 5760 
GGAAAGAAAT GATAAAATAT GCTATAATGA AATAATGTAA AAAAGGAGTA TTTATGGACA 
TTTCAGTAAT TCGTCAGAAA ATTGACGCAA ATCGTGAAAA ATTAGCTTCT TTCAGGGGGT 
CTCTTTGACC TCGAAGGGCT AGAGGAAGAG ATTGCCATCT TGGAAAACAA GATGACAGAA 
CCTGATTTTT GGAACGATAA TATTGCGGCC CAAAAAACCT CGCAAGAATT AAATGAATTA 
AAAAACACTT ACAATACCTT CCATAAGATG GAAGAGTTGC AGGATGAAGT CGAAATTTTA 
TTGGATTTTT TGGCTGAAGA CGAGTCAGTC CATGATCAAC TGGTAGCGCA GTTAGCCGAA 
CTTGATAAGA TAATGACCAG CTACGAGATG ACTCTACTCT TGTCAGAACC TTATGACCAC 
AACAATGCCA TCTTGGAAAT CCATCCAGGT TCTGGTGGTA CTGAGGCGCA GGACTGGGGT 
GATATGTTGC TTCGTATGTA TACTCGTTAT GGTAATGCTA AAGGCTTTAA AGTGGAAGTG 
TTGCATTACC AAGCAGCTGA TGAGGCTGGT ATTAAGTCGG TAACTTTATC ATTTGAAGGG 
CCTAATGCCT ATGGTCTCCT CAAGTCAGAA ATGGGTGTTC ACCGCTTAGT GCGAATCTCA 
CCATTTGACT CTGCCAAACG TCGCCATACC TCTTTCACAT CTGTAGAAGT GATCCCAGAA 
TTGGATGATA CTATTGAAGT GGAAATCCGT GAAGATGATA TCAAGATGGA TACCTTCCGT 



5580 
5640 
5700 



5820 
5890 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
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TCAGGTCGTC CCCGTGGACA AAACGTCAAT AAGGTTTCAA CAGGTGTACG TTTAACCCAC 6600 

ATTCCAACTG GAA7TGTTGT CCAATCAACA GTACATCCTA CCCAGTATGG AAATAGAGAT 6660 

CGTGCCATGA AGATGTTGCA GGCTAAGCTC TATCAAATGG AGCAAGATAA GAAGGCTGCG 6720 

GACGTAGATT CTCTCAAAGG TGAGAAAAAG GAGATCACTT GGGGAAGCCA AATCCCTTCT 6780 

TATGTCTTCA CGCCTTATAC TATGGTAAAA GATCACCGAA CTAGCTTTGA GGTTGCTCAG 6840 

GTAGATAAGG TTATGGATGG GGACCTAGAT GGTTTTATCG ATGCTTATCT CAAGTGGCGA 6900 

ATTAGCTAAG ATAGAAAGGA ACTCACATGT CAATTATTGA AATGAGAGAT GTCGTTAAAA 69 60 

AATACGACAA CGGAACAACT GCTCTACGCG GTGTTTCGGT TAGCGTTCAA CCGGGGGAAT 7020 

TTCCTTACAT CGTAGGACC7 TCAGGAGCAG GGAAGTCAAC TTTTATTCGT TCTCTGTATC 7080 

GTGAAGTAAA AATCGATAAA GGAAGCCTAT CAGTTGCTGG TTTTAATCTG GTTAAGATCA 7140 

AAAAGAAAGA TCTCCCGCTT CTACGTCGTA GTGTTGGGGT TGTCTTCCAG GATTATAAAT 7200 

TCTTACCAAA GAAAACTGTC TATGAAAATA TTGCTTACGC TATCGAACTA ATCGGGGAAA 7260 

ATCGCCGTAA TATCAAAAGA CGAGTGATGG AACTTTTGGA CTTGGTTGGA TTGAAGCATA 7320 

ACGTTCGTTC TTTCCCAAAT GAACTCTCAG GTGGGGAGCA ACAGCGGATT GCGATTGCGC 7380 

GTGCAATTGT AAATAATCCC AAAGTATTGA TAGCTGATGA GCCAACAGGA AATCTGGATC 7440 

CGGATAATTC ATGGGAAATT ATGAATCTCT TGGAACGGAT TAACyTACAA GGAACAACTA 7500 

TTTTGATGGC GACTCATAAT AGCCAGATTG TAAATACCTT GCGCCACCGT GTCATTGCCA 7560 

TTGAAAATGG CCGTGTCGTT CGTGACGAAT CAAAAGGAGA GTATCGATAC GATGATTAGT 7620 

AGATTTTTTC GCCATTTATT TGAAGCCTTA AAAAGTTTGA AACGAAATGG TTGGATGACA 7 680 

GTAGCTGCTG TCAGTTCAGT CATGATTACT TTGACCTTGG TGGCAATATT TGCATCTGTT 7740 

ATTTTCAATA CAGCGAAACT AGCTACAGAT ATTGAAAATA ATGTCCGTGT AGTAGTTTAT 7800 

ATCCGAAAGG ATGTGGAAGA TAATAGTCAG ACAATTGAAA AAGAACGTCA AACTCTTACA 7860 

AATAATGACT ACCACAAGGT ATATGATTCT TTGAAGAACA TGTCTACGGT TAAAAGTGTT 7920 

ACcrrrrcAA gtaaagaaga acaatatgaa aaattaaccg agataatggg agataactgg 7980 

AAAATCTTTG AAGGAGATGC CAATCCTCTC TATCATGCCT ATATTGTAGA GGCAAACACT 904 0 

CCAAATGATG taaaaactat AGCCGAAGAT GCTAAAAAAA TTGAAGGTGT CTCTGAGGTT 8100 

CAAGATCGCC GTGCCAATAC AGAAAGACTC TTCAAGT7AG CTTCATTTAT CCGTGTTTGG 8160 

GGACTAGGGA TTGCTGCTTT GTTAATTTTT ATCGCAGTTT TCTTGATTTC AAATACCATT 8220 

CGTATTACCA TTATTTCCCG CAGTCGCGAA ATTCAAATCA TGCGCTTGGT CGGACCTAAA 8280 

AAC AGITATA TCCGTGGACC GTTCTTGTTA GAAGGAGCCT TTATCGGTTT ATTGGGAGCT 8 340 
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ATCGCACCAT CTGTTTTGGT CTTTATTCTT TATCAAATTG TTTACCAATC TGTCAACAAA 8400 

TCGTTGGTAG GGCAAAATCT ATCCATGATT AGTCCAGATT TATTTAGTCC GTTGATGATT 8460 

GCCCTACTAT TTGTGATTGG GGTTTTCATT GGTTCATTGG GATCAGCAAT ATCCATGCGC 8S20 

CCATTCTTGA AGATTTAGGT AAAATAGCTG CTTTTATGAG GAGATTGTAA AATCTCCTTT 8580 

TTTGCTACAA GAGTTTTTGA AAAGAGATGC GCAGAAGAAA AGAGCTTCCA AAGAAGTCCC 8640 

CCAGAGAAGA CTTC 8654 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19718 base pairs 

(B) TYPE: nucleic acid 

(C) " STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

TGTCGCGTCA AAATCATTAC TATGGCTATG TATAGCCCTT ACTATGACTT GGCTAAACAC 60 

GTTCGCTTTC AAATTTCTAG GCTCAGCCTG AAACAGTCTC CCAGGCTGTT CACTCCCGAA 120 

TCCTAAAATC GTTCTTGATC GCTTTCACAT TCTACAACAT CTTAGCCGTC CTATGAGTCG 180 

TGTGCATGTC CAAATCATGA ATCAGTTTCA TCGAAAATCC CATGAATACA AGGC7ATCAA 240 

GCGCTACTGG AAACTCATTC AACAGGATAG CCGTAAACTG AGTGATAAGC GATTTTATCG 300 

CCCTACTTTT CGCATGCACT TAACAAATAA AGAAATTCTT GACAAGATTT TAAGCTATTC 3 60 

AGAAGACTTG AAACACCACT ATCAGATCTA TCAACTCTTA CTTTTTCACT TTCAGAACAA 420 

AGACCCTGAG AAATTTTTCG GACTCATTGA GGACAATCTG AAGCAGGTTC ATCCTCTTTT 480 

TCAGACTGTC TTTAAAACCT TTCTCAAAGA TAAAGAAAAG ATTATCAACG CCCTTCAACT 540 

ACACTATTCT AATGCCAAAC TGGAAGCGAC CAATAATCTC ATCAAACTTA TCAAGCGCAA 600 

TGCCTTTGGT TTTCGAAACT TTGAAAACTT CAAAAAACGG ATTTTTATCG CTTTGAACAT 660 

CAAAAAAGAA AGGACGAAAT TTGTCCTTTC TCGAGCTTAG CTGACTTCAA CCCACTACAG 7 20 

TTGACAAAGA GCCTAATTTC CATAAAAATT GACATGGAAA TTATAAAACC ATTACTAGTT 7 80 

TAGTCCTTTT TGATAACGTG CCAATTCGGC TTGGTTCGCC CAAACATAGT GACCTGGACG 840 

GATTTCTACC ATAGATGGCT TATCAGTCTC ATAGTCGTGT TGACTTGGAT CGTAAACCTT 900 

CAAGACCTTC TTACGTTCCA AGATTGGATC TGGGATTGCT ACCGCTGAAA GCAAGGCTTG 960 

AGTATATGGG TGAATTGCAT TGTTAAACAA TTC7TCTCTT TCTGCAACCT CTACAATAAC 1020 
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ACCCTTGTAA ATAACTGCGA TACGATCTGA AATAAAGCGA ACAACCGACA ACTCATGGGC 1080 

GATGAAGAGA TAGGTCAGGC CGAGCTCTTT TTGGAATTTT TTGAGCAAGT TCAAGACTTG 1140 

GGCACGTACA GAAACGTCCA AGGCTGAAAT TCGCTCATCT GCAATAACAA AGTCTGGTTG 1200 

CATGACCAAG GCACGGGCAA TACCGATACG TTGACGTTGA CCGCCTGAGA ATTCATGACG 1260 

GTAACGAGTC AAGTGCTCAG CAAGAAGACC TACTTCACGG ATAATATTTT GAACTTTCTC 1320 

TTTACGTTCT TCTTCATCCT TAAATAAACG GTGATTGTAA AGACCTTCAG AAATAATATA 1380 

ATCAACAGTC GCACGTTCAT TCAAACTTGC GGCAGGGTCT TGGAAAATCA TCTGGATTCG 1440 

ACGAATCAAT TCCGCAGCTT CTTCACGCGA TTTCTTACCA TTAATCTTTT G AC CATC AAA 1500 

AATGATATCT CCATTACTTC TATCATTTAG ACCGATGATA GCACGACCAA TACTTCTTTT 1560 

CCCACTACCG GACTCACCTA CAAGCGAGAA AGTTTCTCCC TTGTTGATAA AGAAGTTAGC 1620 

ATTTTTAACC GCGACAAACT TCTTACTTCC TTCACCGAAG GAAATTTCTA AATCTTTGAT 1680 

TTCTACTAAT TTTTCAGACA TTTCCTTCCT CCTAGTCAGC CAGATGGGCA AATCCCATTT 1740 

TTTCACGGAT CTTATCATGG AGATTTGCAA TCACAGCTGG TTTTTCTACT TTCGCAGCAT 1800 

CCTCATCAAG AAGCCAAGTT TTAGCCCAAT GTGTCTCTGA TACTGAGAAT TGAGGAGCTT 1860 

TTTGTTCGAA GTCAATCTGC ATTGCGTAGT CAGAACGCAA GGCAAAAGCA TCCCCTTTCA 1920 

GGTCAGTATA AAGTGACGGA GGTGTTCCTG GGATTGAGTA AAGATCCCCT TTATCATCAG 1980 

CAAGCTGAGG CAAGCTAGAC AACAGACTCC ATGTATATGG ATGGCGAGGG TCATAGAAGA 2040 

CTTCCTCAAC CGTTCCATAC TCAACGATT7 CTCCTGCATA CATAACCGCT ACCTTATCCG 2100 

CAATACTTGC CACCACACCA AGGTCGTGGG TAATAAAGAT TCTTGTGAAA TGATACTCGT 2160 

TTTGTAAAGA TTTTAGCAAA TCAATAATCT GACCTTGAAT AGTTACATCC AAGGCAGTTC 2220 

TTGGCTCATC ACAGATCAAG ACATCAGGTC GGCAGGCAAG CGCAATAGCA ATAACGATAC 2280 

GTTGACGCAT TCCTCCAGAA TATTGGAATG GGTATTCATT AAAACGTCTA TCTGCGTCTG 2 340 

GAATGCCAAC CTTATTCATG TAGTCAATGG CCAATTCTTT CGCTTCTTTA GCTGTTTl'TC 2400 

CTTGCTGTTT TACAATAACT TCTGTAATCT GACTACCAAT TGTTTTAATG GGGTCCAAAC 2460 

TAGTCATTGG GTCCTGGAAG ATAGTCGCAA TCTTAGCACC ACGAATTTGT TCCCAATCCT 2520 

TGTGAGAAGA TAAAGCTGTC AAGTCCTGAC CACGGTAGTC AATACTACCT TGGGCAATAC 2580 

GACCATTTTC TTCGAGCATA CCTGTGAAGG TCTTTGTCAA AACAGATTTA CCTGATCCTG 2640 

ACTCACCTAC CAAGGCTAAT ACTTCTCCTT CGACTAGTTC AAGGGAAACG CCGCGAATGG 2700 

CTGTCAATAC TTTGTCACGA ACGTCAAATT CCACGACAAT ATCGCGAGCA CTCAAAATTA 2760 

CATTTTTTTC TTTTGTCATT TCTACTCCTA TCTATGTGTA CGTGGATCAC TAGCATCCGC 2820 
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TAAGTTTTGA 


CCAACTACGA 


AAAGGGACAA GGATACCAAG ACAAGCGTTG TCAATGGAAT 


2330 


CCAGAACAAG 


TAAGCATTGG 


TTGTTACGTT 


TTGTGAATAA 


TCCGAAATCA 


> AACGACCCAA 


2940 


ACTTGGCACT 


GTAATCGGTA 


ATCCAAGACC 


GAAGAAAGAC 


AAGAAGGCTT 


CCTATCAGAT 


3000 


AAAGCTTGGA 


AGCATTTGAG 


TCATGGTTGT 


CACAATAACA 


GATACCAATT 


GAGGCATGAT 


JwwW 


ATTTTTGGCA 


ACAATCTTCA 


AGGTTGGTGT 


TCCCAAAGTA 


CGTGACGCCA 


AGTTGTATTC 




CAAGTCACGA 


TAGCGCAAGA 


TTTGCACACG 


GATCATGAAG 


CCAATACCAA 


TCCATGTTGT 


J13U 


TACGCTCATG 


GCAAAAATCA 


GATTCCAGAA 


TCCAGCTCCC 


ATTGAGTAAG 


TCAAGACAAT 




AACAATCAAA 


AGAGGTGGGA 


TGTTTGAGAT 


GACGTTGTAA 


ACTTCCATCA 


TGACACGGTC 


3 3 00 


AACTGATTTT 


GAAATACCCC 


AAATACCACC 


GACAAAAACA 


CCGATAACCA 


AGTTAATCAC 


3 360 


TGTCGCAATC 


ACAGAAATGA 


GGATGGAGTT 


ACGAGCTCCG 


AACCAGACAC 


CGTCAAAGAG 


3420 


CGATTTACCG 


TTACTGTCAG 


TACCGAACCA 


ATGCTCCGCA 


TTTGGCTTGA 


TATAACGAAC 


3480 


ACTAAAGTCG 


TTTACCTTGC 


TGACATCATT 


GAAATCAAAC 


TTAGAAAACA 


TTGGGTAGAT 


3540 


GAAACTTATC 


AAAATGATGG 


CTACCAAGAT 


TCCCAACATG 


ACTACAGTTG 


ATTTTTTCTT 


t finn 

JDUU 


CATAAATTGT 


TTAAACACTG 


ATTTCCAGTA 


ACAATATCC7 


GGCGCATCAA 


TAGTTTCAGA 


3 660 


GGCAAAATCC 


TCACGTTTTA 


CAAACTGAAA 


TTTTTCTTTA 


TCGATTGTAG 


ACATTATTTG 


3720 


CCTCCTTTCT 


CAGTCAATTT 


AATACGTGGG 


TCAATAATAG 


TCATCCAAAT 


ATCTCCCAAA 


3780 


AGACGTGAGA 


ACATAGAAAT 


ACATGTAAAG 


ATGAAGACAA 


GACCAACGAC 


CATAGAGTTA 


3840 


TTAGATGCTT 


TTACAGAGTC 


AATCAACATT 


TTACCCATAC 


CTCCGAAGCC 


GAAGACTGTT 


3900 


TCAGTAAGGG 


TTGCACCACC 


GATAACCCCA 


ATAATGGCAG 


CAGGAATTCC 


TGAAACCAGC 


3960 


GGAACCATGG 


CATTTTTAAA 


GATGTGTTTG 


TTTGAAATTT 


CTTTTTCAGA 


CAAACCTTTT 


4020 


GCACGAGCGA 


AACGAACAAA 


GTCTTGAGAT 


TGCAAGTCAA 


TCATGTAACG 


ACGAATCCAA 


4080 


ATGGCTGTAC 


CAGGAGCACC 


CAACAAACCA 


AGGATGACTG 


CTGGTAAAAC 


GTAAGAACGC 


4140 


CAATCTCCAG 


CTCCCAAGAT 


AGGGAATGAA 


TCTGGAAGGG 


CAATAGATGA 


TCCAATCAAT 


4200 


CCAACGATGT 


AAACCAAGGC 


AATCGTTGGA 


AGAGCAAGCA 


AGAAGGTCAA 


AGCCCCTGTT 


4260 


GAGAGGCTAT 


CAATCCAAGT 


GTTCTTGAAA 


CGAGCCATGG 


CTGAACCAAG 


TGCCACGGCA 


4320 


AGAGCATAGG 


CAACAACCAA 


ACCAATCAAA 


CCAGTAATAG 


CAGAGCTGAC 


AATCATAGAT 


4380 


GGATATTGGT 


AATTACTTTC 


AGTCGCTGTA 


TAAGGATCAT 


CTTTCCCATA 


GCTAGCTACT 


4440 


TCACGAGAGT 


CAGCCTGACT 


AGGTGACTTG 


TAGG7TCTTG 


AGTAAATATT 


TACAGAAGAC 


4500 


GTTTTCTTAC 


CTGTTGGGAA 


CTGAACTTGG 


GCAGTTTTGG 


TTTGTCCTTG 


ACCTTGAGTA 


4560 
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ATAACCTGAA 


GAACTGGTGT 


ATTAGCATAG 


GTTGGGTAAG 


AGTCACCTAA 


ATTCAAGTTC 


4620 


ACAAAGTTTT 


GATGAACAAA 


TGGGAACTGA 


CTGTTAAAGT 


ACAAGAGATA 


TTTATGTTTA 


4680 


GTTCCTGAAC 


CGACCAATGA 


CCATCCGATA 


GCTGGATCAT 


TTTCAAAACC 


AAGGTAGCGT 


4740 


TTCAAGTCTG 


GATTTTCAGG 


GTCTTGGATT 


TTATTTGTAT 


GGTCAATGTC 


AATCAAGTTA 


. 4800 


GCATAGAAGT 


GAAAAACACG 


TTCAAAAATT 


GGAATTTCAC 


GAGTAGCATA 


GAATTGACCA 


4860 


CTTTCAGTAA 


ATTCTCCCAA 


AGTCCAACCA 


TGACCTAATT 


GATTGATGTA 


CTTTTCATAA 


4920 


ATAGCTTTAT 


TGGTCGCATT 


TGCTTCTACT 


GTTACAGAAG 


AATCCATGCT 


ACTTGCCTTT 


4980 


TCTTGCAACT 


CTTTAGTATC 


GTAATACTCA 


ATGTAGCCCA 


TACGCTCAAA 


CACAGTATTT 


5040 


TCATAGTTAT 


CACGTTTATC 


AGCCGTTGTC 


GCAATTTTAT 


TATAGTTAGG 


ATCCTGCTTG 


5100 


AAAATCAATT 


TTCGAGGAAC 


CAAGGTATAG 


ATAATCGTGT 


AGGTCAAAGT 


CGTTACTAAG 


5160 


AAAATCGAAA 


CCAATCACCG 


CAAAACACGC 


ATAAAAATAT 


ATTTTTTCAT 


ATTATTTCCT 


5220 


TTAAAAATCC 


CAAAAGAACC 


TTCTCCTCAT 


GGAGAGAAAC 


TTCTATTAGA 


AATTATTTAC 


5280 


TTCACATGAC 


TTGCCAATTC 


TTTTTGAGCT 


TTCTCATTTG 


ATTCAGCTTT 


TTCTTTCAAC 


5340 


CATTTTTCAC 


GAGCTTTTTC 


ATACTCTTCC 


TTAGTCACCA 


CTTTATCTTG 


TGATTTCAAA 


5400 


TATTTGAAGT 


AAACATCTGA 


CCCCTTAGAG 


CCTGTTTGCG 


CAGAAGCTCC 


AGTAAATGGA 


5460 


ACAATTCGTG 


AAAGCACTGG 


TGCTGCACCA 


GAAGAAGCCA 


TAGCAGGAAT 


AAAGAGTGAA 


5520 


CTATCTGTCA 


ACCATGCTTG 


AGCCGCTGCA 


TATTTTTCAT 


AACGGACATT 


CAAGTCGCTT 


5580 


GTCrCTCTGG 


CAGCTTCATC 


AACTAATTTA 


TCGTATTCTT 


TCAAACCAAC 


TTGAACTACT 


5640 


GAAGGCCTAT 


TTGGATTATC 


AAATCCTAAA 


TATGTTTTTG 


TAGTTTCACT 


GCTAGTTGTT 


5700 


TTTAAAATAT 


CCAGGTAAGT 


AGATGGGTCT 


TGATAGTCTG 


GCCCCCATGA 


AACTCCTCCT 


5760 


GATACATCCC 


AATCCTCAGA 


TGAAGCATTG 


GCAGCATAGT 


AAGTAATATT 


AAGGAATTCA 


5820 


7CACTTGTCA 


TTTGTTGAAT 


ATCAACAACG 


ACATTTTCAA 


CACCAAGAAC 


TGTTTCTACA 


5880 


GATTGTTTAA 


AGGACTGAAT 


ACGAGATATG 


TAGTLTrrTG 


ATGCTTGGTC 


TACTGGAACG 


5940 


TCCAGATGAA 


TAGGAAACTG 


AACGCCGTCT 


GCTTCTAAAG 


CTTTCTTAGC 


TTTCGCAAAC 


6000 


TCTGCCTTGG 


CCTTGTCAGC 


ATTGAATAAA 


CCATCCTGCC 


CATCAGCTAA 


ATTCACACCT 


6060 


TTCCACTCAT 


CACCATAAGC 


AGGAAGTTGA 


GCAGCGACTA 


AATCACCAAA 


GGTCTTCTCA 


6120 


CCAGCTCAAA 


CAAAGTCTGG 


TTTTACAAAT 


AAATTACGAA 


CTGCTAAAGC 


TGCTCCATCT 


6180 


TTACCATTGA 


TTTGAGCTGA 


GTAAGCTGAG 


CCATCAAGAG 


CAAAATTCAA 


GGCTTGACGG 


6240 


AAATCTTTGT 


TAAGCAATGC 


CTTCTTAGTA 


GCTACTTTCT 


CTGAATCTGT 


AGTTTTAGAA 


6300 


GTATACTTGT 


AACTTTGGCG 


ATCAATATTC 


ACACCCAGAC 


CAGCAATCCC 


AGAGCCTGAT 


6360 
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TGTGTGTAAT 


AGATATTGTC 


CTTGTATTCT 


TCTGCAACCT 


TAGAATAGTT 


GGACCTGGTA 


6420 


GGGTAAAGAC 


GCGCATAACT 


ATAAGCTCCA 


CTAGTGAAGT 


TACGCTCTAG 


CGACTCCTGA 


6480 


TCTGATCCAT 


CATAGTAAGC 


TAGATTCATA 


GTATCTAGGT 


GGACATTTTC 


TTTATCCCAA 


6540 


TATTGCTCAT 


TTTTTACAAA 


CTCTACAGAA 


GATTTTGCAG 


TCAACCCTTT 


CAACAAGAAT 


6600 


GGACCATTAT 


AAAGCAAGGA 


TGTCGGATCT 


GTTGGTTTAG 


CAAAATCGCT 


TCCTTTTGAT 


6660 


GTTTCGAATT 


-CTTCATTCAG AGGCCAGAAA 


ATAGAATAGC 


TCAACTTAGA 


GTTCCAGAAC 


6720 


CCTTCACGCT 


GGTTCAAAGT 


GTATTGTAAC 


GTATAATCAT 


CAACCGCCTT 


GACACCAACT 


6780 


GTTGAAAAAT 


CTGTTGAAGT 


TCCTGATAGA 


TAATCTGCCA 


AGCCTTTAAC 


CGAATTTTCA 


6840 


GCTAAATACA 


TAGCTTCTGA 


TTTTTTATCT 


GCTGCGTGTT 


TTAAACCGTT 


CACGAAATCT 


6900 


TTAGCCGTCA 


CCTCTGCATA 


TTCTTCTCCA 


TCAGAGGTAA 


ACCATTTAAC 


CCCTTTACGA 


6960 


ATCTTATAAG 


TGTAGGTCAA 


ACCATCCTTA 


GAGACTTCCC 


AATCCTCTCC 


AACTCCAGGA 


7020 


GCAAGATTAC 


CGTAATTATC 


GTTAGTGAAT 


AAACCATCAA 


TCCCATTTGA 


AGTCACTACT 


7080 


GTTGTACTAT 


TTTTACTTGA 


AATCAGGTAG 


TCCAAGGTTT 


CTGGGTCTGC 


TCTATAAACA 


7140 


TAGCCATAAG 


CTTTAGCCCC 


TGATGAATCA 


GATGATTTTG 


AAGAACTGCA 


TCCTGCAAGT 


7200 


ACACCTGCTG 


CTAATAAAAC 


AAGACCTGCT 


CTAGCAAATA 


CACGATTTTT 


TTTCATTTTC 


7260 


TACTCCTCTG 


TTTATGTGAA 


TTATAGATTG 


ACAACCATTA 


TATCACATTA 


TCCATTAAAA 


7320 


ATCAAACAAA 


TTTTCAGAAT 


ATTTAGGCTT 


GTTGGCACAA 


ATTTTTCATT 


TTTTTTGAAT 


7380 


ATATGATTCA 


AATTGTCCTT 


CGAAGTGTCA 


AACACTACAG 


TGAAAATAGC 


AAATTTGACG 


7440 


CAGAAACTTT 


GGACTTTAGG 


AAGACATACA 


GTAAAATGAA 


ATACGGACGG 


AACAATGTGA 


7500 


TTTTCGAATT 


CAAATTAAAT 


TATAACAATA 


TTGTAGAAGT 


ATCATTCTAG 


TATTCAAGAT 


7560 


TCAGTTTACT 


ATGTCTTTTC 


ACACCAACCT 


TATCCCGAAT 


TCAATTACTT 


TTGTGATTTA 


7620 


TAT AT AT Aft A 


TTAAGACTAT 


CTTTTATACT 


TTAAAATTTC 


l UUL. 1 AL.U 1 r 


A rLL At. * A TA 


/ ooU 


TGCTCCTCGC 


TATCACGTTT 


CTATTCATAG 


CCTACGATTT 


CACTATTGCT 


TTCTCTGACA 


7740 


ATTCTTATTT 


CCTGCGTCAG 


ACTTAAAACG 


ATCTATCCCC 


AGACCATTTT 


AATCCGCTAC 


7800 


CTCACGATAG 


TCAGGCTTGG 


GGAGCGCTAT 


TGTATTCACC 


GGTAGTGGAG 


CCCTACAGAG 


7860 


GACTTACACC 


TCAGATGCAC 


GACATGCCCA 


TCGTATAAAA 


AATCTCCTAC 


CCAAGGTAGA 


7920 


AGATTTCAAA 


CTTATAAAAC 


TTAATCCGTC 


ATGTCCGATA 


CCAACATTCG 


ATGCTCCAAT 


7980 


CGAATACTCC 


ACATAACTAG 


CAACAAAATA 


AAGCCTGACT 


GAATCCAGAA 


GAGAGCCAAG 


8040 


TCAAAAATTC 


CGTGCACAGC 


AACCACTGTA 


AGGAAAGATA 


GATAAAGGCC 


GATAATCGGA 


3100 
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CCTTTCCCCG 
ACTAATAAAA 
GCATGTTCAT 
TCACCCCAAA 
TCTTCCATAG 
CTCAAACCAA 
GTAAAGAGAT 
AACTTCAAAC 
AACTTGGTCG 
TAATAATTAG 
GGTGAAAGAA 
GCAGCTGACA 
GATAAAATCG 
ACAAGACCCA 
AGCAGCATCG 
AAACTGATAA 
AGGATAATTC 
CCTATTGATT 
TATCCTATAT 
AAAACCCTAT 
TTACCCGATA 
TATTTTGACG 
CTAGATCAAA 
CAGGTCGTTG 
GACCTTCTTT 
AGTAAATCTG 
CATCATCATC 
TATACTCGGC 
AAGCATAGTC 
CATGGGTATC 



ACTCCTGACT 
TAGTCCCCAC 
CATAAGGAGC 
AAGGATTTTG 
AAGAGTCTAA 
TCGCGAAGAC 
AGATAATTGC 
CAAAGAGATT 
TTGTAAACAG 
GATTAAAGAA 
AAGCATAGTT 
AGACACTACC 
ACTCATAGTG 
TCCAATTTTG 
GATGCTCCCC 
TATATAAACA 
CCAATAAAAT 
TCAAAATGAA 
TCTACCACTT 
TTTATCCACC 
AGTCTGATTT 
TTTGATTACA 
GGCTTCTGAA 
CACTTCTTTT 
AGTTTCTTCT 
AGGAATACCT 
TCCAAGCGCT 
ACTAGAGTAC 
AATCTCCTCA 
TAGCGTCGTA 



CATATCCATC 
AATTCCGTAA 
ATGTATCCGA 
CTTAAACAAG 
AGTACCCATT 
CCCAATACTA 
TCCAGCGATA 
AACAAAGCCT 
ATAGAAACCA 
GGTCACTTCT 
AAATTTCTTC 
AAACAAGACA 
CAAAAAGAAA 
TGCAAGAATG 
CATTTTCTGA 
GAGTACAACT 
CAGCTCTTTA 
TCCTTTCTCT 
TTTTAGCAAT 
ATATCAAGGC 
TGCAAATCAA 
ATGCTGTGAG 
CGGTTACGGA 
GCTATTTCCT 
AATAATTTCA 
GGGGCAAAAG 
GAATAGTAGG 
TTACGTTTGA 
TCGGTCAGGA 
AATTGCTTCA 



756 
ATCAAGCGAA 



CTCAGAATCG 
GCATAAGACT 
GCCATCCCAG 
CGAACTCCCA 
AGCCAAAAGG 
ATAGCAGGAA 
GCAATCACAC 
ATCATAATAC 
GCCCGGTTCT 
ACAATTTGGA 
AACTCCAAAA 
ATAGTAAATA 
GATATAACAG 
AGAATACTTT 
ACAAAAAGAT 
CTAGACAACC 
CCAAATCAGC 
TTGAAAACAA 
TTCAAAATGA 
TTTCTACTAC 
CTCTTGGTGT 
AAGAAAATAG 
CGTTGCTATA 
AGTCATTCTT 
CTTGAATTAG 
TTGAATTGAT 
CATTGGCTCC 
TATCCTTGAC 
TCGGGCTCAT 



CAGGAGCAAC 
TATCAATATA 
TCATATAGGT 
CATCCCAGAT 
AATCACTAGA 
CCTTCCAGTT 
AGGCAGTTCG 
AGAATACTTT 
AGAAACAACA 
GATGCCACAC 
AATGTTCTAA 
TCAATCGAAA 
GAAACATTCC 
TACTATAGCT 
TCATGTCTCC 
AAAAGGGTAA 
CCTTCAGCTT 
TGATTCAGAT 
AGGAAACGTT 
TACTTCAACT 
TGCTGTTACG 
CTCTATCTCA 
ATTGAGAAGG 
GTAATGACGA 
GCCTGCTAAT 
ACGAGCGAGA 
TTGGTAGATA 
AACCTTATAG 
ATCTACTACT 
CTTTAACCAC 



AGAAGACAAA 
AAGACTCTGC 
CAATGGCCCT 
AGAAATGCGT 
AAAGAGGAAA 
TTTAATAGTC 
ATTTTGAGTA 
CAACCAATTC 
AATAATTCCA 
CTGCATATTG 
ACTGGCAAAA 
GAATTTATGG 
TACTGAAGCC 
AAGAAAAACA 
TGTAAAAATC 
AAAGATACTC 
TTCAATAAAG 
AATAGTAAGC 
TTCCAAAATA 
CCATTCTCAA 
GACTTATCTT 
GTAGTCCCTT 
GCCTTCACAA 
TTAATATTTC 
AGACCCACAT 
AAATACTTGA 
TCTAAGTTGT 
AGTTCATTTG 
CCAATCCCAT 
TTAGCCAAAC 



8160 

8220 

8280 

8340 

8400 

8460 

8520 

3580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 
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GCTCTGTTCT GGAACTGTAA AGAGTATAAA GTGTCACCAT TGGAAGAGCA AAATCATAAA 9 960 

CATAG7AATC ATGCTCTGCT ATTTTAAACT GAATCGAATA GTCTTCATCA ATCTCAGGTA 10020 

AAAGCTCTCT CCCATACTCA GCAGCGATAT CTCGAACTTT GTCCAATAAA TCCCAAATAT 10080 

CTGCTTCCAC AAAGAAATCA TTAGTATCCA ATTTCTTCAC TGCATAAGCA AAGGCATCTA 10140 

GACGAATCAA ATCACACCCA TTACTTGCCA AGTCCTGAAT GCTCTTACGG ATAAATTCCA 10200 

~ TAGTTACTTC TTTGGTCACA TCAAGATCAA TCTGCTCCTC ACCAAAGGTA TTCCACAAAT 10260 

GTTCCACTGA ACCATCTTCA AACACAATCT CTTGCTTTGG TGCACGATCC TTACGCTTGT 10320 

AAATTAAATC TACATCAGAC TG7GTCGGAC GGTTTTCTGG CCAAAACTTA TCCCAGTTTA 103 80 

AAAAGAGAGC TTTAAATTCA CTGGCTTCAT GTTTTTCTTG ATAGTCCTTA TAATACTTGG 10440 

ATTGACGAGA AATATGATTA ATCATAAAAT CAAACATAAG ATAATATTTC TCACCTAAAC 10500 

GCTTCACATC CTCCCAATCA CCAAAAGCTG AGTCCACTTC GTCGTAGTCA ACTGGCGCAA 10560 

ATCCACGATC AACTCTTGAT GGGAAAAATG GTAAAAGGTG AACTCCTCCA ATACCATCTC 10620 

CAAAATGCTC TTCCAAATTA TCATATAAGT CTTTAAGATT ATTTCCAAGG CTATCAGAAT 10680 

AGGTAATCAA CATGGTTTTA TTTTGAATTG GCATCATTAC TCTCCTTTTT CTAATTGAAG 10740 

CCAAGTCTCA TATGATCTGG CTTCATAAAT AAAATTCATT TTAAATCTCT ATTTATCATC 10800 

AAACTCGTAC TAATATAGAC TGTGATAAAC AAAGTACTAC TTTCTTGTTT TCTGCATAGA 10860 

ATTATCAACA AGCTAAACTC TTCCTCTGTG TCAAAGACTA TAGATTCCAT GAGCTCTTCT 10920 

TATACTCTTC GAAAATCTCT TCAAACCACG TCAGCTTCAC CTTGCCGTAG GTATGGTTAC 10980 

TGACTTCGTC AGTTTCATCC ACAACCTCAA AACAGTGTTT TCAGCAACCT GCGGCTAGCT 11040 

TCCTAGTTTG CTCTTTGATT TTCATTGAGT ATTACTTCAC TGCCCCGTTG CTCATTCCTG 11100 

AAATGATATG GCGTTGGAAG AAGAGATAGA CAATCCTGAT ACTGATAATG CCGACCACGT 11160 

AAGAGGCAAA GCTTGCTCCG TAGTCGTTGA AATATTGGCC TGCGTAGTTG TATTGGAACA 11220 

AAGGCAGAGT CCACATTTTG GAATCCCGGT TCAAGACAAG GACTCCCAAC ATGAAGTCAT 11280 

TCCAGAACCA AAGGGCATTG ATGATCATGG TTGTCGCATG CATCGGTTTC ATCATTGGGA 1134 0 

AGATGATGCG GAAATAGGTT GTAAATTGAT TAGCCCCATC GATCTCTGCT GCTTCATCCA 11400 

GACTTTCTGG AATCGAGATT TTGATATAGC CAACATAGAG AAAGAGGGTC TGTGGAATCG 114 60 

CATAGGTCAA GTAGAGCAAG ATCAAACCAA ACGTATTAGC CAAACCGAGT TTACTCATCA 11520 

TAACCGTAAT CGGAATCATG ATGACTTGGA AAGGTACGAA GATTCCGAGG ATTAAGAGGG 11580 

TATACATGAT GGTAAAGGCT TTTCTTTTAC TCATATTGCG AGCGATGGAG TAGGCTGCCA 11640 



WO 98/18931 



PCT/US97/19588 



758 



TAGGGATAAA 


GATCATTACT 


GCAAGTAAAG 


ACAAGACAGT 


GATGACGACA 


GAGTTCCAAT 


11700 


AATAGCCTCC 


AATCCCATCA 


GCTAAGAGAC 


GGCTAAAGTT 


GTCCCATGTG 


AAGTTGGTTG 


11760 


GAAAGCCAAA 


GAAATTATCT 


ACAATATCCT 


TAGTCGGTTT 


GAAGGAACTA 


AAGAGGGTAG 


11820 


CAAGGAGCGG 


CACTAAAATC 


AGAACCGATC 


CTAGAATCAA 


TAGAATGTAT 


TTGCCAATCA 


11880 


GGGCTTTTCT 


TTCATCTTGT 


TTCATCATGC 


TTCTCCTCTT 


AAATTTCAAA 


TTTCTTAGAT 


11940 


ACTCTCAATT 


GGATGATCGA 


AATCACTACA 


ATTAAGAAGA 


ACAAGATTAC 


GGCAATGGCA 


12000 


TTGGCATAAC 


CGAATTGGTT 


GTTTTTAAAG 


GCATAGTTAT 


AAACCAAGAG 


CCCAAGTGAG 


12060 


GTTGTGGCAT 


TGTTTGGACC 


ACCACCGGTC 


ATGGCAAAGA 


CTTGGTCAAA 


GGCAGTCAGC 


12120 


CCACCTTTTA 


GGGCTAGGAT 


AAAGACCATA 


GAGACACTTC 


GTAGCAAGTA 


AGGCAATTCA 


12180 


ATGTTCCAGA 


AAACTTGCTT 


GCTAGTCGCA 


CCATCAATCC 


TTGCTGCCTC 


TCTAATCTCA 


12240 


GTTGGAATAG 


ATTGCAAACC 


AGCTAGGAAG 


ATGATGATGG 


GCATAGCCAC 


CCCTTGCCAA 


12300 


AGAAGGACAA 


AGACAGCCGC 


AAAGATTGCT 


CCCCACTTAG 


TCCCTAAAAG 


ACTGGTTTGG 


12360 


AAAAATTCAA 


TATGAAGGGC 


ATTTCCAATC 


GCTGGAAGAC 


CGTAGTTGAA 


GACTTGCTTG 


12420 


AAGATCAAAG 


CCACTGTCAA 


ACCAGATAAA 


ACAGCTGGGA 


AGAAGAACCA 


AGCACGGAAG 


12480 


AAGGTTTGGC 


CTTTGATTTT 


AGAATTCAAG 


ACACGCGCAA 


TGAAGATCCC 


GAGTGCAATC 


12540 


TCACCAACCA 


CCATGGCAAT 


CGCAATGATT 


GCGGTAAAGC 


CAATCGCATT 


CATGAATTTT 


12600 


GGATCCATGA 


AGAGGAGCTT 


AAAGTTGTTT 


AAGCCAACAA 


ATTTGTAGTT 


ATAACTCAAT 


12660 


CCTGTCCAGT 


TGGTAAAACT 


GTAAAAGGCT 


CCTTGAAACA 


TCGGCACATA 


GAAGAAAATT 


12720 


GCTTGTAACA 


AGAGGGGGAT 


GACCACAAAA 


CCCCATGCCC 


AATATTTTTG 


TAATACTTTT 


12780 


TTCATAGTCT 


CTCTACTCCT 


AATCCACATC 


CGCTTTCATC 


GGGTTAAAGA 


AGGCATTCAA 


12840 


ATCATTGACC 


ATGCCTTGTT 


TATCACCGGT 


CAAGACATAG 


TTCATGGTCA 


AGGTATGGAA 


12900 


GTCTGCTTCA 


CTCGTCCAGT 


ATTGTTGCAA 


CCAGACCAAG 


TGACGATCCG 


TAAAGGCATA 


12960 


TTCGGTCATA 


CCAGCAAGCG 


GTGAATCTTC 


TCCTGCTTGT 


TTGACCCCTT 


CGATCGCTGT 


13020 


TGGAGATCCG 


TCCACATCGT 


AGTATTTTTG 


CATCACTTCT 


GGACGGGTCA 


TATATTCCAC 


13080 


AAAGGCATTG 


GCTTCTTTTG 


GATGTTTGGT 


GGTGGCTGAG 


ATAGACCATG 


CCAAGTCTCC 


13140 


CGCACCAACG 


GTTAAGCTTT 


GTCCTTTTTC 


TTTTCCTGGA 


ATCATGAAGG 


TCCCAATCTT 


13200 


AAAGTTCGGT 


TTTTGTTCAT 


TAATCGCTGT 


GATCGCCCAA 


GACCCATTTG 


GTCTCATGAG 


13260 


GACATCCCCA 


CGTGCGAAGG 


CTCCGATAAC 


ATCGGTATAG 


CCAGCACCTT 


CCCAGTTCTT 


13320 


TTGCTTAGAT 


CCATTGATGC 


GAAGGATGTC 


CATGACCTTG 


ATATCATCTT 


TCATAATCGG 


13380 


ATCCGACAAT 


TTAATGGCAT 


TTGGTTGACA 


ATAACGAAGG 


TATTGATTTG 


CTTCTTTTCC 


13440 
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TCCACCTCTT GCTGTCGCAA AGGCTAATTG ATTGTAACCA TTGAGTGTCC AAGCATCTGC 13 500 

ACCTGCAATT CCAAATGGTG TTTGTCCTTT AGCAACGATA TCTTTGACTA ACTGTTCAAA 13 560 

TTCATCCCAG GTTTCAGGAA CCTTCAAGCC CAGTTCTTCG AATTTATCTT TGTTGTAGTA 13 620 

AATTCCATAA GCATTACCTG TAAAAGGAAC GTTGTAAACT TTTTCGTTTA CAGCATATTT 13 680 

TTCAGCGTAG CCATTTTTCA CGCCTTTCAG GTAGTCTTTG TTGCTCAAAT CTTCAAAAAC 13740 

ACCTGCTTTT GCCCATTCTT GCAGTTCGAT GGACTGTGGG TAAATATTGA CCACATCAGG 13800 

CACATCTCCT GCGAGAACGC GTGTCTTCAA TACTTCACCA GCATTTCGTA CATTGACGAC 13860 

TTTGACCTTG ATCTTAGGGT TTTCCTTCTC AAAATCACGA GTGATTTCTT CCAAGGTTTT 13 920 

GGTCATTTCT TTTTTCTGGT TGAAATACTC GATGGTCACT GTGCCATCCG CAGATTTACC 13 980 

ATAGTTGGAG CAAGCGCCGA GCCCAAACAA AGCTAAACCT GTAGTTGCAA GAAGTCCGAT 14040 

TTTTTTATAC CATTCCATTA GAAAGCCTCC TTTATAAATT TATACACCCT TATTGAACTG 14100 

CACCCCAAAA GTTAGACAGA ATAAATCTAA CTTTTGGGGT CAGTACATAT CATACTTTTC 14160 

TAAAAATATA CTGTCTACTC AAAAAATCTC CTTGGGATAA GATAACAGTT AAGCCCGCAT 14220 

ACATTAGTTC TGCACCTGAG TAAACTTCCC CATTTTCCTG TAATTTATAT ACTCCCTCTT 14280 

CATCCAAATC TTTTAATTTT AAAGTTGTTT CCATGGTCTC TACAACAGAT AAAACGCGAA 14340 

CGTAGGTTAC AATCGTTTGA TTTCCGTAAT TAAATTGTAC AGCTGCTTCA TTGGATACAG 14400 

TATCAGGATT AATTAGTCTA TACTGCTCTC CTAACTGAAC TACTGGTCGT AATTCTTTAT 14460 

ACAAGTTCAC CTGATTAGCA ATCGTAGCTT TCTCTTCATC TGATAAATTT GTCAAATCAA 14520 

GTTCATAGCC CAAATTTCCC ATCATTGCTA CAAGGCCACG TGTTTCTAAT GGTGTCATTC 14580 

GTCCCATCTG ATGATTCGGT ACTGCTGACA CATGAGCCCC CATAGAAATG CTTGGATAGA 14640 

GATAGGATGA ACCGTATTGA ATTGGTAAAC GTGCAATGGC ATCAGTATTA TCACTACCCC 147 00 

AGACTTGTGC GAAATAGCGC ATCATACCAA GATCATTTCG TCCACCACCA CCAGAGCACG 14760 

ACTCAAAGAG AATATGGCTG TGCTTCTCTG TCAGATAAGA AACGAGTTCA TAAAGCCCCA 14820 

CCATGTACTG ATGAGATTCC ATCTGTGTCT CTAGATAAGT TAATCCATTC CCTAGCTTAG 14880 

TGATATTGCG GTTCATATCC CATTTAATGT AATCAATATC ATGATAAAAT ACGAGTTGAT 14940 

CTAAGACACT TTTCAAGTAT TCTACTACCT GAGGATTGGC AAGATTAAGT ACTAATTGAT 15000 

TCCGAGAATA AGTATGCTCA TAGCCAGGAA CCTGAATAGC CCAGTCAGGA TGTTGACGAT 15060 

ACAAATCACT ATCTACAGAA ATCATTTCGG GTTCTAACCA AAGTCCAAAC TGCAAACCTC 15120 

TTTCATGGAT AGCTGAAATC AGACTTTCTA GACTTCCACC CAGTTTTTCC TCATTAACAA 15130 
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CCCAATCACC 


TAAAGCACGA 


TTATCATCAA 


AACGATTGCC 


AAACCAACCA 


TCATCTAATA 


15240 


CAAAAAGTTC 


AATCCCAACT 


TTCTTAGCTT 


CATC7CCTAA 


CTCTAACAGT 


TTTTCTCTCT 


15300 


GAAAGTCAAA 


GTAAGTAGCT 


TCCCAGTTAT 


TGATTAGAAT 


TGGACGTTCT 


TTTTTAGAAA 


15360 


ATTCACTTAG 


CATAATGTGC 


TTCAGTACAA 


AATTCTGACT 


TTCATGACTA 


ATACCAGTTA 


15420 


ATCCCTGATC 


TGAATGAGTC 


ACTAAAGCTA 


CCCGTGTTTC 


AAAGTATTCC 


TCAGGAGCTA 


15480 


ACTTCCAAGA 


AAAGTTTTCT 


GGATTAATGC 


CAATAGCCAC 


CCGAACTTCA 


TTCAATTGAT 


15540 


TTTTTTGAAC 


AAAAGCTTCA 


AACTTGCCAC 


TATACATTAG 


TTGAATAGCA 


AACACATTCC 


15600 


CAGCATCCTC 


TGTGACTCCT 


TCTTCGCATA 


GTAGAAGAGC 


TGGTGTTTGA 


GCATGACCAG 


15660 


AAGCACCTCG 


GTTTGAACTA 


ATCGAAAAGA 


TTCCTTGTTC 


TACCTGTTGA 


CGTCTAACAG 


15720 


TCTTTTCACG 


AGCATAAGCA 


CCCTGCAGAG 


TTACTATTTC 


GTAATCTGCA 


GCTGGAAAAT 


15780 


CAGCCATAAA 


AGAAAAATCT 


TTATGGATGA 


CAACTTCCTG 


ATT ACT ATT A 


TTATCTAATT 


15840 


TACTGTAGCT 


AGCAATAGTC 


GCATCATTAT 


TAAAAGTAGT 


ATAATACAAA 


GTCAGACTAA 


15900 


GTTGAGCCTT 


AGAATCTTCT 


AACATTAAGA 


CAAGAGTCTC 


TGTATCGTCC 


ATGCTATGTG 


15960 


GAGAAGCTAA 


GCCCTGTGGA 


CCATTCTGAC 


CTTTTAAAAT 


CTTTGCTTCT 


ACAAATCGAA 


16020 


AGTCTGTTAC 


TTCAGTTACA 


CTATGCTGAA 


CCTGTATGGT 


TGGTTTCCTA 


AAATCTCCTA 


16080 


AGCCATGTTG 


TCCAAAAATC 


TGTCGCTGAG 


TATCTAAACT 


AAAGGTTCGA 


TTAGTAGCCG 


16140 


TTGGATTTCC 


TGAAAAGGCA 


TCGTCTCGTT 


CATAAACACT 


ATTCGAACCT 


TTATAGTTCT 


16200 


TAATAGTCTT 


TCCTAAATGT 


TTCAAAAGTA 


AGTAGCCATT 


TCGATTTTCA 


ATAATCAAAC 


16260 


TTAGATTTTT 


ACTCTCAACA 


TAAAATAGAT 


TATTCTCTAT 


CCTAACTCCC 


ATTTACTTCA 


16320 


CCTCATCACT 


TTATTGATTA 


TATTTTATCA CCTGAAATCG 


CTTTCCAAAA 


TAGAAAAATG 


16380 


TCTCAAGAAT 


ATGGTAAAAT 


GTTAGGTAGG 


AGGTAGCACA 


TGTTAGTTTT 


TTCAGAATAC 


16440 


CAGACTGGAA 


CAATCGACCT 


TGCCCTAAGC 


TTTTATGGAT 


ATGAGGAATG 


CACACCTAAT 


16500 


TACTCTTTTG 


GTCCAGCCAT 


TCGTGATACA 


TACCTTCTAC 


ATTACATTAC 


TAAAGGAO.A 


16560 


GGAAAATTTC 


ATTACAAGGG 


TAAAATTGTT 


GATTTAAAAG 


AAGGAGATTT 


CTTTCTATTA 


.6620 


AAACCAGAGG 


AACTAACCTT 


TTATCAAGCA 


CATAGTAAAG 


AACCTTGGGC 


CTACTACTCG 


16680 


TTAGGAATCA 


CTGGAGGGAA 


AGCCCCTGAT 


TATTTTGCTC 


TTTCCCAAAT 


TTCTGATCAA 


16740 


TCCTATCTCA 


TCCAATCTGA 


AACTTGTCAT 


ACCCAGACTA 


CTGCAAAACT 


CATCTCAGAC 


16800 


ATTGTCCGCT 


TCGCTCACAT 


TACAAAATCA 


AGTGAATTAG 


CTCAACTCCA 


TATCATGGGA 


16860 


CAACTTCATG 


AACTGATGTT 


TCATCTGGGA 


ACTATTGCTC 


CCAATCAGAA 


AAAAAAGAAT 


16920 


ATTTCATCAA 


CCCACCAACT 


CTATCTTGAA 


TGCAAACGAT 


TAATTGATAG 


CCACTATCCT 


16980 
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CAATCACTTA CAATTCAAGA TTTAGCAAAA GAACTATCCG TTCACAGAAG CTACTTATCA 17040 

AGCGTATTCA AAGAATTTAA TACCTTATCA CCCAAACAAT ACCTACTCTA CGTTCGAATG 17100 

CACCGAGCTA GACAACTTCT CGAAAATACC CAACAGTCCA TCAAGGTAAT TGCATACTCG 17160 

GTAGGTTTTT CAGATCCACT CCATTTTTCG AAAGCTTATA AACAATACTT TAATCAGACT 17220 

CCAAGTCATA CAAGAAAAGA ATACTCTCAA TACCAACTAG TAAGAAAGGC AACATTATGA 17280 

AATCCTACCA AGCTGTCTAC CAAATCCTAT CTAAAGAAAC CGACTATATC AGCGGAGAAA 173 40 

AAATCGCAGA AAAACTATCC CTAAGCCGAA CAGCAATTTG GAAACCCATC AAGCGACTAG 17400 

AACAAGAAGG CATTGAAATT GATAGTA7CA AAAATAGAGG ATATAAACTG ATGAATGGTG 17460 

ACCTTATTCT TCCAGAGATT CTAGAAGAAA ATCTTCCAAT TAAAGTCAGC TTTAAACCCG 17520 

AAACAAAATC AACACAACTA GATGCAAAAG AAGCAATTGA TTTAGGCCAT GAAGCAAATA 17580 

CCCTCTATCT AGCTTCCTAT CAAACAGCAG GCCGAGGCCG TTTTCAACGT TCCTTCTACT 17640 

CACCACAAGG TGGTATTTAT ATGACACTCC ATCTTAAACC AAATCTCCCC TATGACAAAT 17700 

TACCATCCTA CACACTACTT GTAGCTGGAG CTGTCTACAA AGCCATTAAG AACCTAACTT 17760 

TAATAGATCT CGACATAAAA TGGGTCAATG ATATCTATCT AAACAATCAT AAAATTGGAG 17820 

GAATCCTTAC TGAAGCAATG ACCTCTGTAG AAACTCCCTT AGTCACAGAT ATCATTATTG 17880 

GAGTAGGTAT CAATTTCACT ATTAAAGACT TCCCTCAGGA ATTAAAAGAA AAAGCTGCCA 17940 

GCTTATTTAA ACCTACAGCT CCTATAACAA GGAATGAATT GATCATAGAA ATCTGGCGTG 18000 

CTTTCTTCGA AACACCAGCA GAAGAGCTAT TATACCTATA CAAAAAACAG TCATTCATTC 18060 

TAGGAAAAGA AGTCACTTTC ACACTAGAGC AAAAAGACTA CAAGGGACTT GCTAAAGACA 18120 

TCTCAGAAAA TGGAAAACTT TTAGTTCAAT GTGATAACGG AAAAGAAATC TGGCTAAATA 18180 

GTGGCGAAAT TTCTCTCAAT AGTTGGAAGT AAAATAACAC AATTATAATA TAAACGATAT 18240 

AAAAATAACT TCAGATTAGT AATTCAATTA AGTTTTACGG ATCTCAAGTT TTATTGGCTC 13300 

TAAAAATAAA AAAGAGAGTT ACAGACTCTC ATTAAAACGG AGAATAAGGG ATTCGAACCC 183 60 

TTGCGCCAGT TACCCGACCT AACGATTTAG CAAACCGTCC TCTTCAGCCT CTTGAGTAAT 1842 0 

TCTCCAATTA ATGGGCACGA GTGCACTCGA ACCACCGACC TCACGCTTAT CAGGCGTGCG 13480 

CTCTAACCAC CTGAGCTACG CGCCCAAGTT AAAAAACTTG GTAATTTGAA CAAAGTTCAA 18540 

AGCGGGTGAC GAGAATCGAA CTCGCGACAA CAGCTTGCAA GGCTGTAGTT TTACCACTAA 18600 

ACTACACCCG CATAAATACT ATCAATAAAA TGGCGCGAGA CGGAATCGAA CCGCCGACAC 18660 

ATGGAGCTTC AATCCATTGC TCTACCAACT GAGCTACCGA GCCTTATTGC GGGAGCAGGA 18720 
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TTTCAACCTA CGACCTTCGG GTTATGAGCC CGACGAGCTA CCGAGCTCCT CCATCCCGCG 18780 

TTAATAATAT AAAAGGAGGA TGTGGGATTC GAACCCACGC ACGCTTTTAC ACGCCTGACG 13840 

GTTTTCAAGA CCGTTCCCTT CAGCCGGACT TGGGTAATCC TCCAATATTC AAATGGACCT 18900 

TGTAGGACTT GAACCTACGA CCACTCGGTT ATGAGCCGAG AGCTCTAACC AGCTGAGCTA 18960 

AAGGTCCGAC AAGATCATTA TAGCGGCGAA GGGGATCGAA CCCCCGACCT CCCGGGTATG 19020 

AACCGGACGC TCTAGCCAGC TGAGCTACAC CGCCATGAAT CGGGAAGACA GGATTCGAAC 19080 

CTGCGACACC TTGGTCCCAA ACCAAGTACT CTACCAAGCT GAGCTACTTC CCGAGTTAAA 1914 0 

TAGAAAAATG CACCCTAGAG GAGTCGAACC TCTAACCGCC TGATTCGTAG TCAGGTACTC 19200 

TATCCAGTTC AGCTAAGGGT GCTCCATATT ATGCCGAGGA CCGGAATCGA ACCGGTACCA 192 60 

TCGTTACCAA TCGCAGGATT TTAAGTCCTG TGCGTCTGCC AGTTCCGCCA CCCCGGCCTC 193 20 

TCTAAGCGAA CGACGGGATT CGAACCCGCG ACCCCCACCT TGGCAAGGTG GTGTTCTACC 19380 

ACTGAACTAC GTTCGCACTG TTTTCTTCTA TCTAAAAATG CCGGCTACAT GACTTGAACA 19440 

CGCGACCCTC TGATTACAAA TCAGATGCTC TACCAACTGA GCTAAGCCGG CTCATTTGTT 19500 

ATATCTTAAT GCGGGTTAAG GGACTTGAAC CCCCACGCCG TTAAGCGCCA GATCCTAAAT 19560 

CTGGTGCGTC TGCCAATTCC GCCAAACCCG CATATATGAC CCGTACTGGG CTCGAACCAG 19620 

TGACCCATTG ATTAAAAGTC AATTGCTCTA CCAACTGAGC TAACGAGTCT AAAATAACTT 19680 

CCCTTACCTT AAACGGTCCG ACGGAATCGA CCCGGTAC 19718 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4117 base pairs 
(8) TYPE: nucleic acid 
tC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
CCGTGGAAAA GTCTGGATAG TGAATGGTCT TCACACAATC ACCTGAAAGA AGCCTGAGAA 60 
TAATTATGGA GAGTAGCATT CTGAGAGGTG TTACCAGAAC CATATGACAG AGCTGTTTGA 120 
AGAGGGAATA TTGAGGAGAA AAATCCTGAC CCTACCACTT GGAGTTGGAA AGAGCTGACT 180 
GTTAGATCAT GGTTTATTAT CCACAACCTG TGGATAACTT TGTGAATAAC AGAAGTTGCT 240 
AAAGAAGGAG ATATATAACG ATGAAGAAAA TCAAACCGCA TGGACCGTTA CCAAGTCAGA 300 
CTCAGCTAGC TTATCTGGGA GATGAACTAG CACCTTTTAT CCACTTCGGT CCTAATACCT 360 
TTTATGACCA AGAATGGGGG ACTGGACAGG AGGATCCTGA GCGCTTTAAC CCGAGTCAGT 42 0 
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TGGATGCGCG 


TCAGTGGGTT 


CGTGTGCTCA 


AGGAAACGGG 


CTTCAAAAAG 


TTGATTTTGG 


480 


TGGTCAAGCA 


CCACGATGGC 


TTTGTCCTTT 


ATCCGACAGC 


TCACACAGAT 


TATTCGGTTA 


540 


AGGTCAGTCC 


TTGGAGGAGA 


GGAAAGGGCG 


ACTTCCTCCT 


TGAAGTATCC 


CAAGCTGCCA 


600 


CAGAGTTTGA 


TATGGATATG 


GGGGTCTACC 


TCTCACCGTG 


GGATGCCCAT 


AGTCCCCTCT 


660 


ATCATGTGGA 


CCGAGAAGCG 


GACTACAATG 


CCTATTATCT 


GGCTCAGTTG 


AAGGAAATCT 


720 


TATCAAATCC 


TAACTATGGG 


AATGCTGGTA 


AGTTCGCTGA 


GGTTTGGATG 


GATGGTGCCA 


780 


GAGGAGAGGG 


CGCGCAAAAG 


GTTAATTATG 


AATTTGAAAA 


ATGGTTTGAA 


ACCATTCGTG 


840 


ACCTGCAGGG 


CGATTGCTTG 


ATTTTTTCAA 


CAGAAGGCAC 


CAGTATCCGC 


TGCATTGGCA 


900 


ATGAACGAGG 


GTATGCAGGT 


GATCCACTGT 


GGCAAAAGGT 


GAATCCTGAT 


AAACTAGGAA 


960 


CAGAAGCAGA 


GCTGAACTAT 


CTTCAGCACG 


GGGATCCCTC 


GGGCACGATT 


TTTTCAATCG 


1020 


GAGAGGCAGA 


TGTTTCCATC 


CGTCCAGGCT 


GGTTCTACCA 


TGAGGATCAG 


GATCCTAAGT 


1080 


C^CTCGAGGA 


GTTGGTCGAA 


ATCTACTTTC 


ACTCAGTAGG 


GCGAGGAACT 


CCACTCTTCC 


1140 


TTAATATTCC 


GCCGAATCAA 


GCTGGGCTCT 


TTGATGCAAA 


GGATATTGAA 


CGACTTTATG 


1200 


AATTTGCGAC 


CTATCGCAAT 


GAGCTCTATA 


AAGAAGATTT 


GGCTCTGGGA 


GCTGAGGTAT 


1260 




TCTTTCCGCA 


GACTTTGCTT 


GTCGCCATTT 


GACAGACGGC 


CTTGAGACCA 


1320 




AAGCGATGCA 


GACTTGCCCA 


TCCAGTTAGA 


ACTCGACTTA 


CGTTCTCCTA 


1380 


AJLAPTTTTGA 


TGTAATTGAG 


TTAAGAGAAG 


ATTTGAAGCT 


AGGGCAACGA 


ATCGCTGCTT 


1440 


TTCATGTGCA 


AGTAGAGGTG 


GATGGTGTCT 


GGCAGGAGTT 


TGGTTCGGGT 


CATACTGTTG 


1500 


GTTACAAACG 


TCTCTTACGA 


GGAGCAGT7G 


TTGACGCACA 


GAAGATACGT 


GTAGTCATTA 


1560 


CAGAATCACA 


GGCTTTGCCT 


TTGTTGACCA 


AGATTTCCCT 


TTATAAAACT 


CCTGGATTAT 


1620 


CAAAAAAAGA 


AGTTGTTCAG 


GAACTAGCAT 


TTGCAGAAAA 


AAGCCTAGCT 


GTGGCAAACC 


1680 


GAGAAAATGC 


CTATTTTACA 


GTTAAGCGCA 


GAGAATtiTAU 


TGGTCCTTTA 


CAACCTAAGA 


1740 


TTTCGATTCA 


ACCGGGGACA 


GGTGTCCATG 


GTGTCGCCTA 


TCAGGATGAG 


ATTCAAGTCC 


1800 


TTGCGTTTCA 


AACTGGTGAG 


ACTGAAAAAA 


GTCTGACGCT 


ACCAACCTTC 


TATTTCGCAG 


1860 


GAGATAAAAC 


CTTGGATTTC 


TATCTGAACC 


TAACCGTGGA 


TCCTCAGCTT 


GTGGATCAAC 


1920 


TTCAAGTCCA 


AGTTTCATAA 


AAGAAGAACC 


TTTGCGCGAT 


GCAAAGGTTC 


TTTTGGTTAT 


1980 


TAGTGACTTG 


GTAACCAGCT 


GAGGGTGAAA 


GTTAGTTGTT 


CAGCTTTTAA GAGCTCTTGG 


2040 


TGTTGAATAG 


TTGATACGAG 


TGTTTTGTCC 


AGTCGGCATT 


CTTTGACAAA 


GTTAAAATGG 


2100 


TTGTGGTTTT 


GTTTAGTATG 


GATATCCAGC 


CATTTATCTT 


CTTTAGCGAG 


GTAGACTCGT 


2160 
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AGATGGTCAA 


AGAGAGGGAT 


TCCGAGGTCA 


TAGCTTGGTT 


TTCCTCGACA 


GGTTGCATAA 


2220 


AATCCGAGAG 


CTGACCAGAT 


GTACCAAGCA 


GAGAGACTAC 


CATTCTCTTC 


ATCTCCAGGA 


2280 


TAGGCTTCCC 


AACTTGGGTG 


AAAAGCTTTC 


TGACGGAGCG 


TCTTGATAAG 


AAGGGCAGTG 


2340 


TAGTCAGGGT 


AATCGCTGTA 


ACGGAAGAGA 


TAAGGAATGT 


GGAAACTAGG 


CTGGTTGGAA 


2400 


ATGGCTATTT 


GTCCAAAAGG 


AGCAGTAGCC 


ATCTCGCTCA 


TTTCGTGAAT 


TTCGTAACCA 


2460 


TAGCCTGTTG 


TTTCAAAGAG 


GGGAGCATCT 


TGACAGGCTT 


TCAAAAGATA 


GTTGCTAAAG 


2520 


GTTTCTTTTC 


CACCCATCAG 


TTGGATTAAG 


CCAGGGATGT 


CGTGGAGAAC 


CCCTAAAGTA 


2580 


GCTTGAATGG 


CAGAGCATTC 


AGCGTAGTCT 


CGCCCCCAAC 


TATAAGGAGA 


GAAGTCAGGG 


2640 


TGAAAGTTTC 


CTTGATTGTC 


TCGTGCTCGC 


ATGTAACCTC 


TCTCAGCGTC 


AAATAGCTGG 


270C 


CGGTAATTTT 


GTGAAGCAGC 


CTTGTAGGTT 


TCAGCGATTT 


CTATGTTCTC 


TAGTTTTTTG 


2760 


CCACACCTGG 


CGATACAAAA 


GTCACTATAG 


GCATAGTCTA 


GAGTATGGCT 


AACACTTTCG 


2820 


TGGTGGTCGG 


TAGAGAGGTA 


ACCTAGTTCT 


TGGTATTGGG 


CTAGTCCGTG 


GCGGCCATTG 


2880 


ATGCCGAGAG 


GGTCGGCTTT 


GCTGGCTGTT 


TCGAGCATGG 


CTTGGAAGAG 


TTCTCCTTCT 


2940 


AGGTCGGGGG 


TCATGTCCTT 


GCAGGCGCTA 


TCTGCGATAA 


TACCGTCTAA 


AAGTGTACCT 


3000 


GGCATCATAC 


CCCGTTCATC 


TGGAGCCAGC 


CATTTTGCAA 


GGAAACCAGT 


ATCGCGGTAG 


3060 


CTATTGAGGA 


AACCTTCTAA 


AAAGCGTTGA 


TAGTGCTCCG 


GTATGATAAG 


GGCAAAGAGG 


3120 


GGGAAGGTGG 


TGCGGAAGGT 


ATCCCAGAAA 


CCATTGTTGC 


TAAAGAGGAC 


ACCAGGCTTG 


3180 


ACAGTACCAG 


TAGCCAGATC 


CATGTGGATG 


GCTTCCCCTG 


ATTCATTAAT 


CTCATAAAAA 


3240 


GTCTGTGGGA 


AGAGGAAGAG 


TCTGTAGAGG 


CAGTGGTCAA 


AGAAGGTTCG 


GTCAGCCTCT 


3300 


CCTGTCTCTA 


TAATGTCAAA 


ACGATGGAGG 


AGATTTTCCC 


AATCCACTTG 


GGCACTTCAT 


3360 


TTACAGCTAT 


CAAAATCTTC 


TTGAGGTAGA 


TTGATTAGAG 


CTTGAGAAGG 


AGAGATGAAA 


3420 


GAAGTGGCTA 


GTTGCATCTC 


GGTTTGACTA 


CTTGCTAAGT 


CAATTCGCCA 


GTCTCCAGCT 


3480 


TCTTGGCTGA 


TAGCAAGAAT 


ATCCGTGTTC 


ATTTGCAGGG 


CAGTGAACAT 


CGTTAGCGAA 


3540 


TTTTTGTTAG 


TTTCAGTTTT 


ACCTTCTTGT 


CGCAGGGCAA 


GAGTCCGCTT 


ATCTACTTGC . 


3600 


TCTACTGTCA 


GTTCATCTGC 


TGCGTGAAGA 


TAGAGGGAGA 


GGGCTTTGCC 


TTGCTTTTGA 


3660 


TTCAAACGAA 


TAGAAGCACC 


ATAGCAAGTC 


GGTGTGAGCT 


GGGTTTCAAT 


CTGATAACGC 


3720 


AGAGAAAAGA 


GCTTCAAATA 


GTGAGGCTGG 


AAGCAAGCTT 


TATCTATATC 


ATAAGAAGAC 


3780 


TGGCGGTGAA 


AGAGGCTGTC 


TCCCCCCAGT 


TGACTGGTGA 


CAGGTGTCAG 


AAGGAGCCAA 


3840 


GAGTAGTCCC 


CAATCCAAGG 


ACTGGGCTGG 


TGAGTTAATC 


GAATCCCCTG 


AAAGATAGGC 


3900 


AGATGTGGAT 


CAAAAAACCA 


AGATCCATCC 


TGGTCACTGG 


TCTGCGGCAC 


AAAGTAATTC 


3960 
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ATCCCAAAAG GCACGCCTGT GTATGGCAGG G7ATTTCCCC GAGAAAAGGC ATGCTTGTTG 4020 
GTAGTTCCAA AACGGGTATC GATGGTATCA AGTAGTGGTT TCATAGTCTT TCCTTTAGCT 4080 
GTTTTTCTAC ATTATATCAG TAATAGAGGG CCTTTAG 4117 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i} SEQUENCE CHARACTERISTICS : 

(A) LENGTH: -2-72-7- base- pairs . 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 



CTGGTTCAAT 


TATTATTCAC 


TCTAAGTAGT 


CATATGTTCT 


TTATTTATGT 


IjAU 1*111 At. 


ou 


CTTTTAAAGG 


ATCTTGTTAG 


ATGGGAtjAAL» 


O n l V AAAA(j 


I O At. ALi A 1 tj A 


L nn 1 ALnAun 


i. £ u 


AAAGTTCGTT 


TATTAGTAGC 


<_"lTi'T'l~l'AGC 


ATTtjTU AT ALi 


Lit. i At. a ret. 1 


GAGTTCTTTC 




TTTATTAGCC 


TGTATCATTT 


GTGGCAAGAA 


GCGCTTAGAG 


GATTATTATG 


AAATCAAGAG 


240 


TAAAGGAAAC 


GAGTATGGAT 


AAAATTGTGG 


TTCAAGGTGG 


CGATAATCGT 


CTGGTAGGAA 


300 


GCGTGACGAT 


CGAGGGAGCA 


AAAAATGCAG 


TCTTACCCTT 


GTTGGCAGCG 


ACTATTCTAG 


360 


CAAGTGAAGG 


AAAGACCGTC 


TTGCAGAATG 


TTCCGATTTT 


GTCGGATGTC 


TTTATTATGA 


420 


ATCAGGTAGT 


TGGTGGTTTG 


AATGCCAAGG 


TTGACTTTGA 


TCAGGAAGCT 


CATCTTGTCA 


480 


AGGTGGATGC 


TACTGGCGAC 


ATCACTCAGG 


AAGCCCCTTA 


CAAGTATGTC 


AGCAAGATGC 


540 


GCGCCTCCAT 


CGTTGTATTA 


GGGCCAATCC 


TTGCCCGTGT 


GCGTCATGCC 


AAGGTATCCA 


600 


TGCCAGGTGG 


TTGTACGATT 


GGTAGCCGTC 


CTATTGATCT 


TCATTTGAAA 


GGTCTGCAAG 


660 


CTATGGGGGT 


TAAGATTAGT 


CAGACAGCTG 


GTTACATCGA 


AGCCAAGGCA 


GAACGCTTGC 


720 


ATCGTGCTCA 


TATCTATATG 


GACTTTCCAA 


GTGTTCGTGC 


AACGCAGAAC 


TTGATGATGG 


780 


CAGCGACTCT 


GCCTGATGGG 


GTGACAGTGA 


TTGAGAATGC 


TGCGCGTGAG 


CCTGAGATTG 


840 


TTGACTTAGC 


CATTCTCCTT 


AATGAAATGG 


GACCCAACGT 


CAAAGGTGCT 


GGTACAGAGA 


900 


CTATAACCAT 


TACTGGTGTT 


GAGAAACTTC 


ATGCTACGAC 


TCACAATGTA 


GTCCAAGACC 


960 


GTATCGAAGC 


AGGAACCTTT 


ATGGTAGCTG 


CTGCCATGAC 


TGGTGGTGAT 


GTCTTGATTC 


1020 


GAGACGCTGT 


CTGGGAGCAC 


AACCGTCCCT 


TGATTGCCAA 


GTTACTTGAA 


ATGGGTGTTG 


1080 


AAGTAATTGA 


AGAAGACGAA 


GGAATTCGTG 


TTCGTTCTCA 


ACTAGAAAAT 


CTAAAAGCTG 


U40 


TTCATCTGAA 


AACCTTGCCC 


CACCCAGGAT 


TTCCAACAGA 


TATGCAGGCT 


CAATTTACAG 


L200 
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CCTTCATGAC AGTTGCAAAA CGCGAATCAA CCATCGTCGA GACAGTTTTC GAAAATCGTT 
TCCAACACCT AGAAGAGATC CGCCCCATCC GCTTGCATTC TGAGATTATC CGTGATACAG 
CTCGTATTGT TGGTGGACAG CCTTTGCAGG GAGCAGAAGT TCTTTCAACT GACCTTCGTG 
CCAGTGCGGC CTTGATTTTC ACAGGTTTGG TAGCACAGGG AGAAACTGTC GTCGGTAAAT 
TGGTTCACTT GGATAGAGGT TACTACGGTT TCCATGAGAA GTTGGCGCAG CTAGGTGCTA 
AGATTCAGCG GATTGAGGCA AGTGATGAAG ATGAATAAGA AATCAAGCTA CGTAGTCAAG 
CGTTTACTTT TAGTCATCAT AGTACTGATT TTAGGTACTC TGGCTCTAGG AATCGGTTTA 
ATGGTAGGTT ATGGAATCTT GGGCAACGCT CAAGATCCAT GGGCTATCCT GTCTCCAGCA 
AAATGGCAGG AATTGATTCA TAAATTTACA GGAAATTAGG CTGGAGAACC AGCCTTTTTC 
TAAAGATAAG GAGAAATATG AACAAAAAAA CAACACACAC ACTAATCGGA CTGCTAGTGT 
TATTGCTTTT GTCTACAGGG AGCTATTATA TCAAGCAGAT GCCGTCGGCA CCTAATAGTC 
CCAAAACCAA TCTTACTCAG AAAAAACAAG CGTCTGAAGC TCCTAGTCAA GCATTGGCAG 
AGACTCTCTT AACAGACGCA CTCAAGAGTC AAATAAAGCG GAGTCTGGAG TGGAATGGCT 
CAGGTGCTTT TATCGTCAAT GGTAATAAAA CAAATCTAGA TGCCAAGGTT TCAAGTAAGC 
CCTACGCTGA CAATAAAACA AAGACACTGG CCAAGGAAAC TGTTCCAACC GTACCTAATG 
CCCTCTTGTC TAAGGCCACT CGTCAGTACA AGAATCGTAA AGAAACTGCG AATGGTTCAA 
CTTCTTGGAC TCCTCCAGCT TGGCATCAGG TCAAGAATCT AAAGGGCTCT TATACCCATG 
CAGTCGATAG ACGTCATTTC TTAGGCTATG CCTTAATCGG TGGTTTGGAT GCTTTTGATG 
CCTCAACAAG CAATCCTAAA AACATTGCTC TTCAGACAGC CTGGGCAAAT CAGGCACAAG 
CCGAGTATTC GACTGGTCAA AACTACTATG AAAGCAAGGT GCGTAAACCC TTGGACCAAA 
ACAAGCGTGT CCGTTACCGT GTAACCCTTT ACTACGCTTC AAACGACGAT TTAGTTCCCT 
CAGCTTCACA CATTGAACCC AAGTCTTCGG ATGGAGAATT GGAATTCAAT GTTCTAGTTC 
CCAATGTTCA AAAGGGACTT CAACTGGATT ACCGAACTGG AGAAGTAACT GTAACTCAGT 
AAAAGATACG CCTACACTCC TATGTCACTT ATCGATGTAG GAGTTCTTTT TACTAGTTTA 
AGCAGGACTA AGACAGGTAC TAAGACAAAA TAGCAACTTC TAAAACTAAC TTCCAGTTTT 
GGGAGAGAGA TGGAAGTTAC TTTGAGA 
(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

I860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 
2700 
2727 
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{xi> SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

TTTTTTGTAG ATTTAAGTGG GGTGCAATTC CTAAAAAATA AAAAACAATT TTTGAAAATT 60 

ATGTTAGCAG GAATTGCTTC AAATTCGATT TTATCACTTA CAGGTTTACT TGTTTTATTG 120 

TTCACATCGT ATAAATTGCT TGGACTCTTA TTTTTTATCA TTAACTTAGG TATGATTTTT 180 

ATTAATTCAA TTCCTTTTTT TCAGTATCAT AGTGGTATTA TTTTAAGATA CTTGAATTCT 24 0 

AACAATAATA ACTTGAATTT TCAATATATA GTTCAACTTT TAATACCATT TGTTATTATT 300 

TATTTTCCTT TGAGTCAACT ATTACAGTTT TTGACACCCA ATATTATTGT TCGTAGTATA 3 60 

GGAGGGGTGG TTGTTTCTAT ACTGCTTTCT AT ATT AT AT A TGATAGGAAG GACGAAATAT 42 0 

GTTCTACGTA AATAGTTATG TTTTTGCTTA TAAAAAAGAA GGTATAATGT ATTTACGTGG 4 80 

TCGGAGTATG CGGGAAATAG CTATAGAACC TCAAATTTCC CAAGAATTTA TCAACGATCT 540 

ATTTAATAGT TGTAAGGAAC TATTAGAGAT AGAAGAAGTA TTAGGCAGTA AACTAACATT 600 

TGAACTATAA ATGAACAAAT TTTAATTTCG GATGAGATAG ATATTGATAG TAGATATTCT 660 

AGAACTAAAG GTTACTATTC GTTATTTTAT AATGAAGAGT ATAATAAAAT ACAGAATAAA 7 20 

ACAGTATTAG TATTAGGAGC AGGAGTCTTA GGATGTTATA TATCTCTAAG TCTAAGTATG 780 

TATGGAGTGA GGAAACTTAT TGTCGCTGAT TACGATATAA TAGAACCATC AAATTTAAAT 840 

AGGCAAATTC TTTATACAGA GTCGGATGTT GGTAAGGAGA AGATTAATGT TCTTTCTGAA 900 

AAAATACACA AGTATAATTC AGATGTTCAG GTAGTACCTA TTTCTATTAA AGTTTCTTCA 960 

GTAGAAGAAT TAGAAAAAAT TGTTGCGGAA TATGGGACTA TAGATTTTAT CGTTAAAGCA 1020 

ATTGATACGC CCATTGATAT TATAAAAATT GTCAATCAAT TTGCTGTATC GCATAAGATA 1080 

TCCTACATAT CAGGAGGGTT TAATGGATGC TATCTTATTA TTGATAATAT ATATATCCCT 1140 

ACCATCGGTT CTTCCTTTCC TTGTCGGAAT ATAAACAAAG ATATAAATAA GTACACTTTA 1200 

TCTGATAAGA CAAAGTGGCC GACTACACCA GAGATGCCTG CTATTTTGGG AGGGATAATG 1260 

ACTAATTTAA TAATTAAAAT ATTTCTGGGA TGTTATAATG AAATCCTAAT AGATAACGCT 13 20 

TACGTTTATA ATATGAGAAA TCATGCTCTA AGTCAAGAAA AATATGTTCT GGAAAACGGA 13 80 

GAATGTCCAA TTTCTAAAAA AATAATAAAG TGAAAGATAA CAATATTACA GCGAAAACAT 1440 

TTATTCGTTC AGTTTGTTTT TGCTTATTAT CAGGAGGAGT AGCTTTTTTA TCTGCTATTG 1500 

CGCAGTTCAC TGTTATAGAA ACACAATTAA TAGTATTGTT CTTGGCTATT ATTTTTGCTA 1560 

TATATTATGC TTACTACAAT AAAAATATTC AAACATCATT GGAAAATATA GTATGGCTTT 1620 
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TTTCATCCTT TCACATTTTA TTTTTGCTTG TTAATTTTAG AACATTTATT CAGTTACCAG 1680 

TGGATATTTT TATTGGTATG ATAATATTTT TAATGCTGTG GATATTTATT ATGTTAGGTA 17 40 

TAGTGTGTCT TAGTTATTAT ATAACTTTAT TATTTAGCAA GGAGGCTTAG TATGTTTAAA 1800 

AAAATAGGTA TAATGAGCAT TTGCATATAT ATAATTATTT TATACTGCTT CAGAATGTAT 1860 

CGTATTATCA ATAATATTGA AACAATCTTG CTAACGGTTA TATGCTTAAT GTTATTGTTT 1920 

TTTTTAAGAC GTTTATTTGA TAAAGATAAG TAAATAGATG TTAAGTAAAA ATGTAGAATA 1980 

TAAACGAGGT GCAATGAGTA TGATTGAAGT TAGCCATTTA TCAAAAAGTT TTGGTGATAA 2040 

AATAGCTTTA AATAATATAA GCTTCACTGT TAAAGAAGGT TAGATTTTTG GATTTTTAGA 2100 

ACCATCTGGT TCTGGAAAGA CCACAACGAT TAATATTCTG ACTGGGCAGT TCCTTGCCCA 2160 

TAAAGGACAA TCTATTATTT TGGGACAAAA ATCTCAAAAT TTAACAAGCG GTGAATTAAA 2220 

GAGAATTGCA TTGGTTAGCG ATACAAGTGG ATTTTATGAG AAAATGTCTC TCTATAACAA 2280 

TCTTCTTTTT TATAGTAAAT TTTATAATAT TAGTAAATCA CGTGTTGATA ATTTGTTAAA 2340 

GCGAGTAGGA TTATATGATA GTCGCAAGAT GGTAGCAGGA AAATTATCCA CTGGAATGAG 2400 

GCAACGAATG CTTTTAGCAC GAGCTCTTAT CAACAACCCC GCTGTACTCT TTCTGGATGA 24 60 

ACCGACCTCA GGTCTAGATC CCACAACTTC TCGAACAATT CATGAGTTAA TTTTAGAATT 2 520 

GAAAACAGCA GGGACAACGA TTTTTCTAAC GACTCATGAT ATGAATGAAG CAACTCTTTT 2580 

ATGTGATTAT GTTGCCTTAT TAAATAAAGG GAAATTAGTT GAGCAAGGAG CTCCTTCTGA 2640 

ACTCATTCAA AGATATAATA AAGATAAAAA GATTAAGGTT ACAGATTATA ATGGGAATCA 2700 

GATAACTTTT GATTTTACAT CACTAGAACA GGTATCTCAG ACTGATCTGG AAAATATTTT 27 60 

TTCAATTCAT TCATGTGAGC CTACTTTAGA AGATATTTTT ATCACATTAA CAGGAGGAAA 2820 

GCTAAATGCT TAAACGGTTT CTGGCTTTGG TATGGTTGCG TTGTCAAATC ATCCTTTCCA 2880 

ATAAGAGTAT TTTATTGCAA GTTTTAGTGC CTTTTGCTTT CACATATTTT TATAAATATC 2940 

TTATGGAAAC ACAGGGGAAG GTCAACGATC AACAGGCATT AGTTCTTTTG ATGATGTGTT 3000 

TACCTT7TTC TT TTTCTTTG GCTGTTGGAA GTCCTATAAC TATTATCTTG TCTGAAGAAA 3060 

AAGAAAAGTA CAATTTACAA ACTCTTCTGT TGAGTGGTGT TAAAGGCTCC GAATACATTT 3120 

TATCAACTAT GTTTCTTCCT TTTTTGCTAA CTTTTGTGAT TATGGGAACT ACTCCTCTTA 3180 

TTTTAGGAGT TACAATTGTA CATACTTTTA ATTATATTAC AATCGTTCTT CTAACCTCTT 3240 

TATCCATCAT 7TTATTCTAT TTATTGATAG GTTTAACCGC GAAGAGCCAA GTAGTAGCTC 3 300 

AGGTTATCAG TCTTCCTGCT ATGATTTTAG ' r rCiCTTTCTT ACCGATGCTA TCTGGTTTGG 3 3 60 

ATAAGACAGT TCCGAAGATA ACAGATTATA CTTTTATCGC ACTATTTACT AAGTTTTTCA 3420 
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CAAAATGCGA GGAATTTTCA TGGAATAAAA CTCTAATTCC TAATCTAACA CTACTTATTT 3480 

GGATTGTTCT TCTATTAACT TTAATTACGA TAACTATTAG GAAAAAGAAA ATTTCTTAAT 3540 

TGAGTTATTT TAATGATTAT AAACACAAGT GGGAACGAAA AAATGAACTG ATCTTTTTGA 3600 

CAGCAATTCT ACAGAATAGT CTTATTGCTA TATTTTGATT TGAGTGTACG AAAAAAGAAA 3660 

AATAACAATA GTGCTCATAC TAATTGCAGA AGTTTTGGGT GATAAGATAA CTGATAAATT 3720 

GCAA7AAAAA ATGC AACATT TTTAAATCTC CTCT ATAAG7 GCTTCAAAAA GTGCTTCAAA 3780 

ACCTGTCTTG TAATCCAAGT ATTTTTGGGG ACGGTGATTA ATAAGCTAGC AAAGCATCAT 3840 

TAAGGATTTT TTCGGTAATT GTTGCCAAAT CGGTTTAAGA AAATACTCAC GAAGAAGTCC 3900 

ATTCCCATTC TCATTACTTC CCCTTTGCCA AGATGAATAG GCATCCGCAA AATAAAACAG 3960 

AATTCCCATT TGTTCAATTA AAGGGTAACA AGCAAACTCT TTTTCTCTGT CCGAAGTGAA 4020 

AGTCTTTAAC TATTCTTTTG GAAAGAGTCT TGTCAGGTCT TCAATAGCAG TCAACATGGA 4080 

TTTAGCTGTT TTTACTTGAC AAGTGCTAGT AGAAATAATA GAATAGTAAA AAACCTTTAA 4140 

AGCAGTCCAG AGAGGCAGCT AAGGTTAGAC GGTGAAAGGG TGGAGACTAC CCATTTTTCC 4200 

TGGAACCTTG CTGTTGGCAG CTTCCTTTTT TCGTGGCTTC TGTTGGCCAG ACTCTCTCAC 4260 

TAGTAAACGT AAAAGGAGAA ACCTATGCGA GAACATCGTC CAATCATTCC TCTTGATTTT 4320 

CCTAGTTTTG AGGCGGTCAA GGAATTTTTA GCTCTTTTCC CAGCAGAACA AAGCCTTTAT 4380 

CTCAAGGTAG GGATGGAGCT TTATTACGCA GCGGCGCCTG AGATTGTCTC CTACTTAAAA 4440 

GGTTTGGGTC ATAGTGTCTT TTTCGATCTC AAACTTCATG ACATTCCTAA TACAGTCAAG 4500 

TCAGCCATGA AGATCTTGTC TCAGCTTGGT GTCGATATGA CTAATCTCCA TGCGGCTGGT 4S60 

CGTGTAGAGA TGATGAAGGC GGCGCGTGAA GCTCTTGCGA GTCAAGCCAA ATTGATCGCT 4620 

GTAACTCAGC TCACATCAAC GTCAGAAGCT CACATGCAGG ACTTTCAAAA TATCCAAACC 4680 

AGTCTGCAAG AGTCTGTGAT TCACTATCCC AAGAAGACAG CTGAAGCTGG CTTGGATGGT 4740 

GTTGTTTGCT CGGCTCAGGA AGTACAAGTC ATCAAGCAGG CTACCAATCC AGATTTTATC 4800 

TGTCTGACAC CAGGGATTCG TCCAGCTGGT GTTGCAGTTG GAGATCAAAA ACGAGTCATG 4860 

ACACCTGCTG ATGCCTATCA AATCGGCAGT CACTATATCG TAGTGGGACG TCCCATTACC 4 920 

CAAGCTGAGG ATCCTGTTGC AGCTTATCAT GCCATCAAGG ATGAATGGAC ACAGGACTGG 4980 

AATTAAAGAA CTAGATTAGA AAAATAAAAG GAGAATACCA TGACACTTGC TAAAGATATC 5040 

GCTAGCCACC TCTTGAAAAT CCAAGCCGTT TACCTCAAAC CAGAGGAACC CTTCACTTGG 5100 

GCATCTGCTA TCAACTCACC GATTTACACT GATAATCGTG TGACACTAGC CTATCCAGAA .5160 
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ACTCGTACCC TAATTGAAAA TGGTTTTGTG GAAGCTATCA AAGAAGCCTT TCCTGAAGTA 5220 

GAAGTGATTG CAGGAACTGC AACAGCAGGG ATTCCACACG GAGCCATTAT TGCTGATAAG 52 80 

ATGGACTTGC CTTTTGCCTA CATCCGTAGT AAACCAAAAG ACCACGGAGC TGGTAATCAA 5340 

ATCGAAGGTC GCGTAGCTCA AGGTCAAAAA ATGGTAGTGG TTGAAGACCT TATTTCAACG 5400 

GGTGGTTCAG TTCTTGAAGC TGTAGCAGCA GCCAAGCGAG AAGGAGCAGA TGTACTTGCA 5460 

GTTGTAGCGA TTTTCAGCTA CCAATTGCCA AAAGCAGATA AGAACTTTGC AGATGCTGGT 5520 

GTTAAACTTG TGACGCTTTC AAACTATAGC GAGCTTATCC ATCTAGCCCA AGAAGAAGGT 5580 

7ACATCACGC CAGAGGGCCT TCATCTTCTA AAACGCTTTA AAGAAGACCA AGAAAATTGG 5640 

CAAGAAGGTT AGGTCACTAA GATAAAGAGA GACGACGCTA CCGAGTCTCT TTTACCATTT 5700 

TATTTAAAAT ATGACAG 571.7 
(2) INFORMATION FOR SEQ ID NO: 103: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5558 base pairs 

( B ) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

CCTCCACTTT CTAAAATGAA ATCTTCCCAC CTGGATCAAG CCCTTCATCA GCATTTTTCA 60 

GAAGAAGAAT TAGCTCCTCA CTTTCATGTC CTTCTATGGA CTTTTTTTAC AATGGCATTG 120 

CTATCACACC CAATACCTAT CTAACCGCCT GGTTCGTAAA CTTTATTGCA GCTCTTCCTC 180 

TAAATTTCCT AATTGTTGAA CCAATTGCCC GTTTTATACT AAGTTCTTTT CAGAAACCAT 240 

TTACTGGGCA AGAAGTTGAA GATTTTCAAG ATGATGATGA AATCCCAACT ATTATCTAAG 300 

CCAGTTCTGT AAACTACTAA TATTTGAAAT CCACTTCCTT TTAGGGTGCA ATGGTTATAA 360 

ATGAATTTTT GAGAGGATCA GAATGAAAAA ACTAGCAACC CTTCTTTTAC TCTCTACTv;T 420 

AGCCCTAGCT GGGTGTAGCA GCGTCCAACG CAGTCTGCGT GGTGATGATT ATGTTGATTC 480 

CAGTCTTGCT GCTGAAGAAA GTTCCAAAGT AGCTGCCCAA TCTGCCAAGG AGTTAAACGA 540 

TGCTTTAACA AACGAAAACG CCAATTTCCC ACAACTATCT AAGGAAGTTG CTGAAGATGA 600 

AGCCGAAGTG ATTTTCCACA CAAGCCAAGG TGATATTCGC ATTAAACTCT TCCCTAAACT 660 

CGCTCCTCTA CCCCTTGAAA ATTTCCTCAC TCACGCCAAA GAAGGCTACT ATAACGGTAT 720 

TACCTTCCAC CGTGTCATCG ATGGCTTTAT GGTCCAAACT GGAGATCCAA AAGGGGACGG 78C 

TACAGGTGGT CAGTCCATCT GGCATGACAA GGATAAGACT AAAGACAAAG GAACTGGTTT 840 
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CAAGAACGAG ATTACTCCTT 
TGGTCAACCA AACACCAATG 
TTCTAAACTC CCTACAAGCA 
AAACCCTAGT CTAGATGGCA 
TGTGGATAAG ATTGCTAAGG 
AATCGACAGC ATCGAAGTGG 
CAGTATCCAC ATTCGGTACT 
TCCCATATTT GGTCTATCCA 
GATCCCCTAT ATTCTTTGAG 
TTGGTCGGCT CGTCCAGCAC 
ACCTTGGCTT GCTCTCCCCC 
AAACCACAAC GGGCAAGGGC 
CAGACAGCTT CAAGAGGAGT 
AGTTCTAAAT AATCTCCACG 
CTCTTCAAGA GAGTTGTTTT 
CGTTCGAAGG TAAGATTTAA 
TTGGCTTGGA AGATAAAGCG 
GGTTTCTCAC TTTGGAGTTC 
GCCATATTAC GAGTTGCAAC 
ATCTCTTTCT GCTGGCGTTC 
TCTTGGAACT GGTAGTAGTC 
ACAATATTAA TAACGTCATT 
TCATAGTTTT GGAGATAGCG 
GGCTCGTCCA ACAGCAAGAT 
GTTCTTTGCC CACCTGACAA 
AGAGCACGCG CTACTTCG7C 
CGCTCTTGAA GTTCTCCTAC 
ATTTTCATAT AGAGGTCATT 
CGGAGAACAT CACGCACCGA 
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ATTTCTATAA CATCCGTGGT GCTCTTGCTA TGGCTAATAC 900 

GCAGCCAGTT CTTCATCAAC CAAAACTCTA CAGATACCTC 960 

AGTATCCACA GAAAATTATT GAAGCCTACA AAGAACGTGG 1020 

AACACCCAGT CTTTGGTCAA GTGATTGACG GTATGGATGT 1080 

CCGAAAAAGA TGAAAAAGAC AAGCCAACTA CTGCTATCAC 1140 

TGAAAGACTA CGATTTTAAA TCTTAAAAAC CAAAAAAATA 1200 

GTATTTCTTT TACTCTCAT7 CTTAAGTTAA ATTATTAAAA 12 50 

GCCTTCATAA AAGTCTGGCT CGTGGCAGAC CATAAGGATA 1320 

AGCGCGTTTG AGCTCATCCT TTGCATCCAC ATCCAAATGG 1380 

TAAAACGTTG TTTTCACGAT TCATCAAGAG ACAGAAACGA 1440 

TGATAATACT TGAATCTGGC TTTCAATATG TTTCCTTGTC 1500 

TGCACGGACT TCTGCTTGAT TAAGGGCAGG AAAGGCATTC 1560 

TTGGCGATTA CCGCCTTCTA CTTCCTGCTC AAAATAACCA 1620 

CTCCACTTCC CCAGCGATTG GCGAGATAAT GCCCAAGAGA 1680 

TCCAATACCA TTAGCACCAA TAATCGCAAC CTTTTGATTG 1740 

AGGCTTAGTA AGAGGACGGT CGTAACCAAT TTGCAACTTC 1800 

CCCTGGTGTA CGAGCTGGTT TGAAATCAAA GGATGGTTTT 1860 

GATAATATCC ATCTTATCCA ATTTCTTTTG ACGAGACATA 1920 

ACGGGCTTTA TTACGAGCCA CAAAGTCCTT GAGGTCTGCA 1980 

GTAGGCTGCC TCTAGCTGAG ATTTCTTCAT AGCATAAACT 2040 

ACCAGAGTAA CGCGTCAGCT GTTCATTTTC CACATGATAG 2100 

GAGGAATGGA ATATCGTGCG AAATGAGAAC AAAGGCATTC 2160 

CTTGAGCCAA TCAATATGCT CAGCATCCAA GTACTTGGTC 2220 

ATCAGGCTTT TCAAGGAGAA GTTTTGCCAA AAGCACCTTG 2280 

AGAAGTTACA TCCGTATCCA TGCCAAAGTC CATAACACCA 2 340 

AATCTTAGCA TCCAAGGTAT AGAAATCACG ACTCTCCAGA 2400 

TTCTTCCATG AGAGCATCAA CATCCGCGCC GTCTTCAGCC 2460 

GATACGAGCT TCAGCTTTGA AAAGCTCATC AAAAGCCGTA 2520 

CTGTCTTTCA GCAAGCACAC AGTGCTGATC CAAGTAACCA 2580 
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GCCGTCACAT ATTTGGACCA CTCAACCTTT CCTTCATCTG GCAGCATTTT ACCACTCACG 
ATACTCATAA AGGTTCATTT TCCTTCACCA TTGGCACCGA CCAGGCCGAT ATGTTCTCCC 
TTGAGGAGAC GGAAGGACAC ATCTTCAAAA ATTGCACGCT CACCAAAACC GTGACTCAGA 
TTTTTAACTT CTAAAATACT CATTTTAATT CCTTACCTTG TTTTTATGTA ATCGTTTATA 
AAGGAGCCAA GCCAGATAGC CACCCAAAGT GTTGGTCCAC AAATCATCAA TCTCAAAGAC 
GCGATTGAAA TCAAAGAAAA AGTCCAAGAT TAATTGCGTA CACTCGATTC CAACACTCAC 
AAGAAAACTA AAAAGAAGGA CCTTTTTTGT TTTCCGCAAA TTTGGAAATA GATAAAGGAG 
TTGGAAAATC AGAGGAAAAA ACAAGAAGAC ATTGAGGATA TTTTGTAAAA AAATCCAACA 
TAATTGTCCA ATGTCACTCA CTTCCCCCAG TTTCCAGAGA GAATTGAAAG GAGTCAAAAG 
AAAAACCAGG CGTCCAAGAT GCTGAATACC TGGAGTTCCC ACTCCCACGG TAGATTGTTC 
TTGAGGAGTA AAGCAAAAAC AGACAATGCA AATGCTATAG AAAATGACTC CCCAGACCAA 
AATATGATTA TAAGTCTTCT TCATCATTAA GGATTTACCG CTGCGACTGC CTTCTGGCGG 
TCACGTTTCA TTGTGTTAGA GCCCAATTGT CCACAAGCTG CGTCAATATC TGTACCATGC 
TCTTGACGAA CCACACAGTT GACCCCTTTT TTCTTAAGCG TATCATAGAA AGCCAACACG 
CACTCTTTGG GACTACGGCT ATATTGGTCA TGCTCACTAA CTGGGTTATA AGGAATCAAG 
TTTACATAAG ACAATTTCTT GATGTTCTTG AGCAATTCAG TCAATTCCAA GGCTTGTTCT 
ACACCGTCGT TGACTTCATT AAGCATGATA TATTCAAAGG TTACACGACG GTTTGTTGTC 
TCAATGTAGT ATTCAATACC ACCAAAGAGT TTTTCAATCG GAAAGGCACG CTTAATCTTC 
ATGATACTTG AACGAAGTTC ATTGTTAGGT GCGTGAAGAG ACACGGCAAG ATTGACCTGA 
ACCCCTTCAT CAGCAAAGTC ACGAATTTTA TGAGCCAAAC CTGAGGTTGA AACCGTGATG 
TGACGAGCAC CGATAGCCAT TCCTTTATCA TCATTGATAG TACGAAAGAA ATTCAAGACA 
TTGTTGTAAT TATCAAAGGG CTCACCGATT CCCATGACAA CGATATGGCT GATGCGTTCA 
TCCTGACCAC GCTCATCAAA GTATTTCTGA ACCAGCATGA TTTGCGCTAC GATTTCACCG 
TTATTGAGGT CACGTTGCTT CTTAATCAAA CCAGACGCAC AGAAGGTACA ACCGATATTA 
CAGCCGACCT GAGTGGTCAC ACAGACAGAT AAACCATAGT GTTGACGCAT GAGTACAGTC 
TCAATTAACA TACCGTCGGG CAATTCAAAG AGATATTTGA CTGTACCATC AGCAGACTCT ' 
TGCACAATAC GTTGTTTCAA GGGATTGACC ACAAACTGGT CATTGAGCTT AGCAATCAAA 
TCCTTGGAAA GGTTGGTCAT TTCTTCAAAT GACTGCACAC GTTTACGGTA GAGCCATTCC 
CAGATTTGAT CTGCACGGAA TTTCTTTTCT CCCTGCTCCA A7ACCCATTC CTGCATGGTT 
TGATGTACCA AACTATGAAT TGAGGGTTTC ATTTCTTCTC CTTATTCTCT ACTCACTTCT 4380 
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3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 



WO 98/18931 



PCT/US97/19588 



773 

GACCAATGAC AAAATGACGT TGTCCCTTGT CGTCTTTCTG ACGACGTCTA TTTTTCTTAT 4 440 

CTCCATTCGA CTTTCGTTTA CTTTCAC7CC GTTTCTTTCC TTTTCTACAA GGTGTTTCTT 4 500 

C7TCCGTCTT ACGCATTTTC TTCTCAAATC ATGCTCGCTT AGGGGCTTCA TTTTCTAAGA 4 560 

CAAAATAGGC ACAACCATAA CTACAATACT CTAAAAGGTA GTCTTGTAAA CGACTGATTT 4 620 

TTTCAAGTTT TTCTTCTGTT CGGTCATCCT TGTAAAAACC TCGTACGCGA ACCTGTTCGT 4 630 

TGCTCCAGTC CCCCACGATA TAATCAAACT TGGTTAATAC TTCTGAAAAA CCCTGATTAA 4 740 

AAGTCGTCAC ATCAAAGGCA TCCTTGATAT TTTCAACCAA GGAAAAACCT ATCCCTTCCG 4 300 

T7TCGACCTT GTCCCCGTGT AAATGGAACT CCGGACCAGG AAACTTGTTA TAGTTGTATA 4 360 

ATTCAGGTGC AATTTCTTTT CGCATAGATA TCCTTTTTTC ACGATTACTT AATACTTTAT 4 92C 

TCTACCATAA TTTCTAGCAG TTAGCACGTT TCTCATAAAA ATGAAAAAAG TCTGACGATT 4 980 

TTGTCAGACC AGAATCTTAT AACCTAAAAA GACAAGAACA ATTCTTCCCT CCAACTATCA 5040 

TTATTTACCA GCTGCGTACA ATTCATCTAC TTTATTCCAG TTCATTACTC AAAAGAAAGC 5 ZOO 

TTTGATGTAG TCAGGACGCA CGTTGCGGTA TTTCACCTAC TAAGCATGTT CCCAAACGTC 5160 

CAAGCCCAAG ATTGGTTTTT TACCTTCTGA GATTGGTGTG TCTTGGTTTG CTCTTGAAGT 5220 

CACTTCAAGT TTCCCTTCTT TGTTGACAAC CAACCATGCC CAACCTGAAC CAAAACGAGT 52 80 

TGTTGCTGCT GCAGTGAAGG CTGCTTGGAA TTCTTCAAAT GAACCAAATG TTGCATCGAT 5340 

TGCTCCTGCC AGTTCTGCTG AAGGAGCTGT TTTCTCGGGA GTCATCAATT CCCAGAAAAC 5400 

AGCGTGGTTC AAGTGTCCGC CACCATTGTT GATAACTGCT TGACGCATAT CACCTGGGAT 54 60 

AGATTCTACA TCAGCAAGCA AGGCTTCAAG GTCTTCACCG ATTTCACGGT GTTTTTCTAA 5520 

ACCTGCATTG GCATTGTTGA CATAACTTTC ATGGTGTT 555a 
(2) INFORMATION FOR SEQ ID NO: 104: 

fi) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 6735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 104: 

GGAATTGTAA ATATCATATT GTTTTTGCAC CCAAATATCC TCGTCAAATC ATTTATGGCA 60 

GATACAAAGC TAGTATCGGA AGAATCATAC GTGACTTATG TGAGCGTAAG GGTGTAATAA 120 

TCCATGAAGC GAATCCTTCT TCAGACCATA TTCACATGCT TATCAGTATT CCTCCGAAAC 180 
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TTACTGTTTC GTCCTTTATG GGCTATTTAA AGGGCAAGAG CAGTTTGATG ATTTTTGATA 240 

AGCATGCGAA TTTAAAATAC AAATATGGCA ATCGCAAGTT TTCGTCTAGA GGCTATTATG 300 

TAGATACGGT AGGCCGTAAT CAGAAAGTGA TACCTGAATA TATTCAGAAT CAATTACAAG 3 60 

AAGACAGAGT AGCAGACCAG CTCACCTTAT TCGAGTCAGT AGATCCGTTT ACTGGCGAAA 4 20 

TAAATAAGAG GAACTAACTA AGGTGCTTTA GCACCTCCTC GGCAAAGTGG TGCGCGAGGA 480 

AGCTATTTCG GTCGGCCTTT GGCCCTGGCC GGTAGAAGCG GCTTATAGCC GCAGAACAAA 54 0 

CCACCAGTTC ACACTGGTGG TTTTGATTTA AAAAACTTGA TACATAAAAA TAAAAGTCTA 600 

TATAAAGGAT GGTAAAATTC CTCTTCTCCG ATTTGGACAA TATCCTAAAT AGTTACAATA 660 

TATGGTCTAT ACTTTTTCTT AGGAGAAAGC TAGATGTACA GACGTTTGAG AGATTTGAGG 7 20 

GAGGATCATG ATCTGCCCCA AAAGCAAATA GCTACAATAC TTTCGTTTAC AAATTCAGCT 7 80 

TATGCCAAAA TTGAACGGCG TGAGCATGCG TTGACGGCTG ATGTATTGGT TAAACTC7CA 840 

CATTTCTATC ACGTCAGTAC AGACTATTTA TTGGGATTAA CTGATTTTCC TGATAAAATT 900 

CGCTTTAGAA AATAATCTCC TCAATTTCAT ACAGTTTGAA AATCAGTGAG ATTTTTTATT 960 

TGCCCTTTGA CAACTGAATA GCCTAAAATG GTACTTTCCT CATTTGTGGA GCAAATTTGA 1020 

ATGGCTCGCC ATGATAAGAG CGATTTTAAA ATCATCAATA AAATAGAGCC ATACTTTATA 1080 

TGCCATGATA CAAATGATAT ACAATGATAC TTCTGACCGT TCAGCCTCCC AACGTAAAAG 1140 

AGCAGCAAGT GAAATTCTTA TGATGACTTC ATCAGTCATG CCACGTTGAA TGTGTGAGTT 1200 

TGTTAGATAA ACCCAATTAA TCCTCAAAAG GTTCCCCGAA CCTTTTCAGT TCTACAGACG 12 60 

CATCACGTGG AGTGTGTAAG CTTGTTGCTA AAAGCGTAAA AACCTTCCAA CGAAAGGAAT 13 20 

AATAGACTTT CTGCGAAACA AAAATATAAT ACAATAAAAC TATGAATGAT GAAGCAAGTA 1380 

AACAATTGAG CGATAGCCGT TTCAAGATCC TTGTAGGTGT TCAGCGCACG ACTTTTGAAG 1440 

AGATGTTAGC TGTGTTAAAA ACAGCTTATC AACGTAAACG CGCAAAAGGT GGACGAAAAA 1500 

GCAAATTAAG CCTACACCAT CTCCTTATGG TAACTATTCA ATACATGCGA GAATAGACCA 1560 

CTTATGAACA AATTGCGGCT GATTTTGGCA TTCACGAAAG CAACTTAATC CCTCCGAGTC ;.620 

AATGGGTTGA AGCAACTCTT ATTCAAAATG GTTTTACGAT TTCAAATTCT GCCTTAATTC 1680 

TGTAAAAACA GTAAAATTCG AAGGATTCTA AGGTAAGAGT TTTTTTCTTT CTGAAAAAAT 1740 

GGTATAATAG CAATCAAAAC TAGAAAATAA AACGGAATTT GGAACAGATT 7CTCTCTATC 180C 

CT ACT AG ACT GCTGA7ACTA TGAAGATTAG TAAGAGCCAC TTATTAAATT ATTCCATCTT 1860 

GATTCCCTAC TTGCTTTTA7 CTATTTTGGG CTTGATTGTC GTCTATTCGA CCACCAGTGC 1920 

TATTTTAATT GAAGAAGGCA AGAGCGCCTT GCAGTTGGTT CGAAACCAAG GAATCTTTTG 198 0 
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GATTGTTAGT TTGATACTGA TTGCCTTAAT TTATAAATTG AG ACT AG ATT TTTTGAGAAA 2040 

TGAGCGACTA ATCATTTTAG TTATATTAAT AGAAATGCTT TTATTGTTCT TCGCTCGTTT 2100 

TATTGGTATT TCCGTAAACG GGGCATACGG TTGGATTTCG GTTGCAGGAA TAACTATTCA 2160 

GCCACCTGAG TACTTAAAAA TCATTATTAT TTGGTATTTA GCTCACCGAT TCTCCAAACA 2220 

GCAAGAAGAA ATAGCTACTT ATGATTTTCA AGTTTTGACT CAAAATCAAT GGCTTCCCCG 2280 

TGCTTTTAAT GATTGGCGAT TCGTTCTCCT AGTTCTGATT GGAAGTTTGG GAATTTTCCC 2340 

TGATTTAGGA AATGCGACTA TTTTAGTCTT GGTTTCCTTG ATTATGTATA CAGTTAGTGG 2400 

AATCGCTTAT CGCTGGTTTT CAACCATTCT GGCGCTCGTA TCTGCCCCTT CTGTCTTTGT 24 60 

CTTGACCACT ATCAGCCTAA TCGGTGTTGA GACCTTTTCA AAAATTCCAG TATTCGGCTA 2520 

TGTAGCCAAG CGCTTTAGTG CCTTTTTTAA TCCTTTTGCC GATCGTGCTG ATGCAGGTCA 2580 

CCAGTTAGCT AATTCTTATT TTGCCATGGT CAATGGCGGT TGGTTTGGTC TAGCTCTTGG 264 0 

AAACTCCATT GAAAAACGAG GTTATTTGCC AGAAGCTCAT ACAGACTTTG TCTTTTCTAT 2700 

CGTGATTGAA GAATTTGGCT TTGTTGGTGC CAGTCTTATT TTAGCTCTCT TGTTTTTCAT 27 60 

CATTTTCCGG ATTATCTTGG TCGGTATCCG AGCGGAGAAT CCTTTCAATG CCATGGTTGC 2 820 

ACTCGGTGTC GGAGGGATGA TGTTGGTTCA CGTATTTGTC AATATCGGAG GGATTTCGGG 2880 

CTTGATTCCA TCTACAGGAG TGACTTTCCC CTTCTTATCC CAGGGTGGAA ATAGTCTTCT 2 940 

AGTCTTATCA GTGGCAGTAG CCTTTGTCTT AAATATTGAT GCCAGTGAAA AACGCGCTAA 3000 

ATTGTACCGA GAATTGGAAA ATCAACCAAT GAACCTTCTC TTGAAGTAGG ATAAAGAAAG 3060 

GATAGTTTAT GTCTCTTCAA AAATTAGAAA ATT AT ACT AA TAAAAGTGTT CTGCAACAAC 3120 

AAGTCTTGAT TCTAACAGAA TTACTGGAAG ATATTACTAA AAATATGCTT GCCCCACAGA 3180 

CCTTTGAAAA AATAATACAG TTGAAAGAAT TATCAACGCA GGAAGATTAT CAAGGTCTAA 3 240 

ACCGTCTAGT GACTAGCTTA TCAAATGATG AAATGCTCTA TATTTCACGC TATTTCTCTA 3 300 

TCTTGCCTCT TTTGATTAAT ATTTCAGAGG ATGTGGATTT AGCTTATGAA ATCAATCATC 3 3 60 

AAAATAATAT TGATCAGGAC TATTTAGGTA AATTATCTAC AACGATTAAA TTCGTACCAG 34 20 

AAAAGGAAAA TGCCGTTGAG ATCCTAGAAC ACTTGAATGT TGTCCCTGTT TTGACAGCCC 3480 

ATCCAACACA AGTGCAACGC AAAAGTATGT TGGATTTAAC AAATCATATT CATAGTCTTT 3540 

TGCGTAAATA CCGTGATGTT AAGTTGGGGT TGATCAATAA AGATAAATGG TACAATGATT 3 600 

TGCGTCGTTA CATCGAAATT ATCATGCAGA CAGACATGAT TCGTGAGAAA AAATTAAAAG 3 660 

TGACTAACGA AATCACGAAT GCTATGGAAT ATTATAACAG CTCCTTTTTG AAAGCTGTAC 37 20 
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CTCATTTGAC GACGGAGTAT AAGCGCTTAG CGCAAGCCCA TGGTCTGAAT TTAAAACACG 3780 

CTAAACCAAT CACCATGGGT ATGTGGATAG GTCGTGACCG TGATGGAAAT CCATTTGTTA 3 840 

CAGCAAAGAC CTTGAAGCAG TCTGCACTCA CTCAGTGTGA AGTCATCATG AACTACTATG 3 900 

ATAAAAAGAT TTACCAACTT TATCGTGAAT TTTCTCTTTC AACTAGCATT GTCAACGTCA 3 960 

CCAAGCAAGT CAGAGAAATG GCTCGTCAAT CCAAGGATAA CTCGATTTAC CGCGAAAAAG 4020 

AGCTTTACCG TCGTGCCTTG TTTGATATTC AATCAAAAAT TCAGGCAACT AAAACCTATC 4 080 

TGATTGAGGA TGAAGAAGTT GGGACTCGTT ATGAAACCGC -CAATGATTTC TACAAGGATT 4140 

TGATTGCCAT TCGACATTCT CT AC TAG AAA ATAAGGGCGA GTCCTTGATT TCAGGTGATT 4200 

TTGTGGAATT ATTGCAGGCA GTAGAGATAT TTGGTTTTTA CTTAGCATCA ATTGATATGC 4 2 60 

GACAAGACTC TAGCGTCTAT GAAGCCTGTG TGGCAGAACT CTTGAAATCA GCAGGAATTC 4320 

ATTCTCGTTA TACCGACTTG AGCGAAGAAG AAAAGTCTGA CCTTCTCTTG AAAGAATTAG 4 3 80 

AAGAAGATCC CCGAATTCTT TCTGCGACTC ACGCAGAAAA ATCAGAATTA TTAGCAAAAG 4440 

AATTACCTAT TTTTAAGACG CCTCCTCTTT TCAAAGATAA GTTCGGAGAT GATGTCATCC 4500 

GTCAGACCAT CATTTCACAT GCAACCAGCC TTTCTGATAT GCTAGAATTA GCTATTCTGT 4 560 

TAAAAGAAGT AGGACTGGTG GATACGGAAA GGGCGCGTGT TCAGATTGTT CCCCTTTTTG 4620 

AAACAATTGA AGACTTCGAT CATTCAGAGG AAACAATGAG AAAATATCTT TCTCTTAGCC 4680 

TTGCCAAAAA ATGGATTGAC TCACGAAATA ACTACCAAGA AATCATGCTT GGCTACTCTG 4740 

ACAGTAATAA AGATGGCGGT TACTTGTCAT CATGTTGGAC CCTCTACAAG GCTCAACAAC 4800 

AATTGACTGC TATTGGAGAT GAATTTGGCG TTAAGGTTAC CTTCTTCCAT GG7CG7CGTG 4 860 

GTACTGTCGG TCGTGGTGGT GGGCCAACCT ATGAAGCCAT TACATCTCAA CCGCTCAAGT 4 920 

CTATCAAGGA TCCTATCCGC TTGACGGAGC AGGGTGAAGT AATTGGGAAT AAATACGGTA 4 980 

ACAAAGACGC CGCTTACTAT AACCTTGAAA TGCTAGTATC GCCAGCTATT AACCGTATGA 5040 

TTACTCAGAA GAAGAGCGAT ACCAATACCC CAAATCGTTA TGAAACCATT ATGGATCAAG 5 LOO 

TAGTGGACCG TAGTTACGAT ATCTACCGTG ATTTGCTCTT TGGTAATGAG CATTTCTATG -5160 

ATTATTTCTT CGAGTCAAGT CCAATCAAGG CTATTTCAAG TTTTAATATT GGTTCTCGTC 5220 

CAGCCGCTCG TAACACTATT ACTGAAATCG GTGGTTTGCG TGCCATCCCT TGGGTATTCT 5280 

CATGGTCACA GAGTCGTGTT ATGTTCCCTG GATGGTACGG GGTTGGTTCA AGCTTCAAGG 5 340 

AATTTATCAA TAAAAATCCA GACAATATTG CTATCTTACG AGATATGTAC CAAAATTGGC 5400 

CTTTCTTCCA ATCGCTTCTT TCAAATGTTG ATATGGTTTT GTCAAAATCA AATATGAATA 54 60 

TTGCTTTTGA ATATGCTAAA CTTTGTGAAG ACGACCAAGT TAAGGCCATC TATGAGACTA 5 520 
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TTTTAAATCA ATGGCAACTT ACTAAGAACG TTATCTTGCC TATTCAACGA CATGACGAAC 5580 

TCTTAGCTGA CAATCCATAT CTAAAAGCTA GTCTGGATTA CCGTATGCCT TACTTTAATA 5640 

TTCTCAACTA TATTCAGTTG CAGTTGATTA AACGCCAACG TCGTGGAGAA TTGTCCAGTG 5700 

ATCAAGAACG ATTGATTCAT ATCACCATCA ACGGAATTGC GACAGGATTG CGTAATTCAG 5760 

GTTGATAATT TTCAAGAGTG AATGCTAAAA GTGAATATCA AAAAAATTCT AATAGACTAT 5820 

TTAGAGAGTC TGTGGTAGCT GAAAACAGAT AAGTCGCAAT GATGAAAATT GGGCTG AATG 5 94 0 

CTATTTAGAA TTTGAAATTA TAAAAATTCG GTAAGCACAC CTTACAGTGC ATCTCGTTAT 6000 

TCCCACACTG AGCGATAGGG AAATTCCCTA TAATTGAGGT GGTACCGCGC ATCGACGTCC 6060 

TCACACAAGT TTTTTGTGTG AGGATTTTTT TGATGGAGGT TAGTATGGAA AGAAAACGAT 6120 

GGCGTCCCTT GTTTAGATAA GTGAAATATG TTAAAGGAAA TAAAAAGGAG AAACAGAATG 6180 

AAAAATAAAC CTTTAATTGC AATTATTGCT GCATTAGCAG TCTTAGTAGC AGGAAGCTTG 624 0 

ATTTATTCTT CAATGAATAA ATCAGAAGCT CAGAATAATA AGGATGAGAA CAAAATAACC 6300 

AACATTGGTG TGCTTCAATT TGTGAGCCAT CCATCCCTTG ATTTGATTTA TAAAGGGATC 6 3 60 

CAAGATGGAC TTGCAGAAGA AGGATATAAA CATCATCAAG TTAAAATTGA TTTTATGAAC 64 2 0 

TCAGAAGCTG ACCAAAGTAA GGTTGCGACA ATGAGTAAAC AATTGCTTGC AAATGGGAAT 64 80 

GACCTTGTGG TTGGTATCGC AACACCAGCA GCCCAAGGGT TGGCTAGTGC AACAAAAGAC 6540 

CTACCGGTTA TCATGGCCGC T ATT AC AG AC CCAATTGGTG CTAACTTGGT TAAACATTTG 6600 

AAAAAACCAG GTGGCAACGT TACAGGGGTA TCTGACCACA ATCCAGCTCA ACAACAACTT 66 60 

GAACTCATCA AGGCTCTGAC ACCGAATGTG AAAACAATCG CAGCTCTTTA CTCAAGTAGC 672 0 

GAAGACAATT CAAAA 6735 

(2} INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS : 

fA) LENGTH: 6516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
CTAGAGGATC CCAGCAGGTA AATTGGCTTC ACCTGCCAAA AAAGTTGCCC TCCTTGAACG 
CAGCAAGGCT ATGTACGGTG GAACTTGTAT CAACATTGGT TGTATCCCAA CTAAAACCTT 



60 
120 
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GCTAGTTGCT 


GCTGAAAAGG 


ACTTGTCTTT 


TGAAGAAGTC 


ATTGCTACTA 


AAAACACGAT 


180 


CACTGGTCGC 


CTCAACCGTA 


AAAACTATGC 


GACTGTTGCT 


GGTACAGGCG 


TAGATATCTT 


240 


TGATGCGCAA 


GCTCACTTCC 


TTTCAAATAA 


AGTCATCGAA 


ATCCAAGCTG 


GTGATGAAAA 


300 


GAAAGAACTG 


ACTGCTGAAA 


CAATCGTCAT 


CAACACTGGT 


GCTGTTTCAA 


ACGTCTTGCC 


360 


AATCCCTGGA 


CTTGCTACAA 


GCAAAAACAT 


CTTTGACTCA 


ACAGGTATCC 


AAAGCTTGGA 


420 


CAAATTACCT 


GAAAAACTTG 


GAATCCTTGG 


TGGCGGAAAT 


ATCCGTCTTG 


AATTTGCCGG 


430 


CCTTTACAAC 


AAACTTGGAA 


GCAAGGTCAC 


AGTCCTAGAT 


GCCTTGGATA 


CATTCCTACC 


540 


TCGTGCAGAA 


CCTTCCATCG 


CAGCTCTTGC 


TAAACAATAC 


ATGGAAGAAG 


ATGGCATTGA 


600 


ATTGCTTCAA 


AATATCCATA 


CTACTGAAAT 


CAAAAACCAT 


GGTGACCAAG 


TGCTTGTCGT 


660 


AACTGAAGAC 


GAAACTTACC 


GTTTCGACGC 


CCTTCTCTAC 


GCAACTGCAC 


GCAAACCAAA 


720 


TGTAGAACCA 


CTTCAACTTG 


AAAATACAGA 


TATTGAACTA 


ACTGAACGTG 


GTGCTATTAA 


780 


AG TAG AC AAA 


CACTGTCAAA 


CAAACGTTCC 


TGGTGTCTTT 


GCACTTGGAG 


ATGTCAACGG 


340 


TGGCCTTCAA 


TTTACTTACA 


TTTCACTTGA 


TGACTTCCCT 


CTTCTTTACA 


GCTACCTTGC 


900 


TGGAGATGCC 


AGCTATACAC 


TTGAAGACCG 


TCTCAATGTG 


CCAAATACTA 


TGTTCATCAC 


960 


ACCTGCACTT 


TCACAAGTTG 


GTTTGACTGA 


AACCCAAGCA 


GCTGATTTGA 


AACTTCCATA 


1020 


CGCTGTTAAG 


GAAATCCCCG 


TTCCAGCAAT 


GCCTCGTGGT 


CACGTAAATG 


GAGACCTTCG 


1080 


CGGTGCCTTC 


AAAGCTGTTG 


TCAATACTGA 


AACAAAAGAA 


ATTCTTGGAG 


CAAGCATCTT 


1140 


CTCAGAAGGT 


TCTCAAGAAA 


TCATCAACAT 


CATCACTCTT 


GCTATGGACA 


ACAAGATTCC 


1200 


TTACACTTAC 


TTCACAAAAC 


AAATCTTCAC 


TCACCCAACC 


TTGGCTGAGA 


ACTTGAATGA 


1260 


CTTGTTTGCG 


ATTTAAGTTG 


AGATTTAATC 


GTATCGAACA 


GCCCTCTTTG 


CCCTCTTTTT 


1320 


ACTTCTGCGG 


AATCTCAAAT 


CTGTCTTTCT 


CCTCTTTTAT 


GATATAATAG 


AAACATGAAC 


1380 


TTAAAAACTA 


CTTTGGGCCT 


TCTTGCTGGG 


CGTTCTTCCC 


ACTTCGTTTT 


AAGCCGTCTT 


1440 


GGACGTGGAA 


GTACCCTCCC 


AGGGAAAGTC 


GCCCTTCAAT 


TTGATAAAGA 


TATTTTACAA 


1500 


AACCTAGCTA 


AGAACTACGA 


GATTGTCGTT 


GTCACTGGAA 


CAAATGGAAA 


AACCCTGACA 


1560 


ACTGCCCTCA 


CTGTCGGCAT 


TTTAAAAGAG 


CTTTATGGTC 


AAGTTCTAAC 


CAACCCAAGC 


1620 


GGTGCCAACA 


TGATTACAGG 


GATTGCAACA 


ACCTTCCTAA 


CAGCCAAATC 


TTCTAAAACT 


1680 


GGGAAAAATA 


TTGCCGTCCT 


CGAAATTGAC 


GAAGCCAGTC 


TATCTCCTAT 


CTGTGACTAT 


1740 


ATCCAGCCTA 


GTCTTTTTGT 


CATTACTAAT 


ATCTTCCGTG 


ACCAGATGGA 


CCGTTTCGGT 


1800 


GAAATCTATA 


CTACCTATAA 


CATGATATTG 


GATGCCATTC 


GGAAAGTTCC 


AACTGCTACT 


1860 


GTTCTCCTTA 


ACGGAGACAG 


TCCACTTTTC 


TACAAGCCAA 


CTATTCCAAA 


CCCTATAGAG 


1920 
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TATTTTGGTT TTGACTTGGA AAAGGGACCA GCCCAACTGG CTCACTACAA TACCGAAGGG 1980 

ATTCTCTGTC CTGACTGCCA AGGCA7CCTC AAATATGACC ATAATACCTA TGCAAACTTG 2040 

GGTGCCTATA TCTGTGAAGG TTGTGGATGT AAACGTCCTG ATCTCGACTA TCGTTTGACA 2100 

AAACTGGTTG AGTTGACCAA CAATCGCTCT CGCTTTGTCA TAGACGGCCA AGAATACGGT 2160 

ATCCAAATCG GCGGGCTCTA TAATATCTAT AACGCCCTAG CTGCTGTGGC CATCGCCCGT 2220 

TTCCTAGGTG CCGATTCGCA ACTCATCAAA CAGGGATTTG ACAAGAGCCG TGCTGTCTTT 22 80 

GGACGCCAAG AAACC7TTCA TATCGGTGAC AAGGAATGTA CCCTTCTCTT GATTAAAAAT 2 340 

CCACTCGGTG CAACCCAAGC TATCGAAATG ATCAAACTAG CACCTTATCC ATTTAGCCTA 2 400 

TCTGTCCTCC TTAATGCCAA CTATGCAGAT GGAATTGACA CTAGCTGGAT CTGGGATGCA 2 460 

GACTTTGAAC AAATCACTGA CATGGACATT CCTGAAATCA ACGCTGGCGG TGTTCGTCAT 2520 

TCTGAAATCG CTCGTCGCCT CCGAGTGACT GGCTATCCAG CTGAGAAAAT CACTGAAACG 2580 

AGTAATCTGG AGCAAGTTCT CAAGACCATT GAGAATCAAG ACTGCAAGCA TGCCTATATT 2 640 

CTGGCAACTT ATACTGCCAT GCTGGAATTT CGTGAACTGC TGGCTAGTCG TCAGATTGTT 2700 

AG AAAGG AG A TGAACTAATG GTTTATACTT CACTTTCCTC AAAAGATGGC AATTACCCCT 27 60 

ATCAGCT CAA CATTCCCCAC CTCTACGGAA ATCTCATGAA TACtACGGGG ACAATGGAAA 2820 

CATCCTCATG CTCAAGTATG TGGCTGAAAA ACTGGGAGCC CATGTGACCC TTCACATCGT 2880 

TTCTCTCCAT GATGACTTTG ATGAAAATCA CTACGACATC CCCTTTTTCG GTGGTGGTCA 2940 

AGACTTTGAA CAAAGTATCA TTGCAGACGA CCTACCTCCT AAAAAAGAGA GCATTGACAA 3000 

CTACATCCAA AACGACGGTG TAGTTCTGGC TATCTGCGCT GGTTTCCAAC TATTGGGTCA 3 060 

ATATTATGTT GAAGCTTCAG GAAAACGTAT CGAAGGGCTA GGGGTCATGG G AC ACT AC AC 3120 

GCTCAACCAG ACCAATAACC CTTTTATCGG TCACATCAAG ATTCACAATG AAGATTTCGA 3180 

TGAAACCTAC TATGGATTTG AAAATCACCA AGGTCGTACC TTCCTCTCTG ATGACCAAAA 3 240 

ACCGCTGGGA CACGTTGTCT ATGGAAATGG AAACAACGAA GAAAAGGTCG GTGAAGGGGT 3 3 CO 

TCATTATAAG AATGTCTTTG GTTCCTACTT CCACGCGCCT ATCCTCTCTC GTAATGCCAA 3 3 60 

TCTGGCTTAT CGCCTAGTTA CTACTCCCCT CAAGAAGAAA TATGGTCAGG ACATCCAACT 3420 

CCCTCCCTAT GACCACATTC TCAGCCAAGA AATCGCTGAA GAGTACAGTG ACGTCAAAAG 3480 

CAAGGCTGAC TTTTCTTAAA C AAAGG AAAA TGATATCAAA GAACTCCGTT ATCTTGTCGG 3 540 

AGTTTTTTCT CTTTTCTTTT ACCCTTCTCC CTTGCATTTT CTCTCATTTT TTGCCAAAAT 3 600 

AGAGGGGTAG AAAGAAGGTA GCATATGTCT AAATTACAAC AAATCCTAAC ATATCTTGAA .3 660 
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TCAGAAAAAC TAGACGTCGC TCTCGTATCT GACCCCGTCA CAATCAATTA CCTCACTGGT 3 7 20 

TTTTACACTG ATCCCCATGA ACGCCAAATG TTCCTCTTTG TCCTAGCAGA TCAGGAACCT 3780 

CTCCTCTTTG TCCCAGCTCT TGAAGTAGAA CGTGCAAGTA GCACCGTTTC CTTCCCAGTA 3840 

GTGGGCTATG TCGATTCTGA AAATCCATGG CAAAAAATCA AACATGCTCT TCCACAACTT 3900 

GACTTCAAAC GTGTCGCTGT TGAGTTTGAC AATCTCATCT TGACCAAATA CCATGGTTTG 3960 

AAAACAGTTT TTGAGACTGC TGAGTTTGAC AACCTCACTC CTCGTATCCA ACGCATGCGC 4020 

CTCATCAAAT CAGCTGATGA AGTGCAAAAA ATGATGGTTG CAGGTCTTTA TGCTGACAAG 4080 

GCTGTTCATG TTGGTTTTGA CAATATTTCT CTTGATAAGA CTGAGACAGA TATCATCGCA 4140 

C AAATCG AC T TTGCCATGAA ACGTGAAGGT TATGAAATGA GCTTTGATAC CATGGTCTTG 4 200 

ACTGGTGATA ATGCTGCGAA TCCACACGGC ATTCCAGCAG CTAATAAGGT TGAAAATGAT 4260 

GCTCTTCTCC TCTTTGACCT GGGTGTTCTG GTCAATGGCT ATGCGTCAGA TATGACTCGT 4320 

ACAGTCGCTG TCGGCAAACC AGACCAATTC AAGAAAGATA TTTACAACTT GACTCTTGAA 4 380 

GCCCAACAAG CTGCTCTTGA CTTTATCAAG CCAGGTG7GA CTGCTCATGA AGTGGACCGC 4440 

GCTGCCCGTG AGGTCATCGA AAAACCTCCT TATGGTGAGT ACTTCAACCA CCGTCTCGGG 4 500 

CATGGTATCG GTATGGATGT CCATGAATTC CCATCTATCA TGGAAGGAAA CGACATGGTC 4560 

ATCGAAGAAG GCATGTGCTT CTCTGTTGAA CCAGGTATCT ATATCCCTGG TAAAGTCGGT 4620 

GTTCGTATTG AAGACTGCCG TGTTGTTACC AAGGATCGC? TCAACCTCTT TACAAGCACC 4680 

AGCAAAGATT TGCTTTATTT TGATTAAACT ATATAGCCCC TATGCTTTCC TTTCAAAATA 4740 

TCTAGGGGCT ATTTTATTGT CATTTTTCTG CTATTATCCT AAAGAAATTG GCTGCAATAA 4800 

TCTAACCCTA ACTGTCTGGA ATGATAACGA GGGTGCTCTC CGCTTTTATC AAACACAAGG 4860 

GATGAAACCC CAAGAAACAA CAATGGAAAT GATAATTCAT TAAGAAGTCA TCTATCAAAA 4920 

GATGTTAGAA AAAGTTCAAT TTCACTACAA AATGAGGAAA ATCTCCCCAC AATAAAACGC 4 980 

ATAGTATCAG GTATTGTGTA CTGACCCCAA ACAGTTAGAC AATTAATTTA TCCGAAGGAT 504 0 

TTAGTTCTGT ACTGCACAGG ACTAAGTCCT TTTAGTTTTA CCTTAATTCG TTTGTTGTTG .5100 

TAGTAATCAA TATAGTCTAT AATGACTTGT TCCAATTCGT TAAGTGATTT AAATGTTTTC 5160 

TCATAGCCAT AAAACATTTC GGATTTTAAA ATGCCAAAGA AAGATTCCAT CATACCGTTG 52 20 

TCTTGGCTGT TTCCCTTGCG TG AC AT AG AT GCTTGAATTC CCTTATTCTC TAGGAACCGA 52 80 

TGATAAGAAT CCTCTTCCTA TTGCCAGCCT TGGTCACTAT GGAGAATCGT ATTCTCGTAG 5 340 

TGCTTCTCT? TGAATGCCTG TTCCAACATT GTTTGTACTT ATTCTAAATT AGGCGAACAA 5400 

GAAAGATTAA AAGCAATAAT TTCGCTGTTA AAGCCATCTA AAACTGGTGA TAAGTAAAGC 5460 
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TTTTGAGTAC TTGCTGGAAT GGCAAATTCA GTCACATCTG TGTAGCACTT TTCCATTGTT 5 520 

TTAGAGCCTT CAAATTGGGC TTGAATGAGA TTCTCTGCCT TCTTACCAAC GTCTCCTTTA 5580 

TGAGAAGAAT ATTTTCCTTT CTTTCGCATT TTAGCTTGTA AATTGAGTAC TTTCATCAAG 5640 

CCTTGAACTC TTTTATGATT TACCAGATAA CCACCATTTC TTAGTTCTAA ATGAACCCGG 5700 

CGATAAGCAT AATTTCCCTT GTGTTCGATA AAGATGGATT GAATTTCACT TTTAAGCTCT 57 60 

TGGTCTTTAT CTGTTTTCTC TAGCTGTTTC AAGTGATAGT AGTAGGTCCA AC GAG CT ACT 5320 

TTAATGGCTT CTAGAAGAAG ATCTAACGAA AACTCAGTCA TTAATTCTTG AACAATTTCT 5380 

GTCTTTCTTC TTTCTCTTTT TCCTCCTTCA ATCGCACTTC TCTTAACTTT TTTAGGATGG 594 0 

CATTCTCCGC TCTCACGTAC TCTCCCTCTT GTTTTCTCAA CAATAGTATA CCCGTTTTTC 6000 

CTGTATTGTG CTAGCCAGTT AAGAAGTATC GTACGACTTG GGAGACCGTA TTCAAGAGAA 6060 

ACTCTATCTT TAGTCCAGCC TTCATGTCAG ACTTTATTAA CCCCAATTAT TCACCCCAAA 6120 

TCTAAAAACC ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC 6180 

CAATGGGTGT TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA 62 40 

AAACACAAAA AGAAAGGAAA CTCACATGAA CAGTTTACCA AATCATCACT TCCAAAACAA 6300 

GTCTTTTTAC CAACTATCTT TCCATGGACG TCATTTAACC CAGTATGGTG GTCTTATCTT 6 3 60 

TTTTCAGGAA CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC 64 20 

GAATGAmCAA CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCwTGTCC AGTTCCTCTT 6480 

TCAACTGTTA ACAGGTTATG GAACGGAATA TGCTTG 65 Lb 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

TTTTCAACCC ATATCGTGGC TCCTGAATAC TACTTACTGA CAACTATGCT ATCAGAGACT 60 

TCTCTACTTG TTTTCTATAT CATTTTCATC CATAGAAAAC AACTCATCCA CTTGGGACAT 120 

ATCTTTAGCT ATACTGTTCG ATACTCTCTC TTTTCACTTT CCTTTGTAGC AATTTATTTC 180 

CTGATTAATT TCGTGTATCC TGTAGATATC CTCATTAATT TGCCATTTTT GATTAATACT 2 40 

GGTTTGATTG TCTTGCTATC AGCTATCTCT TATATTAGTC TACTTGTCTT CACAAAAGAT 3 00 
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AGCATTTTCT 


ATGAATTTTT 


AAACCATGTC 


CTAGCCTTAA 


AAAATAAATT 


TAAAAAATCA 


360 


TAGGAGTTTA 


AAATGAAACA 


ACTAACCGTT 


GAAGATGCCA 


AACAAATTGA 


ATTACAAATT 


420 


TTGGATTATA 


TTGATACTCT 


CTGTAAAAAG 


CACAATATCA 


ACTATATTAT 


TAACTACGGT 


480 


ACTCTGATTG 


GGGCGGTTCG 


ACATGAGGGC 


TTTATCCCTT 


GGGACGACGA 


TATTGATCTG 


540 


TCCATGCCTA 


GAGAAGACTA 


CCAACGATTT 


ATTAACATTT 


TTCAAAAGGA 


AAAAAGCAAG 


600 


TATAAGCTCC 


TATCCTTAGA 


AACTGATAAG 


AACTACTTTA 


ACAACTTTAT 


CAAGATAACC 


660 


GACAGTACGA 


CTAAAATTAT 


TGATACTCGA 


AATACAAAAA 


CCTATGAGTC 


TGGTATCTTT 


720 


ATCGATATTT 


TCCCTATAGA 


TCGCTTTGAT 


CATCCTAAGG 


TCATTGATAC 


TTGTTATAAA 


780 


CTGGAAAGCT 


TCAAACTGCT 


GTCTTTCAGT 


AAACATAAAA 


ATATTGTCTA 


TAACCATAGC 


940 


CTTTTAAAAG 


ATTGGATACG 


AACAGCCTTC 


TGGTTACTCC 


TTCGACCGGT 


TTCTCCTCCT 


900 


TATTTTGCAA 


ATAAAATCGA 


GAAAGAAATT 


CAAAAATATA 


GTCGTGAAAA 


TGGGCAATAT 


960 


ATGGCTTTTA 


TCCCTTCAAA 


ATTTAACGAA 


AAGGAAGTCT 


TCCCAAGTGG 


TACCTTTGAT 


1020 


AAAACAATCG 


ATTTACCCTT 


TGAGAATTTA 


AGCCTTCCTG 


CACCTCAAAA 


ATTTGATACT 


1080 


ATTTTGACAC 


AATTTTATGG 


AGATTATATG 


ACCCTACCAC 


CAGAAGAAAA 


ACGCTTCTAC 


1140 


AGTCATGAAT 


TTCACGCTTA 


TAAATTGGAG 


GATTAGGATC 


CAATATTTAC 


AAAAAAAAGA 


1200 


AATTAAAGAA 


ATTCAACTAG 


CCCTGCTGGA 


CTATATTGAT 


GAGACTTGTA 


AGAAACATGA 


1260 


TATTCCTTAT 


TTTCTCAGTT 


ATGGAACCAT 


CCTTGGACCC 


ATCCCCCACA 


AAGGTATGAT 


1320 


TCCTTGGGAT 


GATGATATTG 


ATATTTCCCT 


TTATCGTGAG 


GATTATGAGC 


GTTTACTGAA 


1380 


GATTATTGAA 


GAAGAAAATC 


ACCCTCGCTA 


CAAGCTTCTT 


TCCTACGATA 


CATCTTCTTG 


1440 


GTACTTCCAT 


AATTTCGCAT 


CGATTTTGGA 


CACTTCTACT 


GTTATAGAAG 


ACCATGTTAA 


1500 


GTACAAGCGT 


CATGATACCA 


GCCTTTTCAT 


CGATGTCTTC 


CCAATTGATC 


GATTTACAGA 


1560 


CTTGAGCATT 


GTCGACAAGA 


GCTATAAGTA 


TGTGGCTCTT 


CGTCAACTAG 


CTTATATCAA 


1620 


AAAATCACGA 


CCAGTTCACG 


GTGATAGCAA 


ACTAAAAGAT 


TTTCTTAGAT 


TATGTAGCTG 


1680 


GTACGCTCTC 


CGATTTGTCA 


ATCCTCGCTA 


CTTTTACAAG 


AAAATTGATC 


AACTACTCAA 


1740 


AAATGCTGTA 


ACCAACACTC 


CTCAATATGA 


AGGAGGACTT 


GGGATCGGTA 


AGGAAGGGAT 


1800 


GAAAGAAATC 


TTCCCAGTTG 


ATACCTTTAA 


AGAACTGATT 


TTAACTGAGT 


TTGAGGGCCG 


1860 


TATGTTGCCT 


GTTCCCAAAA 


AATATGACCA 


ATTTTTAACC 


CAGATGTATG 


GCGATTATAT 


1920 


GACACCACCA 


TCAAAAGAAA 


TGCAAGAGTG 


GTATAGTCAT 


AGCATTAAAG 


CTTATCGCAA 


1980 


AAACTCATTC 


AGGGGGATTA 


TACAAACTAC 


TAAGATAGAC 


GTTATTCAAA 


AACATAATTT 


2040 


TAGTAGAAAA 


TCAAATACAT 


ATTCCCACAA 


TAAAACGCAT 


CATATCAACG 


TTTTTGAAAA 


2100 
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ACCTTGATAT GATGCGTTTT ATAATTTTAA AGACTTTTTT CTATAGTAGA TTGAAATAAG 2160 

ATGCGAACAA ATCAATTAGA AAATTCAAAT TAATTTATAG AAATATTTTA GTATTCCTGT 222C 

GTACTGTTCT AAATTCAGTC TGCTATATCT TATTTTTCTA TTTAAATCGC TTCTGTAACA 2280 

AAGCTACGAC T7TCAACTAC CTTAAGCATG GCATTAGCTG TATCTAGCGC TGTGAAGAGG 2340 

GGCACCCCGT GTTCAATCGC TGAACGACGA ATTTGCTCAC CATCTTCGTC AGCAGTTCCT 2400 

TTTGTTCCTA CTGTGTTAAT GATAGCTTGA ATTCTTCCTT TGCGTACAAA ACTTGGGATA 24 60 

TCCTTATCGT CATCACCAAT CTTACCAACA GGTTGGGCTT CCAAGCCATG ACTAGCAAAG 2 52 0 

AAGGCTGCTG TCCCTTCTGT CGCAAGGATT CCATAACCAA TGTTTTGGAA ACGACGAGCC 2580 

AAGTTCAACG CTTCTTCTTT GGCATCATCA GCGATGGTAA AGACGACATT ACCAAAAGTT 2640 

GGCAAGTGTA GATAAGAAGC TTCAAAGGCT TTATAGAGAG CTTTTTCCAA AGTAGCATCA 2700 

GAACCCATAA CTTCACCTGT TGACTTCATT TCAGGACCGA GCAAGCTGTC TACCTTAGC? 27 60 

AGTTTGGTAA AGGAGAAGAC ACGTGCCTTG ATATGAACAC CCGTGCTTTC AGCCTAAAC7 2820 

CCATTTTGGT AGCCAAGTTC TGATAAACTT TGACCAAGAA TGAGTTTGGT CGCTACTTGA 2 880 

GCCATAGGAA TATTCGTTAC CTTAGATAGG AATGGAACAG TACGGCTGGC ACGTGCATTG 294 0 

ACCTCAATAA CGTAGACTTT TTCATCCTTG ATAACAAACT GGATGTTCAT CATTCCAACC 30 00 

CAGTGAAGAC CCATTCCTAA GCGTTTGGTG TAGTCTGCGA TGGTCTCCTG AACCTTTTGC 3060 

GACAAGGTTT GTGGTGGGTA AACAGCCATT GAGTCACCTG AGTGGACACC AGCACGTTCG 3120 

ATATGCTCCA TGATACCAGG AATGAGTACA TTTTTACCAT CTGAAATGGC ATCAACTTCG 3180 

CACTCTTCCC CAACGATATA AGAGTCGACA AGAACTGGGT GGTCTGGACT ACCCTTAACA 32 40 

GCAG7TCGCA TGTAAGAACG AAGGTCTTCT TCGTTTTCAA CGATT7CCAT GGCACGTCCA 3 300 

CCAAGTACAT AAGATGGGCG GACAAGAACT GGGAAGCCAA TCTTGCGAGC TGCAAGACCT 3 3 60 

GCTTCTTCTT CATTGGTAGC CCTTTCTCC7 GGTGGCTGTG GAATATCCAA TTCTTTGAGA 3 4 20 

GCTTGCTCGA AGAGGTCACG GTCTTCGGCA CGATCTAGCT CAGCAACCTG TGTACCAAGC 3 480 

A7GGTCACAC CTGCTTTTGC CAATGGCTCC GCAAGGTTGA TGGCTGTTTG ACCACCGAAC 3540 

TGAACGATAA CTCCCTTTGG TTGTTCCAAG TCAATGACGT TCATAACATC TTCGAATGTC 3 600 

AATGGCTCAA AGTAAAGCTT ATCTGATACA GAGAAGTCTG TTGAAACGGT CTCTGGGTTT 3 660 

GAGTTCATGA TGATAGCTTC ATAACCAGCT CCCTGGATAG CCTTAACAGA GTGAACGGTT 3720 

GCGTAGTCAA ACTCAACCCC TTGACCGATA CGGATTGCAC CTGAACCTAG GACAAGTACA 3 7 90 

CATTCTTTAT CAGA7CTCA7 AGATTCATTT TCCCAACCAT AGGTTGAATA GAAATATGGC 3 84 0 
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GTTTCGGAGT CGAACTCTGC CGCACAAGTG TCTACCATCT TATAAACTGG AACAATCTTG 3900 

TTTTCCAAGC GAAG7TGGCG AACTTTATCA TCAGTCGTTC CCCAGAGTTC AGCAATCTTA 3950 

CGGTCTGAAA AACCATTAAG TTTGGCTCTT TTCAAAACTT CTAAATCTTC TGGATGAGCA 4020 

CCCAATTCTT GCTCAATTTC AAAGATATGC AAGAGTTTAT CAAGATAGAA GATATCAATT 4 080 

TTTGTAAGCT CTGCAATTTC TTCAGGTGTG TAGCCACGAC GAATGGCTTC TGATACGTAG 4 140 

AAGAGACGGT CATCTTCCCC TTTGACAACC TTTTCAATCA AGGCATCATC AGAAACTGCT 4 200 

GCAAGTTCAG GTATTTCATT GTGGTGCACC CCAATTTCAA GGGAGCGGCA GGCCTTGAGA 4 260 

AGAGATTCCT CGATGTTACG ACCCATTGCC ATGACTTCTC CAGTCGCCTT CATTTGTGTA 4 320 

CCGAGACGGC GTTCACCCTT TTCAAACTTG TCAAATCGGA AACGTGGAAT CTTAGCAACT 4 380 

ACGTAGTCAA GGGCTGGTTC AAACATGGCA TAGGTTGAAC CTGTAACTGG GTTTATAACC 4440 

TCATCCAAGG TCAAACCTAC TGCAATCTTG GCAGCCAACT TAGCAATCGG ATATCCTGTC 4500 

GCTTTAGAAG CAAGGGCTGA CGAACGTGAT ACACGACGGT TTACTTCGAT AACATAATAC 4560 

TTGAAGCTGT TAGGATCAAG AGCTAGCTGA ACATTACATC CACCTTCAAT CTTGAGGCCA 4 620 

CGAATAATGC TCAAGCTCGC ATCACGAAGC ATTTGGTTTT CATAGTCTGA CATGGTTTGC 4680 

GCAGGGGCAA ATACAATGGA ATCCCCTGTG TGAATCCCAA CTGGGTCAAA GTTTTCCATG 4740 

TTACAAACAA CCAAGGCATT GTCAGCTGAG TCACGCATCA CTTCGTATTC AATTTCCTTG 4800 

AAACCGGCAA TCGAACGCTC AATCAAACAT TGGGTAACAG GTGACAATTT CAAACCATTT 4 860 

TCAGTGATTT CACGCAATTC TTTCTCGTTG GCACACATAC CACCACCAGT ACCACCAAGG 4 920 

GTAAAGGCTG GACGAACGAT GACTGGGTAG CCAATTGTCG CTGCAAAGGC AACTGCTTCT 4 980 

TCTACTGTGT TAACAATTTC AGATTCTGGA ATGGCTTCTT CAAGCTCTTC CATCAATTGT 5040 

TTAAAGAGGT CACGGTCCTC CGCTTGGTCA ATGGCAGATA ATTTGGTACC CAGAAGTTCA 5100 

ACGCCAAGCT CGTCTAGGAT ACCATTTTTA GATAATTCCA TGGCCATGTT CAGACCTCTC 5160 

TGACCACCGA GTGTTGGTAG CAAGGCATCT GGACCTTCCT TACGAAGAAT ACGTGTCACA 5 220 

AACTCAAGTG TAATCGGTTC AATGTAAACC TTGTCAGCAA TTTCCTTGTC CGTCATGATG 5280 

GTTGCAGGAT TTGAGTTAAC CAAAACAACC TCATAACCT7 CCTCTTTCAA CGACAAGCAA 5 340 

GCCTGAGTCC CAGCGTAGTC AAACTCAGCA GCCTGACCAA TAATAATCGG ACCAGAACCA 5400 

ATCACCATAA TTTTTTGAAT ATCAGTACGT TTAGGCATAT ATAAGATATT AAGGGTGTCA 54 60 

AGCGGACAAA GCTAAAATAG GAGTTATGAC GAAGAACTGT CAGTTCTAGG AATAACTATC 55 20 

TTTTTAGCAC CGTCCGTAGC CCGTATTCAG TTCAGCAAAT ACGGAGCACC CTTCTCCTTT 5580 

CTATTCGTCG CCTCTCAGGG CGACATTAAA TAAGATACAA AGGACGAATA GAAAGCGATT 5640 
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CAAGCTCTGG CTAGGGTGTT GGCCCGAACC ATCTCCACCA TTGATGATGG AAGTCGTAAT 7440 

CCTTGGACTA GCAATCAATT CTCTATAGTA GTCGACCTCT GGATGGCGAA TC AC AC AG AC 7 500 

ATCCACTCCT AAAGCAGACA CAGTCAAAAT GGTGTCATAA AGTGTCTCAC CCTTATTAAC 7560 

CGAGCTAGTC TTCACATCAA AGTCAAGTCG TTCCAATCCA AGTTTAATCT CTCCGACTTC 7620 

AAAGGACTTA TGTGTCCGTG TAGAATCCTC AAAGAAGAGA TTGGAAACAA TCGGATGGTC 7680 

TTCATAGGGA AGCTGGGCTC CATTTTTAAA CTCAATTCCT CCCTTGATCA ATTTCATTAC 77 40 

TTGATCGACA GTGAGGTCTT CCATGGACAC CACATGGTTC AATGCTTGTT GATTTTCTGA 7 800 

CATGGCTACT CCTTTAACTT TCTAAGCTTC TTCAGTAATC AGAACTCTGT CTTGGTCATC 7860 

AAGTTCTGTC ATCTCTACGA TGATTTCTTC AGAACGACTG GTTGGGATAT TTTTTCCAAC 7920 

GTAATCTGGA CGGATTGGCA ATTCTCTATG TCCACGATCG ACTAGAACTG CTAAACTCAC 7980 

ACGCGCAGGA CGACCATGAC CCACAATATT ATCAATAGCA GCACGGATGG TACGACCTGT 8040 

ATAGAGCACA TCATCCACCA AGATAACTTC GCGGTCTGTC ACATCGACAG AAACCAAAGA 8100 

AGTATCTTCT CCACTTTTAA CATCATCACG GAAACGTTTA GTATCCAATT CCACAACAGG 8160 

AACTGAAAGA TTTTCTAACT GCTTCAAACG TTCTTGGATT CGGTGGGCAA TAAAGACACC 8220 

ACGAGTTTTA ATACCAGCCA AGACGATCTT ATTCAAATCT TTGTTGCGTT CGATAATCTC 82 80 

ATAAGTAATA CCCCTAATCC CTCGTTTGAC GGTCAATTCG TCTACAACTT CTTTTGTTTT 8340 

CATGACAAAC CTCCAAAAAG AAAAGTCTCC TTAAACAACC AGACTTGAAA TTTATAGCCA 8400 

ACCGAGCCCT ACTGCACACA CTATAGACTT CACCCTTCTA CTTTATCGCG CTCCTTGCCT 8460 

GCCTCACGGG ACAGGTTTAA AGGAATATTT AGTTATCATT TACTATAGCA CAAAGCATGC 85 20 

TTAAAATCAA GCAAAAAGTT TCAATGTAGC ATCTTACAAA TTGCTAAAAT CATATAATTG 8S80 

TGGCTACTGG TCACACTCTG GATTTTTTGG ATGGCAAATG GCTCTTCCAA AATAAATCAT 8640 

GGCCTGATGG GCAGCTAACC ACTGCTCAGG CGGCAACATA TCCATGACCC GCTTTTCCAC 87 00 

CTCAAGTGGC GTCGCTCATT TTTTGACAAT ATCGTGGTGT TTGCAAATAC GCTCCACATG 8760 

AGTATCCACT GCAAAGGCTG GAATTCCAAA TCCTACACTC ATGACAACAT TGGCTGTCTT 8820 

GCCACCAACA CCTGCCAAAC TCTCCAATTC TTCACGTGTC TGAGGGACTT GACCATCAAA 8880 

ATCGTCTAGT AACTGTTGGG C AC AT TTTTT AAGGAATTTA GCTTTATTCC GATACAATCC 8940 

CAAGCGAGAA AT ATGTG AAG CAATCTCACT CTCTGTCCCT ACAGACATAG CTTGGGGTGT 9000 

TGGAAACGCA ACAAAGAGAC CTGGTGTGCC CTTATTTACC CCTGCATCTG TCGTCTGGGC 9060 

TCATAACATG ACCGCAACCA GGAGTTCAAA ATGATTCGTA AAATCAAGAC TAGGCTTGGC 9120 

ATCTGGGAAG AGGGCAATGA TTTCTTCTAG CACCTTTCGT GCTCGTTTTT TTGACAAGAC 9180 
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CATTATTCAT CTCCGTCAAA 7AGTCCTTGT AAGCCAGCAA AAGGACTGTT TTCTTCTTTC 92 40 

TTTACTCCT? TTTGACCTTG GTATTCTTCC TCTGTCATGA TTTGCCAGTC ATTTCCTGAG 9300 

ATAAATCCTT GACCAGCTTC TTCTTCACCC GTCAAGACCT TGATAGGAAT GTTTAGCAGC 93 60 

ATATTGTCTG ATACACTCTC AGCAAGGTCA AGCTCCCCAT TTTCGATGGG CAAGACCAAG 94 20 

TCATCATCTA AAACTTCTTG ATCTAGCTGG TTAGTTGCGC CTTCCATGAA AACTTCCGTG 9480 

ACTGGATAAG ATTCAACTAA CTCAACTGGC TCCATACTGC GACTCGACGC AAGAACAATG 9540 

GTATAAGATA CTTGATAATC TAAGAAATAC ATACGGTCTT CATATTGTAC TTTCCCAACT 9600 

GCAAGGATAT CTTTTACATC TAAAATTTCT TGATTACGTG CACGCAGGTC ATCAACTAAA 9660 

TCTAACGTTT GTTCAAAGTT CAAACCTTCA GACTGCTTAC GAATTTCTTG AATATTTAAT 9720 

TTCATACTTC CTCCATAAAG ATTTACTCTC TTGATTATAC CATGAAAAGG C7ACAAATCA 9780 

GCACACCAAA CTTTGTAATT AAAATTCAAA ATTTTAACAT ATTTACTATG ATAGTTTTAT 9840 

TTTTTAGTCC TATACTATAG GGAAAGAGTA CATCAGATCA AGGAGGATGC TCACATGGAA 9900 

GACAAGAAAC TCATTCAACT CCTATCCAAG TTAAATAAAA GCTACCAAAA CTGTAAACAG 9960 

GGTACGGCAG ATGATATTCG ACTACAAGAG CTGCTAAACA CTACTATGCA AGAGCTCAAA 10020 

AAAACGGAAC AGTTGAACAA CAGTATCTTA ATTGATCTTG AGAAATTTTA CCAACCTACC 10080 

AGTCTTCTGA TTGGACTGGG TAGCCTAAAA CTAAACGATC AAGCACGCAC TGCTTGCCGA 10140 

AACTATGATA AATTCCATTA CGATCATGTC AAACACGTAC TAAGTCTCTA TGGACCTGTT 10200 

TTTGAATTTT AGAGCATAGA ATTTCCACTT TTCTGTTGAC AAAATTTCCT TAAAGGTATA 10260 

ATATAAAGAT ACTAATACTC GGAGGTAAGG GAGACATGAA CAACTAAGTC TATCAAATAA 10320 

AGAACCTTTA TTTAGTAGAT CTTGTTTTTG TCTCTTTTTG TGTGCTCTTT TATGCTCTTT 10 3 80 

TTCTGGCATG TTAATAGAGT TTTTTTGACA TAGACTTTCC GCTCTACTAC GTAAACTAGA 10440 

GCTTT7TG7T ATGCACTATG AACATTCTAG AAAGGGAAAT CATATGATAA AAATCAATCA 10500 

TCTAACCATC ACACAAAACA AAGATTTACG AGATCTTGTA TCTGACCTAA CCATGACCAT 10560 

CCAAGACGGG GAAAAGGTTG CTATTATTGG TGAAGAAGGA AATGGCAAAT CAACCTTACT 10620 

TAAAATTTTA ATGGGGGAAG CTTTGTCTGA TTTCACTATC AAGGGAAACA TCCAATCTGA 10680 

CTATCAGTCA CTGGCCTACA TTCCTCAAAA AGTCCCTGAG GACCTAAAAA AGAAAACTTT 10740 

ACACGACTAC TTCTTTTTAG ATTCTATTGA TTTAGACTAC AGTATCCTCT A7CCTTTCGC 10800 

GGAGGAATTG CATTTTGATA GCAATCGTTT CGCAAGTGAC CAAGAGATTG GCAATCTATC 10360 

AGGGGGCGAA GCTTTGAAAA TTCAGCTTAT CCATGAGTTA GCCAAACCCT TTGAGATTCT 10920 
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ATTTTTAGAT GAACCTTCAA ATGACCTAGA CCTTGAGACA GTTGATTCGC TAAAAGGCCA 10980 

GATTCAAAAG ACCAGGCAAA CCGTTATTTT CATTTCCCAT CATGAAGACT TTCTTTCTGA 11040 

AACGGCAGAC ACTATTGTTC ACTTGCGACT GGTCAAACAC CGTAAAGAAG CGGAAACGCT 11100 

ACT AG AG CAT TTAGACTATG ATAGCTATAG TGAGCAGAGA AAGGCTAATT TTGCCAAACA 11160 

AAGTCAGCAA GCTGCTAACA ACCAAAGAGC CTACGATAAA ACCATGGAAA AACATCGGAG 11220 

AGTTAAGCAA AATGTAGAAA CTGCGCTTCG AGCTACCAAA GATAGTACTG CCGGTCGCCT 11280 

ATTGGCTAAA AAGATGAAAA CTGTCCTCTC ACAAGAAAAA CGCTACGAAA AGGCAGCTCA 11340 

GTCCATGACT CAAAAGCCAC TTGAAGAGGA ACAAATCCAA CTTTTCTTTT CAGACATCCA 11400 

ACCATTACCA GCTTCTAAAG TCTTAGTCCA ACTGGAAAAA GAAAATTTGT CCATTGACGA 11460 

CCGAGTTTTG GTTCAAAAAC TACAACTAAC TGTCCGTGGC CAAGAAAAAA TCGGTATTAT 11520 

CGGGCCAAAT GGTGTTGGGA AATCAACTCT GTTAGCCAAG TTACAGACAC TTCTGAATGA 11S80 

TAAAAGAGAG ATTTCACTTG GTTTTATGCC ACAAGATTAC CACAAAAAAC TGCAATTGGA 11640 

TTTATCCCCA ATAGCCTATC TCAGTAAAAC TGGGGAAAAA GAGGAACTAC AGAAAATCCA 117 00 

ATCTCACCTA GCTAGTCTCA ATTTCAGTTA TCCAGAAATG CAGCATCAAA TTCGCTCCTT 11760 

ATCTGGCGGA CAACAGGGAA AACTCCTGCT TTTGGATTTA CTCCTGCGCA AACCAAACTT 11820 

TCTCCTGCTG GATGAACCCA CACGAAACTT TTCTCCCACT TCTCAACCCC AAATCAGAAA 11880 

ACTCTTTGCT ACCTATCCAG GCGGTCTCAT CACTGTTTCG CATGACCCTC GTTTCTTAAA 119<0 

AGAAGTCTGC TCGATCATC7 ATCGCATGAC AGAACACGGT TTGAAGCTAG TTAATTTAGA 12000 

AGATTTATAA ATTTGCAACA TAGCAAAAAT CCAGAGACGA CCTCTGGATT CTTTTACATC 12060 

TGTTTTAAAC GTTCAATCCG TTCTCAGATA GGTGGGTGGG TATAAAAGAG TTTTTGGAAC 12120 

CCCCCACCTT TCTTAGGATC ATTGATATAA AGGGCACTGC TAGCATCATC GACGTGGCGA 12180 

CTCATAGGTT TGCTATTGTC CAACTTATCT AGGGCATTAA TCATTCCCTG GGGATTGCGA 12240 

GTCAGCTCGA CACTAGATGC ATCTGCCAGA AATTCCCTCT GACGAGAAAT AGCGAGCTGA 12300 

ACCAAGGTTG CAGCGAGAGG TGCCAGTACA ATAGCTAGTA GGGAAACCAC TAGCATAATG 12360 

ATTTCAACAC CATTTCCATC TCGGTCATCA TCACTTCGTC TGCGACCTGC TCCACCCCAC 12420 

C AC AT CAT AC GACCTCCCAT ACTAGAAAGC ATGGTGATAG CACTAGCAAG GGCAACTGCA 12480 

ATAGTCGAAA TACGGATATC ATAATTACGA ATATGACTGA C7TCATGTCC CATAACAGCT 12540 

TCTAGTTCTT CACGATTCAT GATAGCTAGT AGACCTGAAG TCGCAGCAAC AGCCGCATTT 12 600 

TGAGGATTAG AACCTGTCGC AAAGGCATTT AAGGCTGGAT CATCAATGAT GAAAACACGG 12 660 

GGCATAGGAA TCTGAGCGAC CAGAGCCATA TCTTCCACTA CATGGTAGAG GTCTGGTGCC 12720 
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CTATACTATC CTAAGCAAAG GTTTTTAATC TCATCCCGTG AGGTGACGAA GACGCAGAAA 14 520 

TATTTAAAAC TCTTTAAAAT CTAAATTTTA AAGAAGTCTT ACTCTGAGGG CCTATTGCTG 14 580 

TAAAATAATC GGCTCTTTTT TGATGCCCAA AAGTGAGGTT TATATGAAAC AAGAATCAAC 14640 

TGTTGATTTG TTAC 14 654 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6405 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

AGAAAAATCT GCTTTACACA AAATAAAAAT AATAGGAGAA AATCTATGTC AGATTTGAAA 60 

AAATACGAAG GTCTCATTCC AGCCTTCTAC GCATGTTATG ATGATCAAGG AGAAGTAAGC 120 

CC AG AACGT A CGCCTGCCTT GGTTCAATAC TTCATTGATA AAGGTGTTCA AGGTCTTTAT 180 

GTCAATGGTT CTTCTGGTGA ATGTATCTAC CAAAGCGTTG AAGATCGCAA GTTGATTTTG 240 

GAAGAAGTCA TGGCGGTAGC AAAGGTAAAT TGACCATTAT TGCCCATGTT GCTTGCAATA 300 

ATACTAAAGA TAGTATGGAA CTTGCTCCCC ATGCTGAAAG CTTGGGAGTA GATGCTATTG 3 60 

CAACGATTCC ACCAATTTAT TTCCGCTTGC CAGAATACTC ACTTGCCAAA TACTGGAACG 4 20 

ATATCAGTTC TGCAGCTCCA AACACAGACT ACCTGATTTA CAACATTCCT CAATTGGCAG 480 

GGGTTGCTTT GACTCCAAGC CTTTACACAG AAATGTTGAA AAATCCTCGT GTTATCGGTG 540 

TGAAGAACTC TTCTATGCCA GTTCAAGATA TCCAAACCTT TGTCAGCCTT GGTGGAGAAG 600 

ACCATATCGT CTTTAATGGT CCTGATGAGC AGTTCCTAGG AGGACCCCTC ATGGGGGCTA 660 

GGGCTGGTAT CGGTGGTACT TATGGTGCTA TGCCAGAACT CTTCTTGAAA CTCAATCAGT 720 

TGATTGCGGA TAACGACCTA GAAACAGCGC GTGAATTGCA GTATGCTATC AACGCAATCA 7 SO 

TTGGTAAACT CACTTCTGCT CATGGAAATA TGTACGGTCT CATCAAAGAA GTCTTGAAAA 840 

TCAATGAAGG CTTGAATATT GGATCTGTTC GTTCACCATT GACACCAGTG ACTGAAGAAG 900 

ATCGTCCAGT TGTAGAAGCG GCTGCTGCCT TGATTCGTGA AACCAAGGAG CGCTTCCTCT 960 

AATCTAAAAG GAGGTATTTA TGACATATTA CGTTGCAATT GATATCGGTG GAACCAACAT 1020 

CAAGTATGGT TTGGTTGATC AAGAGGGGCA ACTTCTTGAA TCGCATGAAA TGCCAACTGA 1080 

GGCGCATAAG GGTGGACCTC ATATCTTACA AAAGACCAAA GATATCCTAG CTAGTTATTT 1140 

AGAAAAAGGC CCAGTAGCAG GTGTTGCCAT ATCTTCTGCT GGGATGGTGG ATCCGGATAA 1200 
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GGGTGAGATT TTCTATCCTG GGCCGCAAAT CCCTAACTAC GCAGGCACCC AGTTCAAAAA 12SO 

GGAAATCGAA GAAAGCTTTA CTATTCCTTG 7GAGATTGAA AATGATGTCA ACTGTGCAGG 13 20 

TCTTGC7GAG GCAGTATCTG GTTCAGGCAA GGGAGCAAGT GTGACACTTT GCTTGACCAT 13 80 

TGGAACCGGT ATCGGTGGTT GCTTGATTAT GGATAGGAAA GTCTTCCATG GTTTTAGCAA 14 40 

TTCAGCCTGT GAAGTCGGGT ATATGCATAT GCAGGATGGA GCTTTTCAAG ACTTGGCTTC 1500 

TACAACAGCT TTAGTGAAAT ATGTAGCTGA ACCCCATGGA GAAGATGTTG ATCAGTGGAA 15 60 

TGGCCGTAGA ATTTTCAAAG AAGCCACTGA AGGAAACAAA ATCTGCATGG AAGGTATTGA 16 20 

CCGTATGGTT GACTATCTAG GAAAACCTCT GGCAAATATT TGCTACGTTG CCAATCCACA 16 80 

ACTGCTTATT CTTGGTGG7G GTATCATGGG GCAAGAGGCT ATCCTCAAAC CTAAGATCCG 17 40 

TACAGCCTTG AAAGACCCTT TGGTACCAAG TTTAGCACAA AAAACACGAT TAGAATTTGC 1800 

CCATCACCAA AATACACCAG GGATGTTGGG TGCATATTAT CATTTTAAGA CAAAACAATC 18 60 

CTAGTTTGGC TCAGCCAAAC TAGGATTTTC TTACACGTTT TTCTCTACGA TAGCCGTTGA 19 20 

GTTTTTTATT TTCCCAGTAG CTATTAAAGA TTTTTTCCTT GCTTTCGCGA TTGATTTCCA 1980 

AAAAGTACGC ATAAATCAAA TCGATAAAGA AGAGCATAGG AAGTTGAGCG GATATTCGTT 2040 

GGATATAGGA GGGTTGGCTG TGGGTGGCTA CAAGAACAGT CTCTGTATAG GTCTGGCTAT 2100 

CTTTATTGGC AACACTTGTA AAGAGTACAG TCTTTGCCCC CATCTCCTTA GCATCTAATA 2160 

GACTATCTAA AATAGAAGGA GTTGAGCCTG AAAGTGAGAA GCCCAGTACT AGACAATTTT 2220 

CATCCATGAT GCTGGTTGTC CAGGCAAAGC CCTCTTGGTC TGTCAAAGCT TCGCAGACCA 2280 

CACC7AGTCG CATAAAACGT AATTTCATTT CACGGGCGAC GAGGCCACAA CTCCCTCTTC 2 34 0 

CAAACAAGTA GATACGCTCA GCATCTTCGA TTAGCTGGGC AATTCGTTCT ACTTCGATTT 2400 

CGTCAATCAA GTCTTGTGTT TGTTCCCTCA 7ATTGCTATA ACTTCTCACG ACTCGTTTGG 2 4 60 

TCAGTGGACT GTGCTTGGAG ACTTGGTTGG CTTGATTTTC TGCCTGATGT TGGTATTGGA 2 5 20 

AAATAAATTC TCGGTAGCCA GTAAAGCCAC ACTTTTTAGC AAAGCGGGTC AAAGCAGCTT 2 5 80 

GAGAAATATG TAATTTTTGG GTGACTTGTT GAGAAGATAA ATCATC7GTA ATCGTTTCAG 2 640 

CTTGCAAAAA ATAGCGAGCG ATTTCTTGTT CTAGGTCTGT CATTTCTTCA AAATGTGAAT 2 700 

CAATGATAGT TGCGATATCT GGTTTGTCCA TAGGGAAAGC TCCTTTACAT GAGTCATACT 27 60 

GGAAGACTAG ATCAGAGAAT AGTCACACTT CATTATAACA CATAATATAA GGATAGATAA 2820 

ATAAAAACGC ATCTCTGTTT TAAAAACGAA AAAATCGAAA AACCTTCTCT CTTTTCCATA 2880 

ATTTTCTAC7 CAAATTGTGG TACAATTAAG ACTAACATTT TAAGTTAGAA ATGAGACTGA 294 0 
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TTTGTATGAG 


AAAATTTAAC 


AGCCATTCGA 


TTCCGATTCG 


GCTTAATTTA 


TTGTTTTCAA 


3COO 


TCCTCATTTT 


ACTCTTTATG 


ACCATTATTG 


GTCGTTTGTT 


GTATATGCAG 


GTTTTGAACA 


3060 


AGGATTTTTA 


CGAAAAAAAG 


CTAGCTTCAG 


CTAGTCAGAC 


CAACATTACA 


AGCAGTTCAG 


3120 


CCCGTGGGGA 


AATTTATGAT 


GCTAGTGGAA 


AACCTTTGGT 


AGAAAATACG 


TTAAAGCAGG 


3180 


TTGTTTCCTT 


TACGCGTAGC 


AATAAAATGA 


CGGCTACAGA 


CTTAAAAGAA 


ACAGCTAAAA 


3240 


AGTTACTGAC 


TTATGTGAGC 


ATCAGTTCTC 


CAAATTTGAC 


AGAACGCCAG 


CTGGCGGATT 


3300 


ACTATTTGGC 


TGATCCTGAA 


ATCTATAAAA 


AAATAGTGGA 


AGCTCTCCCA 


AGTGAGAAAC 


3360 


GCTTGGATTC 


AGA7GCCAAT 


CGTCTATCCG 


AATCAGAACT 


GTATAACAAT 


GCGGTCGATA 


3420 


GTCTACAAAC 


GAGTCAACTA 


AACTATACAG 


AGGATGAAAA 


GAAAGAAATC 


TATCTTTTTA 


3480 


GTCAGTTAAA 


TGCTGTTGGA 


AACTTTGCGA 


CAGGAACCAT 


TGCGACAGAT 


CCTCTAAATG 


3540 


ATTCTCAGGT 


GGCTGTTATT 


GCCTCTATTT 


CAAAGGAGAT 


CCCTGCCATT 


AGTATTTCTA 


3600 


CTTCTTGGGA 


TAGAAAGGTT 


TTGGAAACTT 


CCCTTTCTTC 


TATAGTTCCC 


AGTGTATCCA 


3660 


GTGAAAAAGC 


TGGTCTCCCA 


GCGGAAGAAG 


CAGAAGCCTA 


TCTTAAAAAA 


GGCTATTCTC 


3720 


TAAATGACCG 


TGTAGGAACC 


TCCTATTTGG 


AAAAGCAATA 


TGAAGAGACC 


TTACAAGGAA 


3780 


AACGCTCGGT 


AAAAGAAATC 


CATCTGGATA 


AATATGGCAA 


TATGGAAAGC 


GTGGATACAA 


3840 


TTGAGGAAGG 


TAGTAAGGGA 


AACAATATCA 


AACTGACCAT 


TGATTTGGCT 


TTCCAAGATA 


3900 


GCGTGGATGC 


TTTACTGAAA 


AGTTATTTCA 


ATTCTGAGCT 


AGAAAATGGT 


GGAGCCAACT 


3960 


ATTCTGAAGG 


TGTCTATGCA 


GTCGCCCTTA 


ACCCAAAAAC 


AGGTGCGGTT 


TTGTCTATGT 


4020 


CAGGGATTAA 


ACATGACTTG 


AAAACGGGAG 


AGTTCACGCC 


TGATTCCTTG 


GGAACGGTAA 


4080 


CCAATCTCTT 


TGTTCCAGGT 


TCGGTTGTCA 


AGGCGGCGAC 


CATCAGCTCA 


CGTTGGGAAA 


4140 


ATGGAGTCTT 


GTCAGGAAAC 


CAGACCTTGA 


CAGACCAGTC 


CATTGTCTTC 


CAAGGTTCAG 


4200 


C7CCCATCAA 


TTCTTGGTAT 


ACTCAGGCTT 


ACGGTTCATT 


CCCTATCACA 


GCGGTCCAAG 


4260 


CTCTCGAGTA 


TTCATCAAAT 


ACCTATATGG 


TCCAAACACC 


CTTAGGTCTT 


ATGGGGCAAA 


4320 


CCTATCAACC 


CAATATGTTT 


GTCGGCACCA 


GCAATCTAGA 


GTCTGCTATG 


CAGAAACTGC 


4380 


GTTCAACCTT 


TGGCGAATAT 


GGCTTGGGTA 


CTGCGACAGG 


AATTGACCTA 


CCAGATGAAT 


4440 


C7ACTGGATT 


TGT7CCCAAA 


CAGTATACCT 


TTGCTAATTA 


CATTACTAAT 


GCCTTTGGGC 


4500 


AGTTTGATAA 


CTATACGCCG 


ATGCAGTTGG 


CTCAGTATG7 


AGCAACTATT 


GCAAATAATG 


4560 


GTGTTCGTGT 


GGCTCCTCGT 


ATTGTTGAAG 


CCATTTATGG 


TAATAATGAT 


AAGGGAGGAC 


4620 


TGGGTGACTT 


GATTCAGCAA 


CTGCAACCGA 


CAGAGATGAA 


TAAGGTCAAT 


ATATCCGACT 


4680 


CCGA7ATGAC 


CATCTTGCAC 


CAAGGTTTTT 


ATCAGGTTGC 


CCATGCTACT 


AGTGGATTGA 


4740 
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CAACTGGACG TGCCTTT7CA AATGGTGCCT TGGTATCCAT TAGCGGAAAA AC AG GT AC AG 4 800 

CCGAAAGCTA TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATGCGGTG GCCTATCCCC 48 60 

CATCTGATAA TCCCCAAATC CCTCTCCCAC TGCTCTTTCC TCATAATACC AATCTAACAA 4 920 

ATGGTGTAGG ACCTTCCATT GCGCGTGACA TTATCAATCT GTATCAAAAA TACCATCCAA 4 980 

TGAATTAGAA AGGAAATTAT GCTTTATCCA ACACCTATTG CCAAGTTGAT TGACAGTTAT 504 0 

TCTAAGTTAC CAGGTATCGG GATTAAGACG GCTACGCGTC TGGCC7TTTA TACGATTGGG 5100 

ATGTCTGCTG ATGATGTCAA TGAATTTGCA AAAAATCTCC TTTCTGCTAA GAGAGAATTG 5160 

ACATATTGTT CTATTTGTGG ACGTTTCACA GACGACGATC CTTGTTCTAT CTGTACTGAT 5220 

CCCACTCGTC ACCAGACAAC AATTTTAGTT CTTGAGGATA GTAGAGATGT GGCAGCCATG 52 80 

GAAAATATCC AAGAATACCA TGCACTCTAT CATCTCCTTC ATGGCCTCAT TTCTCCTATG 5 3 40 

AATGGTATCA GTCCGGACGA TATCAATCTC AAGAGCCTTA TGACTCGTCT TATGCATAGT 54 00 

GAGGTTTCAG AAGTGATTGT GGCGACTAAT GCTACAGCGG ATGGTGAAGC GACTTCCATG 54 60 

TATCTTTCAC GTTTGCTCAA GCCGGCTGGT ATCAAGGTTA CGCCTCTAGC ACGACGTCTC 55 20 

GCTGTGGGAG CGCACA7TCA GTATGCGGAC GAAGTGACAC TCTTACGAGC CATTGAAAAT 55 80 

CGGACAGAGT TGTAAGTGTA GGCAAATTTA CGAACTCCAT TCATTTATAA AAAATCAAAG 5640 

AGGCTGAAAA TCGTTCCTAT CGGCCTCTTT TTGTATAGTG TGATGACTAG GCTCAGGTTC 57 00 

AAGTTTTAAA AAACCAAGCA AATATGATAT ACTAAAGAGC CAGTATTCTA GTAGAATTAG 57 60 

GACAAATAAT ATGAAACAAA CGATTATTCT TTTATATGGT GGACGGAGTG CGGAACGCGA 5 82C 

AGTCTCTGTC CTTTCAGCTG AGAGTGTCAT GCCTGCGGTC GATTACGACC GTTTCACAGT 58 80 

CAAGACTTTC TTTATCAGTC AGTCAGGTGA CTTTATCAAA ACACACCAA? TTAGTCATGC 5940 

TCCGGGGCAA GAAGACCGTC TCATGACCAA TGAAACCATT GATTGGGATA AGAAAGTTGC 6000 

ACCAAGTGCT ATCTACGAAG AAGGTGCAGT GGTCTTTCCA GTCCTTCACG GGCCAATGGG 6060 

AGAAGATGGC TCTGTTCAAG GATTCTTGGA AGTTTTGAAA ATGCCTTACG TTGGTTGCAA 6120 

CATTTTGTCA TCAAGTCTTC CCATGGATAA AATCACGACT AAGCCTGTTC TGGAATCTGC 6180 

TGGTATTGCC CAAGTTCCTT ATGTGGCTAT CGTTGAAGGC GATGATGTGA CTCCTAAAAT 62 40 

CCCTGAAGTG GAAGAAAAAT TGGCTTATCC AGTCTTCACT AAGCCGTCAA ACATGGGGTC 6 300 

TAGTCTCCGT ATTTCTAAGT CTGAAAACCA AGAAGAACTC CGTCAAGCCT TAAAACTTGC 6 3 60 

CTTCCGATAT GACAGCCGTG TCTTGGTTCA GCAAGGAGTG AATGC 6405 
(2) INFORMATION FOR 3EQ ID NO: 108: 
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(i) SEQUENCE CHARACTERISTICS : 

(A» LENGTH: U309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

CCAGCTCCGG TACCGGGATT TTAAGGAGTT TGATATGTAT AACCTATTAT TAACCATTTT 60 

ATTAGTATTA TCTGTTGTGA TTGTGATTGC AATTTTCATG CAACCAACCA AAAACCAATC 120 

CAGCAATGTA TTTGATCCCA GTTCAGGTGA TTTGTTTGAA CCCAGTAAAG CTCGCGGTTT 180 

TGAAGCTGTA ATGCAGCGTT TGACAGGGAT TTTAGTCTTT TTCTGGCTAG CCATTGCCTT 2 40 

AGCATTGACG GTATTATCAA GTAGATAAGA AAATAATGGG CAGGACTACG TCTTTGCCTC 300 

TTTTTATTTT TAAAGGATGT TTCAGAAGGT TTTACAGTAA AAGAAAATTA AAAAATCTAG 3 60 

AAAGAAAATA TGAAAGATAG AATAAAAGAA TATTTACAAG ACAAGGGAAA GGTGACTGTT 4 20 

AATGATTTGG CTCAGCCTTT GGGAAAAGAC AGTTCCAAGG ATTTTCGTGA GTTCATTAAA 480 

ACCTTGTCCT TAATCGAAAG AAAGCACCAA ATTCCTTTTG AAGAAGATGG TAGTCTGACA 540 

TTAGAAATTA AGAAAAAACA TGAGATTACC CTCAAGGGGA TTTTTCATGC CCATAAAAAT 600 

GGCTTTGGCT TTGTTAGTCT GGAAGGCGAG GAGGACGACC TTTTTGTAGG GAAAAATGAT 660 

GTCAACTATG CTATTGATGG TGATACCGTC GAGGTAGTGA TTAAGAAAGT CGCTGACCGC 720 

AATAAGGGAA CAGCAGCAGA AGCCAAAATT ATTGATATCC T AG AAC AC AG TTTGACAACA 780 

GTTGTCGGGC AAATCGTTCT GGATCAGGAA AAACCTAACT ATGCTGGCTA TATTCGTTCA 840 

AAAAATCAGA AAATCACTCA ACCCATTTAT GTTAAGAAAC CAGCCCTAAA ATTAGAAGGA 900 

ACAGAAGTTC TCAAAGTCTT TATCGATAAA TACCCAACCA AGAAACATGA TTTCTTTGTC 960 

GCGAGTGTTC TCGATGTAGT GGGACACTCA ACGGATGTCG GAATTGATGT TCTTGAGGTC 1020 

TTGCAATCAA TGGACATTGT ATCCGAGTTT CCAGAAGCTC TTGTTAACGA ACCACAAACT 1080 

GTGCCTGATG CTCCGTCTCA AAAGGATATG GAAGGTCGTC TGGATCTAAG AGATGAAATT 1140 

ACCTTTACCA TTGACGGTGC GGATGCCAAG GACTTGGACG ATCCAGTGCA TATCAAGGCT 12 00 

CTGAAAAATG GCAATCTGGA GTTTGGGGTT CACATCGCAG ATGTTTCTTA TTATGTGACC 12 60 

GAGCGGTCTG CCCTTGACAA GGAAGCCCTT AACCGTGCGA CTTCTGTTTA CGTGACAGAC 13 20 

CG AGTGGTGC CAATGCTTCC AGAACGACTA TCAAATGGCA TCTGCTCTCT CAATCCCCAA 1380 

GTTGACCGCC TGACCCAGTC TGCTATTATG GAGATTGATA AACATGGTCG TGTGGTCAAC 1440 

TATACCATTA CACAAACAGT TATCAAGACC AGTTTTCGTA TCACCTATAG CGATGTCAAT 1500 
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CATATCCTAG CTGGCGATGA AGAAAAGAGA AAAGAATATC ATAAAATTGT ATCAAGTATC 1560 

GAACTCATGG CCAAGCTTCA TGAAACTTTA GAAAACATGC GTGTGAAACG TGGAGCTCTC 1620 

AATTTTGATA CCAATGAAGC GAAGATTTTA GTGGATAAAC AAGGTAAGCC TGTTGATATC 1680 

GTTCTTCGGC AGCGTGGTAT TGCCGAGCGG ATGATTGAGT CTTTTATCTT GATGGCTAAT 1740 

GAAACAGTTG CCGAACATTT CAGCAAGTTG GATTTGCCTT TTATCTATCG AATTCACGAG 1800 

GAGCCTAAGG CTGAAAAGGT TCAGAAGTTT ATTGATTATG CTTCGAGTTT TGGCTTGCGC 1860 

ATTTATGGAA CTGCCAGTGA GATTAGTCAG GAGGCACTTC AAGACATCAT GCGTGCTGTT 19 20 

GAGGGAGAAC CTTATGCAGA TGTATTGTCC ATGATGCTTC TTCGCTCTAT GCAGCAGGCT 1980 

CCTTATTCGG AGCACAATCA CGGCCACTAT GGACTAGCTG CTGACTATTA TACTCACTTT 2040 

ACCAGTCCAA TTCGTCCTTA TCCAGACCTT CTTGTTCACC GTATGATTCG GGATTACGGC 2100 

CGTTCTAAGG AAATAGCAGA GCATTTTGAA CAAGTGATTC CAGACATTGC GACCCAGTCT 2160 

TCCAACCGTG AACGTCGTGC CATAGAAGCT GAGCGTGAAG TCGAAGCCAT GAAAAAGCCT 2 220 

GAGTATATGG AACAATACGT GGGTGAAGAG TATGATGCAG TTGTATCAAG TATTGTCAAA 2280 

TTCGCTCTCT TTGTCGAATT GCCAAACACA GTTGAAGCCT TGATTCACAT CACTAATCTG 2 340 

CCTGAATTTT ATCATTTCAA TGAGCGTGAT TTGACTCTTC GTGGAGAAAA ATCAGGTATC 2400 

ACTTTCCGAG TGGGTCAGCA GATCCGTATC CGTGTTGAAA GAGCGGATAA AATGACTGGA 2 460 

GACATTCATT TTTCATTCGT ACCTAGTCAG TTTGATGTGA TTGAAAAAGG CTTGAAACAG 2 520 

TCTAGTCGTA GTGGCAGAGG GCGTGATTCA AATCCTCGTT CGGATAAGAA GGAAGACAAG 2580 

AGAAAATCAG GACGCTCAAA TGATAAGCGT AAGCATTCAC AAAAAGACAA CAAGAAAAAA 2 640 

GGAAAGAAAC CTTTTTACAA GGAAGTAGCT AAGAAAGGAG CCAAGCATGG CAAAGGGCGA 2700 

GGGAAAGGTC CTCCCACAAA ATAAAAAGGC ACGCCACGAC TATACAATCG TAGATACGC7 2 760 

AGAGGCAGGG A7CGTCCTGA CTGGAACTGA AATCAAGAGT CTACGACCTC CTCGAATTAA 2 820 

TCTCAAGGAT GGCTTTGCTC AAGTGAAAAA TGGAGAAGTT TGGCTGAGCA ATGTTCATAT 2 880 

CGCGCCTTAC GAAGAGGGCA ATATCTGGAA CCAGGAACCA GAACGTCGTC GTAAACTCCT 2 94 0 

GCTCCATAAA AAGCAAATTC AAAAATTGGA ACAAGAGATC AAAGGGACAG GAATCACCT? 3 000 

AGTTCCCCTT AAGGTCTATA TAAAAGATGG CTACGCTAAG CTTCTTTTAG GACT7GCCAA 3 060 

AGGGAAGCAT CACTATGACA AACGGGAGTC TATCAAACGT CGTGAGCAAA ATCGAGATAT 3120 

CGCGCGTGTC ATGAAAGCTG TTAATCAGCG ATAAAAAGAG GAATTGAAAA TGGAAAAATT 3130 

AGTTGCCTAT AAACGCATGC CTTTGTGGAA TAAACAAACA ATGCCTGAAG CTGTTCACCA 3 240 
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AAAGCACAAT ACAAAACTTG GGACTTGGGG CAAAATTACT GTCTTGAAGG GAGCTCTCAA 3 300 

GTTTATTGAA TTGACAGAAG AAGGGGAAGT TCTAGCTGAA CACCTCTTTG AAGCAGGGGC 3 360 

ACACAATCCA ATGGCCCAAC CTCAAGCCTG GCACCGAGTG GAAGCTGCCA CAGATGATGT 3 420 

GGAATGCTAC TTGGAATTTT ATTGTAAACC TGAGGATTAT TTTGCTAAAA AATACAATAC 3480 

CAATCCTGTT CATTCAGAGG TCCTAGAGGC CATGCAGACA GTGAAACAAG GGAAAGCTTT 3 540 

CCATTTGCGT TGTGGTCAGG GGCGTAATTC TCTTTTTCTA GCCCAGCAAG ATTTTGATGT 3 600 

CACGGCTCTA GATCAAAATG GACTAGCTCT TGAAATCTTG CAAAGCATTG TGGAGCAGGA 3660 

AGATTTGGAC ATGCCTGTTG GCCTTTACGA TATCAATTCA GCTAGCATTG AACAAGAATA 3720 

TCATTTTATC GTTTCAACAG TTGTTCTCAT GTTTCTACAA GCGGACCGCA TTCCAGCTAT 3 780 

TATTCAAAAT ATGCAGGAGA AAACCAGTGT TGGTGGTTAC AACCTTATCG TTTGTGCCAT 3840 

GGACACGGAG GATTATCCTT GCTCGGTTAA CTTCCCATTC ACCTTTAAAG AAGGAGAACT 3 900 

GGCAGACTAT TACAAGGATT GGGAATTGGT TAAGTACAAT GAAAATCCAG GCCATTTGCA 3960 

CCGTCGCGAT GAGAATGGCA ATCCTATTCA ACTACGCTTT GCGACCTTAC TAGCTAAGAA 4020 

AATCAAGTAA ACACACATGA AGATTAGGAA TTTTCCTCAT CTTTTTTCTT TTTTACGAAT 4080 

GATATAGAAA AGGAGGGAAT TCATGTTTGT TGCGAGAGAT GCTAGGGGAG AATTGGTAAA 4140 

TGTCTTAGAG GATAAACTTG AGAAGCAAGC ATACACCTGC CCAGCTTGTG GAGGCCAGCT 4200 

CCATTTGCGT CAAGGACCAA GTGTACGCAC GCATTTTGCC CATAAATCCT TAAAAGACTG 4260 

TGATTTTTTC TTTGAAAATG AAAGTCCAGA ACACCTGGCC AATAAGGAAT CCCTCTATCA 4320 

CTGGTTGAAA AAAGAGACAA AGGTTCAATT AGAGTACCCG CTTTCAGAAC TT AAAC AG AT 43 80 

TGCGGATGTA TTTGTAAATG GCAATCTAGC TCTAGAAGTT CAGTGTAGTC CCTTGCCTCA 4440 

GAAAGTCCTT AAAGAGCGAA GTGAGGGCTA TCGTAGTCAG GGTTACCAAG TACTCTGGTT 4 500 

GCTGGGTCAA AAACTGTGGC TCAAGGAGCG TTTGACTCGT CTACAGCAAG G TTTTCTTTA 4560 

TTTCAGTCAA AACATGGGCT TTTATGTTTG GGAATTAGAC AAGGAAAAAC AAGTTTTAAG 4 620 

ACTCAAATAC CTGATTTACC AGGATCTCCG CGGTAAACTC CATTATCAAA TCAAGGAATT 4 680 

TTCCTATGGT CAAGCTAGTT TATTGGAAAT ATTGCGTCTT CCCTATAAGA GACAAAAAAT 4740 

ATCTCATTTT ACAGTTTCTG AGGACAAGGA CATCTGTCGC TATATCCGGC AACAACTTTA 4800 

TTATCAAAAT CTCTTTTGGA TGAAAGAACA AGCAGAAGCC TATCAAAAGG GAGAAAATAT 4 860 

CCTGACTTAT GGACTGAAAG AATGGTATCC ACAAATTCGA CCAATAGTGG GCAAATTTTT 4 920 

CCAGATTGAA CAAGACTTGA CTAGCTATTA TCAGCACTTT TATACCTATT ACCAAAAAAA 4980 

TCCTCAAAAT GATTGGCAAA AGCTTTATCC ACCAGCCTTT TATCAGCAAT ATTTCTTGAA 5040 
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AAATATGGTA GAATACAAAG GATGGAGGAA TCTAATGGTA TTACAAAGAA ATGAAATAAA 5100 

TGAAAAAGAT ACATGGGATC TATCAACGAT CTACCCAACT CACCACGCTT GGGAAGAAGC 5160 

CTTAAAAGAT TTAACAGAAC AATTGGAGAC ACTAGCCCAG TATGAAGGCC ATCTCTTGGA 5220 

TAGTGCGGAT AACCTACTAG AAATCACTGA ATTTTCTCTT GAAATGGAAC GCCAGATAGA 52 80 

GAAGCTTTAC GCTTATGCTC ATATGAAGAA TGACCAGGAT ACACGTGAAG C7AAGTATCA 5 340 

AGAGTACTAT GCCAAGGCCA TGACACTCTA CACCCAGTTA GACCAAGCCT TTTCATTCTA 54 00 

TGAGCCTGAA TTTATGGAGA TTAGCGAAAA GCAGTATGCT CACTTTTTAG AAGCTCAACC 54 6C 

AAAGCTGCAG GTTTATCAAC ACTATTTTCA CAAGCTTTTG CAAGGCAAGG ATCACCTTC7 ' 5 520 

TTCACAACGT GAAGAAGAAT TATTGGCTGG AGCTGGAGAA ATCTTTGGTT CAGCAAGTGA 5580 

AACCTTCGCT ATCTTGGACA ATGCGGATAT TGTGTTCCCT TATGTCCTAG ACGATGATGG 564 0 

TAAAGAAGTT CACCTATCTC ATGGGACTTA CACACGTTTG ATGGAGTCTA AAAAACGTGA 57 00 

GGTTCGCCGT GGTGCCTATC AAGCTCTTTA TGCGACTTAC GAACAATTCC AACACACCTA 57 60 

TGCCAAAACC TTGCAAACCA ATGTTAAGGT GCAAAATTAC CGTGCTAAAG TTCCTAACTA 5820 

CAAGACTCCT CGTCATGCAG CCCTCGCAGC GAATTTTGTT CCACAAAGTG TTTATGACAA 5880 

TTTGGTAGCA GCAGTTCGCA AGCATTTGCC ACTCTTACAT CGCTATCTTG AGCTTCGTTC 5940 

AAAAATCTTG GGGATTTCAG ATCTCAAGAT GTACGATGTC TACACACCGC TTTCATCTGT 6000 

TGAATACAGT TTTACCTACC AAGAAGCCTT GAAAAAAGCA GAAGATCCTT TGGCACTCTT 6060 

GGGTGAGGAT TACTTGAGCC GTCTTAAACC TGCCTTCAGC GAGCGTTGGA TTCATGTT7A 6120 

CGAAAATCAA GGCAAGCGTT CAGGTGCCTA CTCTCGTGGT TCTTATGATA CCAATGCCTT 6180 

TATGCTTCTC AACTGGCAAG ACAATCTGGA CAATCTCTTT ACTCTTGTTC ATGAAACAGG 6240 

TCACAGTATG CATTCAACCT ATACTCGTGA AACTCAGCCT TATGTTTACG GGGATTACTC 6 3 00 

TA7CTTTTTG GCTGAGATTG CCTCAACTAC CAATGAAAAT ATCTTGACGG AGAAATTATT 6360 

GGAAGAAGTG GAAGACGACG CAACACGCTT TGCTATTCTC AATAACTTCC TAGATGGTTT 64 20 

CCGTGGAACA CTTTTCCGCC AAACTCAATT TGCTGAGTTT GAACACGCCA TTCACCAAGC 6480 

AGATCAAAAT GGGGAGGTCT TGACAAGCGA TTTCCTAAAT AAACTCTACG CAGACTTGAA 654 0 

CCAAGAGTAT TATGGTTTGA GTAAGGAAGA CAATCCTGAA ATCCAATACG AGTGGGCTCG 6600 

CATTCCACAC TTCTACTATA ACTACTATGT ATATCAATAT TCAACTGCCT TTGCGGCCGC 6660 

CTCACCCTTG GCTGAAAAAA TTGTCCATGG TAGTCAAGAA GACCGTGACC GCTATATCGA 6720 

CTACCTCAAG GCAGGTAAGT CGGACTATCC ACTTAATGTC ATGAGAAAAG CTGGTCTTCA 67 80 
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TATGGAGAAG GAAGACTACC TCAACGATGC CTTTGCAGTC TTTGAACGCC GTTTAAATGA 6840 

GTTTGAAGCC CTTGTTGAAA AATTAGGATT GCCATAAAAT GGTTGAATCG TATAGTAAGA 6 900 

ATGCTAACCA TAACATGCGT CGTCCTCTCC TCAAAGAAGA AATTGTAGAC TTGATGCGTC 6960 

AGCGTCAAAA GCAGGTCACA GGTTTCTTGA AAGAATTGGA AGACTTTGCC CGCAAGGAAA 7020 

ATATTCCTAT TATTCCCCAT GAAACGGTTG CTTATTTC LG TTT T CTTATG GAAACCATGC 7080 

AGCCTAAAAA TATTCTGGAA ATTGGGACGG CTATCGGTTT TTCAGCTCTC TTGATGGCTG 7140 

AACATGCGCC AAATGCTAAG ATTACAACTA TTGATCGTAA TCCAGAAATG ATTGGTTTTG 7 200 

CCAAGGAAAA TTTTGCCCAG TTTGACAGTC GCAAGCAAAT CACTCTCCTA GAGGGAGATG 7260 

CGGTCGATGT CTTATCTACA CTGACAGAGT CTTATGATTT CGTC7TTATG GATTCTGCCA 7 320 

AGTCTAAATA CATCGTCTTT CTGCCAGAAA TCCTCAAACA TTTGGAAGTT GGTGGTGTGG 7 380 

TTGTCTTGGA TGATATTTTT CAAGGTGGTG ATGTTGCCAA GGATATTATG GAAGTCCGTC 7440 

GTGGTCAGCG AACCATTTAT CGAGGCCTTC AAAAATTATT TGATGCAACC TTAGACAATC 7500 

CAGAACTCAC CGCAACATTA CTGCCTTTAG GAGATGGTAT TCTCATGCTT CGTAAAAATG 7560 

TAGCAGATGT TCAACTGTCT GAAAGCGAAT GATTTTCAGA AAAATTTAAG AAAAAATAGT 7620 

AAAATAGATA GAGTAACACT TATCTCAAAG GAGTAGACAT GAAGAAAAAA TTATTGGCAG 7 680 

GTGCCATCAC ACTATTATCA GTAGCAACTT TAGCAGCTTG TTCGAAAGGG TCAGAAGGTG 7740 

CAGACCTTAT CAGCATGAAA GGGGATGTCA TTACAGAACA TCAATTTTAT GAGCAAGTGA 7800 

AAAGCAACCC TTCAGCCCAA CAAGTCTTGT TAAATATGAC CATCCAAAAA GTTTTTGAAA 7860 

AACAATATCG CTCAGAGCTT GATGATAAAG AGGTTGATGA TACTATTGCC GAAGAAAAAA 7920 

AACAATATGG CGAAAACTAC CAACGTGTCT TCTCACAAGC AGGTATGACT CTTGAAACAC 7980 

GTAAAGCTCA AATTCGTACA AGTAAATTAG TTGAGTTGGC AGTTAAGAAG GTAGCAGAAG 8040 

CTGAATTGAC AGATGAAGCC TATAAGAAAG CCTTTGATCA GTACACTCCA GATGTAACGG 8100 

CTCAAATCAT CCGTCTTAAT AATGAAGATA AGGCCAAAGA AGTTCTCGAA AAAGCCAAGG 8160 

CAGAAGGTGC TGATTTTCCT CAATTAGCCA AAGATAATTC AACTGATGAA AAAACAAAAG 8220 

AAAATGGTGG AGAAATTACC TTTGATTCTG CTTCAACAGA AGTACCTGAG CAAGTCAAAA 82 90 

AAGCCGCTTT CGCTTTAGAT GTGGATGGTG TTTCTGATGT GATTACAGCA ACTGGCACAC 83 40 

AAGCCTACAG TAGCCAATAT TACATTGTAA AACTCACTAA GAAAACAGAA AAATCATCTA 8400 

ATATTGATGA CTACAAAGAA AAATTAAAAA CTGTTATCTT GACTCAAAAA CAAAATGATT 84 60 

CAACATTTGT TCAAAGCATT ATCGGAAAAG AATTGCAAGC AGCCAATATC AAGGTTAAGG 8520 

ACCAAGCCTT CCAAAATATC TTTACCCAAT ATATCGGTGG TGGAGATTCA AGCTCAAGCA 8580 
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GTAGTACATC AAACGAATAG TCCAAATCAA TGAGTCAGGG AAAAAACTCG ACTTCAGGAA 8 640 

AAAATCAAGC AAACATTCCC ACAATAAAAC GCATAGTACA AGGTTTCTAC TGCCCCCCAA 8700 

AAAGTTAGAC AATTAATTTA TCCGAAGGAT TTAGTTCTGT ATTGCACAGA GCTAAGTCCT 8760 

TTTAGTTTTA TCTTAATTCT CTTATTGTTG TAATAATCAA TATAGTCTAT AATGGCTCGT 8820 

TCCAATTGAT TAAGTGATTT AAATGTTTTC TCATAGCCAT AAAACATTTC GGATTTTAAA 8880 

ATCCCAAAGA AAGATTCCAT CCTACCGTTG TCTTGGCTGT TGCCCTTACG TGACATGGAT 8940 

GCTTGAATTC CCTTACTCTC TAGGAAGCGA TGATAAGAAT CGTGTTGATA TTGCCAGCCT 9000 

TGGTCACTAT GGAGAATCGT ATTCTCGTAG TCCTTCTCTT TGAATGCCTG TTCCAACATT 9060 

AACGATCAAT CAATTTAATC ATGTACCTAA CATTAGAATT GTTTATCCCA AATTTATTTG 9120 

AAAGCTTCTC TAAGCTATAT CCTTGTTTTC TAAGTTCATA GATCTGAACT TTATCATCAT 9180 

AAGTTAATTT CATAATAAAA ACACCCCAAA AGTTAGATTT TTTCTGTCTA ACTTTTGGGG 9240 

TGTAGTTCAT GTACACCTGA TATGATGCGT TTTATAATTT TAAAGACTTT TTGACCAGCC 9 300 

TCATTTTTTT AACTTGATAC TCAGTGAAAA GCAAAGATTA AACTAGGAAG CTAGCTGTAG 9360 

GCTGCTCAAA GAACAGCTTT GACGTTGTAG ATAAAACTTG TGAGGTCACC AACATATATA 9420 

ATGTGAAGCT GACGTGGTTT GAATAGATTT TAGAAGAGTA TGAGTCTGGA AGTTTTAATG 9480 

GATAATGCAA GATTCCATAG AATGGGTAAG CTAGAGTTCT TATGTGAAGA GTTTGGGCAT 9540 

AAACTTTTAC CTTTTCCTCC CTACTCATCT TAGTATAGAA AAGTGAATCT GAAATAGTAC 9600 

ATAACTGCTT CTAAAACATT CTTATAAATT GATTTAAATT CTCAAATCAT ATTATTCAGT 9660 

TCTTATTTCA TTTTGTTCTA CAATCCTGTT GAGAAGACAC GTGTTCATAT CAAAAAGGTA 9720 

TTGGCAAGTT GCAATACCTT TTTACGAGGC TCTGTTGTCT TATTTTTGTT TCAACTGACT 9780 

ATATCTCCTA TGGTTCTAGT TCAGAAGGCT AGGCTATAAT TATGATTGAT AAGAAGTATC 9840 

ATTCCAAGTA TTGGGACTGA ATGTTTCAAA ATCATGGGTT TCTATAATGG TCAGGCTGGC 9900 

ATTTCCTAGA CCGCCATCTT TACGAAGAAG TGGTTCTTTA TAGCCTAGGA GAGTACGAAG 99 60 

ACTGGCAGTA AGATTGGCGC CGTGTCCGAC AATTAGAATA CGTTCAGCTG GACTATCTTT 10020 

TAATGATT7G ATAAATTGGA TGGTCCGTTG ACTTGTACTA TAGAGGGATT CGGCTCCGAA 10080 

CATTCGAGTG TCAAATTGAG CAAGATTTGA ACGAAAAGCC TCGATTTGTT GCGGGTAAAT 1014 0 

AGCTTCCAAG GTTGCAATTT TCAAACCTTC TAACTTCCCA AGTTGCCATT CACGGAGATT 10200 

AGGAACGATT TCTAAAGAAC AGGGGGTATA GAGTTGACTT TGGATAATCT CAGCAGATTT 102 60 

GACCGCTCGA GGTAAATCAC TTGAATAAAT CTGATCAAAA GGAATTTCCT TGAGATACTG 10320 
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ACCAAGTCCT TTTAGCCTTT CAATGGATTC AGGAAGAAGA GGAGAATCAC CACTAGCACC 
TTCAAAACCA CCTTCTTGGT TCCAGAGGGT ACGACCCTGG CGGACAAAGT ACAGTTTCAT 
TACTTGATGT CCTCCAAAAT ATCTACAAAG TCTGCCTTTA CAAAGCTAGC CAAGTCTTGT 
GGCGCGACGA TAATGCTGTG TCCGACTTCG CCTGCAGAGA CAATCATTTG ATCCAAATCT 
AGAGCAATTT TATCGATAAA AATGGCATAA TTCTGTTTCT GACGAATTCC GACAGGATTA 
TTGCCTCCAT GAATCTAACC AGTTGTTTTT TCTAAGTCCT TTTGTGGAAT CATGCTCACT 
TTTTTATTGC CAGAAATTTT AGCTAGTTTC TTTTCAGACA AGTGCTGAGT GATAGGGACA 
ATTCCGATAA TCGGTCCGGT CTTGTCTCCC AAAAGCGCCA AGGTTTTGAA AATCTGATCT 
CGTTCATAAC CTTGAGGAAG CTCTCCTTCT AGGGCATTGA TTTGAATCCC CTGATGAGGG 
ATAGCTGCTT TAGATAGGAT TTGTTCCACC AATGTTTTTT TGATTTTAAC TTTTTTTGCC 
ATTATTTATA TTTATCCTCC AATTGACTCA TCCAAATACC AAGCCAGATT CCCACCGCAA 
AGAAGAAGGC CATCATGACA TAACCGACAA GTGAAAGTCC TGTGTATTGG ATACTTTCAG 
CGTTTCCTGC ATTTGGAATT AACATCAAAA GGGTACTTGA TACGACGATA CCCATCATGA 
AATGATAGAC GAACTGTTTA CGGAGTTCTT CTAGTTCTCC GTCCGTCCAA GCGTAGGCCA 
CTTCTTCTTT CTTGCCTTTA CCTTTGGACA TCTTGTAAAG AGGTGGGAGG GCAATATAGA 
CATGACCTGC CTCGACTAGC GGACGCATGT AACGGTAGAA AAATGTCAAG AGCAAGGTCT 
GGATATGGGC ACCGTCGGTA TCCGCATCG 
(2) INFORMATION rOR SEQ ID NO: 109: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 



10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11309 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
CCATAGTCTA ACAAGTCTTT GTAAAGGTTT ATCCCTGATT CATGTAAAGA TTGTGTAAAG 
AATCAAAAAA AGCCACTTTT GAAAAATGGC TGCTCCTAAA AATAGCTTTA AAAATTATTA 
GTCCTGTGCG AAAGATTGGT TAGGAAGAAA AATCGTGAAG CAACTGCCTC TGCCAAGCTG 
ACTCGTCACC GTGACTTGGC CACCTAATAA TTGACTGAGT TCTTTGACAA TGGCAAGGCC 
AAGACCAGTG CCACCAGTTT GTCTGCTTCG ACCTTTATTA ACTCGGTAAA AACGTTCAAA 
AATACGATCC TGCTCTAATT GACTAATACC AATCCCTGTA TCTGATACAG AAATCTTAAT 
CCCTTCGTTC ACCTTTTGGG TCTTGACCTC AATTTTTCCC CCTTGTTCAG TGTAACGGAT 



60 
120 
180 
240 
300 
360 
420 
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GGCATTGGAT AAAAGATTCA GTAACATTTC GCAAAGTAAT TGACTATCTG ATACGAGGGT 480 
GACATCATCT GGCACCTGCA CCTTTAGCTG TAAATCCTTC TTCTTGACC7 GAGGTTGCAA 540 
GCTTTGAGTC AAATCCTGTA CAAATTCTGC CAAAGAAAGG GTCCTCCATT GTATAGGCAT 600 
TTGTTGAGCC TTAGATAAGG TAAGAAGATG CTCAACAATA TGCTCAAGAC GCAAACTTTC 660 
TTTGTAAATA ATGTCTAGAA AGTCATCCTT CAGCGCTTCT TCTTCAGCTG ACATCCCCTT 720 
AATGGTTTCA GCAAAGCCCT TAATCGAAGT AACTGGTGTC CTCAATTCAT GGGACGCATT 780 
TGAGACAAAG GCTAAATTTA ACTTTTCATA AGTTCTAATC GTTGTTAAAT CATATAGCAA 840 
G AC GAG C AC A GCTTCCACAG ATTGGGTGGG GCTAAAAACG GGAACTGCTG TCACTTCTAA 900 
AATCAAGTCA CCCTCATCAA ACCCACTTAC TTCTTGTTTT AACCTTGTTT TTTGATCAAA 960 

GGCTTGGTGA ACTAAATTCC GAATATCCAT CCGTTTGACG TCATCAAGTG AACTTATGTC 1020 

GCCGTCCACA TCGGGAAAAT AATGAGGCAC AGAGCGACTG GATAATAACA TCTGACCTTG 1080 

AGCGGAAACT AAAAACGTCC CCATGGTTAG GTGCGACAGA AGAACCTCCA TTGTTTCGCC 1140 

TAGATCCTTG TATTGCTGAT CCTGTTGGGA GACTTTGGTT TTTAGGCCAG ACACATACTG 1200 

AGCCAAAGAC TTTAAGTCTT CTTGCCCTTT TTCTAAAAAG TATTCACTAC TGGTCAAGAG 1260 

AGCTTGGTGC AAGGTCTCAA AAGCAACTTC CCATTTCCAA AGGCAAAAGA GCCAGTAGCC 13 20 

ACCTAGTCCC AAAGAAAGGG CTAGAAGAAA GAGACCGATG CCTTTACTGA TCCAAGTTAA 1380 

TGCCATCCCT GCAATCAGAA TGAGGCTAAC ACTTAGATTG ACT AG CC AAA ATTGAAGGTA 1440 

GCGTTTCATC TATAACTCCT TGAACTTATA ACCATAACCC CGAATGGTTC GAATAAATTG 1500 

ACGCGCTTTA GGATTGTCTT CAATTTTTTC CCTCAACTTA CCAATATGAA CGTCCACCAA 1560 

ACGTGTTTCC TGCCCAAAGT CATACCCCCA GATACCTTCC AAAAGACGCT CTCTAG7CAG 1620 

TGTCATGTTG GGATCTTTCA TAAGATAGAG CAAGACTTCA AATTCTTTTG GGGTCAAACT 1680 
CAGTAACTTA TTCCCCTTGT AGACTTCATG ACGCTCAGGG TATACTTTCA AGGTCCCAAA . 17 40 

TAGCCAAGAA TCGTCAGCGA TATTATCTGA ATCATCTCCT TCTTCTTCTC CTTTAGTTCG 1300 

CCTGAGCACA CCCTTGACAC GCGCCAGCAA TTCTCTAGGG CTAAAAGGCT TGGTCAGGTA 1860 

GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT TCGCAGAAAC 1920 

CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT CCATGCCATC 1980 

TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT CTGCCAAAGC 2040 

TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC TTAAATGGTA 2100 

GTCAAGCAAT TTC AG AATGT GTTCTTCATC ATCCACTAAT AAGACTTGTT TTGTCATCTA 2160 
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TTATCTCCTA TTGGTAACAT TATAACACAA TTATCAGAAA TCCTAACATT CCTAAATCAC 22 20 

ATTAAATTTG CCTATCAAGA CTAGTATCTG GTCAAACCCT CAATCATCTC CTTGTGCTCT 2280 

GGATAGGTCG CCAGTAGATC TACCCTTTCA AATAATTCAA AATCCTCAAA TTCAAAACCA 2 340 

GGAGCAACAA GACAAGAAAC CAGAGCATCA TCCTTATCAA CTGTTGATCC CCAAATAGTG 2 400 

CCCTTAGGAA CACAGTAGTG AAGTTGTTGC CCTTTGGATA TGTCCAGGCC TAAAGTGACT 2460 

GCTTCGTAGT GACCATCTGC TGTAATCATG TGAACAGTAA GTGGGGATCC TGCATGAAAA 2520 

TACCAGATTT CATCTGCTGT CAATCGGTGA AAATGTGAAG GATTCGTTTC TTCTAATAAG 2580 

AAATAAATAC TGGTATAAAG CGCCCTTCCC TTACCAGCAA GGTTTATAGT GTCTGAAGCT 2 640 

TTTTTTGTTT GTCTAAAATA GCCACCTTCA ATATGGGGAG CTAACTCTAG AGTTCTTATC 2700 

AAGTCTTCTT TATCCGTCGG AGCCAATGGG TTGAAGTAAC T C TT G TTCAA AGTGGTTTTA 2 760 

CGATTTCAAG AACTCCTCTC AGTTCTGAGG ACACGGTAAT GATTGATGCG ACGGAAGTAC 2 820 

AAATCAATCG CCCTAAAAAA AGAATTAGCG AATGATTCTG GTAAAAAAAA TGCCACGCTA 2880 

TGAAGGCTCA AGCGATTGTC ACAAGTCAAG GGAGAATTGT TTCTTTGGAT ATCGCTGTGA 2940 

ACTATTGTCA TGATATGAAG TTGTTCAAAA TGAGTCGCAG AAATATCGGA CAAGCTGGTA 3000 

AAATCTTGGC TGACAGTGGT TATCAAGGGC TCATCAAGAT ATATCCTCAA GCACAAACTC 3 060 

CACGTAAATC CAGCAAACTC AAGCCACTAA CAGTTGAAGA TAAAGCCTAT AACCATGCGC 3120 

TATCCAAGGA GAGAAGCAAG GTTGAGAACA TCTTTGCCAA AGTAAAAACG TTTAAAATGA 3180 

TTTCAACAAC CTATCGAAAT CATCGTAAAC ACTTCGGATT ACGAATGAAT TTGATTGCTG 3240 

GCATTATCAA TCATGAACTA GGATTCTAGT TTTGCAGGAA GTCTATTATT TCCTTAGGTG 3 300 

AATTAGTGAA GCGTTTAGGC AAGTGTCTCT GGTTACGACG TCATGGACTC TAAATCGATT 3360 

ATATTTAGGG GTCATGACTA GTGAAGCAGT TAGCTAGTTC GCATATAAGC GGCTAGCGTC 3420 

TAACAATTAG GAACTTTAG7 TCCAATAACT TTAAGATTAC GACGTTTTAG GACATAAATC 3 480 

GATCATATTT ATCTCCTAAA ACTAGTGAAG CCCCTACCCA AAGTCCGAAT AGGATTTNJGC 3540 

CTTAGTTACT TAGATTGCTT TGCAATCAAG TAACTT7CGC GATTTACATC TTCTCTGGCC 3 600 

CTTCTACTCC AAGCAAGCGA AGGGCTTCTT TGAGAACGAC TGCGGTTGCG TAGCTGAGGG 3 660 

CTAGACGGCT GTCGCGTTCT GGGCTTTCAT CCAAGATACG TGTATGTGCA TAGTATTTGT 372 0 

TAAAGCATTG AGCCAGGCTA ATTGCAAATT TAGCAATCAT AG AAGGTT C A AAGTTATCTG 3780 

CCGCACGGTT GATAATACGT GGGAAGTCTT GAATGAGTTT AATGATTTCC CACCTTTCAC 3 840 

TATCATTCAA GCTATAGTTG CCAGCTGTTT CTGGTTTCAA ATCGGCTTTG CGTAAGATAG 3900 

ATTGGATACG AGCGTAGGCA TATTGAACGT AAGGTCCAGT TTCACCCTCG AAGGATACCA 3 960 
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TAGCCTCTAG GTCGAAGTCC TATCCATTTG TACGGTCGGT TTTGAGGTCA TAGAATTTAA 4020 

TGGCTCCAAT CCCAACAGCA TGTGCTACTT GGTCTTTGTT TTCTACTTCA GGATTTTTAG 4080 

CCTCGATTTG GACCTTGGCA CGGCTAACAG CCTCTGCAAC AGTAGGCTCT AGCAAGATGA 4140 

CATTCCCTTT ACGAGTAGAG AGTTTCTTCC CTTCTTTTGT AACCAAACCA AAAGGAACGT 4200 

GAGTAATGTC GTCACTCCAG TCGTAGCCCA TCTCTTCCAA GACAGCTTTG AGCTGTTTAA 4260 

AGTGGGCAGA TTGTTCTTGA CCAACGACAT AGATAGATTT AGCAAATTGC TATTCGTTTT 4 3 20 

TACGGTAGAG GGCTGCAGCC AAGTCACGTG TG AT AT AG AG AGTTGCACCA TCACACTTCT 4380 

TGATGAGGGC TCCATCTTCA ATT C CAT ATT TCTCAAGATT CACAACTTGG GCACCTTCTG 4440 

ATTCAAGAAG TAGTCCTTTT TCAGAAAGAA TGTCTACAAC TGCATCCATC TTATCATTGT 4 5 00 

AGAAGGCTTC TCCGTTATAG CTGTCAAATT CAACCTTCAA TTCATTGTAA AGGCGGTTAA 4 560 

ATTCCACTAA ACTTTCATCG CGGAACCATT GCCAAAGAGC GAGAGCTTCC TCATCTCCAT 4 620 

TTTCAAGTTT ACGGAACCAT TCGCGCGCTT CTTCATCCAA GCTAGGGTCA TTTTCAGCTT 4680 

CAGCGTTGAT GCGGACATAG AGTTTAAGGA GTTCATCGAT TGCATCACCT TTTACAGCTT 4740 

CTTCGTCGCC CCATTTTTTG TAGGCAACAA TCAACATCCC AAATTGTTTA CCCCAGTCTC 4800 

CCAAATGCTT GACCTTGACC GTTTGATAAC CGATTTTTTG GAAAATATGT GACAAGCTAT 4860 

CTCCGATAAC AGTTGAACGC AGGTCGCCAA TAGAAAATGG TTTAGCGATA TTCGGACTAG 4 920 

ACATGTCGAT AACAACATTT TCTTGTTTAC CAATATTTTG GTCAGCATAG TGTTCTTTTT 4980 

CAGTGGTAAC AGCTTGCAAT ACTTGAGCAG AAATGGCAGA TTTATCAAGG AAAAAGTTAA S04 0 

CGTAAGGTCC TGTTGCGACA ACTTTTTCAA AGGCTTGGCT GTTCATTTTT TCAGCCAGTT 5100 

CAGCCGCAAT CATTTGTGGT GCTTTACGTT CGACTTTTGC AAGAGAAAAA GCAGGGAAAG 5160 

CAATGTCTCC CATTTCTGAG TTTTTACGGG TTTCCAGTAA CTTTAAAATA GCCTCTTGGT 5220 

CCAGGCTATC AATGATGCTA GATAATTCGC TAGCAATCAA TTCTTTTGTA TTCATTAAGA 5280 

CCTCCTTTTT GGACTTTTCT ACTATTTTAT CACAATTTTA AAGAAAGAAG AAAAAATTTT 5340 

TGAAATCTCC TGTTTTTTTG GTATAATATG GTTATAAATA TAGTTATAAA TATGCACGCA 5400 

ACAGCATTTT ATGAGAAAAA GAGATCGTCA TCAGTTAATA AAAAAAATGA TTACTGAGGA 5460 

GAAATTAAGT ACACAAAAAG AAATTCAAGA TCGGTTGGAG GCGCACAATG TTTGTGTGAC 5520 

GCAGACAACC TTGTCTCGTG ATTTGCGG 5548 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3132 base pairs 
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<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

TACCCCGTAG TCTTAGCAGA CACATCTAGC TCTGAACATG CTTTAAACAT CTCTGATAAA 60 

GAAAAAGTAG CAGAAAATAA AGAGAAACAT GAAAATATCC ATAGTGCTAT GGAAACTTCA 120 • 

CAGGATTTTA AAGAGAAGAA AACAGCAGTC ATTAAGGAAA AAGAAGTTGT TAGTAAAAAT 180 

CCTGTGATAG ACAATAACAC TAGCAATGAA GAAGCAAAAA TCAAAGAAGA AAATTCCAAT 240 

AAATCCCAAG GAGATTATAC GGACTCATTT GTGAATAAAA ACACAGAAAA TCCCAAAAAA 300 

GAAGATAAAG TTGTCTATAT TGCTGAATTT AAAGATAAAG AATCTGGAGA AAAAGCAATC 360 

AAGGAACTAT CCAGTCTTAA GAATACAAAA GTTTTATATA CTTATGATAG AATTTTTAAC 420 

GGTAGTGCCA TAGAAACAAC TCCAGATAAC TTGGACAAAA TTAAACAAAT AGAAGGTATT 480 

TCATCGGTTG AAAGGGCACA AAAAGTCCAA CCCATGATGA ATCATGCCAG AAAGGAAATT 540 

GGAGTTGAGG AAGCTATTGA TTACCTAAAG TCTATCAATG CTCCGTTTGG GAAAAATTTT 600 

GATGGTAGAG GTATGGTCAT TTCAAATATC GATACTGGAA CAGATTATAG ACATAAGGCT 660 

ATGAGAATCG ATGATGATGC CAAAGCCTCA ATGAGATTTA AAAAAGAACA CTTAAAAGGC 720 

ACTGATAAAA ATTATTGGTT GAGTGATAAA ATCCCTCATG CGTTCAATTA TTATAATGGT 7 80 

GGCAAAATCA CTGTAGAAAA ATATGATGAT GGAAGGGATT ATTTTGACCC ACATGGGATG 840 

CATATTGCAG GGATTCTTGC TGGAAATGAT ACTGAACAAG ACATCAAAAA CTTTAACGGC 900 

ATAGATGGAA TTGCACCTAA TGCACAAATT TTCTCTTACA AAATGTATTC TGACGCAGGA 960 

TCTGGGTTTG CGGGTGATGA AACAATGTTT CATGCTATTG AACATTCTAT CAAACACAAC 1020 

GTTGATGTTG TTTCGGTATC ATCTGGTTTT ACAGGAACAG GTCTTGTAGG TGAGAAATAT 1080 

TGGCAAGCTA TTCGGGCATT AAGAAAAGCA GGCATTCCAA TGGTTGTCGC TACGGGTAAC 1140 

TATGCGACTT CTGCTTCAAG TTCTTCATGG GATTTAGTAG CAAATAATCA TCTGAAAATG 1200 

ACCGACACTG GAAA7GTAAC ACGAACTGCA GCACATGAAG ATGCGATAGC GGTCGCTTCT 12 60 

GCTAAAAATC AAACAGTTGA GTTTGATAAA GTTAACATAG GTGGAGAAAG TTTTAAATAC 13 20 

AGAAATATAG GGGCCTTTTT CGATAAGAGT AAAATCACAA CAAATGAAGA TGGAACAAAA 13 80 

GCTCCTAGTA AATTAAAATT TGTATATATA GGCAAGGGGC AAGACCAAGA TTTCATACGT 144 0 

TTGGATCTTA GGGGCAAAAT TGCAGTAATG GATAGAATTT ATACAAAGGA TTTAAAAAAT 1500 

GCTTTTAAAA AAGCTATGGA TAAGGGTGCA CGCGCCATTA TCGTTGTAAA TACTGTAAAT 1560 
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TACTACAATA 


CAGATAATTG 


GACAGAGCTT 


CCAGCTATGG 


GATATCAACC 


' GG ATGAAGGT 


1620 


ACTAAAAGTC 


AAGTGTTTTC 


AATTTCAGGA 


CATGATGCTG 


TAAAGCTATG 


GAACATGATT 


1680 


AATCCTGATA 


AAAAAACTGA 


AGTCAAAAGA 


AATAATAAAG 


AAGATTTTAA 


AGATAAATTG 


1740 


GAGCAATACT 


ATCCAATTGA 


TATGGAAAGT 


TTTAATTCCA 


ACAAACCGAA 


TGTAGGTGAC 


1800 


GAAAAAGAGA 


TTGACTTTAA 


GTTTGCACCT 


GACACAGACA 


AAGAACTCTA 


TAAAGAAGAT 


1860 


ATCATCCTTC 


CAGCAGGATC 


TACATCTTCG 


GGGCCAAGAA 


TAGATTTACT 


TTTAAAACCC 


1920 


GATGTTTCAG 


CACCTGGTAA 


AAATATTAAA 


TCCACGCTTA 


ATCTTATTAA 


TGGCAAATCA 


1980 


ACTTATGGCT 


ATATGTCAGG 


AACTAGTATG 


GCGACTCCAA 


TCGTGGCAGC 


TTCTACTGTT 


2040 


TTGATTAGAC 


CGAAATTAAA 


GGAAATGCTT 


GAAAGACCTG 


TATTGAAAAA 


TCTTAACGGA 


2100 


GATGACAAAA 


TAGATCTTAC 


AAGTCTTACA 


AAAATTCCCC 


TACAAAATAC 


TGCGCGACCT 


2160 


ATGATGGATG 


CAACTTCTTG 


GAAAGAAAAA 


AGTCAATACT 


TTGCATCACC 


TAGACAACAG 


2220 


GGACCAGGCC 


TAATTAATGT 


GGCCAATGCT 


TTGAGAAATG 


AAGTTGTAGC 


AACTTTCAAA 


2280 


AACACTGATT 


CTAAAGCTTT 


GGTAAACTCA 


TATGGTTCCA 


TTTCTCTTAA 


AGAAATAAAA 


2340 


GGTGATAAAA 


AATACTTTAC 


AATCAAGCTT 


CACAATACAT 


CAAACAGACC 


TTTGACTTTT 


2400 


AAAGTTTCAG 


CATCACCCAT 


AACTACAGAT 


TCTCTAACTG 


ACAGATTAAA 


ACTTGATGAA 


2460 


ACATATAAAG 


ATGAAAAATC 


TCCAGATGGT 


AAGCAAATTG 


TTCCAGAAAT 


TCACCCAGAA 


2520 


AAAGTCAAAG 


GAGCAAATAT 


CACATTTGAG 


CATGATACTT 


TCACTATAGG 


CGCAAATTCT 


2580 


AGCTTTGATT 


TGAATGCGGT 


TATAAATGTT 


GGAGAGGCCA 


AAAACAAAAA 


TAAATTTGTA 


2640 


GAATCATTTA 


TTCATTTTGA 


GTCAGTGGAA 


GCGATGGAAG 


CTCTAAACTC 


CAGCGGCAAC 


2700 


AAAATAAACT 


TCCAACCTTC 


TTTGTCGATC 


CCTCTAATGG 


GATTTGCTGG 


GAATTGGAAC 


2760 


CACGAACCAA 


TCCTTGATAA 


ATGGGCTTGG 


GAAGAAGGGT 


uAAOATCAAA 


AACACTGGGA 


2820 


GGTTATGATG 


ATGATGGTAA 


ACCGAAAATT 


CCAGCAACCT 


TAAATAAGGG 


AATTGGTGGA 


2880 


GAACATGGTA 


TAGATAAATT 


TAATCCAGCA 


GGAGTTATAC 


AAAATAGAAA 


AGATAAAAAT 


2940 


ACAACATCCC 


TGGATCAAAA 


TCCAGAATTA 


TTTCCTTTCA 


ATAACGAAGG 


GATCAACGCT 


3000 


CCATCATCAA 


GTGGTTCTAA 


GATTGCTAAC 


ATTTATCCTT 


TAGATTCAAA 


TGGAAATCCT 


3060 


CAAGATGCTC 


AACTTGAAAG 


AGGATTAACA 


CCTTCTCCAC 


TTGTATTAAG 


AAGTGCAGAA 


3120 


GAAGGATTGA 


TT 










3132 


(2) INFORMATION FOR SEQ ID NO : 111: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14672 base pairs 
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(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(0) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

CGAGATTTCT TTAAATGAAC TACGTGAAAT CTACCCATCA TCCAGATCTG GATATTCTCT 60 

CCTATCTATA AGTAAAGTTT TAGGAGATTT TAATATAAGT TCTCATGCTT TTAAAGCTTC 120 

GGTAAGAGAT TTAAAACCGC TCAGTTTCCC ACTCATTTGC TTCTCGGAGA CTTCTCATTT 180 

TATTATTCTT GAAAAAA7TA GTAAAAACAA GTTTTATATT TTAGATCCTG CAAAAGGCAG 240 

GCAGAGAATG TCAATAAGTG AATTTGAAAG GCATTATTCA AATATCATTT TAACATTTAA 300 

AAAGTTAGAT AGCTTTATGT CTCGTAAAGA TAATAAGAAG TCGCCTGTTT TAAAGTATTT 360 

TTTTAAGTAT AGGAATAAGC TAGGGATTTT ATTTTTTGTA ACAGCATTAT TGTATGTAAT 420 

ACAATCATTA GTACCTATAG CTAATAGATA CATAATTGAC ACGAATTTCA AGGACGATTC 480 

GTATTCGTCT AGAATGTTAT TTACTATATT ATTTATATTT ACTGTTTCAT TCTCACTAAT 540 

GTATTTATTA AGACAGATAT ATGTTGCATC CTTAAAATAT ATAATGGATA AAGAGATTAG 600 

CTATCATTTT ATGAAACATT TGATATATTT ACCTTACAGT TTTTATCAAA AACGTACTTT 660 

AGGGGATATA CTTTTTAGAG CTAACTCTAT TGTTTATATA AGAGAAATAC TATCAAATAA 7 20 

TTTTATAGCA GCTATACTTG ATTTGTTAAT GATTGTGGTT TATGCTGTGG TTTTATTTAC 780 

CTTTTCTAAG TACATGGTAA T C TTTTTAAT ATCACTAACT CTAGCTCTAT CTATTGTAAT 840 

G7ATCCAATC ATAAAAATCT CAAAAAATTT AATTGATAAA AATATAAAAG AAAAGGTTAA 900 

TGTTCAAAAT ATTACTTCCG AAGTAATTTC TAAAAATAGT GATATTAAGC TAACTGGAGA 9 60 

AGAGGAATTT TGGATTAACA AATGGGATAA TTTTAATACA AAACAGCTCA TCATAGGTCG 1020 

AAAACTTGAT ATACATTTAT CAATTGTTAG TAGTATAACG AATGTTTTAC AAATTATTCT 1080 

CCCTGTTTTG ACCCTTATTG TAGGTGTAAA TATAAAAACA TTCGAACAAT TGACCTTAviG 1140 

ACAAATTGTA GCAATAACTA CAGTCTCACC ATACTTTATT TCTCCTATAA TTTCTTTAAG 1200 

TGATAACTAT ATACAATTAA TGTTATTAAA GGGATATTTT TTAAGAATAG AGGATGTGTT 1260 

TAATACTAAA TCCGAATTAA TTCCAGAAAG AGTCAGTCAA GATATAAAAT TTGATAAAAA 1320 

AATAGAATTA AAAGATATTT GGTATAAATA TGGATTATTT GATGATTATG TTTTGAAAGG 1380 

AATAAATGTT ACTATTAAAA AAGGAGAAAC TGTTGCTATT GTTGGAGAA7 CAGGTTCAGG 144 0 

TAAGAGTACA TTAGCTAAAA TTTTATTAGG TTTATTAGAA CCTAATATTG GTTCAATAGA 1500 

AGTTGATGGA GTAGAAAAAG AAGAAATTGG TCAAACATTG TATAGAAAGA TTTTTGGAGC 1560 
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AGTGTTACAA 


AATTCAACCC 


TAAGTTATGG 


TACCTTAAGA 


GAGAATTTGA 


CATTTGGACA 


1620 


CTTTG7TTCA 


GATGAAGAAT 


TAATGACAAA 


TCTAAATTCA 


ATTGGTCTTA 


GCAATGTAGT 


1680 


TAAATCTTTA 


CCTCTTGGAT 


TAGAGACAAT 


CATCGCTGAA 


GAAGGTAATA 


ACTT' P TCTGC 


174C 


AGGGCAGCAG 


CAAATGATAC 


TTTTAGCTCG 


TTGTCTTTTG 


TCGAAACCTT 


CGGTAGTTGT 


1 800 


TTTGGACGAA 


GCAACAAGTA 


GTTTAGATAA 


TTTATCTCAA 


CAAATTACAA 


CTTCTTAfTT 

^4 • V- I 4 ** V* 4 i 


1860 


AAGTGAAATC 


GGTACCACTA 


AGATTTTAAT 


TGCCCATCGA 


CTAGATACTA 


TCAAGTCTGC 


192 0 


AGATAAGATC 


TTAGTAATGC 


ATAATGGTGA 


AATTGTAGAG 


ATTGGGACCC 


ATAGAGAACT 


1980 


TCTTGAACTA 


GGAGGCATTT 


ATAAGCAATT 


GTATTCAAAT 


AATTAGTTTT 


TGATTAAAAG 


2040 


GGTAAATTTA 


TGAAGATTAT 


GAAAAAAAAA 


TATTGGACTT 


TAGCGATATT 


ATTCTTTTCT 

*» 4 4 4 4 4 4 *»J 4 




TTGTTCAATA 


ATTCTCTTAC 


TCCTCAAGAA 


ATACCTAAAA 


ATCTTGATGG 


CAATATAACT 


~> i fin 


CACACTCAGA 


CTAGCGAAAG 


TTTTTCTGAA 


TCTGATGAAA 


AACAGGTTGA 


CT ATTCT AA T 

V* 4 *» 4 4 \* 4 4 


2220 
c & « y 


AAAAATCAAG 


AAGAAGTAGA 


CCAAAATAAA 


TTTCGTATTC 


AAATCGATAA 


GACAGAATTA 


2280 


TTTGTAACAA 


CAGATAAACA 


TTTAGAAAAA 


AACTGTTGTA 


AATTGGAACT 


TGAACCACAA 


2340 


ATAAATAACG 


ATATTGTTAA 


CTCTGAAAGT 


AATAATTTAC 


TAGGCGAAGA 


TAATTTAGAT 


2400 


AATAAAATTA 


AGGAAAATGT 


TTCTCATCTA 


GATAATAGAG 


GAGGAAATAT 


AGAGCATGAC 


2460 


AAAGATAACT 


TAGAATCGTC 


GATTGTAACA 


AAATATGAAT 


GGGATATAGA 


TAAAGTTACT 


2520 


GGTGGAGGCG 


AAAGTTATAA 


ATT AT ATT CT 


AAAAGTAATT 


CTAAAGTTTC 


AATTGCTATT 


2580 


TTAGATTCAG 


GAGTCGATTT 


ACAAAATACT 


GGATTACTGA 


AAAATCTTTC 


AAATCACTCA 


2640 


AAAAACTATG 


TCCCCAATAA 


AGGATATTTA 


GGAAAAGAGG 


AGGGAGAGGA 


AGGAATAATA 


2700 


TCAGATATTC 


AAGATAGATT 


AGGTCATGGT 


ACGCCTGTTG 


TAGCTCAAAT 


TGTAGGGGAT 


2760 


GACAATATTA 


ATGGAGTAAA 


TCCTCACGTT 


AATATTAACG 


TCTATAGAAT 


ATTTCGTAAG 


2820 


TCGTCAGCTA 


GTCCAGATTG 


GATTGTAAAA 


GCAATTTTTG 


ATGCTGTAGA 


TGATCGCAAT 


2880 


G AT ATT AT C A 


ATCTTAGTAC 


TGGACAATAT 


TTAATGATTG 


ATGGAGAATA 


TGAGGACGGA 


2940 


ACAAATGATT 


TTGAAACATT 


TTTGAAGTAT 


AAAAAGCCTA 


TTGATTACGC 


GAATCAAAAA 


3000 


GGAGTAATTA 


TAGTAGCTGC 


ATTAGGGAAT 


GACTCCCTAA 


ATGTATCAAA 


TCAGTCAGAT 


3060 


TTATTGAAAC 


TTATTAGTTC 


ACGCAAAAAA 


GTAAGAAAAC 


CACGATTAGT 


AGTTGATGTT 


3120 


CCAAGTTATT 


TCTCATCTAC 


AATTTCGGTC 


GGAGGCATAG 


ATCGCTTAGG 


TAATTTATCA 


3180 


GATTTTAGCA 


ATAAAGGGGA 


TTCTCATCCA 


ATATATGCGC 


CTGCAGGCTC 


AACATTATCT 


3240 


CTTTCAGAAT 


TAGGACTTAA 


TAACTTTATT 


AATGCAGAAA 


AATATAAAGA 


ACATTCGATT 


3300 
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TTTTCGGCAA CACTAGGAGG ATATACGTAT CTTTATGGAA ACTCATTTGC TGCTCCTAAA 3 360 

GTTTCTGGTG CGATTGCAAT GATTATTGAT AAATACAAAT TAAAAGATCA GCCCTATAAT 3 420 

TATATGTTTG TAAAAAAATT CTGGAAGAAA CATTACCAGT AAAAAATGGT ATAAAAGTGT 3480 

TAAATATACC AAACGTATTG AGATATGATT TGAATATGTT ACAATTAGAA TATAAAAATG 3540 

AACAAAGTTG GGATAGTTTC ATAGATAATG TTAATTTAAT TGAGTTGGAA GAGAGAATTC 3 600 

AAACTACTAT TGGAATTAAA CAAATAAACA CACACAATAT TATTACTATT GCCCGAGAAG 3660 

GGTACTCTCA AAATTATTTA CCTAACACTT CAGAAAATAC ATATAATTCA TTACAAGTCA 3 720 

GTTTAGTTGG AGT ATT ACTA CTTTTTATAA GTATGGTAAA TATTTTATGG GCTAAAAAAA 37 80 

GTAAATGAAA ATAAAATTTG GAGCCCTCTG AAAAAGTAAG TCCTACAGTT CAACTAAAAT 3 840 

GAGTCAAAAG ATGAATCACC TTGATGTAGG GGAGTTTGTC TTATTGCTGC CTGAACACCT 3900 

CCGTTCAGAG GAAGAACATT ATAAATCTGT TTTTGAAGAC GACTTAACCA GTCGCATATC 3960 

TAGTCAAGAT GAACGACAGC AAATGACTGC TACGGTAGGT TATTTAGAAT CAGGTCAGGA 4020 

TCGTTTTGTG TATAATACGA CCCCTATTTC TTACCAGCAG TTTTTCAAAG ATCCAATCAT 4080 

CATTGTTATA ACACCCCAAT CAACTGCTCC ACAGTCCATT • TTGTTTTCGA TAGACGCAGT 4140 

ACAGAACTAC GTTCTCTTTA ATCAATTGTC TGATGCCCAG GAGCTTATCC AGAGACAAGG 4200 

CATTGAAAAT TGGGTCTCAG AAATGCAAAC AGGTTACCAC AACTACATCA CATTATTGGA 4260 

TAATATCCAG AGGGAACGTT GGGTAATGCT AGCAGGAGCT CTGCTTGGGA TTGCAACTTC 4 320 

AATCTTGTTG TTTAACACTA TGAATAGGCT CTACTTTGAA GAATTTAGAC GTGCCATTTT 4380 

TATCAAACGC ATTGCAGGTC TCAGGTTCTT AGAAATCCAT CGCACTTATC TCTTTGCTCA 4440 

ACTGGGTGTG TTTTTACTGG G ATTTGT TGC GAGTGTATTT CTTCAGGTAG AGATAGGAGT 4500 

TGCTTTCTTA GTCTTGTTAC TCTTTACTGG TCTATCTCTT TTACAGTTAC ATGTCCAAAT 4560 

GCAGAAAGAA AACAAGATGT CCATGCTTGT TTTGAAGGGA GGTTAATATG ATTGAACTTA 4620 

AACAGGTGAG TAAATCTTTT GGAGAACGAG AGTTATTTTC GAATCTTTCA ATGACATTTG 4 680 

AGGCTGGAAA AGTCTATGCC TTAATTGGTT CAAGTGGTAG CGGAAAAACA ACCTTGATGA 47 40 

ACATGATTGG GAAATTAGAA CCTTATGATG GGACGATTTT TTACCGAGGT AAAGACTTGG 4800 

CCAATTATAA ATCAAGTGAT TTTTTCCGTC ACGAATTGGG CTACCTCTTC CAGAACTTTG 4860 

GCTTAATTGA AAACCAAAGT ATTGAAGAAA ACCTTAAGCT AGGTCTCATT GGTCAAAAGT 4920 

TGAGTCGGTC GGAACAGCGG TTGAGGCAGA AGCAGGCTTT AGAACAGGTC GGCCTGGTTT 4980 

ATCTTGACCT AGATAAGCGC ATCTTTGAGT TATCGGGCGG AGAATCGCAA CGGGTTGCCT 5040 

TGGCAAAAAT TATCTTAAAG AATCCACCCT TTATTCTGGC AGATGAGCCA ACACCTTCAA 5100 
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TACACCCAGC 


AACCTCTCAG 


TTCATTATCC 


AGATTTTGCT 


ATCTCTTCGA 


GATGATAATA 


5160 


GGCTAATCAT 


TATCGCAACA 


CATAATCCGG 


CAATTTGGGA 


GATCGCTGAT 


GAACTGTTCA 


5220 


CGATGGATCA 


TCTGAAATAA 


AAATCCTTGT 


TTTTAATTGC 


ACGATGAGTT 


ACTGAAATAT 


5280 


TATCATGAAT 


CAAGAATTGG 


AGTTAATTTA 


GAATTGTACT 


TAATTTAGAA 


TTGTACTTTA 


5340 


TTAATATTGA 


GGTAACTTTT 


TCTTGATAAA 


GGAAGAAATA 


ATGGAGAGGA 


AGTTAGAATG 


5400 


AAAAAATTCG 


"ACAATTATAT 


"TATTGAGAAG 


CCTTGCGATT 


CTAATTCAGA 


TAAACTGCAA 


54^60" 


AAAATCTTAA 


TAATTGAAAG 


TTTGGTAGAT 


GATATTTTGC 


AATTTTCTCT 


CAGAATCAAT 


5520 


AATAGTGTAG 


GAGAGATTTT 


CCTCCTACAA 


CCGTTTTAAA 


AGAAAACTAT 


CTTTATTCCA 


5580 


TGTTATTTTG 


AGGAAGATAT 


TGTGAAAGTC 


AAAGATGATG 


ATAAAGTTGA 


GTGGAATTTG 


5640 


TTAGAATTTC 


AAAAATTTAG 


AGCATTTTTG 


GCTTAGTAAT 


CTGTGTTGAA 


GGCTCAAAAC 


5700 


CTATGGTAAA 


AAAGTAGCTT 


TGAAAACGTA 


TTGCCTCCAA 


AGATTTAGTT 


AAATAATGAT 


5760 


TTAACACAAA 


AAGAAATTAT 


TGAAGTTCTG 


GAAAGATCTT 


GTTTCAGTAT 


TGAGAAAAGG 


5820 


TGGGAAAAAC 


TTGCGATTTT 


CACAGAGAAA 


GGAAGAAAAA 


GTATAGAAAT 


ATAGTCAATT 


5880 


GAAACAAGAA 


CAGGATAAAA 


CAACCTTTTG 


TGCCATATTT 


TTCTCCTTTC 


GCTTTACAAT 


5940 


TGGATTGAAC 


ACCTTTATTG 


TATCGCGTTT 


GGAGTTTTTT 


TGGTATAACC 


TTCGACGCAC 


6000 


ACCCGCATAG 


CGGGTGTTTT 


TTTTGTCTCG 


CACCTAACGG 


AGCGAGACAA 


ACTAATACTC 


6060 


ACTTAATCAA 


AAAACGCACC 


ATATCAAAAA 


CTAAAAAGTT 


TGATATCATG 


CGTCATGTCT 


6120 


TAAACTAATT 


GACTATACTT 


TCTATTCAAA 


TGAGCTTTTA 


ACCAATTGAT 


TGAGCCAATC 


6180 


CACTCTTAAA 


ACCAAAGAGC 


AATTTCTCGC 


TTACCTGACT 


CTTCTGAATC 


TGAACCATGT 


6240 


ACAACATTTT 


GGATAATCTC 


ATTTTCTCCA 


GCAGCTTTTG 


CAAAATCACC 


TCGAATAGTG 


6300 


CCTGGTAAAG 


CTTCTTCTGG 


ACGAGTTGCA 


CCCATCATGG 


TCCGCCAAGT 


TTCGATTACT 


6360 


TTGGGACCAG 


AAATGACACC 


CACAAGAACT 


GGACCTGAAG 


TCATGAATTC 


ACGAATCGGT 


6420 


GGGTAAAAAC 


TCTGACCAAC 


CAAGTCCTGA 


TAGTGCTGGT 


CAATCAACTC 


TTCTGAAACC 


6480 


TGTGAACGAA 


ACTCCAATTT 


TTCCATTGTA 


AATCCACGTT 


GTTCGATGCG 


CTTTAACACT 


6540 


TCACCCACTA 


GCCCTCTTTT 


TACACCATCT 


GGTTTGATGA 


TAAAGAATGT 


TTGTTCCATA 


6600 


CCCGTCTCCT 


TTGTCAGCTT 


CTTTCTTTTA 


TTTTACCACA 


TTTCGTGGAA 


AAATGGAGAA 


6660 


AGTTTTCAGA 


AGAGAGAATG 


AGAGAACCCT 


CGGGTTCTCT 


CATTCTCTCT 


TATTCTACTG 


6720 


TTTCTTCCAC 


AGTTTCAACG 


GCAGTATCCA 


CAACTAC7TC 


TGTTGTTTCT 


TCATTTCCTT 


6730 


CTTCCTCTAC 


TGGAGGATTA 


AGGTATTCTT 


CTTCCTTGAC 


ACCATGTGGT 


TCAAGGTTAC 


6840 
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GCTAACGCGC CATACCAGTA CCACCTGGCA TGATCTTACC GATGATAACA TTTTCTTTAA 6900 

GTCCAAGGAG ATGGTCTTTC TTACCACGGA TAGCTGCGTC ACTAAGCACA CGAGTTGTTT 6960 

CCTGGAAGGA AGCCGCTGAC AAGAAACTGT TTGTTTCAAG TGAGGCTTTG GTAATTCCCA 7020 

TAACGACTGG GCGACCTGTC GCTGGAACTC CACCTGCGAT AAGGACATCT TTGTTGGCAT 7080 

CTGTAAAGTC ATTGATATCC ATGAGGGTAC CCATGAGAAG ATCTGTATCA CCTGGATCCA 7140 

TGACACGGAC TTTACGGATC ATTTGACGAA CCATTACCTC GATGTGTTTG TCACCGATTT 7200 

CTACCCCTTG GCTACGGTAA ACTTTTTGTA CTTCACCGAG AAGGTACGTT TCAACTGACA 72 60 

AGACATCACG AACTGCAAGG AGACGTTTTG GTTGGATAGA ACCTTCTGTC AGAGCAGCAC 7 320 

CACGCGCTAC TTGGCCCCCA ACTTCGACAC GCATACGAGC TGTAAATGGA ACGACATATT 7 3 80 

CACCTTCGCC AGTTTCACCC TTAACAAAGA CTTTCTTGGT ACGAGTTGAT GCATCTTCTT 7440 

CGATAGCAGT AACTTGTCCT TTAACCTCTG TAATAACCGC TTCCCCTTTA GGATTGCGGG 7500 

CTTCAAAGAT TTCTTGGACA CGAGGAAGAC CCTGAGTGAT ATCGGTATTT GAGGCAACCC 7560 

CACCTCTGTC GAAGGTACGC ATTGTAAGCT GTGTACCAGG TTCCCCGATA GATTGGGCAG 7 620 

CGATTGTACC AACTGCTTCA CCAACTTCAA CCGCATCACC AGTCGCCAAG TTGATACCGT 7680 

AACAGTGACG GCAGACACCG TGACGAGTGT TACATGTAAA TACAGAACGG ATAGTCACTT 7740 

CTTCCACACC AGCATTGACA ATTTCACGCG CCTTCTCTTC TGTAATCAAT TCATTTGGAC 7800 

CAATAATCAC TGCACCAGTT TCTGCATGTT TAACAGTTTT CTTACTGTAA CGACCGTTGA 7860 

GACGCTCTTC GAGAGACTCG ATCATCTCTT TTCCTTCTGC GATAGAACGG ATCAAGAGAC 7920 

CACGGTCAGT TCCACAGTCG TCCTCACGCA TGATAACGTC TTGGGCAACG TCGACCAAAC 79 80 

GACGAGTCAA GTAACCTGAG TCGGCTGTCT TAAGGGCCGT ATCGGTCATA CCTTTACGAG 8040 

CACCGTGAGT TGAGAAGAAC ATTTCCAATA CCGACAAACC TTCGCGGAAG TTTGAAAGGA 8100 

TTGGCAATTC CATGATACGT CCATTCGGAG CAGCCATCAG ACCACGCATA CCGGCAACCT 8160 

GTGAGAAGTT TGAGATGTTA CCACGGGCTC CAGAGTCCAT CATCATAACG ATTGGGTTCT 8220 

TAGGATCTTG GTTAGCAATC AAGCGTTTCT CAAGTTTTTC ACGGGCAGCA CCCCATTCAG 8280 

CTGTAACAGC ATTGTAACGC TCGTCGTCTG TGATCATACC ACGACGGAAT TCTTTGGTGA 8 340 

TTTGTTCGAC ACGTTTCTGT GATTCTTCAA TGATTTCAGC CTTGTCATCA ACGACTGGGA 8400 

TATCGGCAAT ACCCACTGTC AATCCTGCAA GACTTGAGTG GTGGTAACCG AGGTTCTTCA 8460 

TGCGCTCAAG TAGGGCAGAA GTTTCTGTCG TACGGAAACC TTTGAAGATT TCAGCGATCA 8520 

TATTTCCAAG GTTTTTCTTC TTGAATGGAG GGTTGAGCTC AAGATTGCTG ATAGCTTCCT 8580 

TGATATCTCC ACCAAGTGGC AAGAAGTATT TAGCTGGAAC ACCTTC7CTC AAGTTGGCAT 8640 
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TGTTTGGTTC 
CAACTGTTGT 
CTGTTGCCAT 
CTTCGTCACG 
AGTAGTACTT 
GGTTCAAGAT 
AAAGTGGTAC 
CAAGTGCGTG 
CCAAACGGTG 
CAAGGATATC 
TTTGCACGAT 
CCATTTCACG 
AACGTCCTGA 
CTTTAAGCAT 
CACGACGACC 
GAACGATGAT 
TAACACGGCG 
ACATTCGACG 
TGTTTCCAGA 
TTTGTCCAGT 
CT7GCTTCAA 
CATATTCACG 
GTCTATCCTT 
GAGGGCTCAT 
GAGATACAGG 
TTACTTCAAC 
CACAAGCACA 
GTTCTGGTTT 
ATGAACGGAC 



TTGCAAGTAT GGTAGCCCCT 
AAGCAAGACC TTATGTCTTT 
ACCAACACGT GAGTGGACGT 
GTCTTTGAAG ACCATTCCTT 
ACCCAAAACC ATGTCCTGAG 
GTGCTCAGCA GCTAGCATGA 
GTGGATGGCC ATTTGGTCCC 
CAAGCGAAGA GCCTTACCAT 
AAGGGTCGGT GCGCGGTTCA 
CCAGATACGC TCATCTCCGC 
ATCACGGGCA ACGATTTCAC 
CGGCACACCA CATTCGTACA 
GAAGTCAACA CGTTTACCGA 
GTGGCTCAAT GATTTCAATG 
ATTGTCAATC AAAGCGTCAA 
ACCTGGTGCA TTTAACTCAA 
GTAAAGGTCA TTCAAGTCAG 
AAGATCTGGT GGGATAACCG 
CTTGTAAAAG GCATCCAAAA 
AGCTGTTTTC AATTCTTCTT 
AAGGTCTTGG ATGGCTTCCG 
CAAGCGCTCT CCCTATTCCC 
AGGATCAATC A CC AC AT AAG 
ATCAAGGGTC AAGCCCATAC 
AGCTTTCAAT TCGATATGTC 
CCCACAGCGG TCACAAACAA 
TTCCCAGTCT TTTGTAGCAC 
CAAGGTACGA TAATTGATTG 
TTTACTTGGA GAAGCTAGGG 



811 

C7GCCATCAT ATCGTTGAAG 
GCTCTTCTGT CCAAGGCTTG 
GAACATAACC ATTGCGGTAA 
CACCTTCGCG ACCAGCTTCT 
ATGGAGTAAC TACCGGTTTC 
GGATACGAGC TTCTGCTTGT 
CGTCAAAGTC AGCATTGTAG 
CAATCAAGAC TGGCTCGAAG 
AAAGCACTGG GTGTTCTTTA 
GTTCCACCAA GCGTTTAGCT 
GCATCACAAA TGGTTTAAAG 
TCTTAAGAGT TGCACCAACG 
GCAAGTTTTG ACCGAAGCGT 
GACGGCTACC TCCTCCTCTC 
CTGCTTCTTG AAGCATACGC 
GCAAACGAGC CAAACGGTTG 
ATGAGCCAAA ACGGCCACCA 
GAAGGATGTT AAGAATCATC 
CATCCAAACG ACGGATGGCT 
TGAGTTCAGC AATTTCTTTT 
CACCCATCTT GGCAACAAAT 
GCTCTGTCAT GATAGACTTG 
CCCCAAAGTA GATAACTTCC 
GGCTTGGAAT CCCCTTGAAG 
CCATACGCTC ACGACGAACT 
TTCCTCTGTA ACCAATCCGT 
CAAAGATCAC TTCATCAAAG 
TTTCAGGTTT TTTGACTTCT 
TGATTTGCAT ACTTTTAAAA 



AGAATTTTAC 8700 

TTCACGCTCT 8760 

GCCATAACCG 3820 

TCCATGGTCA 8880 

CCATCTTTCG 8 940 

GCTTCTTCTG 9000 

GCTTCACAGA 9060 

GCTTGGATAC 9120 

ATCACTTCTT 9180 

GCTTTGACGT 9240 

AGTTCAATCG 9 300 

GCCATAACTG 93 60 

CCTTGTTTAC 94 20 

ATTGG A CG AC 9480 

TTCTCATTTT 9540 

TTACGGTTGA 9600 

TCCAACTGCA 9660 

CATTCAGGTT 9720 

TTGACACGCT 9780 

TCAAGATCTA 984 0 

GAACCATAAC 9 900 

TGCTCAAGTG 9960 

TCCAGGGCAC 10020 

TACCAGATGT 10080 

TTCCTACGCG 10140 

TTGTACTTAC 10200 

AGTCCTTCAC 102 60 

CCATAAGACC 103 20 

CGATTTACAT 10 380 
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CAACCACTAT TTCTTCCCTT TCTATTCTAA GTGAACTGCT TATTCTTGTT CAGCAGCTTC 10440 
TTCTGTTGCT TCCGCTTTTG TTGCTTTCTC AGCTTCTTCA CCTTCAAACG CTGCTTTAGC 10500 

CTCTTGGGCT GCTTTTTCGC GGGCTTTTTC AAGGTCATCT ACGTGGATGA CATCTTCGTC 10560 

CATTCCTTCA TCCAAGTCGC GAAGTTCCAC TTCTTGGTCA TCTTCGTCTA GGACACGCAT 10620 

GTCAAGACCA AGAGATTGCA ATTCTTTGAC AAGAACTCGG AAGGATTCTG GAACACCTGG 10680 

TTTTGGAATT CGTTTGCCTT TTGTAATAGC TTCATAGCCT TTCAAACGTC CGTTGATATC 10740 

GTCCGACTTG TAAGTCAAGA TTTCTTGAAG GACATTTGAC GCACCGTACC CTTCAAGAGC 10800 

CCAAACCTCC ATCTCACCGA AACGTTGTCC ACCAAACTGA GCCT7ACCTC CGAGTGGTTG 10860 

TTGGGTAACA GTTGAGTATG GTCCGACTGA ACGCGCGTGC AATTTATCAT CAACCATGTG 10920 

GTGGAGTTTG ATCATGTACA TGACTCCGAC AGAAACACGG TTATCAAACG GTTCACCAGT 10980 

ACGTCCATCG TAAAGGATCG TTTTGGCATC GCTATCCATA CCTGCTTCTT TAACACTTGA 11040 

CCAAAGATCT TCAGAACTTG CTCCATCAAA GACTGGTGTA GCCATCTGAA TACCAAGAGT 11100 

ACGAGCTGCC ATACCAAGCT GAAGCTCCAT AACCTCACCG ATATTCATAC GTGATGGTAC 11160 

CCCAAGTCCG TTCAACATGA TGTCGACTGG AGTTCCGTCT GGAAGGTAAC GCATGTCTTC 11220 

TACAGGAACG ATACGAGAGA CAACCCCTTT GTTTCCGTGA CGTCCGGCCA TTTTATCTCC 11280 

GACCTTAATC TTACCTTTTT GAGCCATGTA AACACGAACC AACATGTTAA CACCTGATTG 11 J 40 

CAACTCATCT CCATTTACAC GTGTAAACA7 CTTAACATCA CGAACGACAC CATCGGCACC 11400 

GTGTGGTACA CGAAGAGAAG TATCACGCAC TTCACGAGAC TTGTCTCCAA AGATAGCGTG 11460 

CAAGAGACGT TCTTCAGCTG AAAGATCTTT CTCACCCTTA GGTGTTACTT TACCTACAAG 11520 

AATATCACCT TCTTTAACCT CAGCACCAAT ACGGATAATC CCCATTTCGT CAAGGTCTTT 11580 

GAGGGCATCT TCACCAACGT TTGGAATTTC GCCAGTGATT TCTTCAGGCC CAAGCTTTGT U640 

ATCGCGCGTT TCTGATTCGT ATTCTTCAAG GTGAACAGAT GTGTAGACAT CGTCCTTCAC 11700 

CAAGCGTTCG CTCATGATAA CGGCATCCTC GAAGTTGTAA CCTTCCCAAG TCATGTAGGC 11760 

AACGATTGGG TTTTGTCCAA GCGCCATTTC TCCATTTTCC ATAGAAGGTC CGTCAGCGAT 11820 

GAAATCGCCT TTTTCAACGA CATCACCAAC TTTTACGAGA CTGCGTTGGT TGTAAGCAGT 11880 

ACCTGAGTTT GAACGACGGA ATTTTTGGAT GTGGTAAACA TCCAATGAAC CATCTTCACG 11940 

ACGAACTTCT ACCTTGTCAG CATCTGCGTA AGTAACTTTA CCATCATACT GAGCAATCAC 12000 

AGCCGCACCA GAATCGTGGG CTGCTTGCTA TTCCATACCA C7ACCAACCT AAGGTGCCTG 12060 

AGGATTAATC AATGGCACAG CCTGACGTTG CATATTGGCT CCCATGAGGG CACGGTTGGA 12120 

GTCATCGTTT TCCAAGAAAG GAATACATGC TGTCGCAACG GCAACTACCT GTTTTGGTGA 12180 
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AACGTCCATG TAGTCAACAA TATTAGCTGG ATACTCTTGG TTGACCCCTT GGTGACGTCC 1224 0 

CATGACAATC TTCTCAGCAA ACCTTCCA7C TTCATTCAGA CGAGAGTTAG CCTGAGCTAC 12 300 

AGTATATTCA TCTTCTTCAT CAGCTGTCAA CCAAACAATT TCGTTCGTGA CAACACCTGT 123 60 

TTCACGGTCA ACCTTACGGT ATGGTGTTTG AACAAAACCA TATTTGTTCA AGTGTCCATA 12 420 

AGATGACAAG TTATTGATCA AACCGATGTT AGGTCCTTCA GGTGTCTCGA TTGGACACAT 12 480 

ACGACCATAG TGAGTGTAGT GCACGTCACG TACTTCATAT CCAGCACGCT CACGAGTCAA~ 12 540 

ACCACCAGGT CCTAAGGCTG ACAAACGGCG TTTGTGAGAC AACTCAGAAA GCGGGTTGTG 12 600 

TTGGTCCATG AACTGTGACA ACTGTGATGA ACCAAAGAAT TCTTTAACTG CAGCTGTTAC 12660 

AGGACGGATA TTGATAATTT GTTGTGGTGT CAAGACTTCA TTCTCCTCAA CAGACATACG 12720 

TTCACGGACA TTACGTTCCA TACGAGAAAG TCCCAAACGT ACTTGGTTCG CAAGCAATTC 12780 

ACCAACCGCA CGGATACGAC GATTTCCAAG GTGCTCGATA TCATCTACAC GGCCAAGTCC 12840 

TTCAGCCAAG TTGAGCAAGT AGCTCATCTC AGCAAGGATA TCTCCAGCAG TCACCGTACG 12900 

AACCTTGTCA TCTGGGTTAG CATTACCAAT GATCGTTACG ACGCGATCTG GATCAGTTGG 12960 

AGCAATAACC TTCAATTTTT GAAGAACAAC ACGCTCAGTC ACAACGGCTC CATCGTTTGG 1302 0 

GATGTAGACA ATCTTGTTCA AGTCGCCATC CAAATCCCTT TCAATGCTTT CAATCACGCT 13080 

ACGAGTCATA ATCGTACCAG CTTCTACCAA GATTTCTCCA GTTTCAGGGT CTACCAATGG 13140 

CTCTGCAATG GTTTGGTTGA GCAAACGTCT TTTAACATTG AGTTTTTTAT TGATTTTGTA 13 200 

ACGACCAACT GCTGCCAAGT CATAACGACG TGGGTCAAAG AAGCGAGCTA CAAGCAAGCT 13260 

ACGTGAGCTT TCAGCCGTCT TAGGCTCACC TGGACGAAGG CGTTCGTAAA TTTCTTTCAA 13 3 20 

GGCTTCGTCT GTACGAGAGT CCATTGGATT CTTGTGGATA TCTTTTTCAA CAGTGTTGCG 13 380 

AACCAATTCG CTGTCACCAA AGATATCAAA GATTTCATCA TCACCTGAGA AACCAAGAGC 1344 0 

ACGAACCAAG GTTGTAAATG CAATCTTACG AGTACGGTCG ATACGAGTGT AGGTGATATC 13500 

TTTTGAGTCG CTTTCAAGTT CCAACCAAGC TCCACGGTTA GGGATAACAC TTGAACCATA 13 560 

GCCCACCTTA CCATTTTTGT CTACTTTGTC GTTAAAGTAA ACACCTGGTG AGCGGACCAA 13620 

CTGAGAAACG ATAATACGTT CACCACCATT GATGATGAAA GTACCCATTT CTGTCATGAT 13 680 

TGGGAAATCA CCAAAGAAAA CTTCTTGGGT CTTGATTTCG CTTGTTTCTT TATTGATCAA 13740 

ACGGAACCTT ACAAAAATTG GTGCTGAGTA GCTAGCATCG TGGATACGAG CTTCTTCTAG 13800 

CGTATATTTT GGTTCCTTGA TTTCATATCC AACAAATTCC AACTCCATTG TCTCTGTGAA 138 60 

GTTTGAAATT GGCAATACAT CTTCAAACAC TTCCTTAAGA CCGTGGTCTA GGAAAGCTTT 13 920 
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GAATGAGTCA GTTTGAATTT CAATCAAATT TGGTAAGTCA AGAACTTCTT TGATTCTTGA 13 980 

AAAACTACGA CGGGTACGAT GTTTCCCCTA TTGAACGTCA TGTCCTGCCA AGATGATTCT 14040 

CCTTTGTAAA TAAGTTCCAA GCCTTGTCAA TCAGGCTTTT CTAATCGTCA TATGGTTGTA 14100 

AACCCCTTAT CACCGTGTCC TCTTGACGAA TTTTCAGAAT CTTTAAGCCT CTGTTACAAA 14160 

TGCTCAAAAT CTTGAAAAAA AGCACAAAAA GAGCAGCTAA ATCTGACTTT TTCAGAAGAT 14220 

TTAACTGCTG TGACCCTTGT CTGGACAATA TTTCAGACAA AACCTACGAC AAATGATTAC 142 80 

C CAT ATT AT A CCCTATTTAG CTAGATTTTT CAAGGGGTTT CACTAGCTTT TTGGTAAATT 14340 

TTTTCCCATA GAAAACTTGG CATCACATTC GAATCACGCT ATGGTACAAA AAACTGAAAA 14400 

AACTATTGAC TGAAAATCAT TTTCAAGGTA TAATAATAAA CGTTAAGGCG GTATAGCCAA 14460 

GTGGTAAGGC ACGGCTCTGC AAAAGCTTGA TCGTCGGTTC AAATCCGTCT ACCGCCTTCT 14S20 

ATAACTTGAT TTATCAGCTT TCAAATGAAC AGAAAGCCCA ATTTGAAGGG CTTTTTTTAT 14580 

TTTCCCTCGA ATAAATACGT ATAACTTTAA AAACTTTTGG ACCGAGTTTC TGCCAGAGTT 14 640 

CTTTCCATGG CATAATTCCC TTTTGAAATC AG 14672 



(2) INFORMATION FOR SEQ ID NO: 112: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

ACGAGACTAT TCAAGCCCAA ATTgAGTAGC CCAGCAAAGA CTCTATAGAC TGTGATACGT 60 

TTTTCATAGC CATTGGTAAA GAGAATTTGG GAACCAAGAA TGCTATCTAA GGCCAGGATA 120 

ATCGTACGAA AAGCGAAGAG AGAGGTCAAG ATGCCGCCTC CGATATATTT TTCACTACCG 180 

TAAAGTAGGA TGGCATTTGG TCCTAAAACC ATGAGTCCAA AACTCAGTGG AATGATAAAG 240 

AAGTTAAAGA TTCGACTACC TCTATTAACC AGAGAAACAT AGCCTTCTTT GTCTCCTTTC 300 

CCCAGATAGT AACTGAGACG AGGCACACTC ACTCCAATTG CACCTGTTAC AACCCCAGCT 3 60 

ATAACGGTCA CAATTCGCTG AGCTATGGTA TAGTAACTAA CGTTGACATC AATCCCTGTT 420 

TTAACGAGGA AGAGGCGATC TAAAAAAGTG AAGAGCATAT TGGCATTGGC AAAGACTAAC 480 

ATGGCTCTCA GAGGGAGAAA GAGTGGTTTA AAATC ACTT A GGTGAATTTT AACAAGTTTG 540 

ATGTCTCTTT TAATCCAAAA ATAACTAATC AGGTAGTTAA TCAGCGTCGA TAAACTCATC 600 

ACAAGTGTAT AGACAACAAT ATCGTGTTCA TTTTTAACAA ATAAGAAAAT AGAGACCAGC 660 
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ATCAGGATAC 


GGATGAAGCC 


AGTTTTGTAA 


AAGAGAAAAC 


TGTAATTTTC 


CAGAGCTTCA 


lies 


TTCACCCATT 


CGATTGAAAA 


AATCTGGGCA 


ATGAGTTGAA 


TCCCCATAAC 


AAGGTAGACC 


7 80 


TTTTTCACCA 


TTGGATTATC 


AGTAAAGAAG 


AGAGGATAGG 


CTACGATATA 


GACAGCAGTr: 




GTCAAAATCG 


TACAAGCGAT 


GCACAAATAA 


AAAAGACTAG 


AAAAGGTTCT 


GTTAACATPT 




TTTTTGTTAT 


CCTTGACATT 


ACTGATAGCC 


CTTAAACCGT 


AGTTATAGAC 


ACCATAAGTT 




"GCAAAGGGCA" 


~ AGAAAAATG A 


CAAAATAGTG 


TCGACTGAGT 


TGAAGTAACC 




1020 


CGGTCCAAGA 


CACGCGCGAC 


ATAGGTTCCA 


GTTAGGATGG 


GAAAAATAAT 




iUDU 


CCAATTCCCA 


TGTAAGATAG 


AGCATTTAAT 


TTTATACTTT 


TCATTCAATT 


i nv,u i \m\j ill 


l i a n 


TTCATTATAT 


CATAAAGTTA 


GCTAATAAGA 


AATGAAGGGC 


AGTAAGTCAA 


GTAATCACTT 


i J fin 


TGAAGTTTCA 


AATCTTAAGT 


TTTAAGTTTT 


CTTTAAGGAA 


AGTATATTAT 


TCTCAAfiCAf* 


1 £ QU 


TCTAAAATTT 


CGCAGCCATT 


TATTAGTAAT 


TGCTACACAA 


TTCCTAGTCA 


TT ACT Ad AAA 


t inn 


TGGACTAGTT 


TCTTTGAATA 


ATAGAACTGC 


ATAATTCTCC 


TATTCTAGAA 


GCGC, AC*C1 ACC 


X JO u 


AGTATTTCTT 


TTATGATAGG 


ACTAGATTCT 


GGTATAATAG 


AGAGAATAAG 


4 1 t I 1 1 4 n\j I 


1 d j n 


AAGACAAAGG 


AGAAAA7AGA 


TGATTTATGC 


AGGAATTCTT 


GCCGGTGGAA 


CTCICIC AC AfT. 




CATGGGGATC 


AGTAACTTGC 


CAAAACAATT 


TTTAGAGCTA 


GGTGATCGAC 


CTATT^PTfiAT* 




TCATACAATT 


GAAAAATTTG 


TCTTGGAGCC 


AAGTATTGAA 


AAAATTGTAG 


TTfIf»*T*fl"T*rC A 

4 4 ^>VJ 4 w 1 i ^ 


1620 


TGGAGACTGG 


GTTTCTCATG 


CAGAAGATCT 


TGTAGATAAA 


TATCTTCCTC 


TTTATAAGGA 


1QOU 


ACGTATCATC 


ATTACAAAGG 


GTGGTGCTGA 


CCGCAATACA 


AGTATTAAGA 


ACATCATTGA 


1740 


AGCCATTGAT 


GCTTATCGTC 


CGCTTACTCC 


AGAGGATATC 


GTTGTTACCC 

W 4 4 w 4 4 WW 


ACCATTCTGT 

f^VaWn 4 4^4 *J 4 


1 800 


TCGTCCATTT ATTACACTTC 


GCATGATTCA 


GGACAATATC 


CAACTTGCCC 


AAAATCATGA 


1860 


CGCAGTGGAC 


ACAGTGGTAG 


AAGCGGTTGA 


TACTATCGTT 


GAAAGTACCA 


ATGGTCAATT 


1920 


TATTACAGAT 


ATTCCAAATC 


GTGCTCACCT 


TTATCAAGGA 


CAAACACCTC 


AAACATTCCG 


1980 


TTGCAAGGAC 


TTCATGGACC 


TTTATGGATC 


TCTTTCTGAT 


GAAGAGAAGG 


AAATCTTGAC 


2040 


AGATGCATGT 


AAAATCTTTG 


TGATCAAAGG 


AAAAGATGTG 


GCTTTGGCCA 


AACGTGAATA 


2100 


CTCAAATCTG 


AAGATTACAA 


CCGTAACAGA 


TTTGAAGATT 


GCAAAAAGTA 


TGATTGAGAA 


2160 


AGACTAGTAA 


AATGATTAAT 


CAAATTTATC 


AACTAACTAA 


GCCTAAGTTT 


ATCAATGTCA 


2220 


AATATCAGGA 


AGAGGCTATT 


GACCAAGAGA 


ATCATATCCT 


TATCCGTCCC 


AACTACATGG 


2280 


CTGTCTCTCA 


TGCCGATCAG 


CCTTACTATC 


AGGGAAAACG 


TGATCCCAAG 


ATTTTGAATA 


2340 


AAAAGCTTCC 


AATGGCAATG 


ATTCACGAGT 


CATGTGGAAC 


CGTCATTTCT 


GACCCGACCC 


2400 
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GAACCTACCA GGTTGGTCAA AAAGTTCTCA TGATTCCCAA TCAGTCTCCT ATGCAGAGTG 2460 

ATGAAGAATT CTATGAAAAC TACATGACAG GGACCCATTT CTTGTCTAGT GCATTTGATG 2520 

GCTTTATGAG AGAGTTTGTT TCTCTCCCTA AAGATCGTGT GGTGGCTTAT GATGCTATTG 2 580 

AAGATACGGT TGCAGCCATT ACAGAGTTTG TCAGTGTGGG CATGCACGCT ATGAATCCTC 2 640 

TATTGACTCT TGCTCATAGC AAGCGGGAGC GCATCGCCGT TATTGGAGAT CGAAGTTTAG • 2700 

CTTTTGTCGT TGCCAATATT ATCAACTATA CTTTGCCAGA AGCAGAGATT GTGGTTATTG 2760 

GTCGTCATTG GGAAAAGTTG GAACTCTTCT CATTTGCCAA AGAATGCTAT ATTACGGATA 2820 

ATATTCCTGA AGATTTGGCC TTTGACCATG CTTTTGAATG TTGTGGTGGT GATCGTACTG 2 890 

CACCAGCTAT TAATGACTTG ATTCGCTACA TTCGTCCTCA GGGAACGATT CTCATGATGG 2940 

GAGTTAGCGA ATATAAAGTC AATCTCAATA CTCGCGATGC CTTAGAAAAG GGCTTGATTT 3000 

TGGTTGGCTC ATCTCGTTCT GGTCGCATTG ATTTTGAAAA TGCTATCCAA ATGATGGAAC 3060 

TCAAGAAATT TGCCAATCGT CTTAAAAATA TCCTTTATCT AGAAGAACCT GTAAGAGAAA 3120 

TTAAAGATAT TCATCGTGTC TTTGCAACCG ATTTAAACAC AGCCTTTAAA ACAGTGTTTA 3180 

AGTGGGAAGT ATAAGTACTG GAGGTTAATT GTGGAGAAAA TCATTAAACA AAAAATTTCT 3 240 

TCCTTACTTA GTCAAGAAGA GGAAGTCCTC AGTGTTGAAC AACTGGGTCG AATGACCAAT 3 300 

CAAAACTATT TGGCCAAAAC AACAAATAAG CAATACATTG TTAAATTCTT TCGTAAAGCG 3360 

ACAGAAAAGC TTATCAATCG ACAAGATGAA AAGTACAATC TTGAACTACT AAAGGATTTA 3420 

GGCTTAGATG TAAAAAATTA TCTTTTTGAT ATTGAAGCTG GTATCAAAGT AAATGAGTAT 3480 

ATCGAATCTG CGATTACGCT TGATTCAACG TCAATCAAGA CCAAGTTCGA CAAAATTACT 3540 

CCAATATTAC AAACTATTCA TACGTCTGCT AAGGAATTAA GAGGAGAATT TGCTCCTTTT 3 600 

GAAGAAATCA AAAAATACGA ATCCTTGATT GAAGAACAAA TTCCTTATGC CAACTATGAA 3660 

TCTGTTAGAA ATGCAGTCTT CTCCTTAGAG AAAAGACTGG CTGACTTAGG TGTTGACAGA 3720 

AAATCTTGTC ATATCGATTT GGTGCCTGAA AACTTTATCC AATCACCTCA AGGACGACTT 3780 

TATTTCATTG ACTGGGAATA TTCATCAATG AATGATCCAA TGTGGGATTT GGCTGCCCTC 3 840 

TTTTTAGAGT CTGAATTCAC TTCCCAAGAG GAAGAAACTT TCTTATCTCA CTATGACAGT 3900 

GACCAAACAC CGGTTTCTCA TGAAAAGATT GCTATTTATA AAATTTTACA AGATACTATT 39 60 

TGGAGTCTAT GGACTCTC7A TAAGGAAGAG CAAGGTGAAG ATTTTGGTGA CTATGGTGTG 4020 

AATCGTTACC AAAGAGCTAT TAAAGGTTTG GCTTCTTATG GAGGTTCAGA TGAAAAGTAA 4 080 

AAACGGAGTT CCTTTTGGCC TTCTCTCAGG TATTTTCTGG GGCTTGGGTC TAACGGTTAG 4140 

TGCTTATATC TTTTCGATTT TTACAGATTT GTCACCCTTT GTGGTGGCTG CAACTCATGA 4 200 
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TTTTTTGACC ATCTTTATCT TACTAGCTTT TCTCTTGGTA AAAGAAGGGA AAGTTCGCCT 4 2 60 

CTCAATTTTC TTAAATATTC GCAATCTCAG TGTTATCATC GGACCCTTGC TAGCAGGCCC 43 20 

TATCGGTATG CAGGCCAATC TTTATGCAGT TAAGTATATC GGAAGTTCTT 7ACCTTCATC 43 80 

TGTATCGGCT ATTTACCCTG CGATTTCAGT TCTATTGGCT TTCTTCTTTT TGAAGCACAA 4 440 

GATTTCGAAA AATACTGTAT TTGGGATTGT CTTGATTATT GGAGGGATTA TTGCTCAGAC 4500 
CTATAAGGTT- GAACAGGT7A ATTCTTTCTA CATTGGGATT XTTTGTGCTT TG GTTTGTGC " 4 560* 

TATTGCATGG GGAAGTGAGA GTGTTCTTAG CTCTTTTCCC ATGGAAAGTG AATTGAGTGA 4 620 

AATCGAAGCC CTCTTAATCC GTCAAGTAAC TTCGTTCTTG TCCTATCTTG TGATTGTGCT 4 680 

CTTCTCTCAT CAGTCATTTA CTGCAGTAGC CAATCGACAA TTGCTAGGTC TCATGATTGT 4 74 0 

TTTTCCAGCC TTTGATATGA TTTCCTACTT GGCTTATTAT ATCGCTATCA ATCGCTTGCA 4800 

ACCAGCCAAG GCTACAGGCT TGAACGTGAG C7ATGTAGTA TGGACGCTCT TGTTTGCAGT 4 860 

TGTTTTCTTG GGTGCACCGC TAGATATGCT GACCATTATG ACGTCACTTG TCGTCATTGC 4920 

TGGAGTTTAT ATTATTATTA AAGAATAAAG CACATTCCTG TGAAAGCCAT TATCTTAGCA 4 980 

GCGGGATTGG GAACTCGCTT GCGTCCTATG ACTGAAAATA CCCCTAAAGC CTTGGTTCAG 504 0 

GTTAATCAAA AACCTTTGAT TGAGTACCAA ATTGAGTTTC TCAAAGAAAA AGGAATCAAT 5100 

CACATCATCA TCATTGTTGG TTATCTTAAA GAACAATTCG ATTACTTGAA AGAGAAATAC 5160 

GGTGTTCGTC TCGTTTTCAA TGATAAATAC GCTGACTACA ATAACTTTTA CTCTCTCTAT 5220 

CTTGTAAAAG AAGAATTGGC CAACAGCTAT GTTATTCATG CTGACAATTA TCTCTTTAAA 52 80 

AATATGTTCC GCAATGATTT GACACGTTCG ACTTATTTTA GTGTTTATCG TGAAGATTGT 5 340 

ACCAACGAAT GGTTCTTGGT TTATGGAGAT GACTACAAGG TTCAAGACAT TATTGTTGAT 54 00 

AGCAAGGCAG GTCCCATCCT TAGTGGTGTA TCCTTCTGGG ATCCTCCAAC TGCAGAAAAG 54 60 

ATTGTCAGCT TTATCGACAA GGCTTATGTA AGTGGTCAAT TTGTTGATCT CTATTGGCAC 5520 

AATATGGTTA AGGATAATAT CAAAGAGCTA GATGTCTATG TTGAAGAATT AGAAGGCAAT 5580 

AGCATTTATG AGATCGATAG TGTCCAAGAC TATCGTAAAT TAGAACAAAT TCTTAAAAAC 5640 

GAAAATTAAA GATTCCAACA TCTGACAAAA TAGTCGGATG TTTTTTCATT TTTTACGAAC 5700 

TTTTACGAAT AGATAGATGA GTAGAAAAAG AAATGGAGTT ATTTATGAAA ATCACAAACT 57 60 

ATGAAATCTA TAAGTTAAAA AAATCAGGTT TGACCAATCA ACAGATTTTG AAAGTGCTAG 5320 

AATACGGTGA AAATGTTGAT CAGGAGCTTT TGTTGGGTGA TATTGCAGAT ATCTCAGGTT . 5880 

GCCGTAATCC AGCCGTTTT7 ATGGAACGTT ATTTTCAGAT AGACGATCCG CATTTGTCGA 5940 
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AAAATTTCCA 


TCTTTCTCTA 


nwtivwn ft ^* ft ft 

TTTTAGATGA 


CTGTTATCCT 


TGGGATTTGA 


6000 






GTACTTTTAT 


TTTACAAGGG 


AAATCTTGAC 


CTCCTGAAAT 


6060 




AGCGGTCGTG 


GGCAGTCGTG 


CTTGTAGCAA 


ACAGGGAGCT 


AAGTCAGTTG 


6120 


AAAAAGTCAT 


TCAAGGCTTG 


GAAAATGAAC 


TGGTTATTGT 


CAGTGGTCTG 


CCCAAGCGCA 


6180 


TTGACACAGC 


AGCTCATATG 


GCAGCTCTTC 


AGAATGGCGG 


AAAAACCATT 


GCAGTGATTG 


6240 


GAACAGGACT 


GGATGTGTTT 


TATCCTAAAG 


CCAATAAACG 


CTTGCAAGAC 


TACATCGGCA 


6300 


ATGACCATCT 


GGTTCTAAGT 


GAATATGGAC 


CTGGTGAACA 


ACCTCTGAAA 


TTTCATTTTC 


6360 


CTGCCCGTAA 


TCGCATCATT 


GCTGGACTTT 


GTCGTGGTGT 


GATTGTACCA 


GAGGCTAAGA 


6420 




TAGTCTCATT 


ACGTGTGAGC 


GAGCAATGGA 


AGAAGGACGC 


GATGTCTTTG 


6480 


CTATTCCTGG 


TAGCATTTTA 


GATGGACTAT 


CAGACGGTTG 


CCATCATTTG 


ATTCAAGAAG 


6540 


vjAUCAAAATT 


GGTCACCAGT 


GGGCAAGATG 


TTCTTGCGGA 


ATTTGAATTT 


TAAAAATGAC 


6600 


CTAAGCTAGA 


ATTCTAACAA 


AAAATCAATT 


TTAAGAGAAA 


ATGAACCCAA 


CATTTCCATA 


6660 


A I AAAACGCA 


TATTAGCAAG 


TTTTTAACAC 


TTGATAATAT 


GCGTTTTTTC 


TAAGTGGATT 


6720 


AGT AG AG TAG 


AGGATTTTTC 


TCATATAATA 


CTCTTCGAAA 


ATCTCTTCAA 


ACTACGTCAG 


6780 


CTTCCATCTG 


CAACCTCAAA ACAGTATTTT GAGCgaCTtC GTCAGTCTTA TCTACAACCT 


6840 


^ A ft ft. ^ ^ ft 

C AAAG C AGTG 


CTTTGAGCAA 


CCTGTGGCTA 


GCTTCCTAGT 


TTGCGCTTTG 


ATTTTCATTG 


6900 


AGTATAAGGG 


AAAGTATAGT 


GAATTGAAAT 


AAGATGTGAA 


CAACTCTATC 


AGGAAAGTCA 


6960 


AATTAATTTA 


TAGAAATATT 


TTAGCAGCCA 


AGGTCTACTC 


TTATAGATTC 


AATTACACTA 


7020 


TAATTTAGTG 


TAATTGAGAA 


AGGAGAAATG 


ATTGTGATTG 


ATGTTGGCTA 


GGTTATGTTC 


7080 


AATGAi ICCT 


ACCGTCTCAA 


ATCTTGTCAG 


TAAGGAAAAA 


TAAATTCTTC 


AAAAGTACAG 


7140 


ATTACAAGGC 


TTGTTTAAGA 


AAGAATTCAA 


AGACCTTGAC 


AAATAAAAAT 


AAAATGGTTA 


7200 


TTATAAAAAA 


TGGTCTGAAA 


TAGATGATGA 


TACTTTTCGA 


AAATCTCTTC 


AAATACGTCA 


7260 


Cj tTTC AGCTTT 


GCCTTGCTGT 


GTTTTGAGCA 


AGCTACGGTT 


AGCTTCCGAG 


TTTGATTTVC 


7320 


ATTTACTAGA 


AATGAAACTG 


ATGAGAGATA TCAGTAGACA 


TTTGACTCAG 


GATATTATGG 


7380 


AAAATGATAA 


AAAGAGCTCG 


TGAGATTGGC 


ATATCAGACT 


ACTAAAGTAT 


TGAGTTTGTT 


7440 


AGGATTTTAG 


CGACTAGTTA 


GCTGGGAAAG 


CAAGATATTT 


GTGACAAATA 


ATAAACTGTA 


7500 


TTCGTTGATA 


GAATTTAGAA 


ATAAAATATA 


TGAAGAATTA 


GAACTTTCCA 


GAAGTGATTT 


7560 


ACCGATTTTA 


CTATGTGCCA 


TGCTTATCGC 


CTCTATCGGA 


TTAAATATGG 


ATTCGACTCC 


7620 


CGTGATTATT 


GGACCCATGT 


TAATCTCTCC 


TTTGATGACA 


CCTATTCTGG 


GAGTGGGGCT 


7680 


CTCTCTAGCT 


ATATTTGATT 


TTAAATTGTT 


AAGAAAATCT 


TTTAAAATAT 


TAGCTATTCA 


7740 
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AATTCTTGCC AGTCTAATAG CTTCAACACT TTATTTTTAT CTTTCTCCCA TTTCGTATCC 7800 

TAGTTCGGAG ATTGTTGCTA GAACCTCTCC GACTATTTGG GATGTTCTCA TTGCTTTTCT 7860 

AGGACGGATA GCAGGTATCA TTGGTGCTAG GAAAAAAGAG AC 7902 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18627 base pairs 
- TBI TYPE: nucleic" acid 
(C> STRANDEDNESS : double 
{ 0) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

CAACTTGAAA TGGCCAGCTG ATGAGCAATA TCGGTCATAG AAATCTTCTC AATCAACTTT 60 

TGCGCAATTT TTTGGTTGAT AATACGAGGA ATTTGCTCAT TTTTCTTCAC GATAGAAGTT 120 

TCAGCGACCA TCATTTTTGA ACAGTGATAG CACTTGAAAC GACGCTTTCT AAGTAGAATT 180 

CTACTACCCA TACCAGTTCT CTCAAGGTAA GGAATCTTAG ACGGTTTTTG AAAGTCATAT 2 40 

TTCTTCAATT GGTTTCCGCA CTCAGGGCAA GATGGGGCGT CGTAGTCCAG TTTGGCGATG 300 

ATTTCCTTGT GTGTATCTTT ATTGATGATG TCTAAAATCT GGATATTAGG GTCTTTAATG 3 60 

TCTAGTAATT TTGTGATAAA ATGTAATTGT TCCATATGAA TCTTTCTAAT GAGTTGTTTG 420 

CTCGCTTTTC ATTATAGGTC ATATGCGACT TTTTTTCTAC AATAAAATAG GCTCCATAAT 4 80 

ATCTATAAGG GATTTACCCA CTACAAATAT TATAGAGCCA AAAATCCTTT GTTTACTAAA 540 

CAAGGGATTT TTCTTTTGTC TCTGCTCCTT TTTTGATATA ATAGTTCTAT GTTAAAATCA 600 

GAAAAACAAT CACGTTATCA AATCTTAAAT GAAGAATTGT CCTTCCTATT GGAAGGCGAA 660 

ACCAATGTTT TGGCTAATCT TTCCAACGCC AGTGCTCTCA TAAAATCACG TTTTCCTAAT 720 

ACCGTATTTG CAGGCTTTTA TTTGTTCGAT GGAAAGGAAT TGGTTTTAGG CCCCTTCCAA 780 

GGAGGTGTTT CCTGCATCCG TATTGCACTA GGCAAGGGTG TTTGTGGTGA GGCAGCTCAC 840 

TTTCAGGAAA CTGTTATTGT TGGAGATGTG ACGACCTATC TCAACTATAT TTCTTGTGAT 900 

AGTCTAGCTA AAAGTGAAAT TGTGGTGCCG ATGATGAAGA ATGGTCAGTT ACTTGGAGTT 960 

CTGGATCTGG ATTCTTCAGA GATTGAGGAT TACGATGCTA TGGATCCAGA TTATTTGGAA 1020 

CAATTTGTCG CTATTTTGCT TGAAAAGACA GCATGGGACT TTACGATGTT TGAGGAAAAA 1080 

TCTTAATCTA TCAAGCACTT TATCGAAAAT ATACAAGT.CA AAACTTCTCC CAGTTAGTTG 1140 

GTCAAGAAGT TGTGGCTAAG ACTCTTAAAC AAGCGGTGGA GCAAGAGAAA ATAAGTCACG 1200 
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CTTATCTTTT TTCTGGTCCT CGTGGAACCG GAAAAACCAG TGTTGCTAAA ATCTTTGCCA 12 60 

AGGCTATGAA CTGTCCCAAT CAAGTGGGTG GCGAACCTTG CAATAACTGC TATATTTGTC 13 20 

AAGCAGTGAC GGACGGTAGT TTAGAAGATG TCATTGAAAT GGATGCAGCT TCTAATAATG 1380 

GGGTAGATGA AATTCGCGAA ATTCGTGATA AATCTACCTA TGCGCCTAGC CTTGCTCGTT 1440 

ATAAGGTTTA TATCATAGAT GAGGTTCACA TGCTGTCTAC AGGGGCTTTT AATGCCCTCC 1500 

TAAAGACGCT GGAAGAACCA ACACAGAATG TAGTCTTTAT TTTGGCCACT ACTGAATTGC 1560 

ACAAGATTCC TGCTACTATT CTATCCCGTG TGCAACGTTT TGAGTTTAAA TCAATTAAGA 1620 

CACAGGATAT TAAGGAACAT ATTCACTATA TCTTAGAAAA AGAAAATATC AGTTCTGAAC 1680 

CAGAGGCTGT GGAAATCATT GCCAGACGGG CGGAAGGTGG AATGCGGGAC GCCTTGTCTA 17 40 

TTTTGGATCA AGCCCTGAGT TTGACACAGG GAAATGAGCT GACGACTGCT ATCTCTGAAG 1800 

AAATTACTGG CACCATTAGC CTATCAGCCT TGGATGATTA TGTGGCGGCC TTGTCTCAAC 1960 

AGGATGTTCC CAAAGCTTTG TCTTGCTTGA ATCTTCTTTT TGACAATGGT AAGAGCATGA 1920 

CTCGTTTTGT GACCGATCTT TTGCACTATT TAAGAGACTT GTTAATTGTT CAAACAGGGG 1980 

GAGCAAATAC TCATCATAGT TCAGTCTTTG TAGAAAATTT GGCACTTCCT CAAAAAAATC 2040 

TGTTTGAAAT GATTCGCTTA GCAACAGTGA GTTTAGCAGA TATTAAGTCT AGTTTGCAAC 2100 

CCAAGATTTA TGCTGAAATG ATGACCGTCC GTTTGGCGGA AATCAAGTCC GAACCAGCTC 2160 

TATCAGGAGC GGTTGAAAAT GAAATTGCTA CGCTGAGACA GGAACTTGCC CGTCTCAAAC 2220 

AAGAGCTTTC TAATGTAGGT GCGGTTCCTA AACAACTTGC ACCAGCTCCT AGTCCACCAG 2 280 

CTACGGGCAA AACAGTCTAT CGTGTCGATC GCAATAAAGT GCAATCTATC TTACAAGAGC 2 340 

CCGTCGAAAA TCCTGATTTA GCACGTCAAA ATTTAATTCG TTTGCAGAAT GCCTGGGGAG 2 400 

AGGTAATTGA AAGTCTAGGT GGGCCGGACA AGGCTCTGCT AGTTGGTTCT CAACCGGTTG 2460 

CTGCCAATGA ACACCATGCT ATTCTTGCTT TTGAGTCTAA CTTCAATCCT GGTCAAACTA 2520 

TGAAACGAGA CAATCTCAAT ACCATGTTTG GTAATATCCT CAGTCAGGCG GCAGGTTTTT 2 580 

CACCTGAGAT TTTACCTATT TCCATGGAGG AATGGAAAGA AGTTCGCGCA GCCTTTTCAG 2 640 

CCAAAGCCAA ATCTTCTCAA ACTGAAAAAG AACTAGAAGA AAGCCTGATT CCAGAAGGAT 2700 

TTGAATTTTT GGCTGATAAA GTGAAGGTAG AGGAAGACTA AAGAAAGATT TCATGATACA 2760 

ATAACTTTAT GAATAAACAA CAATTTATTA TTATGGCGCT GTTTACAGCT GCTGAGACCT 2820 

ATTTTTTCAA TGAACCCTGG ATCACTGGCC GCTATATTAT GGCAGCCTTT TGGGCAATTT 2880 

TACTCTTTAG AAATTTCCGA GTCAGTTATG TGATGGGCAA AATCGTTGAT GTCATCGATC 2940 

AGCATTTTAA TAGGAAAGAC TAGCCCTCAG CTTCCAGACA AAATCAAAGC CTTTTAGGCT 3000 
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TTTTTTTGTT ATACTAGAAA AGTATATTTA TAGAATTTTT GCTCTATTTC TGGGGAAATC 3060 

ACACGTTTTT CTAGTAAGTA CTGTAAAAGT TTTGAAAAAG AAAGGAACTA TCATGTCAGT 3120 

ATT AG AG AT C AAAGATCTTC ACGTTGACAT TGAAGGAAAA GAAATTTTAA AAGGGGTTAA 3180 

CCTGACCCTG AAAACAGGAG AAATTGCCGC TATCATGGGA CCAAATGGTA CAGGTAAATC 3 240 

GACTCTTTCT GCCGCTATCA TGGGAAATCC AAACTATGAA CTAACTAAAG GTGAAGTTTT 3 300 

GTTTGATGGC CTAAACATCC TTGAGTTGGA AGTGGATGAG CGTCCGCGTA TGGGACTTTT 3 360 

CCTTGCTATG CAATACCCAT CAGAAATCCC TGGAATTACC AATGCTGAGT TTCTTCCTCC 3 4 20 

CGCTATGAAT GCGGGTAAAG AAGATGATGA GAAGATTTCA CTTCGTGAGT TTATTACTAA 3 4 80 

GCTAGATGAA AAAATGGAAT TGCTCAACAT GAAAGAAGAA ATGGCAGAGC GTTACCTCAA 3 540 

CGAACGCTTC TCTGGTGGTG AGAAAAAACG CAATGAAATT CTTCAACTTT TGATGTTGGA 3 600 

GCCAACATTT GCTCTTTTGG ACGAGATTGA CTCAGGTCTT GATATTGACG CTCTTAAAGT 3 660 

TGTGTCTAAA GCTGTCAATG CCATGCGTGG TGAAGGTTTT GGTCCTATGA TCATCACTCA 3 720 

CTACCAACGT CTTTTGAACT ATATCACACC TGATGTGGTA CACGTGATGA TGGAAGGTCG 3 7 80 

TGTTGTCCTT TCTGGTGGTC CAGAATTGCC TGCGCGTTTG GAACGTGAAG GATACGCAAA 3 840 

ATTAGCTGAA GAACTTGGCT ACGACTACAA GGAAGAATTG TAATTCCCTC GTATCTTTTA 3900 

GGAGAAGTAA ATGACTAGAC AAAATATTAA ACTTTTTTCA GAAATGCACG CTGAACCAAG 3 960 

CTCGTTGGCT GATCTCCGTC AAAAAGCTTT TGACAAGATT GACACTTTCC AATTACCAGT 4020 

TATTGACTGT GTCAAATTCC ACCGTTGGAA TCTGGGTGAT GGAACGATTA CAGAAAATGA 4 080 

GCCATCAGCA AATGTTCCAG ATTTCACAGC TTTAGATCAT CACTTGAAGT TGGTGCAAGT 4140 

AGGAACTCAA ACTGTTTTCG AACAAACTCC ACTTGAGTTA GCTCAACAGG GTGTTGTCTT 4200 

CACAGACTTT CACTCAGCTT TAGAAGAAAT TCCAGAGCTG ATCGAAGAAT TCTTCATGTC 42 60 

ATCTGTTAAG TATGATGATG ACAAGTTGGC GGCTTACCAC ACAGCTTACT TTAACAGTGG 4 320 

TGCTGTACTC TATATTCCAG ATAACGTAGA AATCACAGAG CCAATTGAAG GAATTTTCTA 4 380 

CCAAGATAGC GATAGCAATG TGCCGTTTAA CAAGCATATT ATGATTATCG TTGGTAAAAA 44 40 

TTCTAAGATT AGTTATCTGG AGCGTTTAGA GTCACGCGGT GAAGGAAGTG ACAAAGCAAC 4 500 

TGCCAATATC ACAGTGGAAG TGATTGCACG TTCTGGTGCG CAAGTCAAGT TTGCTGCTAT 4 560 

CGACCGTCTA GGTGAAAACG TCACTGCCTA CATTAGCCGT CGTGGTAAAT TAGGCAACGA 4620 

TCCAAGTATT GACTGGGCTA TCGGTGTCAT GAACGAAGGA AATGTCGTTG CTGATTTTGA 4630 

TAGTGACTTG ATTGGTAATG GTAGCCATGC TGACCTCAAC GTTGTAGCTC TTTCAAGTGG 4 740 
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TCGTCAGGTA CAAGGGATTG ATACTCGTGT AACTAACTAT GGCTGCAACT CAATCGGAAA 4800 

CATTCTACAA CATGGGGTTA TCCTTGAAAA AGCAACTTTG ACTTTCAATG GTATCCGCCA 4860 

CATCATCAAG GGTGCTAAGG GAGCAGATCC GCAACAAGAG AGCCGTGTTC TCATGCTTTC 4 920 

AGACCAAGCG CGTTCAGATG CTAACCCAAT TCTTTTGATT GATGAAAATG ACGTAACTGC 4980 

AGGCCATGCA CCCTCTATTG GTCAGGTAGA TCCAGAACAT ATGTACTACC TCATGAGTCG 5040 

TGGCTTGGAT AAGGCAACTG CAGAGCGTTT GCTTCTTCCT GGTTTCCTTG GATCTGTTAT 5100 

CGTCGAGATT CCAGTCAAGG AAGTTCGTGA TGAAATGATT GCAACTATCG AAGAGAAATT 5160 

GTCAAAACGC TAAGGGGCAG CCTATGTTAG ATGTAGAAGC GATTCGCAAG GATTTTCCAA 5220 

TTTTAGATCA GATTGTCAAT GATGAACCTC TGGTCTATCT GGACAATGCT GCGACGACAC 5280 

AAAAACCACT ACTACTTCTC AAAGCTATTA ACACCTACTA TGAGCAGGAC AATGCCAATG 5340 

TTCACCGTGG TGTCCATACC TTAGCGGAAC GAGCGACAGC TTCTTATGAA GCTGCTCGTG 5400 

AAACCATTCG TAAGTTTATT AATGCAGGCT CTACAAAGGA ACTTCTCTTT ACCAGAGGAA 5460 

CGACAACCAG CCTTAACTGG GTCCCACCCT TTGCTGAGGA AATTCTCACT CAGGGAGACC 5520 

AGGTCTTGAT TTCAGTAATG GAACACCATT CTAATATCAT TCCATCGCAG CAAGCTTGTC 5580 

GAAAGACTGG AGCAGAGCTT GTCTATGTCT ATCTTAAAGA CGGTGCCTTG GATATGGAGG 5640 

ATTTGCGAGC TAAATTGACT GATAAGGTTA AATTTGTTTC CCTAGCTCAT GCCTCCAATG 5700 

TTCTTGGTGT GGTCAATCCG ATCAAGGAAA TCACTCAATT AGCCCACCAA CTTGGGGCAA 5760 

TTATGGTAGT GGATGGTGCT CAATCTACAC CTCATATGAA GATTGATGTC CAGGACT7GG 5820 

ATCTGGACTT TTTCGCCTTT TCGGGTCACA AGATCGCTGG TCCGACTGGT ATCCGTCTCC 5880 

TTTACGGCAA AGAAAAGTAT CTTGAGCAAA TGTCTCCAGT AGAATTTGGC GGCGAGATGA 5940 

TTGATTTTGT CTACGAGCAA TTTGCTAGTT GGAAGGAATT GCCTTGGAAA TTTGAGGCTG 6000 

GAACGCCAAA TATGGCACGA GCTATTGGAC TTGCGACTGC AGTTGATTAT CTGGAAAAGA 6060 

TTGGTATGGA TGCCGTTGAA GCTCATGAAC AGGAATTGAT TGCGTACGTC TATCCAAAAC 6120 

TGCAGGCAAT TGAGGGATTG ACCATTTACG GTTCTCACGA TTTGGCTCAA CGTTCGGGTG 'SI 80 

TTATTGCCTT TAACCTAGGT GATCTCCATC CTCACGATCT TGCGACGGCT CTGGATTATG 6240 

AAGGAGTGGC TGTTCGTGCT GGTCACCATT GTGCGCAACC CTTGCTTCAG TATTTGGAAG 6300 

TCCCAGCAAC AGCTCGTGCA AGTTTTTATA TCTACAATAC CAAGGCAGAT TGCGACAAAC 63 60 

TAGTCGATGC CCTACAAAAG ACAAAGGAGT TTTTCAATCG CACTTTCTAA ACTAGATAGC 6420 

CTTTATATGG CAGTGGTAGC AGACCATTCG AAAAATCCAC ATCACCAACG GAAGTTAGAA 64 80 

GATGCTGAGC AAATCAGTCT CAACAATCCG ACTTGTGGGG A7GTCATCAA CCTCTCTGTC 6540 
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AAGTTTGATG CAGAGGACCG TTTGGAACAT ATTGCTTTTC TAAATTCAGG ATGCACGATT 6600 

TCAACTGCTT CTGCTAGTAT GATGACAGAT GCCGTTTTAG GAAAAACCAA ACAAGAAATT 6660 

TTAGAACTGG CGACTATTTT TTCTGAAATG GTTCAAGGGC AAAAAGATGA GCGTCAAGAC 672 0 

CAACTTGGAG ACGCGGCATT CTTGTCAGGT GTTGCCAAAT TCCCTCAAAG AATCAAGTGT 6780 

GCAACCCTAG CTTGGAATGC CCTTAAGAAA ACAATTGAAA ATCAAGAAAA ACAGTAAGAC 6840 

AAGTTTCTTT-'TGTCTTATGA- ATTATTAGAA" ATGAAGAAAG AAAGG ATACT ATGGCTGAAG 6900 

AAAGAGTAGA ACCAAAACCA ATTGACCTTG GTGAATATAA ATTTGGTTTC CATGACGATG 69 60 

TAGAGCCTGT CTTATCGACA GGAAAAGGAC TCAACGAAGG TCTTAT7CCT GAATTATCTG 7020 

CTGCTAAGGG TGAGCCTGAG TGGATGTTGG AGTTCCCTTT GAAGTCTTAT GAAACCTTCA 7080 

AAAAAATGCC CATGCAAACT TGGGGAGCAG ACTTGTCAGA GATTGACTTT GATGACTTAA 7140 

TCTACTACCA AAAACCATCT GACAAACCAG CCCGTTCTTC GGATGATGTA CCTGAAAAGA 7200 

TTAAAGAAAC CTTTGAACGT ATCGGGATTC CAGAAGCTGA ACGTGCTTAT TTAGCAGGGG 7 2 60 

CTTCTGCCCA GTACGAGTCA GAAGTGCTTT ACCACAACAT GAAGGAAGAG TTCCAAAAAT 7320 

TAGGTATTAT CTTTACAGAT ACAGATTCCG CACTCAAGGA ATACCCAGAC TTATTTAAAC 7 3 80 

AATACTTTGC GAAGTTGGTA CCGCCGACAG ATAACAAGTT GGCAGCCCTC AACTCAGCAG 7440 

TATGGTCGGG TGGAACTTTT ATCTACGTGC CAAAAGGTCT CAAGGTAGAT ATTCCACTTC 7 500 

AAACTTATTT CCGTATCAAT AACGAAAATA TAGGTCAGTT CGAACCTACC TTGATTATCG 7 560 

TTGATGAGGG AGCAAGCGTC TACTACGTAG AACGATGTAC AGCACCAACA TATTCAAGCA 7 620 

ATAGCTTACA CGCTGCCATT GTAGAAATTT TTGCTTTCGA CGGAGCTTAT ATGCGTTATA 7680 

CAACTATCCA AAACTGGTCT GATAACGTCT ATAACTTGGT AACAAAGCCT GCTAAGCCTC 77 40 

AAAAGGATGC CACTGTTGAG TGGATTGATG GAAACTTGGG TGCCAAAACG ACTATGAAAT 7800 

ATCCATCTGT TTACCTTGAT GGAGAAGGAG CGCGTGGTAC CATCCTCTCT ATCGCCTTTC 7860 

CTAATGCAGG GCAACACCAA GACACGGGTG CTAAGATGAT TCACAATGCT CCACATACCA 79 20 

GCTCGTCTAT TGTGTCTAAA TCCATCGCTA AAGGTGGAGG AAAGCTTCAC TACCGTGGAC 7 980 

AAGTCACCTT TAACAAGAAC TCTAAGAAAT CTGTTTCCCA CATTGAATGT GATACCATTA 8040 

TCATGGATGA CTTGTCAGCA TC AG AT ACT A TTCCATTTAA TGAAATTCAC AACTCGCAAG 8100 

TGGCTTTGGA ACACGAAGCC AAAGTATCTA AGATTTCAGA AGAGCAATTG TATTATCTCA 8160 

TGAGCCGTGG ATTCTCACAA TCTCAGGCAA CTGAAATGAT TGTCATGGGA TTTCTACAAC 8220 

CCTTTACAAA AGAACTTCCA ATGGAATACG CAGTTGAGCT GAACCGCTTG ATTAGCTATG 8280 
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AAATCGAGCC ATCAGTTGGA TAAAATTTGA TTTTA7ACTC TTCGAAAATC TCTTCAAACC 8 340 

ACGTCAGCAT CCCCTTACCG TATGTATGGT TwCTGAtTCC TCAGTTTCAT CTACAACCTC 8400 

AAAACAGTGT TTTGAGCAAC CGCGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTGAGTATT 84 60 

AGATTTACTC AAAATCAAGG ATTTTGAAGA TGAACTTGTA TCAAAAAATC GCGGTTTAAA 8520 

ATCGCGATTT TTTATAATTT CTCGTTAACA AAGCGGACAA ACTGATTCCA CCAAACTTTT 8580 

AAGAAGAAGG CTTTTTCAAT TTTCTTGTCT GCTACCATTT CCAAACTAGG GCGCTCTGTG 8640 

GTGATGTAAC CTTGACCAAT CAAGTCCTTG TCTTCATAAG TCAAATGGCC AACCACTGTT 8700 

CCAGCTTCAA GTGGTGCTGG GATTGCTTTG GAATCAGGTG TGAATTGAAC AGATTGGGAA 8760 

GATTGATTCC CAACACGTTC GATTAGATAG ATATCCTCTG GAGCCACTGC AGTTACTGTA 8820 

TCTTCTTTTC CATCTTGTAC AGGGGCTTTG CTATCTTGAT AGGCATCGCC TTGTTGAACC 8880 

ATTTTGCGAA GTGTAAATGT AGAAGAAATA TAATCCATTA GGGAAGATGT AGCTGTAAAT 8940 

CGAGCGTAAG GATTATTCTC TTGATGATCT GCATTTAAAA CAACTGTGAT GACTCTCATG 9000 

CCTTTTTCGA CAGTAGTACC AACAAAAGAC TCTCCAGCCT TATCTGTTGT TCCTGTTTTT 9060 

AGCCCATCAA AACCACCACC GTAAGCAGGC ATACCTTCTA ACATGTAGTT CCTTCAAGTG 9120 

ATTGTCATCC CAGCAAAAGT AGAAGAAGGT TTTTTGGTGA TTTCTAAGAC TTGTGGGTAT 9180 

TTTTTGATGA GGTTGCGAGC AACGATAGCG ACATCATAAC CACTAAGCTT ATTTTCCTCA 9240 . 

TCTT T TTTAC AACCTGGGTA AATGTTATCC CCTAGAGTTT CATTGTTAAG ACCTGTCGTA 9300 

TTGACAACAG TGGCATCCTG AATTCCCCAT TCCAAGAGTT TTGCCCGCAT CATATCGACG 9 3 60 

AAATCTTTTT CTGAGCCAGC AATTTTCTCA GCTAGGGCAA TAGCGGCGCT GTTCCCACTA 9420 

GATACCAGAG TTGCTTCAAG CAACTCTTCG ACAGTATAAT TACGGGCCTC CATAGGAATA 9480 

TTACTGGCTT CAGAATTTGT CGTCAATTGA TAAGGATAAT CAGAAATATC TACAGGAGTG 9540 

GACAGGGTAA TACTTCCGTT TTCCAAAGCT TCATAGACCA GATAAACAGT AATCAATTTT 9 600 

GTTATGGAAG CAATTTCCAC AGGTTGCGTT GCATCCTTCT CATAGAGAAT TTTACCAGTA 9660 

TTTGCCTCAA CAGCAATCGC ATGTTTAGCG GCAATGGTAA AATCTTGAGC AACAGCAGTA 9720 

GAAGCACCCC CTAAAAGAGA GACAGTTAAC AAAGTTAAAA ATATTTTTTT CATAGTAGTC 9780 

TTATTCTATC ATAAAGAAAA AAAATATTCT TGCTTTAATA ATTCATCTGT TAAGCTTTTT 9840 

GAAAATATGG TAAAATAAAG TAAGGGAGGT AACTCATCTT TCGTAGAAAT AAATTATTTT 9900 

TTTGGACCAC AGAAATTTTA CTCTTAACCA TCATCTTTTA CCTATGGAGA CAGATGGGGT 99 60 

CTTTGATTAA CCCTTTTGTT AGCGTGCTTA ATACAATTAT GATTCCATTT TTATTAGGGG 10020 

GCT TT TTTTA TTATTTGACA AACCCTATTG TTACTTTCTT AAATAAAGTC TGTAAACTCA 10080 
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ATGTGATTTC 


TTTTTGCATT 


TTTGAGTTAG 


ATAAGGTATA 


ATGATTTTAT 


TGTCTTTTGG 


11520 
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GCATTATGAG 
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11580 
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AAATATAACT 


GCAAAAAATA 
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GCCTAAAAAC 


11640 


CAGCAGGCGT 


CACCCGATTT 
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TGTTCAATGA 


CAGCTCTTAT 


TATTAGCCAG 


11700 


ATACGATTAA 


GCCTTGTCTA 
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GTTGGCAGCT GTTTCGACCT GGGTTCGACT CCCACCGGCT CCATTATTCC TTTGCATTCT 11880 

TTTGCATTCC TTGGTAAAAC GTTGTTAAAT CAACGTTTTT TATTTTTATC TTTCGTATTC 11940 

CTTTGCATTC TTTTGCTAAA AAGGGAGTCA CAAACAGACC CTATTTTAAA AAAGGATAGA 12000 

AAAAAGGATA CAACATTTGT CGCATCCTAA AAATAATCTT TTTTCGACGG AAGAGATGGG 12060 

ATTCGAACCC ACGCACGCTA TTACACGCCT ACCCCGTTTC CAACACGGCC TCTTAAGCCT 12120 

CTTGAGTAAT CTTCCAATAC TTACTCAAAT AGTCTACCAT AAAGGCTCTT ATCTTGCAAT 12180 

AAAAATTCTA GAAATAAGAA AAATGATAGA TTTTGAAAGA AAATGATAAA AAATGCTTGA 12240 

CTTCGAAAGA AAGTATGATA GAATGAATAG TGTAAACGAT AACAGGAGGT GATTCAGTGT 12300 

TAAAAACAGA ACGTAAACAA CTAATTTTAG AGGAGTTAAA TCAACATCAT GTAGTTTCTC 12 360 

TAGAAAAATT ACTTACTTTC CTAGAAACGT CAGAATCAAC GCTTCGAAGA GACTTGGATG 12 420 

AGTTGGAAGC GGAAAACAAG CTTCCTCGTC TGCATGGTGG AGCAGAACTC CCCTACTCCT 12480 

TACAGGAAGA AGAAACCATT CAAGAAAAAT CTGTCAAAAA CCTTCAAGAA AAGAAATTGC 12540 

TGGCTCAGAA AGCAGCCTCT CTCATTAAAG AAAAAGATGT CATCTTTATC GATGCTGGAA 12600 

CAACAACTGC TTTTTTCATT CATGAATTGG TCAATAAGAA TGTTACAGTT GTGACCAACT 12 660 

CCATTCACCA TGCCGCTCAG TTGGTTGAAA AGCAGAwTCC AACTGTCATG GTTGGAGGAA 12"? 20 

AC GTCAAGAC GGCGACAGAT GCTAGTATCG GGGGCGTTGC TCTTAACCAG ATTAACCAAT 12780 

TGCACTTTGA CCGTGCCTTT ATCGGAATAA ATCGTGTTGA CGATGGCTAT TATACGACTC 12840 

CTGATATGGA GGAGGGAGCT GTGAAAAGAG CTATTTTCGA GAATGCCAAG CAGACCTACG 12900 

TCTTGGTGGA TTCCTCAAAA ATTGGACAAA CTTGCTTTGC CAAGGTAGCC CCACTCAAAC 12960 

GCGCTATCGT TATCACTAGT CAAGGGCATG AGCTCTTGCA GGTTATTAAG GAGAAAACGG 13020 

AGGTAATAGA AGTATGATTT ATACAGTCAC ACTCAATCCA TCCATTGACT ATATCGTTCG 13080 

TTTGGACCAA GTCAAAGTTG GTAGTGTCAA TCGTATGGAC AGTGATGATA AGTTTGCTGG 13140 

TGGGAAAGGA ATCAATGTCA GCCGTGTCTT GAAACGTTTG AATATACCAA ATACAGCGAC 13200 

GGGATTTATC GGTGGCTTTA CTGGTAAATT TATCACAGAT ACTTTAGCAG AGGAAGAAAT ;.3 260 

CGAGACACGT TTTGTCCACG TGGCAGAAGA TACTCGTATC AATGTTAAAA TCAAAGCAGA 13 320 

CCAAGAAACA GAAATCAACG GAACGGGTCC AACTGTTCAA TCGGTTCAGC TAGAAGAATT 13 3 80 

GAAACCTATT TTATCTAGTC TGACAGCAGA AGATACAGTT GTCTTTGCAG GTTCAAGTGC 13440 

TAAAAATCTA GGCAATGTTA TCTATAAGGA TTTGATTTCC TTGACGCGCC AGACTGGTGC 13 500 

GCAAGTGGTC TGTGACTTTG AAGGACAGAC CTTAATTGAT AGTTTGCACT ACCAGCCTCT 13560 

TCTTGTAAAA CCAAACAATC ATGAACTTGG AGCGATTTTT GGCGTTAAAC TCGAAAGTTT 13 620 
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CTTATGTCTT 


TGGTACAGGT 


15480 


ACGCTTGCAG 


CAACTGTTTC 


TTCAGGTGGT 


TCTGTAGCCA 


TGGCAGCAGT 


TATGGCTGGA 


15540 


GGAATGGTGC 


CACCACTTGC 


AATCTTTGTC 


GCAACTCTTC 


TTTTCAAAGA 


TAAATTTACT 


15600 


AAGGAAGAAC 


GTAACTCTGG 


TTTGACAAAC 


ATCATCATGG 


GCTTGTCATT 


TATCACTGAG 


15660 


GGAGCGATTC 


CATTTGGTGC 


CGCTGACCCA 


GCTCGTGCGA 


TTCCAAGCTT 


CATCCTTGGT 


15720 


TCAGCAGTAG 


CAGGTGGACT 


CGTTGGTCTT 


ACTGGTATCA 


AACTCATGGC 


GCCACACGGA 


15730 


GGAATCTTCG 


TTATCGCCCT 


TACTTCAAAT 


GCTCTCCTTT 


ACCTCGTTTC 


TGTCTTGGTA 


15840 


GGAGCAATCG 


TAAGTGGTGT 


GGTTTATGGT 


TACCTACGCA 


AACCACAAGC 


ATAAAAAATA 


15900 


GAAAAATGAA 


AAGATTGGAC 


CGTTTGGTCC 


AGTCTTTTTC 


TCTTCCCGAA 


ATGCCTGTCA 


15960 


AATATGGTAT 


AATAGAAGAA 


TGGCAAACAA 


GAATACAAGT 


ACAACAAGAC 


GGAGACCGTC 


16020 


TAAAGCAGAA 


CTGGAAAGAA 


AAGAAGCGAT 


TCAACGAATG 


TTGATTTCCT 


TAGGAATTGC 


16080 


GATTTTATTG 


ATTTTCGCAG 


CCTTCAAATT 


AGGGGCTGCA 


GGTATAACCC 


TTTATAATTT 


16140 


AATTCGCTTG 


CTAGTGGGTA 


GCCTAGCTTA 


TCTGGCGATA 


TTCGGCCTAT 


TAATCTATCT 


16200 


CTTCTTTTTC 


AAGTGGATAC 


GAAAACAGGA 


AGGACTCTTA 


TCTGGCTTTT 


TCACCATATT 


16260 


TGCTGGCTTA 


CTCTTGATTT 


TTGAGGCCTA 


CTTGGTTTGG 


AAATATGGTT 


TGGACAAGTC 


16320 


CGTTCTAAAA 


GGCACCATGG 


CTCAGGTTGT 


GACAGATCTG 


ACTGGTTTTC 


GAACGACTAG 


16380 


CTTTGCTGGA 


GGGGGCTTGA 


TCGGGGTCGC 


TCTTTATATT 


CCAACAGCCT 


TTCTCTTTTC 


16440 


AAATATCGGA 


ACTTACTTTA 


TTGGTTCTAT 


CTTGATTTTA 


GTGGGTTCTC 


TCCTAGTCAG 


16500 


CCCTTGGTCT 


GTTTACGATA 


TTGCTGAATT 


TTTCACTAGA 


GGCTTTGCCA 


AATGGTGGGA 


16S60 


AGGGGACGAG 


CGTCGAAAAG 


AGGAACGCTT 


TGTCAAACAA 


GAAGAAAAAG 


CTCGCCAAAA 


16620 


GGCTG AG AAA 


GAGGCTAGAT 


TAGAACAAGA 


AGAGACTGAA 


AAAGCCTTAC 


TCGATTTGCC 


16680 


TCCTGTTGAT 


ATGGAAACGG 


GTGAAATTCT 


GACAGAGGAA 


GCTGTTCAAA 


ATCTTCCACC 


16740 


TATTCCAGAA 


GAAAAGTGGG 


TGGAACCAGA 


AATCATCCTG 


CCTCAAGCTG 


AACTTAAATT 


16800 


CCCTGAACAG 


GAAGATGACT 


CAGATGACGA 


AGATGTTCAG 


GTCGATTTTT 


CAGCCAAAGA 


16860 


AGCCCTTGAA 


TACAAACTTC 


CAAGCTTACA 


ACTCTTTGCA 


CCAGATAAAC 


CAAAAGATCA 


16920 


GTCTAAAGAG 


AAGAAAATTG 


TCAGAGAAAA 


TATCAAAATC 


TTAGAAGCAA 


CCTTTGCTAG 


16980 


CTTTGGTATT 


AAGGTAACAG 


TTGAACGGGC 


CGAAATTGGG 


CCATCAGTGA 


CCAAGTATGA 


17040 


AGTCAAGCCG 


GCTGTTGGTG 


TAAGGGTCAA 


CCGCATTTCC 


AATCTATCAG 


ATGACCTCGC 


17100 


TCTAGCCTTG 


GCTCCCAAAG 


ATGTCCGGAT 


TGAAGCACCA 


ATCCCTGGGA 


AATCCCTAAT 


17160 
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CGGAATTGAA 


GTGCCCAACT 


CCGATATTGC 


CACTGTATCT 


TTCCGAGAAC 


TATGGGAACA 


17220 


ATCCCAAACG 


AAAGCAGAAA 


ATTTCTTGGA 


AATTCCTTTA 


GGGAAGGCTG 


TTAATGGAAC 


17230 


CGCAAGAGCT 


TTTGACCTTT 


CTAAAATGCC 


CCACTTGCTA 


GTTGCAGGTT 


CAACGGGTTC 


17340 


AGGGAAGTCA 


GTAGCAGTTA 


ACGGCATTAT 


TGCTAGCATT 


CTCATGAAGG 


CGAGACCAGA 


17400 


TCAAGTTAAA 


TTTATGATGG 


TCCATCCCAA 


CATGGTTGAG 


TTATCTGTTT 


ACAATGATAT - 


17460 


TCCCCACCTC 


TTGATTCCAG 


TCCTGACCAA 


TCCACGCAAA GCCACCAAGG 


CTCTCCAAAA 


F7520 


GGTTGTGGAT 


GAAATGGAAA 


ACCGTTATGA 


ACTCTTTGCC 


AAGGTGGGAG 


TTCGGAATAT 


17580 


TGCAGGTTTT 


AATGCCAAGG 


TAGAAGAGTT 


CAATTCCCAG 


TCTGAGTACA 


AGCAAATTCC 


17640 


GCTACCATTC 


ATTGTCGTCA 


TTGTGGATGA 


GTTGGCTGAC 


CTCATGATGG 


TGGCCAGCAA 


17700 


GGAAGTGGAA 


GATGCTATCA 


TCCGTCTTGG 


GCAGAAGGCG 


CGTGCTGCAG 


GTATCCACAT 


17760 


GATTCTTGCA 


ACTCAGCGTC 


CATCTGTTGA 


TGTCATCTCT 


GGTTTGATTA 


AGGCCAATGT 


17820 


TCCATCTCGT 


GTAGCATTTG 


CGGTTTCATC 


AGGAACAGAC 


TCCCGTACGA 


TTTTGGATGA 


17880 


AAATGCAGCA 


GAAAAACTTC 


TTGGTCCAGG 


AGACATGCTC 


TTTAAACCGA 


TTCATGAAAA 


17940 


TCATCCAGTT 


CGTCTCCAAG 


GCTCCTTTAT 


CTCGGATGAC 


GATGTTGAGC 


GCATTGTGAA 


18000 


CTTCATCAAG 


ACTCAGGCAG 


ATGCAGACTA 


CGATGAGAGT 


TTTGATCCAG 


GTGAGGTTTC 


18060 


TGAAAATGAA 


GGAGAATTTT 


CGGATGGAGA 


TCCTCGTGGT 


GATCCGCTTT 


TTGAAGAAGC 


18120. 


TAAGTCTTTG 


GTTATCGAAA 


CACAGAAAGC 


CAGTGCGTCT 


ATGATTCAGC 


GTCGTTTATC 


18180 


AGTTGGATTT 


AACCGTGCGA 


CCCGTCTCAT 


GGAAGAACTG 


GAGATAGCAG 


GTGTCATCGG 


18240 


TCCAGCTGAA 


GGTACCAAAC 


CTCGAAAAGT 


GTTACAACAA 


TAAAAAAATA 


GrTTCTTTrr 


18300 


AAGTTTGGAG 


GG AAGCTATT 


TTAGTGGCTA 


TTCATTGCTT 


TTATTTTCTG 


AAGTTGGCGC 


18360 


ATTGGACTGT 


TTTTCGTTTT 


CAGTAGCAGG 


TTTACTTGAA 


GCAGGAGTAG 


AAGAGTCCTG 


18420 


AGTT.GCTGTT 


TTCTGATCTT 


CTTTTTTCTC 


TTCCTTCACG 


CTAGATTTTG 


GTGTTTCCTC 


18480 


TTGCTGTCTT 


TTTTCTTGAC 


TAGTGTTAGT 


CTCTTTAGTT 


GGACTGGTGT 


TTTCCTTAGG 


18540 


GGATTCCTTT 


TGGATTTCTT 


TGACAATGGT 


TGTCGTCTGG 


CTTGTCGTAC 


GTTCTTTTTT 


18600 


AATATTTTTG 


TTATTATCCA 


AGGCGTT 








18627 


(2) INFORMATION FOR SEQ ID NO: 114: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2560 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D> TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

TAAAATACCT TACCTTCCTT CTGCACCTTC ACCACGTAAG TCATTGAAAT TTAAACATCA 60 

AGATATTACA ATTGAAGAAA CGACTGAAAC AGCTTTTGAA GGAGTTGATA TTGCTCTCTT 120 

TTCAGCAGGT AGTTCTACAT CAGCTAAGTA TGCACCATAC GCAGTAAAAG CTGGCGTGGT 180 

AGTAGTAGAT AATACATCTT ATTTCCGTCA AAATCCAGAT GTTCCTTTGC TTGTTCCAGA 240 

GGTCAATGCT CATGCACTTG ATGCTCACAA CGGAATCATT GCCTGCCCTA ATTGTTCAAC 300 

AATTCAAATG ATCGTGGCTC TTGAGCCGGT TCGCCAAAAA TGGGGCTTGG ACCGTATCAT 360 

TGTTTCAACT TATCAAGCCG TTTCAGGTGC TGGTATGGGA CCAATTCTTC AGACACAACG 420 

TGAACTTCGT GAAGTCTTGA ATGATGGTGT GAAACCACGT GATTTGCATG CGGAAATCTT 480 

GCCTTCAGGT GGTGACAAGA AACATTATCC TATCGCCTTT AACGCTCTTC CACAAATTGA 54 0 

TGTTTTCACT GATAATGATT ACACGTACGA AGAGATGAAG ATGACCAAGG AAACTAAGAA 600 

AATTATGGAA GATGATAGCA TTGCAGTATC TGCAACATGT GTGCGTATTC CAGTCTTGTC 660 

AGCTCACTCT GAGTCTGTTT ATATCGAAAC AAAAGAAGTG GCTCCAATCG AAGAAGTAAA 720 

AGCACCTATC GCAGCCTTCC CAGGTGCTGT TCTTGAAGAT GATGTAGCTC ATCAAATCTA 780 

TCCTCAAGCT ATCAATGCAG TTGGTTCGCG TGATACCTTT GTTCCTCCTA TCCGTAAAGA 840 

CTTGGATGCA GAAAAAGGAA TTCACATGTG GGTTGTTTCA GATAACCTTC TCAAAGGTGC 900 

TGCTTGGAAC TCAGTTCAGA TTGCTGAAAC TCTTCATGAA CGTGGATTGG TTCGTCCAAC 960 

AGCCGAATTG AAATTTGAA7 TAAAATAGTC ATATCGTTTA GGAGTTCAGA TGAACTCCTT 1020 

CTTTGAAATA GAGAGGTGTT TTCGTGTCTT ATCAAGATTT AAAAAAATGT AAAATCATTA 1080 

CAGCCTTTAT TACCCCCTTC CATGAGGATG CTTCCATTAA CTTTGATGCT ATTCCAGCCT 1140 

TGATTGAGCA TTTATTGGCC CATCATACGG ATCGAATTCT TCTCGCAGGA ACGACTGCTG 1200 

AGAGTCCAAC TTTGACCCAC GATGAGGAGT TGGAGTTGTT TGCGGCTGTA CAAAAGGTTvi 12 60 

TCAATGGACG CGTTCCTTTG ATTGCGGGTG TACGT ACT AA TGATACGCGT GACTCTATTG 13 20 

ACTTTGTCAA AGAAGTAGCG GAATTTGGTG GTTTCGCAGC TGGGCTTGCT ATTGTTCCTT 1380 

ACTACAACAA ACCTTCTCAA GAAGGGATGT ATCAGCACTT TAAGACTATT GCAGATGCTT 1440 

CTGACCTACC AATTATTATC TATAACATTC CAGGGCGTGT AGTTGTCGAA TTGACTCCAG 1500 

AAACCATGCT TCGCTTGGCT GACCATCCAA ATATTATCGG TGTCAAAGAA TGTACTAGCT 1560 

TGGCTAATAT GGCTTACTTG ATTGAGCACA AGCCTGAAGA GTTCTTGATT TATACAGGTG 1620 

AGGATGGAGA TGCTTTCCAT GCCATGAACC TTGGGGCGGA TGGGGTTATT TCTGTTGCCT 1680 
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CTCATACAAA TCCGGATCAA A7GCACCACA TGTTTACTGC GATTGCAGAA AGCGATATGA 174 0 

AGAAAGCCGC AGCAATTCAG CGTAAATTCA TTCCTAAGGT TAATGCTCTC TTCTCTTATC 1800 

CAAGTCCTGC TCCAGTTAAG GCAATTCTTA ACTATATGGG ATTTGAAGCT GGACCCACTC I860 

GTCTACCTCT TGTTCCAGCA CCAGAAGAAG ATGCCAAACG CATTATCAAG GTTGTCGTAG 1920 

ATGGCGACTA CGAAGCAACT AAGGCAACTG TAACAGGGGT CTTAAGACCA GATTACTAAT L980 

AAAGACAATA AAATCCGGCT CTTTGTCAAC TGTAGTCCCT TGAAGTCACC TAACCTCGAG 2040 

AAAGGACAAA TTTTGTCCTT TCTTTTTTGA TATTCAGAGC GATAAAAATC CGTTTTTTGA 2100 

AGTTTTCAAA GTTCCGAAAA CCAAACGCAT TGCGCTTGAT AAGTTTGATG AGATTATTGG 2160 

TCGCTTCCAA TTTGGCGTTT GAATAGGGTA GTTGAAGGGT GTTGACGATT TTCTTTTTGT 2220 

CCTTTAGAAA GGTTTTAAAG ACAGTCTGAA AAATAGGATG AACCTGCTTC AGATTGTCCT 2 280 

CAATGAGTCC GAAAAATT7C TCCGGTTCCT TATTCTGAAA GTGAAACAGC AAGACTTGAT 234 0 

ACACCTCATA GTGATGTTTC AAGTTTTGTG AATACCTCAA AAGCTTGTTT AAAATCTCTT 2400 

TATTCGTTAA GTGCATACGA AAAGTAGGAC GATAAAATCG CTTATCACTC AGTTTACGCC 2460 

TATCCTGTTG AATGAGTTTC CAGTAGCGCT TGATAGCCTT GTATTCGGGA TTTTCGATGA 2 520 

AACTGATTCA TGATTTGGAC ACGCACACGA CTCATAGCAC 2 560 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11303 base pairs 
(Bl TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
TATTGGATTT CCCTTGCAAT CACTTTATGG GACAAGCACC CGGCAGCGCA GAGGAAATCA 60 

ACCCCTTCTG TAGCCTACAT TTTCAAACCA CCTTCCCACG TTTTGCCAAG ATTAAGGTCA 120 

ACGGTAAGGA AGCAGACCCT CTCTATGTCT GGTTACAAGA CCAG AAATC C GGCCCACTAG 130 

GAAAACGAGT CGAATGGAAT TTCCCTAACT TTCTCATCGG TCGAGATGGG CAAGTCTTTC 240 

AACGCTTTTC TTCAAAAACA GACCCAAAAC AAATTGAAGA GGCGATACAA ACTCTACTAT 300 

AATTCACAAT CTCACTATGA TTAGGTTTCC TTTAACCTGA TGAATAGTGA GATTTTTTGA 360 

TGGGCTTTGA CTTAAATAGA AAAACACCCC ATGATATGAA ACATGAAGTG TTGTAAAGTC 4 20 

TATGTTGTAG GTCCTTATTT CACAATTTCA ATGTGACCAC TGATAACGAA TACCATACAG 480 
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AATCTTCATA TACACTAAAC AAATGACTTT CTAATTATTT CAATTAGTTT TGGCTAGTAA 54 0 

ATATCATTTC CAACAAACGC CCTCTCAATT CCTTATCCTG ATGATGCAAG ATATTCATTA 600 

AGTCATGAGA GTTTTTCGCA TTCATGAATT GATTTAACAA TCTATCTTTT AATTCATATG 660 

GAACAGAAGC TGTCTTTAGT AGTCTAAAAA CTTCGTCATT TAAAGATGTC CTTTTATTAT 720 

CTTTCCATTC AAATTTAGCT GTATCATTCT TATTTGGCAA TTCAATTATA GACACATTCG 780 

TTCCTTTAAA ATGAATTCTA TGTTTTCTAT TGCTTGGAAC GAT ACT AG AA TCTCCTTGTA 840 

ATGCTAACTC TACCATTCCC ATTTCCCAAT CGATTGATAA TCTTGTTTTA TATCTTTGAC 9C0 

CATTTTGATC TTCAAGCATT TCAAAAGAAT GTTGTTTTCC TGGGAATACA TACCAATCTA 960 

CAACTTCAGG TAAATCAACA CCCATACCTA TCTCAGAACC AACCAAGGGA ATGATTGCAC 1020 

CACTTTTTGC AAACACAGGC GTAGTCGAGA TGTCCCTATA AACACTTAAC TTCACACCAC 1080 

CTGTGTATTT TTTCTCTGAA AAGAAGTCAT ACCATTCACC TTCAGGGAAC CATACATCTA 1140 

CTTTTGCAGA TTGGAATGTC AAATCCATCT TTTCTACAAT GGGAGCCACC ATCAGTTCTG 1200 

TTCCAAAAAA GTATTCGTTT GGAACATTAT AGCTCTCATC ATTCTCTGGA TAGAAATAAT 12 60 

AGATTCGACT GATTAATGGG GCACCTTCCT CATGTGTCTG TACATTCATG GTATATAGAT 1320 

AGGGAATCAT CTGATGTCTC AAACGAAGGT ATTTCTTCAT AATCTTAGAT GTT G TTTCTG 1380 

AAAAAAACCA AGCTTCTTTA CTATTAAAAG GACTTCTAGA ACTATGTAAT CGAGTAATCG 1440 

GACTAAAAAC ACCAAACTGT AGCCATCTAG TTTGTAGCTC TTCCTCATAA TCCCCCAACA 1500 

TATGTCCACC GATATCATGA CTCCACCAAC TATAACCGAT ATTAGATGCT GTCGCTGTAA 1560 

AATAGGGTTG AAATCTTAAG GAATTCCAAC TAATAATAGT ATCCCCTGAA AAACCAACAG 1620 

GGTAGCGGTG ACTACCAGGA CCTGCATATC TTGATAAAAT CAAACCACCT TCTGCATTTT 1680 

TACAACTATC CTGATAGTGA TAATGGTTTA AAAGCCAAAG TGGATCTAGC ATACCTTGTG 1740 

TCCCTTCTTG CCAGTCAATC CACCAAAAAT CTACTCCCTG CTTTTCTAGT TCATAATGAA 1800 

CATCTTTAAA GTAGGCTTCC CTAAAAGAGG GATTAAAAAA ATCAAAAATA GCAGGTTCTT 1860 

CTAGTTCTAC ATTTAACCCC AACCGTTTTG CGATTTGAGG ATAAGCTTCT TCATAAGCCC 1920 

GTATCCCATC AGCAGGATGG ACATTTAAGG AGAGTTTTAG CTTTCTATCA TGAAGTTGTT 1980 

GCAATAACTG TTCTGGATTT GGTATTAAGT TTCTATTCCA ACTATATCCT GTCCAGCCAC 2040 

TTCCAAAGCG AGCTGGAATG TCAGTTATAT GCCAATCCAT ATCTAACACA CCGATAGATA 2100 

ATGGAATTTT CTCTGTTTCA AATCTGTCTA TTAAATCCAA GTATTCATCC GACGTATAAG 2160 

GCCAATATCT ACTCCACCAA TTGCCTAAAG CATATCTTGG CAACAAGGGT GTTGAACCAG 2 220 

TCAAATGGTA AAAATCTCTG ATTGCTCCTC TATAATCATG CCCATAGGCA AAGAAATACA 2280 
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GGTCAATTTG ATTTTCTCTC TCAATATAAC CAGATTGTTC ATCCCAAATA AATCCTTGAG 23 40 

AATCATCCAA TAAGCCTATA CCATTTCGGC TAATAATTCC ATCTTCTAAC GAGATTGCTC 2400 

CATCTGCCTT ATCCAGAGTC CGAGCTGTTC CTTTTAACCT TTCAATAGAT TCACCAAAAT 24 60 

ACCAGCGACT ACCATATACG GCAAAATTTC CTTTTAATTC TATAAATAAA TTTTCGGCGT 2520 

TAAATTCTCC TTTATTAAAG TGCAGATCAA AATAGTCCGT CATAATATCT AGTACGTTTG 2 530 

ATGTCTCGAT ATAATCTAAC GAAATTTGGC CAAAATCTCT ATT AT AG AT A AGTTGTGTCG 264 0 

TTCTATCCTC AAAACTTCCA GTTTGAGAGT ATTCTAACC? TACTAGC7TG TCTGTTAATA 27 00 

CAGAGATTCG ATAAAACTCT CCCTTAAAAA TTTTCAATTT GTTTTCCTCC TTTTATGCTA 27 60 

GCATAAAAAC AGAACGCACC ATTTTTGATG CGTTTTTCAT TATTCTGAAT GCAATGTTCT 2 820 

ATCTGTTATA TCTATGACAA ATAATAGTCA ATTGAAAAAA TGCAGTGGAC AAAATATCTT 2880 

TTAACAAACC AAGAGTTTAT TAAAGAGTTA TCACTTTTCA ACTTTTCTAA GCTTATGCAG 2940 

TTGTGAAACA AACTACTTTT AAACTATTAA CTAAGATAGG ATTGATAAAT AATTTCAAAC 3000 

TCTTACTAGC AATCATACGA TATTCAACCT CACGTGCTTT TTTCCTTCCT GCTTATTTCT 3060 

TAGAACTGAA GAACCCGGAT CGGTATATAA ATTATCCGGA TCAACATAGT CATAAGATTC 3120 

ATAACAGTTG CGCTTCATTA AGTCATCCCC AGAGCAAGAG CTTCATCTCG TAATTTTTCA 3180 

ACATCACTAA CCGTAGGTCG CCATCCTTCA ATCATATTTG TACTTAAAGC ATACCAAACA 32 40 

CTCTTAAAAA CGGATCGGTT TTCAAAACCT ATTCCCATGA TTGTCATCTT TTCTTTATCT 3300 

ATATCTAAGG ACATATGCTA CCTCCTTTAG ATACATTATA CCATGTTTCT CTGTAGCTTT 3 3 60 

TAAAAATTTT ATTTTGTTTG TCATATCTAA CTTTTCAGCA CGCTTATCCT ATTTTATAAG 3420 

CCTCAAACCC AAATATAAAA CGCATTCTTT TTGCTTTTTT ACTATTGTAT CGTATTCTAC 3 4 80 

GATAACATAC TTTACTTTAT T G TTTTTTTA AATAACAGCA GTTCCCTGTT TATCAACTAT 3 540 

TCGAACTACT TTCTATTTTG CTTCATACCC TACATAGCGA AAAAATATGA AAAAGCAGAG 3 600 

AAGAATATCT TAAAAAGACC TCTTCACTGC TAATATTAAC ACTCATTATT TAAACTATAT 3 660 

GGATTCTATC ATCGAGTATA CTTTTTTACT TATTAGATAC CTTGCTCTTC TTTCACCAAT 37 20 

TTTTGATCAT ATACACGGAT GAATGGAAGA TACACTAGGA ATGCTGCAAA TGCACATACT 3 7 80 

AGAGCAACTA ATACAGCTCG AAGATCTGCT GTCCCTAAGA AAGCTCCAAT CCCTACTGGA 3840 

GTTGGCCATG GAACCTGTGC GATAATTGGC TTAATAAAGT TTAGAGAATT CGCTACGTAA 3 900 

TAAATAGTAG CAGTAACCAT TGGTGCTAAA ATAAATGGTA TAGCCAAGGC 7GGATTATAG 3 960 

ATAATAGGTA ATCCAAAAAT TAATGGTTCA TTAATATTAA ATAAGGCTGG AACTACAGAT 4 020 
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GCTCGTCCTA 


TTGCTTTAAG 


CTGTTCAGAT 


TTAGAGGCAA 


AAGCAATATA 


TAAACATAGT 


4060 


CCTAAAGTTG 


CACCAGAACC 


ACCTGCAATT 


ACAAACATAT 


TAGAAAATTC 


ACCTGCAACA 


4140 


GCGAAGTGCC 


CGCCAGCAGC 


ATTTTCAGCC 


ATCTTAGCAA 


GAGCAATTGG 


ACTAACAAAT 


4200 


GCAAAAACAA 


TGTTCGCACC 


GTGGATACCT 


ACAATCCAAA 


GTAGTTGAGT 


CAATAGATAA 


4260 


ATAATCATTA 


AACCAATCCA 


CGAATTAGTC 


AGATTGGATA 


CAAKACCAAA 


TGGAATTGCA 


4320 


ATGACTTTAA 


AAATATCTGT 


TCCCATTGCT 


ACAAGAAGAC 


CGTTGATAAA 


GATAACAACA 


4380 


AATGCAACAA 


CAAATCCCGG 


AACCAAAGCG 


GTAAATCCAC 


GAGAAACTCC 


TTCTGGAACA 


4440 


GCTTCAGGCA 


TTTTAATAAC 


CCAATTATGT 


TTAACACACA 


TACGATAAAT 


AAGAACAGTC 


4500 


ACAATTGCCA 


TAATGATTGC 


GGTAAAAATC 


CCTGTTGTCC 


CAAAACGTGC 


GACTACATTT 


4560 


CCCATTGCCC 


ATCCATCTGC 


AATTACTGCA 


CCTTCTTTTA 


GACTTGTCAC 


AGTCTTCATC 


4620 


ATTCCACCAT 


CAAAAATGAT 


TTGCGGTACT 


GTCATGACAA 


AAGCCATCAA 


GGCAAGCAAG 


4680 


GCACCATTAA 


GAGGATTCAT 


ATTGAGTTCT 


TCTTCCTCTG 


CATAAATTTT 


TGTCAATTCA 


4740 


TATGCAAGTG 


ATAGAACGAA 


ATAAAGAGAT 


AGAGAACCCA 


TAGTCGCATA 


CTTTGCAACC 


4800 


ATGTAAAGTG 


ATGTGAATTT 


ATCAAATGAA 


GCAGAGAAAA 


TATCTGCCAC 


AATTGGCCAA 


4860 


AATGAGAAAG 


CTTGTGGCAA 


AATACTGAAT 


ACCAAAAACA 


TTGATCCTAC 


AATAGTAAAT 


4920 


GGTACAGCAG 


CCATACCTGC 


AGCCGTGATA 


GCACGTACTA 


CTTTAAACTG 


AGCAAGTTTG 


4980 


CCCATTGGTC 


CCATAACATG 


GTTTTCAAGA 


AAACCAAACA 


ACCCGTTTTG 


TTCATCCATA 


5040 


AATAGACCTC 


CTTAATAAAA 


CATAATAATT 


TTTACTTTCT 


AAAGACTAGT 


TTCAAATACA 


5100 


AATTATACTA 


GATCAGGATT 


ATAAACTAAG 


TGAGTTCTTT 


TCCAATTGGA 


CAAATTGTTG 


5160 


ATAAGCCTTA 


TCTGT7CGTT 


TATAAATTTT 


TTTAATTCTT 


CTAATGTCTA 


ACAAACTCAG 


5220 


AACTAAACCT 


AATAGAAGAA 


CTACAAAAAC 


AAATAAACGT 


GCTACTTGGT 


TATTTTCAAA 


5280 


AATCGGAAAA 


ACATTCTTAA 


ACCAACTTGT 


CCAACTTAAA 


ACAAGTAATC 


CTATTGAAAT 


5340 


AAGCATTTGT 


ATTCTAACAA 


ACATTAGTGT 


TATTCCCAAC 


TTTTCTTTCC 


TATTTCCATA 


5400 


AACTTTAAAT 


TGTTCAACAG 


TTGCTAAAAT 


AGAAAATACT 


ATGAGCATAA 


TGGGGAAAAT 


5460 


AATAATAGGC 


GAGGGACTAA 


TAAACTGACT 


CAAAAGCCAA 


TAAATATTCC 


CAAAAAAGAA 


5520 


GAGTGCTATT 


GAATAACGTA 


GAAGAAGATA 


TCGATTGAAA 


AAAGTATTAG 


TTAGAGCCAT 


5580 


CTCTCGACGT 


TGTTGTTCAA 


TCTTTTGTCG 


TTCTTTTTTA 


TCCATATCAT 


TTCCTCCTTA 


5640 


TATAACAACA 


CATATTTAGT 


TAACTTTCTT 


ATAAAGAGCT 


AACATTTCCT 


TTCCTACTTC 


5700 


TAATAATGTC 


ATAGTGGTCA 


TTAAATGATC 


TTGAGCATGT 


ACCATGATAA 


TTTCAATTTT 


5760 


AATTTCCACT 


CCACTTGCGT 


ATTCTTGCAA 


GAGTTTGGTT 


TGTGCATGAT 


GCGCTTCAAG 


5820 
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AATTATCTCA TTTGATTGAT TTAATTTACT TTCTCCATCA TCAAAACTAC CTTCTCTCAT 5880 

TTTTGCAAAT GCTTCATGTA TTTCTGACCT TGCATTTCCC CAATGCAGGA TAATTTCAAA 5940 

TGCTGCAACC TGCAGTTCCT CTTGATTCAT ATAAACCTCC TATTTTATCT TCTCAAATAT 6000 

GTTAATAAAA TCTTCAAAGT TATTGCAAGA TATTAGCTGA TTTTGCAATT CATCATTCTC 6060 

TGTCAGAGAG ACTATCTTTT TAGTCACAGT TGCCAAACCT TCGTTCCCAT ATATTGATGG 6120 

AGATAGAAGA AATACTAGCT GGACATGTGA ~ACTTTGATTA TCCCAGACTA ACCAATCTTT 6180 

ACAAATTGCA ACCGAAACCT TTCCCTCTGT ACCAAAGGGC TGAATAGGAT GCCGAACTGC 62 40 

AATTTTTTCA GAAAAAACAA CTGAACTTAA TTCTTCGCGC TGTTTAATTC CATAAAGTAA 6300 

AGATTGTTCA AACTCATTTG ATTCACCAAC AGATAAACTC TCAACCATCT TTTCAAGTAA 63 60 

ATTTACCTTG TCTGATTCAG TACATATTAA AAAGTTTTCT TTACTAAAAT ACTGTCTAAA 6420 

GCCGTTGTTT TCAAATTTGT TAATCTTTGA TGATTGTACA TAACTAGAAA CTTGCATCTA 6480 

ATCCATACCT TTTCTAATCA TTTCCATCTC ATCACTCTTA AGAAACACAC TAACTTTAAA 6540 

AACTGGGATT TGAAAATATA GATTTGATAA ATCAATAGCT GACACTATAA AATCTATTCC 6600 

TTTAAGTTTT TCTTGATTCA ATTCATAGTA GCCTATTACA TCAACAACTT CTACTCGCTT 6660 

CCCAAACTCC GTTTCCAAAC GATTTCTTAA CATTTGGGCT GCACCAAATC CTGTTGCACA 6720 

AATAGCAAGA ATATTAAACT TAGTACTCTC TTTGCTACGT TCCATAGCAG CTAAAAAGTG 6780 

AAGACTTACA TATGCTACTT CATCATCTGA TATTGTCCAC TCCAAGAACT TGTCCATATT 6840 

TGCAAGAATT TCTCTAGTCA TAAAGAATAT ATCACTATAA TTCTGTTTAA TTTCATCTAC 6900 

CAAAGGGTTA TTTAAGGTAA TCCGGCTTTC TAAACGTACT TGTAGTGTCA TTAGATGAGT 69 60 

TATCAATCCT TCAATTAGTT GGAAATCTGA AGAAAAGTTA TACATATCAT CTAATCCTAA 7020 

ATTCTGAAAT GTTTTAAATA AAGATTTTTT TAAAACTTCT TCAGAAATAT TCTTCTGATT 7080 

TTTTTGACAT TGTTGACTCT TAGCTAACAA ATGCAAAG7A ATGTAGTCTA TTTCCTGAAC 7140 

TGGAAATTCC TGATTTGTTA CTTCTCTTAC TTTAGAAAGA ATTCTTTGGG CAACCTTTCT 7200 

CTCTATTGCA TCATCAGTCA TCTGACAGTC TATATTTTTT ATTTCAAATC CGGATTTTAA 7260 

ACGAATCACA GACAATGCTA TGTGAACTAC TAAATTCTGT AGTACAAAAT CAGATAGTTT 7320 

TAGGTTGGCC TCTTGGCATT CATCCAAAAC AATTCTAGCA AATTCTTCTA ATGGAACAGT 7380 

TTGATCAAAA AAGTTAAATT TTACATAGCA ATGTATTGTT TTAAAAAATT GATTCTCTAG 7440 

GAAATAATTT ATGATAAAAC GTCGTTTATC ACGTTCCTCG CCTGAGACAT AAACTCCTTT 7500 

ATTCGCCCTA CTCTCAATGG ACAAATTATA CTCTGATAAC ATCACTCGTA TCTTTCTGAA 7 560 
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ATCATGAGAT AATGTTGAAC GACTAACGTA AAGTTCATCA GCTAAATCAT CAAAAAGAAC 7620 

TGGAACTTGC TCAAATAATA ATTTATTTAA GATAAATACT AAACGATCAT CACCTTTTGA 7680 

AACCGCAGTT TTCGTATACT CTTCTTCCAG TTCATAAGTT TGTCTAAACT CCTGGTAACC 7740 

GCCTTGATTC TCAAAAAATA TTTGATACCC TTGACCTTGT TTTGAAATCA ACCCGACTCC 7800 

TTGAATAATC ATTGTCTTCT CAATTAATTT CACTACATTA CGGACAGTTC TATCTGAACA 7860 

GGATAAATAT TCTCCCACTT CTTTGCTTGT AACAAAACGT TCCTTATTTT TTATTAAAAA 7 920 

TTGAAGGATA TCTTTCTCTT TAATGTTTAA CACATTCATT CCCTCCTAAA ACGTATGTTT 7980 

TCATATATTG AAGCATATTA TACACTTAAA TCAGTTTATA TCAAACTCAA AACAATTTAT 8040 

CTTAACCTAA ATATTTATTG ACATTTCATG TGTTCATCAA ATATTCTCAA GAATCAAATT 8100 

AGCCATTTTT TCAATTCCCA TTGGAATAGG AATATAGGCT TGAGGAGGTA TTTGTACAAC 8160 

TGGTTTTCCT GCTTTAGAAC CAGCCTCTTC AAATTGCTTA AAGTACATTT TT C TT TC AGG 8220 

ACTGACAAGA TACAAATCAA AAGCTGCTGC TGCGATAGCT TTCCCTCCTT CAGTAGCACT 8280 

AATAGCATCA ACTACAATAT CTTTCCCTTT TCCTTTTAGA AACTC7CTTG TTTTCTGTGC 8340 

CATAAGTGAT GAAGACATTC CTGCTGCACA AATAATTAAA GCTTTTGCCA TAATATTTTC 8400 

TCCTTTTCTT AAATCCAATC AAAGCTGTGC TAAGTTGGCT TATTTGTTAT CTATTTTTAT 84 60 

TATAAAATAA AGCGTTTCCA ATGACAATTC CCTCATTTTC CTAAATGATA TGGAAAAAAA 8520 

TTATTTATAC TTCAATTTAT AAAATAAAAT TATTCCTGAG AGTAGAAATG AAACACTATT 8580 

TGCTAAAATC AAAGGCAAGT CTCCTATACG AATACCATGA GCAAGCCACA ATGCAATACC 8640 

AATAACTTGC ATAACATACA TACCTAGAGC AATAGATCCT GTGTCCTTTG TCTTAACTAC 8700 

ACGAAAAACT TGTGGTAAAA ATGCAAATGT TGTTAAAATT GCTGCAATAC TTCCAATCAT 8760 

ATGTCACCTC AATATGCTAA ACAAACTGAG AATAATCTCA GTTTGTTTAT ACTATTCTAC 8820 

TGATTCACCG TTAGATGAAA TAACTTCCTT ATACCAGCCA AAAGATTTTT TCGGGGAACG 8880 

ATTATAACTT CCCTTCCCAT TATCATCTTT ATCTACATAA ATAAAGCCAT AACGTTTCCG 8940 

CATTTCACCG GTACCAGCTG AAACCAAATC AATACATCCC CATGGAGTAT AACCCATTAA 9000 

ATCAACACCA TCTTCAACTA CAGCCTTTTT CATTTCACGA ATATGGGCAC CTAGATATTC 9060 

AATTCTATAA TCATCATGTA CCATACCATC TGCTGCAACT TGATCTATAG CTCCAAAACC 9120 

ATTTTCAACA ATAAAGAGTG GTAAGTGATA GTGGTCTGTA AACCAATTTA ACCCATAACC 9180 

CAAACCTTCT GGATCAATTT GCCACTCCCA TTCAGAAGCC TTAACATAAT TATTTTTCAC 9240 

TAAATCTTCT GTTTCAAGAT AATCAAAATA AGGATTATTT TCACGATGAG AGTCGATAGC 9300 

AAAGGACATA TAGTAACTGA AACCAATGTA ATCTACAGTC CCACCAAGTA AATCTTCTTT 9 360 
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ATCCTCCGCA GTAAAATCAA CTGAAATACC TTTTCGTTCC CAATACTTGA AAATATGCTC 94 20 

AGGATATTTA CCTAAAACAT CCACATCAGC AAAATAATAA CCCTTCTGCA TAGCTTTCAT 9480 

TGCCATTAAG ATATCCTTAG GATTGCAAGT AACTGGATAA ATTGGACACA TCGCAATCAT 95 40 

ACAACCTATT TGAAAATCTG GATTAATCTC ATGACCAATT TTTACAGCTC CTGCAGAAGC 9600 

AACTAATTCG TAATGTCCTC CTTGATACAT AATTGCTTCT CTATTATCAC CTTCCTCATA 9 660 

-TACAATACGT- GAGTTAGTAA ATCGTGCAAA ATCTTCCTGA TAATTCGCTT GATTATTGAT 9720 

TTCATTGAAA GTCATCCAAT ATTTAACCTT ATCTTTGTAA CGTTTAAATA CGACTTCTGC 9780 

AAAACGAGCA AAGAAATCAA TCAATTTCCT ATTTTTCCAA CCA CC AT ATT CGGTCACTAA 9840 

GTGATAAGGC ATTTCAAAAT GAGATAGAGT GATGACAGGT TCAATACCAT TCTTTAAGCA 9900 

TTCATCAAAA AGATTATCAT AAAACTGTAA TCCTTCTTCA TTCGCCTCTA ACTCATCACC 9 960 

TTTTGGAAAG ATACGTGTCC ATGCAATAGA GGTACGGAAG CACTTGAATC CCATTTCAGC 10020 

AAAAAGTGCT ATATCTTCTT TATAACGGTG ATAAAAATCT ATCGCCTCAT CATTTGCATA 10080 

ATATTTACCC TCTAAAACTC CCAAAGTAAT TTCACGAGCT ACTCCATGAC GACCAGCAGT 10140 

CATAACATCA GCAACACTAA TTCCCTTGCC ACCTTCTTGC CATCCACCTT CAAGTTGATG 10200 

AGCAGCAACA GCACCACCCC ATAAAAATCC ATCTTTAAAA GTAGTCATCT TTTTTCCTCC 102 60 

TGACTTTGAT ACTCTTATTA TAAACCTTAA ACCAAAAGAT GAAAACGCAT TCTTTTTCCT 10320 

TATTGTTAAG GAAAGAACTA ATTTTTAATG GAAATAGAAC AATATCTTCT TGTATTCTCG 103 80 

TAATGATATC TTTACGATTT TCAATACTTT CAAACTACAA AAACTCTCAC AATAATTCTA 10440 

ATTCCCTGTG TCTATAAACG ACTTATCGCT TTCTGCCATC CCAGAATCAT CTTCTATATA 10500 

ACGTTCAACT TGCATCTGCA AGTGATATTT TTTTCTTAAA TCTAAGATTT TCTGCATTGT :0S60 

CTTTGATTGA TAATGTTTAT CTAAAGTTTC TTGATTTATC CACTGATCAA TAAGGAGAAT 10620 

AGTTCCCTCT TTTTCAATTC GTAAAAAATA TTCGTATTTC AACTTACCTT TTTCATTTCT 106 80 

AATTTCTTTA ACAAGGCCAC TATCAAGCAT TTCTCTTGCA AACTTTATTG CACTATCTCC 10740 

ATCACCT7TA TAATATACAT GAATAGTCAA TGTCATCTTA TATCCTCCAA AATCATCCTT 10800 

CAATTTTAAA AAAACAAGTT TAGATGAGGA TCTAAACTTC TTTTTTATGA ACTAATTATC 10860 

TAACGTTTCG CCATTACTTT CAATCACTTC TTTATACCAA TAAAATGATT TTTTCTTATA 10920 

CCGATTTATA GTCAATTGAA ACAAGAGCAG GACAAAAGAG CCTCATAAAA GGTATTGCAA 10980 

CTTGGTAATA CCTTTTTGAG GTCCTTTTTG ATATGAGCCC ATCTTTTCTC AATAGGATTC 11040 

TACTCAGGTG AGTAGCCACG AAGAGGTAAA AGTTTATACC CAAACTCTTC ACACAAGAGT 11100 
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TCTAGCTTCC CCATTCTATG GAATCTTGCA TTATCCATAA TAATAACCGA TGGTGTGGTT 11160 

AATCTTGGTA AGAGAAACTT CTGAAACCAA GCTTCAAAAA ACTCCCTCGT CATCGTCTCT 11220 

TCGTAAGTCA TTGGAGCGAT TAACTCACCA TTTGTTAGAC CTGCAACCAA AGAAATCCTC 11280 

TGATATCTTC TTCCAGATAC TTT 11303 



(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3112 base pairs 

(B) TYPE : nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

CCTTAGATTT CCACTTGCCA GAGGAATTGA TTGCCCAAAC GCCCCTTGAA AAACGTGATG 60 

CCTCCAAACT CCTCATCGTC AACCGTGAGA CAGCAGAAAT GCAAGATAAA CATTTCCACT 120 

CTATTATTGA TATGCTGCAA CCTGGTGATG CCCTTGTCAT GAACGACACC CGAGTTCTCC 180 

CTGCCCGCCT CTATGGTCAA AAAGTGGAGA CAGGAGGTCA TGTGGAACTT CTCCTCCTTA 240 

AGAACACTAG TGGAGACGAG TGGGAAGTTC TCGCTAAACC TGCCAAACGC CTCAAGGTCG 300 

GTACTCGTAT CAGCTTTGGT GATGGCCGCC TCAGCGCTGT CGTTACAGAA GAATTGACCC 360 

ACGGGGGACG CATTGTCCGC TTTGAATACC AAGGAATTTT CCTAGAAGTC TTCGAAAGTC 420 

TGGGAGAAAT GCCTCTGCCA CCTTATATCC ACGAAAAATT AGATGACCGT GAACGTTATC 480 

AAACCGTCTA CGCCAAGGAA AGTGGCTCTG CTGCAGCACC GACTGCTGGT CTTCACTTCA 540 

CCAAAGAACT GCTGGCAGAA ATCCAAGCTA AGGGTGTTCA TCTAGTCTAT CTGACTCTCC 600 

ATGTCGGACT CGGAACCTTT AGACCTGTTT CTGTGGATAA TCTGGACGAA CACGAAATGC 660 

ACTCAGAGTT CTATCAACTT TCTGAGGAAG CTGCTGCCAC CCTTCGCTCT GTCAAAAAAA 720 

ATGGTGGTCG TGTCATCGCT GTCGGAACCA CTTCTATCCG CACCTTCGAA ACTATTGGTT 7 80 

CCAAGTTTGA TGGGCAAATC CAAGCAGATT CTGGTTGGAC CAATATCTTT ATCAAACCTG £40 

GGTATGAGTG GAAGGTCGTG GATGCCTTCT CAACCAACTT CCACCTGCCA AAATCAACTC 900 

TGGTCATGTT GGTTTCTGCC TTTGCAGGCC GTGAATTAGT CTTAGATGCC TACCACCATT 960 

CCATCCAAGA ACACTACCGC TTCTTCAGTT TTGGTGACGC CATGTTTATT TATTGAGAAA 1020 

GAATTTCTCT AAATCTTCTA ATACCAATAA ATCGCTAAGA TATTATTTCA AAGAACATCT 1080 

ACAATTGAAA CTCTAGCTAG CTGTAGAAGA GGCCTAGTAC ATTGAAATTA AAATGCTTCC 1140 

CCCTAGCTTC GAAAATATTG CCATAGATTG CCTTGACTCT CCAAATTGAT TCATCTATAT 1200 
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TTTATTTCAC CTTCCTATAC T77CTTCGC7 GTTTGTAAAT CAAAATGCAA GACACATCAG 1260 

TAGCACCATA TTTGTTACTC TTATCTGTCC TC7CAACAGA C7ATTATGAG TTATTTCAGA 13 2 0 

ATCATTCACT ACTTTGACCC TGACTCTCCT TAGTCTCAAA ATCAAAGACT TATACTCTTC 1J80 

AAAAATCTCT TCAAACCGCG TCAACGTCAC CTTGGATTAT ATATGTGacC TGaCTTCGTC 1440 

AGTTCTATCT ACAACCTCAA AGCAGTACTT TCAGCAACCT GCGACTAGTT TTCTAGTTTG 1500 

CTCTTTGATT TTCATTGAGT ATTAAACAAA AAGTGAACAA ATCTGAATTC TAATGTACAG " 1560 

AAGACTACGC TTGTTCACTT TT77ATAGTC GCTATAAGAT CACC77A7CT ATAGCTTTTT 1620 

ATATATAATT ATATATTCAG ACATACTATT ATCAATTTTG TCGCAGGGAG GAATCTGTTA 1680 

ACCCACCCAT TCACCA77AT CATTGACTCT A7AGCCATCT A7AC77G7A7 7GACCGC7AA 1740 

C7CACCCGAT G7A777ACAT AA7ACCA7TT ACCACCAACT 7GGAACCA77 GATTGAC7T7 1800 

CATAGAACCG 77GCTG7TGA GG7AG7ACCA TGAACTA7TA AC7TG7ACCC AACCTCTTGC 1860 

CATGGAACCA TCAGTAT7A7 AAAAATACCA CA7ACCAT77 7CT7GTTTCC AG7CTGTTCT 1920 

TGGAGCAAC7 GCTTTAGCTG GT7C7AC7GC 7ACA7CTG77 CCTTGGT7AG A7G7AACAGA 1980 

TACAGGATAC GAAGGAA7AG A7GATTGCTC AGGAACAACA ACTTTTTCAC GTTCTCTCGT 2040 

CCCTCTCCTT ATACGTCTTT TTACCATCTC 7T7AGTAATT 7GACGAGAAG 7AGTTTCTTC 2100 

AATTGTTCCA 7CACG77CAT C7ACAGTATA GATTGTAGTA AGAG7AAT7T ACCAATTTCT 2160 

CCTACTTC77 CTACT7C7TG ACT7TTATCA AGACTTCCGC CATCGAGA7A 7TCTG7TTCG 2220 

ATTGGAAT7T C77GGACAAG AACT7GGGGC T7GG7TC7TT 7TTTAACAAC 7CTTGTTTGA 2280 

GAGTC7TT7T 777CAC7TAA AGTACTC7CA G7TACT7GTC CACTCTTTCC ATCTACATTA 2340 

TAAGTTATCG 7TG7AAC7G7 TT7CCCAT7C 7TTCC7AGAC TAATCTCT7G CTCCTGTCC7 2400 

CCAGAAAGCT CAT7GTCTGC 7TCATAT7TA G7AGCAAA7G GAACAAGAAC T7C7TCAACC 2460 

TTGCTTT7AG C7GGAAC77T GATAAC7G7A 7CCG7GGC77 CTTT7CTATC AACAG7AACC 2520 

7GTTCGG7AA CATAACCAGT CTC7GGA77A ACA7CG7AGG 7CC77G7CG7 AG T7 AC AT AG 2 580 

CCATCCTC7C CATCAAT7GT AACAGGAT7T TCAC7ACGG7 C7TTTCTTTC ATCTTTTTCA 2 640 

TAACGAATTC GCGTAC7TGA AAT77TCTTG G77ACTACC7 TAGGTTTAG7 CGCTACTT7T 2700 

ACAATAATAT CCCCATTGTC AGCG7CATCA 7ACTC7A7TC CC7C7TCT7T ATCTCTAGTA 2760 

TCATCTCTGA CATATTGAAT CCCATCAGCA GCATGAACAA AACTTGTATT CAGATTCCTC 2320 

CTAAAAATAA AGTTAGCCCG ATTACCGCAG AACCAAAAAT C7T7CCGAGT TTACGTAT7G 2 880 
CATAGCGCTT A7TAG7A77A GATT77GCCA TTACATCC7A CTTCTAG7AT AGCATCTTTT - 2940 
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CTATCAAACG TTAAACAATA TACGTTATAT ATAAAATAGA CTTAGAATGA TAT ATTG AT T 3000 

ATTGAACTAA CACTTTAACT ATATCGTAAT CAATCTCATA TATAAAGGAT TGCAGACATC 3 060 

TTATCTAAAT ACATGCGAAT ATATTTAGAT ACAAACATTC CAACTTGATA AT 3112 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4327 base pairs 
(8) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

CCCAAAAATC TCTTCAAACC ACGTCAGCTT CCCCTTCCCC TAGTATGGTT ACTGACTTCG 60 

TCAGTTCTAT CCACAACCTC AAAACAGTGT TTTGAGCATC ATGCgGCTAG CTTCTTAGTT 120 

TGCTCTTTGA TTTTCATTGA GTATAAAAAC AGATGAGTTT CTGTTTTCTT TTTATGGACT 180 

ATAAATGTTC AGCTGAAACT ACTTTCAAGG ACATTATTAT ATAAAAGAAT TTTTTGAAAC 24 0 

TAAAATCTAC TATATTACAC TATATTGAAA GCGTTTTAAA AATGAGGTAT AATAAATTTA 300 

CTAACGCTTA TAAAAAGTGA TAGAATCTAT TTTTATGTAT ATTTAAAGAT AGATTGCTGT 3 60 

AAAAATAGTA GTAGCTATGC GAAATAACAG ATAGAGAGAA GGCATTCAAC CTTACAAAAG 420 

GGGAATAATA TGATATTTAA GGCATTCAAG ACAAAAAAGC AGAGAAAAAG ACAAGTTGAA 4 80 

CTACTTTTGA CAGTTTTTTT CGACAGTTTT CTGATTGATT TATTTCTTCA CTTATTTGGG 540 

ATTGTCCCCT TTAAGCTGGA TAAGATTCTG ATTGTGAGCT TGATTATATT TCCCATTATT 600 

TCTACAAGTA TTTATGCTTA TGAAAAGCTA TTTGAAAAAG TGTTCGATAA GGATTGAGCA 660 

GGAAGTATGG TGTAAATAGC ATAGGCTGAT GTCCATCATT TGCTTATAAA GAGATATTTT 720 

AGTTTAATTG CAGCGGTGTC CTGGTAGATA AAC TAG ATTG GCAGGAGTCT GATTGGACAA 780 

AGGAGAGGGG AAAATTGCCA CCAATTTGAG ATAGTTTGTT TAGTTCATTT TTGTCATTTA 840 

AATGAACTGT AGTAAAAGAA AGTTAATAAA AGACAAACTA AGTGCATTTT CTGGAGTAAA 900 

TGTCTTATTT CAGAAATCGG GATATAGATA TAGAGAGGAT CAGTATGAAT CGCAGTGTTC 960 

AAGAACGTAA GTGTCGTTAT AGCATTAGGA AACTATCGGT AGGAGCGGTT TCTATGATTG 1020 

TAGGAGCAGT GGTATTTGGA ACGTCTCCTG TTTTAGCTCA AGAAGGGGCA AGTGAGCAAC 1080 

CTCTGGCAAA TGAAACTCAA CTTTCGGGGG AGAGCTCAAC CCTAACTGAT ACAGAAAAGA 1140 

GCCAGCCTTC TTCAGAGACT GAACTTTCTG GCAATAAGCA AGAACAAGAA AGGAAAGATA 1200 

AGCAAGAAGA AAAAATTCCA AGAGATTACT ATGCACGAGA TTTGGAAAAT GTCGAAACAG 12 60 
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TGATAGAAAA AGAAGATCTT GAAACCAATG CTTCAAATGG TCAGAGAGTT GATTTATCAA 1320 

CTGAACTACA TAAACTAAAG AAACTTGAAA ACGCAACAGT TCACATGGAG TTTAAGCCAC 13 80 

ATGCCAAGGC CCCAGCATTC TATAATCTCT TTTCTGTGTC AAGTGCTACT AAAAAAGATC 1440 

AGTACTTCAC TATGGCAGTT TACAATAATA CTGCTACTCT AGAGGGGCGT GGTTCGGATG 1500 

GGAAACAGTT TTACAATAAT TACAACGATG CACCCTTAAA ACT T AAA CCA GGTCAGTGGA 1560 
ATTCTGTGAC TTTCACAGTT GAAAAACCGA CAGCAGAACT^ ACCT AAAGGC CGAGTGCCCC " ~" 1S20 

TCTACGTAAA CGGGGTATTA TCTCGAACAA GTCTGAGATC TGGCAATTTC ATTAAAGATA 1680 

TGCCACATGT AACGCATGTG CAAATCGGAG CAACCAAGCG TGCCAACAAT ACGGTTTGGG 1740 

GGTCAAATCT ACAGATTCGG AATCTCACTG TGTATAATCG TGCTTTAACA CCAGAACACC 1800 

TACAAAAACG TAGTCAACTT TTTAAACGCT CAGATTTAGA AAAAAAACTA CCTGAAGGAG I860 

CGGCTTTAAC AGAGAAAACG GACATATTCG AAAGCGGGCG TAACGGTAAC CCAAATAAAG 1920 

ATGGAATCAA GAGTTATCGT ATTCCAGCAC TTCTCAAGAC AGATAAAGGA ACTTTGATCG 1980 

CAGGTGCAGA TGAACGCCGT CTCCATTCGA GTCACTGCGC TGATATCGGT ATCGTCATCA 2040 

GACGTAGTGA AGATAATGGT AAAACTTGGG GTCACCGAGT AACCATTACC AACTTACGTG 2100 

AC AAT CCAAA ACCTTCTGAC CCATCGATCG GTTCACCAGT GAATATCGAT ATGGTGTTGG 2160 

TTCAAGATCC TGAAACCAAA CGAATCTTTT CTATCTATGA CATGTTCCCA GAAGGGAAGG 2220 

GAATCTTTGG AATGTCTTCA CAAAAAGAAC AACCCTACAA AAAAATCGAT GGAAAAACCT 2280 

ATCAAATCCT CTACCGTGAA GGAGAAAAGG GAGCTTATAC CATTCGAGAA AATGGTACTG 2 340 

TCTATACACC AGATGGTAAG GCGACAGACT ATCGCGTTGT TGTAGATCCT CTTAAACCAG 2400 

CCTATAGCGA CAAGGGTGAT CTATACAAGG GTCACCAATT ACTAGGAAAT ATCTACTTCA 2460 

CAACAAACAA AACTTCTCCA TTTAGAATTG CCAAGGATAG CTATCTATGG ATGTCCTACA 2 520 

GTGATGACGA CGGGAAGACA TGGTCAGCTC CTCAAGATAT TACTCCGATG GTCAAAGCCG 2580 

ATTGGATGAA ATTCTTGGGT GTAGGTCCTG GAACAGGAAT TGTACTTCGG AATGGGCCTC 2640 

ACAAGGGACG GATTTTGATA CCGCTTTATA CGACTAATAA TGTATCTCAC TTAGATGGCT 2700 

CGCAATCTTC TCGTCTCATC TATTCAGATG ATCATGGAAA AACTTGGCAT GCTGGAGAAG 27 60 

CGGTCAACGA TAACCGTCAG GTAGACGGTC AAAAGATCCA CTCTTCTACG ATGAACAATA 2 820 

GACGTGCGCA AAATACAGAA TCAACGGTGG TACAACTAAA CAATCGAGAT GTTAAACTCT 2880 

TTATGCGTGG TTTGACTGGA GATCTTCAGG TTGCTACAAG TAAAGACGGA GGAGTGACTT 2 940 

GGGAGAAGGA TATCAAACGT TATCCACAGG TTAAACATCT CTATGTTCAA ATGTCTGCTA 3000 
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TCCATACGAT CCACCAAGCA AAAGAATACA TCATCCTCAG TAATGCAGGT GGACCGAAAC 3060 

GTGAAAATGG GATGGTCCAC TTGGCACGTG TCGAAGAAAA TGGTGACTTC ACTTGGCTCA 3120 

AACACAATCC AATTCAAAAA GGAGAGTTTG CCTATAATTC GCTCCAAGAA TTAGGAAATG 3180 

GGGAGTATGG CATCTTGTAT GAACATACTG AAAAAGGACA AAATGCCTAT ACCCTATCAT 3240 

TTAGAAAATT TAATTGGGAA TTTTTGAGCA AAAATCTGAT TTCTCCTACC GAAGCGAACT 3 300 

AGAGAGATGG GCAAAGGAGA GATGGGCAAA GGAGTTATTG CCTTGGAGTT CGACTCAGAA 3 3 60 

GTATTGGTCA ACAAGGCTCC AACCCTTCAA TTGGCAAATG GTAAAACAGC GACTTTCCTA 3 4 20 

ACCCAGTATG ATAGCAAGAC CTTGTTGTTT GCAGTAGATA AGGAAGATAT CGGACAGGAA 34 80 

ATTATTGGTA TAGCTAAAGG AAGCATCGAA AGTATGCATA ATCTTCCTCT AAATCTAGCA 3 540 

GGTGCCAGAG TTCCTGGCGG AGTAAATGGT AGCAAAGCAG CGGTGCATGA AGTTCCAGAA 3600 

TTTACAGGGG GAGTTAATCC TACAGAGCCA GCTGTTCATG AAATCGCAGA GTATAAGGGA 3 660 

TCTCATTCGC TTGTAACTCT TACTACAAAA AAAGATTATA CTTACAAAGC TCCTCTTGCT 3720 

CAGCAGGCAC TTCCTGAAAC AGGAAACAAG CACAGTGACC TCCTAGCTTC ACTAGGACTA 3780 

ACAGCTTTC? TCCTTGGTCT GTTTACGCTA GGGAAAAAGA GAGAACAATA AGAGAAGAAT 3840 

TCTAAACATT TGATTTTGTA AAAATGGCTC TTTGTCAACT GTAGTCGCTT GAAGTCAGCT 3900 

AAGCTCGAGA AAGGACAAAT TTTGTCCTTT CTTTTTTGAT ATTCAGAGCG ATAAAAATCC 3960 

GTTTTTTGAA GTTTTCAAAG TTCCGAAAAC CAAAGGCATT GCGCTTGATA AGTTTGATGA 4020 

GATTATTGGT CGCTTCCAAT TTGGCGTTAG AATAGTGTAG TTGAAGGGCG TTGACGATTT 4080 

TCTCTTTGTC CTTTAGAAAG GTTTTAAAGA CAGTCTGAAA AAGAGGATGA ACCTGCTTTA 4140 

GATTGTCCTC AATGAGTCCG AAAAATTTCT CCGGTTCCTT ATTCTGAAAG TGAAACAGCA 42 00 

AGAGTTGATA GAGCTGATAC TGATGTTTCA AGTCTTGTGA ATAGCTCAAA AGCTTGTTTA 4260 

AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGGCG ATAAAAATGT TTATCGCTGA 4 320 

GTTTACG 43 27 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3521 base pairs 
O) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

CTCTGGCCCT GCCACTCCAA CGTTTTGTCA GGGTGCTTTT 7TCATAAAGG AGTTCTTATG 60 
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TTAGATATCA AACGTATTCG TACACATTTT GAAGCTGTCG CAGAAAAATT AGCTACACGT 
GGTGTAGATG CTGCTGTCTT GAATGAAATG AAAGAAATCG ATGCTAAACG TCGTAACATC 
TTGGTCAAGG TTGAAACTCT CAAAGCAGAA CGTAACACAG TTTCTGCTGA GATTGCCCAA 
GCTAAGCGCA ACAAGGAAAA TACAGATGAC AAGATTGCTG CCATGCAAAA TCTATCTGCT 
GAGGTTAAAG CCTTGGATGC TGAATTGGCA GAAATCGATG CTAAATTGAC AGAATTTACA 360 
ACGACTCTTC CAA^TATCCC AGCTCACAGC GTTCCTGTTG CGCCTGACCA AGACGACAAT 4 20 

GTGGAAGTTC GCCGTTGGGG TACTCCACGC GAGTTTGACT TCGAACCTAA AGCTCACTCC 
GATCTCGGTG AAGACCTTGG TATCCTTGAC TGGGAACGCG GTGG7AAGGT AACAGGCGCT 
CCCTTCCTCT TCTATAAAGG CCTCGGTGCT CGTTTGGAAC GTGCTATCTA CAACTTTATG 
TTGGATGAAC ATGGAAAAGA AGGCTATACT GAAGTCATCA CACCTTACAT AGTCAACCAT 
CATTCTATGT TTGGTACTGG TCAGTATCCA AAATTTAAGG AAGATACTTT TGAACTCAGC 
GATACCAACT TTGTCTTGAT TCCAACTGCT GAAGTTCCTC TGACAAACTA CTACCGTGAT 
GAAATCTTAG ACGGCAAAGA TCTTCCAATC TACTTCACTG CCATGAGTCC CTCATTCCGT 
TCTGAGGCTG GTTCTGCCGG TCGTGATACG CGTGGCTTGA TCCCTTTGCA CCAATTCCAC 
AACCTTGAAA TGGTCAAATT TGCCAAACCA GAAGAATCTT ACGAAGAATT GGAAAAAATG 
ACAGCCAACG CTGAAAACAT TCTTCAAAAA CTCAACCTTC CATACCGTGT CGTTGCTCTC 
TCTACTGGAG ATATGGGCTT CTCAGCTGCG AAGACTTACG ACTTGGAAGT GTGGATTCCA 
GCACAAAACA ATTACCGTGA AATCTCAAGC TCTTCAAACA CACAAGATTT CCAAGCCCGT 1140 
CGTGCCCAAA TCCGTTACCG TGATGAAGCA GATGGCAAGG TGAAACTCCT TCATACCTTG 



120 
190 
240 
300 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



1200 



AACGCTTCTG GACTTGCAGT TGGACGTACA GTGCCTGCAA TTCTTGAAAA TTACCAAAAT 12 60 

GAAGATGGTT CTGTGACCAT CCCAGAAGCA CTTCGTCCAT ACATGGGTGG AGCTGAAGTC 13 20 

ATCAAACCAT AAAAAATAAG CTTTAGCTAT TTCTAGCTAG ACCTTTTTTC GTAACCAAAT 13 80 

CAGATAAGCA CCTAGTACAA AGAATAAAAT ACTTAGGCAT ATAATGGTTT CAGCCAATAC 1440 

CAGGTAATCC AGAAATGGAA GTTTCAAAAT TCCCTGAGCC ATCTTGAGCG ACGTCCCTCT 1500 

GATAATGGTT GGGAAGGTGA GGG CTGAGAA CCCTGGTTGA AAACCTTGTT TTAAAATGTT 1560 

GGGCAGACGA GTTAAAACAA AGAAAAAGAA GGATTGAGAA GCCAAAATCA TGACAATCAA 1620 

GACCCAAGTC GGCAGGCTGG TTCCTCCTAC TCGAACTAGA GAAGCCAAGA GTAGAGAGAA 1680 
AGGAGCACAG TAGATTCCTT CTTGTCCAAG CAAGGCTAGT GGGAGTGGAT GTTTCTTTAA 



1740 



ATCGCTATAA ATAAGGGGAT AGAGATAGAA GGTCAAGAGA AAACCAAAAC TCAAGGTCGC 1800 
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ATAGGCAATT TCGATAATAC CTACCAGAGG ATAGGTCAAG GCAGCCACTG CTATCCCCAC 1860 

ATAGAGAACC GTCCAGCTTG GAGTGGCATG AACCCTCCGC CCTGCACAAG CAAACTTCAT 1920 

GGTAAAACCA GCAATCAAGG TCAAATCCAA GAGAAATGAA AACCACCAAA TCCCTTGTGC 19 80 

TACCAAAGGA AGATAAGAGA ATACGCGAAA GACATAGGTC GATAAAATCA TCCCAGCCAT 204 0 

AGGAAAGGTT GCCATTCCTG ACAAAAGAGG GGGCTTGGTC AATTCTTGCT TGGTTTCTTT 2100 

CCAATTAAAG AGATGCAGAA TTAGAAAGTA AATCCATAAA ACCAAACCAA TC AG ACT AAA 2160 

AAG ATGGG AT AGAACCGGCA ACGTATCTAA AATAAGATTT CCAGCTCCTG CCAAACCTAG 22 20 

CAAACAACCT GAAAATACTA AGGGGAGTTT TTTCATCCTA ACCTCCAATA ATCATGTTAG 2 2 80 

TTTCAGTATA ACATAAAAGC GCTTAAATGA GGATTTAAAA AAACGACTCC GCTTATTTCA 2 340 

GACTTCATTT TACTCAGATA TGAATTAGGC ATAAGGTTGC AATTCTGGAT TAATTGGTGT 2400 

ATTACCTAAC TTGTTGGCAT AGTTACAGAG GATTGCTAGG CTGACACCAA AAACCACATC 24 60 

CAAGGCATTT TGTTGAGTGT AGCCAGCTTC TAAAAACTCA GACAAGGCTT CATCTCCTAC 252 0 

ACGACCCTTG GTATTGATAA CTGCCAAGGT AAACTTAGCT AGGGTATCCA ATTTAGGATC 2580 

TGTTTCAATT GGAGTACGAT TGCGAAGAGC TTGAATCAAG TCATCATTCA TCTGGATTTG 2640 

TTTGATGGAA AAGGCTGTGT GACCTGCGAC ACACAACGCA CAACCATTGG TCACGGCTGC 2 700 

CGTGATTTGC ACCACTTCAC GCTCAACGGG TGTCAGGCTG TTGCGACGGT CCATAGATGA 2 760 

GACAATTTGG TAGGCTTCTA AAACAGTCGG CCCATTGCCC AAGAGACCGA TTAGGTTGCC 2 820 

AATATAGCCA TTGTTGTCTT TTTCTACTGT TTCAAGAATT TCTTTCACTT CTGCTGGTGC 2880 

TGACTCTACT GTATGGATAG TAAATGTTGT CAT AAG AT AC CTCTTTTCTT ATTATTGACA 2940 

CTAATATTAT TGGAAAATCT TATAAAATCC TGATTCCTAA GTTTATCTAA GATAAAGCTT 3000 

TATTCTCTCA TAACATTTTC GTTGTTATAT TAGTTTATCA CACTTCCAAT CACTTGTATA 3060 

ATATATATTA TATATCAGGC TGATAAAAAT TATTTATAGG CAAAAAAATC ACACGAGCTG 3120 

TGTGATTCCA TTATTTGTCA AAATACTTTT TAGTTTCAGC AATAACCACT GGCGACAAGA 3180 

CCAAGAGGGC AATCAAGTTT GGCAGAGCCA TCAAGGCGTT AACGATATCT GCGATAATCC 3 240 

AGACCATATC CAACTCGATA AATCCTCCTA ACAAGACCAT GAGCACAAAA ACCACACGGT 3 300 

AGAGCCAGAT AAAGCGAACC CCAAAGAGGA ACTCAAAACA GCCTTCTCCG TAATAGTTCC 3 3 60 

AACCTAGAAT CGTTGTAAAG GCAAAAAGTA CAAGGAAGAT GGTCAAGAGA GCAGGCCCAA 34 20 

AGTGTGAAAA GTTTGTTGAG AAAGCTGACT GAGTCAAGGC AACCCCATTC AAGTCACCGC 34 80 

TCCAAACTCC AGTTACCAAG ATGGTCAAAC CAGTTAGAGT A 352 1 
(2) INFORMATION FOR SEQ ID NO: 119: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1968 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ TD NO: 


119: 






AACCTGGGCA 


AGCAAGCTAA 


AAGCAATGGG 


ACCTCGAATC 


CTAATGGCAA 


CTGCCGCTGT 


60 


TGGAGGTTCC 


CACATTGTAT 


CCTCAACTCA 


AGCTGGCGG? 


TCTTACGGTT 


GGTCTCTACT 


120 


TCTCTTGGTC 


ATCTTAGCCA 


ATGTCTTTAA 


ATATCCATTT 


TTCCCTTTTG 


GTGCTGAATA 


L80 


CACAGCTGAT 


ACTGGAAAGA 


CTTTGGTTGA 


AGGTTATCCC 


GAAAAAGGAA 


AACTCTATCT 


2 4 0 


CTGGATTTTC 


TTTATCCTCA 


ATGTCTTTTC 


GGCTATGGTC 


AACACGGCTC 


GTGTTGCCAT 


300 


TCTGTGCTCA 


GCTATCATCG 


CCAGTGCCTT 


CCCAATGATT 


GGACTTAGCA 


TTACTCAGTG 


360 


GTCCCTCATT 


CTCGTTGCAA 


TCATTTGGGC 


TATGCTACTC 


TTTGGAGGCT 


ACAAACTTTT 


420 


AGACGGCATG 


GTCAAATGGA 


TTATGTCTGC 


CTTAACCATT 


GCGACTGTTC 


TTGCAGTTAT 


480 


CATTGCGGCG 


GTCAAGCATC 


CAGAATACAG 


TTCTGATTTT 


GTCGAGAAGA 


CACCTTGGCA 


540 


AATGGCACCT 


CTCCCCTTCA 


TCGTCTCCCT 


CCTAGGATGG 


ATGCCGGCTC 


CTATTGAAAT 


600 


TTCAGCCATC 


AATTCACTTT 


GGTCAGCTGA 


AAAGAGAAAG 


ACCGTCAACT 


TTAACACAGA 


660 


ACACGCTCTG 


TTTGACTTTA 


ACACTGCTTA 


TATTGGAACA 


GCTATCCTAG 


CCGTCTTCTT 


720 


ruTGGCACTG 


GGAGCACTGA 


TTCAGTATCC 


TACAGGGCAG 


CCGGTTGAAG 


CTGCTTCAGC 


780 


CAAATACATC 


TCTCAATTCG 


TGGGCATGTA 


TGCCTCTGTT 


CTTGGCGAAT 


GGTCCCGTTA 


840 


CTTGATTACC 


TTTATTGCCT 


TCCTCTGTAT 


CTTTGGAACA 


GTTATAACTG 


TTATCGATGG 


900 


CTATTCTCGC 


GTTAATCAGG 


AATCTCTCCG 


ACTGCTAATC 


AGTCAAAAAG 


AGGACAATCG 


960 


TAAATCTTTG 


AACATCTGGA 


TG AC CATC AC 


TGCTATCATC 


GGTATCGTCA 


TTATCAAGTT 


1020 


CTTCGCTGGT 


CAGGTTTCAA 


CCATGCTCCG 


CTTTGCCATG 


ATTGGCTCTT 


TCCTGACAAC 


1080 


ACCTTTCTTT 


GCTCTTTTGA 


ATTACGCCTT 


GGTAACGCGT 


GAAAACAAAA 


ATCTTCCTTC 


1140 


TTGGCTCAAA 


CACCTTGCCA 


TTGCGGGATT 


GATTTTCCTC 


TTTGCTTCGC 


CATCTTCTTT 


1200 


ATCTACGCAC 


TCGCAATCGG 


AAAAGCAGGG 


TAAGGGACAA 


GCGCGAGATG 


AAGATAAGGT 


1260 


TTCATTTCAA 


GAGAAAATTC 


AGCAAATATT 


TCTATGATAA 


AAAGCATAAG 


AACAAGGTTT 


1320 


TGAAGACCTG 


AACTTATGCT 


TTTTTACGTT 


CTTAAAGACT 


GTTTATACTC 


AAAAAACAGT 


1380 


TGAACAACTT 


CAACCACCTC 


TTATAAGAAC 


TTTATACTAT 


TCGAGAATCT 


CTTCAAACCA 


1440 
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CGTCAGCTCT ATCTGCAACC TCAAAGCTGT GCTTTGAGCA ACCTGCGACT AGCTTCCTAG 1500 

TTTCCTCTTT GATTTTCAT? GAGTATTAAT TCTCCTTTTC CAACTCATAC AAATCTCCGA 1560 

TAATAGCTGC GACATGTTTG ATATCTTCCA GCATGCCTCG CATTTCAAAG TCAGCCAATA 1620 

CAGGGAAGCC AAAGCCTTGA CTGTATTGCT TGCCTGTTAG GCAGTATTGG TTATTAAAGT 1680 

TACGATTTCC TGACCCAACC ACACCAAAAC ACTTACTAGC ATTGTTACCA TAGGCAATAA 174 0 

AATCTCCCAC CCGTGTCGTC AAAATCTCAA CATCTCCGTT ATCCACGCCA TTCCCACCTT 1800 

CGAGATAGGT CGGCAAAAAA GCGACATAGG GATGGTCCAT TTCATAGAAA TTTTTGCCTT 1860 

CCTTGACCAA ATCCTTGATA TGAATCTTTT GAACCTCAAT CCCTTTGTAC TGGGACAAGA 1920 

GATACTCTTT CAAGCGCGTC ACAAAACTTT CAGTGTTGCC ACTCAAGG 1968 
(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 120: 



CCGCATTTTT 


TATCACTAGA 


CTCGAGACAT 


CTTTTGAGTG 


GCTCTTGCTC 


TCTGGTTTAA 


60 


TTTTCTTCCT 


TCCTCAAGGA 


CTCCTGCTAT 


TTCTCTTGGT 


CGTCCGACTC 


AAACATCAAT 


120 


TCGCTGAGAT 


TTATCCTCAA 


ATCAATAAAA 


AGATTCGCTT 


CTACTATTTA 


GGGGTTCTCA 


180 


CCATTGATTT 


TCTATTTTTT 


GTTCTCTTAG 


CCTTCATTAG 


TTCTCAGCGT 


TTTTCATCTC 


240 


TTATGCCAAT 


CATCACTGCT 


TGCCATTCTA 


CTTTTTATTA 


TATGACAGCT 


GACTACCTAA 


300 


GAGAAAACTA 


TCCAGACTTT 


TACGACAAAC 


ACATCTCTTT 


ATGGGAGTGT 


CTCTAAAGAA 


360 


AAGGAGGTTT 


TAGCATGAAA 


AAAATCATCT 


TCATCAAAAC 


CATTCAACTC 


CTTGTCATTG 


420 


ATGGAATCAT 


GCTGGCATTT 


TTGACATTTA 


AAAGGGGCCT 


TACTTGGGAC 


TGGATTTTGA 


480 


TTTATAGCGG 


TTGGCTCATT 


TTCTTTCATC 


CTGTGCTATT 


GACCTATCTT 


TCAAACCAAC 


540 


TTTGTGACCA 


CTTTAGTTAA 


CTCTATTCCC 


AG ATT AG A CC 


GAGATTCTGG 


CGTTTTGCTT 


600 


TACAAATTCT 


CCTATGGGAT 


AGCCTGATGA 


TTCTCTCCTT 


CGTGTCrTTA 


AGTGATATTC 


660 


CACTTTTCCT 


TCAGGGAACT 


CTCCTCATCC 


TAGGACATCT 


CATCCCTTCC 


TATCGCATCT 


720 


GCCAAAGCCT 


GAAAAGAGAC 


TTCCCCCAAG 


CATATCAAGA 


ACCGATTTCT 


TTTTGGAGTA 


780 


TTTTATGATA 


GATGAGAAAG 


ACCAAGCCGA 


CTGGGCTTGC 


TCTTTCTTAT 


CTCTTTTTAG 


840 


TATCTAGGAT 


AATGGTAACA 


GGTCCATTAT 


TAACCAGCTC 


AACCTGCATA 


TCTGCTCCAA 


900 
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AGATGCCTGT CTGAACCGCC ACTTCTTGCG CTAATTTTTG ATTGAAAGCA TCATAGAAGT 960 

CTGATGCCAT ATCAGGTTTA GCTCCCCCTC TAAACgCTGG ACCATTCCCT CTCTTAGTAT 1020 

CCGCAAAGAG GGTAAACTGA GAAATAGAGA GCATTTCTCC TTCAATATCT TTGACAGACA 1080 

GGTTCATCTT GCCTTCTGCG" TCTCAAAAAA TCCGCATATT GACCACTTTT CTCACAGCAT 1140 

AGTCCAAATC TTCCTCTTGG TCCTCTGCTC CAACACCAAC CAGCAATAAA AGTCCCTGAT 1200 

-TGATTTTTCC-CTGAATCTGG-CCTTCTATAC- TCACTTGGCC TTTTTTAACC CGTTGGAT AA " ~"12~60 

TGATTTTCAT AATAGCCTTT CTAGTAAGAG CTAGGACAAC TAGCCGTTGG TCCGTTTGAC 1320 

AGAGTAAACT TCTGGCACAC TCTTAATTTT ATCGACAACC GTGGTCAGTG TAGAGACGTT 1380 

GGCAATACCG AAGgACACAT GGATATTAGC AAACTTCATA TCCTTGGTTG GTTGGGCATT 1440 

GACCGTTCAA ATATTCTTGG TTGTATTTGA AAGAACTTGC AGTACATCGT TCAACAGTCC 1500 

TGTACGGTTC AGACCGTAGA TATCGATATG GGCCATATAC TCCTTATTTG AGCTAGGGTA 1560 

CTGGTCTTCC CATTCCACAT CAAGGAGACG TTCCTCGTAG TTTTCTTGGG CACGCAGGTT 1620 

CATACAGTCC ACACGGTGAA TAGCCACACC ACGACCCTTG GTAATGTAGC CAACAATATC 1680 

GTCACCAGGC ACGCGGTTAC AACACTTAGC AATCCGCACT AGGAGACCAG AAGCACCTTC 1740 

AATAACCACT CCCCCCTCAT GCTTGACCTT GAGGGTTTCT TTATTTTCAA CCTTGACCTC 1800 

GCCACCTTTG ACAAGCTCCT CTGCCTCAGC TTTCGCCTTG GCACGCTCTT CCTCACGGCG 1860 

TTCCTTTTCA GTCAGACGGT TAAAGACCGT AATCGCACCG ATTTCCCCAA AACCAATGCC 1920 - 

CGCAAAGAGG GAGTCTTCTG TCTTGTAACT GGTCTTTTGC AGAACTTGAT CCATGTGGCG 1980 * 

CTTGTCCATA AATTTATTTG CCACATAGCC ATTTTCTTGG AACTGAGCCA TCAGCATCTC 2040 ' 

ACGACCCTTG TTGACAGACA ATTCCTTATC TTGGTTTTTA AAGAACTGCC GAATCTTATT 2100 

GCGCGCCTTG CTAGTCTTCA CCATATTGAG CCAGTCACGG CTAGGTCCAA AGGAGTTCGG 2160 

GTTGCCCATA ATTTCAACCT GATCCCCTGT CTTTAACTTC CTTGTCAGTG GAACCATGCG 2220 

GCCATTGACC TTGGCACCAG TTGCTTTTTC ACCGACCTTG GTATGGATTT CGTAGGCAAA 22 80 

ATCAATCCGT CCTGAATCTT TGGGAAGGGA ACGGACAGCT CCATCTGGGG TAAAAACGTA 2340 

AATCTCCTCA GCCAAATAGT TTTCCTTAAC ACACTCCACA AATTCCTTAG CATCATCAGC 2400 

CTGGTCTTGG AGCTCCATCA TCTCCTTGAT CCAGTTCATT CCAATAGCTG ATTCCTTGCT 24 60 

CTTAACTTCC CCCTTTATAC CTTTCTTATA AGCCCAGTGA GCCGCAACCC CGTACTCAGC 2520 

CACCTCGTGC ATTTCCTTGG TTCGAATCTG GAATTCAATC GGCCCTTTTG GTCCATAAAC 2580 

AGTCGTATGG ATAGACTGAT AACCATTGGC CTTGCGGTTG GCGATATAGT CTTTGAAGCC 264 0 
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ACCTCGCATC CCTTTCCAAA ATTCATGCAC GTAACCAAGC ATGGCATAAA CATCACTTTG 2700 

CGTATCTAAA ATACAACGAA TAGCAATCAG ATCATAGATT TCCTCAAACC GTTTTCTCTT 2760 

GTCCTGCATT TTGCGGAAAA TTGAGTAAAT ATGCTTGGGA CGACCATAAA TCTTCCCTTT 2320 

CAAGTGACGT TCTGTCGTAT ACTCCTCTAA TTTTGTGACT ACCTCATCCA CCAAGGCCTC 2880 

ACGCTCCCTG CGCTTTTCCT TCATCATATG CGTAATCTTG TAAAACTCCG TTGGATTGAG 2940 

ATAACGGAAA GACAAGTCTT CTAATTCCCA TTTGACACTG GAAATCCCCA AACGATGGGC 3000 

AAGCGGGGCA TAGATTTCCA TGGTTTCTTT GGAAATACGC TCCTGCTTGT CTTTTCGAAG 3060 

ATGTTTCAGG GTCCGCATAT TGTGCAAGCG GTCAGACAGT TTGACCAAAA TAACGCGGAT 3120 

GTCCTCAGAC ATGGCCATGA GCATCTTGCC ATGATTTTCC GCTAATTGCT CCTCGATCGA 3180 

TTTGTACTCG ACCTTGCCAA GCTTGGTAAC TCCGTCAACA ATCATCCGCA CATCAGGACC 3 2 40 

AAACTCTCTT TCCAAATCGT CCAAAGTCGC ATCTGTATCT TCCACCACAT CATGCAAGAA 3 300 

TCCACAAGCT ACTG7TACAG CATCCAGCTT TAGCTTAGCT AAAATACCTG CCACTTGGAT 3 3 60 

AGGGTGAATG ATATAAGGCT CGCCTGATTT GCGATATTGA CCACTGTGGC ATTCAACAGC 3420 

ATAGACCAAG GCCTTATGGA CAAAATGAAC ATCCTCTTCC GTTAAATATT CTTTGGTTAA 3480 

AGCGACAACT TCTTCGCCTG TTAAATTCAC TTCTTTCGCC ATCTCTACTC TCCAATTCTT 3540 

CCTACCATTT TATCACTTTT TTAAGAATAT GAAAACTAGA TTCGAACAGA ATAACAAAAA 3600 

AATAATT CAA AATTGCTTGA TAATTCTGAA TTATTGGTCC GTAATATACT ACGAAGTTAG 3 6 60 

ATTTTAAACT TAGGTCATAG AAGGAGAGAT AGAAGAACGG AAACCATATT GTAACCCAAA 3720 

GACTTTCTGA CTTCCCCAAT TCCATTGAAG ATACGAAAGA TAAACGGTGG AACTCGTATC 3780 

ACATACACTG GTACCTTGAC TGGATTTTGG AATTAATACT AAATGAAAAT CAAAGAGCAA 3840 

ACTAGGAAAC TAGCCCCAGC TTACTCAAAG CACCGCTTTG AGGTTGCAGA TAAAGTTGAC 3900 

GCGGTTTGAA GAGATTTTTG AAGAGTATAA AAATCCTCAA GATACTTTCT TCTATCCTTT 3 960 

ACTTTATAAG GAGAATACCT ATGAAAAAAA CTGCTATTTC TATCTTTGCT CTCCTAATCT 4020 

TAGGAGTTTG CTGCCTGTTC CTATTCAGCC AGCAAAGCTA TAAAAAACAG TCGTTCAATA 4 080 

CTATGCTAAC GACCAGAACC TGCCCAGTAG GATAACTTAT AGTGAATATA GCGACAAATG 4140 

AGAAGCCAAC TACGGTAGCA CTCTAAACAT CACGTCTATC AAACAAGCTA ATGACGGAGT 4200 

TTATGCAACC TATGAAGGCC AATTGACACC TTTCCAATAT TGATAAATTG ATAACCAGCC 4 2 60 

TGTCTTCATC TAGTCATGCT GGTTTTTAAG TTCATTTTAA ATCCTTACCT ATTCTCCCTA 4 320 

ACTGTGCTAT ACTTAATTTA TACTCAATGA AAATCAAACA GCAAACTAGA AAGCTAGCCG 4330 

CAGGCTGTTC AAAGCACTGC TTTGAGGTTG CAGATAAAGT TGACGCGGTT TGAAGAGATT 44 4 0 
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TTCGAAGAGT ATTAGTACAT TCTTTGAGAT TGGAGCTAGT ATGAAAATCC ATAAAACCGT 4 500 

CAATCCTGTT GCCTATGAAA ATACCTATTA TCTAGAAGGC GAAAAGCACC TCATCGTCGT 4 560 

CGATCCTGGT AGTCATTGGG AAGCCATTCG TCAGACAATC GAGAAGATCA ACAAACCGAT 46 20 

CTGTGCTATT CTCTTGACCC ACGCCCATTA TGACCATATC ATGAGTCTGG ACTTGGTTCC 4 680 

CGAGACGTTT GGCAATCCTC CTGTCTATAT CGCAGAGAGC GAAGCCAGCT GGCTCTACAC 4 7 40 

TCCTGTCGAT AATCTCTCCG GTCTCCCTCG CCACGATGAT ATGGCACATC TGGTCACAAA 4 800 

ACCTGCAGAA CACACCTTTG TCTTTCACGA AGAATACCAA CTAGAGGAAT TTCGTTTTAA 4 860 

GGTTCTACCG ACCCCAGGGC ACTCTATCGG TGGTGTTTCC CTAGTCTTTC CTGATGCTCA 4 920 

TCTAGTCTTG ACGGGAGATG CTCTATTCCG CGAAACTATC GGACGGACCG ACCTTCCGAC 4 980 

TCCTAGCATG GAGCAACTCC TTCATAGTAT CCAGACCCAA CTCTTCACCC TACCAAACTA 5040 

CGATGTCTAT CCAGGACATG GTCCAGCTAC TACTATCGCT CACGAAAAGG CCTTCAATCC 5100 

CTTTTTCTAG CAAGATGATG ACAATCGAAA TTTAAGTAAA CTATCCAGCA AATCTTTCTA 5160 

TTACAAAAGG CATCCTATCA ACGTTTTCAC ACATGATTGG ATGCCTT T TT TCTGATGACT 5220 

AGATTTTTTG CATTACCAAA TAATCACGCG CTCCTCTGGT GAACGCCACA TTCCGTCTCC 5280 

TTCTTTGACA TCATAGGTTG TAAAGAAATC GTCGAAGTTT GGTACTTGCA CATTGACACG 5340 

GAGTTTGGCT GGTGCGTCCA CATCGACGCT ACCCAAAAGT TTCATAAATT CTGGTCGACC 54 00 

TTTCATGCGC CAGATGCGAC CGAAGTTGTA GAAGAACTCT TCTGCTGAGA AGTCTGCTTC 54 60 

TCTCTTAGCT GCTTCAAGCG CTGCTGCGAT TCCTCCCAAG TCAGCCACGT TTTCTGATAC 5 520 

AGTCAATTTA CCGTTAATGG TTGCTCCATA AGAATCCTGT CCATCAAATT GGTCAATGAC 5580 

TTTTTGTGTT TTCTCCTTCA AGGCAGCATA GTCGCTCTCT GTCCACCAAT CCTTGAGGCT 5640 

ACCATTTTCG TCAAAGGAAG CCCCGTTAGT ATCAAAGGCG TGGGAAATTT CATGGGCAAT 5700 

CACTGCCCCA ATACCACCGT AGTTAGCAGA AGATGACTGA TGCAAGTCAT AGAAAGGCCC 5760 

CTGTAAAATG GCCGCTGGAA AGACAATCAG GTTCTTCTGA GGATTGTAGT ACCCATTGAC 5820 

CATATGAGCA GGCATGCCCC ATTCCTTATA ATCTACAGGC TGGTTCCACT TACTCCAACT 5880 

GTCCTTGATT TCCACACGCG CAAAGGCTAG AGCATTCTCA AAAAGACTGG CAGTTTCATT 5940 

CACTACCTTA TCCTTGTAAC GTGCAGGCAA TTCTTCTGGA TAGCCAATAT AAGGTTTGAT 6000 

CACATTGAGC TTCACGATAG CCTCTTTACA GGTTTCTGGA GTGAGCCAGT CATTCTTAAG 6060 

CAGACGCTCC TTATAAACAT CAATCATGGT TGCCACTTTT TTCTCCACAT CCGCCTTGGC 6120 

TTCTGGAGAG AACTTCTCAC GGGCGTACCA AAGACCCAGG GCTTGCTTGA AAGGTTCTTG 6180 
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TGCTAGATGA TAAGCTGCTT TGACCTTATC TTTTGCCTCT GGAACTCCAG AAAGGGCACG 6240 

GCTGTAGGCA CCAGACAAAA CACGGATATC CTCTGTTAAA TAGCTGGTTC AAAGATTGAC 6300 

AACACTCAAA ATCAAGGTTG CTTTAAGGAG AGACCAGGCT TCCTCACTGT AGAATTGCTC 63 60 

TGCTCCTTGC CAGAAACGTT CCTCGTCTAC AATAACCTTG TCTGGTAATT GCCCAATAAC 6420 

TGCTTTGAAG AAGTCATCCA AAGGTAGGGC ACGCGCGAAT TTCTTGAAAT CTTCGTAAGA 6480 

ATATGGATGA TAGAGTTTAG CATATTCTGA ACTTTCTTCA TTAGACAGCA CCACTGCCGC 6540 

AACTCGGCGG TCCAATTCAA GTCTTTTTTC TAGCAAGTCT TCAATTTCTT CATCAGAGAA 6600 

ATCATAAGCC TTGAGGAGAT TTGCGCTCCT TTCTTTCCAA AGAGTCAACA GCTCTTCGCG 6660 

CTGAGGATGT TCTTCTGCAT AGTAGGTCGT ATCTGGCAAG ATTGTGCTTG GAGCGCTAGC 67 20 

CCATAGAACA TTGATTCTAG CATCCATAAA CTCTGGCCAT ACACCAAAAG GAAGGAAGTT 6780 

TGGTTTTCCT GCAAGCTCAA ACTCTGCTAG TTTAGCTGTA AAATCCGCAA AAGTCTCCAA 6840 

TTCTTGGAAT TCTTTAAGGA GTGGTAAGAC AGGTGTGATA CCGTCAGCTT CTCTCTTGTC 6900 

AAAATCACGA ACTAGGCGGT GGTATTTGAC AAAGTTTTCC AAGATAGCAT CCTCAGGCAC 6960 

TTCTTCACCT GCTAACCACT TGTCTGTTGT CGCCAGCATC AGGTCTTCAA TTTCCTGGTC 7020 

TAAATCAACA AAACCTCCTG TTTGAGACTT ATCTCCTGGG ATTTCAGCTG TCTGTTGCCA 7080 

TTCTCCATTG ATAGCATCAT AAAAATCATC TTGATAACGT GTCATCTTGT TCTCGCTTTC 7140 

ATTTGTATTT GCATTTATCT TAACAAAAAT CG 7172 
(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
CGGGAAGTTA TGCGATCTAG ACTTCGTTCC TGTACAGCTA CTTTCTCACG TGGTCTTGTT 
GTTTGTATGA GTTTGTTTAG AGAGGATCTT TCTATGTCTT TCTTTCTTAT TTTTGTTTTA 
TATGCTTTTC TGATTTCTTA TCTAATTTAT GGTTATTTCA GACTAAAAAG GAAATACCGA 
GTAGATGAAT ACCAAGGTTC TAGGTCTTCA GATTGATTTT TAGCACTCTT GATAAAAGAG 
TGCTAATTTT TTGAGTTTTT G7CTTGACAT TCTCTTCTAA GGGTGTATAA TAGAATCATC 
AGTTAGCACT TGGATGCATT GAGTGCTAAT TGATCAGACA GAGAGGAGTG ATGAGATGGT 3 60 

TACAGAGCGT CAGCAGGATA TTTTAAATCT GATTATTGAC ATCTTTACCA AAACGCACCA 420 



60 
120 
130 
240 
300 
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ACCTCTCGGA TCAAAAGCCT TCCAACAGTC TATTAACTCT AGCAGTGCAA CCATTCGTAA 4 80 

TGACATGGCG GAACTAGAAA AACAAGGGTT GCTTGAGAAG GCTCATACTT CAAGTGGTCG 540 

GATGCCAAGT GTTGCTGGTT TTCAGTACTA TGTGAAACAC TCACTGGATT TTGACCGGC? 600 

GGCTGAAAAT GAGGTATATG AGATTGTCAA AGCCTTTGAT CAGGAATTCT TCAAATTGGA 660 

GGATATTCTG CAAGAGGCTG CTAACTTACT AACAGACCTG AGTGGCTCTA CGGTAGTGGC 720 

ACTGGATGTT GAGCCGACCA GGCAACGTTT GACAGCCTTT GATATCGTTG TTTTGGGGCA 780 

ACATACAGCC TTGGCGGTAT TTACCCTAGA CGAGTCGCGA ACGCTTACTA GTCAGTTTCT 340 

GATTCCAAGG AACTTCTTGC AGGAGGATTT GCTGAAACTG AAGAGCATCA TTCAGGAACG 900 

TTTCCTCCGT CACACCGTTT TAGATATTCA CTACAAGATT CGGACGGACA TTCCGCAGAT 960 

TATCCAGCGT TACTTTACAA CAACGGATAA TGTCATCGAT CTCTTTGAAC ACATCTTTAA 1020 

GGAAATGTTC AACGAAAACA TTGTGATGGC GGGCAAGGTC CATCTCTTGA ATTTTGCCAA 1080 

TCTAGCAGCC TATCAGTTCT TTGACCAACC GCAAAAGGTG GCCTTGGACA TTCCTCAGCG U40 

GTTGCGTGAG GATCAGATGC AAAATGTTCG TGTTGCAGAC GGTCAAGACT CCTGTTTAGC 1200 

TGACCTAGCG GTAATCAGTA GTAAGTTCCT CATTCCTTAT CCGGGAGTTG GAATTCTAGC 12 60 

CATTATCGGT CCAGTTAATC TGGATTACCA ACAGCTAATC AATCAAGTCA ATGTGGTCAA 1320 

CCCTGTTTTG ACCATGAAGT TGACAGATTT TTACCGCTAC CTCAGCAGTA ATCATTACGA 1380 

AGTACATTAA GATTGAAATC ATTAAAGGAG GCGAACATGG CCCAAGATAT AAAAAATGAA 1440 

GAAGTAGAAG AAGTTCAAGA AGAGGAAGTT GTGAAAACAG CTGAAGAAAC AACTCCTGAA 1500 

AACTCTGAGT TGGACTTGGC AAATGAACGT GCAGATCAGT TCGAAAACAA ATATCTTCGC 1560 

GCTCATGCAG AAATGCAAAA TATCCAACGC CGTGCCAATG AAGAACGTCA AAACTTGCAA 1620 

CGTTATCGTA GCCAGGACTT GGCAAAAGCA ATCTTACCAT CTCTTCACAA CCTTGAGCGT 1680 

GCACTTGCAG TTGAAGGTTT CACAGATCA7 GTGAAGAAGG GCTTGGGGAT GGTGCAAGAA 1740 

AGCTTGATTC ACGCTTTGAA AGAAGAAGGA ATTGAAGAAA TCGCAGCAGA TGGCGAATTT 1800 

GACCATAACT ACCATATGGC CATCCAAACT CTCCCAGCAG ACGATGAACA CCCAGTAGAT I860 

ACCATCGCTC AAGTCTTTCA AAAAGGCTAC AAACTCCATG ACCGCATCCT ACGCCCAGCA 1920 

ATGGTAGTGG TGTATAACTA AGATATAAAG CCCGTAAAAA GCTCGCAGTA AAAATAGGAG 1980 

ATTGACGAAG TGTTCGATGA ACACAAGAAA ATCTATCTTT TTTACTCAGA GCTTAGGGCG 204 0 

TGTTCGATTC GGCAATTCTG ACGGTAGCTA AAGCAACTCG TC AG AAAACG GCAATCGCTA 2100 

TGGCGTTTGC CTAGCTTCCT TACTAACTCG TCGTCGAAAT AAAATCGATT TCGACTCCTC 2160 
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GTGTCGCAAT TTACATAATA GAAAACTTGT CCGAAACGAC AATAAACTAT GAAGAAAGAT 2220 

AAAATATGTT TGGCTTTGTA ATAGTGAGCG AAGCGAACCA AACACGATAC TCTTCGCCGT 2280 

GGCGCTATTT GCGCAAATTT TGAGACCTTA GGCTCAAAGT TTAGTCAAAG AGATTGACGA 2340 

AGTCAAGCTC TGACGGCGTC GCCACTGTCG CCACTTAAGA AGAGTATCAA AAAGAAAAAT 2400 

AGAAAATTAA CTAACAAGGA GAAAAACACA TGTCTAAAAT TATCGCTATT GACTTAGGTA 2460 

CAACAAACTC AGCAGTTGCA GTTCTTGAAG GAACTGAAAG CAAAATCATC GCAAACCCAG 2520 

AAGGAAACCG CACAACTCCA TCTGTAGTCT CATTCAAAAA CGCAGAAATC ATCCTTCGTG 2 580 

ATGCTGCAAA ACGTCAAGCA GTTACAAACC CAGATACACT TATCTCTATC AAATCTAAGA 2 640 

TGGGAACTTC TGAAAAAGTT TCTGCAAATG GAAAAGAATA CACTCCACAA GAAATCTCAG 27 00 

CTATGATCCT TCAATACTTG AAAGGCTACG CTGAAGACTA CCTTGGTGAG AAAGTAACCA 2760 

AAGCTGTTAT CACAGTTCCG GCTTACTTCA ACGACGCTCA ACGTCAAGCA ACAAAAGACG 2820 

CTGGTAAAAT TGCTGGTCTT GAAGTAGAAC GTATTGTTAA CGAACCAACT GCAGCAGCTC 2 880 

TTGCTTATGG TTTGGACAAG ACTGACAAAG AAGAAAAAAT CTTGGTATTT GACCTTGGTG 2940 

GTGGTACATT CGACGTCTCT ATCCTTGAAT TGGGTGACGG TCTCTTCGAC CTATTGTCAA 3000 

CTGCAGGGGA CAACAAACTT GGTGGTGACG ACTTTGACCA AAAAATCATT GACCACTTGG 3060 

TAGCAGAATT CAAGAAAGAA AACGGTATCG ACTTGTCTAC TGACAAGATG GCAATGCAAC 3120 

CTTTGAAAGA TGCGGCTGAA AAAGCGAAGA AAGACCTTTC TGCTCTAACT TCAACACAAA 3180 

TCAGCTTGCC ATTTATCACT GCAGCTGAGG CTGGACC7CT TCACTTGGAA ATGACTTTGA 3240 

CTCGTGCGAA ATTTGACGAT TTGACTCGTG ACCTTGTTGA ACGTACAAAA CTTCCACTTC 33 00 

GTCAAGCCCT TTCAGATGCA GGTTTGAGCT TGTCAGAAAT CGACGAAGTT ATC C TT G T T G 3 3 60 

GTGGTTCAAC TCGTATCCCT GCC G TTGTTG AAGCTGTTAA AGCTGAAACT GGTAAAGAAC 34 20 

CAAACAAATC AGTAAACCCT GATGAAGTAG TTGCTATGGG TGCGGCTATC CAAGGTGGTG 34 80 

TCATTACTCC TGATGTCAAG GACGTTGTCC TTCTTGATGT AACGCCATTG TCACTTGGTA 3540 

TCGAAACAAT GCGTGGAGTA TTTACAAAAC TTATCGATCC CAACACTACA ATCCCAACAT 3 600 

CTAAATCACA AGTCTTCTCA ACAGCAGCAG ACAACCAACC AGCCGTTGAT ATCCACGTTC 3 660 

TTCAAGGTGA ACGCCCAATG GCAGCAGATA ACAAGACTCT TGCACCCTTC CAATTGACTG 3720 

ATATCCCAGC TGCACCTCGT GGAATTCCTC AAATCGAAGT AACATTTCAC ATCGACAAGA 3780 

ACGGTATCGT GTCTGTTAAG GCCAAAGACC TTGGAACTCA AAAAGAACAA ACTATTGTCA 3840 

TCCAATCGAA CTCAGGTTTG ACTGACGAAG AAATCGACCG CATGATGAAA GATGCAGAAG 3900 

CAAACCCTGA AGCCGATAAG AAACGTAAAG AAGAACTAGA CC7TCGTAAT GAAGTAGACC 3960 
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AAGCAATCTT TGCGACTGAA AAGACAATCA AGGAAACTCA ACGTAAAGGC TTCGACGCAG 4020 

AACGTGACGC TCCCCAACCT GCCCTTGATG ACCTTAAGAA AGCTCAAGAA CACAACAACT 4080 

TGGACGACAT GAAAACAAAA CTTGAAGCAT TGAACGAAAA AGCTCAAGGA CTTGCTGTTA 4140 

AACTCTACGA ACAAGCCGCA GCAGCGCAAC AAGCTCAAGA AGGAGCAGAA GGCGCACAAG 4 200 

CAACAGGGAA CGCAGGCGAT GACGTCGTAG ACGGACACTT TACGCAAAAG TAAGATGAGT 4260 

GTATTGGATG AAGACTATCT AAAAAATACA CGAAAAGTTT ATAATGATTT TTGTAATCAA 4 3 20 

GCTGATAACT ATAGAACATC AAAAGATTTT ATTGATAATA TTCCAATAGA ATATTTAGCT 4 3 80 

AGATATAGAG AATTATATTA CCTGAACATG ATAGTTGTAT CAAAAATGAT GAAGCGGTAA 444C 

GGAATTTTGT TACCTCAGTA TTGTTGTCTG CATTTGTATC GGCGATGGTA CCGTATCTGA 4500 

CGAACGTTCA GCTTATAT 4 518 



(2) INFORMATION FOR SEQ ID NO: 122: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

TGCTATTTTC GATTCCCTTG GGCGTTTTGA TTCCCTTTGC CTTGCAAGTC CATTGGAAGC 60 

CCCTCCATTA TCTGATTAAC ATTTACATCT GGGTTATGCG AGGAACCCCC TTACTCTTGC 120 

AACTGATTTT TATCTATTAT GTGCTCCCAA GTATTGGGAT TCGTTTAGAC CGCCTTCCTG 180 

CAGCTATTAT TGCCTTTGTT CTCAACTATG CAGCTTACTT TGCAGAAATT TTCCCTCGGG 240 

GAATTGACAC TATTCCAAGA GGACAGTATG ACGCCGCCAA GGTCTTGAAG TTTAGCCCTT 300 

TTGACAGAGT GCGCTATATT ATCTTGCCCC AAGTCACCAA GATCGTTCTT CCTAGTGTCT 3 60 

TTAATGAAGT TATGAGTTTG GTCAAGGATA CTTCTTTGGT CTATGCTCTC GGAATTTCAG 4 20 

ACCTTATCTT GGCTAGTCGA ACAGCTGCTA ACCGCGATGC TAGTCTAGTT CCTATGTTCT 480 

TGGCAGGAGC CATTTATTTG ATTTTGATTG GGATTGTGAC AATTATTTCC AAAAAAGTTG 540 

AGAAGAAGTA TAGTTATTAT AGATAGGAGG CTGCCATGTT AGAATTACGA AATATCAATA 600 

AAGTCTTTGG AGACAAACAA ATCCTGTCTA ATTTCAGTCT AAGTATTCCT GAAAAGCAAA 660 

TCCTGGCTAT CGTTGGACCT TCTGGTGGAG GTAAGACAAC TCTTTTACGT ATGCTTGCAG 720 

GTCTTGAAAC CATTGATTCA GGGCAAATCT TTTATAATGG ACAACCTTTA GAGCTGGATG 780 



4 
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AATTCCAGAA GCGCAATCTA CTGGGATTTG TCTTCCAAGA TTTTCAACTA TTTCCTCATC 340 

TATCAGTTCT GGAAAATTTG ACTTTATCGC CTGTGAAGAC CATGGGAATG AAGCAGGAAG 900 

AGGCTGAGAA GAAGGCGAGT CGACTCTTGG AACAGTTAGG ACTAGGAGGA CACGCAGAGG 960 

CCTATCCTTT CTCACTATCT GGTGGGCAAA AGCAGCGGGT GGCTTTGGCG CGTGCTATGA 1020 

TGATTGACCC AGAAATCATT GGCTACGATG AACCAACTTC TGCCCTGGAT CCAGAATTAC 1080 

GTTTGGAAGT GGAGAAGCTA ATCTTGCAAA ATAGGGAACT TGGGATGACC CAGATTGTGG 1140 

TTACCCATGA TTTGCAGTTT GCTGAAAATA TCGCAGATGT ATTATTGAAA GTACAACCTA 1200 

AATAGGAGGA AAAATGGATG AAAAAATGGA TGCTTGTATT AGTCAGTCTG A7GACTGCTT 1260 

TGTTCTTAGT AGCTTGTGGG AAAAATTCTA GCGAAACTAG TGGAGATAAT TGGTCAAAGT 13 20 

ACCAGTCTAA CAAGTCTATT ACTATTGGAT TTGATAGTAC TTTTGTTCCA ATGGGATTTG 1380 

CTCAGAAAGA TGGTTCTTAT GCAGGATTTG ATATTGATTT AGCTACAGCT GTTTTTGAAA 1440 

AATACGGAAT CACGGTAAAT TGGCAACCGA TTGATTGGGA TTTGAAAGAA GCTGAATTGA 1500 

CAAAAGGAAC GATTGATCTG ATTTGGAATC GCTATTCCGC TACAGACGAA CGCCGTGAAA 1560 

AGCTGCCTTT CAGTAACTCA TATATGAAGA ATGAGCACCT ATTGGTTACG AAGAAATCAT 1620 

CTGGTATCAC GACTGCAAAG GATATGACTG GAAAGACATT AGGAGCTCAA GCTGGTTCAT 1680 

CTGGTTATGC CGACTTTGAA GCAAATCCAG AAATTTTGAA GAATATTGTC GCTAATAAGG 1740 

AAGCGAATCA ATACCAAACC TTTAATGAAG CCTTGATTGA TTTGAAAAAC GATCGAATTG 1800 

ATGGTCTATT GATTGACCGT GTCTATGCAA ACTATTATT7 AGAAGCAGAA GGTGTTTTAA 1860 

ACGATTATAA TGTCTTTACA GTTGGACTAG AAACAGAAGC TTTTGCGGTT GGAGCCCGTA 1920 

AGGAAGATAC AAACTTGGTT AAGAAGATAA ATGAAGCTTT TTCTAGTCTT TACAAGGACG 1980 

GCAAGTTCCA AGAAATCAGC CAAAAATGGT TTGGAGAAGA TGTAGCAACC AAAGAAGTAA 2040 

AAGAAGGACA GTAAGATAAA ATAGTGGCTG AAACTGCGTT TTGATTAGCA AAACGTAGTT 2100 

TTTTTTGTAA TCTAGGAAAA CGATAATAGC GATTGAATAT CCATAATTGA ATATGGAATA 2160 

GCCCACTGTG ATTTCTAAAA CATTGTTAAA AATTGATTTG ACTTCCAAAA TTAAAATGTT 2220 

CTGTAATGAA ATACTGATGT AACTGTTTTA GGAACAATAA AACGCATAAT ATCAAGGTTT 22 80 

TTGCACCTTA CATTATGCGT TTTTGTGATT TTAAGACTTG TTAGCTGATT TTTTACAATC 2340 

CTGCGAAATC TTTGATTTCT TGTGCTGACA TTGAAGAGTC GCAACGGACG TTGATTTGTC 24 00 

CATCTGTAAT ATGAACAAAA CCTGGTACAG TTGGGATTCC ATAGCGTGAG CGGAATGCTT 24 60 

GCAAATCATT CAGTTCCCTT GGTTCTTCAC TATTCATCAA GTAAATCTGA CCTTTGCTTT 2520 

CAGCTACGAC ACCTGACAAT GTACCTGCAA ATTTACGGCA GTAAGGGCAA GTTTTCCGAC 2580 



WO 98/18931 



PCT/US97/19588 



855 

CGATAAAGAA GCTTGCAGTT TCTTTTTTAT CAAGAGCTTC TTGCGCACGC ACAACTGTAG 2 640 

TGACTTCAAG GTCTTTGATG TTATCTAAAA ATTGTTCCAT GAGATTACCT CCCTTTCATT 2700 

GATAAGTCTA GTATGCCATA AAGTTTCTAA AATTGCTTAG ATTTGATACG AAAAAAGATG 2760 

AGGTTGGTTG GTCTCATCTT TTATAGGTCT TTATTTTACA AATGCATTGA TTTCTGCTTC 2820 

GATGTTAGCA ATCTTAGCTT GTGATTCTTC GTTCCTTTCC CCTACAACTG CAATGTAGAA 2 880 

CTTGATTTTT GGTTCTGTAC CTGAAGCGCC AACGGCAATC CATCAACCCT CACCAAGTCT 2 940 

GTATTTCAAC ACATCACTTG GAGGAGTTGT CAAGTTTGTA ACAGTACCGT CAGCAACAGT 3000 

AGCAGTTTGT GCCTTGAAGT CTTCTACGAC AGTGATAGCT GTTGCGTTCC ATTCTGTTGG 3060 

AGCATTGTTG CGGAATTTAG CCATAATCGC TTTGATTTGT TCAGCACCAT CGACACCTGA 312 0 

AAGAGTAACA GAGATTGTTT TTTCTGCGTA GTAGCCATAT TCTTTATAGA TTTCTTCGAT 3180 

ACCGTCAGCA AGTGTCAAAC CACGAGAACG GTAGTAGGCA CCAAGTTCAC CAACTACAAG 3 240 

AACGGCTTGG ATGGCATCTT TATCACGTAC AAATGGTTTA ATCAAGTAAC CGAAGCTTTC 3 300 

TTCAAATCCC ATCATGTAAG TGTGGTTGTG TTTTTCTTCG AATTCTTGGA TTTTTTCAGC 33 60 

CATAAATTTC AAACCTGTCA AGACGTTGAA CATAGTTGCG CCGTAGCTTT CAGCAATCTT 3 420 

CGTTACCAAC TCAGTTGAAA CGATAGATTT GCAGAGAGCG GCATTTTCAG GAAGAGTTCC 34 80 

AGCGTTTTTG TGAGCTTCCA AGATGTATTT AGCCATGATA GCACCGATTT GGTTACCTGA 3 540 

AAGGTTGAGG TAGCTACCAT CTTTTTGAAG AACTTCAACA CCAACACGCT CAGCGTCTGG 3 600 

GTCAGTTGCG ACAAGAACAT CTGCACCAAC TTGACGACCA AGTTCTTCAG C AAGGG C AAA 3 660 

GGCTGCTTGG CTTTCTGGGT TTGCAGATCT TACAGTTGAA AAGTCTGGGT CAGCAGTTGC 3 7 20 

TTGCGCTTCA ACAACTTGAA CAGAGTCAAA TCCTGCTTGG GCAAGAGCAC GACGAGCCAA 3 780 

CATTTCACCA GTACCATGAA GTGGTGTGTA GACAATCTTC ATGTCTTTAC CAAATTCTTC 3840 

AATCAACCCT GGGTTGATGT TTATGTCCTT AACCTCTTTA ACGTATTCTA TGTCAACAGC 3 900 

TTCGCCGATA ACTTCAATCA AGCCAGAAGC TTTTTCAGTT TCCACATCAG CAACTTCAAC 3 960 

TGCAAATGGG TTTTCGATTG CACGGATATA AGTAGTCAAA GCGTCCGCAT CGTGTGGAGG 4020 

CATTTGTCCA CCGTCTTCAC CGTAAACCTT GTAACCGTTA AATGGAGCAG GGTTGTGGCT 4080 

GGCTGTGACC ATGATACCTG CGAAACAGTT GAGATGACGA ACTCCAAATG ATAGTTCTGG 4140 

AGTCGGACGA AGGCTTTCAA ATACGTAAGA TTTGATGCCG TGTTTAGCAA GAACTGCCGC 4 200 

AGATTCAAAG GCAAACTCAC GTGAGAAGTG ACGGCTATCG TAGGCAATTG CTACACCGCG 4 260 

TTCTTTCTCG TTTCCACCTT TTGACTCAAT CAAACGAGCC AATCCTTCAG TAGCTTGCCC 4 3 20 
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AACAACGTAG 


ATGTTCATAC 


GGTTTGTACC 


AGCACCAACC 


AAGCCACGCA 


TACCTGCAGT 


4380 


ACCAAATTCA 


AGATTTGTAT 


AGAAGGCATC 


TTCCTTAGTT 


TTTTCGTCCA 


TATTTTCCAA 


4440 


ATCTTGACGA 


AGGTAGTCAC 


GAAGCTCCAC 


AAAATCAACC 


CATTTCTGGT 


AATTTTCTTG 


4500 


GTAAGACATT 


CAAATTCTCC 


TTTATTTTTA 


AAACATTTAA 


TCAGTTTAAT 


TATATCATTT 


4560 


TTTTTAGTTT 


TAGTAAAACC 


TTATCTGCTT 


CGAACATCTC 


TTCAAACCAG 


GTCAGATTGA 


4620 


ATTTTGGGGT 


TATATGATCT 


TGAGGCTAGG 


AAAAATTCAA 


TTTCAGTAAA 


AAAAGTAAGT 


4680 


CTTCTCATAA 


CAAAACATTG 


ATATAGTTAC 


TTAGTTTTAA 


ACAAGCATAT 


TATAATAAAG 


4740 


CTATGGCATA 


TAGTACTGAT 


TTTAAACAGC 


GAGCATTAGA 


TTACATCAAA 


GAGGGGCACA 


4800 


GCCATGTCCA 


GGCAGCCAAG 


TTTTTTGGTG 


TTGGCGTCAG 


AACTCT CTTC 


ACCTGGGAAA 


4860 


AGAAAGACGT 


GAACAAGAAC 


ACATAGAGAG 


GAAAAAGCGA 


GTCGTCAAAA 


ACCGAAAGAT 


4920 


TC CTTT AG AG 


GAATTCAAAG 


CCTTTGTAGA 


GGCTCATCCA 


GATGCTTTTT 


TACGGGAAAT 


4980 


TGCGGCACAT 


TTTGATTGTG 


CTGTTCCTTC 


AGTATGGGCA 


GCTTTAAAGC 


AGATTAAGGT 


5040 


CACTTTAAAA AAAGATGACG 


AGCTTTAAGG 


AACAAGACCC 


AGAAAAGTAG 


CCTTATTTCT 


5100 


TAAGAATTTT 


AATAGTTTAA 


AGCACCTAGC 


ACCTGTTTAT 


ATTGATGAAA 


CAGGAATCGA 


5160 


CCGCTATCTC 


TATCGTCCTT 


ATGCAGGGGC 


TCCTAGAGGG 


GACAAAGTCT 


ATGAAAAGAT 


5220 


TAGCGGACGT 


CGTTTTGAGC 


GAACTTCAAT 


TGTTGCAGGA 


CAAGTAGACG 


GAGAGTTTAT 


5280 


AGCTCCCATG 


ATTTACAAGA 


AAAGCATGAC 


AAGCCATTTC 


TTTGTGGAGT 


GGTTCAAAAC 


5340 


GCAACTCCTA 


CCTGCTTTGA 


AGACACCTCA 


TGTTATTGTC 


ATGGGCAATG 


CTGGTTTTCA 


5400 


TCCCAAGAAC 


ATTTTGGATG 


AACTCTGCAT 


CCAAGATAAA 


CACTTTTTCT 


TACCTCTACC 


5460 


ACCTTATTCA 


CCGGATTTGA 


ATCCTATTGA 


GCAAGCTTGG 


GCTATCTTGA 


AAAAGAAAGT 


5520 


GACGGATGTA 


7TAACGGAAG 


TTCCAACTAT 


TTTTGAATGT 


TTGGAATGCT 


TTTTTAAAAC 


5580 


TAGATGACTA 


TAACGGTTCT 


AAAGGAACCT 


ATCGAGTAGT 


CATTAAAACT 


AAGGATACTC 


5640 


CTGGTTAAGA 


GAAGACGGTA 


TACAATCAAA 


CCATTCACCG 


TGTAGCCGAA 


ATCGTTCACA 


5700 


ATGAACACTT 


GTATCAGAAT 


GAAGACTTGT 


ATAAGAAAGG 


TTTGAATGTT 


GAACTTGCGC 


5760 


ACCAACAAAT 


TAAGGGATTT 


TTTGAAGCAG 


AGTTTAAAAA 


TCCTATTAAT 


GGAGTTCTTA 


5820 


ATACTAAAAT 


AAAAAATAGT 


ACATTAAATC 


GTGTAAATAA 


AAAAACTATA 


CACCAGAGCA 


5880 


ACAAAAACTC 


CATCATCAAT 


TTGAAGCAGA 


AGCAACGGAA 


GATGCTAAAA 


AACAAGGCGA 


5940 


TATTGTGTTG 


AATGTTGACC 


AGGATTTCAT 


GAGCATATCT 


AAGTCTAATA 


AAAGTGGTTC 


6000 


AGACTGGAAG 


AAAACTTTCA 


CAGTGAGGAT 


AACCAATAGC 


CTACCAAATG 


ACTTGAATAA 


6060 


TGTCTTGAAA 


CAGGTTGATA 


AAGATACTCC 


TAATACCCCA 


ACTTGGCTAA 


ACTCAGCTCC 


6120 
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TTCTAAAGCT 


AAAGATGATG 


ACAGAGTATA 


TAAACTACTG 


AAGACTCTTA 


TACCAGGAGA 


6180 


AAATTACCTA 


TCATGTTAAG 


CA7AATCAGC 


TAGAAGTAGA 


AACAGATAAA 


TACACATATA 


6240 


CTGCCGCTAG 


AAATGGTAGT 


AAGGAACTTG 


GTATTCAAGA 


GTCAGATATA 


GCAGCAACTC 


6300 


TAAGTGCCGA 


TGAATATAAT 


TCTAATCGCC 


AAACTTTTGA 


GAGAGAATAC 


AAATACAAAA 


6360 


GCAAATGCCC 


TTAATAATGG 


TTGGGCTAGA 


TCTCGTTCTG 


AAGAGTTCAA 


AAAGTTCTCC 


6420 


CACTTTGTAG 


GGGTAGACAA 


AGGGATTCTG 


CCAACGAATG 


TACTGACTGC 


TAAAAAACTA 


6480 


TCTGATAAGA 


TTAGGAAAGA 


AGTGGGCTCT 


GGAGATAGCA 


AACTAGGAAA 


AGGCGGCTAT 


6540 


TTCTCTACTG 


GGGATGTTCT 


ATTAGGAAAA 


CATGTTGTTT 


CTTATACCGT 


ACAAGTATTT 


6600 


TCAGAGAATA 


ATGAAAGAGT 


AGGAGTAAAC 


ACTCAAACTC 


ACCGTGTTCA 


GTATAATCTC 


6660 


CCAATTCTAG 


CTGACTTTTC 


AGTCATCCAA 


GATACTGTGG 


AACCATCACG 


AACCGTTGTT 


6720 


GAAAAAATCA 


TTCCAAAACT 


AAATATTCCC 


GAAGAAGAGA 


AAGGGAAAAT 


AACCGAAGAA 


6780 


ATCAAGAAAA 


AGAAAAAAAC 


CTCAGAATTG 


GCAGAACTAA 


TCTCACAAAA 


TGTGAAAGTT 


6340 


CGCTATGTTG 


ATGAACAAGG 


GCGTTTGCTA 


TCATTGAAAA 


ATGATACTGG 


AATTGGAGAA 


6900 


AAAGAAAGTG 


ACGGAACCTA 


CATTACCAAT 


AAAAAACAAC 


TGATTGCTAC 


CAGCTATAAT 


6960 


GTCACAGATA 


AAAAACTCAG 


TAGCATGACT 


ACTACTGACG 


GAAAATATTA 


TACTTTTAAA 


7020 


GAAGCAGATA 


CAAATTCTGC 


AAGTTTAACT 


GGGAATATTG 


TAAGCGAAGG 


TAGAACAGTG 


7080 


ACCTTAGTTT 


ATAGAGAAAG 


CGAAGCGCCA 


ACCACTGCTA 


CAGTAACAGC 


CAATTACTAT 


7140 


AAAGAAGGTA 


GGCAAGAGAA 


GTTGGTAGAG 


TCTGTTATAA 


AAGCTGATTT 


AGCGATAGGT 


7200 


TCTGAGTATA 


CCACAGAATC 


AAAAACTATT 


GAAGGGAAAA 


CAACAACTGA 


GGACAAAGAA 


7260 


GACCGAGTTA 


TCACAAGGAA 


AACAACATAC 


ACCTTGGTAG 


CAACTCCTGA 


AAATGCGTAC 


7320 


CAGAAGACGC 


TGCAACAGTT 


GACTATTACT 


ACCGTGAGAA 


TGTTCAGGAA 


ACAGTGGTTC 


7380 


CCAAAACAGC 


AACCTCTACT 


GAGACGAAGA 


CTATAACGCG 


TATCATTCAT 


TACGTTGATA 


7440 


AAGTTACGAA 


CCAAAATGTA 


AAAGAAGATG 


TTGTTCAACC 


TGTAACCTTA 


ACCCGTACAA 


7500 


AAACTGACAA 


CAAGGTCACG 


GGAGTTGTAA 


CCTACGGTGA 


ATGGACAACA 


GGAAACTGGC 


7560 


ACGAGGTTAT 


ATCTGGTAAG 


ATTGACAAGT 


ACAAAGATCC 


AGATATTCCA 


ACAGTTGAA1' 


7620 


CACAAGAAGT 


TACGTCAGAC 


TCTACTGATA 


AAGAAATAAC 


GGTAAGGTAT 


GACCGTTTAT 


7680 


CAACACCAGA 


AAAACCAATC 


CCACAACCAA 


ATCCAGAGCA 


TCCAAGTGTT 


CCGACACCAA 


7740 


ACCCAGAACT 


ACCAAATCAA 


GAGACTCCAA 


CACCAGATAA 


ACCAACTCCA 


GAACCAGGTA 


7800 


CTCCAAAAAC 


TGAAACTCCA 


GTGAATCCAG 


ACCCAGAAGT 


TCCCACTTAT 


CAGACAGGTA 


7860 
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AGAGAGAGGA ATTGCCAAAC ACAGGTACAG AAGCTAATGC TACCTTGGCT AGTGCTGGTA 
TCATGACCTT GTTAGCTGGT CTAGGATTAG GATTTTTCAA GAAAAAAGAA GATGAAAAAT 
AATAGATTTT AGAATCTAGG AACCAGGAAA AGCTCACAGA TGTGGGCTTT TTTCCTGGTT 
TTGAGAACGA GGTCTTTCGT AAAGAATAAA AACGCTTACA AGTCTGTTGA ACTGGGAAAC 
TATGAATCCT ATTTTTTTAA AAATATTTCC AGAAATCAGT TGCGG 
(2) INFORMATION FOR SEQ ID NO: 123: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 123: 



CGGTACCGGG 


AACGATACTT 


AGTCTAATTT 


TGCACCTTTT 


CCATGTATGG 


TAAAGGTTTT 


60 


TCTTTTTTTA 


AAAAGGAAAA 


CGAGAAGAGG 


AGGTTCTTAT 


GAAAGCAAGC 


ATTGCCTTGC 


120 


AAGTTTTACC 


CCTAGTACAG 


GCCATTGATC 


GGATAGCTGT 


TATTGATCAG 


GTCATTGCTT 


180 


ATCTGCAwAC 


TCAAGAACTG 


ACGATGGTAG 


TGACACCATT 


TGAAACGGTC 


TTGGAAGGGG 


240 


AGTTTGATGA 


GCTTATGCGC 


ATTCTAAAAG 


AAGCCCTCGA 


AGTGGCAGGG 


CAGGAGGCAG 


300 


ACAATGTCTT 


TGCCAATGTC 


AAAATAAATG 


TAGGAGAGAT 


TTTAAGTATT 


GATGAGAAAC 


360 


TTGAGAAGTA TACTGAGACG 


ACACATTAGT 


CTATTGCGCT 


TTCTCGCAGT 


ATTGTCAATC 


420 


TCGCACTTAC 


CAGGTTTTCT 


TAAACTTCTC 


CCCAAGTTTA 


TCCTGCCGAC 


ACCTCTTGAA 


480 


ATTCTCCAGC 


CCTTTGTTCG TGACAGAGAA TTTCTCTGGC ACCATAGCTG 


CCCGACCTTG 


540 


AGAGTGGCTT 


TACTGGGGCT 


GATTTTGGGA 


GTTTTGATTG 


CCTGTCTTAT 


GGCTGTGCTC 


600 


ATGGATAGTT 


TGACTTGGCT 


CAATGACCTG 


ATTTACCCTA 


TGATGGTGGT 


CATTCAGACC 


660 


ATTCCGACCA 


TTGCCATAGC 


TCCTATCCTG 


GTCTTGTGGC 


TAGGTTATGG 


GATTTTGCCC 


720 


AACATTCTCT 


TGATTATCTT 


AACGACAACC 


TTTCCCATCA 


TCGTTAGTAT 


TTTGGACGGT 


780 


TTTAGGCATT 


GCGACAAGGA 


TATGCTGACC 


TTGTTTAGTC 


TGATGCGGGC 


CAAGCCTTGG 


840 


CAAATCCTGT 


GGCATTTTAA 


AATCCCAGTT 


AGCCTGCCTT 


ACTTTTATGC 


AGGTCTGAGG 


900 


GTCAGTGTCT 


CCTACGCCTT 


TATCACAACT 


GTGGTATCTG 


AGTGGTTGGG 


AGGTTTTCAA 


960 


GGTCTTGGTG 


TTTATATGAT 


TCAGTCTAAA 


AAACTGTTTC 


ACTA TG AT AC 


CATGTTTGCC 


1020 


ATT ATT ATT C 


TGGTGTCGAT 


TATCAGTCTT 


TTGGGTATGA 


ACCTGCTCGA 


TATCAGTGAA 


1080 


AAATATGTGA 


TTAAATGGAA 


ACCTTCGTAG 


AATTAGAATG 


TTTCTGAAAA 


AGAAAAGAGG 


1140 
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AAATCAAAAT GAAGAAAACA TGGAAAGTCT TTTTAACCCT TGTAACAGCT CTTGTAGCTG 1200 
TTGTCCTTCT GGCCTGTGGT CAAGGAACTG CTTCTAAAGA CAACAAAGAG GCAGAACTTA 12 60 

AGAAGGTTGA CTTTATCCTA GACTGGACAC CAAATACCAA CCACACAGGG CTTTATGTTG 13 20 

CCAAGGAAAA AGGTTATTTC AAAGAAGCTG GACTGGATGT TGATTTGAAA TTGCCACCAG 1380 

AAGAAAGTTC TTCTGACTTG GTTATCAACG GAAAGGCACC ATTTGCAGTG TATTTCCAAG 1440 

ACTACATGGC~~f AACAAATTG "GAAAAAGGAG CAGGAATCAC TCCCGTTCCa" GCTArTGTTG 1500~ 

AACACAATAC ATCAGGAATC ATCTCTCGTA AATCTGATAA TCTAAGCACT CCAAAAGAC7 1560 

TGGTTGGTAA GAAATATGGG ACATGGAATG ACCCAACTGA ACTTGCTATG TTGAAAACCT 1620 

TGGTAGAATC TCAAGGTGGA GACTTTGACA AGGTTGAAAA AGTACCAAAT AACGACTCAA 1680 

ACTCAATCAC ACCGATTGCC AATGGCGTCT TTGATACTGC TTGGATTTAC TACGGTTGCC 1740 

ATGGTATCCT TGCTAAATCT CAAGGTGTAG ATGCTAACTT CATGTACTTG AAAGACTATG 1800 

TCAAGGAGTT TGACTAC7AT TCACCAGTTA TCATCGCAAA CAACGACTAT CTGAAAGATA I860 

ACAAAGAAGA AGCTCGCAAA GTCATCCAAG CCATCAAAAA ACCCTACCAA TATGCCATGG 1920 

AACATCCAGA AGAAGCTGCA GATATTCTCA TCAAGAATGC ACCTGAACTC AAGGAAAAAC 1980 

GTGACTTTGT CATCGAATCT CAAAAATACT TGTCAAAAGA ATACCCAAGC GACAAGGAAA 2040 

AATGGGGTCA ATTTGACGCA GCTCGCTGGA ATGCTTTCTA CAAATGGGAT AAAGAAAATC 2100 

GTATCCTTAA AGAAGACTTG ACAGACAAAG GCTTCACCAA CGAATTTGTG AAATAATGAC 2160 

AGAAATTAGA CTAGAGCACG TCAGTTATCC CTATGGTCAG GACACGATTT TAGAGGATAT 2220 

CAACCTACAG GTGACTTCAG GCGAAGTGGT TTCCATCCTA GGCCCAAGTG GTGTTGGAAA 22 80 

GACCACCCTC TTTAATCTAA TCGCTGGGAT TTTAGAAGTT CAGTCAGGGA GAATTGTCCT 2 340 

TGATGGTGAA GAAAATCCCA AGGGGCGCGT GAGTTATATG TTGCAAAAGG ATCTGCTCTT 2400 

GGAGCACAAG ACGGTGCTTG GAAATATCAT TCTGCCCCTC TTGATTCAAA AGGTGGATAA 2460 

GCCAGAACCT ATTTCCCGAG CGGATAAAAT TCTTGCGACC TTCCAGCTGA CAGCTGTAAG 2 520 

AGACAAGTAT CCTCATGAAC TTAGCGGTCG GATGCGCCAG CGTCTAGCCT TACTCCGGAC 2 580 

CTACCTTTTT GGGCACAAGC TCTTTCTCTT AGATGAGGCC TTTAGCGCCT TGGATGAGAT 2640 

GACAAAGATG CAACTCCACC CTTGGTATCT TGAGATTCAC AAGCAGTTGC AGCTAACAAC 2700 

CCTGATCATC ACGCATAGTA TTGAGGAGGC CCTCAATCTC AGCGACCGTA TCTATATCTT 27 60 

GAAAAATCGC CCTGGGCAGA TTGTTTCAGA AATTAAACTA GATTGGTCTG AAGATGAGGA 2 820 
CAAGGAAGTC CAAAAGATTG CCTACAAACG TCAAATTTTG GCGGAATTAG GCTTAGATAA . 2 880 
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GTAGAAAAAT 


AGGGAGTTGG 


TGAAGATTAT 


CCTTTACCAG 


CGCCCTTTTT 


CTTTTAAAAA 


2940 


TGAGAAAATT 


TCGGTATAAT 


AGTCAAACAA 


CGTCAAGCTT 


TAAAGAGAGA 


GGTGGGTTTG 


3000 


TTATGAGATT 


TAAAAATACA 


TCGGATCATA 


TTGAGGCCTA 


CATCAACGCG 


ATTTTAGATC 


3060 


AATCTGGTAT 


CGTGGAGTTG 


CAACGGAGTC 


AGTTGGCAGA 


TACCTTTCAG 


GTTGTTCCTA 


3120 


GTCAGATTAA 


CTACGTGATC 


AAGACACGCT 


TTACGGAAAG 


TSCAGCCTAC 


TTGGTTGAAA 


3180 


GTAAGCGTGG 


TGGCGGAGGC 


TACATTCGTA 


TAGGACGGAT 


TGAGTTTTCT 


AGTCATCATG 


3240 


AAATGCTCCG 


CGACCTGCTT 


TACTCGATTG 


GTGAGCGAGT 


CAGTCAAGAA 


ATTTATGAGG 


3300 


ATATTCTCCA 


GCTTTTGGTT 


GAGCAGGAAT 


TGATGACCAA 


GCAGGAGATG 


AATTTGCTAG 


3360 


AATCAGTAGC 


TTTGGATCGC 


GTTTTAGGAG 


AAGAAGCTCC 


AGTTGTTCGA 


GCAAACATGC 


3420 


TACGTCAGAT 


CATACAAGAG 


GTAGATAGAA 


AAGGGAAGTA 


AGATGAACTA 


TTCAAAAGCA 


3480 


TTGAATGAAT 


GTATCGAAAG 


TGCCTACATG 


GTTGCTGGAC 


ATTTTGGACC 


TCGTTATCTA 


3540 


GAGTCGTGGC 


ACTTGTTGAT 


TCCCATGTCT 


AATCACACTT 


ATAGTCTAGC 


AGGGGCAACT 


3600 


TTAAATGATT 


ATCCGTATGA 


GATGGACCGT 


TTAGAAGAGG 


TGGCTTTGGA 


ACTGACTGAA 


3660 


ACGGACTATA 


GCCAGGATGA 


AACCTTTACG 


GAATTGCCGT 


TCTCCCCTCG 


TTTGCAGGTT 


3720 


CTTTTTGATG 


AAGCAGAGTA 


TGTAGCGTCA 


GTGGTCCATC 


CTAAGGTACT 


AGGGACAGAG 


3780 


CACGTCCTCT 


ATGCGATTTT 


GCATGATAGC 


AATGCCTTGG 


CGACTCGTAT 


CTTGGAGAGG 


3840 


GCTGGTTTTT 


CTTATGAAGA 


CAAGAAAGAT 


CAGCTCAAGA 


TTGCTGCTCT 


TCGTCCAAAT 


3900 


TTAGAAGAAC 


GGGCAGGCTG 


GACTCGTGAA 


GATCTCAAGG 


CTTTACGCCA 


ACGCCATCCT 


3960 


ACAGTAGCTG 


ACAAGCAAAA 


TTCTATGGCC 


AATATGATGG 


GCATGCCGCA 


GACTCCTAGT 


4020 


GGTGGTCTCG 


AGGATTATAC 


GCATGATTTG 


ACAGAGCAAG 


CGCGTTCTGG 


CAAGTTAGAA 


4080 


CCAGTCATCG 


GTCGGGACAA 


GCAAATCTCA 


CGTATGATTC 


AAATCTTGAG 


CCGGAAGACT 


4140 


AACAACAACC 


CTCTCTTGGT 


TGGGGATGCT 


GGTGTCGGGA 


AAACAGCTCT 


GGCGCTTGGT 


4200 


CTTCCCCAGC 


GTATTGCTAG 


TGGTGACCTG 


CCTGCGGAAA 


TGGCTAAGAT 


GCGCGTGTTA 


4260 


GAACTTGATT 


TGATGAATCT 


CGTTGCAGGG 


ACACGCTTCC 


GTGGTGACTT 


TGAAGAACGC 


4320 


ATGAATAATA 


TCATCAAGCA 


TATTGAAGAA 


GATGGCCAAG 


TCATCCTCTT 


TATCGATGAA 


4380 


CTCCACACCA 


TCATGGGTTC 


TGGTAGCGGG 


ATTGATTCGA 


CTCTGGATGC 


GGCCAATATC 


4440 


TTGAAACCAG 


CCTTGGCGCG 


TGGAACTTTG 


AGAACGGTTG 


GTGCCACTAC 


TCAGGAAGAA 


4500 


TATCAAAAAC 


ATATCGAAAA 


AGATGCGGCA 


CTTTCTCGTC 


GTTTCGCTAA 


AGTGACGATT 


4560 


CAAGAACCAA 


GTGTGGCAGA 


TAGTATGACT 


ATTTTACAAG 


GTTTGAAGGC 


GACTTATGAG 


4620 


AAACATCACC 


GTGTACAAAT 


CACAGATGAA 


GCCGT7GAAA 


CAGCGGTTAA 


GATGGCTCAT 


4680 
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CGTTATTTAA CCAGTCGTCA CTTGCCAGAC TCTGCTATCG ATCTCTTGGA TGAGGCGGCA 4 740 

GCAACAGTGC AAAATAAGGC AAAGCATGTA AAAGCAGACG ATTCAGATTT GAGTCCAGCT 4 800 

GACAAGGCCC TGATGGATGG CAAGTGGAAA CAGGCACCCC AGCTAATCGC AAAAGAAGAG 4 860 

GAAGTACCTG TCTACAAAGA CTTGGTGACA GACTCTGATA TTTTGACCAC CTTGAGTCGC 4 920 

TTGTCAGGAA TCCCAGTTCA AAAACTGACT CAAACGGATG CTAAGAAGTA TTTAAATCTT 4980 

GAAGCAGAAC TCCATAAACG GGTTATCGGT CAAGATCAAG CTGTTTCAAG CATTAGCCGT 5040 

CCCATTCGCC GCAACCAGTC AGGGATTCGC AGTCATAAGC GTCCGATTCG TTCCTTTATG 5100 

TTCCTAGGGC CTACAGGTGT CGGGAAAACT GAATTAGCCA AGGCTCTGGC AGAAGTTCTT 5160 

TTTGACGACG AATCAGCCCT TATCCGCTTT GATATGAGTG ACTATATGGA GAAATTTGCA 5220 

GCTAGTCGTC TCAACCCACC TCCTCCAGGC TATGTAGGAT ATGAAGAAGG TGGGGAGTTG 5280 

ACAGAGAAGG TTCGCAATAA ACCCTATTCC GTTCTCCTCT TTGATGAGGT AGAGAAGGCC 5340 

CACCCAGATA TCTTTAATCT TCTCTTGCAG GTTCTGGATG ACGCTCTCTT GACAGATAGC 5400 

AAGGGACGCA AGGTCGATTT TTCAAATACC ATTATCATTA TGACATCGAA TCTAGGTGCG 5460 

ACTGCCCTTC GTGATGATAA GACTGTTGGT TTTGGGGCTA AGGATATTCG TTTTCACCAG 5520 

GAAAATATGG AAAAACGCAT GTTTGAAGAA CTGAAAAAAG CTTATAGACC GG AATTCATC 5580 

AACCGTATTG ATGAGAACCT GGTCTTCCAT AGCCTATCTA CTGATCATAT GCAGGAAGTG 5640 

GTGAAGATTA TGGTCAAGCC TTTAGTGGCA AGTTTGACTG AAAAAGGCAT TGACTTGAAA 5700 

TTACAAGCTT CAGCTCTGAA ATTCTTACCA AATCAAGGAT ATGACCCAGA GATGGGACCT 57 60 

CGCCCACTTC GCAGAACCCT GCAAACAGAA GTGGAGGACA AGTTGGCAGA ACTTCTTCTC 5820 

AAGGGAGATT TAGTGGCAGG CAGCACACTT AAGATTCCTC TCAAAGCAGG CCAGTTAAAA 5880 

TTTGATATTG CATAAAAGAA TAAAAGTATC AGCATCTGAC CATAAGTCAC AGTGGAGTCA 5940 

AATTCAATGA AAATCAAAGA GCAAACTAGG CAGCTACCCG CAGGTTCCTC AAAACACTGG 6000 

TTTGAGGTTG CAGATAGAGC TGACGTGGTT TGAAGAGATT TTCGAAGAGT ATGAAACTAA 6060 

AACCTATAGC TTCTAAACGA TCCGTGGTTT TCATCATTCA ACACAAAATT CATATGTTTA 6120 

TTACCCTCCC TCGTATTTGT CTTAGAGCGT GTGTAGTAGA AAAAGAGCAG TCTTATCTGA 6180 

AATTTTTATT CTTTCAAAAG AGACCTGTTT CTTTTTTGCA TGTCAAATCC GTTCTAGCTG 6240 

GTATTTGAAA AATCAAACTA ATATTCAATG AAAATCAAAG AACAAACTAG GAAGCTAGCC 6 300 

GCAGGTTGCT CAAAACACTG TTTTGAGGTT GTAGATAGAG CTGACGTGGT TTGAAGAGAT 6360 

TTTCGAAGAG TATAAGCTGC AAGATGAATG ATTTTCTTGT ATTGACGTTG TTCTTCACAA 6420 
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AAAGTAGCGG 


ATAAATGAAA 


TCCATTCCAT 


TATCATAGAT 


GATAGGCTGG 


TAGGAAATTT 


6480 


TCAAATAGCA 


TACAGGAAAT 


AGATGTATGG 


AGTTCTGGTA 


GTAGAAAGGC 


AGAGAGATGA 


6540 


ACATTTTAGT 


TGCAGATGAC 


GAGGAAATGA 


TTAGAGAAGG 


AATTGCAGCA 


TTTCTGACAG 


6600 


AACAGGCTTA 


TCATGTCATT 


ATGGCTAACG 


ATGGACAAGA 


GCTCTTGGAA 


AAATTTCAAG 


6660 


ATCTCCCTAT 


CCATCTCATG 


GTACTGGATT 


7AATGATGCC 


TAGGAAGAGT 


GGTTTTGAAG 


6720 


TGTTAAAAGA 


AATCAATCAA 


AAGCACGATA 


TTCCTGTCAT 


CGTCTTGAGT 


GCTCTGGGAG 


6780 


ATGAAACTAC 


TCAGTCACAG 


GTATTTGATC 


TCTATGCTGA 


TGATCATGTG 


AC AAAAC CTT 


6840 


TTTCTTTGGT 


ACTGCTTGTC 


AAGCGTATTA 


AGGCGCTTAT 


CAGACGTTAC 


TACGTCATAG 


6900 


AGGATCTTTG 


GCGATATCAG 


GATGTAACAG 


TGGATTTTAC 


CTCTTACAAA 


GCACATTATA 


6960 


AAAATGAAGA 


AATTGATCTC 


AAACCAAAGG 


AATTACTGG7 


ACTAAAGTGT 


TTGATTCAGC 


7020 


ATAAAAATCA 


AGTTTTAAGT 


AGAGAGCAGA 


TATTGGAAGA 


AATTTCAAAA 


GATGTAGCTC 


7080 


ATTTACCTTG 


TGATAGGGTC 


GTTGATGTCT 


ATATTCGTAC 


TCTTCGCAAA 


AAATTAGCTT 


7140 


TAGATTGTAT 


CGTGACTCTG 


AAAAATGTTG 


GGTATAAGAT 


TAGCTTATGA 


TAAAAAATCC 


7200 


TAAATTATTA 


ACCAAGTCTT 


TTTTAAGAAG 


TTTTGCAATT 


CTAGGTGGTG 


TTGGTCTAGT 


7260 


CATTCATATA 


GCTATTTATT 


TGACCTTTCC 


TTTTTATTAT 


ATTCAACTGG 


AGGCGGAAAA 


7320 


GTTTAATGAG 


AGCGCAAGAG 


TGTTTACGGA 


GTATTTAAAG 


ACTAAGACAT 


CTGATGAAAT 


7380 


TCCAAGCTTA 


CTCCAGTCTT 


ATTCAAAGTC 


CTTGACCATA 


TCTGCTCACC 


TTAAAAGAGA 


7440 


TATTGTAGAT 


AAGCGCCTCC 


CTCTTGTGCA 


TGACTTGGAT 


ATTAAAGATG 


GAAAGCTATC 


7500 


AAATTATATC 


GTGATGTTAG 


ATATGTCTGT 


TAGTACAGCA 


GATGGTAAAC 


AGGTAACCGT 


7560 


GCAATTTGTT 


CACGGGGTGG 


ATGTCTACAA 


AGAAGCAAAG 


AATATTTTGC 


TTTTGTATCT 


7620 


CCCATATACA 


TTTTTGGTTA 


CAATTGCTTT 


TTCCTTTGTT 


TTTTCTTATT 


TTTATACTAA 


7680 


ACGCTTGCTC 


AATCCTCTTT 


TTTACATTTC 


AGAAGTGACT 


ACTAAAATGC 


AAGATTTGGA 


7740 


TGACAATATT 


CGTTTTGATG 


AAAGTAGGAA 


AGATGAACTT 


GGTGAAGTTG 


GAAAACAGAT 


7800 


TAATGGTATG 


TATGAGCACT 


TGTTGAAGGT 


TATTTATGAG 


TTCGAAAGTC 


GTAATGAGCA 


7860 


AATTGTAAAA 


TTGCAAAATC 


AAAAGGTTTC 


CTTTGTCCGC 


GGAGCATCAC 


ATGAGTTGAA 


7920 


AACCCCTTTA 


GCCAGTCTTA 


GAATTATCCT 


AGAGAATATG 


CAGCATAATA 


TTGGAGATTA 


7980 


CAAAGATCAT 


CCAAAATATA 


TTGCAAAGAG 


TATAAATAAG 


ATTGACCAGA 


TGAGCCACTT 


8040 


ATTAGAAGAA 


GTACTGGAGT 


CTTCTAAATT 


CCAAGAGTGC 


ACAGAGTGTC 


GTGAGACCTT 


8100 


GACTGTTAAG 


CCAGTTTTAG 


TAGATATTTT 


ATCACGTTAT 


CAAGAATTAG 


CTCATTCAAT 


8160 


AGGTGTTACA 


ATTGAAAATC 


AATTGACAGA 


7GCTACCACC 


GTCGTCATGA 


GTCTTAGGCC 


8220 
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ATTGGATAAG CTTTTGACAA ACCTGATTAG TAATGCAATT AAATATTCAG ATAAAAATGG 3280 

GCGTGTAATC ATATCCGAGC AAGATGGCTA TCTCTCTATC AAAAATACAT GTGCGCCTCT 8340 

AAGTGACCAA GAACTAGAAC ATTTATTTGA TATATTCTAT CATTCTCAAA TCGTGACAGA 8400 

TAAGGATGAA ACTTCCCGTT TGGGTCTTTA CATTGTGAAT AATATTTTAG AAAGCTATCA 84 60 

AATGGATTAT AGTTTTCTCC CTTATGAACA CGGTATGGAA TTTAAGATTA GCTTGTAGAC 8520 

AG AT T AGTTT TTTATTAAAG TTCATATAGG GTTAACATAA GTGTGTTATT CTTTGTGTAG 8580 

ATAAAAGAAA GGATACTAAT ATGGTATTAG CGATTATTTT AGTAACATTC TTTATTCGAT 8640 

TGATTTTTTT AAAGCGTTCG ATAGAGAATC AGAAACGAAT CCTTAGCAAT GGCGGGG 8697 
(2) INFORMATION FOR SEQ ID NO: 124: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 



AACCATACAT 


ACGGCAAGGC 


AAACCTGACC 


CGGTTTGAAG 


AGATTTTCGA 


AGAGTATTAG 


60 


TTGCCTTTAA 


AGGCATCCAC 


CATCGTTTGA 


AATTCTTCAT 


TTGAGAGAGT 


AATCCCTTTG 


120 


CCCATTTTAG 


TATGGTCTCG 


ACTCCAAGCA 


CGAATATCAA 


ACTTTGCAGG 


GGCACCATTA 


180 


AAGCTCACAC 


GGTTAATTTC 


CTTGGTCCAA 


CCTTTTTCGT 


TTTCAGAAAG 


AGTCAACAAG 


240 


TGCTCTTCGA 


TTTCAAATGT 


AAATTCTGCC 


ATTTTCTTCT 


CCTTTTTTAG 


TTTCATTAGT 


300 


TTATTCGTAA 


AATCTTGTAG 


ATTTTAGGAA 


AATTTTATAT 


AATATTGATA 


TAAAAGAAGG 


360 


GAGGCCAATA 


TGAGACATAA 


ATTCCAGCAA 


GTTCTAAATA 


AAATACATGA 


TTTTTTAAAT 


420 


GGATATGACC 


AACCTGACCA 


GACTGAAACC 


AACTCCCTTA 


CAGCCACTAT 


TGAAGAGGCT 


480 


ATCCAGAAAC 


AAACCGCTGT 


TCACCTTATC 


TTGTCTGAGA 


CAACCTTTAC 


AGGTGACATC 


540 


ATCAAATATG 


ATCAGCAAGG 


CCAGCAAATT 


ATCGTGAAAA 


ATTTTTCCAA 


AAATGTGAGC 


600 


CGGATTATCC 


GTATAAGCGA 


TATTCAACGC 


CTGCGATTTG 


TCCCCTCAAC 


TGTCCAAACA 


660 


GCCCAAAAAA 


ATAGATTTAA 


GAAAGAGTGA 


GATGTAGTTG 


CTTCATCCCA 


CTCTTTTTTC 


720 


TTAGCGAATT 


TGTTCAAAAT 


CTAAATGAAC 


TGCGATATGA 


TCTCCATAAC 


CACTTCTTTC 


780 


CAAGTCACGT 


TGTAAACGAT 


AGGAAATGTA 


GTGTTCTGCA 


ATGGTAATGT 


AACCTGCCCC 


840 


CAATAAACGA 


TGTTCAACCA 


TAGATTGAAT 


CATACTGATA 


GTCGCACGTT 


CCACCTTGGC 


900 
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TTCTTGTAAA 


TCCAAAACTA 


CCTTCTTAGT 


CACTTGAGCA 


AGATTTTCAC 


GCAAATCATC 


960 


TGTCAAAACA 


TAAACAGTTT 


GGGCTGCCTT 


CAAGATCGCT 


TGCTAAATCT 


TATCTGGATT 


1020 


AAATTCAGCA 


ATTTCGCCAT 


TACGTTTGAT 


TACTTCCATA 


GGTTTCTCCT 


TTATTCTTTG 


1080 


TTTTCTTTGA 


TTTCTGCCAG 


CATTTTTTCT 


TCTTCTACTG 


TCAGTTGATA 


ATGTTCAAGT 


1140 


AAATCCGGTC 


TGCCCTCGTA 


GGTTTTCTTT 


AAACTCTCGT 


ACAATCGCCA 


CTGACGAATC 


1200 


TTTTCATGGT 


GGCCACTCAT 


CAATACATCT 


GGCACGACCA 


TCCCTCGATA 


ATCATAGGGA 


1260 


CGTCTGTACT 


GAGGATATTC 


TAAAAGACCT 


GAAGAAAAAC 


TATCATCTTG 


GTCGCTAGAC 


1320 


TCCTTGCCAA 


TCACTTCTGG 


AATCAGGCGA 


ACTGTAGCAT 


CAATCATGGT 


CATAGCTGCC 


1380 


AATTCTCCAC 


CAGTGAGGAC 


ATAGTCACCT 


AGGGAAATCT 


CATCTCTTAC 


CAAGGTCTTA 


1440 


ATGCGCTCAT 


CATAACCCTC 


ATAGTGCCCA 


CAGATAAAGA 


TTAGCTCTTC 


CTCTTGAGCC 


1500 


AAATCTTCAG 


CATAAGCCTG 


ATCAAACTGC 


TTTCCAGCAG 


GATCAAGCAG 


AATAACGCGC 


1560 


GGATTTTTCT 


TTTCAATAGC 


ATCAAAGGAA 


TCGAAAATAG 


GTTGTGCTCT 


GAGCAACATG 


1620 


CCCTGACCGC 


CTCCGTAGGG 


CTCATCATCT 


ACATGACGGG 


CCTTTTCAGC 


ATTTTCTCGA 


1680 


AAATTATGAT 


ACTGGATATC 


CAAGAGCCCT 


TTTTCTCGAC 


CCTTTCCAAC 


GATTGAGTGC 


1740 


TCCAGTGGAG 


AAAACATCTC 


TGGAAAGAGG 


GTTAAAATAT 


CAATCTTCAT 


CGTCTAACCC 


1800 


TTCTAAGATT 


TCCACATCGA 


CCCGTTTACT 


TGGAATATCA 


ACATTGAGAA 


CCACTGGTGG 


1860 


GATATAAGGT 


AAAAGCAAAT 


CACCTTTGCC 


TTTTCGTTTG 


ACCACCCAGA 


CATCATTAGC 


1920 


ACCTGGTTGC 


AGGATTTCCT 


TGATGGTTCC 


AACCAAGCTA 


TCACCCTCAT 


AGACTTCCAA 


1980 


ACCGATAATC 


TCGTGATAGT 


AAAATTCACC 


ATCGTCTAGG 


TCATTCAAAT 


CTTCCTCAGC 


2040 


GACCTTGAGA 


CTGTATCCCT 


TGTACTTTTC 


GATAGTATTG 


ATATGGTACA 


TATCTTTCAA 


2100 


TTTAATAATG 


TCAAAGTTCT 


TCTGTTTACG 


GTGGCTAGCG 


ATGGTCACTG 


TTTGGACAAA 


2160 


CTGATCTTTT 


TCATCAAACA 


AAACCAGCTC 


AGCTCCTTTT 


TTAAACCGTT 


CTTCTGCAAA 


2220 


ATCCGTCACA 


GACAAGACTC 


GCATCTCCCC 


CTGTAATCCC 


TGCGTATTAA 


CGATTTTCCC 


2280 


AACATTAAAG 


TAGTTCATCT 


TGTCTCCTGT 


AATCTCCTTT 


TTTCCATCTT 


ATTCTAACAA 


2340 


TTCTCGAATA 


ATAGCCGCAA 


TTTTTTCCGA 


TTCTGACCAT 


TGTAAATAAT 


GGTGATTCCC 


2400 


TCCTAAAATG 


AGTTTAGTAT 


TGGAAGTCCA 


ATATTCTGAT 


TCTCTG7ACT 


CTTTTTCTCT 


2460 


ATAAGGCTGA 


CAAAAAACAA 


ATACAGGAAT 


ATGAGCTTCT 


ATAGATACAT 


CCTCAAAATC 


2520 


TTCCTCAGTA 


ATCTCTCCAG 


ATATCTGAAA 


TTCTGGATCT 


TGATTTTCCA 


ACTCTAAGCC 


2580 


TTTTTCTTGC 


ATTAATTCCC 


AGATTTTTTT 


ATTCGTTTCA 


GGACTAAATG 


TTGCTTGAGT 


2640 


TAAGTTCTTA 


AAATAAAGTT 


CAGGACCACA 


CTCGTCAATC 


ACCCTCATCT 


GCTCTTCCAT 


2700 
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TTCTCGATAA GGATTTTCTC AAAAATCAGC AAACATGACT TTTTTAGTTG TCGGTTCAAT 2 760 

TGCTACTAAA GTCTGACGCT TAATTGGTTT C7CGAGTAAT TTGCAAGCTA AAATTCCACT 2820 

CCAACTATCT GCACAAAGTA TATATTCAGA AATTCCTAAT TCT7CAAGTA CTTCATAAAC 2380 

CGCATCTGCA AGATTATCTA GATTTTTTCC AGCTTGGTCA TGAATCGGAC TCCTACCTCT 2 940 

GTTCGGAAAA TCAATTGTCA AATAACCAAT TGTAGGAGGA CCTTTTTCAA GTATAAGTGA 3000 

AAAATTTTCA TAACTTGGTA "cCAAACCTGC TCCGTTTAAA CAAACTAGCA CTTTCTTTTc" 3060 

CTTTTGATAA GTAACAGAGA GGCTACCAAT TTCTGTAGAT ACTTCAAACC TCTTCATAAA 3120 

GAAATCCACT GATTCTATAT AATGAATTAT TAAAAATCCT TATCCTTTAT TTTATCACGT 3180 

TCCAAGCATT TTCTCAAGTT GGAGGAAGGG GACAATATCT CTACTTTCCC TTCAATAATC 3240 

CTTCCAAATT ATGTTTATGT TGGTAATTAA TGGCTGCGGT TTTGTCTTTC TCAAAGACAG 3 300 

TCTTGGTAAG GTCAATATGA TTAATAGCTA CGATTGCGAC GGTGTAGTAA ATGATATCAG 3 360 

CCAGTTCTCT GGCAAGTTCC TCGTTCGAAT CCTATCCCTT CTTTTCCACC AGACCGCCTA 3 420 

TTCAAAACCT CGACTACTTC TCCGACTTCC TCCACTAACT TCATAAAGAG ACCTTCATCA 3480 

GTCCCACACT GCTGTTAATG TTCGATTAAG TAGTCTTGGA ATTGCCTAAA CGTTCAATCT 3 540 

TTTATAGTAT ATTGAAACTA GAATAGTACA CCTTTACTTC TAAAACATTG TTAGAAATCG 3 600 

ATTTGACTCT CCTGATCGAT TTGTCCTGTT CTTGTTTCAT TTTACTATAT CTTCTATTCC 3 660 

ACACAAAAAA GCGAGACATC CGTCCCGCCC TTCTTATTTT TCGTCAATAA CGATTCTTAC 3 720 

TTTTTTGTAT TCAGTTGGGA CAGAGTACAC AATCGTTCTT ATCGCAGAAA TAGTGCGACC 3780 

CTTACCACCC ATTACACGAC CCACATCGCT TTGATCAAGA TTCAAATGAT ATTCCAAAAA 3 840 

TTCTGGTGTA TCCTCAATCT TGATAGTTAA GGCATCTGGT TGTCAAATTA AGGGTTTCAC 3 900 

AATCGCAATA ATCAGATTTT CAATCGTATC CATCTGTCAA CCTACTTTAA ACTTATTTTG 3 960 

AAAATTTAGA ATCGTGGAAT TTTTTCAATA CGCCTTCTTT TGAAAGGATG TTACGTACTG 4020 

TGTCTGAAGG TTGAGCTCCA TTAGCCAACC ATGCAAGAAC GCGCTCTTCT TTCAAAGTTA 4080 

CTTCGTTTTC AGCAACAAGT GGGTTGTAAG TTCCAACTGT TTCGATGAAA CGTCCGTCAC 4140 

GTCCTGAACC TGAATCTGCT ACGTTGATAC GGTAGAAAGG TTTTTTCTTA GAACCCATAC 4 200 

GAGTCAAACG GATTTTAACT GCCATTTTTA AAGTCTCATT TCTTTAATTT TTTATTTCGG 4260 

TGAAATAGCT GAGCTATTTA GCACATGTTC TATTATAGCA GATTTCTGGC ATGTGTC 4 317 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 4881 base pairs 
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( B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



AATTTATTTG 


ACTGGAAATT 


GTAGAGGCTT 


CTCGAAATTT 


CTTGAATGGT 


TAAAATAAGG 


60 


ACAAGAGAAA 


ACATGGATAT 


CTATATCCTT 


GTGCCAAAAA 


AACCACTGCC 


CTCCCCAGAC 


120 


CAACCTGAGG 


AAAGCAGTGA 


TTCTTATTTT 


AGGAGTTAGG 


AATGAATACA 


CGAAATCAAT 


130 


TTAGCTGATT 


ATTTTTTGTT 


TTTCAAGAAT 


TCATCGTATT 


GTTTTTGCAT 


TTCGTTCAAT 


240 


ACTTTTTCGT 


AGGCACCTTC 


AGATTTCAAT 


TTTTCCATCA 


ATTCTGGAAT 


CGCTTTATCT 


300 


GGGTCTACAG 


TACCAGTGTT 


GATAGCTGTA 


TCAAATTGTT 


GCATTGTGTT 


AGCAATAGCT 


360 


GAGATTTCAG 


ATTTCACATT 


GTCAGTATTG 


AAGATAAATC 


CAACCGCTGG 


AGATTCTTTA 


420 


GCTTCTGCCA 


ATTCTTTCTT 


AGAATTTTCG 


ATTTGTTGGT 


CTGTAACGTT 


TTCGTTGATG 


480 


TAAAGGATCC 


AGTTGTTACC 


AGTGTTCCAT 


CCACCCATGT 


GAGTGTTTCC 


TTTGTAGCCA 


540 


TCAAGAACGC 


GAACACGGTT 


TTCTTTACCT 


TCAATTTTTT 


CCCAGTTCTT 


GCCTTCTGGA 


600 


CCGTAAACAA 


GACCGTTCAA 


GAGTTCTGGC 


TTCGTATTCA 


AGAGCTTCAA 


GATTTCCATT 


660 


GATTTTTCTT 


TGTTCTTAGA 


GTTGTTTGAG 


ATGACAAAGT 


TAGCAACTTG 


TGTTGTTTGG 


720 


TTTTTCTTGA 


TGAAGTTAGT 


AATTGGTTTG 


ATTTGGATAT 


CTTTGTTGGC 


AACACGTCAA 


780 


AGCAAGCTGT 


TACCCTAGTC 


AGCTGGTCCT 


ACTGTTTCTT 


CACGAACGAA 


CCAACTATCT 


840 


TGTTGAAGGT 


CAAAGCAAGT 


ATCGCTTCTT 


GCGACGTCTT 


TTGGAATGTA 


GCCAGCTTCA 


900 


TACAATTTGT 


GAAGAGTCTT 


CAACTC~:CT 


TTGAAACGAG 


CCACTTCGTA 


ACGGTTTACA 


960 


ACTTTAGTAG 


TATCGCCTTC 


AAGGTCCATA 


ACGAATGGAA 


GACCGTTTGC 


TACTGGGTAG 


1020 


TCAAAATTAT 


CAGATGGGAT 


GAAAACTTTA 


CCAATAGCAA 


ATGGTACTAC 


GTCTGGAGCT 


1080 


TTTTCTTTGA 


TTTCTTTCAA 


GACTGGCTCA 


AGAGTTTCGT 


AAGAAGTAAC 


ACCTGAAATA 


1140 


TCGATACCAT 


ATTTAGCAAG 


GAGAGTTCCG 


TTGAAGGCAA 


ACTTTTGAGA 


TGATGCAACG 


1200 


TTGGCTGCAA 


CTGGAACAGC 


GTAAATCTTA 


CCATTTACAG 


TATTACCCTT 


GATGTAAGCT 


1260 


GGGTCAAGTG 


CTTTGTAAAG 


GTCTTTACCT 


TCTTTTTTGT 


ACAATTCTGT 


CAAGTCAGCG 


1320 


TAAGCACCTT 


TTTGACCATT 


TACAATATAG 


TTATCTGCAA 


AGGCAATATC 


ATAGTTTTCA 


1380 


CCAGATGATG 


TGATAACTGA 


CATTTTCTTA 


CCATAGTCAC 


CCCAGCCAAG 


GTATTGGATA 


1440 


TCCAATTTGG 


CACCAACTTT 


TTCTTCAATG 


ATTTTGTTGG 


CATTTGCTAA 


CAATTCATCC 


1500 


AAGTTGTCTG 


GTTTGTCACC 


GATTTGGTAC 


ATTTTGATAA 


CAGGTTTGTC 


ACCTGAATCA 


1560 
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GCACCTTTTT TGCTGTTACC TGTCAAATTT CCACAACCAC CAAGACCTGC AGCCAGAGCG 1620 

ACTACACTAG CAGATGCAAA AGCATATTTT TTCCAGTTTT TCATGATAAA AACTCCTTTT 1680 

TTTATTTTTA AACTTATAAA CAATGTAATG ATCTTATACT CAATAAAAAT CAAAGAGCAA 1740 

ACTAGAAAAC TAGCCGCAGG CTGCTCAAAG CACTGCTTTG AGGTTGTAGA TAAGACTGAC 1800 

GAAGTCAGTT ACATATATCT ACGGCAAGGC GACCTTGACG CGGTTTGAAT TTGATTTTCG 1860 

AAGAGTATTA ACTTCACACA AGGGAAGTTG GGAACTGAGA AATGTTATTT CTCAATAACC 1920 

ACTATTCTTT CACACCACCG ATAGTCAAAC CTTTTACAAA GTAGCGTTGG AAAAATGGAT 1980 

ACAAAATCGC GATTGGAAGG GTTGCAACCA CAACCATGGC CATACGACCT GTTTCTTTCG 2040 

GTAGAGCAAC TCCCAGTTGA CCAATCAAGC CGACCGCTTT GGCAATGTAG TCCATATTTT 2100 

GTTGGATT7G CATGAGCAAA TATTGCAATG GATACAAGTT GTCACTCTTG ATGTAAAGAA 2160 

GCGCGT7CAA CCAGTCATTC CAGAAACCAA GAGCTGTTAA GAGCGTGATC GTTGCGATAC 2220 

CTGGTAGTGA CAATGCCAAA CAGATTTGGA AGAAAATCCG GGCCTCACTG GCACCATCGA 2 280 

TACGAGCCGA TTCTAGAATG CCTTCTGGAA TGGTCTTCTT GAAGAAGGAA CGCATCAAGA 2340 

TCATGTTAAA TGGTGAGAGA AGCATTGGAA CAATCAAGGC CCAAACAGTG TCACCAAGCT 2400 

GAAGTACACG GGTCACCATG ATATAACCTG GTACCAAACC AGCCTTGAAC AACATACTGA 2460 

GAAGGACGAA GATGGTAAAG AATCTGCGAT ACTTAAAGGT TGTCCGTGAA ATACCGTAGG 2520 

CATAGGTTGT TGTGATAAAG ACATTTGTCA ATGTCCCAAC TACGGTTACA AAGACAGAGA 2580 

TGAAGAGGGC TTGTAGGATT TTATCCTTAA ACTGTCCCAA AAACTCAAAA CCCTCTAAGC 2640 

CAAATTGGGA TGGGAAGAAG CTATAGCCGT ATTGGAGGAG GCTTTTCTCG TCTGTCACTG 2700 

AAATAATGAT AACCAATACA AAAGGTAGGA TACAAGAGAG GGCAATCAAA CCCGAAATGA 2760 

TACTGAAGAA GATATCTGCT TTCTTACTGA AGGAGTGAAT GCCGACATTA TCAATTTTTT 2820 

CTTTTTTAAT TTTCTTTTTT GCCATATTCT CCTCCTTTCT AGAACAAAGC TGAGTTTGCA 2880 

7CGACTCGTC TTGCAAGCAA GTTTGATAGG AT AAC CAGAA TCAAACCAAC AACGGATTGG 2940 

TAAAGACCGG CTGCTGCAGC CATACCGATA TCTCCTGTCT GAGTCAAACC ATTAAAGACA 3000 

TATACGTCCA AAACGTTGGT TACATTGTAA AGCTGACCAG CATTGTGTGG GATTTGATAG 3 060 

AAGAGACCGA AGTCTGCGCG GAAGATATTT CCGACTGCAA GGATCGTCAA TACAGTTACA 3120 

AGCCGAGTCA ACTGAGGAAT GGTTACCTTG CGAATACGTT GCCACTTGCT AGCTCCGTCC 3180 

ACTGTCGCTG CTTCGTAGTA GGTTGGATCA ATTCCCATGA TCGTCGCATA GTACATGACA 3240 

CTGCTATATC CAAAGCCTTT CCAAATACCT AGGAAAACTA GGAGATAGGG CCAGATGCCC 3 300 
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AGCTCAGCGT AGAAATTGAC TTCTTTGAGA CCAAGACTTT CCAATAGATG ATTGAACACC 3360 

CCTTTATCAA TATTTAGGAA GGCATCTCTA AAGAAACTGA TGATAACCCA AGACAAGAAG 3420 

TAAGGGAACA ACATAGAAGT TTGAAAAATC TTCACCATTC TCTTAGAACG GAGCTCGCTG 34 80 

AGGATAATGG CAATCCCTAC AGATACAACT AAACCTAGAA AGATAAAGCC AAGATTGTAG 3540 

AGGACAGTAT TTCGTGTGAT AATAAAGGCG TCTCTTGAAC TAAATAAGAA TCTAAAATTA 3600 

TCGAGTCCGA CCCATTTACT ATTTATGATA CTATCTATGA AACCATTACT GGTCATGTGG 3660 

TAGTCTTTGA AGGCAACCAC GTTCCCAAAT ACTGGAATGT AAAAGAATAG AATCAACCAG 3720 

ACTGCCCCTG GCAAAACCAT CAAGAGAAAG ATCCAGTTGT CTCTCAATGT TTTTGAAAAC 3780 

TTTTTCATAA TTTCCTCCCT TTTTATTTTG ATATCCATCT AAAAATTCTT TTTTAGACTT 3840 

TTGATAACGA TTACATTATT AGTATACTCC TATTTCCAGG TTAGGTTAAA CTCCTAATTA 3900 

TAGAAAAAAC TCCACAAATT ATGTAGCAGA TTTAAAACTT TATCACCACT ATCAAACAAA 3960 

TGTCCTAAAT CAATTGTTTA TTTTATCTCT ATTAGCCCAG TGATGGCGTC ACTCTGTTAT 4020 

AAGCATCCAA CAACGGGGTA TACTGAAAAA TCTCCAGACT AGGGAACTCA GCGATAGTTC 4080 

CTAATCTGGA GATTTTTAAT ATGTTATTAG GCGTTTGCTT TCAACTTAGC AATAACCTCT 4140 

TTAAGATTAT CAATCAACTC TGCTGCAGTA TGCTCAGAGC CTTTTTCATC TGCCAAGAAC 4200 

AAAACTGCTT TTTGAAGTTC TTTTTGAGAG TTTTCAAGGA CATCCTTATC TACTGTTTCA 4260 

AGGTTTGAGT CTTTAAGAAG TTTACTTAAT TCCTTCGCTA ATTTCTTCAG TTTGATTTGC 4320 

AGACTCATCT TCTCCTGCTG TTTCTTTGCC CGCTGTTTGT CCTCCATCCT TAGTTGCTGA 43 80 

CTGGCTTTCC TTAATGGACT CTAGGGAAGC AATGGCATCT TTGACTGTTT GCAAGATATC 4440 

ACGTAAACCT TGCTCTGTCA AACTATCATC TGCAAAAGCT TTATTACCCT CTGCCAAAAC 4 500 

CAGACGTGCT GAATCTGTGG TAGGATTCGA TACACCTGTC AATGATCTCA AAAGATTTTC 4 560 

TAACGTTTGA GTCTGCTTAC TAATACTAGA CTAAAATCAA AAAGTATTAT ATAACAGTGA 4620 

TATGAAATCA ACTAAAGAAG AAATCCAAAC CATCAAAACA CTTTTAAAAG ACTCTCGTAC 4680 

AGCTAAATAT CATAAACGCC TTCAAATCCT TCTATTTTCT CTGATGGGCA AATCTTATAA 4740 

AGAGATTATA GAACTTTTAT AGTAGTTTGA AATAAGATGT GAACATCTCT ATCAGGAAAG 4800 

TCAAATTAAT TTATAGAAAT ATTTTAGCAG CCAAGGTGTA CTGTTATAGA TTCAATACAC 4860 

TATACTTGGT GGTTTAGCTC G 488 L 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13121 base pairs 
(3) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 
{ D) TOPOLOGY : linear 



ixi ) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


126: 






AGGATCCCCG 


GAAAAGGAGA 


CTAAAAATGA 


AGAAAAAATT 


TCTAGCATTT 


TTGCTAATTT 


60 


TATTCCCAAT 


TTTCTCATTA 


GGTATTGCCA 


AAGCAGAAAC 


GATTAAGATT 


GTTTCTGATA 


120 


CCGCCTATGC 


ACCTTTTGAG 


"tttaaagatt 


CAGATCAAAC 


TTATAAAGGA 


ATTGATGTTG 


180 


ACATTATTAA 


CAAAGTCGCT 


GAGATTAAAG 


GCTGGAACAT 


TCACATCTCC 


TATCCTCGAT 


240 


TTGACGCAGC 


AGTCAATCCG 


GTTCAAGCTG 


GGCAAGCCGA 


CGCTATCATG 


GCAGGGATGA 


300 


CAAACACTAA 


AGAACGTGAA 


AAAGTCTTCA 


CCATGTCTGA 


TACTTACTAT 


GATACAAAAG 


360 


TTGTCATTGC 


TACTACAAAG 


TCACACAAAA 


TTAGCAAGTA 


CGACCAATTA 


ACTCGCAAAA 


420 


CCGTTGGTGT 


TAAAAACGGA 


ACTGCCGCTC 


AACGTTTCCT 


TGAAACAATC 


AAAGATAAAT 


460 


ACGGCTTTAC 


TATTAAAACA 


TTTGACACTG 


GTGATTTAAT 


GAACAACAGC 


TTGAGTGCTG 


540 


GTGCCATCGA 


TGCCATGATG 


GATGACAAAC 


CTGTTATCGA 


ATATGCCATT 


AACCAAGGTC 


600 


AAGACCTCCA 


TATTGAAATG 


CATGGTGAAG 


CTGTAGGAAG 


TTTTGCTTTC 


GGTGTGAAAA 


660 


AAGGAAGTAA 


ATACGAGCAC 


CTGGTTACTG 


AATTTAACCA 


AGCCTTCTCT 


GAAATGAAAA 


720 


AAGATGGTAG 


TCTTGATAAA 


ATTATCAAGA 


AATGGACTGC 


TTCATCATCT 


TCAGCAGTGC 


780 


CAACTACAAC 


TACTCTCGCA 


GGATTAAAAG 


CTATTCCTGT 


TAAGGCTAAA 


TATATCATTG 


840 


CCAGCGATTC 


TTCTTTTGCC 


CCTTTTGTTT 


TCCAAAATTC 


AAGCAACCAA 


TACACTGGTA 


900 


TTGATATGGA 


ATTCATTAAG 


GCAATCCCTA 


AAGACCAAGG 


TTTTGAAATT 


GAAATCACCA 


960 




TGATGCTGCT 


ATCAGTGCTG 


TCCAAGCTGG 


TCAAGCCGAT 


GGTATCATCG 


1020 


CTGGTATGTC 


TGTCACAGAT 


GCTCGTAAGG 


CAACTTTTGA 


CTTCTCAGAA 


TC AT ACT AC A 


1080 


CTGCTAATAC 


CATTCTTGGT 


GTCAAAGAAT 


CAAGCAATAT 


TGCTTCTTAT 


GAAGATCTAA 


1140 


AAGGAAAGAC 


AGTCGGTGTT 


AAAAACGGAA 


CTGCTTCTCA 


AACCTTCCTA 


ACAGAAAATC 


1200 


AAAGCAAATA 


CGG CT AC AAA 


ATCAAAACCT 


TTGCTGATGG 


TTCTTCAATG 


TATGACAGTT 


1260 


TAAACACTGG 


TGCCATTGAT 


GCCGTTATGG 


ATGATGAACC 


TGTTCTCAAA 


TATTCTATCA 


1320 


GCCAAGGTCA 


AAAATTGAAA 


ACTCCAATCT 


CTGGAACTCC 


AATCGGTGAA 


ACAGCCTTTG 


1380 


CCGTTAAAAA 


AGGAGCAAAT 


CCAGAACTGA 


TTGAAATGTT 


CAACAACGGA 


CTTGCAAACC 


1440 


TTAAAGCAAA 


CGGTGAA77C 


CAAAAGATTC 


TTGACAAATA 


CCTACCTAGC 


GAATCTTCAA 


1500 


CTGCTTCAAC 


AAGTACTGTT 


GACGAAACAA 


CGCTCTCGCG 


CTTGCTTCAA 


AACAACTACA 


1560 
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AACAACTCCT TAGCGGTCTT GGTATCACTC TTGCTCTAGC TCTTATCTCA TTTGCTATTG 1620 

CCATTGTCAT CGGAATTATC TTCGGTATGT TTAGCGTTAG CCCATACAAA TCTCTTCGCG 1680 

TCATCTCTGA GATTTTCGTT CACGTTATTC GTGGTATTCC ATTGATGATT CTTGCAGCCT 174 0 

TCATCTTCTG GGGAATTCCA AACTTCATCG AGTCTATCAC AGGCCAACAA AGCCCAATTA 1300 

ACGACTTTGT AGCTGGAACC ATTGCCCTCT CACTCAATGC GGCTGCTTAT ATCGCTGAAA 1360 

TCGTTCGTGG TGGTATTCAG GCCGTTCCAG TTGGCCAAAT GGAAGCCAGC CGAAGCTTGG 1920 

GTATCTCTTA TGGAAAAACC ATGCGTAAGA TTATCTTGCC ACAAGCAACT AAATTGATCT 1980 

TGCCAAACTT TGTCAACCAA TTCGTTATCG CTCTTAAAGA TACAACTATC GTATCTGCTA 2040 

TCGGTTTGGT TGAACTCTTC CAAACTGGTA AGATTATCAT TCCTCGTAAC TACCAAAGTT 2100 

TCAAGATGTA TGCAATCCTT GCTATCTTCT ATCTTGTAAT TATCACACTT TTGACTAGAC 2160 

TAGCGAAACG CTTAGAAAAG AGGATTCGTT AATGGCAAAA TTAAAAATTG ATGTAAATGA 2220 

TTTACACAAG CACTATGGAA AAAATGAAGT CCTAAAAGGA ATTACGACTA AGTTCTATGA 22 80 

AGGAGATGTT GTTTGTATCA TCGCTCCTTC AGGTTCTGGT AAGTCAACTT TCCTCCGTAG 23 40 

CCTCAATCTT TTAGAAGAAG TCACTAGCGG TCACATCACT GTGAACGGCT ATGATTTAAC 2400 

TGAAAAAACA ACCAATGTTG ACCACGTCCG TGAAAATATC GGCATGGTAT TCCAACACTT 24 60 

CAACCTCTTC CCTCATATCT CTGTATTGGA CAACATCACC TTTGCTCCTA TTGAGCACAA 2520 

GTTGATGACT AAGGAAGAAG CTGAGGAATT GGGAATGGAG TTCCTTGAAA AGGTTGGACT 2580 

AGCAGATAAA GCTAATGCCA ATCCAGATAG CCTATCAGGT GGTCAAAAAC AACGTGTGGC 2640 

CATCGCTCGT GGCCTAGCAA TGAATCCAGA CATCATGCTC TTCGATGAAC CAACTTCTCC 2700 

CCTTGACCCT GAGATGGTTG GAGACGTACT TAACGTTATC AAGGAATTGG CTGAGCAAGG 27 60 

CATGACCATG ATTATCGTAA CCCATGAGAT GGGATTTGCT CGTCAGGTTG CCAACCGCGT 2820 

TATCTTTACT GCAGATGGCG AGTTCCTTGA AGACGGAACA CCTGACCAAA TCTTTGATAA 2 880 

CCCACAACAC CCTCGTCTGA AAGAGTTCTT AGATAAGGTC TTAAACGTCT AAACTCAAAC 2 940 

TGTAAGGATT TCCTTGCAGT TTTTCTACCT CGTATTGGAA TTTTTGATTT TTCGGAAAAT 3000 

TATGTTAGAA TTAAGTTTAT GAAATGAGGT TTCCTCATAC CTAGCAAGAC TAGGAATAAA 3060 

AATAGAAATT AGGTAGCTAG ATGTCATCTA AGGTTATTGT TACAATTTTC GGTGCGAGTG 3120 

GAGACCTGGC TAAACGCAAG CTCTACCCTT CCCTTTTTAG ACTATATCAA TCCGGCAATC 3180 

TTTCCAAGCA CTTTGCCGTT ATTGGAACTG CCCGTAGACC TTGGAGTAAG GAATATTTTG 3 240 

AATCTCTACT TGTCGAGTCC ATCCTTGATT TGGCAGATAG TACCGAGCAA GCCCAAGAAT 3 3 00 

TTGCTAGCCA CTTCTACTAT CAAAGCCATG ATGTCAATGA TTCGGAACAT TATATTGCTT 3 360 
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TCCGTCAATT ACAAGCTCAG CTTAATGAAA AATACCAAGC TGAACACAAT AAGCTCTTCT 3420 

TCTTGTCTAT GGCACCTCAG TTCTTTGGAA CCATTGCCAA ACACCTCAAA TCTGAAAACA 3430 

TTGTCGATGG CAAAGGTTTT GAGCGCTTGA TCGTTGAAAA ACCATTTGGT ACAGATTACG 3540 

CAACTGCAAG CAACTTGAAT GACGAACTCC TAGCAACATT TGACGAAGAA CAAATTTTCC 3 600 

GTATCGACCA TTATCTTGGT AAGGAAATGA TCCAAAGCAT CTTTGCAGTT CGCTTTGCAA 3 660 

ACTTGATTTT TGAAAACGTT TGGAACAAGG ATTTTATCGA CAATGTTCAA ATTACCTTTG 3720 

CGCAGCCCTT GGGTGTAGAA GAACGTGGTG GCTACTATGA CCAATCCGGT CCCCTCCGTC 3780 

ACATGGTCCA AAACCACACT CTACAACTTC TTTCCCTCCT CGCCATGGAC AAACCAGCAA 3840 

GCTTCACAAA AGACGAGATT CGTGCTGAAA AGATTAAGGT CTTTAAAAAC CTCTATCATC 3900 

CAACTGATGA AGAACTCAAA GAACACTTTA TCCCTGGGCA ATACCGCTCT GGTAAGATTG 3 960 

ATGGCATGAA ATACATCTCT TATCGTAGCC AGCCAAATGT GAATCCAGAA TCAACAACTG 4020 

AAACCTTTAC ATCTGGTGCC TTCTTTGTAG ACAGCGATCG ATTCCGTGGT GTTCCTTTCT 4080 

TTTTCCGTAC AGGTAAACGA CTGACTGAAA AAGGAACTCA TGTCAACATC GTCTTTAAAC 4140 

AAATGGATTC TATCTTTGGA GAACCACTTG CTCCAAATAT TTTGACCATC TATATTCAAC 4200 

CAACAGAAGG CTTCTCTCTT AGCCTAAATG GGAAGCAAGT AGCAGAAGAA TTTAACTTGG 4260 

CTCCTAACTC ACTTGATTAC CGTACACATG CGACTGCAAC TGCTCCTTCT CCAGAACCAT 4 320 

ACGAAAAATT GATTTATGAT CTCCTAAATA ACAACTCAAC TAACTTTAGC CACTGGGATG 4 380 

AAGTTTGTGC GTCATGGAAG TTGATTGACC GTATTGAAAA GCTCTGGGCT GAAAATGCTG 4440 

CCCCACTTCA TCACTATAAA GCTGGAAGCA TGGGACCTCA AGCCAGCTTT GACCTACTTG 4 500 

AAAAATTCGG TGCCAAATGG ACTTGGCAAC CAGATATCAC CTATCGTCAA GATGGTCGCT 4560 

TAGAATAAAA AAATTTCCTG CAACTTTATG CcTTGCAGGA TTTTTGCTTC TG ATT AC ATT 4 620 

AAACCTTCCA AGACACCTTT CATAAAGTTT TCTGAGTTAA ACTCTCCAAT ATCATCGATT 4 680 

TTTTCACCAA AACCAATCAA TTTTACAGGA ATATTGAGTT CTTCACGAAT GGCTAGAACC 4740 

ACACCTCCTC GAGCAGTTCC ATCAATCTTA GTCAAAACAA TTCCCGTTAA AGGTGTGATT 4800 

TTCGAAAATT CTTTGGCCTG TACTAGGGCA TTTTGACCTG TTGATGCATC AAGTGCCAAG 4860 

AAGGTTTCAT GTGCTGCTTC TGGCACAACA CGTTTGATAA TACGACCAAT CTTTTCCAAC 4920 

TCAGCCATAA GGTTATCCTT ATTTTGCAGA CGACCAGCAG TATCAATCAT GAGAATATCG 4 980 

ATACCTTCAG TCACGGCACG TTCCATACCA TCAAAGACCA CGCTGGCTGG ATCACCTTTT 5040 

TCAGGTCCAG TTACTACTCG AACATCTACT CGTCGGCCCC ATTCAGCTAG CTGAGCTACT 5100 
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GCACCCGCAC 


GGAAGGTATC 


TGCTGCAACC 


AGCATGACCT 


TCTTACCAGC TTGTTTGTAG 


5160 


CGGTGGCCTA 


GTTTTCCGAT 


AGAAGTTGTT 


TTCCCAACAC 


CATTCACACC AACAAAGAGC 


5220 


ATAACTGTCA 


AGTTATCTTG 


GAAGTGGATG 


CTTTCATCGT 


AGCTACCATC CTTTTCATAA 


5280 


AGCTCAACCA 


ATTTCTCAAT 


GATGACACGA 


CGAAGTACAT 


CAGGTTTCTT 


GGCATTTTCA 


5340 


AGCTTGGCTT 


CGTAACGTAG 


TTCCTCCGTT 


AAGTTAGAAG 


CGXCTTGCAC 


ACCAACATCA 


5400 


CTCATAATCA 


GCAGTTCTTC 


CAGTTCCTCG 


AAAAATTCTT 


CGTCAACAGA 


GCGGAAGTTA 


5460 


GCAAAGAAGG 


CATTCAAGCG 


GGCACCGAAA 


CCTGTGCGAG 


TTTTCTTAAG 


ACTCCGGTCA 


5520 


TATTTTTCCT 


GAACAGTTTC 


TTCTGTTTGA 


GGAGCTTCTG 


GTTCAAGCAC 


TTCAGAATTA 


5580 


TTTTCTTCTA 


CAGTTCCTTC 


GTGCTCAAGC 


TTCTCTTCCT 


CTCGTAATTC 


TTCTGAGTTT 


5640 


GGTAATTCTT 


CTATTTCTTC 


TTGAGAAACC 


CCTACAGCTG 


GCTCTGAATC 


CTGACTTTCT 


5700 


TCAACTGTGT 


CTTGGATTTC 


CTCTTCTTGG 


AACACAGCTT 


GTTCAACAAT 


TTCAACCTCT 


5760 


GCTTCTTCCT 


CAGAAACTTC 


CTCAACTTCT 


GTGAAGGTAG 


GATCAACATC 


TTCAGACAAA 


5820 


TCAAGATTTT 


CCAGAGCTTC 


TTTTACAACT 


TCTTCGATTT 


TAGGTTCTTC 


TTTTTTTCCG 


5880 


AATAGACGGT 


CAAACAATCC 


CATATCTTAG 


TTCTCCTTTA 


CCACATATTC 


TTCGATAGCC 


5940 


CAGGCGACAG 


CTTCCTCATC 


GTTGGTCATC 


GGCGTCACTA 


CATTTGCGGC 


TGCCTTTACT 


6000 


TCAGGAACAG 


CGTTTTGCAT 


AGCAACACCA 


AGACCTGCCC 


ATTCAATCAT 


AGAGAGGTCA 


6060 


TTGGCCTCGT 


CACCACAAGC 


CATCACTTGA 


CTTTCGTCGA 


TTCCAAGATG 


GCTGATTAGT 


6120 


TTTGCCAAAC 


CTGTTGCTTT 


ATGAACATTC 


TTTGGTGACC 


ATTCTAGCAA 


CATTTCACGT 


6180 


GATTTAAAGA 


TTTCATATTG 


GTCAAACAAT 


TCTGGAGAAA 


TCTTCTGAAT 


GGCTGCATCC 


6240 


AAGGGTTCTT 


GAGCAAAGGC 


AGTCACGCAT 


TTGTTGTAGG 


TCATTTGACT 


AGATAAGTCT 


6300 


TCAAAGTCCA 


CTGGAACAAA 


GGTCAAAGCT 


GGATTGAATT 


TGGCATAAAC 


ACTTTCTTGG 


6360 


TCCGATTGGA 


TTTGATAAAC 


TGTTCCTTCT 


GAGATCGCAT 


CAAGAGGCAG 


TGATAATTTC 


6420 


TCTGTTTCTT 


CATACAAACG 


TGCCACATCA 


TCATATGAAA 


AGACTGTTTT 


ATCAAGGATT 


6480 


TCTCCTGTAT 


TTTTCTGAAC 


TAATCCACCA 


TTAAAAGTAA 


TGGTATACTC 


ATCTTCCTGA 


6540 


CCGTCAGTCC 


CTAACTCATG 


GAGAAAGAAA 


TCCATGGCTT 


TTAAGGGACG 


ACCAGTTGTC 


6600 


AATACGACCT 


TGATACCACG 


ATCACGCGCA 


gCTTGCAAGG 


TTTCCTTGGT 


ACGATCCGTC 


6660 


AGCCTTTTAT 


CAGTAGTCAG 


CAAGGTCCCG 


TCCAAGTCCA 


ATGCAATCAA 


TTTTATATCT 


6720 


GCCATTATAA 


GCCCTCCATA 


TAAGCTATAA 


CCGACCGTTC 


CTTATGGTGA 


CCAATCACAG 


6780 


TCTTTGCTAA 


TTCTAAAATT 


TCAGGTCGTG 


CATTTTCAGG 


ACCTACAGGA 


TGTCCCACAA 


6840 


CCTGCATCAT 


ATCTAAGTCA 


TTAAGATTGT 


CTCCAAAAGC 


CATGACCTGA 


TCCATTGTGA 


6900 
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TACCAAGTTT 


TTTAACTAAT 


TCAACAATGG 


CCACTCCCTT 


ATCGACATAG 


TCCAGAACAA 


6960 


TATCAATGGA 


TTCAAAGCCA 


GTTGTCATGG 


CCTTAACACC 


AGGAACGTTT 


TCGTTTACCC 


7020 


AAGCCTCCCC 


ATCTTCCAGC 


GTTTCTTCTG 


TGAAGTTGGT 


TGTAAATTTG 


AAAATGTCAT 


7080 


CTGTGATATC 


TTCCAAACTC 


GCTACTTTTT 


GGATATTTTC 


ATTATAGTGC 


TGACTCACTT 


7140 


TCAAATAGGT 


CTCATCAACC 


CTATCTAGAA 


CATATGAACC 


CTTCTTACCC 


GTCAAGAGCA 


7200 


-GTTTATTGAT 


ATCTACATAA^GGTGAAGTTT 


"TCAGCTTTTC "AAAAGTTGCC 


"AG AT AAAAGT 


7'2~60" 


CACCAGACAT 


AGTCGCTTCA 


TACAAGTCCT 


GACCTTGATA 


CTCTACCAAA 


CTGCCATTTT 


7320 


CCGCGATGAA 


AATAA7CTCA 


TCACGAACAC 


CAGCAAATAA 


TTTTTCTAGA 


GACAGAAATC 


7380 


CCCGACCCGA 


AGCTACCGCA 


AAGTAAATCC 


CTTTTTCCTT 


GTAGGAAACC 


AAGAGAGACT 


7440 


TGAGACGATC 


CATATCAAAG 


CGTCCATTCC 


CATCTAGGAA 


GGTTCCGTCC 


ATATCCGTTG 


7500 


CTACTAGTTT 


AATTGTCATC 


CTTCAATACT 


TTCTAAATCT 


TTTAACTTAA 


CTGAAACAAT 


7560 


CTTTGAAACA 


CCCGATTCTT 


GCATGGTCAC 


TCCATAGATG 


GAATCAGCCG 


CTGCCATGCT 


7620 


TCCCTTACGG 


TGGGTTACGA 


CGATGAACTG 


GCTGTCCTTG 


TCAAAGCGCT 


TGAGGTAATC 


7680 


CCCAAAACGT 


TTAACATTGG 


CTTCATCCAG 


CGCAGCTTCC 


ACCTCATCCA 


AGATAACAAA 


7740 


TGGAATAGTC 


TTGACACGAA 


TAATGGAGAA 


GAGCAAGGCA 


AGAGCCGATA 


GGGCTTTTTC 


7800 


ACCACCACTC 


ATGAGATTAA 


GAGACTGGAT 


TTTCTTGCCT 


GGTGGTTGGA 


CAGAAATTTC 


7860 


AACCCCAGCT 


GTCAGCAAGT 


CTCCTTCAGT 


CAAAATGAGC 


TCAGCCTGAC 


CTCCACCAAA 


7920 


CATCTGCTTG 


AAGGTCACTT 


TAAAGGACTC 


ACGAATGACC 


TCAAAGGTTG 


ATTTAAAGCG 


7980 


TTCCTTGACC 


TCATCATTCA 


TCTCTGTAAT 


GGTCTCAACG 


AGCAGGTTTT 


TCGCAGACAA 


8040 


AATATCATCA 


CGTTGGCTAT 


TTAGGAAATC 


CAGACGGTTG 


TGAACTTCTT 


CGTACTGTTC 


8100 


AATAGCGTCT 


AAATTGACAG 


GACCCAGTGA 


GCGTATACCC 


TTCTCTAAAT 


CCTTAACTTC 


8160 


TTGCTCTGCC 


AGATTGAGAT 


TTTCCAACTC 


ATCCGCCTTT 


TCTAAAGCTT 


CTGTGTAGCT 


8220 


GATCTGGTAC 


TGGTCTCTTA 


ATTGACTTTG 


TAGATGGCGC 


AAGCGCTCGC 


TAACCTTTTC 


8280 


TTTCTTGGCT 


TCAGCACGAG 


TTTGCTTGCG 


AATCCACTCT 


TCATTCTGCT 


GGCGAGCCTG 


8340 


ATCCAAATGA 


CTAGCAATAT 


CATCCAGTTG 


ACCCTCAA7A 


TCATCCAACT 


CAAAC7GCTT 


8400 


GCGAATCAAA 


CCTTGTTGGA 


GATTTGTTTT 


TTGAGTTTTG 


GATTCTTCCG 


CCTGTTGACT 


8460 


CAGCAATTCT 


GTATCAACCT 


TCTCAAGATT 


ATCAATCTTT 


TCTTGAAGAA 


GGCGCTGGAT 


8520 


TTCCTCTTGT 


TCAAAATCAA 


GATTCTCCAA 


TTCCTTGCCT 


AAGCGTTCAA 


TATCAGCAAC 


8580 


TTCATAACGT 


TTTTGC CCTT 


GCACTTCTGT 


CTTAAGCAAA 


CGAGCTTGCG 


CTAGCTCTTC 


8640 
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CTCCAACTTT 


TGATAGCGTT 


CTTGGATGGC 


ATTTTTGTTA 


CACTTAATCT 


CTTCAATCTC 


8700 


AGCTTCCAGA 


TTTTGCTTGT 


CACTGGAGAT 


TGCAGCAAGA 


CGCTCTTGGC 


AGTTTTCCTT 


8760 


ATCCGCTTGC 


CAATCTCCCT 


CGGAAAGACG 


ATCTATTTCC 


TCTTCTTGGA 


CTTTCCAAAG 


6820 


AGTTTCCAGT 


TCTTCAACTT 


GCTGACTAGT 


TTGCTGATAA 


GCGAGGAACA 


AGCCTTGCTC 


8880 


CTGAATACGT 


GCCTGCTCTC 


CTTGAGATTT 


AATAGCTTCT 


AATGACTCGG 


TCAATCTGGC 


8940 


CATCTCATCT 


TGCAAGGTCT 


TCAAAGTCGC 


CTCTTCTGAA 


CCCAAGCTTG 


CTTCTTCTTC 


9000 


AGCAATTTCT 


TTTTGTAATT 


GCTCCAGTTC 


TGGCTTGATA 


AAAATGCTGT 


TATTCTGGCG 


9060 


ATTGGCACCA 


CCTGCATAAG 


AACCACCTGT 


GCGCAACTCT 


GTCCCATCCA 


ATGTCACCAT 


9i20 


ACGAACCTGA 


TAACGAACTT 


GGCGAGCTGC 


TGCACGCCCA 


TGTTCTACCC 


TATCAAACAT 


9180 


AGCCGTCGTA 


GCTAGCAAGT 


TCTTGAAAAT 


GGCTTCCAGT 


CTAGTATCAA 


AAGTCACCAA 


9240 


CTCATCTGCC 


ATCCCAAGGA 


AACCTGGCCT 


TACAGCGATA 


GCATCTTGGT 


TCTGACTAGA 


9300 


AATCGTACGC 


GCCTTGATAG 


TGGTCAAAGG 


AAGAAAGGTT 


GCACGACCGG 


CTCTGTTCCG 


9160 


TTTAAGGAAG 


TCAATAGCCT 


TGGTTGCCGA 


CTCTTCATCT 


TCTACGATGA 


TATGCTGGCT 


9420 


ACTTGCCCCT 


AAGGCAATCT 


CTAGGGCAGT 


TTGATAATAA 


ACATCAAAGG 


TCAGATGCTC 


9480 


ACTGACTGCA 


CCAATAATCC 


CACCTAGGCG 


ATCTTTTTCT 


TGGAGAACAC 


TCTTAACACC 


9540 


TGCATAAAAG 


TTACTATGAT 


TTCTCAGGAT 


ATTTTCCAAA 


CTTTGAGCTC 


TGGCCTGCTT 


9600 


GTTTTTGAGA 


TTATCCAGAC 


GGTCAAAGAG 


TTGGCTTTGT 


TGAGCTTGAT 


AGGAAGTTTT 


9660 


CTGCTCCTCT 


TGCTCCTTGG 


CAATAGCTTG 


GTAGTCAGCC 


AATAATTTCT 


GAACCTCCTC 


9720 


CTTGGCAGTT 


TCAAGCTCTT 


CCTTTTGCTG 


ACTAGCCTTC 


TCTTTAGCTA 


TAGCTAATTG 


9780 


CTCTTTCAGC 


TTTTCTAGTT 


GATCTGCTTG 


TTTTTGAGAA 


AGCTGACGAC 


TATTTTCCAA 


9840 


CTCATTCTCA 


ATACGGGTCA 


ACTGGTTTGA 


GACATCCGCT 


TCTTCTTGTA 


AAAGAGCTAC 


9900 


AAAGCGTTCA 


CGTAAGAGCT 


CAATCATCTG 


ATCAGGATCG 


TCTGAGAAAG 


CCAGCAATTC 


9960 


AGCTTCTAAA 


CGATTGAGTT 


TTTGATTATT 


TTCCACTAGA 


TTTCCCTCTA 


ACAGAGCTAA 


L0020 


AGAGCTTTCT 


TTATCAGACT 


TTTCTTTGCT 


GAGTGAATTT 


CTCTTATCCT 


CCAAAGCAGC 


10080 


CAAACGGGCT 


TGTGCCTCCT 


GTTGATTCAA 


GGCCACTTGC 


TCGGACTCCA 


GTTTCGATAG 


10140 


CCCTAATTTT 


CTTTCTAAAT 


CACTAATCAG 


ACTAGTCAAG 


TCCATCAAAC 


TGCCTTGGTC 


10200 


TTTGGCCATT 


TCAGCCTGTA 


AATCTTGGCG 


TTGCTTTTTA 


AGAGTTTGAT 


TTTC TTCTTC 


10260 


TAATTTTTCA 


CCCTTTTGGT 


AATAACTCAT 


CAAGAGTTCT 


TGAACCTGAG 


TCAACTCTTC 


10320 


TTCTGTCGAC 


TCTAGTTCAG 


CCTTATTTTC 


CTTGATTTGA 


GCAACCAGAA 


CATCTAAATA 


10380 


AATAGCCTTA 


CGTTGTCCTT 


CCAAGTCTAA 


AAACTTACGG 


GCATTCTCAG 


CTTGCTTCTC 


10440 
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AAGAGGCTTG 


ATTTGATTAT 


CCAACTCGTA 


GATAATGTCC 


TCTAAGCGGT 


CCAGATTATC 


10500 


CTGAGTTTGC 


TGCAGTTTAC 


TCTCGGTTTC 


TTTTCTGCGA 


GTCTTGTATT 


TTAAAACTCC 


10560 


AGCAGCTTCT 


TCAAAAATAG 


CTCGTCGTTC 


CTCAGGCTTG 


GAATTAAAAA 


TCTCCTCAAC 

4 w * WW * W^V\W 


10620 


CTTCCCTTGG 


GAAATAATAG 


AGAAGGAATC 


TCGTCCCAAT 


CCAGTATCCA 




10680 


ATGAATATCA 


CGCAGACGGA 


CTTTCTTGCC 


GTCAATCTTG 


TATTCGCTAT 


CTCCACTACG 
w 4 wwnw i o'bv 


10740 


ATAGACATCG 


CGTTCCACCC 


TGATTTCTTG 


ACCTGCATCC 


TT GAT AAAT C 

4 4 Wll A. ****** 4 \^ 


CH TC XTCl ATT 


10800 


ATCCAGAGTC 


ACAACTACAG 


AAGCATAATT 


GAGCGCTTTG 


CGACTTTCGG 


TTCCAGCAAA 


10860 


GATGATATCC 


GGCATCTTGC 


CCCCACGGAG 


ACTCTTGACA 


CTAGACTCCC 


CCAAAGCCCA 


10920 


ACGCAGACTT 


TCTGTAATAT 


TGGACTTTCC 


AGATCCATTG 


GGTCCAACAA 


CTGCCGTCAC 

w 4 vi w w \j 4 w n w 


luJOU 


ACCTTGGTCA 


AAAACGACCT 


TGGTCTTATC 


AGCAAAAGAC 


TTGAACCCCT 


GAATTTCGAT 

4 4 4 W \Jt\ 4 


1 1040 


TTCCTTTAAA 


TACATGAATC 


CAGCCCCTTC 


TCAACGGCAT 


TTTTGGCAGC 

4 * * * S*\*W*»WW 


TTCCWZPTPT 

* 4 W V* 4 WW 1 W 4 




GCTAATTTCT 


TAGAACGACC 


TTGGCCTTGA 


CCGATGCTCT 


TACCTTCAAC 


AAGAACTTCT 

nnwrv\^ 4 4 w * 


11160 


ACATCAAAAA 


CCTTATCGTG 


AGCAGGCCCT 


GTTTCAGAAA 


TCACCTGATA 


AC G AAT AG CC 


11220 


ACATCACCAT 


TGACCTGAAG 


CAACTCTTGG 


AGATGGGTTT 


TATAGTCTGT 


AATCATCTCA 


112 80 


AACTCGCCTG 


CTTCAACCTT 


AGGAATCATG 


ACTTGATAGA 


T AAATT C CTT 


GACCTTGGCC 


11340 


ACATCCTTAT 


CCAAAAGAAG 


GGCACCAAGA 


AAGGCTTCAA 


AGGCATCACC 


AAGAATGGTG 


11400 


TCACGATTGC 


GACCACCTGA 


TTTTTCTTCC 


CCTTTACCCA 


ACTTGATAAA 


CTGGTCAAAC 

W 4 WW 4 WCW>W 


114 60 


TGGCAATCAC 


GCGCAAAACC 


AGCTAAACTC 


TCCTCACGGA 


CAATCATAGC 


ACGGAGTTTT 


11520 


GATAGGTCAC 


CTTCAGGCTT 


TTTAGGATAT 


TTTTTATATA 


GATATTCTGA 


AATCAATAAC 


11580 


TGTAGAACAG 


CGTCTCCTAA 


AAATTCCAAG 


CGTTCATTGT 


GTGAAATTTT 


TAAGAGGCGG 


11640 


TGCTCATTGG 


CATAACTCGT 


ATGAGTAAAG 


GCAGTTTCCA 




GTCTGCAAAT 


11700 


TCGATTGCAA 


AATGATTCTT 


TAGTACAGTT 


TGTAATTCTT 


TCATACCAAC 


CTCTTTCTAA 


11760 


CTGATAATAG 


TCCTTTTTAT 


TATATCAAAA 


AAAGCCCCCT 


GAGTCACTCT 


AAAACGGGAC 


11820 


TGGAAAGCAT 


TTGGGAATTC 


TTTAGACAGA 


GATTCTCAGT 


TTTAGCGGCA 


AATTTGGGTC 


11880 


AGGATAAAGA 


AAAAAGCCCT 


ATTAAAGGCT 


TTTTAGGATG 


TTTACATCCA 


CCCTGAGGGA 


11940 


ATCGAACCCC 


CATCTCAAGA 


ACCGGAATCT 


TACGTGATAT 


CC ATT AC ACT 


AAGGGTGGAA 


12000 


ACTTGTTTTA 


TTATAACAGA 


AATTTGCTCT 


AATAACAAGT 


TTTTTGGTCA 


AAGACCCCGT 


12060 


CTTAGTGGGA 


AGCATCCCCA 


TTCCAGATGG 


AGTTTTTCAC 


GATCACATAA 


TCAACGTGTT 


12120 


TAAGGTCAGC 


AACCTGACGT 


CCACCTGCAT 


AAGAAATAGC 


ACTTTGAAGG 


TCTTGTTCCA 


12180 
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TCTCACTTAA AGTGTCTTGC AGATGACCTT TAGCAGGAAG CAAGATACGT TTGCCTTCCA 12 240 

CATTTTTGTA AGCACCTTTT TGATATTGTG AGGCTGAACC ATAATATTCT TTGAACTGTT 12300 

CACCATCGAC TTCAATCGTT TTCCCTGGAC TTTCAATGTG TCCTGCAAAG AGCGAACCAA 12 3 60 

TCATGATCAT GCTAGCACCG AAGCGGATAG ACTTAGCAAT ATCACCGTGA GTACGAATTC 12420 

CTCCATCAGC GATAATCGGT TTACGCGCAG CCTTGGCACA CCAGCGTAGA GCAGCCAACT 12480 

GCCAACCACC TGTACCAAAA CCACTCTTAA CCTTGGTGAT ACAAACCTTA CCAGGACCGA 12540 

TTCCGACCTT AGTAGCATCC GCACCAGCAT TTTCCAATTC ACGCACAGCT TCTGGTCTTC 12600 

CCACATTTCC AGCAATGACA AACGTATCTG GCAATTCTTT CTTGATGTGT TGAATCATAG 12660 

AAATCACCCT ATCCGCATGA CCATGAGCAA TATCAATAGT GATATACTCA GGAGTATCAG 1272C 

CCTTGAGCTG GCTAACAAAA TCATACTCAT AATCCTTAAC ACCGACAGAG ATAGAAGCAA 12780 

TGAGCCCTTG ATTGTGCATT CGTTTAATAA AAGGAATGCG TCCTGCCTCA TCAAAACGGT 12840 

GCATAATGTA GAAGTAACCA CCTTTACCCA GTTGCTCTCC TACATTTTCA TCCAAAATCG 12900 

TCTGCATATT CGCTGGCACA ACAGGTAGTT TAAAGGTGTG ATTTCCTAAA GTGACACTTG 12960 

TATCCGCTTC TGCACGGCTT TTAATGACAC ATTTATTTCG AATCAATTGA ATATCTTCGT 13020 

AATCAAAAAT TGGAAATTCA TTTAACATAT CGATGTCTCG TTTCTTTTGT AATCACCTAC 13080 

CTATGCTCTT GCATCACTAC GCCTTTTCCG ACGTTTCCTG G 13121 
(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

CCGAATGCAA TGTTTACGGT TGAACTTGAA AATGGACATC AGATTTTAGC AACAGTTTCT 60 

GGTAAAATTC GTAAAAACTA TATTCGTATT TTAGCGGGAG ATCGTGTTAC TGTCGAAATG 120 

AGTCCATATG ACTTGACACG TGGACGTATC ACTTACCGCT TTAAATAATC GAAAAACTTG 180 

GAGGGATAAG AAATGAAAGT AAGACCATCG CTCAAACCAA TTTGCGAATA CTGTAAAGTT 240 

ATTCGTCGTA ATGGTCGTGT TATGGTAATT TGCCCAGCAA ATCCAAAACA CAAACAACGT 300 

CAAGGATAAG ATAGAAAGGA GAAAACATGG CTCGTATTGC TGGAGTTGAT ATTCCAAATG 360 

ACAAACGCGT AGTAATCTCA TTGACTTATG TTTATGGTAT CGGACTTGCA ACATCTAAGA 4 20 

AAATTTTCGC TGCTGCTGGA ATCTCAGAAG ATGTTCGTGT ACCTCATCTT ACATCAGATC 480 
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AAGAAGATGC TATCCGTCGT GAAGTGGATG CAATCAAAGT TGAAGGTGAC CTTCGTCGTG 540 

AAGTAAACTT GAACATCAAA CGTTTGATGG AAATCGGTTC ATACCGTGGT ATCCGTCACC 600 

GTCGTGGACT TCCTGTCCGT GGACAAAACA CTAAAAACAA CCCCCGCACT CGTAAAGGTA 660 

AAGCTGTTGC GATTGCTGGT AAGAAAAAAT AATATAGGAG GTAAAAGTCT TGGCTAAACC 720 

AACACGTAAA CGTCGTGTGA AAAAGAATAT CCAATCTCGT ATTGCTCATA TTCACGCTAC 780 

ATTTAATAAC ACTATTGTTA TGA^fTACTGA TGTGCATGGT AATGCAATTC CTTGGTCATC 840 

AGCTGGTGCT CTTGGTTTCA AAGGTTCTCG TAAATCTACA CCATTCGCTG CTCAAATGGC 900 

TTCTGAAGCT GCTGCTAAAT CTGCACAAGA ACACGGTCTT AAATCAGTTG AAGTTACTGT 960 

AAAAGGTCCA GGTTCTGGTC G7GACTCAGC TATTCGTCCG CTTGCTGCCG CTGGTCTTGA 1020 

AGTAACAGCA ATTCGTGATG TGACTCCAGT GCCACACAAT GGTGCTCGTC CTCCAAAACG 1080 

TCGCCGTGTA TAATCATCGC ATTACACTGC TTTTCCTTTA AGAGGGAGTA ACTAAATGAT 1140 

CGAGTTTGAA AAACCAAATA TAACAAAAAT TGATGAAAAT AAAGATTATG GCAAGTTTCT 1200 

AATCGAACCA CTTGAACGTG GCTACGGTAC AACTCTTGGT AACTCTCTTC GTCGTGTACT 12 60 

TCTAGCTTCT CTACCAGGAG CAGCTGTGAC ATCTATCAAC ATTGATGGTG TGTTACATGA 1320 

GTTTGACACA GTTCCAGGTG TTCGTGAAGA CGTGATGCAA ATCATTCTGA ACATTAAAGG 1380 

AATTGCAGTG AAATCGTACG TTGAAGACGA AAAAATCATC GAACTGGATG TTGAAGCTCC 1440 

TGCTGAAGTA ACAGCTGGTG ACATTTTGAC AGATAGCGAT ATTGAAATTG TAAATCCAGA 1500 

TCATTATCTC TTTACAATCG GTGAAGGTTC TTCTCTAAAA GCGACTATGA CTGTTAACAG 1560 

TGGTCGTGGA TATGTACCTG CTGATGAAAA TAAAAACGAT AATGCACCAG TTGGAACACT 1620 

TGCTGTAGAT TCTATTTATA CACCAGTTAC AAAAGTCAAC TATCAAGTGG AACCTGCTCG 1680 

TGTAGGTAGC AATGATGGTT TCGACAAATT AACCCTTGAA ATCTTGACAA ATGGAACAAT 1740 

TATTCCACAA GATGCTTTAG CGCTTTCAGC ACGTATTTTG ACAGAACATC TTGATTTGTT 18 00 

TACAAATCTT ACTGAGATTG CTAAGTCAAC TCAAGTCATC AAAGAAGCTG ATACTGAATC 1860 

TGACGACCGT ATTTTAGATC GTACGATTGA GGAACTGGAC TTGTCTGTGC GTTCATACAA 1920 

CTGTTTAAAA CGTGCCGGTA TCAA7ACTGT GCATGATTTG ACAGAAAAAT CTGAAGCAGA 1980 

GATGATGAAA GTACGAAATC TTGGACGCAA GAGTTTGGAA GAAGTGAAAC TCAAACTCAT 2040 

TGATTTGGGT CTTGGATTAA AAGATAAATA AAGGAGGAAT ACATGGCTTA CCGTAAACTA 2100 

GGACGCACTA GCTCACAACG TAAAGCAATG CTTCCCCATT TGACAACTGA CCTTTTGATC 2160 

AACGAATCAA TCGTGACAAC TGAAGCTCGT GCTAAAGAAA TCCGTAAAAC TGTTGAAAAA 2 220 



WO 98/18931 



PCT/US97/19588 



878 



ATCATTACTC 


TAGGTAAACG 


TGGTGATTTG 


CATGCACGTC 


GTCAAGCAGC 


TGCTTTCGTA 


2280 


CGTAATGAAA 


TCGCATCTGA 


AAACTATCAT 


GAAGCAACTG 


ATAAGTACAC 


TTCTACTACA 


2340 


GCACTTCAAA 


AATTGTTCTC 


AGAAATCGCA 


CCTCGTTATG 


CTGAACGTAA 


CCGTGGATAC 


2400 


ACTCGTATCC 


TTAAAACTGA 


ATCACGTCGT 


GGTGATGCAG 


CGCCAATGGC 


GATCATCGAA 


2460 


TTAGTATAAA 


ATCATCAATT 


TTGTTGAGTG 


TTATGATGAT 


GGAGTCTTGT 


GCTCTTAGTC 


2520 


TAGCTCTGGT 


CTACCGCTAG 


GATTTCGGTC 


CTAGCGGGAA 


C ACT CATC AT 


AAGTTGGGAT 


2580 


AGTAGACGCT 


TGTTTACGAA 


ATTGTTTTTT 


TCTTAACAAC 


AACTTCGTAA 


GCAGGCGTTT 


2640 


TTGAGTATTT 


TCGTTAGAAT 


TATGCTATAC 


TATTTGAAAA 


GAATCCTGTT 


TAATGTTAAG 


2700 


GTTTCTTATT 


TTAAGAAGAA 


TTGGAGTTTA 


CTTATGAAAC 


CCATTATAAC 


TGTTGTTGGT 


2760 


AAAGATAAAT 


CTGGAATTGT 


TGCAGGTGTT 


TCTGGTAAAA 


TTGCAGAATT 


AGGATTGAAT 


2820 


ATTGACGATA 


TCTCTCAAAC 


TGTCTTGGAT 


GAATATTTTA 


CGATGATGGC 


TGTTGTATCT 


2880 


AGTGATGAAA 


AGCAAGATTT 


TACCTATCTT 


CGTAATGAAT 


TTGAAGCTTT 


TGGGCAAACT 


2940 


TTGAATGTAA 


AAATCAATAT 


TCAGAGTGCA 


GCGATTTTCG 


AAGCTATGTA 


TAATATCTAG 


3000 


GAGGTCATCA 


TGGATATTAG 


ACAAGTTACT 


GAAACCATCG 


CCATGATTGA 


GGAGCAAAAC 


3060 


TTCGATATTA 


GAACCATTAC 


CATGGGGATT 


TCTCTTTTGG 


ACTGTATCGA 


TCCAGATATC 


3120 


AATCGTGCTG 


CGGAGAAAAT 


CTATCAAAAA 


ATTACGACAA 


AGGCGGCTAA 


TTTAGTAGCT 


3180 


GTTGGTGATG 


AAATTGCCGC 


TGAGTTGGGA 


ATTCCTATCG 


TTAATAAGCG 


TGTATCGGTG 


3240 


ACACCTATTT 


CTCTGATTGG 


GGCAGCGACA 


GATGCGACGG 


ACTACGTGGT 


TCTGGCAAAA 


3300 


GCGCTTGATA 


AGGCTGCGAA 


AGAGATTGGT 


GTCGACTTTA 


TTGGTGGTTT 


TTCTGCCTTA 


3360 


GTACAAAAAG 


GTTATCAAAA 


GGGACATGAG 


ATTCTCATCA 


ATTCCATTCC 


TCGCGCTTTG 


3420 


GCTGAGACGG 


ATAAGGTCTG 


CTCGTCAGTC 


AATATCCGCT 


CAACCAAGTC 


TCCTATTAAT 


3480 


ATGACGGCTC 


TGGCAGATAT 


GGGACGAATT 


ATCAAGGAAA 


CAGCAAATCT 


TTCAGATATG 


3540 


GGAGTGGCCA 


AGTTGGTTGT 


ATTCGCTAAT 


GCTGTTGAGG 


ACAATCCATT 


TATGGCGGGT 


3600 


GCCTTTCATC 


GTGTTGGCGA 


AGCAGATGTT 


ATCATCAATG 


TCGGAGTTTC 


TGGTCCTGGT 


3660 


GTTGTGAAAC 


GTGCTTTGGA 


AAAAGTTCGT 


GGACAGAGCT 


TTGATGTAGT 


AGCCGAAACA 


3720 


GTTAAGAAAA 


CTGCCTTTAA 


AATCACTCGT 


ATCGGTCAAT 


TGGTTGGTCA 


AATCGCCAGT 


3780 


GAGAGACTGG 


GTGTGCAGTT 


TGGTATTGTG 


GACTTGAGTT 


TGGCACCAAC 


CCCTGCGGTT 


3840 


GGAGACTCTG 


TGGCACGTGT 


CCTTGAGGAA 


ATGGGGCTAG 


AAACAGTTGG 


CACGCATGGA 


3900 


ACGACGGCTG 


CCTTGGCCCT 


CTTGAACGAC 


CAAGTTAAAA 


AGGGTGGAGT 


GATGGCCTGC 


3960 


AACCAAGTCG 


GTGGTTTATC 


TGGTGCCTTT 


ATCCCTGTTT 


CTGAGGATGA 


AGGAATGATT 


4020 
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GCTGCAGTGC AAAATGGCTC TCTTAATTTA GAAAAACTAC AAGCTATGAC GGCTATCTGT 4 080 

TCTGTTGGAT TGGATATGAT TGCCATCCCA GAAGATACGC CTCCTGAAAC TATTGCGGCT 4140 
ATGATTGCGG ATGAAGCAGC AATCGCTCTT ATCAACATGA AAACAACAGC TGTTCGTATC 4200 
ATTCCCAAAG GAAAAGAAGG CGATATGATT GAGTTTGGTG GTCTATTAGG AACTGCACCC 4260 
GTTATGAAGG TTAATGGGGC TTCGTCTGTC GACTTCATCT CTCGCCGTGG ACAAATCCCA 4 3 20 

GGACCAATTC~ATAGTTTTAAT AAATT AAG AA~~AAT AGGAGAA " ATTTTAACTTT CT ATTT AAGA ~4~38C~ 
TTAGACGTGT ATACTATAAT CATTAAATAA AGACCTCCTA ATATTATTTG AAACAGATAA 44 40 

CACTGAATTA CTTTGAATTT GATTTTCATC TAATATCTTT ATTTAATGAA CTCCTAAACT 4 500 

TTTTCATAAT AATCTCCTTC AAAAGTCGCC TGTATGGCTG GCTTTTATTT TATCATTCAT 4 560 

GATATAATAG AAGCAAACGG AGGACGGAAA ATGGTAAAAG TACGATTCTA TTTGGTACGT 4620 
CATGGCAAGA CCATGTTTAA CACGATTCCT CGCGCGCAAG GTTGGAGCGA TACTCCCTTA 4680 

ACTGCTGAAG GTGAACGAGG GATTCAAGAG TTAGGAATCG GTTTGCGAGA ATCTGATCTA 47 40 

CAGTTTGAGC GTGCTTATTC GAGTGATTCT GCTCGTACCA TTCAGACCAT GGGAATTATC 4 800 

CTTGAAGAAC TTGGCTTGCA GGGGGAAATC CCTTATCGCA TGGACAACCG TATCAGAGAA 4860 

TGGTGTTTCC GTAGTTTTGA TGGAGCCTAT GATGGCGATC TTTTCATGGG CATTATTCCT 4 920 

CGTATCTTTA ATGTGGACCA CGTTCACCAA TTGTCTTATG CTGAACTGGC TGAGCGCTTG 4 980 

GTAGAGGTCG ATACAGCTGG TTGGGCTGAA GGCTGGGAAA AACTCAGTGG CCGAATCAAG 5040 

GAAGGCTTTG AAATGATTGC AAAAGAAATG GAAGATCAAG GTGGAGGTAA CCCCCTTGTT 5100 

GTCAGCCA7G GAATGACTAT TGGAACCATT GTTTATCTGA TTAATGGCAT GCATCCGCAT 5160 

GGTCTGGATA ATGGTAGCGT GACAATCCTT GAATATGACC ACGGCCAGTT TAGGGTTGAA 5220 

GTTGTCGGTG ACCGTAGTTA CCGAGAGCTA CCACGTGAGA AGATGGAAGA AGGCTCTATT 5280 

TAATCAGTCT AGACTTGCTT GCCATGAGCT AGGGATTTGA TAAGAATATC AAGATAAGAA 53 40 

AAAACAGCCG AGGGCACTCC TTTCGGCTGT TTTTGATGTG GAAAACTAAA GTGTAATGCT 5400 

ATTCCTTTTA GAGATTTTCA TAAACAAGAG CAAGGAACCT ACTGTTAGAA CAGTCAGGAT 54 60 

AGTTGACAAG GTTGCGGCTA CACCGTAATT TCCTCTGAGA ACCTCTGTAT AAATAGCTAC 5 520 

AGTCATTGTT CTTGTTTTGA CATTGTAGAG GAGGATAGAA CTAGAGACTT TTGAAATCAT 5 580 

TGTGACTCAA GATAAGATGG CTCCAGAAAT GATACCAGAT AGCATCATTG GAGTTGTAAT 5640 

CTTAGCAAAG GTATTGAGAC GACTACTTCC TAAGCTTTCA GCACCTTCTT CAATACTTGG 57 00 

TGCTATTTGT TGTAAGCTAG CAACAGATGA GCGAATAGTA TAAGGTAATC TTCTGGCAGA 57 60 
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TAG AG AC AT A ATCAAGATGA AAGCAGTCCC TGTAATCATA AGAAATCCAC TTCCAAATAG 5820 

ACCACTATTC AAGGAAGAAA TGAAGGCAAT CCCTAGAACG GTTCCTGGTA CAATATAAGG 5880 

TACCATACTG AGGCTGTCAA TTAAGTTTGT AAACAAATTC CGTTTTCTAA CGGCTAGGTA 59 40 

GGAGATAAAT GTCGCAAATA GAACAACTAG AACTAAGGCA ATCAAAGGGA TACGAATGGT 6000 

ATTGAAAATA GCAGATCCCA TACGATGGAA AGCTACCTTG TAACTGTTTG GAGAATAACC 6060 

TTTAACAGAT ACCATACCTG ATGTTTTTAG GAAAGAGGTA TAAATTAAGT AGATTTGAGG 6120 

TAAAACAGAG ATAAAGATAA TTCCGTAGAC TGTTGCATAA ATGGCAGCCA TTTTTCCTTT 6180 

TGTAGTTTTT TTAGGCTCAA TTGGATGGAG CAGATTCATG CTGAAACTGT AGCGGTTTGC 62 40 

AATGTGTTTT TGGATAAGGA AAATTGCCAA GGCAATGATA ATCGCCATAA TTCCAAAACC 6300 

AGAATTTCCT CCAACCTCGC T AAT AAATTG GGTATAAATC AGGACAGGGA AAGTCCGATA 63 60 

CCCTTCGCCA ATCAACATAG GCGTTCCAAA GTCTGAGAAT GCTCTCATAA ATACAAGCAA 6420 

GGAGCTGCTA GTAAGGTTGG AACTAGGAGA GGTAAAACAA CCGTTACGAT AGGTTTAAAT 6480 

CCGAAGGACC CCATGCTTTC AGCTGCTTCA AGTAGAGAAT TGTCAATACT GTTCATTGTT 6540 

CCAGCAACAT ATAGAAATAC CAGTGGGAAT AGTTGCAGTG TAAAGACAAG TACAATTCCT 6600 

TTGAATCAAT AAATATCGAT ACCTGGAAGA TAAAGGGCAT TTGTCAAAAA TTTAGTGATC 6660 

ACCTCATTTC GTCCTAGCAA GAGAACCCAG GAGTAGGCTC CTACGAAAGG AGCTGACATG 6720 

GAAGCAATGA TAATCAATAT TTGTAGAAAT TTCTTCCCCT TGAAGTCATA CATAGAGAAG 6780 

AGATAAGCTA ATAGGCTTCC TACAACTAAG GAAGTGATAG TAGCGGTAAT GGAAACCTTC 6840 

AAACTGTTGA CTAGTGTCTC AGAGTAGTAG GCTTTACTAA AGAAAGTGAC AAAATTAGCT 6900 

AGTGAGAATT GTCCTTCATG TATAAGTCCT TGCTTGAGCA CGGTAACGAT AGGATAAACG 6960 

AGAAAGATAG GATAGGTAAG AAAGAGGAAG AAAGAGGAAA CTGTCCAAAT ATTTAGTTTT 7020 

TTACGTTCCA TGGTTGACTC CTTTTATCAG GTTTTGGGAA CCATCTGCAG AAAACATGTT 7080 

TAATTTTTGC GTATTGATTC GTAGACGAAT ACGATTGCCT TTTTGTAGAT CTTCTTCAAA 7140 

AGTTGATTCT TCACTAACTT GAATTTTTGA GGCAAAACCT CTCTCAATGA AATAATCCGT 7200 

ATTTAGTCCA AGATAGACGC TATCTCTAAT AGTTCCTTCA ATATCTCCAG ATTCATCTTT 72 60 

GATAAACTCT TCGGGACGAA TGCTTACATG AATAGCTTGC TCCTCAACCT GATCAAGAGC 73 20 

TGGCATTCGA AGGGCATAGC CATCTGAAAA GACGATATAA GCGCCGTCCC TCCGTTTTTC 7 380 

AAGATTGGCA GGGATAATAT TTGTGCGTCC GATAAAGGTT GCCACAAACT CATTAGCTCG 7440 

TTTATGATAC AGTTCTTTTC GTCGGCCGAT TTGTTGGATC ACCCCATCTT TCATAACAGC 7500 

AATTTGGTCT GAAATAGCCA TGGCTTCTTC TTGGTCGTGG GTTACATAAA CAGTTGTAAT 7560 
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TCCCACTTCG 


1 TGTTGGATTT 


1 CTCGGATGGC 


1 TTCACGCATA 


TCCAAGCGAA 


* GTTTGGCCTC 


' 0 £. U 


CAGATTACTA 


AGTGGCTCGT 


CCATGAGGAG 


, AACAGTTGGA 

nnV-rtv. 1 1 vl\J/> 


TTAACCGCTA 


AGGCGCATGC 




CAAGGTGACA 


CGTTGTTGTT 


GTCCACCACT 


GAGTTTATCG 

vmvj i i in t V.U 


GGCTTTCGAT 


CCGCATATTG 




AGCAATTTGC 


ATGAGTTCAA 


GATACTTGTT 

\Jf\ 1 nv* 4 4 4 4 


VjVj 1 v. 1 vj 1 1 vjM 


ATCAATTCTT 


CTTTTGGAAC 


7800 


CTTCT TTTGC 


ATAAGACCAA 


AAGCAACCTT 


vj i V- 1 V. vjVjM v»/\ 


GTCAAATGTG 


GGAAAATAGC 


7860 


GTAGTTTTGG 


AAAACCATCC 


PGATATTGCG 


111 VJV. 1 Vjvivj 1 


TCCATATTAT 


TGATTTTTGT 


7920 


ATCATCGAAG 


TAAAAT*CTC 


CArrTTCGAT 


A f^TPTTf AAA 


CCTGCAATCA TACGAAGAAC 


7980 


GGTCGTTTTC 


CCACATCCTG 


aacctv* e aag 

r\f*i\j\— i v, v, awj 


AAGrV-TAAAf". 


AGACTTCCTT 


TTGGAATTGT 


8040 


AATGTTCAAA 


• k V_ * V./W 1 rtrt 


GAGGGAP ATP 


Vj 1 vjvj I Mw\ 1 i 


TTTTTGGCGT 


TAATAATTTT 


8100 


GATCTCACTC 


ATAGTGAACC 


4 V» * A 1 4 <*V- i. VJ 


T TT A fl A TTd n 
111 /\un 1 1 VjVj 


ATATCTGTAA 


AGACTTCCTT 


8160 


GT ATTTCTT A 


ACCATATCTC 


A TT T A TT GTT 




TCATAATCTT 


CAGTGAGTGT 


8220 


TTTGATTTTG 


TCAATTGGTT 


T G A T GT TTTG 

1 V-#% 4 VI 4 4 4 1 >m 


vjv. 1 lul L 1 In 


GCATTTTTAC 


GAACAGGACG 


8280 


GTTAGTAGTG 


GTTGTACCAA 


GTCTATGTTG 


T Af^TTfTTf* A 


GAGATAATAA 


AATCGATAAA 


8340 


TTTCTTGGCA 


TTTTfC* A T A*** 


ill* /\vjj\ iii 


TTT A A GOAT A 


CCAGCACTAG 


CAGGTAGGAA 


8400 


GACGGTTCCT 


TCTTTT GG AT 

4 V* 4 4 1 4 1 


AGACTACCTT 


AATGTT AC C*V 
*w 1 vj 1 k Aviv. 1 


CCGTCATTTA 


AGAGTTTAAC 


8460 


TGCTGGATCT 


TCATAAGACA 


GACGAAGAGG 


A TTT^T^P 11 
V.nl L IV. 1 LLH 


TCACCGACTA 


CTTTATAGAC 


8 520 


ACTAGATGAA 


CTTGAACCGA 


TTTTACGATG 


AATAAfTTCTf: 
nJ\ 1 /vtVvj 1 vj 1 vj 


AAAAGATCTT 


TTACATAAGA 


8580 


CCAAGCCTTA 


T A TCTTTGT 

4 w 4 v> 4 4 4 *J 4 


AACCACGTTG 

/\nv»v»#^V»V» 1 1 


/\VjV_ * 4 \J 1 MVJv> 


ATATTTGTTA 


ATTGAGCAAA 


8640 


GGCGCTAGAA 


GAGTTTGCTG 

Wn\J 4 4 4 4 \J 


G GTC AG C ACT 

v? vj 1 wnviwnw 4 


TGGGAT^TTT 


CCTTTTAGTT 


CAGGTTTGAA 


3700 


AAGATCGTTA 


4 o 4 ^ \« 4 1 ^*yn 


TGTTCATCGG 

1 VJ 4 4 Vai^ 4 VJ V» v> 


TTTAGTT A A A 
ill rtvj i i 


TCAGGGTTGA 


CGATTAAAAC 


9760 


ACTACCATCT 


AGTGTATAAG 


GAGTAGAGTA 


GGG AG T^GTG 
vjv>v_nvj i t vj I Vj 


TTTTGATATT 


CTTTGATAAC 




ATTATCATTT 


TCTTTTGAAG 


TATAGTTTTC 


AAAGAGTTCT 


CCGTGGGTAG 


TATATTGTGT 


8880 


ATAAGAACCA 


CCAAAGATAA 


CATCAGCTAC 


AGGAACTTCT 


TTTTCTGACT 


CTAGTTTTTT 


8940 


GAAAACTTCT 


CCAGTACCAG 


CTTGAATCAG 


TTCTACTTTG 


ATACCATATT 


TTTCTTCAAA 


9000 


GGCAGGAATA 


GTTGCTCCAA 


TTAAGCCCTC 


TGAGTTTGGT 


CAATAAACGA 


CTAGCGAACC 


9060 


GCCGTCTCCT 


TTATCACATG 


AACTGTCATC 


GGCAGATTCA 


TTAGAAGAAC 


AAGCAGCATA 


9120 


ATACATCCAT 


TTCTTTTTCA 


TGATGGATAC 


CTCCGTTGTG 


TTATTTAAGT 


TTATTTTAAA 


9180 


ACAATGTAAG 


CGTTTTTAAA 


ACATACAATT 


CTATTCTATA 


GTGTATTGAA 


TCTATAACAG 


9240 


TACACTTTGA 


CTGCTAAAAT 


ATTTCTATAA 


ATTAATTTCA 


CTTTCCTGAT 


AGAGATGTTC 


9300 
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ACATCTTATT TCAATTCACT ATATTAGAGT AAAATTCTCT ACAAAAAGAA GAATAGCCTA 93 60 

TTTTACTATT CTTCTGAGTG ATTTCAATTC CTTTCGGGAA ATATGGAGAT ACTTTTTAAA 9420 

TCCTGACAAA TGCTTGTTTC TTTTTCTAAA TCGGTGATAC TGTATCGGAG AATGCGCGTG 9480 

AGGTCACAAA GGCTGCGATA GAGCTTCTAT GGAGAATTTC TTTTTGGAGA GATTTTTTAA 954 0 

AGGAATGAGA CATCCGCTAC CTCCTTGGAA GGTTTTTG 95 7 g 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 13440 base pairs 
(8) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

CGGGCTCTTG TGACGATTCT TATTTCTATC TGTGTTATCT TTTTGGGAAC TATTTTGGGT 60 

GTTGTCTTGG CTTTTGGGCA ACGTTCAAAG TTTAAACCGC TTGTTTCGTT GGCCAACTTG 120 

TACGTTTGGA TTTTCCGTGG GACACCGATG ATGGTTCAAA TTATGATTGC CTTTGCTCTT 180 

ATGCATATCA ATGCTCCGAC TATTCAGATT GGAATTTTAG GTGTTGATTT TTCGCCTCTG 240 

ATTCCAGGGA TTTTGATTAT CTCTATGAAT AGTGGTGCTT ATGTTTCGCA GACTGTTCGT 300 

GCCGGAATCA ATGCGGTTCC AAAACCTCAC CTAGAAGCGG CTTATTCGCT AGGCATTCGT 360 

CCTAAAAATG CGATGCGTTA TGTGATTTTG CCACAAGCAG TCAAAAATAT CTTGCCACCA 420 

TTGGGGAACG AATTTATCAC CATTATCAAG GACAGCTCCC TCTTATCAGC TATTGGGGTC 4 80 

ATGGAGTTGT GGAATGGGGC TACAACAGTT TCTACAACAA CCTATCTACC TTTAACACCA 540 

CTTTTATTTG CAGCATTTTA CTACTTGATT ATGACCTCTA TTCTGACAGT AGCCTTGAAA 600 

GCTTTTGAAA AACATATGCG ACAAGGAGAT AAGAAATAAT GACAGAAACC TTGATAAAAA 660 

TTGAAAATTT ACATAAATCC TTTGGAAAGA ATGAAGTATT GAAGGGCATC AACCTCGAGA 720 

TTAAAAGAGG AGAAGTTGTC GTTATCATCG GTCCTTCAGG GAGCGGGAAA TCTACCTTGC 7 80 

TTCGCTCTAT GAATTTGTTG GAAGAAGCAA CCAAGGGGAA GGTTATCTTT GAGGGAGTCG 840 

ATATTACGGA CAAGAAGAAT GACCTGTTTG CCATGCGTGA GAAGATGGGC ATGGTTTTTC 900 

AACAATTCAA TCTCTTTCCT AATATGACTG TGATGGAAAA TATCACCTTG TCCCCTATCA 960 

AGACCAAAGG TGACAGTAAG GCCGTTGCAG AGAAAAGAGC TCAGGAACTT TTGGAAAAAG 1020 

TTGGTTTGCC AGATAAGGCA GACGCTTATC CACAGAGTTT GTCAGGTGGC CAGCAACAGC 1080 

GGATTGCCAT CGCGCGTGGG TTGGCTATGG AACCAGATGT TTTCCTCTTT GACGACCCAA 1140 
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CTTCACCCCT ACATCCTCAG ATGGTTGGAG AAGTTCTGGC TGTTATGCAA GATCTAGCCA L2C0 

AGTCAGGAAT GACCATGGTT ATCGTAACAC ATGAGATGGG ATTTGCCCGT GACCTGCCAG 12 60 

ATCGTGTCAT CTTTATGGCA GACCGTGTGC TTGTTGAAGA CCGAACACCT GAGCACATTT 13 20 

TTGAACAAAC CCAAGGACAA AGGACTAAAG ACTTCTTGAG TAAGGTTTTA TAAGTTACCT 1380 

TTGTTTAGCT ATTTGTAGCC AGCTTTAAAC GTTAAAGAGA AGATTAGTCA AAACCTCAAC 14 40 

CAGAGCTTTT TCTTATAGTT TAAAGCTATA GGATTGCCTA GGAAAGAAGT GTTAGAGCTA 1500 

CAT7GTATTT TTTGGTATAA TTAAAGATAT TTGTAAGAAA AGAGAAGTGA TAT G AC AC AG 1560 

ATTATTGATG GGAAAGCTTT AGCGGCCAAA TTGCAGGGCC AGTTGGCTGA AAAGACTGCA 1620 

AAATTAAAGG AAGAAACAGG TCTAGTGCCT GGTTTGGTAG TGATTTTGGT TGGGGACAAT 1680 

CCAGCCAGCC AAGTCTACGT TCGCAACAAG GAGAGGTCAG CCCTTGCGGC TGGTTTCCGT 1740 

AGCGAAGTAG TACGGGTTCC AGAGACCATT ACTCAAGAGG AATTGTTAGA CCTGATTGCT 1800 

AAATACAATC AGGATCCAGC TTGCCATCGG ATTTTGGTTC ACTTCCCATT ACCAAAACAC 1860 

ATTGATGAAG AGGCGGTTCT ATTGGCTATT GACCCAGAAA AGGATGTGGA TCGTTTCCAT 192 0 

CCTCTAAACA TGGGGCGTCT TTGGTCTGGT CATCCAGTCA TGATTCCTTC CACACCGGCA 1980 

GGAATTATGG AAATGTTCCA TGAATATGGG ATTCACTTGG AAGGTAAAAA TGCACTCGTC 2040 

ATCGGTCGAT CCAATA7TGT CGGAAAACCT ATGGCCCAGC TTCTTTTGGC AAAGAATGCA 2100 

ACAGTAACCT TGACTCACTC ACGTACTCAT AATCTTTCCA AGGTGGCTGC AAAAGCAGAT 2160 

ATTCTGGTTG TTGCAATCGG TCGTGCCAAG TTTGTCACTG CTGACTTTGT CAAACCAGGT 2 220 

GCGGTAGTCA TTGACGTTGG GATGAACCGC GATGAAAATG GTAAGCTCTG TGGGGATGTT 2 280 

GATTATGAGG CGGTTGCCCC ACTTGCTAGC CACATTACGC CAGTCCCTGG AGGTGTCGCT 2 340 

CCTATGACCA TTACTATGCT GATGGAGCAA ACCTATCAGG CAGCACTTAG GACATTGGAT 2400 

AGAAAATAAG ATAAAAATTT TCTGAGGAAA GTGTATTTTC TATAGCTATA TCTAAAATGA 2460 

TAGAAATGAA TATTAAATTT TAGAAATAAG TTTATAAAAG GAGGTTTGCG CCTCCTTTTT 252 0 

GTTGTATAAT GGAGTGAGGT GATTAGATGA TTTTAAAAAT TTATAATGGG GAATATAGTT 2 580 

TACAATGGGA TGGAATATAC TACTTAGCAC TAATTGATTA TCCAAATATT CAAGAGTGGG 2 640 

AATTAGAAAA AATTCCTAAA TTTATAGCTT ACGAAAAACT TCATAAACGT CAAACAAGTA 2700 

TTGAGTGTGC TGATTCTTGT TTAAAAAAAG AAATTTTAGA TTACATCTGT CAGCATCCCT 27 60 

TTCTGCCACC ATTTACTCCT ACAGATAAAA GAGTAGCCTC GACTTATGAC CTACATAAGA 2820 

GGTTAGTGAC TTCAGACTAC TGTAGTCATA CTACGACTAT AGATGCAGCG ATTTCTATTT .2880 
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TTAAAACTGG 


TCGTCTTTTA 


TCTGCTGTGA 


AAGCCTTTGG 


GCGAGATGCT 


GAGGAGTTGG 


2940 


TTTTGGATAG 


TCGAAATGCT 


GCATCTGATC 


CGATAGATTA 


TTTTGACTAT 


GTCATGTTAG 


3000 


GGTGGTCAAA 


TACAAGTTCT 


GGTTATCGAT 


TCCCGATGCA 


GCGTTTATTA 


GGTCGAGCTC 


3060 


CTTCAGAGAA 


AGAATTACAA 


GACAAGTTTA 


TTCCTGGAGT 


AAGTTTTCAT 


TTTATCTATA 


3120 


CAGATTTGAT 


TAAAGTTCCT 


GGTTATATTT 


TTGATGGTTA 


CCSTGCTGTA 


AAAATTAAGG 


3180 


ACATGCTTAA 


TTTATTAAGT 


GAGTTGTATA 


TTTGCATTAT 


TCCAACTCAT 


AATAAGAGCC 


3240 


AATTTGAAAA 


TATTATTCCA 


ACCAAAATAC 


AAGATAGGGT 


GTATTATCTT 


GACTATGCTG 


3300 


GAGAAGACTT 


AGAAGAGTGG 


ACTAAGAAAG 


TCTATCAACT 


TGTTTTAAAA 


C AATC AG AT A 


3360 


AAGGATAGTT 


GAGGAAAAAA 


CGATGAAAGT 


G ATT CATC AA 


ACCTTACTAG 


AAAAAGTCAT 


3420 


TATTGAACGT 


TCTTGTACAA 


GTCATAAAGG 


AGACTACGGT 


CGTCTGCTGT 


TGCTTCGTGC 


3480 


GACTTATCCT 


TATGGTGGTG 


CCATCATCAT 


GGCTGCTTTA 


GCAGCTGTAA 


AAACCGGTGC 


3540 


AGGATTGGTA 


ACCGTTGGAA 


CGGACAGGGA 


AAATATCCCT 


GCTCTACACA 


GCCATTTCCC 


3600 


TGAGGCTATG 


GCCTTTTCTC 


TGCAAGATCA 


GTAATTGTTA 


CAAGAGCAAT 


TGGAGAAGGC 


3660 


AGAAGTTCTC 


TTCCTGGGGC 


CTGGTTTACG 


AGACGATACG 


TTTGGAGAAA 


ATCTTGTAAA 


3720 


ACAGGTCTTT 


GCTAGCTTAA 


AAAAGAATCA 


GATTTTGATT 


GTAGATGGAG 


GGGCCTTAAC 


3780 


CATCCTTGCT 


AGGACAAGTT 


TGTTGTTTCC 


ATCTAACCAG 


CTTATCTTAA 


CTCCCCACCA 


3840 


AAAAGAATGG 


GAAAAACTGT 


CTGGTATTGC 


TATTGAAAAG 


CAAAACCAAG 


GTACAACATC 


3900 


TAGTGCCCTG 


ACTTCTTTCC 


CTCAAGGAAC 


AATTTTGGTA 


GAGAAAGGTC 


CAGCTACTCG 


3960 


TATTTGGCAA 


GTTGGCCAGT 


CTGATTATTA 


CCAGTTAAAG 


GTTGGCGGTC 


CCTATCAGGC 


4020 


GACTGGTGGT 


ATCGGTGATA 


CACTGGCTGG 


AATGATTGCA 


GGATTTGCAG 


GCCAATTTCG 


4080 


ACAGGCCAGT 


CTCTACGAAC 


GTGTGGCAGT 


AGCAACCCAT 


CTTCATTCAG 


CCATAGCCCA 


4140 


AGAACTATCT 


CAAGAAAATT 


ATGTGGTCTT 


GCCGACGGAA 


ATTAGTAATT 


GTCTTCCTAA 


4200 


AGTAATGAAA 


AGATATGTCT 


AAAATAGTTA 


GACAAAAAAT 


GTTGATAATT 


TGTATCATTA 


4260 


TTCTTAATTC 


ACAAAAAACG 


AACGTTTAGT 


ATTCTTCTTG 


CTAAGAAACT 


AAATTTGTTC 


4320 


GTTTTTTTAC 


TCTTGTAAAT 


CTATTTTTCT 


TAGAGTTGAT 


TTGGTTTACA 


TCCGTACTTA 


4380 


AATTGATTTG 


TTAGAGCTCT 


ACTTTTATTA 


AAAAAATTCA 


ATTTCAAGGA 


TAAATAAGCA 


4440 


GTATTCTAAA 


GGTACTTTTA 


GATGAAATAA 


AAGCCTTTAC 


ATGGTATAAT 


AGACGTAGCT 


4500 


CTTTAATGGA 


GGTGTTTGAG 


TGGAAAATCT 


GAAGAAAATG 


GCAGGTATCA 


CCCCTGCTGA 


4560 


ATTTATCAAG 


GATGGGATGG 


TTGTAGGGCT 


AGGAACAGGT 


TCTACTGCCT 


ATTATTTTGT 


4620 


CGAAGAAATC 


GGTCGTCGAA 


TCAAGGAAGA 


AGCCTTGCAG 


ATTACAGCTG 


TGACGACTTC 


4680 
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TAGTGTGACC AGTAAACAGG CTGAAGGGCT CAATATCCCG CTCAAGTCTA TTGACCAAGT 4740 

AGACTTTGTC GATGTGACAG TCGACGGGGC GGATGAAGTG GATAGTCACT TTAATGGAAT 4 900 

CAAAGGCGGT GGTGGTGCCC TTCTCATGGA AAAGGTGGTC GCAACACCAT CAAAAGAATA 4 860 

CATTTGGGTG GTGGATGAAA GCAAGCTGGT CGAAAAACTA GGTGCTTTTA AATTGCCAGT 4 9 20 

AGAAGTGCTT CAGTATGGTG CAGAGCAGGT CTTTCCTCAT TTTGAACGAG CTGGCTACAA 4 980 

ACCAAGTTTC CGTGAAAAAG ACGGCCAACG TTTTGTGACC GATATGCAGA ATTTTATCAT 5C4 0 

TGACCTCGCC TTGGATGTCA TTGAAAATCC AATTGCTTTT GGACAAGAAT TGGACCATGT 5100 

CGTTGGTGTT GTGGAGCATG GTTTATTCAA CCAAATGGTG GATAAGGTAA TCGTTGCTGG 5160 

ACGAGATGGA GTTCACATTT CAACTTCAAA AAAAGGAAAA TAGAAGGGGG CATAAGATGT 52 20 

CTAAATTTAA TCGTATTCAT TTGGTGGTAC TGGATTCTGT AGGAATCGGT GCAGCACCAG 52 80 

ATGCTAATAA CTTTGTCAAT GCAGGGGTTC CAGATGGAGC TTCTGACACA CTGGGACACA 5340 

TTTCAAAAAC AGTTCCTTTC AATGTCCCAA ACATGGCTAA AATAGGTCTT GGAAATATTC 5400 

CTCGTCAAAC TCCTCTTAAG ACTGTAGCAG CTGAAAGCAA TCCAACTGGA TATCCAACAA 54 60 

AATTAGAGGA AGTATCTCTT GGTAAGGATA CTATGACTGG ACACTGGGAA ATCATGGGAC 552 0 

TCAACATTAC TGAGCCTTTC GATACTTTCT GGAACGGATT CCCAGAAGAA ATCCTGACAA 5580 

AAATCGAAGA ATTCTCAGGA CGCAAGGTTA TTCGTGAAGC CAACAAACCT TATTCAGGAA 564 0 

CGGCTGTTAT CTATGATTTT GGACCACGTC AGATGGAAAC TGGAGAGTTC ATTATCTATA 5700 

C7TCAGCTGA CCCTGTTTTG CAGATTGCTG CCCACGAAGA CATTATTCCT TTGGATGAAT 5760 

TGTACCGTAT CTGTGAATAC GCTCGTTCGA TTACCCTTGA GCGTCCTGCC CTTCTTGGTC 5820 

GCATCATTGC TCGCCCTTAT GTAGGTGAAC CAGGTAACTT CACTCGTACG GCAAACCGTC 5880 

GTGACTTGGC TGTATCTCCA TTTTTCCCAA CTGTTTTGGA TAAATTGAAT GAGGCTGGTA 594 0 

TCGATACTTA TGCTGTGGGT AAAATCAACG ATATCTTTAA CGGTGCTGGT ATCAACCATG 6000 

ACATGGGTCA CAACAAGTCA AATAGTCATG GAATTGATAC ACTATTGAAG ACTATGGGAC 6060 

TTGCTGAGTT TGAAAAAGGA TTCTCATTCA CAAACCTAGT TGACTTTGAT GCCCTTTACC 6120 

GCCATCGTCG TAATGCTCAC GGTTACCGTG ATTGCTTGCA TGAGTTTGAT GAACGCTTAC 6180 

CTGAAATTAT CGCAGCTATG AGAGAGAATG ACCTTCTCTT GATTACTGCC GACCATGGAA 6240 

ATGACCCAAC GTATGCAGGA ACGGATCACA CTCGGGAATA TATTCCATTG TTGGCCTATA 6300 

GCCCTGCCTT TAAAGGAAAT GGTCTCATTC CAGTAGGACA TTTTGCAGAT ATTTCAGCGA 6 3 60 

CTCTTCCCCA TAACTTTGGT GTGGAAACTG CTATGATTGG GGAAAGTTTC TTAGATAAAT 64 20 
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TGGTATAAGA 


. TGACGCGCTA 


TGCTTTGCTG GTGAGAGGTA 


> TCAATCTTGG 


; TGGTAAGAAT 


6480 


AAGGTCGTCA 


TGGCGGAGCT 


TCGTCAAGAA TTGACAAACT 


1 TGGGACTGGA 


. AAAGCTTGAG 


6540 


AGCTACATCA 


ATAGTGGCAA 


TATTTTCTTT ACTTCGATAG 


ATTCCAAAGC 


CCAATTGGTT 


6600 


GAAAACCTAG 


AGACTTTCTT 


TGCAGTCCAT TATCCATTTA TTCAGAGCTT 


TTCTTTACTG 


6660 


AGTCTAGAGG 


ACTTTGAGGC 


GGAACTTGAA AATCTACCAG 


CTTGGTGGAG 


CAGAGACTTG 


6720 


GCACGAAAAG 


ATTTTCTCTT 


TTACACTGAG GGTTTGGATG 


TGGACCAAGT 


CATCGCGACA 


6780 


GTTGAAAGTT 


TAGAGCTGAA 


AGATGAAGTG CTTTATTTTG 


GAAAACTTGG 


GATTTTCTGG 


6840 


GGGAAATTTT 


CTGAAGAATC 


CTATTCTAAG ACTGCCTATC 


ATAAGTACTT 


GCTGAAGGTG 


6900 


CCTTTCTACC 


GCCACATTAC 


TATTCCTAAT GCTAAAACCT 


TTGACAAAAT 


TGGTCAAATG 


6960 


CTAAAAAAAT 


AATAAAGGAG 


ACACACAATG ACATTTTTAA 


ACAAAATCCA 


TGAAACTGCT 


7020 


ACTTTCCTGA 


AAGAAAACGG 


AATTGCAGCC CCTGAGTTCG 


GTCTAATCCT 


TCGATCAGGA 


7080 


CTTGGACAAT 


TGGCAGAAGA 


AATCGAAAAT CCAGTTGTAG 


TAGACTATGC 


TGAGATTCCA 


7140 


AACTGCCGCC 


GTTCAACAGT 


ACTCGGTCAT GCTGGTAAAT 


TGGTATATGG 


TGAACTGGCA 


7200 


GGTCGCAAGG 


TCTTGGCTCT 


TCAAGGGCCT TTCCATTTCT 


ATGAAGGCAA 


TCCTCTGGAA 


7260 


GTGGTGACTT TCCCAGTTCG TGTCATGAAA GTTCTTGGAT 


GTGAAGCTGT 


TATTGTAACC 


7320 


AATGCAGCTG 


GCGGTATCGG 


ATTTGGTCCT GGTACCTTGA 


TGGCTATCTC 


AGACCATATC 


7380 


AACATGACGG 


GGCAAAATCC 


ATTGATGGGT GAAAACTTGG 


ATGACTTTGG 


CCCACGTTTC 


7440 


CCAGATATGT 


CTAGGGCCTA 


CACACCAGAA TACCGTGCCA 


CTGCCCATGA 


AGTGGCTAAA 


7500 


AAACTTAATA 


TCAAGCTTCA 


TGAAGCTGTC TATATCGGAG 


TTACTGCTCC 


GACTTATGAA 


7560 


ACACCACCAG 


AAATTCGTTC 


CTATAAGACA CTGGGAGCAG 


ATGCAGTTGG 


TATGTCTACG 


7620 


GTTCCTGAAG 


TTATCGTGGC 


AGCCCACTCT GGCTTGAAAG 


TTCTGGGAAT 


TTCATGTATC 


7680 


ACTAACTTTG 


CGGCCGGTTT 


CCAAGAAGAA CTCAATCACG 


AAGAAGTTGT 


AGAAGTGACT 


7740 


GAACCTCTTA 


AAGGTGATTT 


CAAAGGCTTG CTTAAAGCGA 


TTCTTGCTGA 


ATTGTAAGAA 


7800 


AAAACATTTA 


AAAGGGGGAG 


TGCCTCTGTT TTTTCAGGAT 


TGACTGCCTA 


TCCGGATTAA 


7860 


AGAAGAAACA 


GAGGAATACT 


ATGAGCTTCT TCCTGCTCTT 


ATAACTGAAA 


GAAGCGGAAG 


7920 


AATAGGTATG 


TCTGATCTGA 


TAGCCAGCAT TGTGAAAGAC 


AAGATTCTAG 


GATACTAGCA 


7980 


TTAGCTTCCT 


AGCCAAGCAG 


ACTAGTATGA TAAGGAGAGA 


TGAGAATGAA 


TTGACTTTCT 


8040 


GAATTTCTCA 


GTCTTATCAT 


A7ATACCACA ATGAGATTTC 


GCTTGAGTCT 


CCTTGTAAAT 


8100 


AAACGAAAAG 


AAAGATAAGA 


AATAATGAAA ATTGGTCAAC 


GAATTATGCG 


CTTTGGCATA 


8160 


AAAAATTAAG 


TATCGGAGTT 


GTATCTGTTG TAGTCGGCTT 


TGATTTCTAC 


CTCCAGCTGG 


8220 
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AATTTCAGCC AATGAACTAA AGCAAGATC? AACATCTGAA GTGGTAATAG GTGTGCTAGA 9280 

TTCTAAGGAG GAATTGAAAG AGTCAGAAAA TGATGCTCCA AAACTAGAAA CTCCTCTTAG 3340 

AGAGGAGCCA AGACTAGCTC CTCAAACCCT TCCGGAAGCA AGTGAAGTTC TTGAAAACAA 8400 

AAGGGAAGAG TCAAAAGTAG AGATAACATA ACCACCTCAA GCGGATGATA TCCGCAAGGT 8460 

TGTTCGGGAA TTAGCCAAGG ATATAAGTAT TACTAAGTTG TATATGACAG GTCATTCTCT 8520 

TGG'ATGTfAC CTAGCTCAGA TTGCAGCCGT TCAACCTTAC CAAAAATATC CTGATTTTTA 8530 

TAACCATGTA TTGAGGAAAC TGACAACTTT CAGTGCTCCT AAACTCATTA CTTCCAGAAC 8640 

TGTTTGGAAT GCTAAGAATG GTTTCTGGGA TGTTGGTTTG GAAAGTCGTA AATTAGCTGT 8700 

TAGTGGAAAA ATTAAGCATT ATGTGGTTGA TAATGACAAT GTTGTGACTC CCTTGATTCA 87 60 

TAATAATCGT GATATTGTTA CATTTACAGG TAATTCACCC TTTAAACACC GTTCTCGTGG 8820 

CTATTTTGAA AGTCCAATGA ATGATATTCC TAACTTTAAT ATTGGTAAAC AAGCTACCTT 8880 

GGATAAACAT GGTTATCGTG ATCCGAAATT GGATAAAGTG CGATTCTTTA AGAAACAGGC 8940 

TCTGCCTCGA TCTTCTAGTC AACCAAGCGC TGAACCAATC GAAAATATTG CCTCAGGAAA 9000 

ACAGGTTACT CAAAGTTCGA CACCTTTCGG AGGAGATGCT AGAAGAGCTG TCCATGGCAA 9060 

AGTCGATGGT AACTATGGTC ACAATTCTGT CACTCATACA AACTTCCAAT CTAAGCCTTG 9120 

GTGGCAAGTA GATTTGCCTA AAGAAGAAAC CATTCGCCAA ATCAATATTT ACAACCGAAC 9180 

AGACACTGCC CAGGATAGAT TGGCAAACTT TGATGTCATT CTTTTAGACA GTTCTGGTAA 9240 

AGAAATTGAG TGAAAACCTA TAACATCTCC TAAAGATGTG TCAGCACAAA TTACGATTAA 9300 

CCATAAAAAA CCCCGCTATG TTCGGATTGA GCTAGAAGGC TATAATGCCC TCACTCTTGC 9360 

AGAAGTTGAA GTTTTCTGCT TTATAGCTAC GAATGCTGAA ACGGCGACAC AAGTTTCTAA 9420 

GCCAGTTCAA CCAATCAGTC AGACTCCTGT GAAGGATAAA ACATTGACAA TTCAACACAG 9480 

TGCAGCTTAC ATTGCCCGCT ACTCCATAAC TTGGGAAGAA GTTCCACTAG ATAAAGATGG 9540 

AAACCAAGTT GTTCGTAGTC ATTCTTGGGA ACGAAGCGGT CGCAACCAGA CTGCAGGTTT 9600 

TGTCCTCAAC CTCCCAATCA AAGAAAATAT GAGAAATCTG CGAGTTAAGA TTGAGAAAAA 9660 

GACGGGCCTA CTATCGAATA GATGGCAAAC AATCTATGAA AACAGACCAA TTTTAGCTCA 9720 

ACCCCACCGT AAAATTACCC ATTGGGGTAC GACATTGAAT TCCAAGGTGA GTGACGATGA 9780 

TGTCTTGTAA TCTGATGGTA GAATGACAGT TAGTTTGTC7 AGTTTATAAG AAAGTACTAC 9840 

CTGAGCTTGA ATAGGACTCA GGTAGCTCTC TATGAAAGAA CAAAATTAAT ACTCAATGAA 9900 

AATCAAAGAG CAAACTAAGA AACTAGCCGC AGGTTGCTCA AAGCACTGCT TTGAGGTTGT 9960 
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AGATAAGACT 


GACGAAGTCA GTCACATATA TAATCCAAGG 


CGACGTTGAC 


GTGGTTTGAA 


10020 


GAGATTTTCG 




ACAGAAAGGT 


AGACCGCG7G 


TTCTAATTTG 


AACACCAGTA 


10080 


GAAAACTTTT 


L rTAAAAAGAA 


AAACGAAAGG 


ATGGGTAAAC 


TGTATTCGCT 


GAACTGAATA 


10140 


CGGGCGACTC 


I 1 v. 1 AAA 1 


CAAAATTAAG 


AAAGGAATTG 


ACCCCACCCT 


AAAAGTAGTG 


10200 


GGAAAAAGAT 


AG TTGATCTA 


GCGAGCATCG 


CTCACTGCGC 


CCAACTCCTA 


TTTTCCCTTC 


10260 


GCTTTTTGAT 


CGGTTTGGtA 


TCTTTCTCAA 


TATAAAATAT 


AAAATAAAGA 


AAGGTAGAGC 


10320 


GTGTGTTTTG 


AIT i GAALAL 


gAgCGGAAAA 


CTCGGAAAAT 


AGATAATCTG 


ACTGAAAAAT 


10380 


CACGATTTCT 


GGTCAGGTTC 


CTAATtTTCA 


GTCGTTTTCT 


TCTCGCTCTT 


TCTATCATAA 


10440 


ATTATGTCTA 


I LLATAlluC 


TGCTCAGCAG 


GGTGAAATTG 


CTGATAAAAT 


T CTTCrTCCT 


10500 


GGGGATCCTC 


1 I GG 1 GL i AA 


GTTTATTgCG 


GAGAATTTCC 


TTGATGATGC 


TCTTTGTTTT 


10560 


AACGAAGTGC 


G i AAGA * GIT 


TGGTTACACT 


GGTACTTACA 


AGGGTCACTG 


TGTATCTGTC 


10620 


ATGGGAACTG 


GGA I GGGAAT 


GCCATCTATT 


TCGATTTATG 


CGCGTGAGTT 


AATCGTAGAC 


10680 


TACGGTGTGA 


AGAAA rTG A T 


TCGTgTgGGA 


ACTGCAGGTT 


CTTTGAATGA 


AGAGGTTCAT 


10740 


GTTCGTGAAT 


TAGTTTTGGC 


GCAGGCGGCT 


GCAACCAACT 


CAAACATCGT 


TCGTAATGAC 


10800 


TGGCCACAGT 


ACGATTTTCC 


ACAAATTGCT 


AGCTTTGATT 


TGCTTGATAA 


AGCCTACCAT 


10860 


ATCGCCAAAA 


AAG liGG TAT 


GACTACTCAC 


GTTGGGAACG 


TTtTGTCATC 


TGATGTCTTT 


10920 


TACTCAAATT 


* ^•mmm^* & ft ft « 
AGTTTGAAAA 


GAATATCGAG 


CTTCGTAAAT 


GGGGAGTCAA 


GGCTGTGGAA 


10980 


ATGGAAGCAC 


G AG CTGTTTA 


CTATCTTGCT 


GCCCAATACC 


ATGTTGATGC 


GCTAGCTATC 


11040 


ATGACCATCT 


/"""TV" iim ft/^/~*T*»T* 

G i G A I AGCTT 


GGTCAATCCA 


GACGAAGACA 


CAACTGCAGA 


AGAACGTCAA 


11100 


AATACCTTCA 


/"■TV* ft T> JV TV™ 1 ft 

G ITjATATGAT 


GAAGGTTGGT 


TTGGAAACCT 


TGATTGCAGA 


ATAATTATAG 


11160 


CCAAAAAGGG 


GCTCTTTGTC 


AACTGTAGTG 


GGTTGAAAAA 


AAGCTAAGCT 


TGAGAAAGGA 


11220 


CAAATTTCGT 


CCTTTCTTTT 


k 1 GAT A I I CA 


GGGCGATAAA 


AATCCGTTTT 


TTGAAGTTTT 


11280 


CAAAGTTCCG 


AAAACCAAAG 


GCATTGCGCT 


TGATAAGTTT 


GATGAGATTA 


TTGGTCGCTT 


11340 


CCACTTTGGC 


ATTAGAATAG 


TGTAGTTGAA 


GGGCGTTGAC 


GATTTTCTCT 


TTCTTCTTTA 


11400 


GAAAGGTTTT 


AAAGACAGTC 


TGAAAAAGAG 


GATGAACCTG 


CTTCAGATTG 


TCCTCAATGA 


11460 


GTCCGAAAAA 


TTTCTCAGGG 


TCTTTGTTCT 


GAAAGTGAAA 


AAGTAACACT 


TGATAGATCT 


11520 


GATAGTGGTG 


TTTCAAGTCT 


TCTGAATAGC 


TTAAAATCTT 


GTCAACAATT 


TCTTTATTTG 


11580 


TTAAGTGCAT 


GCGAAAAGTA 


GGGCGATAAA 


AACGTTTATC 


GCTsArTTTA 


CGACTATCCT 


11640 


GTTGGATGAG 


TTTCCAGTAA 


CGCTTGATAG 


CCTTGTATTC 


ATGAGATTTT 


CGTTCAAACT 


11700 


GATTCATAAT 


TTGAACACGA 


AAACGACTCA 


TGGCACGGCT 


GAGATGTTGG 


ATAATATGGA 


11760 
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AACGATCTAG AACGATTTTA GCACACGGAA AAAGCTGTTT AGCCAACTCA TAGTAAGGAC 11820 

TAAACATATC CATCGTAATG ATTTTCACTT GACAACGAAC GCCTCTATCC TAGCGAAGAA 11880 

AGTGATTTCG GATGACAGCT TGTGTTCTGC CTTCAAGAAC AGTGATAATA TTAAGATTAT 11940 

CAAAATCTTG CGCAATGAAA CTCATCTTTC CCTTAGTGAA GGCATACTCA TCCCAAGACA 12 000 

TAATCTTTGG AAGCCGAGAA AAATCATGCT CAAAGTGAAA GTCATTGAGC TTGCGAATGA 12060 

CAGTTGAAGT TGAAATGGCC ACCTCATCCG CAATATCAGT CATAGAAATT TTTTCAATTA 12120 

ACTTTTGAGC AATTTTTTGG TTGATGATAC GAGGGATTTG GTGATTTTTC TTTACCAGGG 12130 

GAGTCTCAGC AAC CATC ATT TTTGAAsAGT GATAGCACTT GAAACGGCGT TTTCTAAGGA 12240 

GAATTCTAGA AGGCATACCA GTTGTTTCGA GGTAAGGGAT CTTAGACGGT TTTTGAAAGT 12 300 

CATrTTTCTT CATTAGACTT CCACAATCAG GGCAAGATGG AGCCTCATAA TCCAGCTTAG 12 3 60 

CGATAATTTC TTTGTGGGTA TC CAT ATTG A TGATATCTAG AATCTTGATG TTTGGGTCTT 124 20 

TAATATCGAG CA GTT TTCTG ATAAAATGTA ATTGTTCCAT ATGATTCTTT CTAATGAGTT 12 480 

GTTTTGTCGC TTTTCATTAT AGGTCATATG GGACTTTTTT TCTACACAAA AATAGGCTCC 12 540 

ATAATATCTA TAGTGGATTT ACCCACTACA AATATTATAG AGCCCAAAAA GGAAGCCCTT 12 600 

TATGAATTGT AGGACTTCCT TTTCTTATCC AGAAATTGAT CTAGCTCTCT CTGATTTCGA 12 660 

AGAATAGTGA CTTTATGTGA ATATTCTTGG CAAAGTTTTT GGTAATTTTC TTTTTGACTT 12720 

TTGCGGACGC CCATCCCAAA GAATCCATCT GATAAACTCC CACTCAAAGC GTTCAGGGCA 12780 

ATCTACCGCC ATACTTTCTC TGACTTTTCC ACGGTATTTA AGATAACCCT TAAAGGCTCT 12840 

AAAGAGACAG GTCAATCGCG AAAAATTGAG AAAGATGATT TGGTCAGCTT CTTGCATTCG 12900 

TTCTTGGTAG TAGCACCAAG AATAATTACC ATCGATGACC CAAGCTTTAT GCTTGGTGAG 12960 

AAAGTTTTTT ATCTCGCTTA ACATCCATTC GCAGTCACTG TCTTGCCAAC CAGGTTGAAA 13020 

TTGGAGTGTG TCCATGTGCA GTTTTGGAAT GGAGTAGTAG TTAGATAACT TTTCTGCTAT 13080 

AGTTGACTTA CCAGAACCAG AATATCCGAT AATTGCGATT TTCATTTTCT ACCTTTTCCT 1314 0 

ATTTGGAGAC AAAAAAACAG CCTCTATGGA CTGTTTCTTA TTTAACAAGT TTAGCTGAAA 13 200 

GACGACCTTT ATCGCGGCTT GCTTTGTTTT TGTGAATCAA ACCTTTACTT TCTGCTTTAT 13 260 

CGATAGCTGA GCTAGCAGCA CGGAAAAGTT CTTCAGATGG GTTTGCTTCG AAAGCTTTTA 13 320 

TAGCAGTACG CATAGCTGAT TTTTGAGCTG AGTTCTTTTC GATTCGTCTA ACGTTCAATT 13 380 

CAGCGCGTTT GATAGCTGAT TTAATGTTTG CCAATGGTCT TACCTCCATA TTTACTAACT 134 4C 
(2) INFORMATION FOR SEQ ID NO: 129: 



WO 98/18931 



PCT/US97/19588 



890 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 8512 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 129: 



CCTTTTTTCA 


AAAACTAGAT 


ACTAGTCTAT 


CAAAAGTAGG 


AAAGGGTTTC 


AAGAAAATTG 


60 


ATTGGAAATT 


TTTTGAAAAT 


CATAGAACTA 


TTAGCTAATC 


CCTAGTATTG 


AAAAGACTGG 


120 


ATAGCTTCTT 


TCAGGTCATC 


TTGTAAACTA 


TTTCTCTGGT 


CAAGTTGGAC 


ATAGACTTCC 


180 


ACCAGACAGG 


ATCTAAAGTT 


GGAAAATTTG 


TAAAAATCCT 


CCCTTTCTTC 


TATCGGAAAA 


240 


TCAACACTTT 


TTATCCAAGA 


ACCTACTTGT 


TCTTGCTCCA 


ACTTCCCTTG 


TAAAATAGGT 


300 


TCATAGATCA 


CTCTTCCTAA 


ACGCCAATCC 


TCATCATCTG 


TAAAGCGAAT 


CGACATTCTT 


360 


TTAAATAGTT 


GGCCAAGTAT 


ATCAAATACT 


TCATGAACTC 


TGTTTTTAGG 


AAAGTCTGGA 


420 


TGACAAACCA 


CCTCTGTCAG 


TAAATCGGCT 


CCATGTGCAA 


AACCGTGAAC 


CCAACCATAC 


480 


TGACTTGAGA 


AACCCCTTGT 


ATCCTTTTCT 


TTTGAAAGAT 


ACTGCAAGCC 


TTCATTTAAA 


540 


AGGACATTAC 


GAATTTCTGG 


AGAAGGATTT 


CCCAAATGAT 


CAAACAACCA 


CTGGATTTCT 


600 


TCCTGGTTAT 


AATTTGGTTT 


TTCTTCTGCT 


ATTTTTCTTA 


GTAAATCTTG 


ATACATGGTC 


660 


AATACCTCTA 


CATTTCTAGC 


AACTGTTCAA 


AAAGGCAGTC 


TTAAATGACT 


CAATATTGAA 


720 


TTCTCAATTA 


AATACAATCT 


GATATAAAAT 


GACGTAAATA 


ACTATCAATA 


CCAGTTCTAC 


780 


AGTAAGTTCA 


AATTTAACAT 


CACGACCTTC 


AACGACATTT 


TTGAAAATAG 


CTACAACTAA 


840 


GACAAATAGA 


ATGACGCTTA 


ACAAGCCCAT 


AAACATCATT 


CTAAAAAATT 


TTTCTATTCC 


900 


CCTACTCTCC 


CAACTCAGCA 


CTATAGGAGA 


TAATCTGGTC 


AACTGTGTCA 


GACAAGAATT 


960 


GGATGGTATC 


ACGGAGTGGT 


TTGTCTGTTG 


AAATATCAGC 


ACCGATAATC 


ATGGCTGACT 


1020 


CAAGTGGTGT 


CTTGCTACCA 


CCTGATTTGA 


GGAGATTGAG 


CCAGTCTTCA 


GCTCCAGTTT 


1080 


CAGAATGTTT 


TAGATGAAGG 


TAACCAGCAG 


TCGAGATAAC 


TAGTCCTGCT 


GAGTAAGTGT 


1140 


AACTATACAA 


GCCCATATAG 


TAGTGAGCTT 


GGCGCATCCA 


AGTCAGAGTT 


GCATCATCGT 


1200 


CAATTTCAAT 


AGCATCTCCC 


CAGAAATCCG 


TCAAAACTTC 


CTTCATAATG 


CTGTTGAGCT 


1260 


TGCTTGCTCC 


AAAGGTCTCC 


CCTTCTTCAA 


TCAATGTATA 


CACCTTACGC 


TGGAAGGCGG 


1320 


CTTCCAAGAG 


GTGGGTGATA 


AAGTTATGGA 


AGTAGGTGTC 


TGTCAAGCGA 


TGAGCCAGAG 


1380 


CGAAGCGTTT 


TTGACGTGGG 


TCATTAGACT 


GGTTCTCCAA 


GTAATCACTG 


ACTAGCAATT 


1440 


CATTGAAGGT 


TGACGGTGCT 


TCAACATAGT 


AGGTCGACAT 


ATGGGCATTG 


AAGTAACTTT 


1500 
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GATCATTCTC TGAAAAGATG AATTGACCAG AATGCCCGAT TTCATGAATC AAGGTATAGA 1560 
CATCGCTCAA ACGGCCTGTC CAGCTCATGA GTACATAACG GTGTACGCGA TATGGGTCCG 1620 
CCGCATAACC ACCGGAATCC TTGCCACTGT TAGCAGCAAA GTCCACCCAG CGCTCTTCTT 1680 
GGTAACGAGC AACTTCCTGA CAATATTCTT GCCCCAAAGG TTCTACCGAC TTCATGACCA 1740 
AATCATAGGC ATCGTCAATA GTCACTTCAG GATTCAGGGC GCTGTCCAAG TCCAATTTCG 1800 
AGTCTGC AAA~GGTC ATCTTT TCAAGACCAT " TT ACCTTCCC AAC ATGCTTG AGGTATCTCT 1860 
GAGCGACTGG TGCAAAGTCC TTCATCATGA CGTCAATCTC GCGGTCAAAC ATGACACGGT 1920 

CCACTTCTTC TTCAGCTAGA AGATAGTCAA AGACAGAGTC GTATCCCTTC ATATCAGCCA 19 80 

AGAGTTTTTC AGACTTGACC TGAGCCAGAT AGGCTGCTGC AGCCGTATTT TGGTGCTTAC 2040 

GAAGTCCCTC TGAGAAGGAA CGGAAGGATT TCTCACGAAC CTCAGCATCC TCATGGTTTT 2100 

GGTAGAAATT CTCATAGGTC ACAAAGCTGT TTTTGTAGGT CTTGCCATGG GCTTCAAAGT 2160 

CAGCCATTTC AAAATCCCCA GCTCCCATCT TAGTATAAAT GTCCTGCCCA CTGTAGAAAA 2220 

CTTCACCGAG ATTTGTCAAG GCCTTCTCCA CATCTGCCCC TAAGTAGTGG GCTTTTTTGA 22 80 

TTTTAGCCTG ACGAATGGCA GCTGTTAAAT GTGGCAATTT ACCCAAACGG TCCAAGACTT 2 340 

CCTCATCTGC TGCCACCAAG GCATCGTCAA AGAAGGTCAA GGCTACGCTG GCATCTGTTT 2400 

CAAATTCCAT CCCAGCTTGG GCAATATTGG CAAATTCGTC ATTGCTATAG TCCGTCGTCT 2460 

GAGGCATAAA ACCATAGTTG CCAATATCGC TCATCTGAAT GTAGATCTGT TCCAATTCCG 2 520 

CAAAGGCCTT CTCGAAATCC TCAAAAGTCT GAAGATTGCC CTTGTAATCA CGGCTAAACT 2580 

GGTTGATGTC TTCGCGAGCT TTCTCGATTG CACGCAAGAA ATCCTCACGG TCTTGCTATA 2 640 

GGGCTGTTAA GTCCCAGAGT TCCTTCTCTG GAAATTCTGA ACGGTGTTTT TCTTCCATTT 2700 

TCTTCCTCTT ATTTCTCTAA TTCTACTAAA ACACTAAGGG CTGATAAAGC CTAAAGCCC7 27 60 

GCTGTTTCTG CTCGCAAAAT. ACGAGGACCT AGGCCTGCCA AAACCGCTCC TTTAGCTTCA 2820 

AAACTTTCGA TTTCTGCAGG TGAGAGACCG CCTTCTGGAC CAAAGATAAA GAGCAGTTTG 2880 

GCTCCTCTTT CAAGACCAGT GACTGCTTGC AGAAGCGCAG CCCCTTCTCC TTCTTTACCT 2940 

GATTCTTCAT AGGCTACTAT GATAGAGTCA AACTGGTCCA GCTGAGCTAG AAAATCTGCT 3000 

TTTTTCTCGA AAAGTTTAAT ACTTGGTACA ATATTACGCT TGCTTTGCTC GGCTGCTCCA 3060 

AGGGCAATTT TTTCTAGTTT TTCAACTTTT TTACCCAATT TCTTGCCATC CCACTTGGCA 3120 

ACTGACCAGT CTGCAGGAAA GGCCCAGATT TGGCTAGCCC CCAGTTCGGT TACTTTTTGA 3180 

CCGATGAACT CCAGCTTGTC TCCCTTGGGA AATCCAGATG CGATGGTCAC TTGGACTGGT 3240 
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AGTTCCACAT TGTCATTTAA TTCTTGCACC AACTCAAACT GACGATTTTC CATATCCAGC 3 300 

ACGCGCGCCA AGCGCTTGAT GCCATCATCA AAGACTAAGG TAACCTCATC CTCTTCTTTC 3 360 

AAGCGCATAA CCTGAAACAT ATGCTTACTG GTTTCCTTGT CCTCGATAGT GACAGGAGAG 3 420 

ATAGCACTGC CTTTTACAAA ATACTGCTGC ATGCTAGCCT CCAATCACAC CAGAGATATC 3 480 

CTTGGTTTTC TTAAAGACAC AGGTATTCCA TTCCCCTTGA ACCATGTGAG TTTCGAGGAA 3540 

AAATCCAGCT GACTCAGCCG ACTGGCGCAC CATGTCCAAC TTGTCCTTGA TAATGCCACT 3 600 

CATGATCAGC TAGCCTTCAT CCTTTACCAA GCGATAAGCA TCGTCTATTA GATGAATGAG 3 660 

GATATCCGCC AAGATATTAG CCACAATCAC ATCTGCCTCA ATTTCCACAC CCTTAAGCAA 3720 

ATCTCCAGCC GCTACATGGA TATTTTCCAT GCCAGCGTTG AGCTCAATAT TTTCCTGAGC 3 780 

CACACGAACC GCCACATCAT CCAGGTCATA GGCGAAAATT TCTTTAGCCC CCAGAAGCGA 3840 

GCTGGCAATA GAGAGAACCC CTGAACCACT CCCCACATCT AGCACCGTTT CGCCACCACG 3900 

AAGAACCTGT TCCAAGGCAA AAAGGCTCAT CTTGGTAGTT GGGTGGGTTC CAGTACCAAA 3960 

AGCCATGCCA GGATCCAGCT TGATAATCAT TTCCCCCGCA GTCGCCTCAT AGTCTGTCCA 4020 

AGAGGGAACG ATGGTCAAAT CATGAGTGAT ACGAGCAGGT TCATAGTATT TCTTCCAGTT 4080 

GTCTGCCCAG TCTTCCTCAC CCAAGGCAGT CGTACCTATT TTTAACTC7C CCAAATCCAT 4140 

AAAATCTCTC AATTCTGCTA GACGAGCCTG CAAATCCGCC TCAACCACTG TCACATCCAC 4200 

CGTGTCACGG TAGTAGGCTG TCACTACGAT TTCTTCTTGC TGCTCCACCT CTCGGAAAAT 4260 

CTCTCCAAAG CGGTCCACAT TTCCCACATA GTCCATACTG TCTTCGATTC CGACTCCTTC 4 320 

CGCTCCCAGC TCAATCAAGA CATTGCAAAC CAACTCCTCT CCCTCACCCT TCACTGTAAC 4 380 

TTTTAACTCT TGCCATGTTT CCATTATTAA TACCAAGCCC GTAAAACACA AAACCAAAAT 4440 

AGGAAATTCT CTGAAGACGC TTCTGTCTAA GACAACTTTA TCTTTTTGGC ACAGTCTTTA 4 500 

GGGCGGGTTC AGTTTAGAAA TGTAACTGAA CCATCCTTTC TAATCACTTA CTTTTAAATA 4 560 

ATCTTTTAAT CTCTCTTGCA ACTGAGGCAC AACTTGACTG GAACTAAGAA ATTCCTCAAC 4 620 

ATTCATCAGC TGATAGCCCT GTCCTTCA7C TCCGAAGATG ATATTCTCAA ATTGTTCTTG 4 680 

TCTTAGCTGA CCAACCATAA AGACCGATTT CTTGCCTTTA AAAATTACCC TAGGATAAAT 474 0 

CTTGCTCCAA AGCAGACAGT CTTCATCTAA ATGAATTCCC AGTTCCTCAT AAACTTCACG 4800 

CCGAGCGCAT TCAAAAGGGC TTTCGTCCCC TTCACGGCCA CCACCTGGCA GTTCCCACAT 4 860 

ATTGGCCCAG GGAATACTTG CCTTATCATC GCGTAACATA GTCAAAAGCT TATCCCCACA 4 920 

AAACAAAGCA ATCTTCCAAC CTGTGAAATC AGAAATTTCT AGTTCCATCT TCAGTTCCTT 4980 

CTAACATTTC CTTTTCCAGC TCGGCTAACC AGTTTTCATA ATATCTTTTC TCATCCCTCA 5040 
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ACATTCGACT ACTATCCATT TTCTGTCTAG CAATCTTGAG AGCCTTACGA GT7CGATCTA 5100 
CATCTTTCTT CACCTTTAAT TGATACCAGG CTTGTATCAC TTCAACATTG GACAGTTTGA 5160 
GAG AC AG AAA CGATTTGACC TGTCGAATAC TAGCATATTG CTCCGCTTGC TCAAAATCTC 5220 

CTTCCAACAA GGCGATATGA AGCAGGGATA GTTGGGCAAC TGTCTGCATC ATCGGAGTAG 5280 

TTCTCCTCTC AAGTAATGCT TGAAACTGCT GTTTAGCTAC TTCTTCCTTC CCTTCCAAAA 5340 

TGGAAACTTC ACCTTGCATA CCTAATACAC CATCCGCAAA ACTCCCTCGT GCATCCTCAG 5400 

CAACTGCTTG AACAAAGTCT TTCAAATCAT ATTCTTGAGG AGCTAGCAAG GTCTGGGCAG 54 60 

AATGTCTCAA TACCAGGTAG GCGTATTTGG TATTTTCAGG GTGTTGTAGT AATTCCCAAA 5 520 

TTTTTGCTCC ATCGGTGATG TCGACTGGCA AAATGTTATT TAGGAAGAAA GATAAATTAA 5 530 

GAAAAATCCA AGTCCCTGCA AAATACCAGC TTCTTGTCAA AAATCCAAAC AATATCGCCA 5640 

ATAATATCAA GCCGAGATGA ACCATCAAGC CTCCTGAAAG CATCAGGATG ATTCTTTGAT 5700 

CGCTTTCATC CTCTTTTAAA CCAATGTATT GAGCACCAAC ATTTTTCAGA ATGGCTGTTC 57 60 

TACTAAGATG AAACCTGCCT GACTTTTTGG TCAAAATAAA ATGTCCTAAT CCAAAAGCCA 5820 

CCACCCGATA GCCTGTCAAC TAGCCACAAA AAGCA7GACC CAGCTCATGA AGAATAAAGA 5880 

TTAAATACAT GCTTAGAAGA GCGAAGGCAT AACCAAAAGT AAAGGCTAAA ACTGCGGAAT 5940 

ACCCCAACTC TGCAAATGCG ATTGTTCCAC AAGCAAAAGC TAGCATAATA AAGACAACAC 6000 

CTAGCACATA AACCAAATAA GTCCCAATTT TCTTCATAAC ACCTCCAACC AACTCCTAGT 6060 

ATCTTGGATA AGGATAAAAT TCTCCCTTTT CCAAGCCAAT TTTTCCTTCT TCAAACACTT 6120 

CTTGGTTCCA TTCCATGACA AATTCCTCTG CTTCTGGGTC TTCCAAAAAG TCCATGAGGA 6180 

CATCTAGCCC AACCTCAGCA GTATCTTTAA GGAAAAGCGC AAAATAAGCT AAAAATTCAC 6240 

GGGAAAATCC TTTTTTAGGC AGGTAAGGAA TAACAGTCAA ATAGTCTTCC TCATTGACTG 6 300 

TTGACTTGGC AGGATTGTAG AAAAGGACCG CTTCCTCAAA AAGAATGTCA TCTCATGAAA 63 60 

CCTCTCCGTC TTCATCCACC ATCTCCACAC CGCAGCATTT TGCGCTTCCA ATAGAAAACT 6420 

CACTTCTACC GCATGGTTGC GTTTGTCCCA GCTAATCTCA AAGTCAAAGG GAAAGTTCTT 6480 

GTCCAACTCT TCCTCTAAAA TATCTAAAAA TCCGTATGTT GCCATTTTGT CCTCTTTCTA 6540 

TGCGACTCTT TAATCGCCCC GATTGCTCGG AAATATGCTA AAATAGATAC TACCATCTTA 6600 

CCACAAAATT ATTTTATGTC CTAATTATAC CATATTACCT CATTTAAACC CTTGGTATCA 6 660 

GTCATTTTCT TAAAAGTCTG ATTTCTTCAT TTCTCATAAA AATCAATATA AAAAGCCCTC 6720 

GAAAGCGCTA ATAAATCTAT AAAATCAATA GGCGAGTAAC TAGCACAAGT GGACGTGCTT 6780 
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TTTTATTGAC 


TATTACCACG 


ATACCACGCT 


TAATCTTAGG 


CTTGAACTTT 


CTTATCTGCA 


6840 


ATAGCGTCTG 


TCAAAGTCTG 


AGAAAAGTTA 


ACCCCCA7TT 


CTCGTCCCAA 


CTTATCTGCC 


6900 


CATTTTGGTA 


TGGTCAAAGT 


CTTTTTAATG 


GGTTCCTGAC 


TTCCTAGGTA 


TTCTGATACA 


6960 


TCAACAGATA 


CCATAGAAAT 


AAAAGATTTA 


TCAAGGTCAT 


AGGTTGACAC 


GAAATCTTCA 


7020 


TCATCTTTAA 


AAGGATCATT 


ATCAATTAAA 


GACAAGCTAT 


TGATATCTGA 


TGGCTGAGGT 


7C80 


AACTCTCCAT 


CACTCTCTAT 


CAAATCTGCA 


ACAGTTATCC 


CTAGCCACTC 


CGACCCCATA 


7140 


CCCAAAGCC? 


CAGAAATCCC 


CTCTCCTTGT 


GTAGCTGAGT 


ATTCAAAATC 


TGGGAAATGG 


7200 


ACAAAATAAG 


TCGCTTCTGT 


TCCCTCTGTG 


TCGTCATAAT 


AAAATAAAGC 


TGGATACGTA 


7260 


ACTAACATTT 


CACTACCTCC 


ATATCAAAAA 


GCAGGGACTG 


AATTTTACAA 


CCCACCTTCC 


7320 


TTTCTTATCC 


CTCTTTCAGT 


GTACTTATTC 


AGCTCACCAT 


GAAGGATTGT 


GATAGGTCTT 


7380 


TCCCCTTGCT 


TTTCCATTTT 


AATATGGGAG 


CCTTTACCGC 


CTCTAGTCTT 


TATCCAACCA 


7440 


TGGGCCGTAA 


GGAGTTTAAC 


CATCTCTTTT 


TGTGTCATAG 


GCATAGCGCT 


TTTACCTCCT 


7500 


GACAACACCA 


TTATAACACG 


TGTTACACGT 


ATTGTAAAGG 


AGTGATACTT 


ATTATTCTAT 


7560 


TATACATAAA 


AGCCCCTAGA 


TGTGGTTCTA 


AGGGAAGCCA 


ATTTATTCAT 


ACCTATTTT T 


7620 


CTAATGAGTA 


GTAAAAACTG 


CTTCTTTATC 


GAGCAATTCA 


TCATCTGTAT 


ACTCAATTCT 


7680 


AAAAGTATCT 


CGATCTAAGA 


CAGATTGAGG 


CGGAGTTGAA 


TGAATCATAG 


GAACACTGCG 


7740 


TACT CT AT AT 


TTTTTATCTC 


CAATTTTTAC 


AAACTGATAC 


TCTTCCAAAA 


TCAAATTCAA 


7800 


ACCACGTCAA 


CGTCGCCTTA 


CCCTACTCAA 


GTACACcCTG 


CGGCTAGTTT 


CCTAGTTTGC 


7860 


TCTTTGATTT 


TCATTGAGTA 


TGATTAACTC 


TCAAGTCTTC 


GAAATCAGGA 


TTTTCAACAG 


7920 


TTATTACAAG 


GAGGCGATTT 


ACTACTTCAA 


AAACATCAAT 


TATTCTATTT 


TTCATATTTT 


7980 


TTCAACCCAT 


TATTAGAATG 


AACTTCTTGC 


TAAGCAAAAT 


CAAGTTTAGA 


TTTAATGTTT 


3040 


TCGTACAAAT 


CTAAAATCTC 


TTTTGGAGTA 


TCTTCCCGGA 


AGAAAAGTTT 


TCTTTTCCCT 


8100 


GAAATAACTT 


GATCACTAAG 


AATCCAATGA 


CGAATTTGTT 


TTGTAAAAAT 


CAAAATTTCC 


8160 


TGACTTGGTA 


CTTCCATCAT 


TTCCATTGCT 


TATCACCTCT 


CTTTTCATTA 


TAGTTCATAC 


8220 


AATGACATTC 


AGCAATATTA 


TTTCTCAAGT 


CAGCACTTCC 


ACTTCTTTAG 


GCTCAACTAT 


8280 


CCTATTTTGA 


GCTTTAAGGA 


AAATCAAATC 


TCTCATGCTG 


ATACCTCTCC 


TCATTAAATT 


8340 


AAATAGTAAA 


AAACATTCTA 


TCTCACTCCC 


TGATTATTAC 


AAAACCATTG 


AAATATCACA 


8400 


ACTAATAGGC 


TAGAATGGAC 


ATAGTAAGAT 


ATAGTAGATG 


AGTCATTC7A 


CTCAAATCCA 


8460 


CGTTAGAAAG 


GACTCCTATG 


CCAGACAATC 


TCGCCGTTCG 


CATGCGCCCn 


GG 


8512 


(2) INFORMATION FOR SEQ ID NO: 130: 
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(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CTCGTTTCAA- GGTTGAGTCT""CTTGCAAAT^ C TTGTTCGCGT TCCTCCTTTT GCCAAGGCAT 60 

CTCTCCCATG GTTGGTGCcA GCCATTGTTG GAATCTTGCT CTCATTGGTT CTACCAAACA 12 0 

AGCAAGAAAG CGATGTTTTT GAAATGGAAT AATCACTTAA ATCACTTTTG TAGCCAAGTC 180 

TACAGGAGTG ATTJcTCTTTT TTTATCCGAT GATAAATGTG TTATAATAGG TAGCGAAAGA 2 40 

GGTGAAGAAA TGAATCAAAC AGTAGAATAT ATCAAAGAAC TGACAGCCAT TGCGtCGCCA 300 

ACACCCTTTA CTCGTGAGAT TGCGGACTAT TTAGTCAAGA CTCTAGAAGG TTTTGGTTAC 360 

CAGCCGGTTC GCACATCCAA GGGCGGTGTC AATGTAACTA TTAAAGGTCA AAATGATGAG 420 

CAACATCGCT ATGTGACTGC CCATGTAGAT ACGCTTGGTG CTATTGTCCG TGCTGTCAAA 480 

CCAGACGGCC GTCTCAAAAT GGACCGTATC GGTGGCTTTC CTTGGAACAT GATTGAAGGA 540 

GAAAACTGTA CCATTCATGT GGCTAGCACA GGTGAAAAAG TATCAGGAAC CATCCTCATC 600 

CACCAAACTT CTTGCCATGT CTATAAGGAT GCAGGAACTG CAGAACGCAC GCAAGACAAT 660 

ATGGAAGTGC GTTTCGACGC CAAAGTAACT AGTGAAAAAG AAACTCGTGC TCTTGGCATT 720 

GAGGTCGGTG ATTTTATCAG TTTTGACCCA CGAACTGTCG TGACAGAGAC ACGTTTTATC 780 

AAGTCTCGCC ATTTGGATG A CAAGGTCAGT GCGCCGATTT TGCTCAATCT CCTTCGCATT 840 

TATAAGGAAG AGAAGATTGA ATTGCCCGTA ACAACTCATT TTGCTTTTTC AGTCTTTGAA 900 

GAAGTGGGAC ACCGTCCAAA CTCTAACATT CCTGCTCAGG TAGTAGAATA TCTGGCTGTG 960 

GATATGGGAG CCATGGGAGA TGACCAGCAA ACAGACGAAT ATACAGTGTC TATCTGTGTC 1020 

AAGGATGCTT CTGGACCTTA TCACTATGAC TTCCGTCAAC ATTTGGTGGC TTTGGCGAAA 1080 

GAGCAAGATA TTCCATTTAA GCTGGATATC TATCCATTTT ATGGTTCGGA CGCTTCAGCG 1140 

GCTATGTCTG CAGGGGCAGA AGTCAAACAC GCCCTTCTCG GTGCTGGTAT AGAGTCTAGC 1200 

CATTCCTATG AGCGTACCCA TATTGACTCG GTGATCGCAA CAGAACGAAT GGTCGATGCT 12 60 

TATCTTAAGA GCACGTTGGT GGACTAATAT GTGCCTTATT TGTCAGAGAA TTGACCTCAT 1320 

CAAGAAGGAA GAAAATCCTT ACTTTGTCAA AGAGTTGGAA ACAGGCTATC TTCTGCTTGC 13 80 

AGACCACCAG TATTTTGAAG GCTATAGTCT CTTTCTAGCC AAGGAGCATG TCAGCGAATT 1440 
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GCACCATTTG AAAAAGGAGA CAAGACTCCG TTTTCTAGAA GAAATGAGTT TAGTCCAAGA 1500 

CGCAGTTGCC AAGGCCTTTG CTGCTGAGAA AATGAATATC GAACTGCTAG GAAATGGCGA 1560 

TGCTCATCTT CATTGGCATC TGTTTCCACC ACGGACAGGT GATATGAATG GTCATGGTCT 1620 

CAAGGGTCGT GGACCAGTCT GGTGGGTTCC CTTTGAAGAA ATGACAGCAG AAACCTGCCA 1680 

AGCAAAACCG GATCAGATTA AAAGATTAGT CAAACGTTTA TCGTCAGAAG TAGATAAACT 1740 

ATTAGAAATA AAGGAGTAGA AATGAAGAAA AGATACCTAG TCTTGACAGC TTTGCTAGCC 1800 

TTGAGTCTAG CAGCTTGTTC ACAAGAAAAA ACAAAAAATG AAGATGGAGA AACTAAGACA 18 60 

GAACAGACAG CCAAAGCTGA TGGAACAGTC GGTAGTAAGT CTCAAGGAGC TGCCCAGAAG 1920 

AAAGCAGAAG TGGTCAATAA AGGTGATTAC TACAGCATTC AAGGGAAATA CGATGAAATC 1980 

ATCGTAGCCA ACAAACACTA TCCATTGTCT AAAGACTATA ATCCAGGGGA AAATCCAACA 2040 

GCCAAGGCAG AGTTGGTCAA ACTCATCAAA GCGATGCAAG AGGCAGGTTT CCCTATTAGT 2100 

GATCATTACA GTGGTTTTAG AAGTTATGAA ACTCAGACCA AGCTCTATCA AGATTATGTC 2160 

AACCAAGATG GAAAGGCAGC AGCTGACCGT TACTCTGCCC GTCCTGGCTA TAGCGAACAC 2220 

CAGACAGGCT TCGCCTTTGA TGTGATTGGG ACTGATGGTG ATTTGGTGAC ACAAGAAAAA 2280 

GCAGCCCAAT GGCTCTTGCA TCATGCAGCT CATTATGGCT TTGTTGTCCG TTATCTCAAA 23 40 

GGCAAGGAAA AGGAAACAGG CTATATGGCT GAAGAATGGC ACCTGCGTTA TGTAGGAAAA 2400 

GAAGCTAAAG AAATTGCTGC AAGTGGTCTC AGTTTGGAAG AATACTATGG CTTTGAAGGC 24 60 

GG AG ACT A CG TCGATTAATA CTCTTCGAAA ATCTCTTCAA ACCACGTCAG CGTCGCCTTA 2520 

CCTACTGACT GCGTCGGTTC TATTCACAAC CTCAAAACAG TGTTTTGAGT cCATTCGTCA 2580 

GTTTTATCTG CAACCTCAAA GCTGTACTTT GAGCAstGCG GCTAGCTTCC TAGTTTGCTC 2640 

TTTGATTTTC ATTGAGTACA AAAAGTAAAC TTTTCTCTTG CAATTCCAGA TAAATAGTGT 2700 

ATAATGGATG GGTATGTGAA AAACATACTT GTGGGAGGTA AAAATCTCTA ATTACCGCCA 2760 

AAACCACAAA GGAGGATTTA AAAATGGCTA AAAAAGTCGA AAAACTTGTA AAATTGCAAA 2820 

TCCCTGCTGG TAAAGCTACA CCAGCTCCAC CGGTTGGACC TGCTCTTGG 2869 
(2) INFORMATION FOR SEQ ID NO: 131: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6186 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
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CTGAATCCCT TATAGGAGTC CAGTAACTTT TTAGCCTCTA CTTTGCCTTC ATAGGCAGCT 
TCAACATCAT TAAAAAAAGA ArCCACTGAA GCAAGTTCTT CAGTGCTCCA CCACAAATCT 
AGTGGGTAAC TATACTGTTT GTTCATTAAC TAATACCAGC TCTCATTCTT GCTTCTTTTA 
GTTCTTGCTT ACGATAACTA CGAGGGAGAA AAGCACGAAT CTCATCTTCA TTAAAACCCA 
TTTGCATACG CTTGGCATCA ATAATAATTG GACGACGCAA AAGACTAGGA TACTCCTCAA 
TCAAATGAAG CAATTCCGAT ACCGAAATAC TCTCTACATC AATATTCAAT - TTTTGAAAAA 
TTTTTGAACG AGTTGAAATG ATGTCATCAG TACCATTTTC GGTCAAGGAA AGGATGTGTT 
GCAATTCTTT TCTTGTTAAA GGACTGGTCA TAATATTGTG TTCCACAAAG GGAACTTATG 
TTTTTCTAAC CAGGCCTTAG CCTTACGACA TGATGTACAG CTCGGTGATA GAAATAGTGT 
AATCATGCTT TTCTCTTCTT ATCTATACTT TGCTACTTCT ATTATACAAA AAAATAAAGC 
GCTTGACTAG GGATTTTTAG AAAAAAAGCC TATTTTTTCA AGAAAAATAG GCTTTTTGCG 
AACGATTGAC ACAATTGGAT TTGGTTAATT CACTCTTAAC GATGGTTTTA AACGATATAT 
ATTTTTATAT ATGTAAATTA AAAACATCTT TCCTTTCACT TCCTACGACT TTTCAGATAC 
AGATAGCCAA AGAAGTTTTC ATAGAGGCCA AAAAAGAGGA GGAAGCCATG AAGAAACAAG 
GTCTCTGGCA AAATCATAAT AACAGGATCC TTGGCTGGAT CAAAAACCCA GGTATCATCT 
CCCACAAAGA GAATTTGATG GAAAACAGTA AAGAATTGGT CAAAACCAAT CAAAACTCCC 
CCAAGTCCAA TCATCACAGG TAAGACTACT AGAGCCAGGA CACTTTTTCG ATAAAGACAC 
AAAAAGTCCT TTTTCACAAT CCTATTGACA AAGACATAGA AACTTGCCAG TGTCACTAGA 
GCTACTAGCT GAACCAAATG AAAGAGATTC TTGACCACTG CGAAATGGTG CAGACCAGCT 
GCTGACGAAC GAAAATCAGG CATCTGTAAG ACCTGACTAA AAGGATTCGT CAGATAATTC 
ATCAAGATAT GAAAATTGTA TTGAATGCTT TCTGGTTTTA GATAGACTCG ATTCGTTAAG 
TTTAGCCACT GAATCTCCAT AGGATAGAAA ATCCAAGCCA GATAAATGGT CAGAAGGATG 
GAGAGGGAGA GGAGAAAGAG CATAGAGCCC CAAAAGATCA ATTTAGTTTT CATCAAAATC 
CCACTCCGCA AGGCTAGAAA CCACATGTGT CGGTGCGATT GGCAGGCCAG CTACTTCTTC 
TGCCTTAGTA AAACCTGTCG TCACCAAGAG CGTTGGAATG CCATTGTCAA TCCCAGCCCG 
AATATCAGTC AAATAATTGT CCCCAACCAT GATTAACTCT TCACGTTCCA AACCTAAGTG 
CTCAACCGCC TTGTCCATAA TGATGGCATT TGGTTTTCCG ATATAAACCG GCTTCACTCG l6 20 
TGTCGCTACT TCAAGCAGCG TAATCAGTGA CCCAGCACCT GGCAAAAGAC CGCGTTCCGT 
CGGCATGTTG AGGTCAGGAT TGGTTCCGAT AAAATGGGCA CCCTTTTGAA TAGCAAGACT 



60 
120 
130 
240 
300 
360 
4 20 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
132C 
1380 
1440 
1500 
1560 



1680 
1740 
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TGCTCTGGCA AATTTTTCAT AGTCCACTTG CCAATCCAGA CCAACTACCA CGTAGGCAGG 1300 

TTTTTCCTTC TCTTCCACAT AACCAGCCGC CTTGATGGCT TCCTTGAGTC CTGCTTCTCC 1860 

GACGACATAG ACGGTCTTTT CAAGCCCCAA ATCATTCATA TAGTCGATGG TTGCCAAAGT 1920 

CGCTGTGTAG ACAGTCGATA GGGGCGTATC GATATTAAAA TTCTGAGCCA ACATCTCCTT 1980 

AACACTCTCT GGAGTGCGGG TTGTATTGTT GGTTACAAAG AGATAGGGAA TGTCCCGCTT 2040 

TTGCAATTCA TGAACAAAAG TCTCTCCAGC AGGGATTCGG TCTTTCCCCT TATAAATGGT 2100 

TCCGTCTAAA TCAATTAAAT AGCCTTTATA TTTCATCTAT TTCTCCCTAA CCCTTTTTTA 2150 

TTTCTTGCCA AGTAATGATT GCTTGGGCAT TGATAACCCC ATCACTTGTA ATTTCATCCT 2 220 

TGCTTTCCAG TCCAGTCCGT TCAACAGCCG ATGTAATCAC CCCACCTGGT CGAACTTCCT 2290 

TGACATACTT GAGGTTGATT TTCTTGGGAA TATAGTGGGT CAAAAAATCC GCTCCCATGA 2340 

CCTCAAAAAT CCAGTCCAAC TATTTACTGT TATTGACATG ACCATTCATA TCCAAGTCGT 2400 

AAAAACGAAC ATCGTAATCC TTGCTGATCG GTTCTTCCAA GCACTCATAC TTCGGTCCAC 2460 

GGATAAGTTT TTTATCAAAA TCAGACTGGT AAGGAGCCAC AATCTCAGGT TCAACAACAT 2520 

GGACTTTTCG ACTGTCGCGG TCCATGAGAA CAAAGCTCGC CATCATGTGG ATGACCTCCT 2580 

GCTCCGCTTC ATTATAAATA GTAAAGCGAC GCTAGCAAAA AAGTCGATTG TAGCTCAAGG 2640 

CTTCCGTTTC GATGGTAATT TCTTCCGCAA AACGAGCCAA ACGAACCACC TCAATATCAT 2700 

ATTCTACGAT AATCCAGACC AGATTATATT CTTCCAAAAT GGCCTTATCA CTAACTCCCA 27 60 

GTTCAATCGA CTGCATCCCT GAAACTTGCA GTGACAGCAA AATCACATCT GGAAGTTTGA 2820 

TATGACCGTT CATATCAGCC ATATCAAAAG GAATTTTCAT TTTCATTTGA TAAGTTAAGC 2880 

CCATGATCCT ACTCCAAAAT AAATCGTTCT GCTACAGTAT CTCCCAAAAA GAGACCTCTC 2940 

TTTGTCATGC GAACGTGCTC ACCCTCAATC TGCATGAGGC CTTGTTCAAC CAAATCTCTG 3000 

ACAATTTCTC CATAAAGTCC AGCAAAAGAC TGTCCAAATT TTTCCTCAAA TCGCGCCATG 3060 

GAAACCCCGG ATTTCTTGCG GAGTCCCAAG AACATTTCTT CTTCCATTTG CTCCTTTTGA 3120 

CTCAGGTGAT CTTCTGTAAT ACAAGCATTG CCTTCCTCAA CCGCACTGAG ATAATGACGA 3180 

ATGGGACCAT GATTTTTATA GCGTACTCCA TTGACATAAC CAGATGCCCC TGCACCAATA 3240 

CCATAGTATT CAGCATTGTC CCAGTACATG AGATTATGAC GACTTTCAAA ACCGCGTTTG 3 300 

GACAAATTAG AAATCTCATA ATGCTCAAAA CCCCCTCGCT CCAGCTCTGC AATGATGTAC 3 360 

TCAAACATCT CCCCTTCTAG TTCCTCCTTA GGCAGAGGCA ATTTCCCACG TCGCATCCGG 3420 

TTCATAAAGA CCGTATGGTT TTCTAAAATC AAACTATACA AACTCATGTC GGGAATATCC 3480 

AATCCAATGG CTTTAGCCAC ATTTTCCTTT ACTTGCTCCA TGGTCTGACC AGGCAGAGCA 3540 
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TAAATCAAAT CAATCCACAT ATTGTCAAAA CCAGCCAGTT TCAGGCGATC GATATTTTCA 3 600 

TAAATATCCT TCTCCAAATG ACTCCGCCCA ATCTTTTTCA ACATCTTATC ATCAAAGGTC 3660 
TCGACACCTA GCGAAACACG ATTGACAGCC GAATTTTTCA AAACAGCTAT CTTATCCGCA 372 0 

TCCAAATCGC CTGGATTGGC TTCAATGGTC AACTCTTCCA AGACAGACAA ATCCAAGTTT 3780 
TTAG7CAAGC CATTCAGTAA CACCTCCAGT TGCGGAGCCG ACAGGGCTGT CGgTGTTCCA 3 8 40 

CCACCGATAT AAAGGGTTGA CAACTTTTCA ATATCATAAG AACGAAACTC TTCCAGCAGA 3 900 

TGCTCTAAAT AGCTGTCGAC TGGCTGATTT TTGATGAAGA CCTTTGAAAA ATCACAATAA 3 960 

TAACAAATCT GGGTACAAAA TGCGATGTGC ACATAGGCTG ACGTTGGTTT TTTCTGCATA 4020 

GTAATTATTA TACCACAAAG ACTAGATTCC AGATAAAAAT CACCATCCCC AGATACATAG 4 080 

TCCGTCCGGA GATGGTGATG CTTTAT7CTT CTGTTATATC AATCACAATC TCTTCTGAGT 4140 

CATCAAGAGC TTCGGCTTTT TCTTGCCATT GCTCCTTGAG ATTATTTAAT TGATTTTTTG 4200 

ATGCTTCTGT CGCTTGAAAA GCATAGGATT TAGTTTGAGC AAGTATACTG TCCACAGTGA 4260 

TTTCACCTGA CTCAACCTCT TCTTTTGTTT TCAGAACAAA ATCTGTACCC TGCTCCTTAA 43 20 

CTTCTGTCAG TTTTTCACAG ACTTGCTCCT TGGCATACTC CGGATCTTCT CTCAAATCAT 4 3 80 

CTAGAAAATC TTGAGCCTGA CTGCAAACTT GTTTGCCCTT ATCACTTCTT AAAAACAAGG 4440 

CAAGAGCTGC ACCTCAAACG GTTCCTAAAA GGATTGAGGA TAATTTACCC ATAAGGATTC 4500 

TCCTTTTTTA TTTTTTGAAA AATTTACTTG CAAGACGAAG AGCTCACACA CTTGCACCAG 4 5 60 

TCTTGAGTGT TTTTGAACCA GCTCATCAAG CTTTCT7GCT CAAGACACGC GCATGGTCAT 4 620 

TGAGGTCTGA AACAGATAGA GATAAATCTG CAACAGCACT GAAGAGTGGA TCAATCGTAC 4 680 

CCACCTTGAC ATTGATATCA TCTGCCAAGA CATTGACCTT AGCCAACAAC TCATTGGTGT 4740 

GATGCAAGGT CACATCCACA TCTGAAGTCA AGGTTTTAAT CGTCTTTTCT GTTTCATCGA 4800 

TGACACGACC AAGCTTTTGT ACAGTAATGA TCAGATAGAC CAAAAAGACA ATCAAAGCTA 4 8 60 

GGGCAACAAG AATATATGCA ACTTCTAACA TTTAGTTTTC CTCCTCTGTA ATATAGTAAG 4 92 0 

GGGCCTTCTT TCGATTTTGA TAAATAACGA TCATTATACC GAGACCGATA AGGACAACTG 4 980 

ACAGCCATTG GGACACTCGA AAGCCGAAGA ACATGACACT ATCTGTTCGC A7ACCTTCGA 5040 

TAACCATACG ACCGAAACCA TACCAAATCA AGTAAAAGGC CGTGATATGA CCTCGTCTGA 5100 

GACTCTTCCA TTTCCCTCTA AAAATCAGAA TCAAGGCAAA GCCAAGCAGA TTCCATAGAG 5160 

ACTCATAAAG GAAAG7CGGT TGACGGTACC TCCCCTCAAT ATACATCTGG TCACGGATAA 5220 

AGCCAGGTAG ATAATCCAGA TTATCCACTG TTCCACCATA AGCTTCTTGG TTAAAGAAAT 5280 
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TACCCCAACG CCCCAAACTT TGAGCAATCA TAACGCTAGG CGCCGCAATA TCTAGAAAAT S3 40 

CCCAAGTATT GATGAGTTTA CGGTCAGCAA AGATATAGAG CACAAGAGCC CCAGTTATCA 5400 

AACCACCGTA AATGGCCAAA CCACCATTCC AAATCGCAAA AATCTCTCCT AAATTCTGAC 5460 

TATAGTAATC AAATCGCAAA ATAACATAGT AGAGACGAGC TCCTAAAATA GCCAAGGGAA S520 

AGGCTACTAA GATAAAATCT AAAATATCGT CTGGTATGAT CTTCTTTCTA GGTGCTTCTT 5580 

TCATGGTCAA ATAAACCGCA AGAATCAAGC CTGTCACAAT ACATAAGGCA TACCAACGAA 5640 

TGGCTAGGGG TCCTAGTTGA ATAGCAATTC GATCAAGCAT TTTGCACCTC ATTTCGAGCG 5700 

ATTAG ACTTG TCAGTCGTTC GTCGAACAAA CGGGTCGCAT CAAAGCCCAT TTCCTTGGCA 5760 

CCATAATTCA TGGCAGCTGC CTCAATCACA ACAGAGATAT TACGACCTGT TTTAACTGGA 5820 

ATACGAATAC GAGGAATGtA CGCCAGAAAC TTCAACTTCC TCTGCATTAT TTCCAAGACG 5880 

ATCAAAGGTC TTATGCGTAT CGTAATTTTC CAAATAGACA GCAAGCTGAA CCTGTGAAGA 5940 

ATCCTTGACA GCACTCGCAC CGTAGAGACT CATAACATCG ATAATACCAA CCCCACGAAT 6000 

TTCAATCAAG TGTTTCAAAA TTTCAGCTGG TTCACCCCAG AGAGTAATCT CATCCTTGGC 6060 

AAAGATATCG ACACGGTCAT CGGCTACCAA ACGGTGACCA CGTTTGACAA GCTCAAGACC 6120 

TGTCTCCCTC TTACCAATTC CACTATCTCC CTGAATCAAC ACGCCCATCC CATAAATATC 6180 

CATCAA 6186 
(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

GAAAATCACA ACCCTTTTTG CAAAATTTTT GAGATTATTT TCACAAACTT GATTTTTCAA 60 

AGTATACTCA ATAAAAATTA AAAAAATCCA CTACGTCAAG GCGAGGCTAA TGTGGTTTGA 120 

AGAAATTTTC GAAGAGCGTG AATGAGTATC ATCTATAGTA AAATAAAAAA ACTGAACAAT 180 

TTGCTTGGGG ACAGCCAAAC CAATTTCTCA CAATGTTTCA GAAACAAGGG TGTGCTATTC 2 40 

CAATTTCAGC CTACTATAAC TGTCATAGAT TGCTGAAACA AAGTCTAGGT AAAAGTCTTC 300 

ATAATAAAAA GACCTCCTAT CAAGTGTTCA AAAACTTTGA TAGGAGGTCT TGTTTTGTGA 3 60 

AAATATTTAT CAAATTTTCT ATACAAGTGA GCTGTTAGCC AGGTTCTTTC TATTCTTTCA 4 20 

ATTTCAATGA ATGGATTTTT TACTAATACT CATAACTGGG AATTTGTCTG TGTAAAAATA 480 
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540 

600 



780 
840 
900 
960 
1020 



901 

GCGAGATAGA TGGTATTTAT AAAACACTCA AGACAGCTAG ACTAATATCA TTTAAAACAT 
TATCTTCTTT TGAGCGACTG TTGGTTACCA ACATAGCTAA ATTTCCTGCA TTTTCAAATT 

GATAGGCTTC TGATTTAGCA TTCACAACCA CCAAGAGCTG TTCTTTGCCG TGAACTTCAT 6 60 

AGATAAGCTA GCCGCTATGT TCAATCGCAG AATGCACAAA GACATCATGG TAAATTTCAT 720 
CATAGCTAGA GTAAGAAAAG GCACCAGTTT TTGTCTTCAA TCGGATGACT TGACGGATAA 
ACTCAATACT GTCTTGACGC TCATTAATCA ACTTCCAGTT CACTTGGTTC ACACTGTCAG 
GAG C ATT AT A GCTATTCATC GCACGCTCTC TATCATCATG GGTCAACTCA CCATTTTCAC 
CAGTCGCAAC CAGTTTGGTA CGACCAAATT CTTGACCGAT TTCCATAAAG CCCATCCCCT 
GCATGAGCAG ATTCATGGCT GTGGCAGTTT CGACCTTGCG CATGATTTGC TCTGAACTTT 

GGTCTGGATG AAGGGTTGCC AATAAATCGT GAAGATTGTA ATTGTCATGG GCTTCTACAT 1080 

AGTTAAGCAC CTGATTTGGA TGTGTATAGC TTCCTAATTC ACGACTTCCT AGGATTGCTT 1140 

TAGCTAGAAT TGGCTCTGTC CCAGCACCAC TGACAAAACC TGACTTGATA GCACCATAAA 1200 

CTTCTCCCCC TTTGACAGCA TCGCGCTGAT TGTCATTAAA GAAACCAATA TTTGGCATCT 1260 

GGTAGGCATT GTCCTTCTTG GCCTTATCAT AAGGGGCAAG ACCTGTTCCC ATATCCCATC 1320 

CTTCTCCATA GAGGATAATG TTGGACTCGA TTTCATCCAA GCTTTGACGA ATCATCTGCA 13 80 

TCCTCTTCAC ATCATGAATC CCCATCAAGT CAAAACGGAA GCCGTCAATA TTATATTCCT 1440 

GCACCCAGTA TAGAAGAGAA TCAATCATAT ACTTGCGAAA CATTTCGTGT TCACTGGCTG 1500 

TTTCATTTCC AACACCCGTT CCATTCTCGA AGGTACCATC TGGATTCATA CGATAATACT 15 60 

AATCAGGGAC TGTTGTTTGG AATGGTGCAT CAACAACTGA GAAGGTATCG TTATAGACTA 162 0 

CATCCATAAT GACTCCAATA CCCGCATCGT GATAAGCTTG AACCATCACC TTCAAATCAC 1680 

GAATGACCTG AGCTGGATCA TCTGGATTAG TTCAAAAAC7 AGTTTCTGGC GCGTTATAGT 1740 

TTTGTGGATC ATAACCCCAG TTGTAGGTTA CATTTCCATC CTCATCGTAT TCTTTATCAC 1800 

GGTCTGCAAT TGGTTGCAAT TGAACATAAT TGTAGCCCAG CTTCTTGATG TAATCAAAAG I860 

CACTTGACTG CCCGTATTGG TTAACTGTTC CAGCCTGAGC AGCACCCAAG AAAGTTCCTC 1920 

GAAGATGTTC ATCTACACCC GATGTAGGTG ATTTAGTCAA ATCACGAATG TGCATTTCAC 19 80 

AGATAACTGC CTTACATGGA TTTTCCAAGC GCCAAGTAGC CTCCGAACCG TGCTTAACCT 2040 

CGAAGTTTTC AACTTGCTTT TCTACATGGC TCAGAATACC TGAACGTTTG CCATCAGGGC 2100 

TGGTCGCGAT TGTATAAGGA TCACGTGTCA CTCTTTGCTC ATGAGGGAAT TGGACTTGAT 2160 

ACTGATAAGT CTTACCTACC AAATCTTCTT CAACATCCAA ACTCCAGACA CCGATTGTAT 2220 
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TCTCCTTATG 


ATTATAAGAG 


1 TACCTATTGC CTCTTTTCAT 


' CTCAAAAGTC 


: TTCCAAACGG 


2280 


GTGCATCATT 


ACCAGCTCAT 


TCATAAACGA CAACTTCCAC 


' TTCTGTCGCT 


GTAGGTGACC 


2340 


AGAGAGAAAA 


ATGACCCTCA 


TTGTCCTCTA CACGGCAACC 


CAATTCTCCT 


TGGTAACCCC 


2400 


AATGATGATC 


AAAACTAGCA 


CTGTTAATGG CCTTATCAAA 


CCCAAAAGGA 


TTTTGATTTT 


2460 


TATAGAAAGG 


ACTGGCAATA . GCAGGATTTT CAGAGTAATA 


AATCCTATCA 


TCGCCTTCCA 


2520 


AAATCCAGAC 


CTCTGTTAAT 


AGCGGATAGT GATTAAAACG 


GATAGAATAT 


TCTTTACTAG 


2580 


TTTGACCTGT 


ATGAACCACA 


AAATTCAAGC TTTCTATAAC 


ATCTGAACTT 


GGGTGTTCAA 


2640 


AGCTAAATAA 


AGCTCCAAAA 


TAATCTTCTT TGTAGGTTAG 


CAAATCAATT 


CGTTGATCCT 


2700 


CACTTTTTAC 


AAAGGAGCAA 


CTGTCATATT CTCCATTCTT 


ACGATGGTAA 


TGAATGCGCA 


2760 


TAGGGTAGTT 


ATACATTTTT 


TATTTTTCCT TTTTACTTTG 


TTTCTATTTC 


ACTAATAAAT 


2820 


TTTTGTCAAT 


CTCGTCTCAA 


TTAACAGACA TAGTCATATT 


CTCTAAACTC 


TGTTTTTAAA 


2880 


CGATCCATTA 


CAAACTTTCT 


AGCCATGCCT CATCTCTGAC 


CTGGATACCA 


AGTTCTTGTG 


2940 


CTTTTTGCAG 


TTTACTTCCA 


GCGTCTGCAC CTACCACCAC 


GAGGTCGGTC 


TTTTTAGAAA 


3000 


TACTACCTGT 


CACTTTGGCA 


CCCAGACTTT CGAGTTTACT 


TTTAGCTTCT 


GAGCGCTTGA 


3060 


GTCGTTCCAA 


TTTTCCTGTC 


AATACCACGG TCAAACCTGA 


CAAGGCCGCA 


TCCGCTACTA 


3120 


CCGTCTGTCC 


TTTATAGTCC 


AGATTGACCC CAGTTTCTTT 


CAATTCTCTG 


AGCAGAATTT 


3180 


CAGAGCCTTC 


TGTCGCAAAA 


TAAGTCTGAA GACTTTTGGC 


AATCACGCCA 


CCTAGACTTT 


3240 


CAATACTAGC 


CACTTCCTCT 


GAATCTGCCT GAGACAGATT 


TTCAATTGAA 


TGGAAATATT 


3300 


GAAGTAAAAG 


CTGACTAACC 


TTGCTTCCGA CATGACGAAT 


TCCCAAACCA 


AATAAGAGCT 


3360 


TCTCGGCAGA 


ATTTTCCTTT 


GATGCTTGGA TAGCCTGATA 


CAGTTTAGCA 


GCGGACTTTT 


3420 


CCTTAACTCC 


CTCTAAAAGG 


AGCAAATCCT CTTCTTGCAA 


ACGATAAATA 


TCCCCCACAT 


3480 


CCTTCACTAA 


ATTAGCAGCA 


AAAAGCTTCT CAACAATACA 


TGGACCAAGG 


CCTGTAATAT 


3540 


TC AT AG CATC 


ACGAGAAGCA 


AAGTGAATCA AGCCTTCCAT 


GATTTGAGCA 


CGGCAACGCG 


3600 


GATTGATACA 


ACGTAGGGCC 


ACTTCATCTT CAAAGTGCAA 


CAAGTCAGAG 


TTACAACTTC 


3660 


GACAGTTTGT 


AGGGATATCT 


AGTTTTTCTT CAGAAACCCG 


TTTGGACTCT 


ACCACACGTA 


3720 


AAACGGCAGG 


GATGATGTCA 


CCAGCCTTAT ATACAATGAC 


CGTATCGTCT 


TTTCGGATAT 


3730 


CTTTTTCAGC 


AATATAATCT 


ACATTGTGCA CGGTCGCACG 


CCTAACAGTC 


GTACCGGCAA 


3840 


GTTGTACTGG 


TGTTAGATTA 


GCAGTTGGAG TTACAACACC 


GCTACGGCCA 


ACTGTCCAGT 


3900 


CAACTGATAA 


GAGTTGAGCT 


TCTTTTTCTT CGGCAGGGAA 


CTTGTAGGCT 


ACTGCCCACT 


3960 


TTGGAGCCTT 


AACTGTAAAA 


CCAAGTTCTT CTTGACTTGC 


TAGGTCGTTG 


ACCTTGATTA 


4020 
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CCACTCCATC 
TCCAGATTTC 
GTTCTAGGTA 
AGACAAACGT 
CTGCTCCCGC 
TAACTTGGTC 
CTTCTGGCAA 
CAATTGAACC 
AG AT AG AT AA 
GAACACGCGC 
GAGGATACTG 
TCGGACTCTC 
AAAGGCGGTC 
CATAGCGATT 
ATAAAACAAG 
TTTCAAACTC 
TAAGAAATAT 
TGGCAAGACT 
TGTACTCAGT 
GTGAATATTA 
AGTTGGAAGT 
AAGATTATAA 
AGCCCCCATA 
GGGAAGTAGA 
GAAGAAAATT 
ATCTCTCCCA 
CACTAGAACA 
TGCAAACTGT 
CAGCTCTAAG 



AATATCGTAA 
ATCTATGTTT 
CTTCAAACCC 
TGCAAGATTA 
A7TACGACGA 
AAAGGAAGCG 
AGTCAAAGCG 
ATCTCCACGT 
GCCATCGATT 
ATCAAAAGCA 
ATGACTGTAT 
TCCTAGCACT 
ATACTCACTG 
GAGCAAAGCG 
CCCTCCTCAC 
AAGCAACCTT 
AAGGCTACAA 
GTCATAAATC 
ACTCTTCCAA 
AAAATCCTCA 
GGTAATCCCA 
ATATTAGCTT 
GTTAAAATAC 
AATTCAAAAG 
TCTTGCTGAT 
GTAAAAGCCT 
AAAGCAATGA 
AAAACAAGGA 
CGAGAATTAT 



903 

GGCAGATTTT CCCGTTCCTG 
TCAGCCAAGA TTCGCTTAGG 
TTTTCTTGGC TATCACGAGT 
CGCTTGGCAA CTACTGCTGT 
TTAGCAAATT CAGGCTCTCC 
CGTGGCATGT AACATTCCCC 
ATGTCCTTAA CACGCTTGAG 
GTTACCCCAG CAACCAAAAT 
TTCAGCTCAC AAATATAGGT 
TCTAGCTCCT CACATGAAAA 
TTTTCAAAAC CATCTAAAAC 
TGCTCTGGAT AAGCAGTTTC 
TCTGAAACCG AGGGATTATC 
ACTAACTCAT TCATTCTTTT 
AAACGAGAAG GGCGGAAAAA 
ATATCAATTT TTCAAAATCA 
CTCCAAGTCC AATAATCAAG 
CTTTTGCAAT AGCCATAAAT 
GAAATTCGCT CTCAACCTTG 
TAAACAATTC ACAAACTAAA 
TCATAAAAAC TCCGACACCT 
TAATTTTACT AGCTAGAACA 
TTGCATAGGC TCCTTCTCAC 
CTGCAAAAAA GAAATTAACG 
CCCAGATATA GTGTAACCCA 
TTTTCTCTTG AACTTTTGCT 
AAAAAGTCAG CGAGTCTAGC 
AGGAAAGAAC AGGAGAGCTA 
AGATCACAAT CTCATCTTTC 



TCCTACTTCT TGGATAAAAT 4080 

ATTGACCACA AAACCTAGTT 4140 

TCAAGGCCTC GCTTCTTGAT 4 200 

ATCCAACTGA CGCAGAGTTC 42 60 

ATTTTCTTGC CGCGCTTGGT 4 3 20 

ACGAACTGTG ATATCTAGTT 4 380 

GTTTTCTGTG ATATTTTCAC 44 40 

CCCCTTTTCA TAAGTCAGCG 4 500 

CGGATGAGCC ACTTCCTTAC 4 560 

AGCATCCTCC AAACTATAAA 4620 

CTTGCCACCA ACACGATGAG 4 680 

TAACTCCACC AACTCACGGT 4 740 

GCTGGTATAG TACTCAGTCC 4 800 

ATTCATAAGA CCATTTTACC 4 860 

ACACTTAGTT TGAAATTATT 4 920 

GTTCGAACAT ATCCGAGACC 4 980 

AAAGAATAAA GATGGACACT 5040 

AGAATAGCTA AGGTAAAAAT 5100 

GTTTGTACTT GAGTAAAAAA 5160 

TTTCCACAAA AGGAAAGAAA 52 20 

GTCAAAGCCA GTAAAATCAA 5 280 

GCCCCAATGA TGGAACCAAT 53 40 

CCGTAAAGCT GATTCGAAAA 5400 

CTGGAACCTA CCAGCAAAAG 54 60 

TCCTTGATAT CTACAAAAAT 5 520 

TCCTCTTTTG GAAGGAAAGC 5580 

AGTAGCGTCA TATGGAGACT 564 0 

ACACCTACAA CCTGCAAAAC 5700 

TCCACCACTT CAGTTATGAT 57 SO 
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AGCTTTATTG GCTGTGCGAG AAAAGGCAAA AGCAATAGCC TGCACAATGT TAGCAACAAT 5820 

CAAAGCGCCA ATCATCCAGC TATCATTCCT TATGAAAGAA ATAGCCAGAC AAAGAATCCC 5880 

ACAAACAAGA TCTGCCGTCA TTAAAATCTT ACCACGAGAA AAACGGTCTG AAATAACTCC 5940 

GCCAAAGGGA TTGACGAGAA TAGATGTGAC GAGCTCAGAA ATCTGATACA TTCCTAAAAC 6000 

TGTCTGTCCT ATAGTCCCCA TAGAAGCCAA CC AG AC ACT A TTTCCATAAT CATAGAGCAT 6060 

ATTTCCCATT 7TATTCATAG CCCCACGGCT AATCAACTGC ACTGCATAGC GATTCATATT 6120 

AAAGCTCCTC TCAAATTTTG AAACTATTGT ATCAAAACCG AAAGCAGCTT TTTATTTTTT 6180 

CCCTTATTTG GGAAAATTAA CTTTTGACAA ATTTTTCGTA GTCTTCCTGA TAATAGGCTA 62 4 0 

CTTGCTCTGG AAGACCTAAC ACATCAAAAA TATGCATGGC CTCTTGCATC TGCTTACAGC 6300 

CTTCTTTACA CTGTCCTTTT TGATATAAGG CAAAACCTTT TAAATAATGG AAAACATTAC 6 360 

GCTCATAAAG CTTAATACCT TTGTCAATAA TCTTCTC7CT ATAAGCCTCA AAATAGTTGG 6420 

CATTATAAAA AGAAGAATGC TCTAAACAAT GCTGGTAACA ATTCAGGGCC AAAATCAACA 64 80 

CTAATCTCTT ATGGCGACTA ATCTCTTGGT AAAATTCCTC CCTCTCCATA ACTTCTCTAC 6540 

CAATCCCAGT GACATAGTCT ACATCGTAGA AACTATAGAG GTTACCGAAA AGAATCAACT 6600 

CATACATGGT CCATTCTTCT GTTTTGAAGA GATAATCTGC TACCTTACCC AAATCATCCT 6660 

GCTTCATATC ATAACTCGCA TCTCTTTGAC AAATCAGACC TTGTAGCAAA ATCCACTTCA 6720 

GCTCAAAATA AAGGGGAGTC GTCGAACTCT TAGACTTTTC AAGTTGTTCT CTTTGAAGCT 6780 

TTTGAAAACC TGCAATATCG TTTGAATAGT AAAGTGGGAT AATCTGTGCC ATCATAGACA 6840 

CATGTTCATG ATTATGAAAA TTCCTTGCCT TATCCATGAA ATTTTCGATT GTTACATGAA 6 900 

TGTTATCCAA AATCTCAAAG AAACGGGAGA CTGCCAGGTC AGACTCCCCA AGCTCAAAGC 6960 

GAGATAACTG ACAGGTAGAG CAGGATTCGC CTGCTGCTTC CTTTAAACAA TAATTTCCAC 7020 

TTGTTCGAAA TTCACGAAAT ACTTTTCCAA GATGTTCCAT CTTTACACCT CCTCTGATAA 7080 

TTCTTCCCAC TCAAGCATAG CTTCTTCCTG ACGATGGCTG ATTTTGTCCA GCTCAGCCTG 7140 

TAATTCCATG AGTTTGTCGC CATCGTTTGT TTCCAACATT TGTTCAGAAA TGGCTTGGC7 7200 

TTGACTTTCT AGCTCTTCAA TTTCAGCTTC TAGACTTTCG ATTTG7CCCA TGAGTTTCCG 72 60 

AACTTCTTTT TGACTTTCTT TCTGGGCCTG ATAGTCATTG ACTGGACTTG CTTCCTTTGC 7 32 0 

TTGATTGCTA GTTGAAGCTT CCTCAGTCTG ACTCATTTCT GCTGTTGCTT TCT7CTCAAC 7 380 

ATAGTAGTCG TAATCTCCAA GGTAGAGAGT TGAACCATTC TCAGACAAT7 CCAAAACATG 7440 

AG7TCCCACA CGATTGATAA AGTAACGATC ATGACTGACA AACAGCAAGG TTCCATCAAA 7500 

GTCAATCAAG GCATTTTCTA GCACTTCCTT ACTATCAATA TCCAAGTGGT 7CGTCGGCTC 7 560 
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ATCCAGAATC AAAAACTTAT TCTTTTCCAT AGACAATTTA CCTAAAAGCA AACGAGCTTT 7620 

TTCGCCACCA GATAGCATGC CGACTGATTT TTTAACATCA TCTCCTGACA AAAGGAAGGC 7 680 

TCCAAGACGG TTGCGGATTT CAACTTCTGG TGTCAGTTTG AAATCATTCC AGAGTTCATC 7740 

CAGCACCGTA TTACTTGGTG TCAGCTTGCT TTGGGTTTGG TCATAGTAAC CAACCTCAAC 7 800 

ATTAGCGCCA AAGCGCTTTT CTCCCTTGAT AAAAGGAATC TGGTCCACAA TAGACTTGAT 7 8.6,0 

AAAGGTTGAC TTGCCGATAC CATTTGGACC AACGATAGCG ACAGCATTCA TCTTACGAAG 7920 

ATCTAGGTTA ATCGGTTGTG ACAAGACTTC CCCGTCATAG CCAACACCTC CATTTTCAAC 7980 

AGTCAAAACA ACATTGCCCG ACGTTTTTTC AGACTGGAAG GTCATGTTGG CTGATTTCTT 804 0 

CCCAGCTTCA GGCTTGTCCA AACGTTCCAT TTTTTCCAGT TGTTTACGGC GAGATTCAGC 8100 

ACGTTTAGTC GTTGAAGCAC GAACTAGATT GCGATTGACA AAGTCTTCCA GAGCAGCGAT 8160 

TTCCTTCTGT TGCTTTTCAT AGTTTTTTGC CTCAGTAACT AGCTTTTGCT CCTTCAATTC 8220 

GACAAAACGA GAGTAATTCC CCACATAGCG ATCCAAGGAA TCCTTCGTCA AATCTAGCGT 82 80 

AATTGTCGCA ACCTTGTCCA AGAAATAACG GTCGTGGCTG ACGATAATGA GGGCACCCCT 8340 

ATAGTTTACC AACTAATTCT CTAGCCAGGC GATGGTTTCA ATATCCAAGT GGTTAGTTGG 8400 

CTCGTCCAAG ACCAAGAGAT TGGGCTTTTC AAGGAGCATT TTGGCAAGTG CCAAACGAGT 84 60 

ATTTTGACCA CCAGAAAGCT CAGCAATTTT CATCTGCCAC ATAGACTCGT CAAACTTGAA 8520 

TCCATTCAAA ATCGCTCGAA TATCAGCTTC ATAGGTAAAG CCACCTCCTT GGCGAAAATT 8580 

CTCAGATAAG CGGTCATAAT CTGACATCAG TTTATCCAAA TCCTCACCAG ACTTTTCACC 8640 

CATCTCCACC TCCATCTGAC GCAGTTGTCT CTCCGTCCGA CGCAAATCAT 7AAAGACATC 87 00 

AAGCATTTCA TCGTAGATGG TATTTTCAGA CTCAAAACGG CTATCTTGGG CTAGGTAAGA 87 60 

CAGAGAAATA TCTTTTTTCT TATTGATTTC TCCGCTAGTT GGCTCCTCTT CTCCAACTAA 8820 

AATCTTCAAA AGAGTACACT TACCTCCACC ATTTTTCCCA ACAAGAGCAA TCCCATCTCG 8880 

TTCATCAACC TGCAGCTTGA TATTATCGAA AAGAACCTCT CCTGCAAAAG AACGTTCAAT 8940 

TTTATTAGCT TGTAAAATAA TCATACAAGT AGTATAGCAT GTTTCCCTAA GGCATTCAAG 9000 

ATAATCGTAA GTCTTTTAGT ACAACTTTTA TAACATAAAA TAAACTAAAT TATGTATATT 9060 

TTATATTAGA TTACTTCACT ATCTTGTTGG ATTTTCTAAC CAGCTAATCT TGTTTCAAAT 9120 

AGTTATCGCA CAAGTCTATT ATTTAATTCT TTT CATC ATT TACGTACGTA TAGCAGATTG 9180 

AAATAAGATG AGAACAAATC GATTGGGAAA GTAAAATTAA TTTCTATAAA TGTTTTAGCA 92 40 

ATTGTTTCGT ACTATTTTAG ATTCAGTCTA CTATATACAA TATTTTCGGA ACATTCAACT 9 3 00 



WO 98/18931 



PCT/US97/19588 

t 



906 

TTTTAACTCT ATTTATTACT AGATTTCATA ATTAAAAAAC CTACTCACCA AGCTAGAAAG 



9360 



CTTGATACAA TAGG C TTTTT AAAGACTGAT TATTTAACAC CGTCTTTAAG AGCTTTACCA 



9420 



GCTTTGAATG CTGGTACTTT AGAAGCTGCA ATTGTCATTT CTTTACCAGT TTCTCGGTTG 



9480 



CGACCTTTAC GTTCTGCGCG CTCACGAACT TCAAACTTAC CAAAACCGAT CAATTGAACT 



9540 



T 



9541 



(2) INFORMATION FOR SEQ ID NO: 133: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 3: 

TTGACTATCC TATCATGCTT TCTAACGTCT ACTCAAGAAA ATCATTTTCA AGTTTTCACA 60 

CCTTTCTCAA AAAAGTTAAA AAATTTTCTC AAAAACGCTT GACTCTCACC TAAGGCGAAG 120 

GGTTATACTA TCATTGTAAG GAGGAAATCA TGTACCATAT AAAAGAAGCT GCGCAGCTTT 180 

CAGGTGTCTC TGTCAAGACC CTGCATCACT ATCACAAGAT AGGACTCTTG GTCCCCTTAA 240 

AGTCGGAAAA CGGCTATCGA ACCTACAGTC AAGAGGATTT GGAACGCCTT CAGGTCATTC 300 

TTTACTACAA ATATCTAGGC TTTTCTTTAG AGAAAATAGC AGAGCTGTTA AAGGAAGAAA 360 

GCACACATTT ATTGCCCCAT TTGACTAGGC AGTTGGACTA TCTAACTCGC GAAAGGCAAC 420 

ATCTGGATAC CTTCATTTCC ACCTTGCAAA AAACTATTCA AGAACAAAAA GGAGAAAGAA 480 

AAATGACCAT TGAGGAAAAA TTCACGGGAT TTAGCTATCA AGACAATCAA AAATACCACC 540 

AAGAAGCGGT AGAGAAATAT GGTCAAGAAG TCATGGGACA ACCGCTCGAA CGCCAAAAAG 600 

GTCACGAAGA CGAGGCTACG GCCGCCTTTA ACCAAGTCTT TCAAACTTTG GCACAAAATC 660 

TTCAAGTTGG TTTACCTGCA ACAGCAACCG AAAACCAGGA GCAAGCAGCC AAGCTCTTGC 720 

AAGCCATTCG CACTTATGGA TTTGACTGCT CTATTGAGGT ATTCGCTCAT ATCGGTAAAG 780 

GTTACGTCTA CAACCCAGAG TTTAAGGAAA ACATTGACAA GTTTGGTTCT GAAACAGCCC 840 

AGTACACGTC AGATGCCATT GCGGTTTACG TTCAGACAAA TGCAGAATAA ATAGGCTAGG 900 

AATTTCCTAG CCTATTTTTT ACTTCAAATC ATAAAGCCAG TCGTCACCGT TTTTGTAGTA 960 

AAAGAATTCA CTGAGATCTT CTTCTAGAAA CACACGAAGC ATATCAGACA TATCATCCGT 1020 

TGCAAGTTTT AGATGAGAAA GATTTTCAAA GTCCTCCCAC CAAACTTTCC CTTCGTCTGA 1080 

AGACTGGAGT TCACCAGTAA AGTGTTCTGT CTTGTAAAAA AGGACGACAT AACGATAATC 1140 
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CTTGTCGTCA TACCAGTTTT TGATACCACA GACTTGGCGT TTGGAAATGA TCAGACCAGT 1200 

TTCTTCTTTC ACTTCACGAA 7G AC AG CATC GACAAAGGAT TCGCCACGTT CAACATGACC 1260 

ACCAGGAAAA GTAATGCCAG ACCAGTCGGG ATTAACTCCC TCTTGGACCA GGACCTTATC 1320 

TCCGTTTTTA ATCATACACA TGTTAACAAA TTCGACTCCC TCTCTTCTGT TCATTCTTCA 1280 

CAACCTTTAA TCTTTAATCA TAATGCAGAC 7TCCCGCCAC CCAGCCGGTA CAGAGGGCAG . 1440 

AAGTGATCTT AAAGCCACCC CTGTGGGCAT TGATATCCAT AACTTCGCCT GCAAAGTGGA ISOQ 

GGCCAGGTAC CACCTTACTT TCAAGGGTTT TAGGATTGAT TTCCTTGAGA CTGACTCCAC 1560 

CCTTGGTAAC AAAGGACTTT GCAAGGGACA TTTTTCCAGT TACAGGAAT7 TTAAGTTCTT 1620 

TAATGGACTG GACAAGTTGT TCTCGTTCCT TTTCAGTCAG TTGTTTGACT TTTTCAGGAT 1630 
ATCCTTGTAC AAAAAATTCG CCCAAGCCTT CTGGTAACAA GGTTTTTAAA GCGTTTTTCA " 1740 

AGCATTTTTC CCGATTTTCT TCTAGAAATG TAACCAAGTC CTTCTCAGAA AGTTGAGGCA 1800 

AAACATCGAG TGAGAGAACC TCCCCACCTT TGACAAAGCT AGACATGCGT AGGGCAGCAG I860 

GACCTGACAA ACCAAAGTGG GTAAAGAGTA AATCATGAGT CATGACATGC TTACCATAAC 1920 

TTAGGGTCAC ATCCTCCAGA CAAATACCTT GTAAGGCTTT ATGTCGAAAA TCTGTTAATA 1980 

AAGCACTTTC AGCAGCCTCA AGATCGGTGA TGGTATGCTT AAAATGGCGA GCAATCTCGT 2 040 

CACCAAAACC AGTCCAACCA GTCGAAGGAT AAGACTTACC ACCTCTTGTG ACAATGACTT 2100 

TCTCACAAGT GAAGGTTTGA TCCGCTGACT TAAGGACAAA CTGGTCATCT ACTTTTTTAA 2160 

CAGAAACGAT TTCTATTTGA GTAGCAACTT GACCACCTAG TTCGGTGATT TTCTTTTCCA 2220 

AAGCTTCGAT AATAGTCCGA GACTTGTCAC TGGCTGGAAA GACCCCTCCG TGGTCTTCGA 2 280 

CCTTAAGTTT AACACCATTT TCTGTAAAAA AGTTGATGA7 GTCATGATTA TCGAACTGGG 2 340 

ACAAAACACT GTAAAGAAAG CGTCCGTTTC CAGGAATTCC AGCTACCAGG TTCTC7AACC 2400 

TACCATTGTT GGTCACATTG CAACGTCCCC CACCAGTCCC AGCTAATTTT 7TTCCAACT T 2 4 60 

TCCGATTTTT TTCGATGAGG AGGCTTTTCT G7CCATAAAA GCTACTGCAA ATCCTAGCCA 2520 

TCATACCAGC AGGTCCCCCA CCGATGACAA TAGTATCAAA ATGTTTCATA GCTCTATTGT 2 580 

ACCACAAAAA AACAAGACAT GATGGTCACC TCTTGTCAAG AATCCAAT7A ATCAATTTCA 2640 

TAGCCCATCA GCAAACCGCC CTCTTCTGCA TAGAAACTGC AGAGACCAGA GGTTGGTAGA 2 700 

ATTTTAATAT CCGCTTG7CG GAAGGTTTCA CGGATTCGCT CTGAGAGCTG TTGACAACAT 2760 

TTTTCGTTAT TGCGTTGGGC CATGACAATA CGGCCACCAG CATA7CCAGC TTTTACTAAC 2 82C 

TCATCATAGG CACCTTGAAC TGATTTCTTT GATCCCCT7G C77TTTGTAG CAATTCGAGA 2 880 
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GTCCCAGTTT CACTAGCTTT TCCGACCATA CGAATGTTGA GAAGGCCAAC GACCGTACCG 2940 

ATAAGCTTGC TCAAACGGCC GTTCTTCACC AAGTTATCGA CTTTGGCTAG CACAAAGAGC 3000 

AACTTAGTTT TTTCTTGATA GGCGGTGATA CCTTCAACCA CTTCTTCAAA AGACAAGCCC 3060 

TGGTCAATCA AGTCATTCAA TTTTTCTACG AGTAGGTCAA CTTCACCACC AGCAGATAAA 3120 

CTATCAATCA CATGAATCTT AGTGTCAGGA TGGTCTTCCA GATAAATATT CTTTGCTAGT 3180 

TGAGCACTAT TGTGACTGCC AGAAAGGGTA CCTGTGATGG TTACTAGGAA AATCTTTTTG 3240 

GCACCTTCAA ATGCTCGCAA ATAGTCATCT GGGCTTGGAC AAGCCGATTT TGAAGCTTCT 3 300 

GCAGTTGCAT ACATGGTTTC CATCATTTGG TCAATATCGA GACTCCCGTC ATCAACAAAG 3 3 60 

ACCTGATCAG CTACTTGAAT GGTTAAGGGG ACACTTACAA AGGTTCTCT7 AATAGCTGGT 342 0 

GTTGCCAGTT GACGATAATC ACAACCAGAG TCAGCAATAA TCTTCCAAGT CATAGAAATT 34 80 

CTCCATCTTT GTCAGGAACG AT 3502 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12665 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EON ESS : double 
( Dl TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

CGATTGATTT TTTTAAAGCG TTCGATAGAG AATGAGAAAC GAATCCTTAG CAATGGCGGG 60 

AAAGAATTTG GAGTTCACAA TACAAAACGA TTAACTATGG CTCATATTGT TTTTTATCTC 12 0 

TCTTGCTTGG TTGAGGCAAT GGTGCACAAG ACA A TTTTTG ATGGCATGGG CATGGTTGGT 180 

TTAGTCTTGC TTATTTTTTC TATGCTGATG TTGATGTTGC TGATTCACTT GTTGGGAGAT 240 

ATTTGGACAG TGAAGCTTAT GCTTGTCAAT AATCACAAAT ATGTAGATCA TATCTTGTTT 3 00 

AGGACAGTAA AACACCCTAA TTACTTTTTA AATATTCTTC CTGAGTTCAT TGGCTTGACC 3 60 

TTGTTGAGTC ATGCTTATGT GACTTTTGTT TTAGTTTTTC CAGTTTATGC AGTTATTTTG 42 0 

TATCGACGAA TAGCTGAAGA GGAAAAGCTA TTACATGAAG TTATAATCCC AAATGGAAGC 4 80 

ATAAAGAGAT AAATACAAAA TTCGATTTAT ATACAGTTCA TATTGAAGTG ATATAGTAAG 540 

GTTAAAGAAA AAATATAGAA GGAAATAAAC ATGTTTGCAT CAAAAAGCGA AAGAAAAGTA 600 

CATTATTCAA TTCGTAAATT TAGTGTTGGA GTAGCTAGTG 7ACTTGTTGC CAGTCTTGTT 660 

ATGGGAAGTG TGGTTCATGC GACAGAGAAC GAGGGAGCTA CCCAAGTACC CACTTCTTCT 72 0 

AATAGGGCAA ATGAAAGTCA GGCAGAACAA GG AG AACAAC CTAAAAAACT CGATTCAGAA 7 30 
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CGAGATAAGG 


CAAGGAAAGA 


GGTCGAGGAA 


TATGTAAAAA 


AAATAGTGGG 


TGAGAGCTAT 


840 


GCAAAATCAA 


CTAAAAAGCG 


ACATACAATT 


ACTGTAGCTC 


TAGTTAACGA 


GTTGAACAAC 


900 


ATTAAGAACG 


AGTATTTGAA 


TAAAATAGTT 


GAATCAACCT 


CAGAAAGCCA 


ACTACAGATA 


960 


CTGATGATGG 


AGAGTCGATC 


AAAAGTAGAT 


GAAGCTGTGT 


CTAAGTTTGA 


AAAGGACTCA 


L020 


TCTTCTTCGT 


CAAGTTCAGA 


CTCTTCCACT 


AAACCGGAAG 


CTTCAGATAC 


AGCGAAGCCA 


1080 


AACAAGCCGA 


CAGAACCAGG 


AGAAAAGGTA 


GCAGAAGCTA 


AGAAGAAGGT 


TGAAGAAGCT 


1140 


GAGAAAAAAG 


CCAAGGATCA 


AAAAGAAGAA 


GATCGTCGTA 


ACTACCCAAC 


CATTACTTAC 


1200 


AAAACGCTTG 


AACTTGAAAT 


TGCTGAGTCC 


GATGTGGAAG 


TTAAAAAAGC 


GGAGCTTGAA 


12 60 


CTAGTAAAAG 


TGAAAGCTAA 


CGAACCTCGA 


GACGAGCAAA 


AAATTAAGCA 


AGCAGAAGCG 




GAAGTTGAGA 


GTAAACAAGC 


TGAGGCTACA 


AGGTTAAAAA 


AAATCAAGAC 


AGATCGTGAA 


1380 


GAAGCAGAAG 


AAGAAGCTAA 


ACGAAGAGCA 


GATGCTAAAG 


AGCAAGGTAA 


ACCAAAGCGG 


1440 


CGCGCAAAAC 


GAGCAGTTCC 


TGGAGACCTA 


GCAACACCTG 


ATAAAAAAGA 


AAATGATGCG 


1500 


AAGTCTTCAG 


ATTCTAGCGT 


AGGTGAAGAA 


ACTCTTCCAA 


GCCCATCCCT 


GAAACCAGAA 


1560 


AAAAAGGTAG 


CAGAAGCTGA 


GAAGAAGGTT 


GAAGAAGCTA 


AGAAAAAAGC 


CGAGGATCAA 


1 620 


AAAGAAGAAG 


ATCGCCGTAA 


CTACCCAACC 


AATACTTACA 


AAACGCTTGA 


ACTTGAAATT 


1680 


GCTGAGTCCG 


ATGTGGAAGT 


TAAAAAAGCG 


GAGCTTGAAC 


TAGTAAAAGA 


GGAAGCTAAG 


1740 


GAACCTCGAA 


ACGAGGAAAA 


AGTTAAGCAA 


GCAAAAGCGG 


AACTTGACAG 


TAAAAAAGCT 


1 800 


GAGGCTACAA 


GGTTAGAAAA 


AATCAAGACA 


GATCGTAAAA 


AAGCAGAAGA 


AGAAGCTAAA 


1960 


CGAAAAGCAG 


CAGAAGAAGA 


TAAAGTTAAA 


GAAAAACCAG 


CTGAACAACC 


ACAACCAGCG 


1920 


CCGCCTCCAA 


AAGCAGAAAA 


ACCAGCTCCA 


GCTCCAAAAC 


CAGAGAATCC 


AGCTGAACAA 


1980 


CCAAAAGCAG 


AAAAACCAGC 


TGATCAACAA 


GCTGAAGAAG 


ACTATGCTCG 


TAGATCAGAA 


2040 




ATLULnXiAL 


T C AAC AGC AA 


^^^FT* ft * * ft » 

CCGCCAAAAA 


CTGAAAAACC 


AGCACAACCA 


2100 


TCTACTCCAA 


AAACAGGCTG 


GAAACAAGAA 


AACGGTATGT 


GGTACTTCTA 


CAATACTGAT 


2160 


GGTTCAATGG 


CGACAGGATG 


GCTCCAAAAC 


AATGGCTCAT 


GGTACTACCT 


CAACAGCAAT 


2220 


GGCGCTATGG 


CGACAGGATG 


GCTCCAAAAC 


AATGGTTCAT 


CGTACTATCT 


AAACGCTAAT 


2280 


GGTTCAATGG 


CAACAGGATG 


GCTCCAAAAC 


AATGGTTCAT 


GGTACTACCT 


AAACGCTAAT 


2340 


GGTTCAATGG 


CGACAGGATG 


GCTCCAATAC 


AATGGCTCAT 


GGTACTACCT 


AAACGCTAAT 


2400 


GGTTCAATGG 


CGACAGGATG 


GCTCCAATAC 


AATGGCTCAT 


GGTACTACCT 


AAACGCTAAT 


2460 


GGTGATATGG 


CG AC AGGTTG 


GGTGAAAGAT 


CGAGATACCT 


CGTACTATCT 


TGAAGCATCA 


2520 
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GGTGCTATGA AAGCAAGCCA ATGGTTCAAA GTATCAGATA AATGGTACTA TGTCAATGGC 2580 

TCAGGTGCCC TTGCAGTCAA CACAACTGTA GATGGCTATG GAGTCAATGC CAATGGTGAA 2640 

TGGGTAAACT AAACCTAATA TAACTAGTTA ATACTGACTT CCTGTAAGAA CTCTTTAAAG 27 00 

TATTCCCTAC AAATACCATA TCCTTTCAGT AGATAATATA CCCTTGTAGG AAGTTTAGAT 27 60 

TAAAAAATAA CTCTGTAATC TCTAGCCGCA TTTATAGCGC TAGAGACTAC GGACTTTTTT 2 820 

TGATGAGGAA AGAATGGCGG CATTCAAGAG GCTCTTTAAG AGAGTTACGG GTTTTAAACT 2880 

ATTAAGCCTT CTCCAATTGC AAGAGGGTTT CAATCTCTGC CACGGTGCTG GCTTGCGAAA 2940 

TGGCTCCACG GAGTTTGGCA GCGCCAGATG TTCCACGGAG ATAGTGAGGA GCGAGACCGC 3000 

GGAATTCACG AACTGCGACG TTTTCTCCTT TGAGGTTAAT CAATCGTTTC AAGTGTTCCT 3060 

AGGCGATCTT CATCTTGTCT TCAAAGGTCA AATCAGGTAG GATTTCTCCT GTTTCAAAGT 3120 

AATGGTTGAT TTGGTTGAAG AGGTAAGGAT TTCCCATGGC AGCTCGGCCA ATCATGACTC 3180 

CGTCAGCACC AACTTCTTCG ATGCGTTGCT TGGCTTCTTG GACAGTACCG ATATCACCGT 3 240 

TGGCGATGAA TGGAATCTTG GTTAGAGCTT GGGCAACCTT GTAAAGGGTC TCAAGGTCTG 3 300 

CGTGGCCAGT ATACATTTGT TCACGGGTAC GGCCATGCAT GGCGAGGGCA GAAACACCTC 3 3 60 

CAGCTTCAGC AGCGAGAGCA TTTTCTACTG CAAGAGATGG GTCCGCCCAG CCGGTACCCA 34 20 

TTTTGACACT AAGTGGGATA TCAAGGACAG ACTGGACCTT GTTGATGATG GAGTAAATCT 34 80 

TGTCTGGATC CTTGAGCCAC ATAGCACCAG CTTCGTTCTT CACGATTTTG TTGACAGGGC 3 540 

AGCCCATGTT GATATCGACG ATATCGCTCT TGGTGTTTTC TTGGATGAAT TCTGCTGCGC 3600 

GTGCTAGGCT GTCTTCATCG CTACCAAAAA GTTGGATAGA GACAGGGTTT TCGCCCTCAT 3 660 

CGATATGAAG CATGTGCAGG GTTTTTTCGT TGTTCTATTG GATTCCCTTG TCAGAGACCA 3 7 20 

TTTCCATTAC AACGAGTCCA GCTCCGAGCT CCTTTGCCAT AGTACGAAAG GCTGAGTTGG 37 80 

TCACGCCAGC CATAGGCGCT AAAACGGTAC GATTGGGAAT CTCAATATTG CCAATCATAA 3 840 

AAGGTGTATT AAGATTTGTC ACGAATGAGT TCCTCCAGGT CCTTTTCATC AAAGTTCTAA 3900 

GTAGTTTGGC AGAATTGACA AGTGATTTCT GCCCCGTGGT CTTCCTCTTT CATTTCCTGT 39 60 

AAGTCTGAGC TTGGAAGGCT GGCAAGAGCG TTCATAAAGC CTTCATCCCT ACAGTCACAT 4 020 

TGGAAACGGA TTTCTTCTTC AGAAAGACGC TTGTAGGCTT CGTCCCCGTA GATAGCCTTG 4 080 

AGGAGGGCTT CGATATGGTC GTCGCTTTCG AGAAGAGTAG AGATAGCTGG CATTTCTTGG 4140 

ATGCGTTTTT CAAAGCGAGC AATCTCTTCT TTCTTGGCTC CTGGCAAGAC TTGAACTAGG 4 200 

AAACCACCTG CAACCTTGAC CTTGTCTTCC TCGTCCAAAA GGACATTGAG CCCCACCCCT 4 2 60 

GAAGGCGTTT GTTCCCTTTC AGTAAGGTAA AAGGCAAGGT CTTCACCGAT TTCTCCAGAG 4 3 2C 
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CCAGAGAGAA 


AACAGGAGAT 


TGCCAATTTA 


GAGGAAATCT 


GCCCTCCAAC 


5460 


TTTGTCTCCG 


GGGCAGAAAT 


TGGACTTGGT 


TCTCTGCGCT 


CACATCAGTC 


AACAACCTCT 


5520 


TUGGGTCAAT 


CCTTGTCACT 


TTCAAGTCAT 


AGAAGAGAAA 


ATGCGAGGCT 


ATTTTGACAA 


5580 


TATCTTTTAT 


CTTCGTTTGC 


TGAGAAAGGT 


TCCCTCCTTT 


TTTGCTGGGC 


AACATATTCC 


5640 


ACTAGGAGTT 


GAGGATGGTG 


AGATGATGAT 


ATTCTTCTCT 


TTTCTCCTAT 


CTCATCGCAT 


5700 


TCTTCCTCTT 


CATACTATGG 


AGTATATTCT 


TGGTTTTGGA 


GGGCAGTTGG 


CAGATTTACT 


5760 


GACGCAATTG 


ATTCAAGAAA 


TGAAGAAGGA 


GGAACTATTG 


GGGGATTATA 


CAGAGGACCA 


5820 


TGTCACCTAT 


GAACTCAGTC 


AGCTTTGTGC 


TCAAGTCTAT 


CTCTATAAGG 


GCTATATTTT 


5880 


ACAGGATCGC 


TACAAGTACC 


AGTTAGAGAA 


TCGTCATCCA 


TATTTACTGA 


TGGAACATGA 


5940 


TTTTAAACAC 


ACAGCAGAGG 


AGATTTTTCA 


TGCTCTACCT 


GCTTTTCAAC 


AGGGGACAGA 


6000 


TTTAGATAAG 


AAGATTCTCT 


GGGAATGGCT 


CCAGTTAATC 


CAATATATGG 


CTGAAAACGG 


6060 
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TGGCCAGCAT 


ATGCGGATTG 


GTCTGGATTT 


GACATCTGGT 


TTTCTTGTCT 


TTTCAAGGAT 


6120 


GGCAGCCATT 


TTGAAACGGT 


ATTTGGAATA 


CAATCCTTTT 


ATTACCATTG 


AAGCTTATGA 


6180 


CCCTAGTCGG 


CATTATGATT 


TGCTGGTTAC 


CAATAACCCG 


ATTCATAAGA 


AGGAACAGAC 


6240 


ACCAGTCTAT 


TATTTAAAAA 


ATGACTTGGA 


TATGGAGGAT 


TTGGTAGCGA 


TTCGCCAGTT 


6300 


ATTATTCACT 


TAAAAGGCTT 


GGTTAATCCA 


GGTCTTTTTT 


GTGAAATTCA 


CACAATCTCC 


6360 


TCACATTTTT 


TTAAAAATTA 


AAAAAAGTTG 


ATAAACAAGA 


AAGCGCTTTA 


TTTTGTATAC 


6420 


TAGTAAGTGT 


AAAGAGGAAA 


CACCTCAAGA 


TCTTTATCAG 


GAGGACAGTA 


CATGTCACAA 


6480 


GAAAAATACA 


TCATGGCCAT 


TGACCAGGGA 


ACTACAAGTT 


CTCGTGCCAT 


CATTTTCAAC 


6540 


AAAAAAGGGG 


AAAAGGTTAG 


CTCGAGTCAA 


AAAGAGTTTA 


CCCAGATTTT 


CCCTCAGGCA 


6600 


GGTTGGGTTG 


AGCACAATGC 


CAATGAAATT 


TCGAACTCTG 


TTCAGTCAGT 


TATTGCGGGT 


6660 


GCTTTCATCG 


AAACTGGTGT 


CAAGCCAAAT 


CAAATCGAGG 


CAATCGGGAT 


TACCAACCAA 


6720 


CGTGAAACAA 


CCCTTCTCTC 


GGATAAGAAA 


ACAGGACTTC 


CTATCTACAA 


TGCTATCGTT 


6780 


TGGCAGTCAC 


GCCAGACAGC 


ACCTTTGGCT 


GAGCAACTAA 


AAAGCCAACG 


TTATGTGGAA 


6840 


AAATTCCATG 


AAAAGACTGG 


TTTGATTATT 


GATGCTTACT 


TCTCTGCTAC 


CAAGGTTCGT 


6900 


TGGATTTTGG 


ATCATGTAGA 


AGGTGCTCAA 


GAGCGAGCAG 


AAAAAGGGGA 


ATTGCTCTTT 


6960 


GGTACTATCG 


ATACTTGGTT 


GGTTTGGAAA 


TTGACTGACG 


GTGCGGCTCA 


CGTGACTGAC 


7020 


TACTCAAATG 


CAGCTCCTAC 


CATGCTTTAT 


AACATTAAAG 


AACTCAAATG 


GGATGATGAG 


7080 


ATTTTGGAAA 


TCCTTAACAT 


TCCCAACgCT 


ATACTTCCAG 


AAGTTCCTTC 


TAACTCCGAA 


7140 


ATCTACGGCA 


AGACAGCTCC 


ATTCCATTTC 


TACGGTGGAG 


AGGTGCCAAT 


CTCAGCTATG 


7200 


GCTGGGGACC 


AACAAGCAGC 


CCTCTTTGGA 


CAGTTGGCTT 


TTGAGCCAGC 


TATGGTTAAG 


7260 


AATACTTATG 


GAACAGGCTC 


TTTCATCATC 


ATGAATACTG 


GGGAAGAGAT 


GCAGTTCTCT 


7320 


GAAAACAACC 


TCTTGACAAC 


CATTGGTTAC 


GGAATCAACG 


GTAAGCTTTA 


TTATGCCTTG 


7380 


CAACGTTCTA 


TCTTCATCGC 


AGGAAGTGCT 


ATTCAGTGGC 


TTCGTGACGG 


TCTTCGCATG 


7440 


GTTGAAAATT 


CACCAGAATC 


TGAAAAATAC 


GCTCGTGATT 


CTCACAACAA 


CGATGAAGTT 


7500 


TATGTCGTTC 


CAGCCTTTAC 


AGGTCTAGGC 


GCTCCATACT 


GG AACCAAAA 


TGCTCGTGGT 


7560 


TCCGTCTTTG 


GTTTCACTCG 


TGGAACAAGC 


AAAGAAGACT 


TTATCAAGGC 


GACTTTGCAA 


7620 


TCTATTGCTT 


ATCAAGTGCG 


TG AT AT CATC 


GACACCATCC 


AAGTGGATAC 


TCAGACCGCC 


7680 


ATTCAAGTAC 


TGAAGGTGGA 


TGGTGGTGCA 


GCCATGAACA 


ACTTCCTCAT 


GCAGTTCCAG 


7740 


GCGGATATTT 


TAGGCATTGA 


CATTGCACGT 


GCTAAAAACC 


TGGAAACAAC 


AGCTCTAGGA 


7800 


GCGGCCTTCC 


TAGCAGGTTT 


GTCAGTAGGG 


TACTGGAAAG 


ACTTGGACGA 


GTTGAAACTC 


7860 
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TTGAACGAGA CAGGAGAACT CTTTGAGCCA TCTATGAACG AATCTCGCAA GGAACAACTC 7920 
TACAAGGCC7 GGAAGAAGGC TGTGAAAGCA ACTCAAGTCT TTCCGCAAGT AGACGACTAA 7980 
TACTGGCACA ATAAAGCGAT TTATTTAGAA AGTGTGTAAA TATGGAATTT TCAAAGAAAA 8040 
CACGTGAATT GTCAATTAAA AAAATGCAGG AACGTACCCT GGACCTCTTG ATTATCCGTC 8100 
GAGGAATCAC AGCAGCTGGT GTAGCCTTGC AGGCCGCAGC TAGCGGTCTT CAGACTGGTT 8160 
TGATTGAAAT CCAAGACTTT GCAGAAGGAA CATCTAGTCG TTCAACAAAA TTGGTTCACG 822 0 

GACCACTTCG TTACCTCAAA CAATTTGACG TAGAAGTGGT CTCAGATACG CTTTCTGAAC 3230 
GTGCAGTGGT TCAACAAATC GCTCCACACA TTCCAAAATC AGATCCAATG CTCTTACCAG 8 340 

TTTACGATCA AGATGGAGCA ACCTTTAGCC TCTTCCGTCT TAAAGTAGCC ATGGACTTGT 8400 

ACGACCTCTT GGCAGGTGTT AGCAACACAC CAGCTGCGAA CAAGGTTTTG AGCAAGGATC 8460 

AAGTCTTGGA ACGCCAGCCA AACTTGAAGA AGGAAGGCTT GGTAGGAGGT GGAGTGTATC 8520 

TTGACTTCCG TAACAACGAT GCGCGTCTCG TGATTGAAAA CATCAAACGT GCCAACCAAC 8580 

ACGCTGCCCT CATTGCCAAC CACGTGAAGG CAGAAGGCTT CCTCTTTGAC GAAAGTGGCA 8640 

AGATTACAGG TGTTGTAGCT CGTGATCTCT TG AC AG A CCA AGTCTTTGAA ATCAAGGCCC 8700 

GTCTGGTTAT TAATACAACA GGTCCTTGGA GTGATAAAGT ACGTAATTTG TCTAATAAGG 87 60 

GAACGCAATT CTCACAAATG CGCCCAACTA AGGGAGTTCA CTTGGTAGTA GATTCAAGCA 8820 

AAATCAAGGT TTCACAGCCA GTTTACTTCG ACACAGGTTT GGGTGACGGT CGTATGGTCT 9880 

TTGTTCTCCC ACGTGAAAAC AAGACTTACT TTGGTACAAC TGATACAGAC TACACAGGTG 8940 

ATTTGGAGCA TCCAAAAGTA ACTCAAGAAG ATGTAGATTA TCTACTTGGC ATTGTCAACA 9000 

ACCGCTTCCC AGAATCCAAC ATCACCATTG ATGATATCGA AACCAGCTGG GCAGGTCTTC 9060 

GTCCATTGAT TGCAGGGAAC AGTGCCTCTG ACTATAATGG TGGAAATAAC GGTACCATCA 9120 

GTGATGAAAG CTTTGACAAC TTGATTGCGA CTGTTGAATC TTATCTCTCC AAAGAAAAAA 9180 

CACGTGAAGA TGTTGAGTCT GCTGTCAGCA AGCTTGAAAG TAGCACATCT GAGAAACATT 92 40 

TGGATCCATC TGCAGTTTCT CGTCGGTCTA GCTTGGACCG TGATGACAAT GGTCTCTTGA 9300 

CTCTTGCTGG TGGTAAAATC ACAGACTACC GTAAGATCCC TGAAGGAGCT ATGGAGCGCG 9 3 60 

TGGTTGACAT CCTCAAAGCA GAATTTGACC GTAGCTTTAA ATTGATCAAT TCTAAAACTT 9420 

ACCC7GTTTC AGGTGGAGAA TTGAACCCAG CAAATGTGGA TTCAGAAATC GAAGCCTTTG 94 80 

CGCAACTTGG AGTATCACGT GGTTTGGATA GCAAGGAAGC TCACTATCTG GCAAATCTTT 9540 

ACGGTTCAAA TGCACCGAAA GTCTTTGCAC TTGCTCACAG CTTGGAACAA GCGCCAGGAC 9600 
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TCAGCTTGGC AGATACTTTG TCCCTTCACT ATGCAATGCC CAATGAGTTG ACTCTTAGCC 9660 

CAGTTGACTT CCTTCTTCGT CGTACCAATC ACATGC7CTT TATGCGTCA7 AGCTTGGATA 9720 

GTATCGTTGA GCCAATTTTG GATCAAATGG GACGATTCTA TGACTGGACA GAAGAAGAAA 9 7 SO 

AAGCAACTTA CCGTGCTGAT GTCGAAGCAG CTCTCGCTAA CAACGATTTA GCAGAATTAA 9840 

AAAATTAAGA AAAAATAAAA GAGGTGGAGG GCAGCATTCC TTGTCGCCCC TCCCTTCTTT 9900 

TTAATGGAGA CAGAAAGATG ATGAATGAAT TATTTGGAGA ATTTCTAGGG ACTTTAATCC 9960 

TGATTCTTCT AGGAAATGGT GTTGTTGCAG GTGTGGTTCT TCCTAAAACC AAGAGCAATA 10020 

GCTCAGGTTG GATTGTGATT ACTATGGCTT GGGGGATTGC AGTTGCGGTT GCAGTCTTTG 10080 

TATCTGGCAA GCTCAGTCCA GCTTATTTAA ACCCAGCTGT GACCATCGGT CTGCCCTTAA 10 140 

AAGGTGGTTT GCCTTGGGCT TCCGTTTTGC CTTATATCTT AGCCCAGTTC GCAGGGGCCA 10200 

TGCTGGGTCA GATTTTGGTT TGGTTGCAAT TCAAACCTCA CTATGAGGCA GAAGAAAATG 10260 

CAGGCAATAT CCTGGCAACC TTCAGTACTG GACCAGCCAT CAAGGATACT GTATCAAACT 10320 

TGATTAGCGA AATCCTTGGA ACTTTTGTTT TGGTGTTGAC AATCTTTGCT TTGGGTCTTT 10380 

ACGACTTTCA GGCAGGTATC GGAACCTTTG CAGTGGGAAC TTTGATTGTC CGTATCGCTC 10440 

TATCACTAGG TGGGACAACA GGTTATGCCT TGAACCCAGC TCGTGACCTT GGACCTCGTA 10500 

TCATGCACAG CATCTTGCCA ATTCCAAACA AGGGAGACGG AGACTGGTCT TACGCTTGGA 10560 

TTCCTGTTGT AGGCCCTGTT ATCGGAGCAG CCTTGGCAGT GCTTGTATTC TCACTTTTCT 10620 

AGTTTATACT CTTCGAAAAT CAAATTCAAA CCACGTCAGC GTCGCCTTAC CGTACTCAAG 10680 

TACAGCTTCC GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT TAGAAAACAA 10740 

TTATGTTGAT AGAGCTTGGG CAAGAGCCCA ATTTCAGCAA AAAATGAAGT AAATCTTCTC 10800 

ATAATAAAAC GCATCATATC AAGCACGAAA ATTCCACGAG GTCAACTACA CTCAGAAACC 10860 

TGAACAACAA GCCAAAACGC CCAAAAAAGG CGGCAAAAAG CAAGCACCTG CAACCAACCT 10920 

GCCGAAATGG TCAAATCCTG ATTATGTCAA CGAATTAGAC CCAAAAATCG TTGATATCCT 10980 

AGTACAATTT CACAAGTCAC AAGGCACTTT GGAAACTCCC GAGGCGCAAG CAGAAATCGC 11040 

CCAAAAACGT GAAGAAATCG AGCAAAGGAG ACCTGAGCTT GAGGGTAAAA AACAAGAGCT 11100 

TTTGAACCGC TTGAACAAAT AGAGTTTCGC AAGTATTATG CTTACAAATT ACTTGAGCAA 11160 

TTAACTAAAA TATAAACCCT GCCTTTATAT CTAGGCAGGG TTTATATTTT AGAAATTCAC 11220 

GTAGGTTGTT ACGGTTTTTA CATACCCAGT ATAGTTTCAG TTTCTATAGT ATTCAGTGAT 11280 

AAACTTCCAT TTTCTTTGAG CAACATGGAT ATAAG7ACTT GTTATGTAGT ATGGATATCG 113 40 

GCTTTGTGAA TCCAAGTAAG ACTGATAAGC TTCTATACCA AAATATGCTC CACCAATTAT 11400 
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TGCACCCCAT GGACCCCCCA ATAAAGCACC TATCCTACCA ATCATATAAC TGATTCCAGC 114 60 

ACCAGTCATG AAGTTAGCGA ATGTGTTACC TTGTTTATTC CCATGTATTG TGTTGACGTA 11520 

ATTCCAAACA TTAGGATCGT ATGATCTAAA AGATATATTT AGGTCGATTT CATTCTTTTG 11580 

ATAAGCCATA TAAAATGCCC CATTGATATA GACGCCCTCA CCACCTCGTT CAATAGTCTC 11640 

TACACTTCCA TCTGGATTGA CAACCTCAAG AACTTCATCG CTTAAAATAT TTACTTGCGT 11700 

ATCTCCGAAC CGCACTGATG AGCCATTCTC AAACTGAGCC TCACCAGATA CAACTTTAGA 11760 

GTTTGCCGAT AAGCTATCAT CAGCAAAAAC AAACAAGCGA CGGGGAAATG CTAGACATAC 11820 

AGAAAACAGA CATAACTAGC AAACACATGC ATTTAAACAT CTTAGACATA ACGGAAACTC 11880 

CTTTGTATTT TTGATTTTTT TCAACTTTTA TTATACAATA AAACCAAATA AAAAGAAAGC 11940 

GGTAACAATA TGCTTAATGC GAAAATTTTT TATATATTTT TATGTTTGAT CGTTATCGAA 12000 

ACTACAGGCT TGTTGTTGTT GAAAAGAGGT CTCGAAATGG GTTATTTAGA CACAGAAGCT 12060 

ATTATCCTCG CAGTTTTTTC ATTTGCTTTT TACAACCTAT GTTCATTCGC TTGGGTCTGC 12120 

TCTACAATAA AAAACAATAA AAAATAAATA GACGTATTTT CAAAAAAAAC maAATGCATA 12180 

TTTATATTAG CAAAACGACG ATTTAAATCG TCGTTTTTTT GTAGTACGAC CGGCATGTCG 12240 

TATATCTGAG GTGTAAGTCC TCAGCCTGAC TATCGTGAGG TAGCAGGGAG AGGAAGGGAT 12300 

AGCGAAATCG TGCCTCTACC AACAGGAACG TGATAGTAAG GCGTATATAG CGGATAAGGA 12 3 60 

GGCTTCAAAC TCTAAAGTCC AAAAAGGTAG TCGTAACCTA TATGTGTAAA TCACGAGAGT 12 420 

AATTGAATTC GGACTAAGGT TTGTGTGAAA AAGATAAATC TTTCTAGAGT CTAAAGACTC 124 80 

TGCGTCAGAT TTCCTATTTT CACTGTAACC TTTTAACGTC CTCATATCTT GTATAAACGA 12540 

GGAAAGATGT ACGACTTATC CCGTGAGGTT TCATCAGCGT GAAAGCGTAG TAACAACGAA 12600 

TCATGAGAAG TCAGCCGAGC CCATAGTAGT GACCAAACTT CCGTAATGGA AGTGGAGCGA 12 660 

AGGGG 12 665 

(2) INFORMATION FOR SEQ ID NO: 135: 

ii) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 5305 base pairs 
(8) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

CGCTAATCAC TACAATCATT TTATTGTACT TTTTCACTCT CAAGAAAAGC AAGAAGTATT 60 
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CATTTTAGTT 


TCATTTAGTA 


TTATTTTGCA 


TACCTAAAAT 


ACAGTAAAAA 


ATCAGTCA7C 


120 


TTCGTATGCT 


CCTGCTTTCA 


CTATTCAACA 


CGTTTTTGAC 


TTATACTAGG 


CTCATTTCCA 


130 


AAAGCATTAT 


ATAATAGTGA 


TATGAAACCA 


ACTAAACTAA 


ACAAGAAATA 


TAAGCAATAA 


240 


AAATTCGTTT 


AAAAGATCTT 


ACTAAAGCTA 


ATACTAAATA 


AAAATAAAAG 


AGTAAACTAG 


300 


GAAGTTTATT 


TCAAACAACC 


TAAAATACTG 


ATTTTCGGCT 


GAAGATAATA 


CTGGAGTGCA 


360 


AATTAATGGG 


GTTATAATAA 


ATAGCTGATA 


GCTTGTCTTG 


GTTTTGGATT 


TTTTAAGAGT 


420 


AGATGAGTAT 


TAAAACTATA 


AGGAGGACGA 


AGGTGGCTAA 


AAATTTAAAA 


TTAAAATTAG 


480 


CTCGGGTACA 


GCGTGATTTA 


ACACAAGGTC 


AACTGGCAGA 


GGCTGTCGGG 


GTCACACGCC 


540 


AGACTATTGG 


TTTAATAGAG 


GCGGGAAAAT 


ACAATCCCAG 


TCTCTCGCTC 


TGCCAGTCTA 


600 


TTTGCAGATG 


TTTAGGGAAA 


ACCCTAGACC 


AACTATTTTG 


GGAGGAAGAA 


CATCAAAAA7 


660 


AGATTTTATT 


ATTCTCAATT 


ACTAGACGAA 


AGAGAAGAAC 


AACTGTTCAA 


TAAAGCGGGC 


720 


TCTGAAAGTT 


TCTATATCTG 


CATTGCTTTG 


TCCCTCCTAT 


CTTATATCAT 


TTCACTATTA 


780 


GCACCAAGCC 


TTTTTAATTC 


TAATATGCTG 


CTAATCCTTA 


TCATCATAGG 


GACATTTTAC 


S40 


TTTTTCAATC 


GTGCCCGTTA 


TCTGGGAGTG 


ACCTACTATG 


GTCGTTTTCA 


TTTTACGATT 


900 


TTGGGTTGTT 


TTTTCCTAAC 


CTTGGCTATT 


ACGGCTCTTT 


TGATGTTGCA 


CAATTATCAA 


960 


TTCAACATAG 


AAATTTATCA 


GCACAATCCT 


TTGAATTTTA 


AATACCTGTC 


TGCTTGGGTC 


1020 


ATTACTTATA 


TCATTTACCT 


TCCGTGGATC 


TTTATTGGCA 


ATCTTGGTCT 


TAAGAGCTAT 


1080 


GGCGAATGGG 


CTCAGAAAAA 


ATTTGAACAA 


GATATGGATG 


AATTGGAGAG 


TGGAGAATAG 


1140 


CTTCTTACTC 


TTTTCTCAAT 


CCAGCTAAAA 


TGTGATATAA 


TAGTACTAAT 


TTATTGGAAT 


1200 


ACATGAAAGT 


TCTTGAAAAT 


TTTCATGGGT 


TTCTAGCTAA 


GGAAGTAGGA 


AAAGTATGTA 


1260 


TCCAGATGAT 


AGTTTGACAT 


TGCACACGGA 


CTTGTACCAG 


ATCAACATGA 


TCCAGCTTTA 


1320 


CTTTGACCAA 


GGGATTCACA 


ATAAGAAGGC 


GCTCTTTGAG 


GTGTATTTCC 


GCCAACAGCC 


1380 


kill i\J\\jJ\£\\- 


1 r\l VjV» VJVJ 


TTTTTGCAGG 


TTTAGAAAGA 


ATTGTGAACT 


ATCTTGAAGA 


1440 


CTTGCGTTTT 


TCAGATAGTG 


ATATAGCCTA 


TTTGGAGTCG 


CTTGGTTATC 


ATGGGGCGTT 


1500 


CTTGGATTAC 


CTTCGCAATT 


TCAAGTTGGA 


GTTGACCGTT 


CGTTCTGCCC 


AAGAAGGGGA 


1560 


TTTGGTTTTT 


GCTAATGAAC 


CGATTGTGCA 


GGTGGAAGGA 


CCTCTAGCCC 


AATGTCAGTT 


1620 


GGTCGAAACG 


GCTCTTTTGA 


ACATCGTCAA 


CTACCAGACT 


TTGGTGGCGA 


CGAAGGCAGC 


1680 


TCGTATTCGT 


TCGGTTATCG 


AAGATGAACC 


CTTGATGGAG 


TTTGGGACAC 


GTCGGGCTCA 


1740 


AGAAATGGAT 


GCGGCCATCT 


GGGGAACACG 


CGCAGCTGTG 


ATTGGTGGCG 


CCAATGGAAC 


1800 


CAGCAACGTG 


CGTGCGGGTA 


AGCTCTTTGA 


CATTCCTGTT 


TTGGGAACCC 


ATGCCCATGC 


1860 
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CTTGGTACAG GTTTATGGCA ATGACTATGA AGCTTTCAAG GCTTACGCTG CGACCCACAA 1920 

AAATTGTGTC TTTCTTGTGG ATACCTATGA CACCCTTCGC ATCGGTGTAC CAGCTGCCAT 1980 

TCAGCTCGCG CGTGACCTGG GTGATCAGAT TAACTTTATG GGTGTGCGGA TTGACTCTGG 2040 

GGATATTCCC TACATTTCTA AGAAAGTCCG TCAGCAACTG GATGAGGCTG GATTTACAGA 2100 

GGCTAAGATT TATGCTTCTA ATGATCTAGA TGAAAATACC ATCCTTAACC TCAAGATGCA 2160 

AAAGGCCAAG ATTGATCTCT GGGGTGTGGG TACCAAGCTG ATTACAGCCT ATGACCAGCC 22 20 

GGCTCTTGGG GCGGTTTACA AGATTGTTGC AATCGAAGAT CAAACTCCTC AGATGCGCAA 22 80 

TACGATTAAG CTGTCTAATA A7CCTGAAAA AGTTTCTACG CCACCTAAGA AGCAGGTGTG 2340 

GCGCATTACC AGTCGTGAAA AAGGCAAGTC AGAAGGCGAC TATATCACTT ATCATCCTGT 24 00 

GGATATTAGC GACATGACAG AAATCAAGAT GTTCCATCCC ACCTATACAT ACATCAAGAA 2460 

GACGGTTCGT AATTTTGATC CCCTTCCTCT CTTGGTGGAT ATCTTCAAAG AAGGAATATT 2520 

ACTTTACAAC TTGCCTAGTT TGACTGACAT TCAGGATTAT GCCCGTAAGG AATTTGACAA 2580 

GTTGTGGGAT GAGTATAAGC GTGTGCTCAA TCCGCAGCAC TATCCAGTGG ATTTGCCGCG 2640 

TGATGTATGG CAAGATAAGA TGGACTTGAT TGATAAGATG CGCAAGGAAG CCCTTGGTGA 2700 

AGGAGAAGAA GAATGAGTTT GCAAGAAACG ATTATCCAAG AGCTGGGTCT CAAACCAGTG 2760 

ATTGATGCCC AGGAAGAAAT CCGTCGTTCT ATTGATTTCT TAAAAAGATA TCTGAAAAAA 2 820 

CATCCCTTCC TAAAAACCTT TGTACTAGGG ATTTCTGGGG GACAAGACTC AACCTTGGCA 2880 

GGACGTTTGG CGCAATTAGC TATGGAAGAA CTGCCAGCTG AAACGGGACA CGATAGCTAC 2940 

AAATTTATCG CTGTCCGCCT GCCATACGGA GTGCAAGCTG ATGAAGCAGA TGCTCAAAAA 3 000 

GCCCTAGCCT TCATCCAGCC AGATGTCAGC TTGCTTGTGA ATATCAAGGA ATCAGCTCAT 3 060 

GCCATGACAG CTGCAGTTGA AGCGACAGGT AGTCCTGTTT CAGACTTCAA CAAGGGGAAT 312 0 

ATCAAGGCAC GT7CCCGTAT GATTGCTCAG TATGCCCTTG CTGCTTCCCA TAGCGGAGCG 3180 

GTCATTGGAA CAGACCACGC CGCGGAAAAT ATCACAGGTT TCTTTACCAA GTTTGCTGAC 32 40 

GGCGGTGCGG ATATTCTCCC TCTTTACCGC CTCAATAAAC GCCAAGGAAA ACAGCTCTTG 3 300 

CAGAAACTTG GCGCAGAGCC AGCCCTTTAT GAAAAAATCC CAACGGCAGA CCTAGAAGAA 3 360 

GATAAACCAG GCCTAGCTGA CGAAGTCGCA CTTGGAGTCA CCTACGCAGA GATTGACGAC 3420 

TACCTAGAAG GCAAAACAAT CAGCCCAGAA GCTCAAGCGA CCATTGAAAA CTGGTGGCAC 34 80 

AAAGGCCAAC ACAAACGCCA CTTACCCATC ACCGTATTTG ATGACTTTTG GGAGTAAAAA 3 540 

GGTCCGGGGG ACCTTTTTAG CTTCTTGCCC TGAAATTAAA AAGCAAGAAA AACCTCCACT 3 600 
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GGAGGTTTTC 


AGCCTCTCAT 


CTTGAAATAA 


GAAAGTGAGA 


GAAGGTCTGG 


GGGATCTTGA 


3660 


ACCCCGAGTT 


TAGAAATAAG 


AAAATGAGGC 


AGATTCAGTA 


ACTCGAAGAG 


TTCGATTTCA 


3720 


TCGTCTTACC 


CCTGCAACGA 


TGACTAGGTT 


TGAAAAAGCT 


TGCTAGAGCG 


CATTTCAAAC 


3780 


CAGGCAGCAA 


CTGCGTCAAG 


AAATTAGAAG 


ACAAACTCGT 


TTTCTAGCTG 


TTACTGAGTT 


3840 


CAGCCTTTTT 


ACTACGAGTA 


TAGAAATAAG 


GAAGTGAGGT 


AGCATCATGA 


AATCTATCGG 


3900 


TACCCAAATA 


TTACAGACAG 


AACGTTTGAT 


TTTAAGAAGA 


TTTGTGGAGA 


GTGATGCAGA 


3960 


AGCCATGTTT 


CAAAATTGGG 


CTTCATCCGC 


TGAGAATCTG 


ACCTATGTTA 


CCTGGGATCC 


4020 


CCATCCTGAT 


GTCGAAATCA 


CTCGAAACTC 


GATTTGCAAT 


TGCGTTGCTT 


CCTATACTAA 


4080 


TCTCAACTAT 


TATAAATGGG 


CCATTTGTCT 


AAAAGAAAAC 


CCAGAGCAAG 


TAATAGGAGA 


4140 


TATCAGCATT 


GTTAAGATAG 


ACGAGGCTGA 


TTTAAGCTGT 


GAAATTGGCT 


ATGTGTTAGG 


4200 


CAAGGCTTAC 


TGCGGAAATG 


GTATGATGAC 


AGAGACTTTG 


AAAGCTATCT 


TCGACTTTTG 


4260 


TTTTACTCAA 


GCAGGTTTTC 


AAAAGGTCAG 


AGCACGTTAT 


GCCAGTCTCA 


ACCCAGCTTC 


4320 


AGGTCGTGTC 


ATGGAAAAGG 


CTGGAATGTC 


CTATCTACAA 


ACCATTCTTA 


ATGGTGTAGA 


4380 


GAGAAAAGGC 


TATCTTCCGG 


ATCTTATTTA 


TTATGGTATA 


AGTAGGGAAG 


AATGTTGAAT 


4440 


TCTATTTTCT 


GTTTCTATCG 


AAGTCAACTA 


TTTATTGTAA 


ATATAATAAT 


TAGCATTCCA 


4500 


AGTTTATTTG 


AAACTTTAAA 


ATACCATATT 


GATTAGTACA 


AGACAGATGT 


TCTACTTCCT 


4560 


TCTTTAATCT 


GGTTTAGTGT 


TAGTTAAAAA 


ATCGCTTTAA 


GCTTGTAACT 


AAGAGGGAGC 


4620 


TAATCGACTA 


GATTCTCCAG 


CCGAACAGGT 


GGTAATGTAC 


TTTTTATACT 


GTAATCCTAG 


4680 


CTGTTGTTAA 


ATTTAAAATA 


GAATCCTCTA 


TCGAGTTAGG 


GAATTAAATT 


CAACCAATTT 


4740 


TATTCATGTT 


TTTTCTATCA 


AATTATCTAA 


TATTAAAATA 


GTCTCATTCT 


GATGAGAAAA 


4800 


CTATTCCCAA 


ATCATTCATA 


CCTCTCTCAA 


CTAGATGTAA 


CTTACAAAAC 


CCCTGACCTC 


4860 


ATGAGCCACT 


TTCTTCCTCC 


TCATGAGGTC 


AGTTTTACTT 


TCTGCTGTTC 


CAGTATCGTT 


4920 


TTTCCTCGCT 


AGATTTCCTC 


AAAAGGGCAG 


ACTCCTCCCT 


TGGTGCGTCA 


CACGATTTTT 


4980 


TCATCTCGAC 


TGTTCTTTAA 


TGCATCATTA 


ACGACGCTTT 


TCTTCTAGGT 


CGTTCATAAG 


5040 


GAACAGGAAG 


ATTCAGGTTG 


ACTTTTCTAA 


TCCTAGAATA 


AAGTGCTGAA 


AACAATTCGG 


5100 


AATAGGCATA 


GAGACTAGAC 


AATTTGAGGA 


GCTGCTTGCG 


TCCTGTTCGA 


ACACATTTTC 


5160 


CCACCACGTG 


AAGAAAAAGA 


TGGCGGAAGC 


GTTTGATTGT 


TAAACTTTGG 


AAGTCACCTC 


5220 


CAGCTAGATG 


TTTGAGAAAA 


AGATAGAGAT 


TGTAGGCGAT 


ACAGCTCATC 


ATCATACGAA 


5280 


CTTCGTTTTT 


GATTAAGGTT 


GAACT 








5305 


(2) INFORMATION FOR SEQ ID NO: 13 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A» LENGTH: 3964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 



{xi ) 


SEQUENCE DESCRIPTION: 


CPA TH MO* 


1 ic . 

1 JO . 






TGGCAGCTCG 


TCGTCGTAAA 


GGACGCAAAG 


ttt TonpTcr* 

1 1 1 luuL 1 \J^w 


A I AATv-LTAAA 


f*f* ft ft iwn^m ft m 

CGAATTCTAT 


- 
60 


CAAAAATCAG 


TAGGAACTCG 


AGTCTACTGA 


TTTTTATTTT 


TGTAAAAAAG 


TTCAGTAGAT 


120 


GCAAATGGAT 


TCGGAAGCGA 


TGTTACACTA 


VjA I I lj/w\U I 


8. ^ ft. ft. ft ft /** 

AGAATAGTAC 


ACCTCTGTTT 


130 


CTAAAACATT 


GTTAGAAATC 


GATTTGACTG 


• \-w 1 \»r% 1 LuA 


TTTG TC CTGT 


TATTATTTTA 


240 


TTTTACTATA 


AAGTTGAAGT 


AGGTGGAGAT 




L AATCGTCTT 


TAAAGATGGT 


300 


TCAGCTATTA 


CAATTCCAGG 


AAATCAATTG 




A i WV_ AAAACiC 


ft^ft ft^ft^ft^^ 
ACAAGATAGC 


360 


ACTAAACTGA 


CTGCTGAAAA 


ATCAACTGTT 




/^•"Pf* ft ft m * /*»«T' 

L. 1 LAAAGAGT 


AG ATGT AAAA 


420 


GATATAACTC 


ATTTAACAGA 


TGAAGAAAAA 


u i i t\J\\j<j lib 


v_ i a i n I ALA 


AGCAAATGGT 


480. 


TCAGCATTAG 


ACGGAGCGAC 


AATCAATGTA 


CCTfZCZ A f! A f* 
vjW. l WiOAVjA * vj 


r ^ » tr* Aft/* 
L» TAL. AOCAAC 


AATCACATTC 


540 


CCAGATGCTT 


CAGTAGTGAC 


GATTCTAGGA 


AAnwn i Al» Alj 


iTmfif* ft ft ft ft 17*^* 


TCCGAAAGGT 


600 


GAATCTGTAA 


CTCAAGAAGC 


TACACCAGAG 


1 Al nnuL 1 AL» 


ft ft ft ft T* ft ft /"»('* 

AAAA TACACC 


AG GTGG AG AT 


660 


AAGGGAGGCA 


ATACTGCAAG 


CTCAGATGCT 


AATr;r*nAATr: 


AAVjL»L.Vj<j IULi 


T AGCC AGGCG 


720 


GGTGGATCAG 


CTCACACAGG 


TTCACAAAAC 


TCAGCTCAAT 


CACAACCTTC 


TAAGCAATTA 


780 


GCTACTGAAA 


AAGAATCAGC 


TAAAAATGCC 


ATTGAAAAAG 


CAGCCAAGGA 


CAAGCAGGAT 


840 


GAAATCAAAG 


GCGCACCGCT 


TTCTGATAAA 


GAAAAAGCAG 


AACTTTTACC 


AAGAGTGGAA 


900 


GCAGAAAAAC 


AAGCAGCTCT 


CAAAGACATT 


GAAAATGCGA 


AAACTATGGA 


AGATGTGAAC 


960 


GAAGCAGAAA 


CGATTGGAGT 


GCAAGCCATT 


GCCATGGTTA 


CAGTTCCTAA 


GAGACCAGTG 


1020 


GCTCCTAATG 


CTGCTCCTAA 


GACAACAAGT 


GCACCGCAAG 


CAACTGCAGG 


AACAATGCAA 


1080 


GATGTTACCT 


ACCAGTCACC 


TGCTGGCAAA 


CAATTACCTA 


ACACAGGTTC 


AGCATCAAGT 


1140 


GCAGCACTTG 


CTAGTCTTGC 


TCTAGTGGTG 


GCAACAAGTG 


GTTTTGCTTT 


GCTAGGAAGA 


1200 


AAGACTAGAC 


GTAGAAAATA 


GAACAGCTAG 


AAAATTCTAT 


TCTCTACTTA 


AAGTTAGATT 


1260 


ATAAGGGGGA 


TTTTGAGAAG 


TCATCAATCC 


TAGTGATGGG 


TGAGAAAAGT 


GAGAACCCAA 


1320 


GATAATCACA 


TACTTTAGCT 


GAATAGGAAT 


ATTCTATCAA 


TGTAGCCAAT 


CTCTTCTGTC 


1380 


TCTAACTGTG 


GAATAGGAGA 


TGGGCAATAT 


CGGATAGAAA 


AG A T AG C AG A 


ATAGCTCTCT 


1440 
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ATTGAAGAGA GGAGGGGAAA CCGAAAAATT AGGTGCCCCT CCTCTTTTTT GGTATAATAG 1500 

AAGATAGAAA ACGAGGTTAG AAGAGATGAT TTTTGATACA CATACACACT TGAATGTAGA 15 60 

AGAATTTGCA GGTCGTGAGG CAGAAGAAAT TGCCTTGGCT GCTGAGATGG GTGTGACACA 1620 

GATGAATATT GTTCGTTTTG ATAAACCGAC GATTGAGCAT GCCTTGGAGT TGGTAGATGA 1680 

GTATGAGCAG CTCTATGCGA CTATTGGTTG GCATCCTACA GAAGCTGGTA CTTATACAGA 1740 

GGAAGTTGAG GCTTACTTGT TGGATAAGTT AAAACATTCC AAGGTTGTGG CTTTAGGTGA 1800 

AATTGGCTTA GATTACCATT GGATGACAGC GCCCAAAGAG GTGCAGGACC AGGTTTTTCG 1860 

CCGTCAGATT CAGCTATCTA AGGACTTGGA TTTGCCTTTT GTTGTCCATA CCCGTGATGC 1920 

GCTGGAAGAT ACCTATGAGA TTATCAAGAG TGAGGGCGTT GGTCCTCGTG GTGGTATCAT 1980 

GCATTCATTT TCAGGGACGC TTGAGTGGGC AGAGAAGTTT GTGGATCTTG GTATGACCAT 2040 

TTCCTTCTCA GG AGTGGTG A C TTT T AAGAA GGCAACTGAC CTCCAAGAAG CAGCTAAAGA 2100 

GTTACCTTTG GACAAGATGT TGGTGGAAAC AGATGCGCCT TACTTAGCAC CTGTACCCAA 2160 

GCGTGGTCGT GAAAATAAAA CAGCCTATAC TCGCTATGTG GTCGACTTTA TCGCTGACTT 2220 

GCGTGGTATG ACGACAGAAG AGCTGGCCGT AGCAACGACT GCAAATGCAG AACGAATTTT 2280 

TGGACTGGAC AGCAAGTAAT GAAAGAGAAA ATTTCTCAAG TTATCGTGGT TCAAGGGCGT 2 340 

GATGATACGG TCAATCTCAA ACGTTATTTC GATGTGGAGA CCTATGAGAC TCGAGGTTCT 2400 

GCCATCAATG CTCAGGATAT ACAGCGGATT CAGCGCCTGC ACCAACGTCA TGGAGTCATT 24 60 

GTCTTTACAG ACCCAGATTT TAATGGGGAA CGGATTCGGC GCATGATCAT GATGGTCATT 2520 

CCAACACTTC ACCATCCCTT TCTCAAGCGA GATGAAGCTG TTCCCAAGTC CAAGACCAAG 2580 

GGGCCTTCTC TGGGAATTGA GCATGCCAGC TATGAAGACC TGAAAACGGC TCTAGCTCAA 2640 

GTGACAGAAC AATTTGAACA TGAGAGTCAG TTTGACATTA GTCGTAGCGA TTTGATTCGC 2700 

CTTGGTTTTC TAGCAGGGGC AGACAGCCGT AAGCCTAGAG AATATCTCGG AGAGACTCTC 27 60 

CGAATCGCCT ATTCCAACGG CAAGCAACTC CTCAAACGCC TAGAGTTGTT TCGGGTTACT 2820 

7TCGCAGAAC TGGAAGAAGC TATGAAATCT TATGAGTAGG AAAGATGTAG CCCTTACAA? 2880 

TTTTTAAGTT TCACAGTATT TTTCGAAGCA GGTAGAAGAG GAGCCGTCTG ATGTTAATTG 2940 

GTCAAAAAAT TAAAGAGATT CGGATAGAAA AAGGAATTAG TCGTCCAGAT TTTTGTGGAG 3000 

ATGAGCAAGA ACTGACAGTT CGTCAACTGT CGCGAATTGA AAGTGGAGCT TCGCAACCGA 3060 

GTTTGCCCAA GTTAGACTAT ATTGCTCGCC GGCTAGGAGT TCCAGTTTAT AGCCTTATGC 3120 

CGGATTTTTC AGCTCTTCCT TCTGCTTATT TAGAATTGAA ATACCAGATT TTACGTGAAC 3180 

CAATCTATGG TAAAGAAGAG GAGTACGATA AGAAGGAAGC GTGTTTGGAA GAGATTTATA 3240 
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3600 
3660 
3720 



3960 
3964 



921 

AAACATACTT TGATAATCTT CCTAAACAAG AACAATTAGC ATGTGAAGTA TTGCAGGCGT 3 300 

GTTTGGATAC TTCTACAACT AGAAGGCCTG AATATGCAGA GTTAATACTT GAGGAACATA 3 360 

TGCCTCAGAT TATAGAAAAA GAACCTTATT CAATAAATGA TATCTTGTTG ATTCGTTTGT 3 420 

TTTTTTATCA AATGCTCATT AGAAAAGATC TTCCCAAATT TATAAATCAA ATCGAAAAGC 3480 

TAATGCTCTT. TCTTTTGGAA CAGAAGAAGG TAACTCAAAT AGAGAATTAC TTTATAATTA 3 540 
GAGATACTCT TATTTCAGGA ATGTGTTGTC TTGAAAAGGT AGGAGTAACT GATTGTTTTA 
ATGATTATCT ATCGTGTTTA CAAGAAATTA TCCATAAAAC TCAAGATTAT CAAAAGAAAC 
CTCTTGTATT TATGTTTTTG TGGAAGCAAC CATTAAGAGA AGAAAGAGAT TTTAGTTTAC 

CTGAATCATT TTATCAGTCT TCTAAAACAT TTGCGCAGCT AATTGGAGAT GAATTTCTAG 3780 

TAAAGAAATT CACAGAGCAA TGGCAAGAGG ATGTCAAAAA ATATTTATAA ACATAGTGAA 3840 

TCAGTGACAA AGATCTCCTT GTCCTCGTAT CAAAACAGTT CTAAAGTTCG TCTTTAGGGA 3900 

TGTrrrrrrA gatataagct aaaaatgaca cgaaatggtt acattttaag gacattgatg 

TCCG 

(2) INFORMATION FOR SEQ ID NO: 13 7: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12666 base pairs 
{B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

TGAGACCGTT ATTTGTATTA GGGAAATGGG TATCTATTTT TAATGCTGTG CCGATTTTGA 60 

TTGTTTCTAT TATTCAAACC AAAAGCTTGT CAGGTATTGG AGCAGGATTG TTTAATCTAT 120 

ATAACATTTC ATCTTATATA GGTGATTTAG TTAGTTTCAC TCGATTGATG GCATTAGGAT 180 

TATCTGGAGC AAGTATAGCA TCAGCTTTCA ATTTAATTCT TGGTTTGTTT CCGGGAATAT 240 

TGGCTAAACT GACAATTGGA TTAGTATTAT TCATTCTTTT ACATGCCATC AATATTTTTC 300 

TATCGTTACT ATCAGGATAT GTTCATGGAG CACGTCTGAT ATTTGTTGAA TTTTTTGGTA 3 60 

AGTTTTATGA GGGTCCACCA AAACCATTTC AACCTTTGAA GGCTTCTGAG AAATATATTA 420 

AGGTTATTAC AAAGAATTAA TGGAGGATAT ATATAATGGA ACATTTAGCA ACTTATTTTT 480 

CAACCTATGG AGGAGCTTTC TTCCCTGCAT TGGGAATTGT ATTGGCGGTT GGATTAAGCG 54 0 

GTATGGGGTC TGCTTATGGA GTTGGTAAGG CTGGGCAATC TGCCGCAGCT TTACTGAAAG 600 
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AACAGCCTGA AAACTTTCCC TCAGCTTTGA TATTGCAATT ATTGCCCGGA ACACAACGAT 660 

TATATCCTTT TGTTATTGGA ATTTTAATTT GGTTGCAATT AACTCCAGAA CTTCCTTTAG 720 

AAAAAGGCGT TGCTTATTTC TTTGTAGCTC TTCCAATTGC TATTGTAGGA TACTTTTCAG 780 

CTAAGCATCA AGGAAATGTA GCAGTAGCGG GAATGCAAAT CTTGGCTAAA AGACCAAAAG 840 

AATTCATGAA GGGAGCAATT TTAGCTGCCA TGGTAGAAAC CTATGCAATT CTTGCTTTTG 900 

TCGTATCATT CATTTTGACC CTTCGTGTAT AAGAAATAAA TTTGCAATTC AAAGGAGGTG 960 

TCTAAATCAG CAATTTAGAA AACTTACGAG AGTCTGTTAT TGAACAAGCT CATGAAAAAG 1020 

GGCGTATGAA ATTATTGGAT TCCAAAAAGA AGATTGATGA TGAATTTGAA ATGCAAAAGT 1080 

CGCTCATTAT AAACAAAAAA GAAGCTGAAC ATGAACGAAA GTTAAAAGAA TTGCAACAGA 1140 

AATATCAAAT AATTTTTCAA CAATTAAAAA ATAAGGAACG CCAATCAACG TTAGTATCAA 1200 

AACAGAAAAT ATTAAAACAA CTTTTTCAAT CTGCTTTACT AGAAATGGAA TCTTGGAGTG 1260 

CAGATAAAGA AATGGAGTTC ATCTATCGAA TTCTGGAACG ATATTCACAA CAAGAGGTCA 1320 

TAGTAACCTT TGGGGAACGG ACTTTAGCTA AATTCAATTT GGAACAATTA GAGAAATTGA 1380 

AATTCTCTTT TCCAAATTAT TTATTTACTC AACAACCTAT CTCAAATCAA TCAGGCTTAC 1440 

TTATTTCAAT AGGTAAAATT GATGATAACT ATTTGTATAA AACATTAATT GCATCGATTT 1500 

CTAAGGAAGA AAGTTCAAGT ATCGCAAATC AAATTTTTAT CAATTAAGGA TGAAATTGGT 1560 

TAATCCTTCT TAGAAATTTG GAGTATTCCA ATAAAATTAG AAAGGTATTT TATGGATACT 1620 

AATCTTTTTT CAAAAATAAA TACGACGATT TCGGTAAAAG AAAACGATTT TATTACAGAA 1680 

GAAAAATTTC AAAAAATTAT ACAATCCAAA GATACGGAGA CATTGGCATT TATCTTAGAA 174 0 

TCAACTCCCT ATCATTTATC GATTGACATC TTAGAAGATC CTAGTCAGAC AGAGATTTCG 1800 

CTAATGACAA AATTAGTCAA TGATTATAGA TGGGCCTATG CTGAAAGTCC GTCTGATATA 1860 

ATTGTGACTT TATTTGCTTT ACGATATCTT TATCATAATA TCAAAGTTTT ATTAAAATCT 19 20 

AAGGCGGCAA TTAAGAAAGA TTTTTCTAAA TTATTAATTC CAATAGGGAT TTTTGATATA 1980 

GAAAGTTTAA AACATTTAGT TTCTTCCTTA CATTCAGATA CACTTCCTGA TTTTATGGTT 2040 

CGTGAAGTAG AATCAATTTG GAATGAGTAT GAAACTTTTA ATAATATTCG TGTACTTGAT 2100 

GTCGGAGCTG ATCTAGCATA TTTTAAACAT CTGAAACTTT TATCTAATGA GTTAGATGAC 2160 

GTACTGTCTC AGGTTATTGT CGAAATGATT CACTTTTATA AT ATT ATT AC TGTAAAACGT 2220 

GGTTTATCTC AAAATAAGAG TCATGGGGAT ATTTTACAAT TACTTTCAGA TCAAGGAAGT 2280 

ATTTCTGCTA AAGAATTTAT AT AC ATT GT A GAAAATCAAG AAATATTTGT GTGGTTCAAT 2 340 

AAAATAAATC CAACC7TAGA TTCAATCTTT TCAACTTATG AATTGAAGAT GCAGGACGCA 2400 
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ACAATTTCAT 


■ CTTCTCACTT 


1 AGAATTTTTA 


. TGTGATTTAC 


TATTGTATAA 


i AACTTTAGAT 


2460 


CAAGGAAGCT 


ACAATGTACA 


GGGGCCGTTA 


GTTCTTGCTA 


GATATTTATT 


1 GGCATGTGAG 


2520 


TTTGAAGTAA 


AGAATCTCAG 


AATGATCATA 


TCAGCTCTTC 


AAAATACAAT 


• v-'w'w 1 X. 1 \J*\J\ 


"> s Art 


TCAATAAAAG 


AAAGGATACG 


CCCACATTAT 


GGAAGCTAAT 


AAGTATAAAA 


1 1 vJVJ^./\ 1 J\f\ 1 


9 a d n 


TGGTAGCCGT 


GAT ATT AT TT 


TACCATTTAG 


CATGATT GGG 


TTTGATATAT 


TTCCTGCCTA 


9 1 nn 


CCAAGAACAA 


GAAGCTATAA 


ATACACTAAG 


AAAATTAGCT 


CAATCTGATT 


ATGGTGTCAT 


t / OU 


TTATATCAC7 


GAAGACATTG 


CTTCAATGAT 


ATTAGATACA 


ATTCGCCATT 


ATGATTCCCA 


CO £.\J 


AGTTGTGCCT 


GCTATTATTT 


TATTACCGAC 


TCATAAACAA 


GGTTTAAATT 


TAGGATTAAA 




ACGTATAGAG 


GATAATGTAG 


AGAAAGCAGT 


AGGACACAAT 


ATTTTATAAT 


AATGTACAAA 


294 0 


ATTGTCTGTA 


ATATTATTCT 


ATAATTTTTG 


GACTTAGTAA 


GGAGAATAAC 


TTTGACTCAA 


1 ts f\ n 
JUUU 


GGGAAGATTA 


TAAAAGTATC 


GGGACCTCTA 


GTTATTGCAT 


CAGGTATGCA 


GGAGGCTAAT 


1 rt & rt 


ATTCAAGATA 


TTTGCCGTGT 


AGGTAAGCTA 


GGGTTAATCG 


GTCAAATTAT TGAAATGAGA 




AGAGATCAGG 


CATCTATCCA 


AGTCTATGAA 


GAAACATCTG 


GTCTTGGTCC 


GGGAGAACCT 


JlOv 


GTTGTTACAA 


CTGGAGAACC 


TCTCTCGGTT 


CAATTAGGGC 


CAGGATTGAT 


TTCTCAAATG 




TTTGATGGCA 


TACAACGCCC 


ATTAGATCGA 


TTTAAATTGC 


CTACTCATAA 


TGATTTTCTA 


J JUL) 


GTTCGTGGGG 


TAGAAGTTCC 


AAGTTTGCAT 


AGAGATATTA 


AGTGGCATTT 


TCATTCCACT 


J JOU 


ATAGCAATTG 


GTCAAAAAGT 


GAGTACGGGT 


GATATTCTTG 


GAACTGTCAA 


GGAAACCGAG 




GTAGTTAATC 


ATAAAATTAT 


GGTTCCTTAT 


GG ACT ATCTG 


GAGAAGTCGT 


TTCTATTGCA 


i j on 

JIOU 


TCTGGCGATT 


TTACAATTGA 


TGAAGTTGTA 


TATGAAATAA 


AAAAATTGGA 


CGGTAGTTTC 




TATAAAGGAA 


CGCTTATGCA 


AAAATGGCCT 


GTCCGCAAGG 


CGCGTCCTGT 


TTCTAAACGT 


JOUU 


TTAATTCCAC 


AAGAACCATT 


AATCACAGGT 


CAACGAGTTA 


TTGATGCATT 


CTTTCCAGTA 


3660 


ACCAAAGGGG 


GAGCTCCAGC 


AGTTCCTGGA 


CCGTTTGGAC 


CAGGAAAGAC 


AGTTGTACA* 


3720 


CACCAAGTAG 


CTAAATTTCC 


CAATGTTGAT 


ATTGTTATTT 


ATGTCGGTTG 


TGGAGAACGT 


3780 


GGAAATGAAA 


TGACGGATGT 


ACTGAATGAG 


TTTCCTGAGT 


TGATTGACCC 


TAATACCGGA 


3840 


CAATCAATTA 


TGCAACGGAC 


AGTTCTGATT 


GCTAATACTT 


CAAATATGCC 


TGTTGCTGCT 


3900 


CGTGAGGCTT 


CAATTTATAC 


AGGAATTACC 


ATGGCTGACT 


ATTTTCGTGA 


TATCGCCTAC 


3960 


TCTGTCGCCA 


TTATGGCTCA 


TTCAACTTCA 


CGTTGGGCAG 


AAGCGCTACG 


TGAAATGTCA 


4020 


GGACGTCTAG 


AAGAAATGCC 


TGGTGATGAG 


GGTTATCCTG 


CTTATCTCCG 


AAGTCGTATC 


4080 


GCTGAATATT 


ATGAAAGAGC 


AGGACGTTCT 


CAGGTTCTAG 


GGCTTCCAGA 


ACGTGAAGGA 


4140 
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ACGATTACTG CTATTGGAGC TGTATCGCCA CCTGGTGGAG ATATTTCAGA AC CAGTT ACT 4 200 

CAAAACACTT TACGGATTGT GAAAGTTTTT TGGGGGCTTC ATGCTCCGTT GGCACAGCGA 42 60 

CGTCATTTTC CTGCAATTAA CTGGCTTACA TCTTATTCAC TATATAAACA CAGTGTGGGC 4 320 

ACTTATATAG ATGGTAAAGA GAAGACAGAT TGGAATAGTA AAATAACTCG TGCGATGAAC 4380 

TACTTACAAC GGGAATCTAG TTTAGAGGAA ATTGTTCGTC TTGTTCGAAT TGATTCTCTG 4440 

TCTGATAATG AACGACTAAC GATGGAAATT GCTAAACAAA TTCGAGAAGA TTATTTGCAA 4500 

CAGAACGCTT TTGATTCGGT AGATACATTC ACTTCCTTTC CAAAACAAGA AGCAATGCTA 4560 

AGTAATATTC TCACTTTTGC TGATCAGGCA AATCATGCTT TAGAGTTGGG TTCTTACTTT 4 620 

ACAGAGATTA TGGAAGGTAC CGTGG CAGTT CGAGACCGTA TGGCGAGAAG TAAATATCTT 4 680 

TCAGAAGATA GATTAGATGA AATCAAAATT ATATCAAATG AGATTACACA TCAAATTCAT 4740 

TTGATATTAG AAACAGGAGG TCTATAAATG AGTGTTATAA AAGAATACAG AACTGCTAGT 4800 

GAAGTTGTTG GGCCTCTTAT GATTGTTGAA CAAGTAAATA ATGTGTCTTA CAATGAGTTA 4 860 

GTTGAAATTC AACTTCATAA TGGAGAAATT CCTCGTGGAC AAGTTTTAGA GATCCACGAA 4920 

GATAAAGCAA TGGTTCAGCT TTTTGAAGGA TCTAGTGGAA TAAATTTACA AAAGTCTAAA 4980 

ATTCGTTTTG CTGGTCATGC ATTAGAATTG GCTGTATCTG AGGATATGGT TGGTCGTATT 504 0 

TTTAATCGGA TGGGAAAACC AATTGATGGT GGACCAGATT TAATTCCAGA GAAATATTTA 5100 

GATATTGATG GTCAAGCTAT TAATCCTGTA TCTAGAGATT ATCCAGATGA ATTTATTCAG 5160 

ACAGGGATCT CCTCTATTGA TCATTTGAAT ACTCTTGTAC GTGGTCAAAA ATTACCAGTA 52 20 

TTTTCAGGTT CGGGCTTACC TCATAATGAA TTAGCTGCTC AGATAGCAAG ACAAGCGACT 5280 

GTTTTAAATT CTGATGAAAA TTTTGCGCTT GTATTTGCAG CAATGGGTAT TACTTTTGAA 5340 

GAAGCTGAGT TTTTTATGGA AGAACTCAGA AAAACAGGAG CCATCGATCG TTCGGTTTTA 5400 

TTTATGAACT TGGCAAATGA TCCTGCAATT GAGCGTATTG CAACTCCCCG CATTGCTTTA 54 60 

ACTCCGGCAG AGTATCTAGC TTTTGAAAAA GATATGCACG TTCTAGTTAT CATGACGGAT 5520 

ATGACTAACT ATTGTGAAGC GTTACGTGAA GTCTCCGCAG CTCCCCGTCA AGTTCCAGGG 5 580 

AGACGAGGCT ATCCGGGATA TTTATATACA AATTTATCAA CTCTATACGA AAGGGCTGGT 5640 

CCCTTAGTTG GTAAAAAAGG TTCGGTGACA CAGATTCCTA TTTTAACAAT GCCAGAAGAT 5700 

GACATAACAC ATCCAATTCC TGATTTAACT GGATACATTA CTGAAGGGCA AATTATTTTG 57 60 

TCGCATGAGT TGTATAATCA AGCTTATCGT CCACCAATCA ATGTTTTACC TTCTCTCTCT 5820 

CCATTAAAAG ATAAGGGATC TGGAGAAGGT AAAACTCGTG GAGATCATGC TCCAACTATG 5880 

AATCAACTGT TTGCAGCCTA TGCCCAAGGG AAAAAGGTTG AAGAGTTAGC AGTAGTATTA 5 94 0 
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GGACAATCGG CTTTATCTGA TGTAGATAAA TTGTATGTGA GGTTTACAAA GCGTTTTGAA 6000 
GAAGAGTACA TAAACCAAGG ATTTTATAAA AATCGAAATA TAGAAGATAC GTTGAATCTT 6060 
GGGTGGGAAT TACTATCAAT TCTTCCTAGA ACAGAGTTAA AACGTATCAA AGATGATTTG 6120 
CTTGATAAAT ACTTACCTTT GGTAGAAGTT TAATCCGGAA ATGGAGTGAT TATCTATGGT 6180 

ACGTTTGAAT GTAAAACCAA CTCGTATGGA ATTGAATAAC TTAAAGCAAC GTTTGACAAC 6240 

AGCTGAACGT GGACATAAGT TATTAAAGGA TAAAAGAGAT GAATTGATGA GGCGATTTAT 6300 

TTCTTTGATT CGTGAGAATA ATCAACTTCG GAAAGAAGTG GAAAGTTATC TAATTCATAA 63 60 

TCTAAAATCC TTTGCAGTTG CTAAATCATT AAAGAATTCT CAAATGGTGG AGGAATTATT 64 20 

TTCAATTCCA TCGAAAGAAA TTGAATTATT TGTTGAGAAA GAAAATATCA TGAGTGTAAC 6480 

AGTTCCTAGA ATGCATATGA ATATTACTTC TCAAAATGAG AACACTGAAT ACAGCTATTT 6540 

ATCTTCTAAT AGTGAAATGG ATGATGTATT TGCTACAATG AATAGTTTAA TTTATAAATT 6600 

ACTAAGACTG GCAGAAGTTG AAAAAACGTG TCAGTTAATG GCTGATGAAA TAGAAAAAAC 6660 

ACGTAGACGT CTAAATGGTT TAGAATACTC GATTATTCCA AACTTGTCGG AAACTATTCA 6720 

TTATATAGAA TTGAAACTAG AGGAGGCAGA AAGACCCAAT TTAGTTCGTA TTATGAAAGT 67 80 

GAAGTAGATC CTTTATTTAG ATTATTAATT AGATCAACAA ATATCAGCTT GGATAAGGCT 6840 

TTAAGCCTTT CTAACCTTTT TTTATTGACA GTATCAGGAT ATCTTTTTCA AAATTTTGGT 6900 

TTGTTAGATA ATGAAAATGT TTCTACTAAT CTAGATTTAG GATTAGTAAA TCGTAAATCT 6960 

AATTATATAG AAAGTAAGCG CG7CATAACA AGGTATCTAT CATTCATCGA CCTCCTCCTC 7020 

TATACTATTA GTAAAGTAAA ACTATTGGAG GATATTTTAA TGCCACAACC TATTGTTCCT 7080 

GTAGAGATTC CACAATCTCG TCCTTTTGAT TCTAAAAAGA GAAATGATAT TCTGCTTAAA 7140 

ATTCGTATTG GCAACCTTGA AGTAAGTTTT TTTCAATCTC TCAATCTCGA AATGGTAGAA 7200 

CAGCTTTTGG ATAAGGTGTT GGTCTATGAC AATTCATCTA TCTAGCCTAG GGGAGGTCTA 7 2 60 

TCTCCTGTGT GGGAAAACTG ATATCAGACA AGGAATCGAT TCACTGGCTT ATCTGCTTAA 73 20 

AACCCACTTT GAATTGGATC CTTTCTCCGC TCAAGTCTTT CTCTTTTGTG GTGGACGTAA 7 380 

AGACCGCTTT AAAGTCCTTT ACTGGGATGG TCAAGGATTT TGGCTACTAT ATAAACGCTT 7440 

TGAGAACGGC AGATTGATTT GGCTAAGTAC AGAAAAGGAT GTCAAAGCTC TCACACCAGA 7500 

ACAAGTAGAC TGGCTTATGA AGGGCTTTTC TATCACTCCA AAAATATAGT AGATTGAAAC 7560 

TAGAATAGTA CACCTCTGCT TCTAAAACAT TGTTAGAAAT CGATTTTACT GTCCTGATCG 7 620 

ATTTGTCCTG TTCTTATTTC ATTTTACTAT AAATCCATCA GAAAGTCGTG ATTTCTATTG 7 680 
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AAATGAGGAC TTTCTTTTTA TACTCATCTC CTTTCAAAAA GCATTCTAGT CCATCTCCGA 7740 

TTAACGATGG ACTTTATCAC CTCCTTCTCC AGTCCTTGTA TAACATCTTC GAGTTGATTC 7800 

ATGACATCTT CCAAAGTTTA AAAGGCTTTA TTCTTAAATC CACGTTTACG AATCTCTTTC 7860 

CACACTTCTT CAATGGGGTT CATCTCTGGT GTGTATGGAG GAATAAATGC AAAGCCAATA 7920 

TTAGTCGGAA TCTTTAAGGT ACTTGATTTA TGCCATATAG CATTGTCCAT AACGAGTAAA 7980 

AGATAATCAT CTGGATAAGC TTGTGAAATC TCCTATTCCT AAAGCCCCTT TAGCGCATAA 8040 

CTTTGGCTCA GCTTCTATTA TCGCTCACAC CATCCATCAG AAGTTTAATC TGAAGGTACC 8100 

CAATTATCGC CAAGAAGAAG ATTGGCCTAG GATGGGTTTA CCAATCACAC GTAAGGAAAT 8160 

CTCTAATTGG CATATCAAGG CGAGTCAATA CTATTTGGAG CCCCTTTATA ACCTCTTCCG 822 0 

AGAGAGACTA TTGACTCAGC CCTTACTTCA TCCGGATGAA ACTTCTTATA GGGTGCTAGA 8280 

GAGTGATAGT CAGCTGACTT ACTATTGGAC TTTTTTGTCA GGTAAAGCAG AGAAACAAGG 8340 

GATTACGCTT TACCACCATG ATCAGTGTCG AAGTGGTTCA GTAGTACAAG AATTCCTAGG 8400 

AGATTATTCT GGCTATGTGC ATTGTGATAT TTTGCGGCAG TAACTTAGGA CTTTAGTCCT 8460 

CTAGTTCTGC CTATGCGATA GCAGTCCAAG GTTTAGGAGC AAGGCGACCC TAAGCTTGGT 8520 

AAACTTCCAA CCGCTCGTCT GCTTATCGTC AACTGGAAGA AGCTGAACTT GTTGGATGTT 8580 

GGGCGCATGT GAGAAGGAAG TTTTTTGAAG CGCCCCCCCA AGCAAGCGGA TAAATCATCC 8640 

TTAGGACCTA AAGGTTTAGC TTATTGTGAT CAGTTATTTT CCTTGGAAAG ACACTGGGAG 8700 

GCTTTGCCAG CTGATGAACG ACTACAGAAA CCTCAAGAAC ATCTCCAGCC CTTAATGGAA 87 60 

GACTTCTTTG CTTAGTGCCG GCGTCAGTCA GTTTTAGCAG GTTCAAAACT AGGAAGGGCA 8820 

ATTGAATACA GCCTCAAGTA TGAAGAAACC TTTAACACCA TTTTGAAAGA CGGACATCTG 8880 

GTCCTTTCCA ATAATCTAGC TGAACGCGCC ATTAAATCAT TGGTTATGGG ACGGAGTAAA 8940 

AGAGTCCAGT GGACTCTTTT AGCCTAAGCT CACTTTAAAA AAGCGAGGGT GGTTATTTTC 9000 

TCAAAGTTTT GAAGGAGCTA AAGCAAGAGC TATTATTATG AGTTTGTTGG AAACAGCTAA 9060 

ACGTCATCAA TTAAATAGCG AGAAATATCT ATCCTATCTT CTAGAATGTC TTCCAAACGA 9120 

GGAAACTCTC GTAAACAAAG AGGTTTTAGA GGCTTATTTA CCATGGACTA AAGTTGTACA 9180 

AGAAAAGTGC AAATAAGAAA TCTCCAGATT AGGAACTATC CGTGAGTTCT CCAGTCTGGA 9240 

GATTTTTCAA TAGACTTCCT GCGAAACAAA ATATGGTATA ATAGTTCTAT GAATGATGAA 9 300 

GCAAGTAAAC AACTAACCGA TGCACGATTT AAGCGTCTTG TTGGTGTTCA ACGCACGACT 9 360 

TTTGAAGAGA TGTTAGCTGT ATTAAAAACA GCTTATCAAC TTAAACACGC AAAAGGTGGA 9420 

CGAAAACCTA AATTAAGTCT AGAAGACCTT CTTATGGCCA CTCTTCAATA TGTGCCAGAA 9480 
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TATCGAACTT ATGAACAAAT TGCGGCTGTT TTTGGTATTC ACGAAAGCAA CTTAATCCGT 9540 

CGGAGCCAAT GGGTTGAAGT AACTCTTCTT CAAAGTGGTG TTACGATTTC AAGAACTCCT 9600 

CTCAGTTCTG AGGACACGGT AATGATTGAT GCGACGGAAG TAAAAATCAA TCGCCCTAAA 9660 

AAAAGAATTA GCCAATTATT CTGGTAAAAA GAAATTTCAC GCTATGAAGG CTCAACCGAT 9720 

TGTCACAAGT CAAGGGAGAA TTGTTTCTTT GGATATCACT GTGAACTATT GTCATGATAT 9780 

~ ~CAACTTGTTC~A^^GTC~^^ 99 7 0 

TGGTTATCAA GGGCTCATGA AGATATATCC TCAAGCACAA ACTTCACGTA AATCCAGCAA 9900 

ACTCAAACCG CTAACAATTG AAGATAAAGT CTATAACCAT GCCCTATCTA AGGAGAGAAG 9960 

C AAGGTTG AG AACATCTTTG CCAAAGTAAA AACGTTTAAA ATGATTTCAA CAACCTATCG 10020 

AAATCATCTA AACGCTTCGG ATTACGAATG AATTTGATTG CTGGTATTAT CAATCATGAA 10080 

CTAGGATTCT AGTTTTGCAG GAAGTCTATT ATCAAAAATA CCATCAAGAT TATATAAGAT 10140 

TGATACAGGA AAAGTTTTAT TTGATGGTGT AAATATTAAT CAAATAGATA AAAAAATATT 10200 

AAGTCAAAAT TTACGAGTAG TTCCACAGGA TTCATTTTTA TTGAACCGAA GTATTCTTGA 10260 

TAATATAACT TTAAAGCACG AAGTTACTTC ACAAAAGATA GAGGAAGTTT GTAAAGCAGT 10320 

TCAAATCTAT GATGAAATCA TGCCTATGCC GATCAAATTT AATACTATCA TCTCAGAGAT 10380 

GGGGTCAAAT ATTTCAGGTG GGCAAAGGCA ACGGATAGCA CTGGCACGTG CATTAATAAA 10440 

TAATCCTAGT ATTGTAATTT TAGATGAAGC AACTAGTGCA TTAGACACTA TTAATGAGGA 10500 

AAGAATAACA AAGTATATAC AAAGTCAGGG CTGTACTCAA ATAATTGTAG CTCATAGATT 10560 

GTCAACGATT AAGGATGCGG ATGTTATTTT TGTAATGAAA GGTGGTAAGA TTGTTGAGTC 10620 

AGGAAATCAT AACTACTTAA TGGATCTTCC TGGAGAGTAC TACAGCTTAT ATACAAAAAG 10680 

GAAATGAGGT GTAAAGAAAA TGAAGAAAGA AAATGAATAT GTAATTTTAA CAACAGCCTC 10740 

ACTAGGGGTG ATGATTGGAA TAGTGTTTGC AATTTTTTTA GATTTTCCAG TTGAATATGG 10800 

TATTTCTTTA GGCTTGTTGA ATGGAATAGT ATTGGGTTCG CTGATTGTTT ACAAAAACAA 10860 

TAAGAATTAA CCATAATTTT TTGCTGTAAA CTAAGGAGTA GAGATGGCTA TAGTTGAAAT 10920 

TATAAATCTA ACAAAAAGCT TTAAAGATAT TGAAGTTATT CATAACACTT AAATAATAGA 10980 

GCAACTACAG TAGTAGC7TA AAAACATGAT TAAATCGCTA TTCTTACGAC TAGCGGTTTT 11040 

TCTTTTTGTT TAATACTCTT TGAAAATCTC TTCAAACCAC GTCAGCTTTG CTTTACCGTA 11100 

CTCAAGTACA GCCTGCGGCT CGCTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATAAAA 11160 

AGGGTCAAGT AAGTATAGTA AATTGAAATA AGATATGAAC AAATCGATTA GAAAAGTCAA 11220 
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ATTAATTTCT AGAAATATGT TAGAAATTGG TTTGAATTCC GCAATCAATT TGTTCAGTTT 11280 

TTATTTCATT TCATTTTATT TAATTAGATT TTCCAATTTT TTAATTCAAG CTAAAAATCC 1134 0 

CCAATCGTAG TGATTGAGGA TTGAGTAAAT AAATCTTAAA CAATACCTTG TGCAATCATG 114 00 

GCATTTGCTA CATTTTCAAA GGCAGCAATG TTAGCTCCTG CAAGGTAGTC TTTATCAAGA 114 60 

CCGTATGTTT CTGAAGTCGT TTTACCTGTG TTGAAGATGT TTGTCATGAT GTCTTTGAGA 11520 

CGGCCATCAA CTTCTTCACG AGTCCATGAG AGGCGAAGAC TGTTTTGGCT CATTTCAAGA 11580 

GCTGAAACGG CTACACCACC AGCGTTGGCA GCTTTTGCAG GTCCGTAGAA GAT AC CAT TT 11640 

TCTTTGTAAA CTTTGATGGC ATCAAGGTCG CTCGGCATGT TGGCACCTTC AGATACACAG 11700 

ATAACGCCTT GAGCAACCAA ACGTTTAGCT GCTTCACCGT TGATTTCGTT TTGAGTGGCA 11760 

CATGGAAGAG CAATGTCATA GTTTCCAGCG TAACTCCATA CAGTACCTTC GTGGTAGGTT 11820 

GCAGTTGCTT TTTCAGCTGC ATACTCAGTC AAACGAGCAC GACGTTTTTC TTTAACATCA 11880 

ACCAAAAGAT CGAAGTCGAT ACCATTTTCA TCGATGACAT AACCATTTGA GTCAGAAACA 11940 

GAAATAACAG TTGCACCGAG TTCAGTTGCT TTTTGAAGAG CATATTGAGC AACGTTACCA 12000 

GAACCTGAAA TAACGACTTT CTTACCAGCA AAGCTGTTAC CGTTAGCTTT GAGCATTTCT 12060 

TCAGTATAGT AAACCAAACC GTAACCAGTT GCTTCTGGAC GAATCAAGCT ACCACCAAAT 12120 

CCAAGAGGTT TACCAGTCAA GACACCAGCA TCAAATTGGT TAAGACGTTT GTATTGACCG 12180 

TAAAGGTAAC CAATTTCACC TCCACCAACA CCGATATCAC CAGCAGGTAC GTCAAGTGAT 12240 

GGTCCGATGT GTTTTTGCAA TTCAGTCATG AAGCTTTGGC AGAAGCGCAT CACTTCAGCA 12300 

TCTGTTTTAC CTTTAGGATC GAAGTCTGAT CCACCTTTAC CTCCACCGAT AGGAAGTCCA 123 60 

GTCAAGACAT TTTTAAAGAT TTGTTCAAAT CCGAGGAATT TCAAGATCCC TTGGTTTACA 124 20 

GTTGGGTGGA AACGAAGTCC ACCTTTGTAT GGTCCAACAG CTGAGTTGAA TTGAACACGG 124 80 

TAACCACGGT TTACTTGAAT TTTTCCATCA CGGTCAACCC AAGGAACACG GAAAGAAACC 12540 

ACGCGCTCAG GCTCAGTAAT ACGTGCCAAG ATATTTTCTT CGATATACTC AGGGTGTTTT 12 600 

TCAAATACAG GTTCTAAAGT GTTGAAAAAT TCTTCAACAG CTTCGAGGAA TTCAGCCTCG 12 660 

TGCCGG 12 666 
(2) INFORMATION FOR SCO ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 083 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


138 : 






AGCAACTGTT 


GTGAACCAAT 


TCCGATAAAT 


' TCCAAGAATT 


' GGTTAATAGA 


i GCTATTTTOA 
' vjLLnl I 1 1 Ljrt 


o u 


CCAAAAATCC 


CGATAAAAGC 


ATAGGCTTTA 


. AGGAGCAAAT 


' TGATCCAGGT 


AGPAAGflATA 


i z u 


ATCAGCATGA 


GCCAGACTTG 


ACGGTGTTTG 


AGACGGGTCA 




V-vj I LUoA 1 AM 


i a a 
Lou 


CTGATAAGCA 


GTGCCACAAA 


CGTCACAATG 


CCTGCATAAA 


vjv»rtL i unu 1 1 


LiAAAl 1 LA 1 1 


*) A A 


TTAAGATAGG 


TCAAGTTTTG 


TGACGCAAAG 


TAAGATTTGT 

* *W»^J*» 4 4 4 \J 4 


A At^PT^tP^^ a a 


AlI l?AAl I Uj 


iUO 


CCTTCGATGT 


TGAAAAAGGA 


TTGACCGAAA 


ATCAAGACCA 




a af* a 
1 AlAAAL?A(jL 


J bu 


GCAATCCAAA 


GCATGTAGGG 


TACTACAAAG 


AGTTTAGAGC 


1 Iwl i 1 11.1 L 


CATCTCTTTC 


4 20 


CTCCTCGATT 


GCATTGATCA 


AACCTGCTTC 


TTGCTCTTCG 


ATTTPT A C fZT 


At ILL 1 LAA 1 


480 


ACGAGCATCG 


AACTCTTCTT 


CGGTTTCATT 


GAGACGCATG 


ATfVTftfiATfyP 
a i Li l Ij*ja lol 


CTTCTGGTTC 


54 0 


AAAGTCCAGA 


CCGATTTCCT 


CACCCACGAT 


AGCCTTACGG 


GTTCACTGGA 


ILMlLLnl IL 


£. ft rt 
OUU 


ATTTCCAAGT 


TCGTCATAGG 


CGATAATTTC 


ATAATGAACT 


CPAfWIAA A A 


vjl 1 t»tjwiTATt. 


660 


GACCTTAACT 


TCGAGCTTGC 


CTTCTTCAGG 


AAGGGTAATG 


LVjL/W\\j ILL 1 


L 1 LnjALLaAA 1 




AACGACCTCA 


ACAGGTTCAT 


TTGCCTTCAT 


CCCACCATCA 


ACCGC*TTrAA 

nLLL>L 4 1 (.An 


/MjLLj ill L>LL 


t a ft 


GTTAAATTCG 


ACCAAGTAGT 


CCTCAATCAT 


GGTACCTGGC 


AAGATGTTTC5 

n^>u^\ 4 ^ 4 4 4 VJ 


APTPPPfVlAT 
nV. 1 LLLLuA 1 


04l* 


AAAGGTGGCA 


ACAAAGTGGT 


TGATTGGCTC 


ATCGTAGATG 


TPCAPAfifWt 

1 LLrtvftljVArtj 


Trrr an err* 

i l LLAuAL I L* 


an a 


GACAATCTCG 


CCATCATTCA 


TAACGAAAAT 


CCAGTCACTC 


A TCfIC A Art ag 


L 1 1 L X . LL 1 Vj 


o £ n 


ATCGTGAGTG 


ACAAAGACAA 


AGGTAATGCC 


CAATCGTTGT 


TGTAA.TTfAr 


PPAATTPPTA 
L»L rW\ i i L w 1 A 




CTGCATGTCT 


GTTCTCAATT 


TCAAGTCCAG 


CGCTGATAAA 


GGPTPG TCP A 
w^. 1 LLi i v_L A 


ALAALjALLAl 


1080 


ACGGGGTTGG 


TTGATGATAG 


CACGGGCGAT 


GGCCACACGC 


TGACGTTGTC 


p*** pp ag a a a p. 


1 1 ACi 


TTTGCGGATG 


GAACGTTTTT 


CATAACCTTC 


CAACTGAACC 


ATCTTGAGAA 


PTTPPP. fT A P 
L 1 1 LLVJL 1 AL 


1 

i £UU 


ACGCTGCTCG 


ATTTCTTTCT 


TATCAATTTT 


ACGCAAGCGA 


AGTGGAAAGG 


CAACATTTTC 


1260 


AAACACATTC 


ATATGTGGGA 


ACAAGGCATA 


GGATTGGAAG 


ACGGTATCTA 


CGTCGCGCTT 


1320 


GTTGGTTGGA 


ATATCATTGA 


TACGAACACC 


GTCTAGCATG 


ATATCTCCTG 


TCGTCGCATC 


13B0 


CAGTAAACCT 


GCAATAATGT 


TTAGGATAGT 


TGATTTCCCC 


GAACCAGATG 


CACCTAGAAG 


1440 


GGTGTAGAAT 


TTCCCTTCTT 


CCAACTCAAA 


GTTGATGTCT 


TTGAGAACCT 


TGGTGTTGCT 


1500 


GTCTTCAAAA 


ACTTTAGAGA 


CGTTTTTGAA 


TTCGATAATT 


GGCTTTTTCA 


ATTGGCATAA 


1560 


ATTCCTTCTT 


TTTCATAGAT 


TAACCGATCG 


GGGCTCTGTC 


AGGTCCCCAC 


TACCTCTTGC 


1620 


AGGGAGTAAA 


ACCACCTGCA 


TACATCTTCG 


CTACCGATAG 


GCTTTCACCC 


AAGATCCGGA 


1680 
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CTTCTCTTTC AAGCGTAATA CCTCAGTCTT CCTTGACTTT TTCGATAACC CATTGGATCA L7 40 

AGTCCTCCTA GTCTTTCGCC GTTCCATCTG CGACATTGAT CATAAATCCT GCATGCTTTT 1300 

CTGACACTTC TACGCCACCG ATACGATAGC CTTTCAAGCC AGCTTCTGAA ATTAACTGAC I8 60 

CTGCAAAATG CCCGACTGGA CGCTTAAAGA CCCAGCCACA AGATGGGTAT 7CCAAAGGTT 1920 

GCTTGAGTTC ACGTAGGTGC GTCAAGCGGT CCATTTCCTG CTTGATAACC TGATGGCTTC . 1980 

CTGGAGCTAG GGCAAATTTA ACTGACAAGA CAACTGCACC AGACTCCTGA ATAGCTGAAT 204 0 

GACGGTAACC AAAAGCCAAG TCTTTAGCAG ACAGGGTTTC GATTTCTCCA TCCTTGGTCA 2100 

AGACCTTACA AGACTGCAAG ATGTGAGCAA TCTCGCCACC ATAGGCACCC GCATTCATAA 2150 

AGACAGCACC GCCAACGCTT CCTGGAATAC CACAAGCAAA CTCAAACCCA GTTAAACTAT 2220 

GACGGAGGGC AATGCGAGTT GTTTCAATCA AGTTACCCCC AGCTTCTGCT TCAATGGTAT 2 2 80 

AGCCATCAAC AGAAACCTTA TTGAGCTTGT CACACAAGAT GACAAATCCA CGAATCCCAC 2 340 

CATCACGAAC CATGATATTC CTTGCATTGC CAAGAACCAT CCAAGGGATA TTTTCTTGGT 2 400 

TGGCAAATTT CACAACGCGA CCCAACTCAA AACGATTTCG TGGAAAGACC AAATAATCAG 2460 

CCTCTCCACC TACTTTTGTA TAACTATAGC TATGCAAGGG TTCCTTAAAA CGGATATCAA 2520 

TTCCTTCTAA GATTTCAAGC ATTTTTTCTC TTACAGACAT GTCACTCTTC CTTTTACAAA 2580 

ATTCATTCCA TTATACCATT TTTAGAGACA TTTGACGACC ATAAAAATAC CTTGTTTGGA 2640 

TTTTGCATAA GAAAAAGAGC TTCCCCCCTT TTTATGATTT TTTACAAAAC ATTTCCTTGC 2 700 

TTCCATAGGC GACCAGAACG AGCTCCAGTG CTACAATCAC TTCAACCAAG ACTGGATTTG 2 7 60 

TCAACCAGCC TACTTGGAAA AGAGATGGTG CCAGATCAAA GAAGGCATGC AACCCATACG 2 820 

CTGCTAGGAG ATAAATCCAT TTCTTCTGGC CAACACCTTC GTAAACCCAA ACTGTCAAAA 2830 

GTAATTGGAA ACCAAGCGCC AAGATTCGCT CAAAACCAAG CAAATAAATC TGCCAGACCG 2940 

AAAGTGACTG AATGCTTTTT AACATATTTT CAGACAGTAA TTGCA PAACC TGTCCATTC: *>:00 

GAGTTTGAAC TGCCGAAAGA ACAATGTAAA GATTGAGTAA ACTAGTAAGC CCTAGAAAAA 3060 

TCAACTCCAA GCCACCATGC CCC 308 3 

(2) INFORMATION FOR SEQ 10 NO: 139: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 15363 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
( O) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
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CCGGAGGATA 


TTC A Cf* A fT A 


<wV>/UW\uLAu 


GgGGAAAATC 


/** A A A TV* A It 

GAAATCAACC 


AATAGTAGCC 


60 


TACTGCGACA 


V- i VjO i v. AAV- 1 


C A f~T A TPTr a 
v-AC lAiLI GA 


TGLT I GATAA 


T AATGC AAAA 


AACCTTTTAA 


120 




T , /" , T» A <r»/^ a f^fT* 

1 V- I ATwAUWT 


CTTTCCACCA 


CTTTTTCATG 


TCATACTCCT 


TCACTTATAA 


180 


TCTTATACTC 


A A r H/ p * X A A A 

AATGAAAATC 


AAAGAGCAAA 


CTAGAAAGCT 


AGCCGCAAGC 


TGCTCAAAAC 


240 


A<» i Vj 1 I i i GA 


GGTTGTAGAT 


AAGACTGACG 


AAGTCGATCA 


CATACATACG 


GTAAGGCGAC 


300 


Gv. * G ACGTGG 


iTwf»m^ ft ft ^ m ^ ft 

TTTG AAG AG A 


TTTTCGAAGA 


GTATTAACTA 


ATTTCTTCTT 


ACCAATTCCA 


360 


V-LA Inl LA 1 A 


CGG TAGGGT A 


TTGGCAGCTT 


CCTTCAAGGA 


ATAGTTCTCT 


AACTTATTTA 


420 


LA I I i lUi GG 


TAATTTCTTG 


GCATACTTAG 


TCGT AATCAA 


TCGTTTTTCT 


TCGTATTCGA 


430 


AAA Tf* A A 
AAA 1 LAAL LT 


GCGCTCCAGA 


TAATAGCCTC 


TCAGCATTTC 


ATCGATATTG 


TTGGGTTTGA 


540 




AACCCGTTC G 


ACAAAGGCAC 


CACTGCTGAT 


AATAGCTGTT 


TCTCCAAGAC 


600 




A T A A A A /T» A 

L AT AAAACT A 


ATC AAAG AGC 


^ffi ^ ^^*ift & a ft a 

GTCTGTAGAC 


TCCCTTCAGG 


TTTTCCAAAC 


660 


III UaA I AAT 


CATCTCTGTA 


TTGGCAAGAT 


AGAGCTCTGC 


AATTTGGTCA 


TAATCAAGAG 


720 


L aLvjOAuAC G 


GCTTTGCTCC 


TTGTTCTTCC 


AGCTACGGAA 


GGTCTTTCCG 


AGAGTAAAAA 


780 


v.1 I v. A 1\jAA(j 


A A A A A ^S" a *n 

GAGAAAACG 4 


A A A ft fYt^^ffl^* ft 

AAAATCCTCA 


AGGAAACAAG 


AAAATAATAG 


GTCAGTCTTG 


340 


AG GG AAGTTT 


ACGATTGATT 


CCTTGTTCTA 


f9k ft iMIMMI/^ ft A 

TATTTTTCAG 


ATAACGTTGG 


TAAACTCGGT 


900 


AAGCACGATT 


GCTAATGTTC 


CCCTCTTCAT 


AGGCCTGTTC 


CAAACCATCA 


CTTTCAATAC 


960 


* AAG AATCAA 


GAGTTTCAAA 


GCAGCCCAGT 


CTTCTTGATC 


ATCCTGGTTT 


TCTTCCCTTA 


1020 


AAA 4 GAG ATT 


TTCAATACGT 


C CAT GAT AAT 


TGTCAATAGC 


CGCATAGAGG 


GGAAGTTTAT 


1080 


- l v. rGGTG TC 


TTCCAACTCT 


TTTTCCAACT 


CTAGCGTTAC 


TTCATTCAAA 


ATGCCGATAT 


1140 


vrt-A i AAGAT A 


ATCCTTGCTT 


TCTTCCTCTT 


^ 1ft ^ ft ^ ft ft ft v - * 

CATCAGAAAG 


ATGAGGCAAG 


ACCAAGAGAC 


1200 


G 1 G 1 I AAAAA 


G CT AACAAGC 


ft ^ ft a ami a 

GTCACACCTG 


CAACAAGGAA 


AAGCAAAAGA 


GGATACTCCT 


1260 


GTTCTAGATT 


ACTTGGTATC 


AAGAGAATCG 


TAGCAATCGA 


CACCGTTCCC 


TTAACACCTG 


1320 


AAAAGGTCAA 


GAGAAACATG 


TCCTTCATAT 


ACTTATTTAG 


CTTTTTCTTG 


AGGCGTCGGG 


1380 


TTCTATAGGC 


ATAATAGCCA 


TAGATCATAA 


TAAAACGAAT 


GACAAAAAGG 


ACAAAGGTAA 


1440 


GGGCGATAAG 


AG AT AG CAAT 


AAAAGTAGAG 


GATTATAGAT 


TGGATTGGTC 


AAGATAGGTT 


1500 


CTGCTATCAT 


TTCCAACTCC 


ATCCCTAAAA 


TCACAAAGAC 


AGAACCGTTG 


AGCATAAAGG 


1560 


TCACTGTATG 


CCAGACCGTC 


TCGGTCACCG 


TATCCACTTG 


GGCTTCCACG 


ACCGTCATTT 


1620 


TCTTGAAGCG 


ACTTGCCTTT 


AAAATTCCAG 


CAACTACGAC 


GGCAATAATA 


CCTGAAACAT 


1680 


GAACTTCTTC 


TGCCAGAAAG 


AAGGTCACTA 


GAGGCAAACT 


CAATTCTAAT 


AAAAGTTCAC 


1740 
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TCGCAATATC CCTTGCGCGC ACACTTAGCA AGAAGGTATG GAGGAAGCGG TTGGTCATGG 1800 

CTGTTAAAAA TCCAATTAAA AAACCGCCTA GGATTGAAAA GATGAGCGAA CTGCTAGCTT 1860 

GCCCCAGAGA AAAAGCTCCA GTTCTCCAAG CTGTCAAAGC TACCTGAAAA CCCACCAAAC 1920 

CAGAAGCATC ATTCAAGAGT CCTTCGCCCT TAAGAATATT GGACACGCGC TTAGGAAAGC 1980 

TAAAACGCTC CGAAAGAGAG GCAAAGGCCA CCAAGTCCGT AGGACCAAGG GCTGCCCCAA 2040 

CAGCCAAGCA AGCTGCCAAC GGAAGGCTGA ACCAAAGAAG ATGGGCCAAG CCACCCAAAC 2100 

TCAGGGTCGA GATAAAAATC ACTGGAAATA TGAGATAAAC AATGATTCGC CAGTGTTTTA 2160 

AAATAGCCGT AACATCTGCT TCTTCAGCCT CTCGGAAAAG CAAGGGTCCG ATAACCAGTG 2 220 

CCAAAAACAA CTCCGTATTA AGGTGAAAGT CAGTATTGCG TAAAAAGAGA CCAATCACAA 2 2 80 

TTCCCAAAAG AATTTGCACC AAAGGGAGAG GCAAAAAGGG CAGGAGCTTA TTGGTTGTAC 2340 

TTGACACAAT CAAAACCAGT AAAAATAGGA TGAGGTAAAT CAGTAATTCC ACCCACGTCC 2400 

TCCTTAATCT TTTTTACAAC AGGATTCAAA TATCTCCTTC TCCTCTTTGA TTTTTTGGTC 2460 

AATCTTGGAA CAGTCTTTGT GCTCAATTTT TCTCTGGCAC CGTTCCATTT CAAGACCAAC 2520 

TAATTTTTTC TTGATTTTAA GCATTTTTTT GCTCATATGC GCTTGGTCTA GCACGCCCAT 2580 

CCCTCGTTCC TGGTGGGTTG ATTCAACAAA ATTCTGGCGC ATGGCATCCA GCTTTTCGTG 2 640 

TAAGTATTGT TTATCCATCT CTGTATCTCT CTAATTTTTC AATCATCACT AAAAACGGCG 2700 

GGTTGTTCAC TTGGTTTAAA GTTCGGTAAA TGGCAGCTGT GTACTCTTGT TGCTTCAACT 27 60 

G G ATC AC AAA ATCCAAGACA GCATCTC7CT CGAGATCGCC TCCTTCATGA CCATAGTAAA 2820 

TCATAATAGC AATTCGTCCA CCTTTGACAA GTAAGCCACA TAGCTTTTCT AATGCCTCAA 2 880 

TCGTTGTCTG CGCTCGGGTG ATGACAGACT TATCAGCTGC CGCCAAATAG CCCAGATTAA 2940 

AAATCCCTGC CTTAGCTTTT ATCACAAACT GGTCCAGTGT CTCATGCCCT TGCAAGATTA 3000 

ACTGGGCATT TGTCAAGTCA GCCTGATGCA AACGCTCTTG GGTCTTTTCC AAGGCTTGCT 3060 

TCTGAATATC AAAGGCATAG ACTTGCTTGG CTAGCTTGGC TAAAAAAAGC GTGTCATGAC 3120 

CATTTCCCAT AGTCGCATCC ACTACGACAT CCTCTTTTGT CACGACCTCA CCCAAAAAAT 3180 

CATGTGCCAT CTCAAGTGGT CTTTTCATTT TCAAACTCCT GTTTTACAGC CTTGCATCCT 324 0 

TGAACACTTC CACGACGTCG CATCTCCATC TCAATGCTGT TGAGGACTTC CCATTTATTG 3300 

AGGCTCCACA TAGGACCAAG CAGCATATCC CTAGGCGCAT CTCCTGTAAT TCGATGGATG 3 360 

ACGATATGTT TGGGAATAAT TTCCAGTTGG TCACAGATGA CCCTGACATA TTCCTCCTGA 34 20 

CTCATCAATT GTAAACGCCC CTCATGGTAA TCTCGTTGCA TACGAGTATT TGTCATAAGA 34 80 

TGGAGCAAAT GCAGTTTAAT CCCTTGAATA TCGTTATCCC TGACACAACG GCGGACATTT 3 540 
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TCAACCATCA 


TCTCATGGGT 


TTCACCAGGC 


AAACCATTGA 


TCAAATGGGA 


AACAATCTCA 


3600 


ATTTTTGGAT 


ACTTTCTCAA 


ACGCTTGACC 


GTTTCCACCT 


ACAATTCATA 


AGAATGCGCA 


3660 


CGGTTAATCA 


GGTCAGAGGT 


TGCTTCATAA 


GTAGTTTGCA 


AGCCCAATTC 


AACCGTCACA 


3720 


TGCATGCACT 


CCGATAACTC 


AGCCAAATAT 


TCGATGGTTT 


CGTCTGGTAA 


ACAGTCTGGG 


3780 


fGC'GTTCCAA 


TATTGATTCC 

* *\ 4 4 4 4 ^ w 


TACCACACCT 


GCCTCATTGA 


TAGCCTGTTC 


ATAACGCTCT 


3840 


rCAATAACTT 




ATGGGTGTTG 


GTAAAATTTT 


GAAAATAAAC 


CAGATACTTC 


3900 


C C. A A r* A T C PG 


ggc A Cf^TGCG 


GTGCATAAAG 


TCAATTTCCT 


TATAAAATTG 


CTCACGGATA 


3960 


GGCGrATecG 


GTG CC ACAAT 


GGCATCTCCA 


GAACCAGAAA 


CCGTACAAAA 


AGTACAGCCC 


4020 


CC ATGAGCCA 

V- V- i \J V-j v_ 


CAGTCCCATC 


ACGATTGGGA 


CAATCAAATC 


CCGCATCAAT 


AGGGACTTTA 


4080 


A & A/^Tf "l*l"l"P 

nnn \j iwi l l i 


CTCCAAAGAG 


TTTTCGATAA 


TAATCATTCA 


AGGTATTATA 


AGATTTCATG 


4140 


ACTTTCATTA 


TAACAAAAAT 


CACCCACAAT 


CTCAAAAGCC 


TGACTTTCCT 


ATAAATTCCT 


4200 




TTCCATTAG 


CCTTTTTTTA 


TGATACAATA 


TGGGTATGAT 


TTTAATGAAA 


4260 


TTAGr , ATC" m A 


TTTT ATT ATT 


GATACTGACC 


TTAGTCGTCT 


GCATTATCCT 


AACCAAACTT 


4320 


TTTAGATTAA 


AAAAACTAGG 


ACCAAACTTT 


GCGGATTTGG 


CTTTTCCAGT 


CTTGGTATTT 


4380 


t>/v*j Inl I AV. 1 




TAAAACCTTT 


ACCCATAATT 


TCCTCCCTAG 


ACTGGGGCTA 


4440 




TCCT AGC CAT 


TATTCTCGTC 


TTTTTCTTCC 


TTTTGAAAAA 


ACCCAGCTTT 


4500 




AATTTATCAA 


ATTCTT CT GG 


CGTGCAGGAT 


TCTTATTAAC 


CCTTATCATC 


4560 


T AT A** AG AAA 


TG ATTGTT G A 


ATTGTTCTTA 


ATGAAATAGT 


CGAATCCCTA 


AGCATTTTCT 


4620 


AfVlf* 1TTTTT 




CAAAATAGTA 


TAGACAATAA 


CACTATACAA 


TTTTATACAA 


4680 


AG AAAAG AGT 


CTGGGACAAT 


AGTCTCTTAT 


ATCCAAAAAG 


GCAACGGATT 


TGCCGTTGCT 


4740 


TTTTTGGATG 


GTTACGATAG 


TCTTGGTAAA 


ATAGAATTGC 


CCAATAAACC 


ATTTAGAAAG 


4800 


GCTATCCCAT 


CCATATTCAC 


TATAACACAA 


ATCAAACAAC 


TTTAwLm. In 




4860 


CCTTCTTACC 


ACAAGATCAT 


CTCGTTTTTA 


CTATTGAAAA 


AGTGGTGAAT 


ACCTTGGAGG 


4920 


AACGTCACTT 


CTACACCTCC 


TATCATGCCT 


TTGATCGCCC 


GTCTTATCAC 


CCTAAAATGC 


4980 


TTGTATCTAC 


TCTTCTATTT 


GCCTATTCAC 


AAGGGATTTT 


CTCTGGTCGA 


AAAATTGAAA 


5040 


AATCGAAGAG 


TTAGTGACCT 


TAGATTCTTT 


GTTTATTGAC 


AGAACTAAGA 


TTGAAGCCAA 


5100 


TGCCAACAAG 


TATAGTTTTG 


TGTGGAAGAA 


AACGACAGAG 


AAATTCTCCG 


CCAAACTTCA 


5160 


AGAACAGATA 


CAGGTCTATT 


TTCAAGAAGA 


AATCACTCCC 


CTTCTGATTA 


AATATGCCAT 


5220 


GTTTGATAAG 


AAACAAAAGA 


GAGGGTATAA 


AGAGTCAGCT 


AAAAACTTAG 


CGAATTGGCA 


5280 
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CTATAATGAC 


AAGGAGGATA 


GCTACACACA 


TCCTGATGGC 


TGGTATTATC 


GTTTTCACCA 


5340 


TACCAAATAT 


CAGAAAACAC 


AGACAGACTT 


TCAACAAGAA 


ATCAAGGTTT 


ACTACGCCGA 


5400 


CGAACCTGAA 


TCAGCCCCTC 


AAAAGGGACT 


GTATATGAAC 


GAACCCTATC 


AAAACTTGAA 


5460 


AGCTAAAGAA 


TGTCAGGCGC 


TTTTATCTCC 


CCAAGGTAGA 


CAGATTTTCG 


CTCAACGCAA 


5520 


GATTGATGTG 


GAACCTGTCT 


TTGGGCAGAT 


AAAGGCTTCT 


TTGGGTTACA 


AGAGATGTAA 


5580 


TCTGAGAGGG 


AAGCGTCAAG 


TGAGAATTGA 


CATGGGATTG 


GTACTTATGG 


CCAATAACCT 


5640 


CCTAAAATAT 


AGTAAAATGA 


AATAAGAACA 


GGACAAATCG 


ATAAGGACAA 


TCAAATCGAT 


5700 


TTCTAACAAT 


GTTTTACAAG 


TAAAAGTGTA 


CTATTCTAGT 


TTCAATCTAC 


TATACAATAA 


5760 


GAGAATGACT 


CAAAATTAAA 


AAGCTAGAGT 


TCCACAATTG 


GAAATATCTA 


GCTTTTTTGT 


5820 


GGTTGAGAAC 


TATTTTGTCT 


CAGGCTCTTT 


ATCTTCTATT 


TACGACAAGA 


CTTTTTCTTT 


5880 


GGTCTTTAAT 


GATAAAGAAG 


GTATCAAAAT 


TTCTAGTCTT 


CTTTTTTACC 


TTTAGTAACT 


5940 


ACTAATCCTG 


CACTCAAACC 


TAGAAGAGTT 


AAACCTGCTG 


CTACTGCTGC 


TTGGCTTGCC 


6000 


GCACTACCTG 


TACTTGGTAA 


CTGGGCTTTA 


TTAGTTTGAC 


TAGCTTCACT 


TGAATCAATT 


6060 


GGTTTTGTAT 


CTGCTTTTTC 


TGACACTTGT 


GGTTTTTTAG 


CTTCTTGAGC 


TACTGGTTTG 


6L20 


GTTCCAACCA 


ACACGATGCG 


GTCTGTCGGA 


ACTTCTACCA 


CTTCACGGAG 


TTTTTCTTCC 


6130 


TTACTTCCAT 


CAGGATTAAT 


CGCTGTAAAG 


ATACGTTCTT 


TTCCAACTTT 


TCCTTCTTGT 


6240 


TCTACACGAG 


TTTCACCTAG 


ATACAGTGTT 


GAATCTTTTT 


TCTCAACTGT 


CTTGTATGCC 


6300 


AAATCTTTTT 


CAACAAATTC 


GATTTTTGGA 


AGATCTTCTT 


GTACAGCAGC 


AACTGTCTTC 


6360 


TCAGAAACTG 


GTTTTTCCTT 


ACTCAAGTGG 


ATACGGTATT 


CCTTGACTTG 


TTTTCCACTT 


6420 


TCTGAAACGA 


GGCGAACAAG 


TACTGGAAAG 


CTATCTTCTC 


CACTATCTAC 


CACAGTTGAA 


6480 


GCTACTTGAT 


TGTTTTCTTC 


AACTGAGACT 


TTTGGCCGTT 


GAC CT1 1 TATA 


GGTAATTTGA 


6540 


TAGTCTTGAC 


GATTTTCAGC 


GAAATCAGCA 


AGTTCTTTTC 


CATCTACAAG 


AATCTTTGAT 


6600 


TGAGTGCTTT 


CTTGAGGCAA 


TTCACTTGGT 


GCAAGGAAGG 


TCATCTCAAT 


CATCGCAACA 


6660 


CCGCTCTTAT 


CTGCTTTACG 


CTCCATACGC 


CATCTCATAG 


CTTTGGCTTT 


GATAGCTTTA 


6720 


AATGTTACGT 


TGATTTCATC 


ACCAGCTGCA 


ATGTCTTTAT 


CCGCACGATA 


AGGAACAGCT 


6780 


TCCCAATTTT 


CTGGATTGTT 


GAATGGATGG 


TCTGCGTCGT 


AGGCTTGGTA 


GTTTGAATAG 


6840 


TAGGTTGGCA 


CTTCAAACTC 


TGGACCGACA 


TAGCGTTCTA 


AAACGAGTTT 


AGATGGTGCA 


6900 


TCCGTACCAC 


TATCTGCAAA 


GAACTGAACT 


TTTCCTTGTG 


TAACAGTCCG 


TTCTACAATC 


6960 


TTACCATTTT 


CACGGAAAAT 


CACACCCGCT 


GATACTTCTG 


GATTACAAGA 


TGGTGTTGGT 


7020 


GACCAGTTTG 


TCCAACGACG 


ATTTTCTGAA 


TGATCTCCGT 


CATTGAGATA 


GTCAACGCGG 


7080 
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TCATGAGAGT 
TCATAGTTAG 
TCACCAACAA 
AGATTTTCCC 
TTACCTGTGA 

AAGCGAGCTT- 

AAGTGCAGAC 
GGCTCACCTT 
GTGACTGCCG 
AAGTTTGCTA 
ACTTGTTTCG 
ACCATGCGAA 
CCAGCCTTGA 
TTAAAGACAT 
TTACCAACAT 
AATCCGACAC 
ACACCGACTG 
TTATTGTAGG 
GAAGCAAAGG 
TCAGTTTGAC 
CCATTAACTG 
ACCTTAGCTG 
GGTGCTTCTG 
GAGACTTTTG 
TGATAGGAGT 
AAACCTTTGT 
GTCACTACAT 
GTGAGTTCAA 
TTATGCAACT 



TTTTGTCAAT 
GGTTATCTGA 
CGTTACTACC 
CTGGAACTTC 
AGTAAACTGG 
GTTTAACCGC- 
GGTATTCTCC 
CACGAACGCT 
CAACTTTTCC 
AGTCTTTGCC 
CAAAGATTTG 
TACGAACAGC 
GTTGAGCAGG 
GGTCCTCATT 
AATACTCAAT 
TTAGATTATC 
AAGCTTCTGG 
AAATGAGCTT 
CAAGTGGCAA 
CAGATACGCG 
TAAAGACACC 
ATGAAACGTG 
CGATTGGAGT 
CACGCGCTTC 
CTAGTTTTTC 
CATACTCAAC 
TTACAGGACG 
CTTGGTCTTT 
CAAGCATTCC 



ATCATTGGTT 
AAGAGTCTCA 
AAGGACACGG 
TTCCCATTCA 
AACCTTAGTC 
AGCAACTGGT 
TAAGATGTCG 
TGGAACGACG 
ATCTACAGAC 
GTCAACTTGG 
TACCTCTGTG 
ATAGGTTTCA 
GGCTTTTAGA 
ACCAACAAAA 
CACATAAGAC 
AACGGAGCCT 
ATTAGTACGA 
GTCATTAACA 
TTCTGAACCG 
AACATCAACT 
TTCCTTAGCG 
ACCATTTGAA 
TGTCACACTG 
AAGGTCAATT 
TTTCGGAATA 
TGTTACTGTT 
GATGGATTGC 
AGCTCCCTCA 
TTTACGAATT 



935 
GCTGAAGCAA 
CCAAGTTTGT 
CCTCGAACAG 
ACTGTCAGCT 
GCCAATTCAA 
TTATGAGAAA 
CCATTTTCAG 
GTAGCGAGAC 
TCAAGGTAGT 
ATTCTTCTTT 
ATAGACGTTC 
ACTTTATCAA 
TTAGTAACTG 
CTAGGGTTTT 
TTCGGTACAC 
TTGCTCAAGA 
TTCCACTTTG 
TTTGAAACTG 
GTCCATTGGT 
TTAGTTGTTA 
TATTGCTCTG 
TCATATGTCC 
ACTTCTTCAA 
CCTTCAACTT 
GCTTGCCAAG 
GCTGGAAGAC 
GCAATCTTCT 
TATTCAGCGT 
GCGACTTCCC 



AGGCCTGGTT 
CTGTCACTCG 
TAAATTGACC 
CTTTTGTTTC 
GTGCTTGACC 
CTAAGCTCTT 
CTTTCGCGAT 
CATTGTTGCT 
AGTCTGTCAA 
GTCCTTGCTT 
CACGCTTGTT 
AGCTAAAGTG 
GTTTCCAGTT 
TAGGAGCTGT 
CAACTCCATG 
TACCTGAATC 
TCCAACGATT 
GGTCGCTTCG 
CAGAAATGTT 
ATTGCGTACC 
GACGAATCGC 
GAACACTTTC 
CTGAAACGAT 
TACCTAGTAC 
TGACTTTATG 
TTGGTTCCTG 
TCTCAGTATT 
TCAGAGTGAC 
CTTCACCACT 



ACTGTTTTCA 
TACAGTGATC 
TGCTTTTGTC 
GTAGCCGTCT 
TACTTGTAGC 
ATCCTTAGTC 
CACACGAACT 
AACACTTGCT 
ATCAGGGTTG 
GGCTGCCGCA 
ATCTGCTTTA 
GTTCATTTCT 
GGCAGAATCA 
TGGGACAGTC 
GTCTTCATGG 
TCCAAACAGA 
CCCTGGTTGC 
ATTTGAGTCT 
TGCACCTTGC 
TTCTAAGCGA 
ATCCCATGCA 
TGCTAATTCT 
ACCTTCTACA 
TTCAAATGTT 
AGTTTTAGGG 
ATGCAAATCT 
GGCTTGGATA 
TGCTCCTGGC 
TGTAGAGAAG 



7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 
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GTTACTTTAT 


1 CACCTGGTAA 


. TACAGCTTGC 


GTTCCATCTT 


GATAGTGAGC 


TCGAACCGAC 


8880 


AATTTGACAG 


TTTGGTCTTC 


TTTGAGACTG 


TCAGCTTTTT 


CCACTTGCAA 


GCTCAAGTGA 


8940 


GCAATTTTTG 


GCGCTTCTTC 


AAGGAATTGA 


ATTGCATAGG 


TTTGAAGAGG 


GCCACCATCT 


9000 


TTAGGCTGAA 


TAAAGATGCT 


CGCACGCATG 


CCGTTTGCTG 


CGCTTGCTTG 


AAGAACTGTA 


9060 


ACAGCTGCAT 


TTTTAGCACT 


TGCTGTGACT 


TCTGGCAACT 


TAGCTCCATA 


AGCAAGAGTG 


9X20 


CGGTATTGCA 


TTGGTTTTTG 


ACTAGTAAGA 


CCTGTTACTG 


CCTCACCACC 


AACCGTTACA 


9180 


GTTGGTACTG 


CAGGTGCCGC 


AGGATTGCCT 


TCTTCTACCA 


CAAGGGTTCC 


ATGAATTGGT 


9240 


TG AC CTTCT A 


AATAACCGGT 


CGCTTGAATA 


CGAGAACCTG 


GAATTGCTAA 


CTTAGCTTTA 


9300 


TCTTCTTCGG 


CAATCTCCCA 


CTTGTCCACT 


TCATACTCTT 


CAACACTTCC 


ATCAATCAAA 


9360 


ACATAGGAAA 


CAGATTTGTC 


TACAGAATTC 


AAGTCAGTAT 


TTGGAGCAAT 


ACGTTTCACA 


9420 


ACTGGTACCT 


CTGATTTAAG 


AGCAATCACT 


TCTACACGAG 


CTTCTACTTC 


TCGTCCGTCA 


9480 


GCCATACCTT 


TCACCGTTAC 


AATACCAGGC 


TTGCTCACAT 


CTACTGAAGA 


CCAGGTTACA 


9540 


GGACCTTCTG 


CACGGCTACC 


ATCACTGTAT 


ACAAACGGAA 


CAGTGGTAGG 


CATTTCAGGT 


9600 


GCCTCTCCAA 


TAATGGTCTG 


TACTTTTGGC 


ACTTCTGTCC 


CCAAAACAGT 


CTTCTCTTGT 


9660 


CCTTCTTTCT 


TACCAGTAAA 


GACAGTGACT 


TGGTTCGATT 


TCAAGAGATC 


AGAGTGCGCA 


9720 


GTCAGGGTGA 


ATTTCCCTGC 


TTGTTCAGTT 


GATTTGACAA 


TGGCAACACC 


TTTACCATTA 


9780 


AATGCTTTAC 


GAATCCAAGA 


ACCATCTGCT 


TGCGCCTTAT 


AGCGTTCACG 


GCTGGCTTGT 


9840 


TCTCCGTTAT 


CTACACCGAC 


CAGTTGACCT 


TGGCCATGCA 


ATTGGAAGCG 


AACCACATTA 


9900 


TTAGCAGTTG 


GAACCACATT 


CCCCTGGCTG 


TCAACAATTT 


CATAGTAGAT 


GTAAGTCAAG 


9960 


TCTTTTCCAT 


CTGCTCCAAT 


CGCATGGTCT 


TCCTTAATAA 


GACGAACTGC 


CGCTGGCTTA 


10020 


CCAGCAGTCG 


TAATCTTATC 


TCGAGCAATT 


TCCTTGCCAG 


ATTCATCACG 


AGCAATTGCT 


10080 


TCCAAGGTAC 


CTGGTTCATA 


GGCAACTTTC 


CATTCAAGAT 


AAAGTTCATT 


AGCATTTGCA 


10140 


CCTTCTTGGT 


AAGTCCGCCC 


ATCGCTGGTT 


TGTTTTTTAT 


TGAAAGTCTT 


AAGACCAAGA 


10200 


GATTTTCCAT 


TCAAGAACAA 


TTCTACACTA 


GAAGCATTCG 


AATAAGCACG 


AACTGGAATC 


10260 


TTACCTTCTG 


AGTCACCTAC 


TTTGGATGCT 


AATTCTTTGT 


TTTCCCAGTT 


CCAGTGAGGA 


10320 


AGAAGGTGTA 


CCATCGGTTT CTTCTTAACA GAAACCCATT GGCTTTGGTA GAGATAGAAG 


10380 


TCATGTTTTG 


GAATGCCGGC 


TGTATCTACG 


ATACCAAAGT 


AAGAGCTCTT 


AACAGGAGTT 


10440 


TGATTTTGGT 


TGTGCCATGG 


TGTAGGTTCA 


CCAATATAGT 


CCGTACCTGT 


CCAGATAAAC 


10500 


TGTCCAGCAT 


AGCCAGCGTT 


GTCACGGTCA 


AAAGTCCATG 


AAGCGGTTGC 


TGTTTTCCCC 


10560 


CAACCCACAC 


GATCATTTCC 


ATAATCTGAC 


TGTTCATAAT 


TACCCTCAGG 


TCCATTGCTA 


10620 
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TGTTTCAATT 
TCTGATCCAT 
GAATAGTTAA 
TTACCGAAAC 
ATAACCTTAA 
ATTTCATTAC 
GTACGTAGGT 
TTTTCAAAGA 
TCTTCCTGAA 
TTGTGGGTTG 
TCTGCTTTAT 
TGGAATTTAA 
CGGTAACCAA 
ATCAATTCGT 
TCTAAAATCG 
ACTAAGCCTG 
TGGTCTTTGT 
TGTTGTTCTT 
TCTGTCACTT 
GGCTGTTTGT 
TTGGTGATAT 
TTGACATAAA 
AGGTCTTTTT 
CCACCTTCAT 
GGTAAATCTA 
TTTGCATTGA 
TGATTCACTT 
GAAGCTTGTG 
TTTGCAGTTA 



CACGTTCAGG GCGATAGTAA 
AAATCAACCA TTTTGGATGC 
ATCCAACAGC ATCGAGTTCA 
GGAATTTATC TGCTCCCATG 
CCAAACGTTT AACAGTTGCT 
CAATTGACCA CATGAAGATA 
CAAAATCAGA CCATTTTTCA 
AACGTCCATA GTCATAAGGT 
CGAGTAAACC TAGTTCTGCT 
TACGGATGGA GTTAACTCCC 
AGTTTTCTTC TGCTCCAAGC 
TACGTTCACC ATTCAAAGAG 
ACAAATCCTT CTTAGCATCA 
ACAAGGCAGG TTTGTCATTT 
CATCTAGGCT TGTTGATTCA 
TTACAGCATG ACCACCTCGT 
CGTCCGTATT GACGATTTTG 
CAAGTTTTGG TGTTAAAATA 
GTAAAGTCAC ATCACGATAG 
TGACTGCATG GACAGCAATC 
CATATGAGAA CTGGTTATAA 
CTTGAGAATC CATGTAGACG 
CATCTAGTTT GAAAGTCTTG 
TTTGTGCAGG AGATTCATGA 
ATTTTTTCCA CGTAGATACG 
GTTTAAAGTA CCAATTTTGA 
CTTCATTTGT TACAGCTTTA 
ATTCTATCCT TGGAGCTTTT 
CGGCTTCATC TTCTTTCTTC 



937 
CTTCCACGTG 
TTAGCTCTAA 
TCACCAATTT 
GTAACATAGC 
AAAGAGTGGG 



TACGGGTAGC TGAAGATGTT 
GGGCTTTGTA ATTATCTTCA 
TCTCATGCCC TCCGCTACCA 
GAGTCTTATC AACATCCTTG 
CATCACCATT AGCTTCACCT 



GCAGGGTTGT TTTTGC CT CT ~ rTCGACCATG 
CCTTTTCGAG CTTCTGGGTG AGTGGCATCT 
TTCTTGCCAC CATACCACCT ATCAAAGGCC 
GCGATTTGCA ACGTTTGCTC ACTAGCAGGG 
ATCTCCTTCA TTTGTTTGAG ACGGCGATAT 
GCCCCATGGT CGTGGTGCAA GGATACTCCA 
AAACCTTCAT TTGGAGTCCA GTGATAGTAA 
ACCAATTGAC CGTCACGCTA AACACGCGTA 
AAAACAGTCC AGAGTTTTGG TCTTTCAACT 
TGTGCTTTTA AGGTACGACT CGCTGTACGA 
TCAACGATTT GATATTCGGC TACAAGTTCA 
CTGGTCACAT GAGTTTCAAC CTTGCCATGT 
GTTGTCCCAT TTTTCTCAAC ATGCACCTTA 
ATACCACTTC CTGAATACCA ACGGCTACTT 
ACATTCTCAC GACCATCTTT TTGAAGGTAT 
CCATTTGGAT AATGCCCCAC TAACTGACCA 
CCATCAAAAG TAAGGCGAAC ATTTTTCTTG 
CGATACCAAG CTTCCCCACC GTTGAGCTGT 
TCGAAATCGT TAAAGATACT CCAGTCATAC 
TCTGCATCAG GTTTAATGGC TTCCTTAGAA 
TTAAAATCCA CTTTCCTGTC TTCAATCATT 
GCATCTTCCT TGAGCGGTTT TTCTTGATTT 
TCTTCCGGTT TAGCAGACAC TTTTTCCTCT 
TCAGATGCAA TAGCCTCAGT TGAACTAGGT 



10680 
10740 
10800 
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11280 

11340 

11400 
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11700 

11760 

11820 

11880 

11940 

12000 
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TCACTTTGTT CTCTCCTTTC AACTATATTT TTAGTTTCCA AAGCTTTATC ACCCTTTTCT 12420 

TCTACTATCA TTTTTTCCTC TTTAGGTTTC TCAGCAGTAT GAGTAATAAG TGTTTCATCC 12480 

GCATAAACTA CAGATTCTCC AGCTATATTT CCTCCTAATA AAACTGCACA AGTCCCAATC 12540 

ATTACTGAGC AAGCTCCCAC AGCAAACTTA CGAATGCTAT AAACTCTTTT CCGATTCCAA 12600 

TGGCCTTTCC CCATAAAACC CTCCTTATAT TATATTTAGT GCAGTTAGCT ACTACCAAAG 12660 

CCCAAGTGGT ATACATGGTA TGACAACCTA CTTTCAACAA TTTACACTCT GCGAAAATCC 12720 

AATTCAAACT TCGTCAGTGT CGCCTTGCCG TAGATATGAT TACTGACTTC GTCAGTTTCA 12780 

TCTACAACCT CAAAACCATG TTTTGAGCTG ACTTCGTCAG TTTCATCTAC AACCTCAAAA 12840 

CCATGTTTTG AGCTGACTTC GTCAGTTTCA TCTACAACCT CAAAACCATG TTTTGAGCTG 12900 

ACTTCGTCAG TCTTATCTAC AACCTCAAAA CTGTGTTTTG AGCAACCTGC GGCTAGCTTC 12960 

CTAGTTTGCT CTTTGATTTT CATTGAGTTT ATATTTTATA GGAGCGCATT ATTTTGCTTT 13020 

TGCTGCGTAC TCTTCGTTAC GTTTGATCAT TTGTTTTCTG TACCAAGCAA AGATACCGAT 13080 

ATAGAATACA AGGAAGACTA CTGCACCAAG GATTGCTTTG ATATCACCAG TTGTAGTGTT 13140 

ACCAATTCTC CAACCAAGAA GTTTTTCGAT TGCTCCTTCA AGAGTAGAGT GAGTAATCAA 13200 

TTGAGTTTGG CTCACACCTT CTGGGAAGGC ACCTACACCT TTAGCAAGTT CTGTTGCAAA 13260 

TGGTGCAATA AGTGTACCTG AAAGAAGGAA GAGTGGCAAC AAGAGTGTTC CGAAGATAAT 13320 

CATACGGAGC AATTTACCAC GAGTTACAAC CAAGAGAGCT GGAGTAACAC CCATAGCGAT 13380 

GATACCTGCA AGTGGCAAGA TACCATTTCC AACTTTTGAA AGAAGCACTC CTTCAATCAA 13440 

CATGATTCGT GCAAGTACGT TGGCACAAGC CCAGATTTCA GCACGACCAG CGATGAATGG 13500 

CCAGTCAAGA CCGATATTGA ATTTACGTCC TTGAAGACGT TTAGTAGCAA CGTTTGTAAT 13 560 

ACCTTCTCAT AGTGGTTCTA CGGCTGCGAT GAACCATGAA CCGATAAGTG AGAAGAGTTC 13 620 

CAAAGATACA CCGGCAGTCA AACCAAGAGA CAACCATCCT TTGATAACAA GACGCCATTT 13 680 

ATCTGCATCT GCAACACCTG CAATTGGATG TGGAGTTCCC ATAATACCGA TAACGATACC 13740 

AAGGATGAAA CCGATGAAGA ATTTAGATCC CCAGAAACCG ATTTTCTTGT TCAATTTAGC 13800 

AGCATCAAAG TCATATTTAT CAAGGCCTGG GAAGAATTTT TCAAAAATCT TATCCAAAAC 13860 

CATGATAACT GGGTTCATCA TGTAGTTCAT GTGAGTTGAT GTCATTGGTG ATGAACTTGG 13 920 

GGCGTTAAGA AGGTCATCAA ATGTAGGTTT CATCAAGTCA GAGTTGATAA TTTTCAACAC 13 980 

ACCGACAAGG ACGATAGCTG CTGTAGCAAT AAAGAGTGAA ACCCCTTGAC TCACACCATT 14040 

GTTATCAGCA TACCATTTAA TCAAGAGACC TGTGATAGAC AAGTGCCAGA TATCAAAGAT 14100 

ATCGACATCA AGTGTATCTG TTTTCTTCAT AGCTAGCATC ACTATGTTGA CAATCAACAT 14160 
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GATGAGCAAG AAGTATACTG TCCAAGCAGA ACCCCAAGTG ATTGTAGCAA GTGGTGCCCA 14 220 

ACCAACGTCG GTAATACTCA ATTGGATACC AGTGTTTTCA ACGAATTTTG CTAGTGATGC 14230 

TGAGAAAGCA GTGTTTAGCA TACCGATGAT AGCACCGATA CCTGTAAGAG CGATGGCAAG 14340 

TTTCATACCA CCTTCAAGCG CTTTGGAGAA TTTCACTCCA AAAAGTAAAG CCAATACTGT 14400 

CAAAATGATT AACATGATGA CAGGTCCACC CATTTCTAAG ATGGGATTGA AAACCTTTCC 14460 

_ GATTACGTCA - AAGATTGCAT"'CCATAACAGT~TCCTCCCTTT~TTGATGTTAT — ATGAATGTTJf 14520' 

ACAAATTAGA ATTAGCTTAA TCCGTGTTCT TTAATAGCTG CTTCAATATT GTCAAATACT 14 580 

GGAGCGCTCA TTGCTGGGAT ACGGAATAAG ATTGGCCCAG CTTCGATAAC TGGGATACCT 14 640 

GGTTCAAAAC CAAGGTCTGT TGCAGCGATT GGTGTAAAGA TATCGTAACC TTTCATAAGG 14700 

TCTTCGTTTA CATCTTTCAC CATGACTGCA TCACAGTGAA CATCATAACC ACGGTTTGAA 14760 

AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC ACCGCAGGCA 14820 

GCAAGAATTT TAATCATTTA GATTTCCTCC GATTTTATTT TTTAATAGAC AAGATTAAGC 14880 

GGTTGCTTCA GCAATGTAAG TATAAAGGGC TTCTGGTTCA GAAATTTTTG ATAGGTCTTC 14940 

AAGATGACCA TTTCCTGTGA AGAAGTCCAT TAACTGAGCA AGAATGTTCG TTTGACTTGA 15000 

ACTTGAATTA TTAATGATAA AGAAGAGTAG GGATACTTCT ACTTCCTTAT CAGGAGCTAT 15060 

CATATTGTGA AAAGTTATTG GTTTTTCTAA TCGAACAACC ACCACTTTCT CAGCTAGATT 15120 

ATGAACAATA TCTCTGTGAG GAATCGCTAC ATTTGGCAAG TCCTTTCCTA GAAATTCCAT 15180 

ATCTAAACCA GTTGGAAATG A C TTTTCACG CGTGATCAAG GCTTCACGAT AAGTTGGAGT 15240 

GACAATTTCT C G TTCTTCCA ATAAAGTTGC AACCTGATCA AAGAGTTGTT CTTGACTATC 15300 

CGCTTCTAAG CAAAACACAA GGTTTTTGTC AAAGAAATAA TCTAATACCA TAAGTTTTTC 15360 

CGG 15363 
(2) INFORMATION FOR SEQ ID NO: 140: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS ; double 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

TAAGACTATT TAATAGTGGA GTGAAATAGG ATACGAACAA ATTGATTAGG AAAATCAAAT 60 

GAATTTATAG AAATCTTTTA GCAGTTATGT TATCCTATTC TAGTTTCAAA ACGCTATAGA 120 
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AGCAGCATTG 


TGCTAGTCkA GATTCAGTTT 


ACTATACTAA AACGAGTAGC 


TTGAAATCAA 


180 


AAAACCCACC 


CTCACAGGCA GCTTTTATCT 


GTATTATTCA GCTACATTAT 


GCTTTACCTT 


240 


CTGAACCGAA 


TACGTCGATA CGTTCTTCAA 


CCGATGCTTG GATACCTTTT ACACCGTCAG 


300 


CCAAGAATTT 


ACGTGGGTCG AAGAGTTTTT TCTTGTCGTA TTCTGCTTCG TTTGCTTCGT 


360 


AGTCACGAGC 


AAATTTACGA GTTGCGTTAG 


CGAATGCGAT TTGGCATTCT 


GTGTTAACGT 


420 


TAACTTTGGC 


AACACCAAGT TTGATAGCTG 


CTTGGATTTG CTCATCAGGA 


ATACCTGATC 


480 


CACCGTGCAA 


TACGATTGGG AATCCTGGAA 


GAGCTTCTGT CAATTTTTGC 


AAGTGGTCAA 


540 


GGTCAAGACC 


TTCCCAGTTT ACTGGGTAAG 


GACCGTGGAT GTTACCGATA 


CCAGCTGCCA 


600 


AGAAGTCGAT 


ACCAGTTTCA ACCATTGCTT 


TAGCGTCTTC GATTGGAGCC 


AATTCACCTT 


660 


TACCGATGAT 


TCCATCTTCT TCACCACCGA 


TAGTACCAAC TTCAGCTTCT 


ACTGAGATAC 


720 


CTTTAGCGTG 


TGCTTTTTCA ACAACTTCTT 


TAGCCAATTT AAGGTTTTCT 


TCAACTGGAA 


780 


GGTGTGAACC 


GTCAAACATG ATTGAAGTAT 


AACCAACTTC GATACACTCA 


AGTGCATCTT 


840 


CGTAGTGACC 


GTGGTCAAGG TGGATAGCTA 


CTGGTACAGT GATACCCATT 


GATTCAACAA 


900 


GGTTAGCGAT 


CAAGTTGCGA GCAACTTTGT 


AACCACCCAT GTATTTAGCA 


GCACCCATTG 


960 


AAGTTTGGAT 


CAAAACTGGA GCTTTTTTAG 


CTTCTGCTGC GCGCAAGATA 


GCTTGAGTCC 


1020 


ACTCAAGGTT 


GTTTGTUTTA AATCCACCAA CTGCATAACC GTTGTCACGG GCTGCTTCGA 


1080 


CAAATTTTTC 


TGCTGAAACG ATTGCCATTT 


TATCAGGCCT CCTGTATATT 


TTTATGGGTC 


1140 


ATCCCATTTA 


CATTGTTCAT TTTATCACTT 


TTTGCCAAAA AAATCTAGTT 


TTTCCCGCAG 


1200 


TTTCGATTGA 


TTTTCTTCTA ACTCCATCTA 


TGTAAACCCT TTCTCTCCCT 


AGTCTTGGAC 


1260 


GACTTTTGGA 


AAATCTATAA AGAAGGTTAA 


ACTATTCTCC TCCATCTCGA 


AACGATAAGC 


1320 


TAATTTTTCA 


TGTTCTAATA GACTCTTAAC 


CACAAAGAGC CCCATACCAG 


ACCCCTTGAC 


1380 


CTTGCGACTG 


GCATTGTCAG AAAAAGACTG 


GGCTAGTTTT TCTTGTTCCT 


CTGAGCTACA 


1440 


GCTATTTTCG 


ATAAAAAGTT CTCCTTCTCT 


TTCTCCAATT CGAACTAAGC 


CACCTGGAAC 


1500 


AGAGTGCTTA ATGGCATTGC TGATGAGATT AGAAAGAATC AACTTCATAA CTGATGCGTT 


1560 


TAGATAAGCC TGCTGATGGG TCAAACTATT GTCTATCTGG AGCTCTCTPT 


CCTTGGCTAG 


1620 


CAAGGCATAA 


TCTTTGACCA GATTTTGCGT 


CATCTGGAGG AGGTCAATTG 


TTTCCCTATC 


1680 


ATCTCGCAAT 


TCCTGCACAG AAGAGAGGGA 


AAGTATCTGC AGAACATGGT 


GATTGAGTTC 


1740 


ATCCACAATC 


CCCAAGGCAA CTCCCAGATA 


CTGGTCTCTA TCCTTATAAC 


GACCGATATT 


1800 


CTCTCTCATA 


TTTTCGATTA GGATTTTCAA 


ACTAGCCAGC GGTGTTTTCA 


ATTCATGAGA 


1860 


AGCTCCTCGT 


AGGAATTCGA CCTTCATCTT 


CTCCAGCTGG AGAATGCCTT 


CATTCTTTTC 


1920 
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ATGCAAGTCC CCAATAACAG TCAAGAGATC CTCGTAGACC CTATTGATTT GTTCCTTGAG 1980 
ATTACCTATC TCATCCTTAG AATCCACGCG CAATCGCACT TGGGAATCCA CGTCCATCAT 2040 
CCGACGGGTC ACCCGCTTCA TTTCCAAAAT CGCTGCAACA ATAGTCCGAG CCTAGATGTA 
GGCCACCAAA AGCGAAATCA CAAAGGACGC CACCAACGTA TAGGGAAGAA ACTGCAGACT 
GATTTGCTCC GCTTCCTTTT GTAAATCCAT GGAAGCTAGA AACTGGAGAA TCATAGTACC 
ACCGTCTTGC GTTTTCACCT CGCGCTCCTC AATAAAGAGA GAGGTTGTCT GGCGGTCTGT 
GTCCAGAGGA AGACTCTCCT TGACTTCTAA CTTGTCCTCG GTCATCTCAC CTTTGACGGT 
CCCCTTGATA TCACTAGTCT GGGAATACAA GTCTAACACT TGCTCGATAC TCTGCCTATC 
TTTCCCTTCT AGGCACTGGG CAATCCCTCT TGCCTTTTGA CCAATGGTTT CCTCACGATG 
ACTCAGATAA GTCGAAGGAA AAAGAAAATA AATAGCTAAA TGAAGGCAGA TAACCAGAAC 
ACTAAATATC GACAAGGTAT AGATAAATAT CTTTGCAAAT AAACCTGTTC GTTTCATTTT 
CGCTCCAATT TATAACCAAC ATTGCGCACA GTGAGGATAC AATCCAAGTC TAGCTTTTTC 2640 
CGCAATTCCT TGATATAAAC ATCAATAACA CGGTCAAAGG GAACCTCATC TGTCGCTTTC 
CAGACGGCAT CGATAATCTG AGATCGAGTC AAGCCCCGCC CTTCATTTTT CACTAGATAG 
TCCAGAATTT CCAACTCTTT GCCATTGATA GGCACTTCTT GACCTGCGAG GCTTGCACTG 
TAGCTTPCAA AGTCCACCTT GGTATCCTTG TAAGAAAAGA TTCGTCCTGT ATCGTAGTAG 
CGCTTGAAAA TCGCGTCCAC CCTCACTTTT AAAAGGGAGA CGGAGAAACG TTTTTCCAGA 2940 
TAGCCATCTG CCAAAGAGGC AAAGGCACTC ATCTTGTATT CCTCATCTTG AAAAGCTGTC 3000 
AACATCAAGA CAGGAACCTG ACTGGTTTTA CGAATCTCAC CTAGGACTTC TAAGCCGTTG 3060 
AGCTTGGGCA TCTCGATATC CAGTAAAACC AGGGCCACCT CATAGCTACA AAATTGCTCC 3120 
AGAGCTTCCT GACCGTCCGC TGCCTCAATA GTTTCATAGC CACAATCCGT CAAATAATCA 3180 
CTGACCCCCT CACGGATCAT CTCTTCATCT TCTACAATTA AAATTTTCAT ACTTTAACTG 3240 
CTCTCTATTT TTTATTTTTC TTAGAATAAA TACCTACCCT ATTTTCTATT ATAGTCTCTT 3300 
GCTGGCCTTT TGTCTGCAAG CAACTGACCA CTAGATAAAA CGTTGTGAAA TTCCTTTCTC 3360 
ATAAATTCCA TAACTTTACT ATATTATATT TAAGCACTAA AGTACAAAGA AAGCAACTGA 3420 
AAGCAATGAT TTTCACCACT GCTTTCGGAT TTATTTTGAA TTGTTAAATA GCCATTCCTA 
TCCACTATTC TTGAATAGAA ACACAAGATG CAATCTTTAT TCTAGACTCA TTTTTTCAAA 
TTTATTCACC ATCCAGCAAG AGCTCTTTTG GTTGTTTTCT AAGGAGATTG CTTGAAGCAA 3600 
GCGCCATAAC GAGAACCACT AGAACCAAGC CAAGGACAAA AATGATGATA AAGTCTGATG 
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AATGTCTAGG 


< CTCGACAAGG 


1 TCTTGCTAAA GCCATCTACT TCTGCACCAC 


3720 




H. r~ ft r~j~i i~iihu ■ ■ 

AGAGGCTTGA 


GCCGCCTTAC 


TAtKX L\m I ri 


GGCAACACCT 


GAAGTCACAT 


3780 


TGGCAAGGAC 


AGTGTTTCCA 


ATTGCACGGG 


^ ft ^W\^<f* ft. It IMt 

CAGTGTAATT 


AGCTAGGAAG 


TAAGCAGAAA 


3840 


WTAGAGCAGG 


GATAGCAATC 


AAGATAGATT 


CGGTGATGAA 


TTGACCCAAG 


ATACTTGCCT 


3900 


GCTTGAGGCC 


GATAGAGAGG 


AGAATTCCCA 


CTTCCTTGCG 


ACGGGCGTTG 


ATCCAAAGCC 


3960 


TGAGCAAGAG 


GGCAAGGAGG 


AGAACTGAGA 


AGCTCAAGCT 


ACCCCAGAAG 


AGGAGGTTGG 


4020 


CCATCTTUTA 


CATACCAGAC 


ATAGATTGCT 


CAAGAGCTGG 


GTAGTTAGAG 


GAGCTCTTGA 


4080 


CGAGTGTGTA 


CCTCTTCCAG 


TTGATACCAC 


TGATGCCATT 


CAACTCTTTC 


ATAACATCAT 


4140 


CCAAGTTCTT 


GTCTGCTGTT 


ACAAAGAAGG 


TTGCGTCCCC 


ATAAATGGCT 


GTGTCTTCTC 


4200 


TGTATCCATA 


AAGTTTTGCA 


GCAGTGTGAA 


TGTCTGTAAT 


AGCTGTGTTT 


TCGTAAAGTT 


4260 


CTTGTGAGTA 


GGTTACTGCT 


GACTTATTAT 


GACCATCAAA 


CAGTCCCTTG 


ATTGTCACTT 


4320 


CAACTGTTTC 


CTTGGCTCCT 


TTTTCATTAT 


CTGCATCGTA 


GAT ATT AGAG 


TCCAGTTTAA 


4380 


CCTTGTCCCC 


TACTTTCCAG 


CCGTGTTTGG 


CTGCCAAGTC 


CTTGTGCAAG 


AGGATTTTAT 


4440 


CCTTGTCGTC 


GTTCCTTAAG 


TGCTCTCCTT 


CG ACT AGTTT 


ATAAGAACCA 


GAGACAAACT 


4S00 


TGTCTTCTTT 


AGAGGAGTCA 


TTGACACCTG 


TAATCATCAA 


GCTACTTCCA 


AAACGCTTGG 


4560 


CACCATCAGC AGTGAGATTC TTCTTGCTTT CTGGCGTTTC 


AATCAGGTCA 


TATCCAGTCA 


4620 


AATCTCCGAT 


ACCGTTGATA 


CGTTTGACAT 


AAGACTCAAT 


GGCCTTGTTT 


TCGGTGATTT 


4680 


TTTTGATGTC 


TTCACCCTTG 


ATATTCCCAG 


CACCACGAGG 


CGTTCCTTGG 


TTGACGCGAC 


4740 


GATTGATTTG 


CATGGAGAAG 


CTATTGGTGA 


TATTTTTAAA 


GGTCTCCTGA 


GAAGCCTTGG 


4800 


CAGTAGCTCC 


CTTGATTCAC 


AAGCCGACCA 


AACTCAAGCT 


CGCCATGAGG 


AGAATAATCA 


4860 


GGAAGATGAC 


AATCGATTTG 


AAAAACTTCC 


TTGTAACATA 


GGCAAATGCG 


TTGTGTAACA 


4920 


TAGATTCCCT 


TTCTAGATTT 


TGTTTTAATC 


ATTCTATTAA 


AATAAGCTCA 


AATTATTTAC 


4980 


TAGTATTGCG 


CGTTTCAGTC 


AGTTTCTTAT 


CCTTTAATTC 


AAGTGTAATA 


TCTGACGCTT 


S040 


GTGCCACTTC 


TTTACTGTGA 


GTTACGACAA 


TCACACATTT 


ACCTGTTTTC 


TGGGCAAGTG 


5100 


ATTTGAGTAG 


TTCGACAATA 


TCTCCAGCAG 


TTTTAGGATC 


CAGATTTCCT 


GTTGGCTCAT 


5160 


CAGCTAGAAT 


AACTGGAGCT 


TCTGAGACCA 


AACTGCGAGC 


AATGGCAACA 


CGTTGCTGTT 


5220 


GACCACCTGA 


TAACTGGAGA 


ACATTCCGCT 


TGATCTGGCT 


TTCATC CAAA 


CCAAGCTCAA 


5280 


GAAGTGTATT CTTGCTTGCC 


'rrrrTCTTGA 


CCAATCGGAT 


ATTTTCCAGC 


GGAGAAAGAT 


5340 


AATCTATCAA 


GTTATAATTT 


TGAAAGACCA 


GGGAAATATG 


GTGCATGCGA 


TGGTAAGAAT 


5400 


AGCCCTTCTT 


ACGAATATCC 


TCTCCTTGAA 


AAAGGATAGA 


ACCTTCAACA 


GGACTATCTA 


5460 
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GACCAGCAAC TAGGGACAAG AGTGTGGATT TTCCTGCTCC TGACTCCCCA ATAATACTGT 5520 

AAAATTTTCC GCGTTCAAAA TTATAATTGA TCTGATATAG CACTCCTTCA GCAGTATTCT 5580 

TATAACGCTA GGTAACATCT TGTAATTGTA ATAAAGTCAT GATTTCTCCT TCTTAACTAA 5640 

TAGATGATAA AATTTCTTTC GGTGATTTTC TAAATAAGAA TAGGAAACAA AGGGCTACAG 5700 

ATAAGCAACT AAGCAGAACT AGAAAAACAT AGGATTCTGC AAAAGATAAG ATGCTAGTTG 5760 

~ATAAACTGCT~TGCTTTGGCT — AGTGT^ CTGATCTCCA CTTGCTAGTA 5820~ 

GAGTTTGGAG TAGGTAAGTT GTGATTGCGT TTCCTGCAAC AAATGCTGGA AGCAAAGCTC 5880 

CAAGAGATAC CAAAACTACC TCTAAACAGA ATTGTAGGAA GATCGAGCTC TTGCCTTTTC 5940 

CAAGTGCAAG TAAAATCCCC ACTTCATAGA CCCGTTCTCT CAACCAGAGA GACAAAACCA 6000 

CAATTAAC-GC TCCAGCTCCT GCTATCAACA TCCCATAAAG GAAGATGGTC AGGAAGGTTT 6060 

GGAAAGTTGC AACTGAGTCT TTGATTTGTT CAAAAGCCTT GTTTTCCTTT TCGACTTGGT 6120 

AGCCTTGATT TTCCAAGGCC AAGTTTTCTA CCTGCTTCAT GAGTCCGTCC ATTTCCTTAG 6180 

GATTTTCTAC ATAGAAGCGT GCTGCACTGA CTTGAGCTTC ACTATTGCCC AAAAGGGTTT 6240 

GGCTACTTTC ATAGTCTGTA AAGACTTGAT TTTCACTGAA GTCAGAAGAC AAGCCTGTGA 6300 

ATTTCTCTTG TTTTPTACCA GAAAAGATGC CGATAATCTC AAACTCTACT GTTTGTCCTT 6360 

TTCCAGATTC AGACTGACCA GCATCCAAGC CAATCTTGTC ATGAAGCGAA AGACCGTTCT 6420 

TCTTAGCCAA TTCTTCGTGG ATAAGGATTT TCTTGGAATC CCCTTTTTGA AGGTGTCGCC 6480 

CTTCTTTTAG ATTGAAAGCC GAACTGGTAA AGGTTACATC CTTGGATGAA TCCTCAAGAG 6540 

CCGTTAAGCT AACCAAGTTA TTGTCTGCAG CTGATAAATC ATCACCCTCC ACCCTCTGCT 6600 

CGCCAGTCAC TGCTTCCTTG TCTTTTAGTT TTGCGACCGT CTCAAGTTCA GGAGAGACAT 6660 

TTTCCAGCCC CTTAATCTTC CTTACAGATG CTAGGTCTGA CAACTTGAAT GTCTGACCAT 6720 

TCTCTATCTT CTTAATAGAA AAAGATGTAT TGAGTGATTT ATAAAGATTG CTTTCTACG 6780 

TTTTGTTGGA CTTCATCAGA GTCAAACAGG CTGAAATTCC GGCCAATAAG ACCAATAAAA 6840 

TCAGAAATAA AATAAAACTT CTCAGTCGCT TTCTGCTGAC ATAAGCCCAA GATCTTTGGA 6900 

TTGGATTCAT TTGTCACCTC CATATTTGTA AGACTATTAT AAAACCCAAA TATGAAATAT 6960 

TTATGAAATA CGAAAAAAAA ATATCGAGTA GGGGATAATC TCTAGCCCCT CTCACACCAC 7020 

CATACGTGCC GTTCGGCATA CGGCGGTTCA ACTAACTTTT AACGCATGTC GTTCAAGGTA 7080 

ATAATCCAAA CACGAAACCA GTCCACGTTT TTCAAGGACT GGTTTTGATA TAGCACGTTT 7140 

AAGTACCGAC TTCTGAGCTA CTATAGTAGA TTGAAACTAG AATAGTACAC CTCTACTTCT 7200 
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AAAATATTGT TAGAAATCGA TTTGACTGTC CTGAACAATT CGTCCTATTC TTATTTCATT 7260 

TTACTATAAT TCATAGTGCT CGCCCCAGCC AGATACCTTA TCTCCTATCC ATTTAGGAAC 7 320 

CCCTAACTTA AGCAATCCCC ATAATCGTCT CGATTTCTTC TTCCATTGCT TCCAGATAAT 7380 

CACTCGTAGG CGAGTACGCA AGCGCTCATC TATGCTAGTG ACTATACTTT TCATATTTAT 7440 

AATTCATTCC TTTCGTTTCA CTCAAGGCAC AACACAGAAT GAAAAAGTGT TGTGATCTTT 7500 

ATTTTGTTTT ATAATAATAG TGAGAAAACC TATCACTACT ACAAATCACG GGGAGGTGAA 7560 

TAAGTGAGTG GTACAGCCAC TACCTCCCAT ATTTTGTCAC ATCATTTAAC GGTACATAAT 7620 

AAGTTGTACC ATCTGAATAA GTTGCTACAA TATCATTTGC ATGCTCTCCT TCACCTTTAG 7680 

CAAAGGTTGG AGCTCCTGCT GGATGATTTT TATTTGCCTC TTTCAATTTT TCAATAATGC 7740 

CATTTTTTCT GTATCTTTTA TATTATCAGG ATTTTTCACT AAGATTTTGT CTGGATATGT 7800 

CGGTTTAGCA GAAACAATTT TTACTGTTAC TTCTTTTTTA TTCGAAGCAC TTGTCCAGTT 7860 

TCCAGCATTA TCTTTAGCAT TTAATTTTAC AGTAATTCCT GAACTAGGAA CTTCAGTAGC 7920 

AGGTTGATTA TCAACATTAT TCAACTTTAA TTTCAAAAGA GCTGTTGCAT CAGACGTTTT 7980 

ATCAATCGTT ATATATAATG ATGAATTGTT ATTATAAACA GTTCCTTCAT ATTTAGCTCT 8040 

TTGTGAGCTA CTTGAAACAG AACTGAAATT ATACCCACTA CCTCCCTGAT TATCTTCAAT 8100 

GCTTACGTCT AAATGAACTT CCCCACTATT ATTTGGCTTA GCAACAACTG TTATAGTAAA 8160 

ATAACATAAA ATTTGCATAA ATAGATTAGG GAAATCAAAG CAGCTTCTAG GAATGTTTTA 8220 

GCAGTCACAG TGTACTTTCC CAGCATCAAG CCACTATAAC TCTGCACATA AAAATGGAGA 8280 

AGATGCCAAT CCTCTTCTCC AAATATTAAC TTCTTTACAA ACCAACTATA GTTGACAAAG 8340 

AACCTAAAAT CAATTGATAA CACAAGGTCA GGTCGGTCAA CTCTTTCAAC TGAAGCCCTG 8400 

TCAACTCTTC CCATTTATCA ATCTTGTATT GGAGAGAATT GCGGTGCAGA TAGAGTTGCT 84 60 

GGGCTGTTTT AGTGAGAACA GCACTATTTT CCCAAAGAGA GAGAATGATT TCCTGAATCT 8520 

GATCTTGATC CAAAATCATC TGGTGTAGAC ATTCCTTGAT TGGCTTCAAG TCCACGAGTC 8580 

TTTCTCCCAT ACTCCAAAGA TAGACCTGAG AAAAAGTATG AACACCTTGG TGACCCTGAC 8640 

GCCACCATGT CTTGAACAAA TCCCGCTCAG CTTTGATTAA GTCTGATAGG GCTTGATGTC 8700 

CCGTCTGAGA CCAAACCTGA CCCAACATGA TAGAAAGACG AAGTCCAAAG TCATACTCAA 8760 

CCGCTTCAAT CGTATCACTT AAAATATCTC TTACAGAAGT GTATTTGTCT TGTTGAAGCA 8820 

CGAAAACATA ATCCTGAGCT CCGACCTGTA GCACTGTCTG ACAATTCGGA AAAAGACTCC 8880 

GCATCATATC TAGCCAAGAA GCCAGATTTT CCTGCTGAAA ATAAGAAAGA TGGCAATAAA 8940 

CCAACTGAAT CTTTTTAAAA ACTTGCGGTG CCTCTCCCTT GCCCTCAACC AGATAGGAAT 9000 
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ACCAAGGGTT TAGCGAACGA GCCTGCTCCT GCTGGGTCAA AAGGGCAACC AACTCCTTTT 9060 

CACGCTCGCT GAGCCCAGCT TCCTCCAGCA AAATCCACTG CTGAGAAGCT AAAGGGACCG 9120 

TGAGATAGCC CTCTTTCTCT ACTGGTTGGT CTGAAATCCG AGCCTCAGGA AACCAGTCTT 9180 

GTAGTTCTTT TGCCCTCATG TTCTAGCCCT CCACTTTTTG GATGCACCAT GAAACCAAAC 9240 

TCTCAAGACG TTCCAGATTC TCAGTCATAT GGAGATAGCC CATAACCGCT TCAAATCCCG 9300 

— TGGACATA^~ATAAGTCACG AttTC^'mACCOT ~ ~ 936O 

GGCCACGTTT GTAGATTTCT TCTTCTTTTT CCGTTAGGAC CTGCTCCTCC AACATGAGAG 9420 

CAATCAGGCG AGCCTGAGCC TTGGCTGACA CGTACTTAGT TGCTTCTTGA TGGAGTTTAT 9480 

TGGGTTTGGT CATACCTTTG AGGATGAGGT GACGGCGAAT ATACATAGAA TACACCGCAT 9540 

CCCCCTCAAA GGCTAGCGCA ATCCCGTTAA TGAGATTGAC ATCAATCACG TGTCCACCTC 9600 

ACTCCATCCT TGGTATCAAG GAGCTTAATT CCTTGAGTAA CCAATTGGTC ACGGATTTGG 9660 

TCTGCTGTCG CAAACTCACG ATTGGCACGC GCCTCTTGGC GTTTTTGAAT CAAGTCTTCA 9720 

ATCTCTGCAT CCAAAACTTC CTCAACAAAG ACAATTCCAA AAATTTCTAA CATATCTGCA 9780 

AGAGCTTGCT TGACACTTGC ATCATAGTTC CCTGAGTTGA TCCATTTGGC CATTTCAAAG 9840 

ACAACTGTGA TACCGTTGGC AGCATTAAAA TCTTCATCCA TAGCTGCTAC AAACTTATCT 9900 

TTAAAGTTTT GTAACTCTTG GGCATCCACA TTTCCTGTAA ATGGTTGTTC GTAAGTATTC 9960 

TTGAGATACT TGAGATTGGT CTCGGCATCG CGAACTGCCT TTTCCGTGAA GTTGATAGGC 10020 

TTACGGTAGT GCTGGGTCGC AAAGAAGAAA CGAAGTACTT CCCCATCAAG AGTTTTAAGG 10080 

GCATCGTGTA CCGTAATGAA GTTACCCAAG GACTTAGACA TTTTGACATT GTCGATATTG 10140 

ACAAAGCCAT TGTGCATCCA GTAGTTAGCA AAAGCCTTGC CTGTTTTAGC TTCAGACTGG 10200 

GCAATTTCAT TGGTGTGGTG TGGAAACTCT AGGTCAGCTC CACCACCGTG GATATCAATG 10260 

GTATCACCTA AAATCTCTGT CGACATGACT GAACACTCAA TATGCCAACC CGGACGTCCA 10320 

GGTCCCCAAG GACTATCCCA AGAAATCTCA CCTGGTTTGG AAGATTTCCA TAGAGCAAAG 10380 

TCTACAGGAT TTTCCTTACG AGCCGTTTCT TCATCGGTAC GACCTGAAGC ACCTAGCTCC 10440 

AAATCTTCCA AGCTTTTATT AGCCAATTTA GCATAGTTGT GGGATTTTTC TACACGGAAA 10500 

TAGACATCCC CTTGACTCTC ATAGGCAAAG CCTTTCTCGA TCAAGTCTTC CACAAAACGG 10560 

ATGATGTCTG CCATAAACTC CACTACACGC GGATGGCGAG TCGCAGGTTT CACGCCCAAT 10620 

GCCGTCACAT CCTCACGAAA GGCAGCGATG TACTTATCCG CAACCTCCTG AGGCGTGATA 10680 

CCTTCTTCCC TGGCACGGTT GATAATCTTA TCATCCACAT CTGTAAAATT GGAAATATAG 10740 
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GCAACCTTAT 


ACCCACGGTA 


CTCAAAATAG 


CGACGAATCG 


TATCAAAAGC 


TACCGTCGAA 


10800 


CGGGCGTTTC 


CTACGTGGAT 


ATAGTTGTAC 


ACCGTTGGCC 


CACAAACATA 


CATCTTGATC 


10860 


TTGCCCTCCT 


CAATCGGGAC 


AAATTCTCGC 


AAATCACGAG 


ACATGGTGTC 


ATAGATTTTA 


10920 


ATCATAAATC 


ATAATCAGGA 


AAGCTGAAAT 


CCAAGAACAA 


TT AG TTTCAT 


CACTAAAAGT 


10980 


TCAAGTAAAT 


TTCAGTCCGA ATATCPCTAC 


ACTTCCGAAT 


CeeTTGCTCC 


TTTCTCATTC 


11040 


AGATAAACCA 


CCTGAGTCTG 


TTTGACAAAG 


CCAATTTTTT 


CATACAAACG 


TTTGGCACCT 


11100 


ACATTGCTAT 


CTTCCACTGC AATCTGAAAT 


TCCTTGTCAT 


TTTGCTCAAT 


TAGTTGGTTG 


11160 


ACGAGGGATT 


TTGCTAAGTA GCTTCCATAG 


CCTTTTCCAC 


GTTCAGGTTC 


CAATATTGCT 


11220 


AAACCGTAGA 


GGTAATTCGT 


ATTAGTCGAT 


AAATCAACCG 


TACAAGTTCC 


AATAACCTGA 


11280 


CCAGCTTTTA 


ATAAAATATA 


TAGTCGGCTT 


TCTGGATCTT 


TCAGAGCTTC 


AGCGACATAT 


11340 


CTATCCACAA 


CTTCTCTCGA 


TTCATGTTCC 


TCTGAAAATG 


CCTGAAATTT 


TAATTGACTA 


11400 


ATTTGATCCT 


GATACGAACT ATCTGCTAAC 


AAAACTTCAA 


GATGGGAAAC 


ATTTGCTAAC 


11460 


GGATAAGGTC 


TTCTATCCTT 


ACCTAACCAA 


GTTTCTGTCT 


CTTCATCCTC 


GATTAGTCCC 


11520 


CAGTTACTGG 


CAAAGTCAGG 


ATGATTCTCT 


AAAAAAATAC 


GTTCTGTCTG 


AAAAGTGACT 


11580 


GACCGAATGG 


GGAAAGAAGC 


TGTTTCTCTC 


TCAAAACTAG 


TAAACAATGC 


ACGCGCAATC 


11640 


CCCTGACGGC 


GATGACCTGG 


ATGAACCAGT 


ATCGTCACTT 


CTACATCTTG 


GTCATCTGCA 


11700 


TAGACAGTTA 


ATAAACCAAC 


AAGTTCGCCT 


TTTTCATAAT 


AAAGGAAAAA 


GGCGGGCATG 


11760 


TTTGGGTCAA 


AATTAAGCAT 


GTTAGAGAGA 


TAGGGATCGC 


GATAGGTACC 


GTCATAGTTT 


11820 


TGGCAACAGT 


TAATTACTTT 


TTTCGCCTCA 


GATAGCTCCT 


CTTGGCTTAA 


crrcrrrcrT 


11880 


GCTTGAATCA 


TATAGGTATC 


CTCTACAAAC 


CAGACGATCT 


GTGACTGGCA 


TCTTTAGCCT 


11940 


GCTCGAGTTT 


ATTGACATAA 


TACTCTCGTT 


TTTCTTCGAC 


TTCGTGAATG 


ACAGGCTCAT 


12000 


CTTTCTTACC 


ATGAAGACGG 


ACAATCTTGG 


CCGGAATACC 


GACAACCGTC 


ACGTCACTAC 


12060 


GTACATCTGC 


TACGACAACT 


GCTGCAGCAC 


CGACCTTGGC 


ATTTTCACCA 


ATTTCCACAG 


12120 


GCCCGATAAC 


TTGGGCATGG 


GCTGATATGA 


GGGCTCCCTT 


TCGTACAGTC 


GGATGGCGTT 


12180 


TGCCACAGTC 


TTTCCCTGTT 


CCCCCGAGAG 


TCACTCCGTG 


ATAGAGAAGA 


ACGCCTTTTT 


12240 


CAACAATCGC 


TGTCTCTCCA ATCACCAGAC 


CAGAACCATG 


GTCAATAAAA 


ACACCTGAAT 


12300 


CAATCTGGGC 


TCCTGGATGA 


ATCTCAATCT 


GAGTCCAAAA 


GCGCCAAAAC 


TGACTGTACA 


12360 


TACGAGCTAA 


TAGTTTGAAG 


CCGTGCTTCC 


ACAGAAAATG 


CGAGAGACGG 


TGGGCCGCCA 


12420 


AGGCCTTGAC 


ACCTGGATAA GTCAGCAAAA 


CCTCCAAAGT 


GGTGCGGGCC 


GCTGGATCAT 


12480 


TTTCTTTTAC 


AATATCAATG 


GTTTCGCGCC 


ACCACCCCAT 


ACATTTCTCC 


TTTTCTTATT 


12540 
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CTGAATCTTT TGATCTTTCT GTAAATTCTT TCTTAGGTTT GTAATCCTTT TGATGACGTG 12 600 

GGCGGTGAGG GCGCTCAGAC TTTTCACCTT TTTCATCATG CTCAGGTTTT GGCGGACGAG 12660 

GTAGAAGAGC CTTCATAGAG GCATCGATAC GGCCTTTTTC ATCAATTTTG ATAACCTTAA 12720 

CATCAACTTC ATCCCCGATT TCTACCAAAT CCTCTACACG ATTGGTACGA GTCCAAGCCA 12780 

TCTCAGAGAT ATGAACAAGG GCATCTGTCT TATCAAAGAG GTTAACAAAG GCACCAAATT 12840 

-tctcgatacg -aacgactita gcacggt^ 12900 " 

CAGCAATAAT TTCTTTGGCA CGGTTAATAG CATCTTGGTC ACTAGAGTAG ATAGACACAT 12960 

TTCCTTCTTC GTCTATATCA ATCTTAACAC CTGTTTCAGC GATAATCTTG TCGATGGTTT 13020 

CTCCACCCTT ACCGATGACA ATCTTAATCT TGTCCACATC AATCTTGATC GTATCAATTT 13080 

TCGGAGCAGT TGGAGCCAAT TCTGGACGAA CTTCTGGAAT GGTTGCTTCA ATGACATCAA 13140 

GGATTTCAAA ACGCGCTTTC TTGGCTTGAG CAAGAGCCTC CGTCAAGATT TCTGCAGTAA 13200 

TCCCTTGAAT CTTGATATCC ATTTGAAGGG CTGTAATCCC ATCACGAGTA CCTGCAACCT 13260 

TGAAGTCCAT ATCTCCAAAG TGATCTTCCA AACCTTGGAT ATCTGTCAAT ACTGTGTAGT 13320 

TATTTCCATC TGAGATAAGC CCCATAGCAA TACCAGCTAC TGGCGCCTTG ATTGGCACAC 13 380 

CACCAGCCAT AAGGGCAAGA GTTCCCGCAC AGATAGAAGC TTGAGATGAA GAACCGTTTG 13440 

ATTCCAAAAC TTCTGCTACT AGACGGATAG CGTAGGGGAA TTCTTCCAAG CTTGGCAAGA 13500 

CTTGAGCAAG AGCACGCTCA CCAAGGGCAC CGTGACCCAT TTCACGACGA CCTGGCGCAC 13560 

CGTAACGACC TGTTTCCCCT ACAGAATATT GAGGGAAGTT ATAGTGGTGC ATAAAGCGTT 13 620 

TCTTGTACTC TGGATCCAAA CCATCAATGA TTTGAGTTTC TCCCATCGGA CCCAAGGTCA 13680 * 

AGACTGAAAG AGCTTGAGTT TGCCCACGAG TAAAGAGACC TGAACCATGT ACACGAGGAA 13740 

GGAAGTCAAC AACCGCATCC AAAGGACGGA TTTCATCGAC CTTACGACCA TCAGGACCCA 13800 

CCTTGTCTTC TGTAATTAAA CGTCGCACTT CTGCCTGTTC CATTTGTTCC AAGATTTCAG 13860 

CCACATCACG CATAATACGG TCAAATTCTT CGTGGTCCGC ATATTTTTCT TCGTAAACGG 13920 

CAGTCACTTG GTCTTTCACT ACTTGAGTCG CAGCTTCACG GGCCAATTTC TCTTCTACTT 13980 

GAACTGCCTP TTGGAGGTCA CTGTTGTAGG CTGCAATGAT TTCAGCTTGC AATTCAGCAT 14040 

CCACGTGAAG CAATTCCACT TCTGCTTTTT CTTTACCGAC AGCAGCAACG ATTTCTTCTT 14100 

GGAAGGCAAT CAATTCTTTG ACAGCTTCGT GCCCTTTAAG GAGCGCTTCC AACATGATTT 14160 

CTTCTGACAA TTCTTTGGCA CCAGACTCTA CCATGTTGAT AGCGTGCTTG GTTCCAGCTA 14220 

CTGTCAATTC AAGAAGAGAT TGCTCTGCTT GTTCTTGACT TGGCTTGATG ATGATTTGGC 14280 
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CATCTACATA TCCCACTTGT ACCCCAGCAA TTGGTCCGTC AAATGGAATA TCTGAAATAG 14340 

ACAGTGCCAA AGATGAACCA AACATAGCAG CCATTGGTGC AGATGCATTT TCATCATAAG 14400 

AAAGCACTGT ATTGATGACT TGGACTTCAT TACGGAAACC TTCCGCAAAC ATAGGACGAA 14460 

TCGGACGGTC AATCAAACGC GCTGTCAAGG TCGCATCTGT TGAAGGACCT CCTTCACGTT 14520 

TCATAAAGCC ACCAGGAAAC TTCCCAGCCG CATACATTTT TTCTTCGTAG TTGACTTGGA 14580 

GTGGGAAGAA ATCCCCAGTT GCCATTTTCT TAGACATAAC GGCAGCAGTC AAGACAGTTG 14 640 

ACTCACCGTA ACGTACGACA ACAGATCCAT TTGCTTGCTT AGCAACCTGA CCAGTCTCTA 14700 

CAATTAACTC ACGACCCGCA AAAGTCGTTT GAAACACTTG TTTTGCCATT TTAATCCCCT 14760 

TTGGATTGAT GAAATTATAC GCCTTGCCTA CAAAGATCAA GATACCAAGG ACGTCAAAAG 14820 

CAAAGTAAAA ATAGGAAACT GACGAAGTCT TCGATGAAGA CAAGACAGTT TATCTTTTTT 14880 

ACACAGCTTT TCGGCCGTGT TCAATTACAC AAGATA7TTT GGACGGTTCG GCTTGCCGAA 14940 

CATTTCTGTA GAAAAATAGG AAGGTGACGT CGCACTCGAC GAGTGCTAGG AAGCTTATCT 15000 

TTTTTCCTAA GAAATGAGAC CAAAATTCAA GTCATCAAGA TACCAAGCCG TCAAGCAACT 15060 

CAAAGGAAGA TAGGAAATCG AACGACGGAG CGACTACTCC TAGGGAGATT TATCTTTTTC 15120 

CACAGAGTTG TAGGCAAGTT CAGTTTTCAA GATACATCAT TAGAAAGGTT TAATACTAAA 15180 

GTATCTAAAG CTTTCACGCT AATCGCTATC GGGCGATTAG CTAAATGCTT TACTAACTCT 15240 

CTCGTCAAAT AACATCGATT TGACTCACTC GTGTCGTTAA ATCTTACAGT TTAAATGCAT 1S300 

TGTATTATTT AATACCTTCA TCTTTGTATC AAGTACGTAC AGAATTTATT TTATCATATT 15360 

TTTCTTAAAA AGTGAGGTCT TTACCATTAA AAAGGAACCA TTCCCCTCAC CTGAGAAGAA 15420 

TGGTTTGCTT TTATTATCCT AGAGACTGGT GATTAAACAA GGCATGGGTT GCTTGATGGA 15480 

TGTATTTTGC TGTATCAGCA TTATTCATCG TATAGAGATG CACACCGGCA ACATCCTGAG 15540 

TTACCAAGTC CACGATTTGG TCCACTGCAT AGGCAAGTCC TGCTGCTCTG ACCGACTCAG 15600 

GGTCATGCTC ATACTTGTCT AAGATGGCTT TAAATTTGCG TGCAAGATGG ATATTCTCAC 15660 

AAGTCTTCAA GAGTCGGAGA GCCTGATTTC GATTCAGAAT TGGCATAATT CCTGCATGAA 15720 

TGGGAACATC AATCCCAGCC AAGATACACT TGTCCTGAAA ATCATAGAAG CGCTCATTGT 15780 

CAAAGAAGAG CTGAGTTACG AGGCTCGAAC AGCCTGCATC CACTTTCTTC TTAAGATTTT 15840 

GAATATCTGA AATCTGATTT GGCGAATCTG GATGCCCTTC TGGATAGCAA GCTCCAATAA 15900 

TATCAAAGTG AGGGGTTTGT TCCTTGATAA ACTCAATCAA CTCGGTTGCA TACCGGAAAT 15960 

CCTTTTGTGG TTCCACGTCT GGAATAATAT CCCCACGAAG AGCCAAGATT TTCTGCACCC 16020 

CAACTTTGTC CAAGTCAGCA ATAGTTTCAG CAACCTTGTC CTTAGTTAGA TAAATAGCTG 16080 
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GCAAGTGCGC 


AATGGTCGGA 


, ATCGCCAAAT 


CATTTTGGAT 


AAAGTCAGCC 


AAACGAACCG 


16140 


TCGTTTCCTT 


GATATTAAAT 


TTATTATTGC 


TCGCAGTTAC 


ACTGATAAAA 


TGGGGAGCCA 


16200 


ACTCCTGCAT 


ATCCTGCAAG 


GCTGAAATAA 


TGTTATCATT 


ACCCACGGCT GGGTTTGGAG 


16260 


GGAACACTTC 


AAATGAGAGT 


GACGGTGTTT 


GGCGTGACAT 


ATGTAATAAC 


CTTTTCTAGT 


16320 


TGATTTCTTT 


TTGAACAACC 


ACTGTATGGA 


GAGAAATCCA 


ATCTTACAAT 


TTCTCACGCG 


16380 


CAGCTTTAGC 


TGCTTCAACA 


AGGCGGATCA 


AGCTTTCTTT 


TGTTTCTGGG 


ATACCACGTG 


16440 


TTTTCAAACC 


ACAGTCAGGG 


TTGATCCAAA 


CTITOTGCT 


TGGCACTTTA 


GCAAGGATGG 


16500 


CTTCGATTGT 


GTTGTCGATT 


TCGCCTTCAT 


TTGGTACACG 


AGGTGAGTGG 


ATATCGTAAA 


16560 


CCCCAGGTCC 


CACTTCTGTT 


TGGAAGTTTT 


TCGCTTTGAG 


TTCGTCCAAG 


ATTTCAAGGT 


16620 


TTGAACGGTT 


AGCTTCAAAC 


GAAATAACGT 


CTGCATCCAT 


GTTATCGATA 


GCTGGGATGA 


16680 


TATCTGTAAA 


TTCTGAGTAA 


CACATGTGAG 


TGTGGATTTG 


TGTGTCTCGC 


GCTACTGTTG 


16740 


AGTGTACCAA 


GCGGAAGGCA 


GG AATAGCCC 


AGTCAAGGTA 


GTCTTCGTAC 


CAGTCGCTAC 


16800 


GGCGGAGTGG 


CAATTTTTCA 


CGAACACCAG 


CCTCGTCGAT 


TTGGATGATT 


TTCACACCAG 


16860 


CAGCTTCAAG 


GTCAAGTACT 


TCATCCTTGA 


TAGCAAGGGC 


GATTTGGAGA 


GTTGAATCCT 


16920 


TGATAGAGAT 


GTCTTCACGT 


GGGAATGACC 


AGTTAAGGAT 


GGTAACAGGT 


CCAGTCAACA 


16980 


TACCTTTAAC 


AGGTTTGTTT 


GTACGACTTT 


GTGCATAGCT 


AGACCATTTA 


ACAGTGATAG 


17040 


GCTTAAGACG 


AGTGACATCA 


CCCCAGATGA 


TTGGTGGTTT 


TACCCCACGC 


ATACCGTATG 


17100 


ATTGTACCCA 


TCCATTTTTA 


GAGAAGAGGT 


ATCCTGACAA 


GTTTTGACCG 


AAGTACTCAA 


17160 


CCATGTCATT 


ACGCTCAAAT 


TCACCGTGAA 


CAAGGACATC 


AAAGTCAATA 


TCTTCTTGCC 


17220 


ACTTGATCCA 


TTCGTCAATC 


GTTTCAGCAA 


GGAAAGCGTC 


GTACTCTTTT 


TGAGACAATT 


17280 


CACCTTTACG 


GTAAGCCAAA 


CGTTTGGCAC 


GAACTTCTTT 


TGTTTGAGGG AATGAACCAA 


17340 


TCGTTGTTGT 


TGGAAGAGCT 


GGAAGTTTGA 


AAGCTTCTTC 


TTGGATAGCT 


TCACGTTCTG 




CAAAGGCTGG 


CAAACGAGTG 


TAGTCTGCGT 


CTGTCAAGCC 


AGCGATACGC 


GCACGAAGTT 


17460 


CAGCATTTTC 


ACCAACACGC 


TCAGTCGCAA 


AGAGTTCTTT 


GTTGGCTGCA 


AGAGCTTCTG 


17520 


AACCTTGACC 


ATTTCGGATA 


GCATCCAAGT 


CACGGATTTC 


ATCCAATTTT 


TCAACTGCAA 


17580 


AGGCAAAGTG 


GTTCAAGAGT 


GCTGGTTCAA 


ATTCTTCATT 


AGCAGTTGTA 


AATGGCACAT 


17640 


GAAGAAGTGA 


GCAAGAGCTT 


GTCAAAACGA 


TGTTTTCAGC 


TGGAATTTGC 


TCAAGAACAG 


17700 


CCAAGCTCTT 


TTOGTAGTTG 


TTGCGCCAGA 


TGTTTTTACC 


ATTGACAATA 


CCTACATAGA 


17760 


GAGTCTTGTC 


AGCTGGGAAG 


CCACCTTTAA 


CGACTTCAAG 


AGTTTTCTTA 


CCTTCAACAA 


17820 
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AGTCAAGACC 


GATAGCATCT 


ACTGGTAAGT 


TTACAAGGTC 


AGCGTATACC 


TCACGAACAT 


17880 


CACCGAAATA 


AGTTTGAAGC 


AAGACTTCAA 


GACCTTTTTT 


CTCACCCAAG 


AGTTTGTTGT 


17940 


AAAGGTTCAA 


GAAGAGACCT 


TTTTCTTCAG 


CTGTCAAGTC 


TTTTACAAGA 


GCCCCTTCAT 


18000 


CCAATTGGAT 


GCGAGTCGCA 


CCAAGTTCAG 


CCAATTTAGC 


AAAAACTTCT 


TGGTAAGCAG 


18060 


CCACTAAGCT 


ATCTACGAAG 


TCGTCTGCTT 


TCACGCCTTC 


TTCAAAGTCT 


GACAATTGAA 


18120 


GGAAAGTGAA GGGACCTACA AGAACAGGAC 


GAGTGTTCAA 


TCCAAGTTCT 


TTGGCTTCTT 


18180 


GGAACTCATC 


GAAAATCTTG 


TGACCAGCCA 


ATTTTACTTG 


AGTCTCTTTT 


TCAAATTTAG 


18240 


GAACGATGTA 


GTGGTAGTTA 


GTGTTGAACC 


ATTTCTTCAT 


TGGAAGGGCG 


CGAACGTCCC 


13300 


CTTTTTCTCC 


CTGGTAACCA 


CGTCCCAAAG 


CGAAGTAGCG 


CTCAAGGTCA 


GACAAGTCCA 


18360 


AGTTTTGAAC 


GGATGCAGGC 


ACCACGTTGA 


AAAGGAAAGC 


CGCATCTAGG 


AAGTTATCAT 


18420 


AGTGAGAAAA 


GTCATTTGAT 


GGAATTTCAG 


TGATGCCTTT 


TTCTTTGACA 


ATGTTCCAGT 


18480 


GTTTAGCACG 


CAAGTCTTTT 


GCTGCTGCTA 


AAAGTTCTTC 


TTCTGAGATT 


TCTTTTCTAA 


18540 


AGTATTTTTC 


AGTTGTAAAT 


TTTAATTCAC 


GGAATTCGCC 


CAAACGAGGG 


AAACCGATGA 


18600 


TTGTAGTTGA 


CATGATGTGT 


CCTCCAAAAT 


TTGTTGTTGA 


AACTATCTTA 


ACAGAAAAGA 


18660 


AAGCGTCTGT 


ATAATTGTAA 


AAAATTAGGG 


TTTGATATAG TTTGAAACTA 


TATATCTGTT 


18720 


TCGGACAAAA GAAAAAGACT 


TGAAGCAAAC 


GTCTCAAATC 


CTTTGTAATT 


CTTACTTTAC 


18780 


AGCTATATTC 


CAATTAGAAT 


ACTAAAACAT 


GTTATTAGTA 


ATTCTTATAA 


GTGACTATGA 


18840 


CCTGTTATTA 


GAAAAGACTA 


TAACTGATTC 


TAGTCAACTT 


TTTCCCTGTT 


CAAGTGGGAC 


18900 


GATTGCTAGT 


GTCTTTCCTA 


AACTGGCTAG 


GACTTTTAAG 


ACTGTATCCA 


ACTGAGGACT 


18960 


AGTCTTTCCT 


GTCTCCATCC 


TAGCTATGAC 


AGGCTGGCTT 


ATTCCACTGA 


CTTCTTCCAC 


19020 


CTTTTTCTGA 


CTGATTCCTT 


GTTCATACCT 


AGCCTCAATC 


AACTCGCTCA 


TGATAGCCAC 


19080 


TCGCATATCA 


CTTTCAAGGA 


TTTCCTCCTT 


GCTAAACACC 


TCAGATGGAC 


ATCCTTCCAA 


19140 


TTACTCCCAA 


TAGCACTATT 


CTTCATCACT 


TAACCCTCTT 


TTTTTTACGT 


CTATGTATTT 


19200 


TTAAAAAAAT 


GAGCGAATTA 


TGATTCGATA 


GATTGACCAG 


TGGGTTTAAA 


GTTGGTGCTA 


19260 


GCCTATTTCT 


TAAGCGATTT 


TCCTTTTCTA 


GGATAAAGCA 


GTTCCTGCTT 


GCTTAACCCC 


19320 


AATTTTCCAC 


GATGAATCCA 


ATAGTAAATG 


GTTGAAATTC 


CCACGTTAAC 


CCCTTTAGCC 


19380 


ATCACCATCA 


TTTCAGGCGA 


AAATTTTTGG 


TTATGTTTTT 


GGTTATGTAT 


AGTGGAGAAT 


19440 


CTTTTCCTTT 


AGTTTCTTAA 


GACTGTTGAG 


CGTAGTCGGC 


AGAATAAATC 


TCTTTGAAGC 


19500 


GCCCTTTTCC 


AAGACATTGT 


CGGACTGTCC 


CACGCTTGAT 


TTCAGTGTGG 


ATAGTTTGAG 


19560 


GAGCTTTTCC 


AAGTAGAGAG 


GCAATTTCTC 


TATTTGATTT 


TCCTTCTTTT 


TTCCATCGTT 


19620 
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CGATTAAGCG ACGGCTATCG ATTGTCAAAT GTTTCCCTTT TCTAGTATAA TTGTCTTGCA 19680 
TTTCTGTGCC TTTTAATCAT TTCAATCTTA AATTGGACTT TTTTTACTTG GCTTGTACTT 19740 
AATCTATGAG GAAGACAAGA AAAAGAATAT CAATCAAGTA AAGTCACAAA GTCACATTAG 19800 
CTCCGAGCAA CCATTGCAAA TTGAGGTACT CACACAATGA TTAAAACATT TCTCTCTGCC 19860 
CTTTCCCTCA TTCTCTTTTC TATCCCTATC ATAACTTATT CTTTTTTCCC ATCTTCTAAT 19920 
CTTAACATTT GGCTATCTAC CCAACCTATC TTGCCACAGA TTTATCCCTT CCCCTTACCT 19980 
ACTGCAACTA TGGCTGCTAT TTTAAGTTTC TTATTTTTTT TCCTATCTTT TTACAAGAAA 20040 
AATAAACAAA TACGGTTTTA CTCTGGCATT TTGCTCTTAC TATCGCTCAT ATT ACT ATT A 20100 
TTCGGAACAG ATAAAACCCT TTCTTCTGCA TCAAATAAGA CTAAAACCTT AAAATTAGTA 20160 
ACTTGGAACG TCGCTAATCA AATAGAAGCA CAACATATTG AGCGAATTTT TAGCCATTTT 20220 
GACGCCGATA TCGCTATATT CCCTGAACTA GCTACCAATA TCACAGGTGA GCAAGAAAAC 202 80 

CAGAGAATCA AACTATTGTT TCATCAAGTT GGACTTTCTA TGGCCAACTA TGATATTTTC 20340 

ACTTCTCCAC CTACCAATAG TGGAATAGCT CCTGTGACTG TGATTGTCAA GAAAAGTTAT 20400 

GGTTTCTATA CACAAGCTAA AACTTTTCAT ACAACACGGT TCGGGACAAT TGTATTACAT 20460 

TCGACAAAAC AAAATATACC AGATATCATT GCCTTGCATA CTGCGCCTCC TCTGCCAGGT 20520 

TTAATGGAAA TCTGGAAGCA AGACTTAAAC ATCATTCATA ATCAATTGGC TTCAAAATAT 20580 

CCAAAGGCTA TTATTGCAGG TGATTTTAAT GCAACTATCC GTCATGGAGC ACTTGCAAAA 20640 

ATAAGCTCTC ATAGGGACGC ATTAAATGCA CTGCCACCTT TTGAAAGAGG AACTTGGAAT 20700 

AGCCAAAGTC CAAAACTTTT TAATGCAACA ATAGATCATA TTTTATTGCC TAAAAACCAC 20760 

TACTATGTTA AAGATTTAGA CATTGTAAGT TTTCAAAACT CTGATCATAG ATGTATTTTT 20820 

ACAGAAATCA CATTTTAATT ATTTTATATA AAATCACCCC TCTAATGTTC ATAAACTAGA 20880 

GGGGGAATTT GTATCCTACT ATCGTTTAAC GCACTTCTGC ATTGACTTTT TCTTCGAGAC 20940 

ACGCTTGGAT TTTTTCCATA TAGCGTGCGA CTTCTTCCTC CGTTAAGCTG TCTTCTGGAT 21000 

TTTGGAAGGT CAAGCTATAA GCCATTGACT TCATACCAAG TCCCAGTTTT TCACCTGAGA 21060 

AGACGTCAAA GAGTTTGATA TCTGTCAAAC GTTTCACGCC GGCAGCTTGG ATAGCATCTA 21120 

CAACTTCTTG GTGAGTCACT TCTGCCTTGA GGAGAAGGGC AACGTCACGG CTGACTGCTG 21180 

GGAATTTGGT GATTTCCACA AATGGAACAG CAGGTTGGAG CGCCCCTTCG ATGGCTGAAA 21240 

GGTTAAGCTC AGCTACATAC GTTTCTGGAA TATCGTAAGC CTTGGCAGTG ACTGGATGCA 21300 

CTTGGCCAAG GAAACCAAGA ACTTGGTCAC CGAGTGAAAT CACGGCTGTA CGACCTGGAT 21360 
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GAAGGCTAAC 


CATTTCACAT 


GTTGCTGTAT 


AGGTTACTTG 


GAGTCCCAAA 


CGACTAAATA 


21420 


GGCCTTCAAG 


GATTCCCTTA 


GCATAGAAGA 


AATCAACTGG 


AACTGCTGCT 


GTTTGGAAAT 


21480 


CTTTTTCACC 


AACCAAGCCT 


GTCAAGGCAA 


AGGCAAAGCT 


GTTGATCTCA 


TTTGGAAGTT 


21540 


CTTCTTTTGG 


ATTACCTGTT 


TGTTCAAAGA 


CTTTTCCAAT 


CTCATAAAGG 


GCCAAGTTTT 


21600 


TATTCTTACG 


AGCCACGTTG 


TAGGCAACGG 


TATCAAGGAT 


CCCTGAAATC 


ATATTTTGAC 


21660 


GGAGGACTGA 


ACGATCCACA 


GTCATTGGCC 


ACATGAGTTC 


AGTAAGGTTA 


CTTGGTTGAG 


21720 


CTGTGAACTC 


AACTGCTTTT 


TCAGGAGTTG 


TCAGAGCATA 


GGTGATGATT 


TCTGTCAAAC 


21780 


CTGCTCCTTC 


AGCAATGGTA 


CGAACTTGAC 


GGCGGAGTTT 


TTGTATCACA 


GTCAATTCAC 


21840 


CAGCTGTACC 


ATCGTCTTTT 


GGAAGGCTGG 


TTGGCAACCG 


GTCATATCCA 


TAGATACGAG 


21900 


CGATTTCTTC 


AAAGAGATCA 


GCTTCGATTG 


TGATATCCCA 


ACGACGACGT 


GGTACGCTGA 


21960 


CTGTAAACCT 


GTCTGCATTT 


CCAGAAAGAC 


CAAAGCCAAG 


ACGACGGAAG 


ACGTCTTCTA 


22020 


CATCAGCATA 


AGACAGCTCA 


GTTCCGAGGA 


CACGCTTAAC 


ATCAGCAAGG 


GTTGAAGAAA 


22080 


CTTCCACATC 


AGAGGTATCA 


AGCTCACCCG 


CTGAAACGAT 


ACCCTTACGC 


ACCGTCGCGC 


22140 


CTGCAAGCTC 


TGCAATCATG 


CTAGCTGCCG 


CATCAAGGGC 


TTCATTAACT 


GTTCCCACAT 


22200 


TAATTCCTTT 


TTCAAAGCGA 


GAAGATGACT 


CAGAACGAAG 


GTTCAGGCGA 


CCACTTGTCT 


22260 


TACGGATAGA 


TTTGCCATTA 


AAAACAGCAG 


CTTCAAGGAT 


AACACGACTA GATTTTTCAG 


22320 


AAATTTCTGT 


AGCCTGACCA 


CCCATAACAC 


CGGCAAGGGC 


TACTGGTTTG 


TCAGCAACTG 


22380 


TAATCACGAG 


GTCTGTCTCA 


GCCAAGTCTC 


GTTCTTCACC 


GTCCAGGGTC 


ACTAATTTTT 


22440 


CACCATCACG 


CGCTTCACGC 


ACACGGATGT 


CAGTCCCTTC 


AAATGTGTCC 


AAGTCAAAAG 


22500 


CATGCATAGG 


TTGACCAAAC 


TAGAGCAGGA 


TGTAGTTTGT 


CACGTCTACA 


ACGTTATTGA 


22560 


TGGGACGGAT 


GCCTTCGTTC 


ATGAGAAGGT 


TTTGCAACCA 


TTGTGGACTT 


GGTCCGATAG 


22620 


TCACATTCTC 


CAAGATACGA 


GCTGCATAGT 


AAGGCGCCTT 


GTCTGTCTCA 


ATGCTGACAG 


22680 


AAAGGGCATC 


TGCCGCAGCT 


TCATTAGTTT 


CTGTTAGAGT 


AAATTTTTTA 


AAGTTGACTG 


22740 


CCTTGTCATA 


GATGGCTGCC 


ACTTCGTGAG 


CCACTCCACA 


CATAGAAAGG 


GCATCTGCAC 


22800 


GGTTTGGTGT 


GATGGAAAGT 


TCGATGATTT 


CATCATCCAA 


GTCTAGGTAA 


GAAAAGACTT 


22860 


CCTCACCTGG 


CACGGCATCT 


TCAGGCAAGA 


TTTGGATGCC 


ATCTGCGAAT 


TCCTTAGGCA 


22920 


CAACTGAGTC 


AGAAATTCCC 


AATTCACCAA 


GTGAACAGAT 


CATTCCAAGT 


GACTCCAAAC 


22980 


CACGGATTTT 


TCCTTTTTTG 


ATTPTGTAGT 


TATCAGCGAT 


ACGAGCTCCT 


GGAAGAGCCA 


23040 


CCATGACCTT 


GATCCCAGCA 


CGCACATTTG 


GGGCACCACA 


AACGATCTGA 


CGCTCTTCTT 


23100 


CTTCGCCAAC 


GTTAATCTGA 


CAAACATGGA 


GGTGAGTCTC 


TGGCACATCT 


TCGCAAGACA 


23160 
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AGACCTCACC GACGACAATT TTTGAGAGAC CAGCAGCTGG TGATTCGACA CCCTCTACCT 23220 

CCATCCCTCT AGTTGACATT TTTTCAGCCA ACTCTTGTGA TGGCACATCA ATGTCCACCA 23280 

ATTCTTTTAA CCATTTATAA GATACAAGCA TAATTTAGTT CTCCAGAATG ACAGTTGTCA 23340 

CTCTAGTTCT TTTCCTTTCC TATCATTTCA ATAGAAGAAT CCTCTTCTTA CCTTAATTTC 23400 

TTTCTCAGTA ACCAATCCGT ATCTACTTTT TGACCAACCA TAAAATGATG TTGGCTAAAT 23460 

TTTTCAAAAC CATATCGGTT ATAAAACGCT TGAGCTTTTG TATTATCCTC CCAAACACCT 23520 

AGCCAAGCCC AAGAAAAACT ATTTTTTGTA GCAAGTTCAA GTGCGAATTC AAACAGTTGC 23580 

TTACCTAGTC CAAATCCTTG GAATTTTTGT AGCACATAGA GACGTTGAAT TTCAAAAGCG 23640 

TCCTCTAATT CTCTCTCAGT TTGAGCACTT CCCCAGTTGA CTTTGAGAAA ACCAGCTATC 23700 

TCCTCCTCAT GCATAATGAA ATAGGTTTCA GAGTCAGGAT TTCCCAACTC AGTTGACAAA 23760 

GTTTTCAGAC TATAAGCCTC TTCAAAGTAT TCCTGTAACT GCTCTTCCGT ATTATCATAC 23820 

GCAAAGGTTT CACGAAAGGT TTGTTTGGCA ATTTTAGCCA ACACCTCAAC ATCTGCCATT 23880 

TCTACTTTTC TAATCATTAT TTAAACTGTT CTGAGAAGCG GACATCTCCT TGGTAGAATC 2 3940 

CACGGATATC GTTGATTCCA TAACGGAGCA TAGCTACACG CTCTTGTCCA ACACCAAAGG 24000 

CAAAGCCAGA GTATACAGTC GCATCGATAC CACTCATTTC AACGACACGT GGGTGAACCA 24060 

TACCGGCCCC CATAATTTCG ATCCAACCTG TTTTCTTACA TACATTACAG CCTTCTCCAC 2 4120 

CACACTTGAA GCAAGAAACA TCCACCTCAA CAGATGGCTC TGTGAATCGG AAGTAAGATG 24180 

GACGCAAACG AATTTGACGC TCTTCACCAA ACATTTTTTG GACAATCAAC TGAAGCGTTC 24240 

CTTGAAGATC AGCCATAGAG ATATTTTTCC CAACTACCAA GCCTTCGATT TGGTGGAATT 24300 

GGTGACTGTG GGTCGCATCG TCCGTATCGC GACGGAAGAC ACGCCCTGCC GAGATCATCT 24360 

TCAAAGGACC TTTAGAAAAA TCATGGGCAT CCATAGCACG CGCCTGAACT GGAGACGTGT 24420 

GGGTACGGAG CAAGATTTCT TCAGTGATAT AGAAAGTATC CTGCATATCA CGAGCTGGGT 24480 

GGTCTTTTGG AAGGTTCATA CGTTCAAAGT TATAGTAGTC TTGCTCCACT TCAAAACCAT 24540 

CCACGACTTG ATAACCCATA CCGATGAAGA TATCTTCGAT TTCTTCACTG GTTTGTGTCA 24600 

AAACGTGACG GTGACCAGTC GCAACTGGAC GACCTGGAAG CGTCACATCT ATACTCTCGC 24660 

TAGCCAGTTG AGCCGCGACT TTCTTTTCTT CCAAGAGCTT AGCTGTTTCT TCAAAAGCAG 24720 

CAGTCAAGAC ATCACGAGCT TCATTGACGT GTTTCCCGAT GATTGGACGC ATCTCAGCAG 24780 

AAACATCTTT CATCCCTTTG AGGATTTCAG TGAGCGAACC CTTTTTACCA AGGACAGAGA 24840 

CACGCAAATC TTGCATCTCT TTTTCATTTC CAGCAGTAAT CTGCTTCAAG CTAGCCAGCG 24900 
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TTTCTTCGCC 


AAGCGCTTTT 


AATTGTTCTT 


CAATAGTTGA CATATTTCCT 


CCATCAGTCT 


24960 


CTCGTAGATA 


AAAAGAAAAC 


CACATGCCAA 


AAACTCCACT CGGAGCGTTG 


ACACGCGGTA 


25020 


CCATCCGTTT 


TCATCTGACA 


ACTCAGACCT TCATTTCTAA ATCCATGCGC AAGTGAATTC 


25080 


ACCCAGCTTT 


CATATAGAGA 


GCTTGCAGTC ACGGCTCTCC TCCCTGATAT ACTTCCCTTG 


25140 


GGCTACTAGT 


CTTTCAGATT 


CCTATTCAAT 


TACTACTTAG TTTATCAGAT TTTTACCATT 


25200 


CTTGCAAGAC 


CTATCTTACT 


TCTGCTTGTT 


AGCTTATTCT TATCTAAATT 


TATATAAACc 


25260 


TTATCTAAAT 


TAACTATTTA 


TAATTTTTGT 


AACAAAATTA AATTAATTGA 


CACTCCCCTA 


25320 


TAAAATAAAG 


AAGTTTAGAA 


TTTAATGTCT 


TCCAAACTTC TTTATTCCAT 


ATTTAATGAA 


25380 


ATGCCACCTT 


AACCCTGATA 


ATAGCTAGTC 


ATCAATAAAA AACTATTTGA 


ATAAGGATTC 


25440 


TC C ATTTG AT 


TCAATCACTT 


CTTTATACCA 


AGTAAAAGAC ATTTTCTTAT 


ATCGATTTAA 


25500 


TGTACCACTT 


CCATCATCGT 


TTCGATCAAC 


ATAAATGAGA CCGTACCTTT 


TAGAAAGTTG 


25560 


TGCAGTGGAC 


ATAGAAACAC 


AGTCAATACA 


TCCCCAAGAC GTATAGCCCA 


TAATTTCAAC 


25620 


ACCATCCTCT 


AGAGCTTCAG 


CAACTTGCAA 


TAAATGTTCT TTCATATACT 


GAATTCTATA 


25680 


ATCATCTTGG 


ACGGTTAAGT 


TATTAAGTTC 


ATCTTTTATT AGTTGATCTT 


TAGCACCTAA 


25740 


TCCATTTTCT 


ACTATAAATA 


ATGGGATTTG 


ATAACGGTCA TAATATCTAT 


TTAAAATTAT 


25800 


ACGTAGTCCA 


ATTGGATCAA 


TTTGCCATCC 


CCACTCTGAA GACTCTAAAT 


AAGGATTTAC 


25860 


TAAACCACCA 


ATAATATTCC 


CTTCTCCTGA 


ATTATACTGT GTTGGAAGAG 


CAGATTGAGT 


25920 


CACACTCATG 


TAATAGCTAA 


AGGATAAAAA 


ATCTACGGTA TAATTTTTTA 


ATAACTCTGC 


25980 


ATCTTCAGCT 


GCAAACTCTA 


TGTTAATGTC 


ATTTTCCTTA AAATATCTTT 


TTGCATAATT 


26040 


CGGATAATAA 


CCTCTAACAT 


GCACATCTGA 


AAATAGATAA TTTAGATTCT 


CATACTCATG 


26100 


AGTCGCCCAT 


ACATCTTTTG 


GATTTGGAGT 


CATTGGATAA GCTGGCATAG 


CTAATACCAT 


26160 


ACATCCCACC 


TTAAACTCTG 


AATTAATCTC 


ACGAGCAATT TTTGTAACCA 


AACTTGAGCC 


26220 


GACTAATTCA 


TGATGTATAG 


CTTGATATAA 


TTCTTGTTTC GAAAGATTCT 


CCTTAGGTAT 


26280 


ATCTATTCCT CCACTACTAA ATGGTAATTC CAAAACAGAG TTTACTTCGT 


TAAATGTAAG 


26340 


CCAATATTTA 


ACTTTATCTT 


TATACCTTTC 


TAAAACTGTT CGAGCAAATT 


TTTCATAAAA 


26400 


ATGAATCATT 


CTCCTATCAA 


CCCATCCATG 


ATATTTTCTT GCTAAATATA 


ATGGAGTCTC 


26460 


ATAGTGTGAA 


AGAGTTACAA 


GTGGTTCTAT 


CCCGTGAGCA TGTAGTTCAT 


CAAACAATTC 


26520 


ATCATAATAT 


TTCAACCCAG 


CTTCGTTAGG 


TTCTTCCTCA TCTCCTTTTG 


GAAAAATTCT 


26580 


ACTCCATGCA 


ATAGAAGTAC 


GAAAAACATT 


AAAGCCCATT TCAGAAAACA 


AGGATATATC 


26640 


TTCCTTATAT 


TTATGATAAA 


AATCAATACC 


TATCAATTTT AAGTTATCTT 


CTGTAGGATT 


26700 
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TTCTGTTGCT 


TCTCCTAATC 


CACCTTTGGG 


TAACACATCC 


TGAACTGATA 


AGCCCTTACC 


26760 


ATCTTCATTA 


TATGCTCCCT 


CTACTTGATT 


AGCTGCAACA 


GCTCCACCCC 


AAAGAAAATC 


26820 


ATCTGGAAAA 


ATGGTCATAA 


CTTTCCTCCA 


TTATAATATT 


ACCAGTAATT 


CCTTAGAATG 


26880 


CTCGATTGTC 


TGATTATTAG 


GTAATACTAA 


TACATCTAGA 


AAATCATTGG 


TATTCGTTAC 


26940 


AATTACTGGT 


GTAACTGTTT 


CGTAGCCTTT 


AGTCTTGATT 


AAATTCAAGT 


CCATTTCAAA 


27000 


AATCAACtGA 


TTTTTGAAAA 


CTCTGTCTCC 


TTCTTCTACA 


TCACTAATAA 


AACCTTGACC 


27060 


TTTTAGCTCA 


ACAGTATCTA 


ATCCAATATG 


AATTAGTAAC 


TCAACACCCT 


CATCACTCTT 


27120 


CAATCCAATT 


GCGTGCTTAG 


TCGGAAAAAT 


ATTTGTAATT 


TTCCCATCAA 


ATGGTGCATA 


27180 


AACCTTACCT 


TCACTTGGGA 


TAATCGCTAC 


TCCGTCTCCA 


ATTAGTTTAT 


CTGAAAATGT 


27240 


TTTATCCTGG 


ACATCGCTTA 


ACGGAATGAT 


TTCTCCTGAT 


ATAGGAGAAA 


ATATCATTTT 


27300 


TTTATTTGAA 


ACTCCAGCTT 


CAACTTCTAA 


ATTGCTAGAA 


CTCTCTTCTT 


CATCGATTCC 


27360 


AAATATATAA 


GCTAATACAA 


AGGTAATAAC 


AACCGAAATC 


ACCGCCACAA 


TTAAAGCATT 


27420 


TACAATATTT 


GATGGCACAT 


CAGAATAAAT 


AAATTGAGGC 


AACGCTATCA 


AAGATGGGAC 


27480 


AGCAAATAGA 


TATGCTTTAA 


CACTAGTAAG 


ACCTGCAAAT 


AATCCCGCTA 


ATCCACCACC 


27540 


AATCATAGCT 


GCATAAAGCG 


crrrriTATA 


TTTTAAAGTC 


ACACCATATA 


ATGCAGCTTC 


27600 


GCTAATCCCT 


GCAACTAAGG 


CTGAGAAACC 


TGCTGCAAAA 


GCAATTTGTT 


TTGTATTATT 


27660 


ATTTTTACTC 


TTTAATGCAA 


CAGCCATCGA 


AGCAGCCCCT 


TGAGCTAAGT 


TTGACCCTAA 


27720 


CATTGCTGGA 


AGAATTAATA 


CGTCTGGAGT 


AGCAATAGAT 


GCCGCCAAAA 


AAATAGGTGC 


27780 


AAAAGCCCAA 


TGCATTCCAG 


TCATAACAAT 


AAATGGCATA 


ATAGCACCAA 


GAATAGCTAA 


27840 


TGTAAGCCAT 


CCAGCTACAC 


CATACATTTG 


CCCAACTAGA 


TTTGATAATC 


CTTCACCAAC 


27900 


AATTACTCCA 


ATAGGTCCGA 


CTACAACTAA 


GGCAATACAG 


CTTGATACTA 


ATAATACTAG 


27960 


V~Vj i f\yj\*t l 1 Li 


AAAAAACTv-T 


TAGTAATAGC 


TAGTGTTAAT 


TTAGCAATTA 


TTTTTTCAVT 


28020 


ATATTTCATC 


AACCAAACCA 


TAATAAGAAT 


TGGAACGACT 


GATGAACCAT 


AACTAGCTGG 


28080 


TGTCACAGGT 


GCACCAAATA 


AACTAAGAGG 


ATTCCCTGAT 


TGCACCATTT 


GAACAAAATT 


28140 


TGGATGGAGA 


AGTACACCTG 


CTACAGACAT 


AGCTAATGTA 


GATGTTACTT 


TTAATTTTTG 


28200 


TGATGCAGAA 


TAAGCTAATA 


ACAGCGGTAA 


GAAATAATAT 


GGAGCATCCC 


CAAAAAATGT 


28260 


CAAAAAAGCA 


ATAGTCTGAG 


AATCTGATTG 


CAATATACCA 


AGCATTGCTA 


AAATGATTAC 


28320 


CAAGACTTTC 


AACATACCTC 


CCCCTAACAT 


TGCTGGAATG 


ATTGGAGTCA 


TGGAACCAGC 


28380 


GATATACTCA 


ATGATTCTTT 


CTAAAATATT 


CCCTTTGTGC 


CCTTGAACAA 


CTGAATCGGA 


28440 
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TTCAAAATTG CCAAGTTTAA CGAATTCTTT ATAATAATTA GCTACATCAT TACCAAGTAT 28500 

AATTTGATAT TGTCCATTCT TTTTCATAAT AC CT ATT AC A CCTG6TATCT TCTTCACATC 2 8560 

ATCATCATTG ACTAAATTTT CATCTTTTAA TTCTAATCTT AAACGTGTTA CACAATGGGT 28620 

AACTCTATTG ACATTTTTTT CACCTCCAAT TACATCGAGG ATTTTTTCTA CCCTATCTTT 28680 

ATAACTCATG GTATTCTCCT ATTCTATTAA TCTAAATTTT TTGTTAAGCG ACGAATATGA 28740 

GCCATCAAAT AAACTAATTC ACTAGAAGTC AGCAAATAAT TGTACTCCGT TTGTATAAAC 28800 

ATTGCTACCT GTTCACCACA TTCATATTCT CTAGGATATT TATTTTTCAT TAATGCTAAC 28860 

AAGTCTTCAT CATCATCGTC GG 28882 
(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12835 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 



GCCTATGTCT 


TTTTCAAAAA 


AATGCTTGAC 


TTGAGACGGG 


AACTAGGGAA 


GTCTAAAGGC 


60 


GGAAGGCATT 


GATTTATACT 


CTTCGAAAAT 


CTCTTCAAAC 


CACGTCAACG 


TCGCCTTGGA 


120 


TTATATATGT 


AACTGACTTC 


GTCGATGCTT 


ATCTACAACC 


TCAAAGCAGT 


GCTTTGAGCA 


180 


ACTTGCGGCT 


AGTTTCCTAG 


TTTGCTCTTT 


GATTTTCATT 


GAGTATTATA 


TTACTTTCTA 


240 


TTTGTACGAG 


GTGGCTTATG 


AAGATTCCTC 


TCTTAACTTT 


TGCAAGGCAT 


AAATTTGTTT 


300 


ATCTCTTGCT 


TACTTTGCTT 


TTTCTTGCTT 


TGGTTTATCG 


TGATGTTTTG 


ATGACTTATT 


360 


tcttttttga 


TATTCATGCG 


CCCGATCTAG 


CTAAATTCGA 


TGGACAAGCA 


ATTAAAAATG 


420 


ACTTATTAAA 


ATCAGCATTA 


GATTTTCGTA 


TTCTCCAGTT 


CAATCTAGGT 


TTTTATCAAT 


480 


CATTTATTAT 


TCCAATCATC 


ATTGTTTTGC 


TAGGTTTTCA 


ATATATTGAG 


CTGAAAAATA 


540 


AAGTTTTACG 


ATTGAGTATT 


GGAAGAGAAG 


TGAGTTATCA 


ACGGTTAAAA 


AGAAAGTTGA 


600 


CTTTGCAAGT 


TGCAAGTATC 


CCTTGTTTGA 


TATATTTAGT 


GACTGTGCTG 


ATAATTGCAA 


660 


TTATAACCTA 


TTTCTTrGGG 


ACTTTTTCTC 


CTCTTGGATG 


GAATTCTCTA 


TTTTCTGATG 


720 


GAAGTGGTTT 


ACAAAGACTC 


CTAGATGGAG 


AGATAAAAAG 


CTATTTGTTC 


TTTACTTGTG 


780 


TCCTACTAAT 


CGGTATTTTC 


ATCAATGCAA 


TCTATTTTTT 


ACAAATAGTT 


GATTATGTGG 


840 


GGAATGTGAC 


TCGTTCGGCA 


ATCACCTATT 


TGATGTTTCT 


TTGGCTTGGT 


TCTATGCTGC 


900 


TTTATAGTGC 


CTTGCCTTAC 


TATATGGTTC 


CTATGACGAG 


TTTGATGCAA 


GCTAGCTATG 


960 
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GCCATGTAAG TTTGATGAAA CTCTTTACTC CTTATATCCT TTATATTGTC CCTTACATGG 
TGCTTGAAAA ATATGAAGAT AATGTTTAAG AATTTTAACA ATATTTTGCT AAATAGAAAG 
ATTGTTTTAC TACTTCGTAT ACTTCTGA7G ATGATTTTGA TAAACCATCT ATTCTCAACA 
GCGGTTCAAA AGCAGGATGC TGTTATCTTT TTCAAGAGAG AATTGATTTC AATTTTTTCC 
TATAATGACT ATTCTGAAGC GAATTTAGAA ATCCCCAAAC TATTGTTAAA CCTTTCGCTT 
-TTCATGGTAG GATGGCTCTC " TGTCATTTTA CTTGAAAGTG ATTTGGCAGA CXATTACCAT 
CACTTGATTC GCTATCAATC AAGCTCCTTT TTCGATTATA CAAGGAAACG ATTGGTTGTC 
ATTTCTAAAT TTTTTACTCA AGATTTGTTT GTCTGGTTTC TTGGTTTACT TCCTCTAGGA 
ATTCATTTCA AAACAGTCGC ACTTTTCTTT TTACTTGCTC AGTTAATGAT GTTGTACTTA 
CTACTGTCTT ATCTGATAGC ACTGATTAGT GCGGGCGCTG GTTTTTCCTT TTTTCTCTAT 
TTTTTAGCAT TTGTGGGACA AGAATGGATG ATGGATCATA TTGTAACAGT GTATTTAGTA 
CTCTTAAGTT TATTAGTTAT GTTGATTGTT AGTCGCTTGG AAGAGAAATT TAAGAAAGGA 
TAAACGATGA GACTTGAAAT TATAAATGGA CAGAAAATTT ATGGGAAAAG ACCTATTTTA 
AATCAGTTGA ATTTGGTGTT TCAATCAGGA AAAATTTATG GACTTAAAGG TGATAATGGA 
TCTGGCAAGA CGGTTCTTTT AAAGATACTT GCTGGTTATA TTAAGCTTGA CAAAGGAAAA 
GTTCTTCAAG ATGGTAAAGT TTACGGGGTA AAAAATCATT ATATTCAGGA TGCAGGAATT 
TTAATTGAAA AAGTCGAGTT TTTATCTCAT TTATCCCTGA GAGAAAATTT GGAACTGTTA 
AGGTATTTTT CATCTAAAGT TACGGAAAAA AGAATTGCCT ATTGGATTCA ATACTATGAT 
TTACAGGAAT TTGAAGACAT TGAATACCGT CATTTATCCT TAGGAACAAA GCAAAAAATG 
GCCTTGATTC AAGCCTTTAT TTCCTCTCCT TCTATACTCT TTCTCGATGA ACCTATGAAT 
GCTTTGGATG AGAAGAGTGT GAGGTTAACC AAACAGGTCA TTTTATCTTA CCTGAAAAAA 
GAAAATCGTC TGGTTATCCT GACGTCGCAC ATATCGGAAG ATATTTCAGA CCTTTGTACA 
GATGTATTAG TTGTCGAAAA TGGACATATA CAAATGTAAA GGATATACAA TCCTAGGAGA 
TGGCTTATGG CACATCTAAA ATCATTTATT ACACGATATT C CAAGGTTT A TATTGGTTTA 
GTTCTGCTGA TCTGGCTGTC TTTCTTCTTT ATCCCTTGGG ATAAACCACT TCTGGGGATA 
AGGATTGACA TCTTCATCAT ACAGAAAATC TTGCTAGCTT TTGGAATTCT GTCCATTCTC 
ATGGCCTTGC TGTCCAAGAA AGTCAGTCTC TTTGTTTTTG GACTGATTTG CTGTCTTTCT 
CTTTGGATTA ACTTATTTAT CACATTTGCC ATTTTGCCGA TTTTTGGCAA TTAAACAGTC 
ATAAAAGTCG GAGAGGTTAG CTTGAAAACT AACCTCTTTT TCCTTTTCAA AATGGGGATT 
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CTTCCTTGAA 


AATAATCAGT 


AATTGTGCTA 


AAATTAAAGG 


AACATTCTAA 


AATATTCGGA 


2760 


ATTTAAAGTA 


AGGAAAAACA 


TGGCTAATAT 


TTTAAAAACA 


ATTATCGAAA 


ATGATAAAGG 


2320 


AGAAATCCGT 


CCTCTGGAAA 


AGATGGCTGA 


CAAGGTTTTC 


AAATACGAAG 


ACCAAATGGC 


2880 


TGCTTTGACT 


GACGACCAAC 


TAAAAGCAAA 


AACAGTTGAA 


TTTAAGGAAC 


GTTATCAAAA 


2940 


TGGAGAATCA 


CTGGATTCAT 


TGCTTTACGA 


AGCATTTGCG 


GTTGTCCGTG 


AAGGTGCCAA 


3000 


ACGTCTCCTA 


GGTCTCTTCC 


CTTATAACGT 


TCAGGTCATG 


GGGCGGATTG 


TTCTTCACCA 


3060 


TGGTGACCTG 


CCAGAGATGC 


GTACAGGGGA 


AGGGAAAACC 


TTGACTGCGA 


CCATGCCGGT 


3120 


ATACCTCAAT 


GCCCTTTCAG 


GTAAAGGGGT 


TCACGTAGTT 


ACGCTTAATG 


AATACCTGTC 


3180 


AGAACGTGAC 


GCGACTGAGA 


TGGGTGAATT 


GTACTCTTGG 


CTTGGTTTGT 


CAGTAGGGAT 


3240 


TAACTTGGCT 


ACCAAATCTC 


CAATGGAGAA 


AAAAGAAGCC 


TATGAGTGTG 


ATATTACTTA 


3300 


CTCAACTAAC 


TCAGAAATCG 


GATTTGACTA 


CCTTCGTGAC 


AACATGGTCG 


TTCGCGCCGA 


3360 


AAACATGGTA 


CAACGTCCGC 


TTAACTATGC 


CTTGGTCGAT 


GAGGTTGACT 


CTATCTTGAT 


3420 


TGACGAGGCT 


CCTACACCTT 


TGATTGTATC 


AGGTGCCAAT 


GCGGTTGAAA 


CCAG7CAGTT 


3480 


GTATCACATG 


GCAGACCACT 


ATGTAAAATC 


TTTGAACAAA 


GATGACTACA 


TCATCGATGT 


3540 


GCAGTCTAAG 


ACTATTGGTT 


TGTCTGATTC 


AGGGATTGAC 


AGGGCTGAAA 


GCTACTTCAA 


3600 


ACTTGAAAAC 


CTCTATGACA 


TCGAAAACGT 


GGCTTTGACT 


CACTTTATCG 


ATAACGCCCT 


3660 


TCGTGCCAAC 


TACATCATGC 


TTCTCGATAT 


TGACTATGTG 


GTGAGCGAAG 


AGCAAGAAAT 


3720 


CTTGATTGTC 


GACCAATTTA 


CAGGTCGTAC 


CATGGAAGGT 


CGTCGTTATT 


CTGATGGATT 


3780 


GCACCAAGCT 


ATTGAAGCCA 


AAGAAGGTGT GCCAATCCAG 


GATGAAACCA 


AGACATCTGC 


3840 


CTCAATCACG 


TACCAAAACC 


TCTTCCGTAT 


GTACAAGAAA 


TTGTCTGGTA 


TGACGGGTAC 


3900 


AGGTAAGACT 


GAGGAAGAAG 


AATTCCGTGA 


AATCTACAAC 


ATTCGTGTTA 


TTCCAATCCC 


3960 


AACAAACCGT 


CCTGTTCAAC 


GTATTGACCA 


CTCAGACCTT 


CTTTATGCAA 


GTATCGAATC 


4020 


TAACTTTAAA 


GCGGTTGTCG 


AAGACGTTAA 


GGCTCGTTAC 


CAAAAGGGTC 


AACCTGTCTT 


4080 


GGTTGGTACA 


CTAGCGGTTG 


AAACTAGTGA 


CTACATTTCT 


AAGAAATTGG 


TTGCAGCTGG 


4140 


TGTTCCTCAC 


GAAGTCTTGA 


ATGCCAAAAA 


CCACTATAGA 


GAAGCCCAAA 


TCATCATGAA 


4200 


TGCTGGTCAA 


CGTGGTGCCG 


TTACCATCGC 


AACCAACATG 


GCGGGTCGTG 


GTACCCACAT 


4260 


CAAGCTTGGT 


GAAGGTGTTC 


GTGAACTTGG 


AGGACTTTGT 


GTTATTGGTA 


CAGAACGTCA 


4320 


TGAAACTCGT 


CCTATCGATA 


ACCAGCTTCG 


TGGACGTTCA 


GGTCGTCAAG 


GAGATCCAGG 


4380 


TGAGTCACAA 


TTCTACCTAT 


CTCTTGAAGA 


TGATTTGATG 


AAACGTTTTG 


GTTCTGAACC 


4440 


CTTGAAGGGA 


ATCTTTGAAC 


GCTTGAACAT 


GTCTGAAGAG 


GCCATTGAGT 


CTCGCATGTT 


4500 
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GACGCGTCAG GTTGAAGCAG CTCAGAAACC TGTCGAAGGA AATAACTACG ATACCCGTAA 
ACAAGTCCTT CAATACGATG ATGTCATGCG TGAACAACGT CAGATTATCT ATGCTCAACG 
TTACGATGTC ATCACTGCAG ATCGTGACTT GGCACCTGAA ATTCACTCTA TGATCAAACG 
CACGATTGAA CGTGTCGTTG ATGGTCATGC GCGTGCCAAA CAAGATGAAA AACTAGAGGC 
AATTTTGAAC TTTGCTAAGT ACAACTTCCT TCCTGAAGAT TCTATTACGA TGGAAGACTT 
. - GTCAGGCTTG TCTGATAAGG CCATCAAGGA AGACCTTTTC CAACGTTCCT TGAAGGTTTA 
CGATAGTCAG GTTTCAAAAC TACGCGATGA AGAACCACTT AAAGAATTCC AAAAAGTTTT 
GATTCTACGA GTGGTGGATA ACAAGTGGAC AGATCATATC GATGCCCTTG ATCAATTGCG 
TAACGCGGTT GGACTTCGTG GCTATGCTCA GAACAACCCT GTTGTTGAGT ATCAGGCAGA 
AGGTTTCCGT ATGTTTAATG ATATGATTGG TTCGATTGAG TTTGATGTGA CACGCTTGAT 
GATGAAAGCA CAAATTCATG AACAAGAAAG ACCACAGGCA GAACGTCATA TCAGTACAAC 
AGCGACTCGC AATATCGCTG CTCACCAAGC AAGTATGCCA GAAGATTTGG ATTTGAGCCA 
GATTGGACGC AATGAACTTT GCCCATGTGG TTCTGGTAAG AAATTTAAAA ACTGTCACGG 
TAAAAGACAA TAAAATGAGA TAGTTTAGAG GCGGATATCT TGTGAAAAGT AAATTTTTAC 
TGGGTATCCG TTTGCTTTAT AAGGAGATGA GTTATGGTAT TTACAGCAAA AAGCTCTAAA 
ATAAATATAG AAGAAGTTCG TGCCTTGTCA AAATTAGAAG GTCAGGCTTT GGAGAGGAAA 
TCACAGCGAG ATCAAGAGCT AGAAGCCATT ATACGTGGAC AAGACCAGCG AATTCTCTTC 
GTAATCGGGC CATGCTCATC TGACAACGAA GAAGCTGTCC TTGAATACGC TAAGCGTTTG 
GCAGTCCTAC AAGAAGAAGT GGCAGATCGT ATCTTTATGG TTATGCGTGT TTATACTGCC 
AAACCCCCTA CCAACGGAGA TGGCTATAAG GGCTTGATTC ACCAGCCTAA CGCGACAGAA 
GCGCCTAGTC TTATCAATGG AATCAAAGCC GTTCGCCATC TTCACTATCG TGTCATCACA 
GAAACAGGGA TGACAACTGC TGATGAAATG CTTTATCCTC AAAACCTTCC GCTTGTACAT 
GATTTGATTT CTTACATGGC AGTTGGTGCC CGTTCAGTTG AAGACCAGCA ACACCGCTTT 
GTGGCAAGTG GGGCAGGATT TTCTACTGGT TTTAAAAATC CAACCTCTGG AAATCTCAAT 
GTCATGTTTA ATGGGATTTA TGCTGCTCAA AACAAACAAA GTTTCCTTTT CTTAGGAAAA 
CAAGTAGAAA CAACTGGGAA CCCGCTTTCA CACGCTATTC TTCGTGGTCC TCTTAATGAG 
TATGGAAAAA ATATTCCCAA CTACTATTAT GACAATTTAA TTGATACCAT TGCCCAGTAT 
GAGAAAATGG GCTTGGAAAA TCCTTTTATC ATCATTGATA CCAATCATGA CAATTCTGGT 
AAGCAGTATA TTCAACACAT CCGAATTGTC CGCCAGACCT TGATTAACCG TGCTTGGAAT 
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GAAAAAATTA AGCAGTTCGT TCGTGGTTTT ATGATTGAGT CTTATCTGGA AGATGGTCGA 6300 

CAAAATGAGC CAGAAGTATT TGGTAAGTCT ATCACAGACC CTTGCCTGGG TTGGGATAAC 6360 

ACAGAAGCTC TTGTCAGAGA AATTTACAAA ACGTTAGGAG AATAAGATCG CATTTATTGA 6420 

AAAAGGTCAA GAAATCGATA TGGAAGTCAT CAAGGCTGAA ACCCAATTGT CTGCGGAAGC 6480 

CTTGAGACTC AAGGAAAGCC GTGACAGGGA ATTGGCAGAT ATTATTTCAG GGGAAGATGA 6540 

CCCTATTCTC TTGGTGATTG GTCCTTGCTC TTCTGATAAT GAAGAGGCGG TCTTGGAATA 6600 

TGCTCGCCGT TTATCTGCCT TGCAAAAGAA GGTAGCGGAT AAGATTTTCA TGGTCATGCG 6660 

CGTGTATACT GCTAAGCCTC GTACCAATGG AGACGGCTAT AAAGGATTAG TTCACCAGCC 6720 

AGATACTTCT AAGGCTCCAA GCCTGATTAA TGGCTTGCAG GCTGTGCGCC AGTTGCACTA 6780 

CCGCGTGATT ACAGAGACTG GTTTGACAAC GGCACATCAG ATGCTTTATC CGTCAAATCT 6840 

GATCTTGGTG GATGACTTCC TCAGCTACCA TGCCGTTGCA GCTCGTTCTG TGGAAGACCA 6900 

AGAGCACCGC TTTGTGGCTT CTGGGATTGA TGCACCAGTA GGGATGAAAA ATCCAACCTC 6960 

AGGAAATTTG GGTGTTATGT TTAACGCCAT CTATGCTGCT CAAAACAAGC AAACCTTCCT 7020 

TTATCATGGG CAGGAAGTTG AGACATCAGG TAATCCTTTG GCCCATGTTA TCCTCCGTGG 7080 

AGCAGTCAAC GAGTATGGCA ATTATATGCC GAATTACTAC TATGAAAATC TACTCCAAGC 7140 

CATTGAACGC TATGAAACCA TGGGACTTGA AAATCCTTTT ATCCTCATTG ACACCAACCA 7200 

TCATAACTCA GGCAAGCAAT ATATGGAGCA CATTCGAATT GTTCGCCAGA CCTTGCAGAA 7260 

TCGTGATTGG AATGAGAAAA TTAAAAAGAC GGTTCGAGGA TTTATGATTG AATCTTACCT 7320 

AGCAGATGGT CGTCAAAACC AACCAGAGAT CTTTGGTTGC TCTATTACTG ACCCTTGCCT 7380 

AGGTTGGGAA AATACAGAGG CCTTGGTAGA AGAGATTTAT GTTACCTTGA CAAAATAAGT 7440 

GAAAAGGATG GAGTTGGGGA ATCTCAACTC CTTTTGATGA GAATGATAGT TGGACACGGA 7500 

ATTGACATCG AAGAATTGGC TTCGATAGAA AGCGCAGTTA CACGACATGA AGGATTTGCT 7 560 

AAGCGTGTAC TGACCGCTCA GGAAATGGAG CGCTTCACCA GTCTCAAAGG ACGCAGGCAA 7620 

ATAGAATATT TAGCTGGTCG CTGGTCGGCT AAGGAGGCCT TTTCCAAGGC TATGGGAACG 7680 

GGCATTAGCA AGCTCGGTTT TCAGGATTTG GAAGTCTPGA ACAATGAACG TGGGGCGCCT 7740 

TATTTTAGTC AGGCACCATT TTCAGGAAAG ATTTGGCTGT CTATCAGCCA CACCGATCAG 7800 

TTTGTGACAG CCAGTGTCAT TTTGGAGGAA AATCATGAAA GCTAGTCCAC ATAGACCAAC 7860 

CAAGGCTCTG ATTCATCTGG GAGCTATTCG ACAAAATATT CAGCAAATGG GGGCTCATAT 7920 

CCCTCAAGGA ACGCTCAAGT TGGCTGTGGT TAAGGCCAAT GCTTATGGTC ATGGAGCTGT 7980 

TGCCGTTGCC AAGGCAATTC AAGATGATGT TGATGCCTTT TGCGTTTCCA ATATCGATGA 8040 
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ACCCATTGAA 


CTCAGACAAG 


: CTCGACTCAG 


: CAAGCCAATC 


CTCATTTTAG 




8100 


AATCGAAGCT 


GTTGCTCTAG 


CTAAAGAATA 


TGACTTCACC 


TTGACAGTGG 




0 1 ou 


GTGGATTCAA 


GCACTCTTAG 


ATAAGGAAGT 


GGACCTAACT 


GGATTGACAG 






GATTCATTCA GCGATGGCAC GGATTGGTTT TAGAGAGGCA AGTGAGCTTG 






AGATTTGCTC 


CAACAACACG 


GTGTTTGTGT 


TGAAGGAATC 


TTTACCCACT 




8340 


TGATGAGCAA 


TCAGATGACT 


ATTTTAATGC 


CCAGTTAGAA 


CGGTTTAAAA 


CTATTTTAGC 


8400 


TAGTATGAAG 


GAAGTTCCAG 


AGCTGGTTCA 


TGCTAGCAAT 


TCTCCAACGA 


CTCTTTGGCA 


8460 


TGTAGAGACT 


ATTTTCAATG 


CGGTTCGTAT 


GGGAGATGCC 


ATGTATGGCC 




8520 


TGGAGCGGTC 


TTGGATTTGC 


CTTATGATTT 


GATACCGGCC 


TTGACCTTGG 




8580 


GCTTCATGTC 


AAGACAGTTC 


CAGCTGGAGC 


TTGCATGGGC 


TATGGAGCAA 




8640 


GGATAGCGAG 


CAAGTCATCG 


CGACCGTGCC 


AATCGGGTAT 


GCAGATGGAT 






CATGCAAAAT 


TTCTCTGTCT 


TGGTAGATGG 


CCAAGCTTGC 


CCAATTGTCG 




Q "I £ n 


GATGGACCAA 


ATCACTATTC 


GATTGCCTAA 


GCTTTATCCG 


CTAGGAACCA 


AGGTAACCTT 


StflU 


GATTGGCTCC 


AATGGGGATA 


AGGAAATCAC 


TGCAACTCAG 


GTAGCGACCT 


ACCGCGTAAC 


gaa/\ 

oooU 


CATTAACTAT 


GAGGTGGTTT 


GCCTCCTCAG 


CGACCGTATT 


CCGAGAGAAT 


ATTATTAGAA 


8940 


AAGAAAGGAG 


TGCAGCATGA 


ATCTACATCA 


ACCCTTGCAT 


GTCTTGCCTG 


GTGTGGGACC 


9000 


AAAGTCAGCA 


GAAAAATACG 


CCAAACTAGG 


AATTGAAAAC 


TTGCAAGATC 


TCTTGCTCTA 


Qrt£A 

jUOU 


CTTTCCTTTC 


CGTTATGAAG 


ACTTCAAAAC 


CAAGCAGGTG 


CTGGAGCTGG 


AAGACGGTGA 


q 1 5 rt 


GAAGGCAGTT 


CTTTCTGGTC 


AGGTAGTGAC 


TCCTGCTAGT 


GTCCAGTATT 


ATGGTTTCAA 


qi on 


GCCCAATCGC CTGCGTTTTA GTCTCAAGCA GGGAGAGGTC GTTTTTGCGG TGAATTTCTT 




TAACCAGCCC 


TATCTGGCTG 


ATAAAATAGA 


GTTGGGAGCA 


ACCCTTGCTG 


TCTTTGGAAA 


/ Jww 


ATGGGACCGC 


GCTAAGGCTA 


GTCTGACTGG 


GATGAAGGTT 


CTGGCTCAGG 


TAGAAGATGA 


9360 


CCTCCAGCCT 


GTCTATCCTC 


TGGCTCAGGG 


AATCAGTCAG 


GCCAGTCTGG 


TCAAGGTCAT 


9420 


CAAGACGGCT 


TTTGATCAGG 


GACTGGACCT 


CTTGATAGAA 


GAAAATCTGC 


CCCACTCTTT 


9480 


ACTAGACAAA 


TACAAACTCA 


TGTCCCGTTG 


TCAGGCAGTC 


CGTGCTATGC 


ATTTTCCAAA 


9S40 


GTATTTGGCA 


GAATACAAGC 


AGGCTCTTCG 


CCGTATAAAG 


TTTGAGGAAC 


TCTTTTATTT 


9600 


CCAAATGCAG 


CTGCAGATGC 


TCAAGTCTGA 


AAATAGAGTT 


CAGGGAAGTG 


GTCTGGTTCT 


9660 


GAATTGGTCT 


CAGGAAAAAG 


TGACAGCAGT 


TAAAGTAAGT 


CTTCCTTTTG 


CCCTGACCCA 


9720 


AGCTCAGGAA 


AAGAGTTTGC 


AGGAAATTTT 


AACTGATATG 


AAGTCCGACC 


ACCACATGAA 


9780 
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TCGTCTCCTA 
TCCGGCAGTG 
GCAACACTTT 
TTCCTTGAAA 
TTTGATTATA 
GATTATTATC 
AGGTGACAAT 
CACAGCCTTT 
TATTGTGACG 
GGAAATTCAA 
TCTAGATTTG 
GGCAGAGGTG 
GGATTTCAAC 
CAACGTTCCC 
ACTTCACCAG 
TGCTAATCCC 
TGGATTTGTC 
CAGACAGTCA 
AGAAGAAGCA 
AGAGTGGCGC 
TAAGGAAAAC 
CTCAAAACAC 
TGAGCTGGCA 
CGTGGTTTGA 
AGAGTCATCA 
AACCACTTTC 
TCTTTTTTCG 
CACTTTCTAT 
CGATTACAAA 
CCTATAGAGG 



CAAGGGGATG 
ACAGCAGGTT 
GAGAGTTTAC 
GCTGCAGAAA 
GGAACTCACG 
GATGAGCAGC 
CCAGATGTCC 
GGAGATATGG 
CGCTGGATCA 
AAAGGTTCCC 
AAAAATGCCA 
GCTCTTCTAC 
GAGAGAAAGA 
AATGCGACTG 
CTTAGAGGTC 
AAGACGGATT 
CTTGCGGAGG 
GCACTTCCAG 
AGAAAGGTTG 
ATGATTGCCC 
TTATACTCAA 
TGTTTTGAGG 
GATAGAACTG 
AGAGATTTTC 
AAAAGAAACG 
CAAAAGAAGG 
TTTCTAAAAC 
TTACCCTTCT 
ATAAAAGGAG 
AAGGGATGAG 



TGGGGAGTGG 
ATCAGGCTGC 
AGAACCTTTT 
AGAGAGAAGT 
CTCTGATACA 
ACCGTTTTGG 
TCATGATGAC 
ATGTTTCCAT 
AACATGAGCA 
AAGTCTATGT 
TTGCCTTATC 
ATGGTAGGAT 
CGGATATTCT 
TCATGATTAT 
GTGTCGGTCG 
CTGGGAAAGA 
AAGATTTGAA 
AGTTCCAAGT 
CTAGCTACAT 
TTCATCTGGA 
TGAAAATCAA 
TTGTGGATGA 
ACGAAGTCAG 
GAAGAGTATT 
AGGACTCTCA 
AAGCAAAAAA 
TACTTTCAGC 
TTCTTGCATT 
CATGCTATGA 
GTTCCGGAAG 



962 
AAAAACGGTA 



CCTAATGGTA 
TCCCAATTTG 
CTTGGAGACC 
AGATGGGGTG 
TGTAGGGCAA 
GGCGACTCCC 
TATCGACCAG 
ACTACCTCAG 
CATCTCTCCT 
AGAGGAGTTG 
GAAGAGTGAC 
GGTTTCGACG 
CATGGATGCC 
GGGGGACAAG 
CCGCATGCGT 
AATGCGTGGT 
GGCTGATATT 
TAGTTCTATA 
AAAGAAAGAA 
AGAGCAAACT 
AACTGACGAA 
TAACATATAT 
AAGCTAGTTT 
TATGACAGTA 
CTAGTATAAA 
CCATCATCCT 
GATTACATAC 
AAAATCCAGC 
ACTTTGATGA 



GTCGCTGCCT 
CCAACAGAAA 
AAACTGGCTC 
ATTGCCAAGG 
GAGTATGCTC 
AGGCGTATTT 
ATTCCACGGA 
ATGCCAGCAG 
GTCTTGACTT 
TTGATTGAAG 
ACGACTCATT 
GAAAAAGACC 
ACGGTTATTG 
GATCGCTTCG 
CAGTCCTACC 
ATCATGACAG 
TCTGGTCAGA 
ATCGAAGATT 
GAAGCTTGGC 
CATCTGGATT 
AGGAAGCTAA 
GTCAGCTCAA 
ATACGGTAAG 
TTAGGTTTGG 
ACGATTAAAG 
CAGAAGAGAG 
AAAAGTAAAG 
ATATGCTACA 
TTTGCTAGAA 
TTTCTGGGAT 



TGGCCATCTT 
TCCTCGCAGA 
TCTTGACAGG 
GTGAGGCTGA 
GTCTTGGTTT 
TACGGGAAAA 
CCCTTGCCAT 
GTCGGAAGCC 
GGTTAGAGGG 
AATCAGAAGC 
TTGCAGGCAA 
AGATCATGCA 
AGGTTGGGGT 
GTCTCAGTCA 
CTGTTCTCGT 
AAACGACCAA 
TTTTTGGAAC 
TTCCCATTTT 
AAGAAGATCC 
AAGCTTTCTC 
CCGCAGGTTG 
AACACCGTTT 
GCGACGCTGA 
CTCTTATACT 
TAAATTACCA 
AGCGAAATGC 
AATCTAAATT 
CTTGTGGTAA 
GAAATTAAGA 
GGGGAAGTGA 
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AAAATCTTTC CACGCTTCCA TCCTACCACT TGGAGGAAAG AGATTTCCAC ATTCCTCAAG 
TCAAGTGCTA TGAGTTAACA TTTGAAGGAA GCAAGGAAGG AAAGGTCTAT GCACGCATTG 
TTCTTCCAAA GAGTGAGGAG AAGGTCCCAT TAATCTTCCA TTTTCATGGT TATATGGGAC 
GTGGCTGGGA CTGGGCCGAC ATGCTGGGCT TCACCGTAGC TGGTTACGGT GTTGTTTCCA 
TGGATGTGCG GGCCCAGTCA GGTTACTCAC AAGACGGCTT GCGTTCTCCT TTAGGAAATA 



CCGTCAAGGG GCATATTATC ^TGG^CTG TGG AAGCTCcT GGACC ACCTC ^TTTTAT AAGG 
ATGTTTATCT GGATATTTAC CAGTTGGTCG AAATTGTTGC TAGTCTGTCT CAGGTTGATG 
AGAAGCGTCT TTCTAGCTAT GGTGCCTCAC AAGGAGGGGC TCTAGCTCTA GTTGCAGCAG 
CGCTCAATCC TCGAATTCAG AAAACAGTTG CCATTTATCC CTTCTTGTCA GACTTCAGAC 
GGGTGATTGA GATTGGTAAT ACTAGCGAGG CTTACGACGA ACTTTTCCGT TATTTCAAGT 
TTCACGACCC CTTCCATGAA ACAGAGGAGG AAATCATGGC GACCCTTGCC TATATCGATG 
TCAAAAATCT TGCCCATCGT ATCCAAGGTG AGGTTAAGAT GATTACGGGC TTGGACGACG 
ATGTTTGCTA TCCCATTACC CAGTTTGCGA TTTATAATCG TCTGACCTGC GATAAAACCT 
ATCGCATCAT GCCTGAGTAT GCTCACGAAG CCATGAATGT ATTTGTCAAT GACCAAGTCT 
ACAACTGGCT CTGTGGAAGT GAGATTCCTT TTAAATATCT AAAATAAGGA GTCGACTCTA 
AGCACAAAAT CTTAAAAATT ACAAACACGC ATAGTATCAC GGGATTAAGA AAACTTTATA 
CTATGCGTTT TATCATCGAA ATATAGTAAA ATGAAATAAG AACAGGACAA ATCGATCAGG 
ACAGTCAAAT CGATTTCTAA CAATGTTTTA GAAACAAATG TGTACTATTC TAGTGTCAAT 
CTATTATATT TATAGAATTT TTTGTTGCTA GATTTGTCAA ATTGCTTAAA ATAATTTTTT 
TCAGAAAGCA AAAGCCGATA CCTATCGAGT AGGGTAGTTC TTGCTATCCT CAGGCTTGTC 
TGTAGGTGTT AATACTTTTC AAAAATCTCT TCAAACCACG TCAGCTTCGC CTTGC 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 
GGGGATATGA ACAACAAAAG AATATTTAAA CACTTCCAAG CTTCAAAAAT GAGTTTAAAC 
ATTTACACAA GCCCCTTGTT AGCCTTTGTT TTTGTCTTCA TAGGAGAGTT TGTGGCTTTT 
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ACTTTGTATG GTATTGGCTT GTTAGCTCTC ATCGGACTTC CTAGAAATTT TGGAGAGGCT 180 

GGTCAAAATC TTGCAAGCTA CTTGCAGACC TTGCATCAGA GCTTGACGGA TAAAACAAGT 240 

GACTTTCGTT TAATTTTAGG ATTACTGGCC TTTGGTTATT CTTAACACTG TGTTCAGATG 300 

GACAAGAAAA CTTGAGAAAA GACCTATTCG AACCTTGGGA TTTTATAGAG AGAATTTCCT 3 60 

CAGCAATCTT CTGAAAGGAT TTAGTCTAGG CCTGGCACTT TTTCTTCTGA CCTTGTTAGG 420 

TTTAGTGGTC TTAGGTCAAT ATCCTTTGGA ATCCATTCAC TTGAATCCTT ATTCTCTTGC 480 

CTTTGTCGTC TTTACTATCC CATTTTGGAT TTTACAGGGG ACAGCAGAAG AAGTGGTGGC 540 

CCGTGCTTGG CTACTTCCTC AATTGGCCTC AAGAACCAAT CTAAAACTAG CTATTCTTAT 600 

ATCTAGCCTG TTCTTTACCC TGCTTCATAT GGGCAATTCT GGTCTCACCC CTCTATCTCT 660 

AGTAAATCTC TTTTTATTCG GAGTTGCCAT GGCTCTTTAC CTTCTCAAAA CTGATACAGT 720 

TTGGGGTGTT GCAGGTATTC ATGGTGCTTG GAATTTTGCT CAGGGTAATC TCTTTGGGAT 780 

TTTAGTTAGT GGTCAACCGT CAGaACGTCT CTGATGACCT TTTTACCACA AGGCAATCAA 840 

GATTGGCTAT CAGGTGGTTC TTTTGGCATA GAAGGTTCCA TTATGACAAG TCTGGTATTA 900 

CTACTGCTGA TTGTCTATCT TGCTAATAAA TTAAAGAAAG AAAATGAAAG GATGTGACTT 960 

CGGTCCGTCC TTTTCTTCGT GAAAATACTA TAAGTATGCT AAAATAGGAA TAGCACATGG 1020 

AGAGAGGATT CTTATGATCA ATCACATTAC AGATAATCAA TTTAAACTAG TATCAAAATA 1080 

TCAACCATCA GGAGATCAAC CCCAAGCTAT CGAGCAGTTG GTGGATAACA TTGAGGGGGG 1140 

AGAAAAAGCT CAGATTCTGA TGGGGGCGAC TGGAACAGGG AAGACCTATA CTATGAGTCA 1200 

GGTCATTTCT AAAGTCAATA AACCAACTCT GGTTATTGCC CACAATAAAA CTCTGGCTGG 12 60 

TCAGCTCTAT GGGGAGTTTA AGGAATTTTT CCCTGAAAAT GCAGTTGAGT ATTTCGTATC 1320 

CTACTATGAT TATTACCAGC CAGAGGCCTA TGTCCCTTCT AGCGATACCT ATATTGAGAA 1380 

GGATAGTTCT GTCAATGACG AGATTGACAA GCTTCGCCAC TCAGCTACCT CAGCCCTTTT 1440 

GGAGCGTAAT GATCTTATTG TCGTGGCCTC AGTCTCTTGT ATCTATGGTT TGGGTTCGCC 1500 

CAAGGAATAC GCTGATAGTG TCGTTAGTCT CCGTCCTGGT CTAGAGATTT CTCGTGATAA 1560 

ACTCTTGAAT GACTTGGTCG ATATTCAGTT TGAACGTAAT GATATTGATT TCCAACGCGG 1620 

AAGATTTCGC GTTCGTGGGG ATGTGGTAGA GATTTTCCCA GCTTCCCGAG ATGAACATGC 1680 

CTTTCGAGTA GAATTTTTTG GAGACGAAAT TGACCGTATT CGTGAAGTTG AGGCTCTGAC 1740 

AGGTCAGGTG TTGGGAGAAG TGGATCATTT AGCGATTTTC CCAGCGACAC ACTTTGTGAC 1800 

CAATGACGAC CACATGGAAG TTGCCATTGC AAAGATTCAG GCCGAGTTGG AAGAACAATT 1860 

AGCTGTCTTT GAAAAGGAAG GTAAACTGCT TGAAGCCCAG CGTTTGAAAC AGCGGACAGA . 1920 
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GTATGATATC 


GAAATGTTGC 


GTGAGATGGG 


CTATACCAAT 


GGGGTTGAAA 


ATTATTCTCG 


1980 


CCACATGGAT 


GGACGGAGCG 


AAGGAGAGCC 


TCCTTATACG 


CTTCTCGACT 


TCTTCCCAGA 


2040 


TGATTTCTTG 


ATTATGATTG 


ACGAGAGTCA 


TATGACCATA 


GGGCAAATCA 


AGGGCATGTA 


2100 


CAATGGAGAC 


CGTTCGCGTA 


AAGAAATGCT 


GGTTAATTAT 


GGTTTCCGTT 


TGCCGTCTGC 


2160 


TTTGGACAAT 


CGTCCTCTCC 


GTCGGGAGGA 


GTTTGAGAGT 


CACGTTCATC 


AGATTGTTTA 


2220 


CGTTTCAGCG 


ACACCTGGTG 


ACTATGAAAA 


TGAACAGACC 


GAGACAGTGA 


TTGAGCAAAT 


2280 


CATTCGTCCA 


ACGGGACTCT 


TGGATCCAGA 


GGTGGAAGTC 


CGTCCGACTA 


TGGGACAGAT 


2340 


TGATGACCTC 


TTGGGTGAAA 


TCAATGCCCG 


CGTTGAAAAA 


AATGAGCGTA 


CCTTTATCAC 


2400 


AACTTTGACC 


AAGAAAATGG 


CAGAGGATTT 


GACCGACTAC 


TTCAAGGAAA 


TGGGTATCAA 


2460 


GGTCAAGTAC 


ATGCACTCGG 


ATATCAAGAC 


CTTGGAACGG 


ACGGAGATTA 


TCCGTGACCT 


2520 


GCGCTTGGCT 


GTCTTTCATG 


TCTTGGTCGG 


AATTAACCTG 


CTCCGTGAAG 


GAATTGACGT 


2580 


TCCTGAAGTG 


AGCCTCGTAG 


CTATTCTCGA 


TGCTGACAAG 


GAAGCTTTCC 


TTCGCAACGA 


2640 


ACGTGGACTC 


ATCCAGACCA 


TTGGACGTGC 


TGCACGTAAT 


AGCGAAGGTC 


ATGTTATCAT 


2700 


GTATGCGGAC 


ACGGTTACCC 


AGTCTATGCA 


ACGTGCTATC 


GATGAAACTG 


CCCGCCGTCG 


2760 


CAAAATCCAG 


ATGGCCTATA 


ATGAAGAACA 


TGGTATCGTT 


CCACAAACCA 


TCAAGAAAGA 


2820 


AATCCGTGAC 


TTGATTGCTG 


TGACCAAGGC 


AGTTGCTAAG 


GAAGAAGACA 


AGGAAGTCGA 


2880 


TATCAATAGC 


CTCAACAAAC 


AAGAGCGCAA 


AGAACTAGTC 


AAAAAGCTTG 


AGAAACAAAT 


2940 


GCAAGAAGCA 


GTTGAAGTGC 


TTGACTTTGA 


ACTAGCAGCT 


CAGATTCGTG 


ATATGATGCT 


3000 


GGAAGTCAAG 


GCCTTGGATT 


AGGGGAATAG 


TATGATTTAT 


TTAAGAAAGT 


TAAAGAAAGA 


3060 


AGATTTGATG 


TCTTTATGGG 


AAATGGCTTA 


TTCACAGCTT 


AATCCAGTTT 


GGAAACAGTA 


3120 


TGATGCTCCC 


TATTATGATG 


ATTATCAGTA 


TTTTTCAAAT 


TTTAAAGAAT 


TCGAACTACA 


3180 


AAAATCAGAA 


TCCATTTTAA 


GCAACTCAAA 


TCGCCTTGGT 


ATTTTTGTTG 


ATGATAAACT 


3240 


AGTTGGGACT 


GTTTCGCGTT 


ATTGGGTATG 


TAAAGAAACA 


AGATGGATGG 


AATTGGGAAT 


3300 


TGGTATTTAT 


GATAAAAAAT 


TCTGGAACAC 


TGGTATTGGG 


AAAGTTGCTA 


TGTTGCAGTG 


3360 


GATAGATAGG 


ACGTTTCAGG 


ATTACTTGGA 


GTTGGAGCAT 


CTGGGTTTGA 


CAACTTGGTC 


3420 


AGGAAATATT 


GGTATGATGA 


AACTTGCTGA 


AAAATTAAGA 


ATGAAAAAAG 


AAGCTCATAT 


3480 


TCCAAAAGTT 


CGTTATTATC 


AAGGTAAATA 


TTTTGACAGT 


ATTAAATATG 


GTATTTTGAG 


3S40 


AGAAGACTGG 


GAGAAAATAA 


ATGACGGTTA 


TTATCAAATC 


AATGGAAACT 


CCTGAAGAGA 


3600 


TAGAAGGTAA 


ATCCTTCGTT 


CACTGGCAAA 


CGTGGAGAGA 


GGCTTATGAT 


GACCTTTTGC 


3660 
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CTGCGGAATT TCACGAGACA ATGACATTAG AAAGATCTCG ACTCTTTACT CAAAAGTATC 3720 

CAGAAAATAC ATTGATTGCG ATGGATGGTG TGAAGATAGT TCGTTTTATA AGTTATGGCA 3780 

ACTGTCGTGA TCAGACTATT CAAGCTGGTG AAATTATTGC TTTATATGTT TTAAAAGACT 3840 

ATTATGGAAA AGGAATCGCA CAAAAGTTAG TGAAAGCAGC TTTGACTGAT CTTAATCATT 3 900 

TTTCTGAAAT TTTCTTATGG GTATTGAAAG ATAACAAGCG CCCCATTGCT TTCTATCAAA 3960 

AAATGGGTTT TACTTTTGAT GGACAAGAAA AAATACTTGA ACTTGGAAAG CCTATAAAGG 4020 

AAAAACGGAT GGTATTCTAT TCTAAATAAT TCTCAAAAGT AAAAGCTAAT ATGGTACCAA 4080 

GTCTGAAAAT TTAATAAATT AGAAAGCGAG TAAATTTATG TCCCGTTCCC AAT7AACAAT 4140 

TTTAACAAAT ATCTGTCTGA TTGAAGACCT CGAAAC7CAG CGCGTGGTGA TCCACTATCG 4200 

CGCCCCTGAA AACAATCGCT GGTCTGGTTA TGCCTTTCCT GGAGGTCATG TAGAAAATGA 4 260 

TGAGGCTTTT GCGGAGTCTG TCATTCGTGA AATCTACGAA GAAACAGGGT TGACTATCCA 4320 

AAATCCTCAA CTTGTCGGCA TTAAAAATTG GCCACTAGAT ACAGGTGGGC GCTATATTGT 4380 

CATTTGTTAT AACGCGACTG AGTTCTCTCC TACCCTTCAA TCTTCAGAAG AGCGAGAAGT 4440 

TTCTTGGGTG CAAAAAGACC AGATTCCAAA CTTAAATCTG GCCTATGATA TGCTACCATT 4500 

GATGGAAATG ATGGAAGCTC CCGACAAGTC AGACTTTTTC TACCCTCGCC CTACAGAAGA 4560 

CGATTGGGAA AAGAAAATCT TCTAGTCTTT TACTAAATAA CCTAGCTGAT CCAAGGCCTC 4620 

CTCGATATAG TGGAGGTCTT GTTCTGTCTC GGCTTCAACT AGGTGATAAT GAATACCATC 4680 

TGTTAACTCA GAAATTGGCT TAAAGTCAGA ACGTTCAACT TGTTCTAGAA AATGTTGCAC 4740 

GTCGCGGCGA CAGGTCAGTT TTAGTAAGGT TTCAATCTCT CCATAAACAG GATGATCAAT 4800 

CAAGATATTT TGAACGCGAC CACCATTATC TACGATAGCA AGTAATTCTC GTCCAATTTC 4860 

TTCAACTTCA TGCTTGACCT TAAATAATTT GTGATGATAA GTATTTGCAT TAGCATCTTT 4920 

ATAGATATAA CCACGATTGG TAGATAGAAT TGGAGATCCA TCAGCTCTTA AAATTGCAAT 4980 

ATCTTGAACA ATAACTTGTC GAGTGACATG AAAGTGC7CA 5020 

(2) INFORMATION FOR SEQ ID NO: 14 3: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4965 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
AAAAAGTGGC AATCCATTGA TTGGCCACTT CATTTAGAGA ATTATCGTCT CGCCCTTGAA 
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GAAGAAGCTC GTGTAGTACT TCAGTTACTC CTATCGCTAC AACTACTACT TGAACTGCTG 120 

GACCTGGATG CAGTTCGTAC ACTCCCCACA ATACTAGACC AACCATTCTC ATAATCCGCA 180 

TCACTTCCGC CAATAGCAAA GCGATAACTT GTCGCTGGCG CTCCTGACTT ATTAGCCCAA 2 40 

TAGCTGGTAA CAGTCGAACC TGTGACCTCT ACTTCTTTTC CTTCAACAGA AACCTTCTCT 300 

GGTTTTTGAC CTGTTGATTT CAAGACTTCC GATTTCACTA CACTAGGATC TAAACCAAAG 360 

CGCTCGTTCC ~CCC AAATGCT ~ TGGGGAAGCT TGCTiG AATW ~^T^A^CA^ ATG AGCCATG 7a0~ 

TAATTAGAGT TATTAGAATA ACCTGCTCTA CGTGACAATG AATGATTATC ATCATGCCCA 480 

ATCCAGCCAC CTAGGGTTAA TCTAGGTGTC GAAAGCATGA GCCACATATT TTCGTCTTGG 540 

TTGGTTGTAC CAGTCTTCCC AATCCAATCT GCATTAGCCA GAGTAGGATT TAAAGAAGTC 600 

AGCTTAGACT TGAAGGTTGT TGTCACACGA GAGGATAGAA CTTCTCGTAG CAATCCCTGC 660 

ATAATCGTCG CAGTAGCTTT TGAATAGACT TGAACCGGTT TATCCTGATA CTCATACACC 720 

ACTCTACCAT CTGCTGCTTC AATCTTTGAA ATCACATGCT TCTGATGATA AACTCCATTA 780 

TTAGCTAAGG TCTGATAGCC ATTGGTATGC TGGGCAACTG TGACTTCAAT ACCACCACCC 840 

ATTGGCAAGC TCTCAATACC GTACTCAGGA ATCTCGTAAC CCATCTTTTC CATATAACCC 900 ' 

TTGACATCAA CACCCTTTTC ACGGAGCATA CGATAGGTCC AGTAAGCAGG GATATTCCAT 960 ' 

GAATAGTTCA GAGCTTCTCC CAAGGTCATC ATTCCTGTTC CCTTGCTATT AGCATACATA 1020 

ATCGGATTGC CATTAGCAAA GTTTGTTGGA TAGTTAGATA GAATCGTTTC ACTTCCCATC 1080 

AAGCCCTGGT CAATAGCAAT ACCGTAGGCC AGCAAGGGCT TGGTAGTAGA AGCTGGCGAA 1140 

CGTTTGGTAT CAAAGGCATG ATTATTTTGA TTTTCTTGAT AATTACGACC ACCTACAAAG 1200 ' 

CCTAGAATAG CACCTGTTTG GTTATCCATC AAGACATTCC CTACTTCTAC ACGACCTGTT 12 60 

CCATCGTCTA AAAGATAGCC ATAATCAGCA ACCGCACTTT GCATGGCAGA ATGAATTTTC 1320 

TGATCTATGG TAGTAGTAAT CTTATAACCA CCATTTTCAA TTTCCTTGGC TGCCAAATCT 1380 

CGATAAAACT TCTGAGTTGC CTCATTTTTC AACTCCTTAG CGGAGACATT GTCTCTCTGA 1440 

GCTAGATAGT CATACATACG TTCTTGAGCT TCTGCCAAAG TTGTAAAGTA TAAATAGTCT 1500 

CGTGAAATTC CTGTAACCGT GCCCGATGGT AAAAAGTCCT GTTTAAGCTC ATAATCCTTG 1560 

TACTGAGAAT ACTCGTCTTT GCTTAATGCA CCTGTACGAT ACATACTGTA AAGAACTGCC 1620 

TTAGCCCGTC TTAAGCCAAT TTCTAGGTCT TCATCACTCT TCAACTCCCC AGTATTTTCA 1680 

TAAGGAGAGT AAGTAATGGG ACTCTGTGGA AGTCCTGCTA AAAATGCTGC TTGAGGAACA 1740 

GTCAACTGAC TGGCATCTAC ACCGAAAATT CCCTCAGCTG CTTGCCGAGC CCCTGCAATA 1800 
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TTCTGTCCCT TATTATTTCG GCCAAAGGGA GCCACATTGA GATAGGTCGT TAAAATCTCA I860 

TCTTTATTCA TGGCGCGTTC CAAGGCAAGA GCATCCACAA TCTCTGCCGC CTTACGAGCC 1920 

AAGGTCGGCG CATCCCCAAC CACCTGCTGT TTAATTAGTT GCTGGGTCAA GGTTGAACCC 1980 

CCACTAGAGG AACCCAAACC TACAAATTTC CCCAAGGTCG CACGAATCAC CGCCTTGGGT 2040 

ACTACACCCT TATGTTCTTT AAAGTGTTCA TCTTCTGTCG CAATGATAGC CTTCTTCAGA 2100 

TTTTCCGAAA TTTGCTCAGA TGAGATAGAA GTCCGCAACA AATCACTCTC TATGGAAGCA 2160 

ATCACCGTCC CGTCCGAATA GGTAATCTCT GAAATAGAAG ACATGTCCTT GACCTGATTC 2220 

ACCAATTCTT CTGTCTGAGG CACCCGAACC TTGTCAAATA AGGCCACTCC GTATCCCAAA 2280 

GCAATCCCAG CTCCCAACAT TCCTCCTAGA AAACCGAGTA CAAAGAGTAA GTTAAATAAG 2340 

GCTTTTATAC TCAGTAAAAT AGCTGGGAAA ATGACTGACT TATCTAAGGT TTTAGATTTT 2400 

TTGGTACTTG AACCTTTCTT GCCAGGTCTA GCTGATTTTT TATTTTT T TG TTTTTGCTGG 2460 

AAAAATTCCA GCATTTTTCG TTTTAATTCA TTTAATTGAT TTTCCATGGA TTTCCTCACT 2520 

TTATCTATTA TACCACAAAA GGGAAATTTT CAATAAAATA GCCACTTTCT TCCCTATTCT 2580 

GCTAGGCTAT TGCCCAAGTT TGTGATACAA TAGGTAGAAA CAATAATTTT AAAAAGGAGA 2640 

AAAAACACAT GCACATTTTT GATGAGCTAA AAGACCGTCG TTTGATATTT CAAACGACTG 2700 

ATGAAGAAGC TTTGCGTAAA GCCCTAGAAG AAGGTCAAGT TTCTTATTAT ACTGGCTACG 2760 

ATCCAACTGC TGACAGCCTT CACCTAGGCC ACCTTGTCGC AATCTTGACA AGTCGTCGCT 2820 

TGCAACTAGC AGGTCACAAA CCTTATGCGC TCGTTGGCGG TGCTACAGGT CTCATCGGAG 2880 

ATCCGTCCTT CAAAGATGCT GAACGTAGTC TCCAAACAAA AGACACAGTA GATGGCTGGG 2940 

TCAAGTCTAT CCAAGGACAA CTTTCTCGTT TTCTTGACTT TGAAAATGGC GAAAACAAGG 3000 

CTGTCATGGT CAACAACTAC GACTGGTTTG GC AG CATC AG CTTCATTGAC TTCCTCCGTG 3060 

ATATTGGAAA ATACTTCACG GTCAACTACA TGATGACTAA GGAATCTGTT AAAAAACGGA 3120 

TCGAAACAGG AATTTCTTAC ACTGAGTTCG CTTACCAAAT CATGCAAGGG TATGACTTCT 3180 

TCGTCCTTAA CCAAGACCAT AATGTCACTC TTCAAATCGG TCGTTCTGAC CAGTGGGGAA 3240 

ATATGACACC TGGTACCGAA TTGCTTCGTC GTAAGGCGGA CAAGACTGGT CACGTTATCA 3300 

CTGTTCCACT AATCACAGAT GCAACTGGTA AGAAATTTGG TAAATCAGAA GGAAATGCCG 3360 

TCTGGCTCAA TCCCGAAAAC ACTTCTCCAT ACGAAATGTA CCAATTCTGG ATGAACGTGA 3420 

TGGACGCTGA CGCTGTTCGC TTCTTGAAAA TCTTTACTTT CTTGTCACTT GATGAGATTG 34 80 

AAGATATTCG TAAACAATTT GAAGCAGCGC CACACGAACG CTTGGCTCAA AAAGTCTTGG 3540 

CTCGTGAAGT TGTTACACTT GTTCACGGAG AAGAAGCCTA CAAAGAAGCA CTTAACATCA 3600 
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CTCAGCAACT CTTTGCAGGA AACATCAAAA ACCTTTCTGT CAAAGAGCTC AAACAAGGAC 3660 
TTCGTGCTCT GCCCAACTAC CAAGTACAGG CAGACGAAAA CAACAATATC GTGGAACTCC 37 20 

TCGTCTCATC TGGTATACTT AACTCAAAAC GCCAAGCCCG TGAAGACGTC CAAAACGGAG 3780 
CCATCTACGT AAACGGCGAC CGCATCCAAG AGCTTGACTA TGTCTTGAGT GACGCTGATA 3840 
AGTTAGAGAA TGAACTGACT GTTATCCGTC GTGGGAAGAA AAAATACTTT GTATTGACTT 3 900 

AOTAAACTAT^TCAACATTTA " TCT ATAAAC A "AAGG AGTT AA CCTCGAGA^ G<TTAACTC^CT 396 0 ~ ~ " 

TTTTGCTGTT AATAACTCTC ATCTATCTAT TTTTAATAGA CAGGCTACGC AGGACAATGC 4020 

GCAAGGTTGT TAGATTATGT AAGATAGAGA GATTTGAAGG ACTGAACCAA TTAAATAAGC 4080 

CAAAGCCAAT CAAACTACTA TTTACGACAA CGGTATCCTG AATATTTTTC TTGATGAGTG 4140 

TTTGCAAAGA TGATGATAAC GAATCCAACT CTTGGAAGAA ATCCAAACGA TTATCTAACA 4200 

ATAAGATATC ACTCATCTGC TTAGAAATAT CTGCACTCTC ATTCATCACC ACACCGATAT 4260 

CTCATAGAGT TAAAGCCGCT GAGTCATTCA ATCCATCTCC AACCATCAAA ATAGTGTGAC 4320 

CTGCTTTCTG CAGTTTCTCT ACTAACTCAA ATTTCCCATC AGGTTTCAAG TCTGTATAGA 4 380 

CCTGATCAAA GGGCAAATCT TTGACTAATT CCTCTGTCCT AATCAAGGTC TCTCCTGTTG 4440 

CCAGAATCAA TTTTTTCCCC TGTGCCTTAA GTTTATCCAA GGCTGTTTTT GCTTCTTTTC 4500 

TCAAAGGAGT ATGAATGCAG AACATTCCAA TCAATTCATT TTGATAAGCC AAGAATAAGA 4560 

GATTGTAGTG ACTCTTGTAC TCTTCAATTA AACCATTTTG TTCTGAACTG ATATGAATCT 4620 

GCTCATCCTG CATCAAGACA TAATTCCCAA TAAGAACTGG TTGGCCATCT ATATGAGATT 4680 

TGATCCCCTT GCTTGCGATA TATTGGACTT TCCCATGCAT TTCCTCATGT TCAATTCCCT 4740 

CTATCTCAGC TTGCTTGACG ATGGCATTAG CAATAGGATG ATAAATGTCT TCCTCAAGAC 4800 

AGGCACTGAT TCTGAGAATA TCTTCCTCAC TATAGTCTCC AAAAGGTAAC ACCTTTTCAA 4860 

CTATAGGATA ACTAGTTGTG ATTGTTCCTG TCTTATCAAA CAAGAAAGTA TCAACTTCCA 4 920 ' 

GATATTTCTC CCTGTTGTGG CCTCTGGCTG TCATCTCTGT GCTGG 4965 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
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CAGGGGCCTA TTACGTGACA ATTCAATGTA GGCTGTCGCT ACTTGCGCCA AAACAAGGAT 60 

TCGATAATGT CGGATGATAC TAACGATTAA ACCGAGCAGA AAGGATCCCA AAATTCCCCA 120 

AACTGCAATA TGCAAGGTCA GAAAGAATGC CTTTTGATAT AGTGGTAGAT ATTGTTCAAC 180 

AATGGATCAA TCCAAAAATA GAACCTCCCA TCTAGAAATA AT AC AG TT AT TGTAGCACTT 240 

AAAATCTTCT TTGGATAATA TCTATTTTTT ATTGCCGTTA TAAGGATTTT TATCATAGAC 300 

ATAAAATTTC TGAAATTTCC AAACAAAATA TTTTAAAAGT TTTGAAAAAG AGTTAAGATA 3 60 

TTTTTGTAAT ACACAAAGTA AACGCTTACT TATTAAGGAG GACATTTTAT GTCATACAAA 420 

ACAAGCAATG CAGAAGGTCA TGTAGATTTC ATCAATACCT ATGATTTGGA GCCAATGGCG 4 80 

CAACAAGTTA TTCCTAAAGC AGCATTTGGC TATATCGCTA GTGGGGCGGG AGATACTTTC 540 

ACTTCTTTCC AGTGATTTTA GCGTCAGGTT CTTTTTACTT TTTAAAGATT ATCCGTGAAT 600 

TTCTTGCTTA TTTATGATAA AATGGGAGTG TCGCAAAAAA TGACTCATCG TATTCAATTT 660 

TGAGTAAAAC TAGGAGGATC CCATGTCTAC AGAACATATG GAAGAACTAA ATGACCAGCA 720 

GATCGTTCGC CGTGAAAAAA TGGCTGCGCT CCGCGAACAA GGAATCGATC CTTTCGGAAA 780 

ACGTTTTGAA CGTACTGCAA ATTCACAAGA ATTAAAAGAT AAATATGCCA ACCTCGATAA 840 

AGAACAATTA CACGATAAAA ACGAAACAGC TACTATCGCA GGACGCTTGA TAACCAAACG 900 

TGGTAAAGGA AAAGTTGGTT TTGCCCACCT TCAAGACCGC GAAGGCCAGA TTCAGATCTA 960 

CGTTCGTAAG GATGCTGTCG GTGAAGAAAA CTACGAAATC TTCAAAAAAG CAGACCTTCG 1020 

TGACTTCCTT GGTGTCGAAG GTGAAGTGAT GCGTACGGAT ATGGGAGAAC TCTCTATCAA 1080 

GGCAACCCAC ATCACACACT TGTCTAAGGC TCTTCGTCCT CTTCCTGAGA AATTCCATGG 1140 

TTTGACAGAC GTTGAAACAA TTTACCGTAA ACCTTACCTT GACTTGATTT CTAATCGTGA 1200 

AAGCTTTGAA CGCTTTGTCA CTCGTTCAAA AATCATCTCT GAAATCCGTC GTTACCTTGA 1260 

CCAAAAAGGA TTCCTTGAAG TGGAAACACC TGTTCTTCAT AATGAAGCCG GTGGTGCTGC 1320 

TGCCCGTCCA TTTATCACCC ACCACAATGC CCAAAACATT GACATGGTGC TTCGTATCGC 1380 

GACTGAGCTT CACTTAAAAC GCCTTATCGT GGGTGGTATG GAACGTGTCT ATGAAATTGG 1440 

CCGTATCTTC CGTAACGAAG GAATGGACGC TACTCATAAC CCTGAGTTCA CTTCTATCGA 1500 

AGTTTACCAA CCTTATGCAG ACTTCCAAGA CATCATGGAC TTGACTGAAG GCATTATCCA 1560 

ACACGCTGCT AAATCAGTCA AAGGTGATGG CCCAGTCAAC TACCAAGGTA CTGAAATCAA 1620 

GATTAACGAA CCATTTAAGC GTGTTCATAT GGTGGATGCT ATCAGAGAAA TTACTGGTGT 1680 

CGATTTCTGG CAAGACATGA CTTTGGAAGA AGCTAAAGCT ATCGCTGCTG AGAAGAAAGT 1740 

TCCAGTTGAG AAACACTACA CTGAGGTTGG TCACATCATC AATGCCTTCT TTGAAGAGTT 1800 
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TGTTGAAGAA ACTTTAATCC AACCAACCTT TGTCTATGGA CATCCAGTAG CTGTATCTCC 1860 

ACTCGCTAAG AAAAATCCTG AAGACCAACG CTTTACTGAC CGTTTCGAGC TCTTTATCAT 1920 

GACTAAGGAG TACGGTAATG CCTTTACTCA GTTGAACGAC CCAATCGACC AACTTAGCCG 1980 

TTTTGAACCC CAAGCTAAAG CCAAAGAACT TGGTGATGAT GAAGCGACAG CAATCGACTA 2040 

TGACTACATT GAAGCTCTTG AATACGGTAT GCCACCAACA GGTGGTTTGG GAATCGGTAT 2100 

- CGACCGTCTC TGCATGCTCC TCACTGATAC AACAACTATC CCTGATCTAT TGCTCTTCCC 2160 

AACAATGAAA TAAATTCTTA TCCTCTGGGT CTTATCAGAG GATTTTTTGA TTCAAAAAGA 2220 

GACTGAATTT AACGAGAAAA TGAAGTGTAG TATATTGAAA TTGAAATAGT ACACTTTGAT 2280 

TTCTAAGACA TTGTTAGAAA TTGGTTTAAA TTCCCTAAGC AATTTGTGCA TGTTTTATTT 2340 

CATTTTACGA TAGTACGCTG AAACTTTTCA AAAAGTACTA GAAATTGACT TGGATTCCCC 2400 

AATTGATTTG TTCAGATTCA CTATAAATAA AAAATTAATA AGTGGGATAG GAAGTTAGCG 24 60 

TCAACTAGGA TAGTATCTTG CTTAAACAGT ATATATGGGA TTGATATAAG TCCATAGGTC 2520 

CTATTAGAGG ATGTTCTGGT GTCTTATTCA CTTGTTTTTT ATAGTATTAG TAGATAGAAT 2580 

CAGCAAATAA AAACCCAAAT CATTCATACC TCTCTCAACT AGATGTAACT TACAAAACCC 2640 

CTGACCTCAT GAGCCACTTT CTTCCTCCTC ATGAGCTCAG TTTTACTTTC TGCTGTTCCA 2700 

GTATCGTTTT TCCTCGCTAG ATTTCCTCAA AAGGGCAGAC TCCTCCCTTG GTGCGTCACA 2760 

CGATTTTTTC ATCTCGACTG TTCTTTAATG CATCATTAAC GACGCTTTTC TTCTAGGTGG 2820 

TTCATAAGGA ACAGGAAGAT TCAGGTTGAC TTTTCTAATC CTAGAATAAA GTGCTGAAAA 2880 

CAATTCGGAA TAGGCATAGA GACTAGACAA TTTGAGGAGC TGCTTGCGTC CTCTTCGAAC 2940 

ACATTTTCCC ACCACGTGAA GAAAAAGATG GCGGAAGCGT TTGATTGTTA AAGTTTGGAA 3000 

GTCACCTCCA GCTAGATGTT TGAGAAAAAG ATAGAGATTG TAGGCGATAC AGCTCATCAT 3060 

CATACGAACT TCGTTTTTGA TTAAGGTTGA ACTATCCGTT TTATCGCCAA AAAATCCCTC 3120 

CTTCATCTCC TTGATGAAAT TCTCGGCTTG ACCACGTCCA CGATAAACCT GAAACTGGTC 3180 

TTGGCTTGTT CCACTCGTCA TATTTGTAAC GAGAGAAATA ACATCGTAGA AC 3232 

(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10711 base pairs 
IB) TYPE: nucleic acid 
(C> STRANDE0NESS: double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

CCGGAGAAAA TCATGAAAAG TTCAAAACTA TTTGCCCTTC CCGCCGTCAC ATTATTGGCG 60 

GCGACTACTT TAGCTCCATC CTCTGGATCA GCTTCAACCA CTAAAGCTGA GAAGACATTC 120 

TCATACATTT ATGAGACAGA CCCTGATAAC CTCAACTATT TGACAACTGC TAAGGCTGCG 180 

ACACAAATAT TACCAGTAAC GTGGTTGATG GTTTGCTAGA AAATGATCGC TACGGGAACT 240 

TTGTGCCGTC TATGGCTGAG GATTGGTCTG TATCCAAGGA TGGATTGACT TACACTTATA 300 

CTATCCGTAA GGATGCAAAA TGGTATACTT CTGAAGGTGA AGAATACGCG GCAGTCAAAG 3 60 

CTCAAGACTT TGTAACAGGA TTAAAATATG CTGCTGATAA AAAATCAGAT GCTCTTTACC 420 

TTCTTCAACA ATCAATCAAA GGGTTGGATG CCTATGTAAA AGGGGAAATC AAAGATTTCT 480 

CACAAGTAGG AATTAAGGCT CTGGATGAAC AGACAGTTCA GTACACTTTG AACAAACCAG 540 

AAAGCTTCTG GAATTCTAAG ACAACCATGG GTGTGCTTGC GCCAGTTAAT GAAGAGTTTT 600 

TGAATTCAAA AGGAGATGAT TTTGCCAAAG CTACGGATCC AAGTAGTCTC TTGTATAACG 660 

GTCCTTATTT GTTGAAATCC ATTGTGACCA AATCCTCTGT TGAATTTGCG AAAAATCCGA 720 

ACTACTGGGA TAAGGACAAT GTGCATGTTG ACAAAGTTAA ATTGTCATTC TGGGATGGTC 780 

AAGATACCAG CAAACCTGCA GAAAACTTTA AAGATGGTAG CCTTACAGCA GCTCGTCTCT 840 

ATCCAACAAG TGCAAGTTTC GCAGAACTTG AGAAGAGTAT GAAGGACAAT ATTGTCTATA 900 

CTCAACAAGA CTCTATTACG TATCTAGTTG GTACAAATAT TGACCGTCAG TCCTATAAAT 960 

ACACATCTAA GACCAGCGAC GAACAAAAGG CATCGACTAA AAAGGCTCTC TTAAACAAGG 1020 

ATTTCCGTCA GGCTATTGCC TTTGGATTTG ACCGTACAGC CTATGCCTCT CAGTTGAATG 1080 

GACAAACTGG AGCAAGTAAA ATCTTGCGTA ATCTCTTTGT GCCACCAACA TTTGTTCAAG 1140 

CAGATGGTAA AAACTTTGGC GATATGGTCA AAGAGAAATT GGTCACTTAT GGGGATGAAT 1200 

GGAAGGATGT TAATCTTGCA GATTCTCAGG ATGGTCTTTA CAATCCACAA AAAGCCAAGG 1260 

CTGAATTTGC TAAACCTAAA TCAGCCTTAC AAGCAGAAGG AGTCCAATTC C CAATTC ATT 1320 

TGGATATGCC AGTTGACCAA ACAGCAACTA CAAAAGTTCA GCGCGTCCAA TCTATGAAAC 1380 

AATCCTTGGA AGCAACTTTA GGAGCTGATA ATGTCATTAT TCATATTCAA CAACTACAAA 1440 

AAGACGAAGT AAACAATATT ACATATTTTC CTGAAAATGC TGCTGGCGAA GACTGGGATT 1500 

TATCAGATAA TGTCGGTTGG GGTCCAGACT TTGCCGATCC ATCAACCTAC CTTGATATTA 1560 

TCAAACCTTC TGTAGGAGAA AGTACTAAAA CATATTTAGG GTTTGACTCA GGGGAAGATA 1620 

ATGTAGCTGC TAAAAAAGTA GGTCTATATG ACTACGAAAA ATTGGTTACT GAGGCTGGTG 1680 

ATGAGACTAC AGATGTTGCT AAACGCTATG ATAAATACGC TGCAGCCCAA GCTTGGTTGA 1740 
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CAGATAGTGC TTTGATTATT CCAACTACAT CTCGTACAGG GCGTCCAATC TTGTCTAAGA 1800 

TGGTACCATT TACAATACCA TTTGCATTGT CAGGAAATAA AGGTACAAGT GAACCAGTCT 1860 

TGTATAAATA CTTGGAACTT CAAGACAAGG CAGTCACTGT AGATGAATAC CAAAAAGCTC 1920 

AGGAAAAATG GATGAAAGAA AAAGAAGAGT CTAATAAAAA GGCTCAAGAA GATCTCGCAA 1980 

AACATGTGAA ATAACTGTTG CAAAATATAA GAAAGGATTT AGTATTTCCC TTGAATGCTG 2040 

- -AATCCTTTTT TACATTTGTA~AAGAAAGATT CTAAAATGTA CGGACCCCCA AAAGTTGGAG 2100 

CCTCTTTTTG TCAGAATAGA GAAAATTTTT GTTAATTTTA CTTGTTTCCT ATTGCTTTCT 2160 

CAGCTATTAT TTGTTATATT AAAAGTATAA TTATTTTTTA TTTATCAGAC TTAAGCATTG 2220 

CACTTTCAGA GGAAGGAGTA TTTTTTAAAA AGAAAATGTA AACGTTTGCT CAAAAATGAA 2290 

AGGATTTAGA AGTTTATGAA TAAAGGATTA TTTGAAAAAC GTTCTAAATA TAGTATTCGG 23 40 

AAATTTTCAT TAGGTGTTGC TTCTGTTATG ATTGGAGCTG CATTCTTTGG GACAAGTCCG 2400 

GTTCTTGCAG ATAGCGTGCA GTCTGGTTCC ACGGCGAACT TACCAGCTGA TTTACCTACT 24 60 

GCTCTTGCAA CAGCAAAAGA GAATGATGGG CGTGATTTTG AAGCCCCTAA GGTGGGAGAA 2520 

GACCAAGGTT CTCCAGAACT TACAGATGGA CCTAAGACAG AAGAAGAACT ATTAGCACTT 2580 

GAAAAAGAAA AACCGGCTGA AGAAAAACCA AAAGAGGATA AACCTGCAGC TGCTAAACCT 2640 

GAAACACCTA AGACGGTAAC CCCTGAATGG CAAACGGTAG CGAATAAAGA GCAACAGGGA 2700 

ACAGTCACTA TCCGAGAAGA AAAAGGTGTC CGCTACAACC AACTATCCTC AACTGCTCAA 2760 

AATGATAACG CAGGCAAACC AGCCCTGTTT GAAAAGAAGG GCTTGACCGT TGATGCCAAT 2820 

GGAAATGCAA CTGTTGATTT AACCTTCAAA GATGATTCTG AAAAGGGCAA ATCACGCTTT 2880 

GGTGTCTTTT TGAAATTTAA AGATACCAAG AATAATGTTT TTGTCGGTTA TGACAAGGAT 2940 

GGCTGGTTCT GGGAGTATAA ATCTCCAACA ACTAGCACTT GGTATAGAGG TAGTCGTGTT 3000 

GCTGCTCCTG AAACAGGATC AACAAACCGT CTCTCTATCA CTCTCAAGTC AGACGGTCAG 3060 

CTAAATGCCA GCAATAATGA TGTCAATCTC TTTGACACAG TGACTCTACC AGCTGCGGTC 3120 

AATGACCATC TTAAAAATGA GAAGAAGATT CTTCTCAAGG CGGGCTCTTA TGACGATGAC 3180 

CGAACAGTTG TTAGCGTTAA AACGGATAAC CAAGAGGGGG TAAAAACAGA GGATACCCCT 3240 

GCTGAAAAAG AAACAGGTCC TGAAGTTGAT GATAGCAAGG TGACTTATGA CACGATTCAG 3300 

TCTAAGGTCC TCAAAGCAGT GATTGACCAA GCCTTCCCTC GTGTCAAGGA ATACAGCTTG 3360 

AACGGGCATA CTTTGCCAGG ACAGGTGCAA CACTTCAACC AAGTCTTTAT CAATAACCAC 3420 

CGAATCACCC CTGAAGTCAC TTATAAGAAA ATCAATGAGA CAACAGCAGA GTACTTGATG 3480 
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AAGCTTCCCG ATCATCCTCA CTTAATCAAT GCGGAAATGA CAGTACGCTT GCAAGTTGTA 3 540 

GACAATCAAT TGCACTTTGA TGTGACTAAG ATTGTCAACC ACAATCAAGT CACTCCAGGT 3600 
CAAAAGATTG ATGACGAAAG CAAACTACTT TCTTCTATTA GTTTCCTCGG CAATGCTTTA 3660 
GTCTCTGTTT CTAGTAATCA AACTGGTGCT AAGTTTGATG GGGCAACCAT GTCAAACAAT 3720 
ACGCATGTCA GCGGAGATGA TCATATCGAT CTAACCAATC CAATGAAGGA TTTGGCTAAG 3 780 

GGTTACATGT ATGGATTTGT TTCTACAGAT AAGCTTCCTG CTGGTGTTTG GAGTAACTCT 3 840 

CAAAACAGCT ATGGTGGTGG TTCGAATGAC TGGACTCCTT TCACAGCTTA TAAAGAAACA 3 900 

GTCGGAAATG CCAACTATGT AGGAATCCAC AGCTCTGAAT GGCAATGGGA AAAAGCTTAT 3960 
AAGGGCATTG TTTTCCCAGA ATACACGAAG GAACTTCCAA GTGCTAAGGT TCTTATCACT 4020 

GAAGATGCCA ATGCAGACAA GAACGTTGAT TGGCAAGATG GTCCCATTGC TTATCGTAGC 4080 

ATTATGAACA ATCCTCAAGG TTGGGAAAAA GTTAAGGATA TCACAGCTTA CCGTATCGCG 4140 

ATGAACTTTG GTTCTCAACC ACAAAACCCA TTCCTTATGA CCTTGGATGG TATCAAGAAA 4200 

ATCAATCTCC ATACAGATGG TCTTGGGCAA GGTGTTCTCC TTAAAGGATA TGGTAGCGAA 42 60 

GGCCATGACT CTGCTCACTT GAACTATGCT GATATTGGTA AGCGTATCGG TGGTGTCGAA 4 320 

GACTTCAAGA CCCTAATTGA GAAGGCTAAG AAATATGGAG CTCATCTACG TATCCACGTT 4380 

AACGCTTCAG AAACTTATCC TGAGTCTAAA TACTTCAATG AAAAAATTCT CCGTAAGAAT 4440 

CCAGATGGAA GCTATAGCTA TGGTTGGAAC TGGCTAGATC AAGGTATCAA CATTGATGCT 4500 

GCCTATGACC TAGCTCATGG TCGTTTGGCA CGTTGGGAAG ATTTGAAGAA AAAACTTGGT 4560 

GACGGTCTCG ACTTTATCTA TGTGGACGTT TGGGGTAATG GTCAATCAGG TGATAACGCT 4 620 

GCCTGGGCTA CCCACGTTCT TGCTAAAGAA ATTAACAAAC AAGGCTGGCG CTTTGCGATC 4 680 

GAGTGGGGCC ATGGTGGTGA GTACGACTCT ACCTTCCATC ACTGGGCAGC TGACTTGACC 4740 

TACGGTGCCT ACACCAATAA AGGTATCAAC AGTGCCATCA CCCGCTTTAT CCGTAACCAC 4800 

CAAAAAGATG CTTGGGTAGG GGACTACAGA AGTTATGGTG GTGCAGCCAA CTATCCACTG 4860 

CTAGGTGGCT ACAGCATGAA AGACTTTGAA GGCTGGCAGG GAAGAAGTGA CTACAATGCC 4920 

TATGTAACCA ACTTATTTGC CCATGACGTC ATGACTAAGT ACTTCCAACA CTTCACTGTA 4980 

AGTAAATGGG AAAATGGTAC ACCGGTGACT ATGACCGATA ACGGTAGCAC CTATAAATGG 5040 

ACTCCAGAAA TGCGAGTGGA ATTGGTAGAT GCTGACAATA ATAAAGTAGT TGTAACTCGT 5100 

AAGTCAAATG ATGTCAATAG TCCACAATAT CGCGAACGTA CAGTAACGCT CAACGGACGT 5160 

GTCATCCAAG ATGCTTCAGC TTACTTGACT CCTTGGAACT GGGATGCAAA TGGTAAGAAA 5220 

CTTTCTACTG ATAAGGAAAA GATGTACTAC TTCAATACGC ACGCCGGTGC AACAACTTGG 5280 
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ACCCTTCCAA GCGATTGGGC AAAGAGCAAG GTTTACCTTT ACAAGCTAAC TGACCAAGGT 5340 

AAGACAGAAG AGCAAGAACT AACTGTAAAA GATGGTAAAA TTACCCTACA TCTTCTAGCA 5400 

AATCAACCAT ACGTTCTCTA TCGTTCGAAA CAAACTAATC CTGAAATGTC ATGGAGTGAA 54 60 

GCCATGCACA TCTATGACCA AGGATTTAAT AGCGGTACCT TGAAACATTG GACCATTTCA 5520 

GGCGATGCTT CTAAGGCAGA AATTGTCAAG TCTCAAGGGG CAAACGATAT GCTTCGTATT 5580 

CAAGGAAACA AAGAAAAAGT TAGTCTCACT^CAGAAATTAA CTGGCTTGAA ACCAAATACC 5640 

AAGTATGCCG TTTATGTTCG TGTAGATAAC CGTAGTAATG CCAAGGCAAG TATCACTGTG 5700 

AATACTGGTG AAAAAGAAGT GACTACTTAT ACCAATAAGT CTCTCGCGCT CAACTATGTT 5760 

AAGGCCTACG CCCACAATAC ACGTCGTGAC AATGCTACAG TTGACGATAC AAGTTACTTC 5820 

CAAAACATGT ACGCCTTCTT TACAACTGGA GCGGACGTCT CAAATGTTAC TCTGACATTG 5880 

AGTCGTGAAG CTGGTGATCA AGCAACTTAC TTTGATGAAA TTCGTACCTT TGAAAACAAT 5940 

TCAAGCATGT ACGGAGACAA GCATGATACA GGTAAAGGCA CCTTCAAGCA AGACTTTGAA 6000 

AATGTTGCTC AGGGTATCTT CCCATTTGTA GTGGGTGGTG TCGAAGGTGT TGAAGATAAC 6060 

CGCACTCACT TGTCTGAAAA ACACAATCCA TATACACAAC GTGGTTGGAA TGGTAAGAAA 6120 

GTCGATGATG TTATCGAAGG AAATTGGTCA CTCAAGACAA ATGGACTAGT GAGCCGTCGT 6180 

AACTTGGTTT ACCAAACCAT CCCACAAAAC TTCCGTTTTG AAGCAGGTAA GACCTACCGT 6240 

GTAACCTTTG AATACGAAGC AGGATCAGAC AATACCTATG CTTTTGTAGT CGGTAAGGGA 6300 

GAATTCCAGT CAGGTCGTCG TGGTACTCAA GCAAGCAACT TGGAAATGCA TGAATTGCCA 63 60 

AATACTTGGA CAGATTCTAA GAAAGCCAAG AAGGCAACCT TCCTTGTGAC AGGTGCAGAA 6420 

ACAGGCGATA CTTGGGTAGG TATCTACTCA ACTGGAAATG CAAGTAATAC TCGTGGTGAT 6480 

TCTGGTGGAA ATGCCAACTT CCCTGCTTAT AACGACTTCA TGATGGATAA TCTTCAAATC 6540 

GAAGAAATTA CCCTAACAGG TAAGATGTTG ACAGAAAATG CTCTGAAGAA CTACTTGCCA 6600 

ACGGTTGCCA TGACTAACTA CACCAAAGAG TCTATGGATG CTTTGAAAGA GGCGGTCTTT 6660 

AACCTCAGTC AGGCCGATGA TGATATCAGT GTGGAAGAAG CGCGTGCAGA GATTGCCAAG 6720 

ATTGAAGCTT TGAAGAATGC TTTGGTTCAG AAGAAGACGG CTTTGGTAGC AGATGACTTT 6780 

GCAAGTCTTA CAGCTCCTGC TCAGGCTCAA GAAGGTCTTG CAAATGCCTT TGATGGCAAT 6840 

GTGTCTAGTC TATGGCATAC ATCTTGGAAT GGTGGAGATG TAGGCAAGCC TGCAACTATG 6900 

GTCTTGAAAG AACCAACTGA AATCACAGGA CTTCGCTATG TTCCGCCTGG ATCAGGTTCA 6960 

AATGGTAACT TGCGAGATGT GAAACTTGTT GTGACAGATG AGTCTGGCAA GGAGCATACC 7020 
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TTTACTGCAA CTGATTGGCC AAATAACAAC AAACCAAAAG ATATTGACTT TGGTAAGACA 7080 

ATCAAGGCTA AGAAAATTGT CCTTACTGGT ACCAAGACAT ACGGACATGG TGGAGATAAA 7140 

TACCAATCTG CAGCGGAACT TATCTTTACT CGTCCACAGG TAGCAGAAAC ACCTCTTGAC 7200 

TTCTCAGGCT ATGAACCAGC TTTGGTTAAG GCTCAGAAAT TAACAGACAA AGACAATCAA 7260 

GAGGAAGTAG CTAGCGTTCA GGCAAGCATG AAATATGCGA CGGATAACCA TCTCTTGACG 7320 

GAAAGAATGG TGGAATACTT TGCAGATTAT CTCAACCAAT TAAAAGATTC TGCTACGAAA 7380 

CCAGATGCTC CAACTGTAGA GAAACCTGAG TTTAAACTTA GATCTTTAGC TTCCGAGCAA 7440 

GGTAAGACGC CAGATTATAA GCAAGAAATA GCTAGACCAG AAACACCTGA ACAAATCTTG 7500 

CCAGCAACAG GTGAGAGTCA ATCTGACACA GCCCTCATCC TAGCAAGTGT TAGTCTAGCC 7560 

CTATCTGCTC TCTTTGTAGT AAAAACGAAG AAAGACTAGT ATTTAGTAAA ACCTCTTAAC 7620 

AAGATTACGG AAGCAGTCTC TATCTTTTCC AATGAGGTTT ATAGTACAGA AAAAGCCTGA 7680 

GAAGATCTCT TCTCAGGCTT TTGTTAAGCA CATAAATACA ATACTGCTAT GACAAAATCA 7740 

CCCAGAAAAA TCTGGGTGAT AAATGTTATG GTTGTGCTGG TTGAGGATTC TGATTTPGTT 7800 

GATCAGGGGT TGTATTTGAT TGTTGCGTAT TATTGTTAGG ATTGGTAGTC CTACTATTAT 7860 

TTGTGCTTCG AGTGGTTGAG CTAGACTGTG AAGTTGAACT ATCTGATGAT GAGCTTGAAC 7920 

TTTCAGTTGA TGGGGGTTGT TGTGGAGCAG GTGAGTTCCA CGTAGAACGA GCACCATTTT 7980 

TAAATACGAA TTCTCCATTT CTGTAGAGCC CCTCTGGTAT ATTCCAATCT TCTGGATTGC 8040 

TTCCTTCAGA CAGGTAGGTC ATCATAGAGC GGTAAACTTT GGCAGCGACC CTAAGGCCAT 8100 

TCqCTACAAG TGGTGTCAGA CGGTTAGAAT AGCCTGTCCA TACAGCCATT GAATATTTAC 8160 

GCGTATAGCC AGCAAATAGT TCATCAGGTG CTACAAATTG AGAGGTCTTG ATGTGGTTTT 8220 

CAATTTCCTC GTCTGTATAC TTAGAGGTTC CTGTTTTACC AGCCTGAGGG AGCCAAGCAA 8280 

GATAGGCATT TCGTCCAGTT CCATAAGTCA AGACTGTTTT CATCATGTCG GTCATCATAT 8340 

AGGCTGTCGT TTCCTTCATG GCACGAGTTC CGACATTAGA GAACTCTTTT TCACTCCCAT 8400 

CACTAAAGAC GACTTTATGG ATATACATTG GTTTATAGTA AGTTCCACCA TTTGCAAAGG 8460 

CAGCGTAAGC AGCAGCCATC TTTTCACTAC TTGCTCCATA TTTTTTGTCT GATTCGGTTG 8520 

TGTTACTTGA AATGGCATTT GAGTAGTGAA TACTTGGGTA CTCGATTCCT AGACCATTTA 8580 

GGAAACTCTT GGCGCGGTTG AGTCCGACCT TGTTTAGAGT TTCCACGGCT GGGACGTTTC 8640 

GCGATTGTTG CAGGGCGTAT TGCAAGGTGA TGTTGCCAAA GTAGCCCCTA TCCCAGTTAT 8700 

AAACAGGAGT ATTTGTCCCA GGGTAGTTAT AGGGCTCATC GTGAACGATA GTAGCACTTG 8760 

AATCGTACAC ACCGTACTCC AAGGCAGGAG CATAGTCTGT GATCGGTTTC ATAGTTGATC 8820 
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CCCAGTCGCG CTTTGTTTCT ACTGCTTGGT TAATTCCGAA GGAAACATTA CTTGACTGAT 
GGCGTGCTCC TAGCTGGGCA ATGACTTTAC CGTTAGAAAC ATCAACAATG GTAGAAGCGA 
CTTGCAATTC ATCCTCTGGA TAGGCAACGT ATTCGTCTGT ATTGTAAATA TCCCACAGAT 
GTTTTTGAGC TTCTTGGTCT ACATTTGTGT AGACATCCAT CCCAGTTGTG AGTAGGTTAT 
AGCCTGTTTC TTCTTCAACT TGATTGATGA CTTCCTTGAG GTAATTATCC ATGTAAGCAG 
GGTAATTACT TGCTGATTTG AGACTTTGTA GTCCATCAGT AATTGGTGTA TTGACTGCTT 
TCTCATACTG TTCAGCAGAG ATGTAGCCTT GATTTTTCAT TTCAGATAAG ACCAAGTTTC 
CCCGCTCTTG GGCTGCTTCT GGATGTGAAT AGGGGTCATA TTGGTTTGGT GCCTGAGGCA 
TTCCAGCCAG CAAGGCTAAC TGAGGTAAAC TTAAATTATT GAGGTCTTTA CCATAGTAGT 
TTTGAGCTGC TGTCTGCATT CCATACTTCC CATTAGACAT GTAGACCTTA TTTATATAGT 
AGGTCAAGAT TTCTTGCTTG GTTGCTTTTT GTTCTAACTG AATCGCTAAC CAAGCTTCCT 
GAGCCTTACG AGAAATAGTC TGGTCGGAAG TCGAAGTTGA AAAGTAAGTC AACTTAATCA 
ACTGTTGGGT GAGAGTTGAT CCACCTTGGA GGGAATTGCT TTGCAGATTG CGCAAGAAAG 
CTCCCAGGAT ACGGATGGTA TCAATCCCCC TGTGGTCGAA GAAGCGATGG TCTTCGATAG 
AAACGATTGC CTTAACCAAA TCTGTGGGAA TATCATTACC TTGGGCATTG ACGCGGCGTT 
CAGAACCCAA GTCAGCAATG AGTTGATTTT TATTGTCGTA GATTTTACTA GAAGTTGTTG 
CAACTAGTTT ACTCTCGGAT AGGCTAGGAG CCTTGCTAAC GT ACT AG AAA AAAACTCCTC 
CCCCTAAGAC AATGGCTGCG ATAACCAAGC TTAAGAAGCT AATGCTCAGA TACTTGATTA 
GGCGCAGAAT CGTTGGTTTG TTCATCTTGT TTTACCACCT AATAAATGTT CTTTGATAAC 
ATTGAGATAA GGAATTTGAG GGAAGCCACC AGCCTTGATT TCATATCCAT ATTCTCGAAT 
ATATTCAAGT GGCATTGATT TTTGTCCCTT ATCTTGATGA TAGAAGCGAA TCAAATCGAA 
TGCCGGCAAT AAGTAGGTTT CTTGCTGAGA AGAAAAGTGA AGAAGGACAA AGCAGATTCC 
TTGTTGGGCA AGGACTTGTT CCATATGCTG AATCTGATGT GGATGAAAAT TTTTCATCGG 
AATCGCACGT TTTTUriTru TTTCCTTGAC TTCAAAGTCG ATGTAATATC CATTATAAAC 
GCCAGAATAG TCCGTCGTTG AAGCTTGTCG AAAATAGGCT TCAACAATCT TGGCACGACT 
TCGTTGTGGA TAGTCCACTT GTACGATTTG AATAGGAGTT GGTTTCTTAT GTATAACAGC 
CAAGCCCTGA GACAAATAGT ACTCGTTGGT AGCATTGATC ATCTTTTCAA AGGGTACCGA 
GCTCGAATTC GTAATCATGT CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT 
TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGCGTCCCT AATGAGTGAG 
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CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CACTCGGCAA ACCTGTCGTG 10620 

CCACCTCCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC 10680 

TTCCGCTTCC TCGCTCACTG ACTCGCTGCG C 107u 
(2) INFORMATION FOR SEQ ID NO: 146: 



<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11887 base pairs 
(B> TYPE: nucleic acid 
<C) STRANOEONESS : double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 
TACATTCATT CCATCGGCTA CTCCATAATA CTTAGATAAA ACCATAGCTG AAGTCCAATA 60 

CGGATACTGT AAAGTATTAT CAATTTTAAT CAAATCATCA TTACCGATAA TACTTCTGAT 120 

TGCTTTTGGT AGTATGAACC ATACGTTGGT GAAATCTCAG ATAATGAAGA ATCATTAGAC 180 

TCTGGACCTT TTTCTAGTGT CTCACTTACC TCATATTCTT CACCCTTACT AGAAATAACA 240 

CTCAAAGCAG ATACTGTCGA TAACTGGCTA GCCAATAAAG TACTCGCAAT AATTGAAATA 300 

CCCAATTTTT TATAAACAGT TTTCTTCATT ATTGTATCCT CCTAATGTAA TTATAGCGTA 360 

CTATTCTAAA TTTCTTAATC TACTATAGAA TCAAGAAATC TACCACCTTC TTTAAATACC 420 

CTCCATTATC ACATAAACAG GTAAACTTTT CAATTAATGA CTGCGCTTTT CAATCACGCT 480 

AGAGGTACTT GCTTGCTTCT TTGATACTAA GTTCAGCCAT TCTTTCCTTG TTTTTCTCAA 540 

TAAAGCATGT TACCCAAGTG GGATTCGTTT TGGAGTAGTC TCGCAGAGTC CAGCCAATGG 600 

CTTTATTGAT AAAAAATTCT GTTTGGTTCA ACTTATGAAG GAGAATCTTT TCCATTAATT 660 

GAGTATTGGT CTTCTCTTTT CTTAACAACT GGTGGTCAAT AGCGACACGT CTCAGCCAGA 720 

TATTATCTGA TAGGCTCCAT TTTATACTCA ATGAAAATCA AAGAGCAAAC TAGGAAGCTA 7 80 
GCCGCAGTTG CTCAAAACAC TGTTTTGAGG TTGCAGATAG AGCTGACGTG GTTTGAAGAG 



840 



ATTTTCGAAG AGTATTAAGA TTATTTCTTC TAGTTCAGGG TGTTCATACA CCAAACTCCC 900 

TACTACTCGA TCTAGGATAT CTACCGTGTC CCACAAGGAT TTTGTCACGA CTAACTGCTC 960 

TAGCTTAGGC AAATCGGTTT CCTTTAGATA AGACTGCATT GCTTTCAAAT AGTTAGCAGC 1020 

CACATATTGG TATTTTCTAG GATCCTTTTC CCAGCAAGTG TCTGCAAAAT CCCAATCGAT 1090 

AATCTTTGTT TTTTTCGCTT CTGGAAAATA TTTTATAGAG TTTATTTCTT TCAGGCACCG 1140 

CAATACCTAG AAAAGAAAAT TGATGGCGCA TATAGGCTTC CATGGACCTT GCTTTTTTAG 1200 

AGTCTTTTGC TGCTTCTAGC TCCTCAAGTA AATCTGCTAA ACTCATCTAA AACTCCTCTT 1260 
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GCCCCACCAA ATGCTCCTGA AAGGCATAGA CAGCCGCCTG GCTACGATCG CTGACTTCAA 13 20 

GTTTGGCAAG AATATTGGAC ACGTGGGTCT TGACCCTCTT GAGAGAGATA AAGAGGTCAT 1380 

CTGCCATGCG CTGATTTTCG TAGCCCTTGG CGATGAGTTG GAGAACATCT CGCTCACGCG 1440 

CACTCAATTC TTCATGAAGT TCCATATGAT TGCGGTCGTA TTCAACCTTC TTGCTAACCT 1500 

CTTGCTCAAT GGCCAGCTCG CCAGCAGCTA CCTTACTGAC GGCATGAAGC AATTCATCTG 1560 

CACT AGAAGT — CTTG AGCAT A TAGCCTTTGG ^ACCAGCATC TAAGACTGGC A^TCATTTTTt" 1620 

CATTGTCCAA ATAAGAGGTC ACAATCAAAA TCTTGGCTTC AGGCCATTCT TTAAGGATTG 1680 

CTAAGGTCGC GTCAATCCCA TTCATCTCAC GCATGACAAT ATCCATGACA ATGACATCTG 1740 

GACGCACTTC CAAGGCCAAG TCAATCCCTT GAGACCCGTT GGACGCCTCA CCCACAACTT 1800 

CTACATCGTC TTGGAGGTCA AAGTAGCTTT TCAAGCCCAA TCGGACCATT TCATGGTCAT I860 

CTACTAGTAA AATTTTCATC TTTACTCCTT TATCATTCCT TATCTAACAG GGGAATACGG 1920 

ATATCAACCG CCAGCCCTTG CTTGGGAGCT GTCAAGACTT GAACTGTTCC AGCCATATCT 1980 

TCAACCCGCT CCTTGATATT TCGCAGTCCA TAACTCAAGT CGTCTAAGCT CCCTAACTGG 2040 

AAACCAATCC CATTGTCCAC CACCTTCAGT TGCAATTCAA CATCTGTCTG ATAGAGGTAG 2100 

ACATCTAGGC AAGATGCCTG GGCATGGCGG AGGGTATTGC TAATCAACTC TTGCAGGATA 2160 

CGGAAGATAT GCTCCTCGAT TTTCTTAGGC AATTTCGTCA TATTCTGCTT GAGACTAACC 2220 

CTAAGATCAC TCTTGTCCTC AAGCTCTTTT AAAAGAATTT GAATCCCTTC TATCAAGCTC 2280 

TTCTGCTCCA GTTCAACTGG TCGCAAATGC AAGAGCAAAA CCCGCAAATC CTTCTGGGCT 2340 

GTTTCTAAAA TAGCTCTGAC ACTCTGCAAC TGGGTCTGCA TCTTTTCTCT ATCCAATTTC 2400 

AAAGCCTGCT GACTGATACC CGATAAAATC ATGTGGCCCG CAAACAACTC CTGACTGACT 2460 

GTATCGTGCA AATCCCGAGC AATTCCCTTC CGTTCCTTCT CGATGATTTC CTCTTCCTGA 2 520 

GCAAGCCTCT GATTTTCAGC TTTTTGAAGA GCCTCTGTCA AAAGCTTAAG TTTACCTGAT 2S80 

AAGGACTTGA AACTGGCATC CAAATCTGGA TCTGCAACCT GAACCACTTC TTGCCCTGCT 2640 

AATAAACGCT TGAGATTAGC CTGCATTTTT CTTAGAGAAA GCTCTTCGAT CCCTCGCCAA 2700 

AACAGGGCTA AGAGACAGGT CATGGACATG CTGAAAACCA ACAATAAAAA GACAAATTTT 2760 

TCTGTTTTTT CGACATCGTG CAAAAAGATA GACCAGTCAA AATCAAGTAT TTCCAGCAAG 2820 

CTGTGGGAGA AAAAAAAGAC AAATAGGAAG GAGGTGAGAG CAATAATGAC ATAGGCTTGT 2880 

TTTTTCATCC TCTAACCACC TCCACATCAC CAATCATAGT GGTCAAGAAA ATCTTGACAC 2940 

TCTTGTTACT CTTGAGATAG TCTTTTGTTT CTTGATGATA GTGTTCATTG CGGAGGGCTC 3000 
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GCTTGGGCTC CTTCAAAAAA ATCAAATCCC CAT AG AG AC A GTTAACCCTG AGACTGACTT 3060 

CCACATCTAC AGGTACGATG ATTTTGGTCG TTCCTACCAT C TTTC T GACG ATAATGACAT 3120 

TGTCATGATT GGTTAAGATG ACCCTCTCCA GATGAATAGT GTCCTTGCCC ATGAAGCGAA 3180 

AGAGATTGAT ATCATCGAAT TGGCAAGTCT GGTAGCTTGA AAAATGATGA AGATTTCCAA 3240 

ACCAACGATT TTTCTCCTTC TTAACCGTCA CGACCTCTTC AAAAACCAAA TTGGTCTGCT 3 300 

CTTTTTCCTG CTTCATCATC GGGTAAAGAA GAAAGAGCCT ATAGATAACC GCAACAAAAA 3360 

TAGCTAGAAT CACAAAACCA TTGAGCATAA CGATGAAAAA GAACACAATC GTTGCCGCTA 3420 

CTAAAAGAAG ATTATTTCCC TCTTTACCAG TGTAGTAGCG AATCAAAAGC AAAAAGAGGA 34 BO 

ATAGTATCAG CAGAAAACGC GAAAAATGCT CTGATACCAT CAAAATCAGA GCTCCTCTCA 3540 

GAAGACAGGC TTCGATAAAT AAAAAGATTT TAAATTTTCT CATAGGTTCA TCCTCTCCCT 3 600 

TCTATTTTAT CACAATTCAA AAAAGTCACC TCAGTCTGAG GATGGAAAAA AGGCGCTGGT 3 660 

TACCCCTTTT TCATCTGATC CTTTGCTTCT TTTAATTTTC CATAAAGAAG ATACTCTACT 3720 

TTTTGTAGAT CTGCTATGGT GGCACAGTTA AGGGAACACA TAATCAAGCG TAGATCTGCT 3780 

TTCCAGCCTT GGACAATGCC AATCACTTCT TCAACTGTGT AGGTTTCAAC CAATTCCAGA 3840 

ACGGTTCGTG ACAATCCCAC AGCCTTAGCA CCAAAAACCA AGCACTTAAT CATATCCAGC 3900 

GGATTCCGAA CCCCTCCACT AACCAAGAGT TCGACCTTAT CTTTCCATTC TTGGGCATTG 3960 

AGAAGGGCCT GCATGGTAGA CTGACCCCAT TGATTGAGGT AATCACGCTG GCCACTACGA 4020 

CGGTTTTCGA TATAGGCAAA GCTGGTGCCA CCACGACCCG ATAGGTCCAC TGTACGAACA 4080 

CCGAATTCAT AGGCTCTTTC GATTGTCTTG GCATCCATTC CAAAGCCCAC TTCCTTGAGG 4140 

ACAATAGGAA CGGGAATTTG CTTGCTATAA TCTGCTAGAT GCGATTGCCA GCTTCTAAAC 4200 

TTCCTTTCTC CCTCGGGCAT GAGTAATTCC TGCATGACAT TGACATGCAC TTGCAATAGA 4260 

ACAGGATTCA TCTCTTCTAC AGTCTGAAGT CCTAACTCGA CAGGCTTGTC CAATCCAATA 4 320 

TTGGTTCCAA GGAGGAGATT GGGATGACTA GACTTGACAG AAAAAGAATC ATCCGTTGGA 4 380 

TTTTTGAGGG CTGCGCTATA AGAACCCGTT ACAAATAAAA TACCACAGGA TTCCGCCACC 4440 

TGAGCCAGCT TTTGATTGAT TTCTCTTCCC TTATTACTTC CACCAGTCAT GGCATTGATA 4500 

TAAAAAGGAA AGTCCCACTT TCGACCAGCA AACTCTGTCG AAAGATCGAT TTCATCCAGA 4560 

TTGTAAAGAG GCAAGGAAGA ATGAATCAGC TCCACCTCAT CAAAGCTATT ATAGGAACTT 4620 

TTCTGCTCAA GGGCATAGAG GATATGCTCG TCCTTACGAT TTGTCGTCAT GTCCTATCCT 4 680 

TTCTTGATAT AAGAGCTCAA TCCCCAGATC GGCCCAACGA TTTTTTAAGG TTTTGGTTGA 4740 

TTGCGCATCA AAACTCAGGG CGATGCCACA GTCACCACCA CCAGCACCAC TACTCTTGGC 4800 



WO 98/18931 



PCT/US97/19588 



981 

AACGGTCTGC AAATCTTGAC TGGCTTCTTT CAACTGTCTA AGCAAAGGCG TCTAAATATC 4860 
TGTACTCAAG CCTTCTAAAA GCTTGCTGGC TACTTCTACT TCATCGATAA TCTTTTCTGA 4 920 

TTTCCCCTGT TCCAAGGCTT CTACCAGAGA ACTCACCGTT TCTTTTGAGG AAGTTAAAAA 4 980 

ATTTTGATTG ATATTTTGCT TGATTTGCTG GACCATCTGA CTCGATACAG CCACTTCCTT 5040 
GGTCCATCCC ACTAAGAAAT CACATTCTAA AGTTGGTTTC ACTTGTGAAA TTGAAAAGCC 5100 
- CCAATCACCC TCCACAACTG TCGCCAACTT TTCTTCTTCT AACCAAGCAG CCACCTTCTG 5160 

CCGATCAAAT GACTGGTAGA GAACCAAATC CTCTGCCACA ATACAGGCAA GGTCGCCCAT 52 20 

GGAACCATTG TCTCCTCGCT TAAGCAAGAC AGCGCTAGTC AGCTTGAACA AGAGCTCCTG 52 80 

ATCAACAGAA ACATCATACA GAGCCAGTAA AGCCTTGACA ACCAAGACAA CGACGCTGCC S340 

ACTAGAACCT AGACCAAACT TTTTCCCTTC TCGTTCCATT TTGCCACAGA TTTCTAGAGA 54 00 

AAAAGGTCTT AAATTCTGAC CACGAACAGC GAGGAAGTCT CCCATCAAAG CAATCGTTTC 54 60 

TTGAATCAAG CTATAGTCAG GATTAGGCCT TAAGTCCACT GCGAAATCAA ACATATCTGA 5520 

ATAGATACGG TAGCTGTCAG AAAAAGCAAT CTCAGCCCTC ATATAGATGG GAATATCCTT 5580 

TATCAAAGCT AACTGCCCTG GCTCTAAAAT AGCATATTCA CCTGCCCAAT AGAGTTTTCC 5640 

GCAAGTTTTA ACAGCAATCA TCTTGACTCA AATCCTTTGT TTTTGACACA ATCAAGCGAT 5700 

AACGATGACC GAAAATTTCT GATAAATGCT CCAAGTCTTT CTCCTGACAG AAGACCTTAA 5760 

CATTGGGACC AGCATCCATG GTAAAGTAGC ACGCCTCTCC TTTCTCACGA AGCTGGCGAA 5820 

CAAAGGCCAT AGCCTCATAA GAGGCATCCG TCAGATAAGA AAAGCCTGGA CTACCAGTCT 5880 

TTGTCGTAGC ATGCATAGCC AGCGCATTTT TCTCCGTTAA TTCTCCAATC TTGGCAAAAT 5940 

CATTTTCCTT GAGATAAATC AGCATATCCT GATAGTCCTT CTCAGACTGA CGAACCCAGT 6000 

CGTCGAAAGT CGTCGAGGTT TCCACACAAA GTTTCATCCC GTCACGGCTA GAGATTGCTT 6060 

TTTTCTTGTC CTCTAGCACC AACATAATCA TAGCTAGTTT CAAGTCTGTC TCTACAGGGT 6120 

AAATTTCTCC ACTATCCTTA TCCCAGGCTC CTAGTGGTCC ATAAAAACTC CGAGAAGAAG 6180 

AACCTGAGGC AAATTTGGCT TCCTGTCCCA ACTGACTTCT ATCCAATCCA AGCTTGAAAT 6240 

AAGCATTACA AGCCTTGACC AGGGCGCACA AACCACTAGA ACTTGAGGAC AGACCCGCTG 6300 

CCCTAGGCAT ATTGTTTTGA GTATCGATAC GGACAAAGCC CTCACCAGCT GGACGATAAC 63 60 

GGTCAATAAT CTT ACT CATC TTGGCATGCT CGACCTCATT TTGTAGCTGA CCATTGATGT 6420 

AAAATTCGTC AGCTGTTACA TTGGCTGGTA AAGGCGACAA GGTCGTCTCT GTATACATAT 6480 

TTTCCAAACT TAGAGAAATA CTGCTAGTAG CAGGCACCAT CTCTTTTTCT TTTTTCTTTC 6540 
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CCCAATATTT GATAATAGCA ATATTTGCGT AGCAACGTAC TGTTACAGCC TCTCTATCCA 6600 

TGTCTGAACA GCTCCTTTCT CTTCTAATCT TTCTGCTAGT TCTTGTGCGT CTGTCAAATT 6660 

GGTTACCAAG GCTATGATAC AACCTCCTAG CCCACCACCG CTCATCTTGG CACCCAGAGC 6720 

ACCATGGCTA AGAGTCGTTT CAACCAAAAA GTCTGCCTCA GGGCTACTGA CTCCAATTTC 6780 

TTTTAAATGT AAATGCGCTT GACTGAGGAT TTGTCCCAGT CCTTCAGCAT CTTTTTGTGA 6840 

AATCGCAACT TCTCCTTGCT GGGTTAATTC TCCCAAGGCA TGCAAAAACG GTAGCGCATC 6900 

CTTGCCCTTA TTTTGAACCA CTTGGATGGC TTCACGACTA TGACCATAAA CACCCGTATC 6960 

GGCAATCACC AAATAGGCGG ATAAATCCAT CTCAAGTTCT GTAAATCCTA CGTTCTTGAT 7020 

AAAGCGAATA GGTTGGTCAC TAAGACAGGT CTTAGCATCC AAACCACTAG GATTCATATG 7080 

GGCAATCATT TCAGCTCGAT TGACCAAGAT TTCTAGTACA TCATGAGGCA GATCAGCCTG 7140 

ATAGTAGTCA AATACTGCAC GAATGCCCGC TATGCTGATA GCCGCTGACG AACCCATCCC 7200 

CCGTTTCTCA GGGATAGCCG AGTCAATCTC ACAACGAATG CAGGCTTCTG TGATATTCAA 7260 

ATACTCCAGT GAGGCATAAA CCGCCATGGA CAAGGTATCC TCCTCATAAA GGCGCCAAGG 7320 

ACTCTCTGCA GGAACTACCT TACAGGTCAC CTCCACCTCC AAAAGACGCA CGCAAATCGC 7380 

AGGATAACCG TAAACGACCG CATGTTCCCC TATTAAAATT ATCTTACTAT GTGCCTGACC 7440 

GACACCAACT TTTTTTGTCA TTTTTTCCTT TTACTAGACG AAAAAACCTC TTATTTTTCA 7500 

TACAAGTATT AATTCTTTCC TATCTATTTT ATTATATTTT CACAAAAAAA GCGATTGTTT 7S60 

CCATTCACAA TCGCTTCTTT CATTATTGAA CCCATTCGCC ATTATAGTTG ACAGAATAGC 7620 

CATCTACGGT CGTATTCACT GCCAAGGCAC CTGAGCGCTA TAAGCGTACT ACCATCTGCC 7680 

ATTGACCTGG AACCAACCTG TCGTCATAGA ACGACGAAAG AAACTCCATA CCATTAAGTA 7740 

AAGAGGAAAG TCGTGAGGGA GCATGCGCCA TTGACAACCT GTTTTAGTGA CGTACAAAGT 7800 

CTCATTAACA AGTACTCGTT TCGGCCATTT ATAGGTGCGG TGTTTGGAGA AATACGGTTC 7860 

AATCTTCGCC CATTCTTGAT CGTTTAAATC AGTATCATAT GCTTTGCGTA TCATAACTCT 7920 

AGCTTAACAT TTTTTTGTCA ATACAGGTTC TAAATAATCG ACCACGAAAA TTTCTTAAGT 7980 

GGAAAACGCC TTATGAAGTA TGCTACGGGA AAGTTATGCA CTTAATTTGA CAATTCAAGA 8040 

TGTAAAAATA TATACTATAG TAGATTGAAA CTAGAATAGT ACACCTCTAC TTCTAAAATA 8100 

TTGTTAGAAA TCGATTTGAC TGTCCTGATC GATTTATCCT GTTATTATCT CATTTTACTA 8160 

TAATATTTGA TAAGTTATCC TAAAAGTATT ATTATGTTGT TGTGTTATAG ATTGATTGAA 8220 

TCTAACTAAA GGATCCTATT CAATTACTAG AACTATCACA TACTCAAGGT CACCTCACAG 8280 

ATGAGCAACT ATTTTGGTTA CAATGTCTAC TAAATTTAAG TCAAACAAAT AATTTAGTCA 8340 
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AAATTAAAAA AATAGAGGAA CATAAATATG 


1 ATTACAAAAC 


: ACAATGTAAT 


1 AGTGTTCTAC 


8400 


AATTTTTACT 


' AGATAAAACT GTAAATTCTG 


AAGGAAGGAT 


CACTTCTTCA 


ACAGAATTTG 


3460 


GAAATTTCGT 


AAGTAATTTA TCATTCCAAC 


ACGGAATAGC 


TGGACTACTG 


TTTCCTCTAA 


8520 


ATAAATTGTA 


CCCCCCAGAA CTGGATTCTA 


AAATACTCTC 


TATCATCAAG 


AAGCCAGTGA 


8580 


CAATTAGAAC 


GACACACACA TATGAATATC 


AATACTCACT 


GCTATTTGCT 


GATGCAGGCT 


8640 


ATCTATGGTT 


ACTCCTACAT TTATTTTCTA 


TCAGTAAAAA 


TCAATACTAT 


CTACAATTAG 


8700 


CAAACGTCAC 


CGCTAAAAAA TTAATAGAGA 


ATTATGATAC 


TCTAGAGGAA 


ATAGACTTTG 


8760 


CATTGGGAAA 


ATCTGGTGTC CTATTATCAT 


TAATAAAATA 


CTATCAATTT 


ACCAATGACA 


8820 


ATACTCTTAA 


AATTTTCATC CACAATAGTA 


TAGGGGAAAT 


TTATCATTAT 


TTCCTACAAA 


8880 


GAGATACAGC 


CAAAGAAACC ATTTTAGACT 


ATAGCTTTGC 


TCATGGATAT 


TGTGGAATTG 


8940 


CATATGCTTT 


ATTTGCCTAT TCTAAAGTCT 


TAGAACCTTC 


TATGTTTTAT 


AATGATCTCC 


9000 


ATACATTCCA 


TACTGAATTA AAAAAATTAT 


TAGAAAAAGT 


TACTTCTAAT 


ACTGAAAATT 


9060 


TAGGAAATTT 


ACAACTTTCT TCGTGCAAAG 


GAATTTCCGG 


AATAATCTTA 


TATCTTTGTA 


9120 


TGTACGATTG 


TGACGGAAAC AAAGATATTA 


TTAGTAAATA 


TCAAGAATTT 


GTTTTTAACC 


9180 


ATCATCTAAA 


AATGATGACA GGATATTGCC 


ACGGAATAAC 


TAGCTTACTA 


CAAACCACTG 


9240 


TCTACAATCA 


AAACAAATTA CTGATGAAAA 


AAATCCAACA 


GGTAATTTTA 


GCATGTTCTG 


9300 


AACGAGATGA 


TCACGGTTTA CTGATGTTTC 


AAGGAGATAG 


TGCTAAACCA 


GATTTGTTTC 


9360 


ACTTCGGAAT 


AGGAAGCATG GGGTATATTG 


GTGTCTATTA 


AATAATAAAT 


TCCCATTCGA 


9420 


TGTGCAGACA 


TAAGGAGAAA AGTATGAAAT 


TATTTTGGAC 


AAACAACATA 


TATAGACAGT 


9480 


TGCTGCTAAA 


CAGCTGTTTT TCATCATTCG 


GCGACAGTAT 


TTTCTACCTC 


GCCATTATCA 


9540 


ATTATGTGGC 


TCAGTACAAT TTCGCTCCGC 


TAGCGATTTT 


ACTGATTTCC 


ATTTCAGAGA 


9600 


TGGTTCCCCT 


ACTATCGCAA CTCTTTCTCG 


GGATTCTAGG 


AGATTTTCAA 


GAAAATAGAG 


9660 


TCAAACACGC 


ACTCTGGATT GCCAAAATCA AAATCCTGCT CTACGCTATT TTGACAGTAT 


9720 


TTCTCGTCTT 


GTCGCCCTTT TCATTAGTTT 


CAGTCATTAT 


GATTGTCATC 


ATCAACCTCA 


9780 


TCTCTGACAC 


CTTGAGCTAC CTGTCTGCCT 


ACATGATGAA 


CGCCCTCTAC 


ATCAGTGTAA 


9840 


TTAAGGACGA 


CCTGCATGAT GCCATGGGGT 


TCAGGCAGTC 


TCTGATGAGG 


GTTGTCCGTA 


9900 


TTGTCGCCAA 


TCTGGCTGGC GCATTCCTTA 


TCAATGTTAT 


AAGTATTCAA 


ACTATTTCCC 


9960 


TTATCAACAC 


TCTGACTTTT GTCATTGCCT 


TTTTGGGCCT 


GTATGTTATT 


CGACATACCT 


10020 


TGTATGAGGT 


TGAAAAAAGA ATTGAAATGT 


CACATACAGC 


ACTGAGTTTT 


AAGAAATATT 


10080 
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TTCAACATCT 
TGTTTCTGAC 
GCTTCATCCA 
TCATCATGTC 
TCCGTTTCAC 
TCCTTTGTCA 
GCATTCTCAC 
GGACGGTGGG 
TAGCCCTATC 
TTTTAATTGC 
AAAGCATGTG 
AGTCACTTCT 
ACCCATTCGC 
CCTGAGCTAT 
CCATTACCTG 
GCTCCTGATG 
TCGCCATTTG 
ATTGCTCCTG 
ATAATACCAG 
CGTCTTTCAC 
TTAGACTGAT 
GTATAATAAC 
CCAACACTGT 
CCATTTTTAT 
GGGTAAGGAC 
ATCTCTCCAG 
TCTTTGAGGT 
TCAGAATCAT 
CCAACTTGCA 
CACCAGATGA 



TAAACAGTCG 
GACCAGTATG 
ACAGACACTA 
TTGTGGAGCT 
GCACCTCTTG 
AGCCTATATC 
CCCTCGCCTA 
CTC'TGCTCTG 
CATAGCATCG 
TCTTATCTTT 
TAGATTTCAC 
TTGATTAAGC 
CATTATAGTT 
AAGCATAGTA 
CATTTAGGTA 
AACGGAGATA 
GGTGGTCTAA 
AGGAACGGAG 
TTGTTGGATC 
CACCAAGGTA 
CATAAAGCCA 
GCTTCTCTTC 
CTTTAGTTTT 
CGAAGTAGTA 
CAATTGTACT 
ATAGAGAATC 
CCCCCTTTTT 
CTGCAAATAC 
TAGTTTTTTT 
GGCGAAATTA 



CTGGCTGTGC 
ATTGCCATCT 
GCACAACTGA 
ATCCTTGGCA 
GTTTTCTGTG 
GTAATTTTCA 
CAAGCAGCTG 
AGCACAGTGG 
GGCGTTTCGG 
TGTCAATGGT 
ATGCTTTTAA 
GAGTGTTCTA 
GACAGAATAG 
CCAGTTGCCA 
GTACCAAGTT 
GTACCATTTG 
ATAATACCAA 
GTAGTACCAC 
TAGGTAGTAC 
GTTTTCTCCA 
ACCTGTCTCT 
TTTATCTTCT 
AATCTCTAAT 
CCACTCTGAC 
ACCTTTAGAT 
AAAATAATAG 
GTAGTAGGTT 
TGTACTGGTC 
CAAAATTTTC 
TAAACTTTAC 



984 
TCCTCAGGTT 



TGGATGTGTC 
GCATTGGGCA 
ATATGACCAG 
AGATTTCCCT 
TGACCAGTTT 
TCTTTGCCCA 
ACATTCTCGC 
TGCAGTTAGC 
TAGTCAAGTT 
TCTCCCCAAT 
ATATAATTAT 
CCATCTACGG 
TTGACCTGCA 
GAACCATCTT 
TTCCCAAGGT 
GTGGTACCTT 
TTATTACCTA 
CAAGTCGAAT 
TTAATTTCCG 
AAAGAATCAT 
GAATCTTCAC 
GTTTTCCAAC 
TTTGGAAAAC 
GGAAACGGGA 
TACTTACCAT 
CTTCCGTTTT 
CCTAGCAAAC 
ATCTATATAC 
CATCGATAGT 



AAAAGATACC 
CCCTCGGCTG 
ACTCCTCGCC 
CAGTAATCTA 
ATTCACTCTA 
CATCAGTTCT 
TATCCCCAGT 
CCCGTCCCTG 
ATTGATATTT 
CAACACTCAT 
CGTCAGGTCA 
AAGCGCCCTC 
TCGTATTCAC 
ACCAACCTGT 
GATACCAACC 
TTTGCCAACC 
CCTGATACCA 
GATATTGCCA 
CATCGTTTAT 
TCTTAGCTAG 
TTTGATTAAA 
GTTTTTCCCC 
CAACAAACTC 
CTTCTAATCT 
TATATTCCCA 
CAATCACTCG 
CTTGGACAAA 
CAAAGAAAAA 
CCTCCAATAT 
TTGGCTACCT 



GTCATACTAC 
ATTGCCCTCC 
CTGCTCTCCA 
TTTAAAAATA 
ATAACTAGTA 
ACGATTATCG 
CACAAGATGC 
CTCTCCCTAT 
TTGTATCTTA 
AACTAACGAA 
AGTACAACAA 
TCATTACCGA 
TGCCAAAGCA 
CTTCATGTCT 
AGTTGCCATA 
TGTTTTCATA 
GCCAGTGGCC 
ACCTCTTTGC 
CCACCCCGCA 
ATAATACCAG 
GTAATAGTTC 
GTACTTTCTT 
TTGTAGCACT 
GATACCATTT 
GCCGACAACC 
CCAGTAGGTT 
CTGCCATCCT 
TACTGTCAGT 
TAAATCCACT 
GTAACCATTG 



10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

1062C 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11230 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 
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(2) INFORMATION FOR SEQ ID NO: 147: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11340 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEONESS : double 
fD> TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 147: 



CCGGTATGTT 


1 CTGGAATACT 


1 ACCAATCTAA 


. GCTGCCTGTG 


! CCCTACAGTT 


1 IAV.AACCCT 


60 


GTACGAATAC 


CTTAAGGAAT 


ATGACCGATT 


TTTCAGCTGG 


GTTTTCGAfIT 




120 


AAACGCTGAT 


AAAATATCCG 


ATATTCCTTT 


ATCAGTTTTG 




WTAACjAAAGA 


180 


CATGGAATCC 


TTTATCCTTT 


ATCTACGTGA 


ACGTCCCTTG 


CTG AATCCT A 




1 A rt 
J 4 0 


ACAAGGTGTT 


TCACAGACAA 


CTATCAATCG 


AACCTTATCA 


GCACTTTCTA 


U i \. 1 J I nu\A 


JUO 


GTATCTAACC 


GAGGAGGTTG 


AAAACGATCA 


GGGGGAACCT 


TATTTCTATP 


VJ 1 Art Lu l AA 1 


j oO 


GAAAAAAGTT 


TCCACCAAGA 


AAAAGAAAGA 


AACCCTTGCT 


GCCAGAGCTG 






GCAAAAACTC 


* * » W * ^%UW 4 




AGGTTTTCTA 


ACTTATATCG 


ATCAAGAGCA 


480 


CCCACAACAG 


CTTTCAAATC 


GAGCTCTCTC 


ATCATTCAAC 


AAAAATAAAG 


AACGAGATTT 


540 


AGCCATTATT 


GCCCTTCTCT 


TGGCATCTGG 


TGTTCGCTTA 


TCTGAAGCTC 


TTAATCTAGA 


600 


TCTAAGAGAT 


CTCAATCTAA 


AAATGATGGT 


TATTGATGTT 


ACTCGAAAAG 


GTTGCAAACG 


660 


TGACTCAGTC 


AATGTCGCTG 


CTTTTGCTAA 


ACCTTATTTA 


GAGAATTATC 


TGGCCATTCG 


720 


GAATCAACGC 


TATAAAACGG 


AAAAAACAGA 


TACAGCCCTT 


TTTTTAACTC 


TCTACAGAGG 


780 


TGTTCCTAAT 


CGTATCGATG 


CTTCTAGCGT 


TGAGAAAATG 


GTTGCTAAAT 


ACTCAGAGGA 


840 


TTTTAAAGTG 


CGTGTAACAC 


CCCATAAACT 


GCGCCATACA 


CTAGCAACTA 


GGCTCTATGA 


900 


TGCGACTAAA 


TCACAAGTTT 


TAGTCAGTCA 


CCAACTAGGA 


CATGCTAGCA 


CACAAGTCAC 


960 


TGACCTCTAT 


ACCCATATTG 


TTAGTGATGA 


ACAAAAGAAT 


GCTCTGGATA 


GTTTATGATT 


1020 


TTACGTATTT 


TAAATTATGT 


AAATAAATAT 


CAAAAAAAGA 


AGTTGGCCAA 


CTTCTTTTTG 


1080 


ATTTATCCAA 


CTACCGCTTC 


AGCGATTTCT 


TCACGCCTAA 


TACCAGCGAA 


GTAGCGTGTG 


1140 


ATATCAATGG 


TTTTTAGCGC 


CTTAAGAACA 


TCTTCGCGTT 


CGTATTTCAC 


CCCACGAAGG 


1200 


ACATCTTCTA 


CTGCAGCAAC 


GTCTTCAATA 


CCAAAGAAGT 


C AC CATAAAT 


CTTGATGTCT 


1260 


TGGATTTTTG 


ATTCAGTAAC 


GTTAGCAAAG 


ACTTCAACCT 


TACCACTAGT 


GAATTTGATT 


1320 
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CCACGACGGA CGTTAAATTC ACCTGATTTA CCATAGTTCC AGTCCCAAGT TCCAAACTTA 1380 

GTATCCTTGA TGCGATTGAT TTCGGCCAAT TCTTCTTCTG AAAACACCTA TTCAGTCATC 1440 

TCTGGGTACT CTTTTTTCAT GTATTCCAAG AGTAAATCAC GGAATTTTTC GACTGTGATT 1500 

TTTTTTGGTA ATTCATTGAT AATATTGGTT ACACGGGCAC GGACGGATTT CACACCTTTT 1560 

GATTCAAATT TATCTTTTGA AACCTTAAGG GCATTTGCGA GGACTGACAA ATCAACGTCA 1620 

AAGAGCAACC AACCGTGGTG CATCATACGG CCGTTGATAT AGGCTTGGGC ATTGCCACAG 1680 

AACTTCTTAC CATCAATCTC AAGGTCATTA CGACCTGTGA ACTCAGCTTT AACCCCAAGT 1740 

TGAGCCAGGG TATTGATAAC CGGAGTTGAG AAGCTCTTGA AGTCAAATGC CTTATTTTCA 1800 

TCTTCTTTGG AGATGATCGT GTAGTTGAGC TTATTTAAAT CGTGGTAAAC AGCTCCACCA 1860 

CCACTAATAC GGCGAACTAC CTCAATACCA TTTTCCCGAA CATAATCACG GTTGATTTCT 1920 

TCGATAGTGT TCTGGTGACG ACCAACAATC ATAGATGGCT TGTTAATCCA AAGTAGGAAG 1980 

ATTTGATCCT CATCCAAAAG GTGTTTAAAG GCGTATTCTT CCAAGGCAAT ATTAAAAGCA 2040 

GTGTCATTTG AATGATTGAT AATGTATTTC ATGATATCCC TTTACTTTAT ATGATAGAAA 2100 

CTGGAAATAA CCTTCCAGTC TAATCTATCT TCGTTTTATT TTTTCTTAGG TGAATGGATG 2160 

GCCATTCCTA GAACATCTGC AAACGCTTCG TACATCACTT CAGAGTAAGT TGGGTGCCCG 2220 

TGGATGGTCT TCAGCATTTC CTCAACAGTG ATTTCCATTT CGATGATGCT TGATGCTTCG 2280 

TTTATTAATT CTGCGGCTGC AGGACCAATA ATGTGTACAC CAAGGATTTC TCCCTATTTC 2340 

TTATCAGCGA TAACTTTTAC GAAACCTTGA GCTGCGTCAG ATGCAATAGC ACGACCGTTA 2400 

GCAGCAAAGT TAAACTTACC GATGGCAACA TCGTATTTCT CACGGGCTTG TTCTTCTGTC 2460 

AAACCTACTG CTGCTACTTC AGGGAGAGTC TAGATGGCTG CAGGAGTCAA ATTCAATTTG 2520 

GCAACTGCAT GATTTCCTTT AAGGGCATTT TCAGCGGAAA CTTCACCCAT GCGGAAAGCT 2580 

GCGTGAGCCA ACATCTTAGT ACCGTTGATG TCACCTGGTG CATAAATGCC TGGAACTGAA 2640 

CTTTCCATGT ATTCGTTGAC CTTGATACAA CCACGATCCA ATTCAAACTC AACCTCTCCA 2700 

ATACCTTCAA GGTCTGGCAT ACGACCAATT GAAAGAAGAG CTTTGCTTCC GATGATATCG 2760 

TCTTTTCCTT CAACCTTGAT ACGAAGTTGA CCATTTTCCT CAATGATTTC TTGCAGTTTA 2 820 

GTACCAGTCA AGATGGTCAT TCCTTTACGC TCAAGAATCA AGCGAAGGTT CTTAGAAACT 2880 

TCCACATCCA TAGCTGGAAC TATACGGTCC ATCATTTCGA TAACAGTCAC TTTTGAACCA 2940 

AATGTCATGA AGGCCTGACC GACTTCGATA CCGACAACTC CACCACCGAT GATAACAAGG 3000 

CTTTCTGGCA CTTCGTTCAT TTCAAGAATG TCATCACTAG TCATGACAAG TGGAGATTCC 3060 

ATACCAGGGA CGTTGATCTT GTTGACTTTT GAACCACCAG CAAGAATGAT TTTCTTGGTT 3120 
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TCAAGCAATT CAGAACCATT TACCAAGACG TTCTTGTCTT TAGTGATTCT ACCAATTCCT 
TTATGAACAG TAACTCCGTA GCTACGAAGA AGTCCTGCAA CACCACCAAC AAGAGTATTA 
ACAACTTTAG ATTTACTTTC TAAAAGTTTT TCCATATCAA CAGTGAAGTT AGGATTTTCA 
ATCACGATAC CACGATTTCC AGCATGACCG ATATTTTCAA TAATTTCAGC GTTATGAAGG 
TAGGTCTTGC TTGGAATACA TCCACGGTTT AAGCAGGTTC CACCAAGTTC AGATTTCTCA 
ACAAGGCCAA CCTTACCCCC GAATTGGGCA "GCTTTAATGG CTGCAACATA ACCAGCAGGA 
CCTCCACCAA TCACAACGAT ATCAAAAGCA TCATCGCTCT TACCATCATC GTTTGAGGTA 
CTTGCTACAG GTACAGGGCT AGCTTCTGGC GATGCTGCTC CAGCTGTTGG GATGTTTTCC 
CTTTCTTCAC CAAGGTAACC GATAACTTCC GTTACAGGGA CACTTTCACC ATCTCCTTTG 
AGAATGGCAA TCAAGTACCC ATCTTCTTCG GCTTCCAATT CCATGCTGAC TTTATCAGTC 
ATGATTTCCA AAAGGATTTC TCCTTCTTTT ACAAATTCTC CGACTTTTTT ATTCCATTGG 
ACGATTTGTC CTTCTGTCAT ATCCACGCCG GCTTTTGGCA TAATTACTTC TAAGGCCATG 
TCTTCCTTCC TTTATCTATA TCTTAAAAAT GAATACTCTT GCTCTTAAAT TAACATTGAG 
ATTGGCGTTT CAATCAACTC TTTCAAGTCC TTCATAAACT TAGCACCAGC CATACCATCT 
ACGACACGGT GGTCAATGCT TAATCCTAAA CTCATGATTG GGCGAATCAC AATTTCACCA 
TTGACGACAA CTGGCTTCTC GATTGTCGAA CTGACACCAA GGATAGCTGA CTTCCCTTGG 
TTAATAATCG GACCAAAGGA CTGAACACCA AACATTCCCA AATTACTGAT TCTCAATGTT 
GAATTTTGTA ACTCACTTGG AGCCAATTTA CCATCCAAGG TACGGCCAAT AACATCCTTA 
AAGGCTACAA CCAGTTCTGA AAGACTCATC TTCTCAGCAT TGTAAACAAC AGGTGTCATC 
AATCCATTAT CCATCCCAAC TGCCATGGCA AGATTGACAT AGTTGTGACT GATAATAGTC 
TTGCCATCTT CTGTCAATGA AGCGTTGATG TATGGGTGTT TCATAAGAGT CTTAACAACT 
GCAAGCGAAA GAAGGTCTGT TACAGTAGTC TTCTTCCCAG TTGCTTCCAT GATTGGCTCA 
AGAACCTTCT TACGAAGAGC CAACATTTCA GTCATATCAA CTTCATACTT GAGGGTGAAG 
GTTGGCGCAG TCAAGTAAGA TTCAACCATG CGTTGGGCAA TAACCTTACG CATTGGTGTC 
ATTGGAATAC GCTCGATTTT ACCATATGGT GTTACGTTAT CAGGGACTTC TTCCACTTTT 
TCAATCTCAG CAGGAGATTT GATGCTATCG TTTTCGATAT TTTCAGGAAG CAGGGCCAAA 
ACATCCTTCT TCATGATTTT ACCACGATGA CCGGTTCCTT GGATTTCCTG CCAAGCAATG 
TTATGTTCCA GGGCAATTCG TTTTGCAAGT GGCGAAATGC GAACCACGTT TGTGTCTTTA 
TAAGTTTCCA CGTCTTCTTT GTGGACACGA CCGTTTGCAC CTGAGCCAGA AACGTCGTAG 
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AGGTTTATCC CTAAATCATC CCCTAACTTT CTACCTGCAG GAGTCGCTCT TAGCTTGTCA 4920 
TCAGCCATGA CCTCTCCAAT TCTATTTATG ATACAAAGGG CGTCAAAAGC GACTGAAAAA 4 980 

TACGAAATCG ACGATGGCTT CGATGAAGCC AAGGAGATTT ATCTTTTTTC CGATCTTTTA 5040 
GCCCGTGCTC TAATCTAACA TATTAATGAC GAAGAGCTCT GCACCTAAAA GATACAAAGT 5100 
TTCTCGTCAG CTTTATTTTA TTTACATAAC TTATCTTATG TAACCCTATT CTTTGTTATA 5160 
AGTTTTTCGG ATTGCATCTT TGATACTTTC AACTCTTGGA ATCATTGCAT TTTCTAGGTT 5220 
TTGTGCATAA GGCATCGGCA CATCTTCTCC TGCACAACGG CGAATTGGTG CATCTAGATA 5280 
GTCAAATGCT TCTGATTCTG AAATAATAGC TGAAATTTCA CCGATATAGC CACTTGTTTT 5340 
GTGGGCATCG TTGACCAGAA CAACCTTACC AGTCTTCTTC ACTGAGTTTA TGATGATATC 5400 
CTTATCAAGC GGAACAAGGG TACGTGGGTC AACAATTTCA ACTGAAATTC CTTCTTCTGC 5460 
TAATTCTTCA GCAGCTTGAA CCACACGGCG AAGCATTTTT CCATAAGTAA CAACTGTTAC 5520 
ATCCGTTCCT TGGCGTTTGA TTTCACCAAC CCCAAGTGGA ATTGTGTAGT CTGGATCAAC 5580 

TGGCACTTCC CCTTTTTGGT TAAATTCTGA CTTGTACTCA AGTATAATAA CTGGGTTGTT 5640 
ATCACGGATA GAAGACTTAA GCAGGCCTTT CATGTCCGCA GGTGTTCCAG GTGCCACAAC 5700 

CTTAAGTCCT GGAATGTGAG TAAACCAAGA GTCTAGAGAT TGTGAGTGCT GGGCGGCAGA 5760 

GCCAACTCCG TTACCAGCTG CACAACGAAC AGTCATTGGA ACCTCACCTT TACCACCAAA 5820 

CATGTAACGT GTTTTAGCAG CTTGGTTGAC GATATTGTCC ATGGCAATAA CAGAGAAGTC 5880 

CATCAAGGTC ATATCGACGA TTGGACGAAG TCCTGTCATG GCTGCTCCTC CTGCTGCTCC 5940 

AGAGATGGCA GCTTCAGAAA TCGGACAGTC ACGGACACGT TCTGGACCAA ATTCTTCAAG 6000 

CATTCCAACA GAAGTACCGA AGTCTCCTCC GAAGACACCG ACGTCTTCTC CCATCAAGAA 6060 

CACATTTTCA TCGCGACGCA TTTCCTCAGA CATAGCAAGG ATAATGGTCT CACGGAAGCA 6120 

CATTGTTTTT GTTTCCATTT TATCTCTTTC TCCTTAGTCT GCGTAAATAT CTTCAAACCC 6180 

TGATTCAAGC GGTGGGAATG GGCTTTCCTC TGCAAATTTA ACAGAAGCTT CTACTGCTTC 6240 

CTTTACTTGC GCTTGGATTT CTTCCAATTC TTCGGCACTT GCAATGTTAT TTTCAATAAG 6300 

GTAATTGCGG AGGTTTTCGA TTGGATCTTT TTGTTTCCAC AATTCCACTT CTTCACGCCT 6360 

ACOATATTTA CCAGGGTCAG ATGATGAGTG ACCGAGCCAG CGATAAGTTA CACTTTCAAT 6420 

CAAGACTGGA CCATTGCCAC TGCGAACATG GTCCACAGCT TTCTGAAATC CTTCATAGAC 6480 

ATCGATGACA TTGTTACCGT CTTCGATGAA CATTCCAGGA ATTCCATAAG CGGCGCTACG 6540 

TTGATGGATA TGTTCTATAT TGGTCATTTT CTTGATATCC GCAGAGATAC CGTAACCGTT 6600 

GTTAATGCAA TAGAAAATGA CTGGCACGTT CCAGATAGAA GCCATCTTCA CTGCTTCGTG 6660 
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TTTATTCATA GAAATCAACT TCGGTATCCA ACAATTTATC 
CTGAAAACAA GGGTTTATCC TCTAAAAGCC ACTCAACAAA 
TCGGCTTGCT CAAAACCTCA TCATAGCGAA CCCATTCTAC 
TCAAGTCGCC CTCAAACTCC GTCACCTTAA AAACATAGGT 
TAAATTCAGG AAAACTGATG ACACCTTTTA GAACTGGCTT 
CAAGGATTTC ACGCGCCGCG CATTCCTGGG GCCTCTCTCC 
CACCAATCCA TTTCCCTTCA TGGACATCAT TGGGTTTCTT 
GTTCTTTCCC ATTATCAATG TAGCAAATCG TCGCTAACTG 
TAAGCCAATC GATTGGCTCT TGTCCTGTCT CTTTTAAGAA 
GCTTGGAACC CCAAAATCCT CTATAAACCG ACAAAGGACT 
TCAAGTGATG AGGATTGGTA ACTAATGCCT TCTTCTTACG 
CAAAAACGAC TGGTCTATCT AGATGATTGA CCACCTGAAT 
CCCAGATTtG ACCAGCATGA CCATTGGCCT GTCCAGCAGG 
GAAGCAAGAC TCCTTCCTCA GCCCAAGCTG TCAAATCATG 
CATCTGACAA TTCTTTCAAG ATATTTTGCA AGGATGGTGG 
CAGAAAAACT CAAGCCCTGC GCTPGACCTG GTCCGTGATA 
CCACCTTAAC TTCTTCAAGC AGTGTTGTCA AGAGAGCCTG 
GATAAATAAT CCCCTGAGAA TAGACCTGCT CCATAAACTG 
CCTCAGGTAA TTGCGCCTTA ATCAAAGCAT GCCAAGACGA 
(2) INFORMATION FOR SEQ ID NO: 14 8: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12127 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANOBDNESS : double 
(D) TOPOLOGY: linear 



CCCATCATAA 
GGTGTGGTCA 
CGTCCCCTCA 
GTACCAGTCT 
GGCTTTGAGC 
TCTCTCTAGC 
ATTACCATGC 
AGGCATATTT 
TGCATTGGCC 
TGGATGGGCT 
TGCATAACCT 
CACAGCATCA 
AACAGTCAAA 
ACATTTCTTA 
AGCTGGGATA 
GGGGTCTTGC 
AAAAACCTTT 
ATTGATTTTC 
GTGTTCCATA 



ACAAACTTGG 
CCTTCCCAAG 
TTGCAGTCAA 
AAATCTGGTG 
CCTGTTTCTT 
TTACCACCCA 
AGCATGAGCA 
TCTCCTTATC 
TTGGAAAAGG 
GATTCGATAA 
CCCCAGAGTA 
GTAAAAGGCT 
CAAGCATTAA 
ACTCCGATAT 
GAGTCAGGTA 
CCTAGAATTA 
TCCTTGGGTG 
CCGAAATAAC 
GCCGACTCGG 



10260 

10320 

10380 

10440 

10500 

10560 
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10680 

10740 

10800 

10860 
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11040 

11100 

11160 

11220 

11280 

11340 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 
AAAAAATAGA CTTGTTAGAC TATAAATGTA GTAAGCCTAC ACAAGAAAAA TACATAGAGA 
TAAAGGTGAT TATTATGAAA TTCAAAAAAA TGCTTACTCT TGCAGCCATT GGCTTATCAG 
GATTTGGGCT TGTTGCCTGT GGCAATCAGT CAGCTGCTTC CAAACAGTCA CCTTCAGGAA 
CGATTGAGGT GATTTCACGA GAAAATGGCT CTGGGACACG GGGTGCCTTC ACAGAAATCA 



60 
120 
180 
240 
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CAGGGATTCT CAAAAAAGAC GGTCATAAAA AAATTCACAA CACTGCCAAA ACAGCTGTGA 300 

TTCAAAATAG TACAGAAGGT CTTCTCTCAG CAGTTCAAGG CAATCCTAAT GCTATCGGCT 3 60 

ACATCTCCTT GGGATCTTTA ACGAAATCTC TCAAGGCTTT AGAGATTGAT GGTGTCAAGG 4 20 

CTAGTCGAGA CACAGTTTTA GATGGTGAAT ACCCTCTTCA ACCTCCCTTC AACATTGTTT 4 80 

GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG ATTTTATCAG CTTTATCCAC TCCAAACAAG 540 

GTCAACAAGT GGTCACAGAT AATAAATTTA TTGAAGCTAA AACCGAAACC ACGGAATATA 600 

CAAGCCAACA CTTATCAGGC AAGTTGTCTG TTGTAGGTTC CACTTCAGTA TCTTCTTTAA 660 

TGGAAAAATT AGCAGAAGCT TATAAAAAAG AAAATCCAGA AGTTACGATT GATATTACCT 7 20 

CTAATGGGTC TTCAGCAGGT ATTACCGCTG TTAAGGAGAA AACCGCTGAT ATTGGTATGG 780 

TTTCTAGGGA ATTAACTCCT GAAGAAGGTA AGAGTCTCAC CCATGATGCT ATTG CTTT AG 840 

ACGGTATTGC TCTTCTGGTC AATAATGACA ATAAGGCAAG CCAAGTCAGT ATGGCTGAAC 900 

TTGCACACCT TTTTAGTGGC AAATTAACCA CCTGGGACAA GATTAAATAA AATGTTTGCT 960 

CCATAAATCT CTAAAGAGAT GCAGACGTTT CATCGTACAA TAAGATAAAG AAGGCAAGTA 1020 

GGGAGGTGTC GTATCTCCCT TACTTTCTTC ACTAGAAAGG ACAAGATGTC ACAAAACAAG 1080 

CCTTCAAACA AGCAGTTTTT AGGGCAATTT TTTTCATGAG TGCAACAGTA GCTGTTGTAG 1140 

CTATTTTGCT AATCTGTTTC TTTATTTTTA GTAATGGCTT ACCTTTCATA GCTAACTACG 1200 

GCTTTCCCCG TTTTTTATTA GGCAGTGATT GGTCGCCAAC GAACATTCCG GCAAGCTATG 1260 

GTATTTTACC AATGATCGTT GGTTCCTTAT TAATTACCTT AGGAGCGATT GTGATTGGGG 1320 

TGCCAACAGG CATCTTGACA TCGGTGTTTA TGGTTTATTA TTGTCCAAAG CCCGTCTATG 13 80 

GCTTCTTAAA ATCAGCTATC AACTTGATGG CACCCATTCC ATCTATTGTT TATGGTTTTT 1440 

TCGGCCTACA ATTATTGGTG CCTTGGATTA GAAGCTTTTT AGGAAATGGC ATGAGTGTCC 1500 

TAACCGCTTC GTTACTATTA GGAATAATGA TTTPGCCAAC CATTATCAGT TTGTCAGAAT 1560 

CTGCTATCCG AACAGTTCCC AAAACGTATT ATTCTGGTAG CTTGGCTCTA GGAGCTAGTC 1620 

ATGAACGGAG TATTTTTAGT GTCATCTTGC CAGCTGCGAG ATCTGGTATT TTATCACCAG 1680 

TTATTTTAGG AATCGGTCGC GCAGTAGGTG AAACCATGGC AGTTATTTTG GTGGCAGGCA 1740 

ACCAGCCGAT TATTCCAAGT GGACTCTTTT CAGGAACCAG AACCTTAACA ACCAATATTG 1800 

TTCTGGAAAT GGCTTACGCA TCAGGTCAGC ATAGGGAAGC CCTTATTGCA ACCTCAGCAG 1860 

TTCTCTTTTT CCTTATTCTC TTGATTAATG CCTACTTTGC CTACTTGAAA GGAAAATCAT 1920 

CTTATGAGTA AATACCTGCT AAAACTTCTC GTTTATTGTT TTTCAGCTTT AACCTTTGGC 1980 

TCTCTCTTTT TAATCATTGG TTTTATCCTC ATCAAACCCT TACCTCATCT AAGTCTATCC 2040 
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GAAAACACCT TCATTCCTCC CACCATCTCC AAACAAGCAG ACAACGATTT TACCGGTATT 6720 

TTGCATTTGC TGACTGAGGG CTGCACCGAC ACCCATCCCC ATACCACCAC C7ACCATACC 6780 

ATTGGCACCA AGGTTCCCAG CATCAAGGTC AGCGATATGC ATAGATCCAC CTTTCCCTTT 6840 

ACAGGTTCCA GTGTATTTAC CAAGGATTTC ACCCATCATT CCGTTGAGGT CAATCCCTTT 6900 

AGCAATAGCT TGCCCGTGTC CACGGTGGTT TGAGGTAATC AGATCATCTG GATTGAGAGC 6960 

— - -TAACATAGCC CCCACGTTAG CTGCCTCTTC _ ACCAACAGAA "AAGTGCGTCA TTCCTCCCAC ~ 7oTo 

TTTCCCTTTC TTTACTAATT GTGCAATTTT TAAGTCCATG CGACGGATTT CTTCCATCTT 7080 

ACGGAACATT TCTAGCAAAA GATTTTTATC TAAAGTTGAC ATCTTCTTGC CTTTCTAACT 7140 

TTCTTCTTAC CTTACTATTT TACCGCTTTT GGCAAATACT CTCAAAGTTT TTCTAAAAGA 7200 

AATTTCACAA AATAAAAAAG AAAACCCCGT GAAAACAAGG GATTTTCTTG TCAAGAATAT 7 260 

TTTTTCACAA ACTTTTTAGC ATTTGGATTT TGCTAAAGAT TCAAATCTCT TCATAATCAC 73 20 

AGTTAAACCC CAACGGTAGA GCGCCCCGCT CACAATCAAA CTAATAATCA AGCCGATCCA 7 380 

GTAAGAATAA GCTCCAAAAT CTCTTACGGA ATCAAATAGC GTAnCACAGG GATTGCTACG 74 40 

CCCCAATAAC CAAGCAAACC AAGGTAAAAA GGAATAACTC TATCCTTATA CCCCCGCAAA 7500 

ATTCCCTGAA GCGGCGCCGC AAAGGTATCT GCTAACTGGA AGAAAAGACT ATAAGTTAAA 7560 

AAACGCACTG TCAAATCGAT AAATTTTGGG TCGTTACCAT AAAGACTGGC CACATTTCCC 7620 

CTAAAAATGT AAAGGAAGGT TAAGGTGAAG GCCGCAAAAA TGAGGGCAGT CCATCTTCCT 7 680 

AGACCAATAT AGGTTTTCGC ATCATCAAAT CGCTTGGCTC CCACTTCATA GGAAACGACA 7740 

ATAGCCATAG CCGATGAGAT ACTCATAGGA AAGGCGTACA TAAGACTTGA AAAGTTCATA 7800 

GCTGACTGGT GACTAGCTAT AATCAAGGGC GAAAACTTAG CCATAATCAA GCCAACCACT 7860 

GAAAAGATAG CCACTTCCGC GAAGACAGTT CCCCCAATAG GCAGACCTAA ACGAACTCCT 7920 

TCCTTAATTT TATCCATATT AAGTGGAATT CGTTTCTCAA GGTGTAAGGC TTTGAGCTTC 7980 

TCCTGTTTAA ATAAAACCAG AACAGAAATC CCAAGCAAGA CCCAGTAGGC CAAGGATGTT 8040 

CCTAAACCAG CACCAGCCCC TCCCAGTTCT GGAACACCAA AGGCACCGTA AATCAAGAGA 8100 

TAGTTAAATC CGCTATTGAG AGGGAGTAAC AAAAGCATGA GGTACATGGA CAGTTTGGTC 8160 

AAGCCCAGCG AATCCAGCAA GGAACGAATG ACGCTAAAGA GCAACAAGGG GATAATCCCG 8220 

ATAGATAAAA ACCAAAGATA GCGAACCGCT ACTGCCGCTA CTGCTGCTTC TAACCCAATA 8280 

TGATTCAAGA TTATTGGTGC CAAGAAAAGT ACCATCCCCA GCAAGACCAC AGATAGGCCC 8340 

AAGGCCAAAT AAATAAATTG GTAAAAATCA GACGCAACTT CTTCCTTTTT GCCTCGACCA 8400 
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AGATGGTGAC 
GGATTCCAGA 
GTCATTGCAG 
AAGAAAATTT 
CTCTTTCTAT 
CGGAGGTTTA 
TCTAACTAAA 
TTGAATTTTG 
CTCCCTACGT 
GTCCCTTACA 
TAGCTGCTTT 
TGATATCTGC 
GGTTGTATGT 
AGGTCAAGCT 
TATCCTCAAC 
TGACACCTGA 
GATTCATCTT 
GTGGCGCTAC 
CACTTTCTTC 
TGACATCACG 
GTTCTCTAGC 
GAAGGCTGAT 
TGCTCTCAAG 
TAAAGTCACT 
TAATCTTCTC 
TTGGACTGGT 
CACCAGTATG 
CGACCAAGCC 
TGATCTCAGA 
CCAAACCAGA 



CAATGATAGG 
TACTGGTTGC 
TATCAACAAA 
TTAAAAATAA 
TCTGATTTAT 
TTAGATTTTG 
CCTCCGTTAT 
GAACGATTAG 
TTTCGAAAAT 
GATGGTATAT 
GGAAGCAAAA 
TGGACCATTC 
CATTTTTTCA 
GAGAGACAGG 
ATTTTTTTCA 
TTTGAGACCT 
AGCATTGCTA 
TTCTGCTGTG 
CTTAGACAAA 
CGCCATCATA 
AATATTTTCA 
AAAGGTATTG 
CGGAATACCG 
CAAGTTTTTA 
TTTGGCAGCT 
ATTAGCCATC 
AATGTCTTCT 
AGGAGCAACC 
CGCAATTTTG 
CTTGGGATCC 



CACCAAGGCT 
CATAGATACA 
AGAGGCAGAA 
TACTAACTTC 
CTAAACCAAA 
AAGTAGTATG 
CATATTGAAC 
CTGCGGGACA 
AATTCATATT 
TTAACCTGCC 
TGGGAGTCCA 
TCAGCCAAGT 
AGTAATTCCA 
GATGTTTCTA 
TCTACATGAT 
TCGATAACGG 
CCTTGTGTTA 
ACTTCTGCAC 
TCCTGGATCT 
GCGTACTCAG 
TTAAAGCCAA 
AGTTTTTTGG 
TCATCAGAGA 
CCATTAAAGT 
GACTGGAGAA 
ATGACGACAG 
TTATGTGTTT 
ACAAGACCAG 
ATAATTTTCC 
ATTACACGAC 



990 
GACACAATCC 



CCAGCCAAGT 
TAATTGGCAA 
TCTCGTAAAC 
GAGTTTCAGA 
CCAACACGCA 
CGCATGGTCA 
GTAAATTCCA 
CTAACTCCTA 
CTTTTAAGGT 
CAAAGCGGTC 
AACCTGCTTC 
TCAAGCTCAA 
AGCCAGTCAT 
GAGGCGCGTG 
CACGACGGTC 
AAAGAAGTGC 
CTAACCCCTG 
GAACATGGGC 
CCACCCCAGT 
GAACACCCTT 
CTTCCTCCAT 
AACCAACCGC 
TTTTAGTAAT 
CTGCTTGCAA 
TAGTAAAACC 
GACCAGGTTC 
TAG CATC AAT 
CATCTTGAAC 
CATTTTTGAT 



CTGTTAGAAA 
CCATACTGTT 
ATTGGTAGAT 
ACTTTGTCTT 
CCATAGTTTT 
CATGTACGAC 
GCTTTTTCTT 
CT ATT AG ATT 
TCAAGCTTGA 
TTCACCGATG 
AGCCTTGGCA 
AAAGTTGTAA 
CTCACCAGCT 
ACCAGATGGC 
GTCAGTCGCA 
TGATTCCAAA 
TTCTGTCTTA 
AGCAAACTCC 
TTTAGTTGCA 
AGCACCGCAG 
CAAACCTGGA 
GCCTTCCTTC 
ACCAGCTTCT 
GGTCGCAACT 
AGTCTCCACG 
ACCTGCAGCG 
ACGGAAATGA 
CGTTTCTGCT 
TAAGACATCA 
TAGTAGCATC 



TGTAAAGAAA 
GTATTCACCT 
CAGGATTGGG 
ATACATACTT 
TCAAACTTAG 
AATAATAGCT 
TAGTTTCATA 
TCGCTTGTCT 
TAGACGATTT 
AATGGTGAAT 
TCAAAAATAG 
AGCTTGGCTG 
TCTACTAAAT 
CCTTTGGTAA 
ATAACTGTGA 
CGAAGCGGTG 
GAGAAATGCT 
ACTACTTTAA 
TAGGCAATCA 
ATATGGAAAT 
TCTTCCTCAT 
ACAATCTTAC 
AAGAGTGCCT 
GTCTTGACAT 
TCTGAAATGG 
GCTGCTAGGG 
ACATGAATAT 
CCTTCTTCCG 
CAAACTTGAT 
TGCTTTCTCC 
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CTCTTTTCTT 
GTTATTCTGC 
TATCTTGTCC 
GATACCTTAT 
GTCTTCTTAG 
TTGCCACTCA 
CAAGCAAGTT 
GTTGCCATGC 
ACAGCTCCCC 
TCAGGCCCTT 
GAAGCCTATG 
AGCTTATTAT 
ACACCTAGAC 
AGAAAGACAG 
AACCCTTAAC 
AGATGAGCAA 
GATGGTTTTT 
CCCAAGCACA 
AAAAGGGGCA 
ATCTGGCGGT 
TCTGTTAATG 
CCTCATTCAG 
AGCTTCACCT 
AGATACCGTT 
ACGGTTCGGA 
TTAGAACAAT 
CTGGCCTTAG 
ATCAACCAAG 
CCACAAGTCT 



GGACTTATAC 
TCTTTGGTCC 
AATATACAAA 
CTGGGATTCC 
GTTTTCAATA 
TTATTCGCTC- 
ATGGACTTGG 
CAGGTATTTT 
TCATGTATAC 
CTCTAGCCCT 
CTACCGGCGT 
CTCGAAAACT 
TTATTTTACG 
ATTACTGCCT 
CGGATGAACG 
GATATTTATA 
CAACAGCCTA 
CATGGTATTC 
GCCATTTGGG 
CAGCAGCAAC 
GATGAGCCGA 
CAACTAAAAA 
ATTTCAGATA 
GACGTGTTTA 
TAAGGAAGGA 
CCTTTTTAGG 
CCTCCAAAGA 
GTCAAAGCGC 
CTGACCTTCC 



TTCTGAGAAC 
TCTTCTTTTA 
AAAAGATTCC 
TTCCATTGTT 
CTCTCTGTTA 
AACAGAAGAA 
GGCAGGTAAG 
AGCTGGAGTG 
ATTAGGTACC 
ACATATGTAT 
GATTTTGATT 
TGTGAAAGGA 
GGGATTTTCA 
TGATAGGCCC 
ATTTGGTTCC 
GTAGCAAATT 
ATCCCTTTGC 
GAGACAAAAA 
AAGAACTCAA 
GCCTTTCCAT 
CTTCAGCCTT 
AGGATTATAC 
AAACTGCTTT 
CCAATCCAAA 
AAAACCTATG 
ACTAGGGCAA 
CAAGGAGATG 
TATCGAATTG 
ATTTGTGATT 



993 
ATTTCCCTTA 
GCCTTGCCCA 
CTTTGTGTTA 
TTTGGTCTGT 
TCAGGAATCT 
GCCCTTTTAT 
TTACGGACTG 
ATACTAGCTA 
TCTACCAATA 
ATGCTGTCAA 
ATTACTCTTT 
GCTTCCTAGT 
AGCCTTAAAA 
ATCTGGTTGT 
TTCTTGCCAT 
CAACCTTAAT 
CATGTCTATC 
ACAATTAGAT 
AGATGATCTT 
TGCGCGAGCT 
AGACCCTATC 
GATTATCATT 
TTTCTTAACA 
AGATCAGCGC 
AGAAATCAAT 
CTTGTCCTTG 
GCAGAGCTAA 
ACCTGTGCCC 
AGCATCATGT 



TGCCAGCGAT 
TAGCCATTTT 
AAATCATGCG 
TTGGCATGCT 
TAACCTCAGT 
CTGTTACTGA 
TTTTTAGAAT 
TTGGCCGTAT 
CGCCAAGTAG 
GTGAGGGGCT 
TAATGATAAA 
ATGCGAACAT 
AATATTTCGA 
GGCAAATCAA 
ATTGAACGCC 
CAGCTACGTA 
TATGATAACC 
GCCTTAGTGG 
AAAAAGAGTG 
TTAGCAGTAG 
TCCACTTTAA 
GTTACCCATA 
GGAGAAATTT 
ACAGAAGACT 
TTGACTTAGA 
AAACAGCTTC 
TTATCAATAA 
GTTTGTTGGC 
CTTCTTGTTC 



TATTTCCACC 
TGCTGGTTTT 
ATTCGCCTCA 
CTTCTTTGTA 
TATCATGCTG 
TAGCATGCCT 
TCTTCTACCA 
CGTTGGTGAA 
TCTCATGTCT 
ACATGTCAAT 
TACTCTATCA 
TTTCACTCAG 
TTCAATTACC 
CTTTTCTAAA 
AAGTCCTCTT 
AGCGTGTAGG 
TGCCTTATGG 
AGAAATCTTT 
CCATGTCCTT 
AACCTGATAT 
AAATTGAAGA 
ACATGCAACA 
GCGAATTTGG 
ATATTTCAGG 
ATTGCATGAA 
AAAAGCCTTA 
GCATCATGCT 
CTTGCAGCAG 
AGACCTTGAA 



2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 
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CGTATGGGAG 


ACCATATGGC 


AGGCATTGCC 


AAAGCTGTTT 


TGCAACTAAA 


AGAAAATCAA 


3840 


CTAGCCCCTG 


ACGAAGAACA 


GTTACACCAA 


ATGGGTAAAT 


TATCCCTCAG 


CATGCTAGCC 


3900 


GATTTATTGG 


TTCCCTTTCC TTTGCACCAA GCCTCAAAAG CTATTACTAT 


TGCTCAAAAA 


3960 


GATGAACAGA 


TTGACCAATA 


TTATTATGCC 


TTATCAAAGG 


AAATCATTGG 


ACTTATGAAA 


4020 


GACCAAGAAA 


CCTCAATTCC 


CAATGGAACT 


CAATACCTTT 


ATATCATAGG 


GCATCTGGAA 


4080 


CGCTCGCTGA 


TTACATTGCT 


AACATTTGTG 


AACGCCTAGT 


CTACCTACAA 


ACAGGAGAAC 


4140 


TAGTGGATTT 


GAATTAATTC 


AACTAATCCT 


TAAAAGAGAA 


GAGTACGATT 


AAGTACTCTT 


4200 


TTTTATGGTT 


GTAAAAAAGT 


TCATTTGACC 


AATTTAAGCA 


GTGTAGATAG 


TGAGGAGTTG 


4260 


TTTCAATTCT 


ATCGTGAACG 


AGGGAATGCT 


GAAAACTTTA TCAAAGAAAG GAAACCAGGA 


4320 


TTCTTTGGGG 


ATAAGACAGA 


TAGTTCGACC 


ATGATTAAGA 


ATGAAGTACG 


TATGATGATG 


4380 


GGCTGTCTGG 


CTTATAATCT 


CTACCTCTTT 


TTAAACCAGC 


TAGCTGGTGA 


TGAAGTAAAG 


4440 


TCCTTGACTA 


TCAAGCGTTT 


TCGACGTCTC 


TTCCTTCATA 


TTGCCGGAAA 


ATATGTCTCT 


4500 


ACTGCTAGAC 


GACATATTCT 


CAAATTCTCA 


AGTCTATACG 


CCTATTCAAA 


ACAGTTTCAA 


4560 


GCCTTATTTG 


ATACAATCTG 


CCAGATAAAT 


CTGATACTCC 


CTGTTCCATA 


TAGAGCTAGA 


4620 


GGGCAGGGGA 


AAACATGCCT 


AACAGAATAA 


GTCACCTTAT 


TTTAAAAATC 


GAGCATCAAA 


4680 


CCAAGGGAGG 


AGTCTGCCCT 


TTTTTAGGAA 


AAAATCAAGA 


CAAATCTCCT 


CAATTATGTC 


4740 


TCGAACATCA 


GAAATTAAGC 


AAAATCACCA 


GAAGGACAGT 


ATTTCAACTA 


GCTTTTCTGG 


4800 


TAATTTTTGA 


ACTGTGTAGT 


TCGTTAGTGC 


CAGATATGAA 


TAATTTGGGA 


TGATAAATCT 


4860 


TTCTTCCTCA 


CGTAGCCTAT 


CATAATACTC 


TTCAAAAATC 


TTATCAAAAA 


CACTCTCTTT 


4920 


CTTTTGGGCG 


ATAGTTTCAT 


CTTCGTATGT 


AGGAGTCCTC 


ATCAAGAAAT 


ACTTCAATTC 


4980 


TAGGTATTCC 


TTATCCAACT 


CTATATAACT 


TGGCATCAAC 


TTGTAATCTT 


CAACCCCCAA 


5040 


ACGTTCAGCA 


ATATATTTTA 


ACTTTGTrAG 


TATTGGTCTG 


GATTCTCCAT 


TTTCAATTCT 


5100 


AATTAATTGA 


CGGATACTTA 


ATTCAGACTC 


ATCACCACAA 


AATTCTGAAC 


GACTGATTI r 


5160 


TTTAGCCAAA 


CGTAATCTTT 


TAATTTTTTC 


GCCAAACTCT 


CGCAACCTAC 


AAGAACTTCC 


Ti220 


TGAGTTUTTT 


ACCTCTATTA 


TAAGCATATA 


CTGAATCAAA 


CTATCTATCA 


GATTTCTTCT 


5280 


CACTTTAACT 


AAAGACTAAG 


AGTTTATCCC 


TTCGTCTCGG 


TTTTTGTCTA 


TTTTTCCACC 


5340 


ATACCCCAGT 


AATGCAAGTG 


CAAAATCCCC 


TAGAATATGA 


TAGAATAAGA 


GAAAGAACTC 


5400 


TATCAAGGAG 


GAAATCATGG 


AAAAACAAAC 


CGTCGCCGTC 


TTGGGGCCTG 


GTTCTTGGGG 


5460 


AACCGCCCTT 


TCACAAGTCT 


TAAATGACAA 


TGGACACGAG 


GTACGTATTT 


GGGGAAATCT 


5520 


TCCCGAGCAA 


ATCAATGAAA 


TTAATACACA 


CCATACTAAT 


AAGCACTACT 


TTAAAGATGT 


5580 
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CGTTCTAGAC GAAAATATCA TTGCCTACAC CGACTTAGCA GAAACATTGA AAGATGTGGA 
TCCGATTTTG TTTGTTGTCC CAACAAAAGT GACACGACTT GTTGCCCAGC AAGTTCCACA 
AACCTTGGAC CATAAGGTTA TCATCATGCA CGCATCAAAG GGATTAGAAC CTGATAGCCA 
TAAACGATTA TCAACCATTC TTGAAGAAGA AATTCCTCAA CATCTCCGTA GTGATATCGT 
CGTTGTTTCA GGGCCTAGTC ATGCAGAAGA GACCATTGTG CCTGACCTAA CTTTAATAAC 
TGCTGCTTCT AAAGATTTAC AAACACCTCA ATACGTTCAG AAGCTATTTA GTAATCACTA 
CTTCCGACTT TATACCAATA CGGATGTTAT CGGGGTTGAA ACTGCTGGTG CTCTTAAAAA 
TATTATTGCT GTCGGTGCTG GAGCTTTACA TGGTCTTGGA TTTGGTGATA ATGCTAAGGC 
AGCCATCATC GCTCGAGGTT TAGCAGAAAT CACCCGCCTA GGGGTAGCAC TCCCGCCCAC 
TCCATTGACC TATAGCGGCT TATCTCGTCT GGGAGATTTG ATCGTAACGG GAACTTCCAT 
CCACTCTCCT AACTGGAGAG CTGGAGATGC TCTCGGACGA GGAGAATCCC TAGCTGATAT 
AGAAGCTAAT ATGGGCATGG TAATCGAAGG AATTTCAACG ACTCGACCAG CCTATGAACT 
AGCCCAAGAA CTTGGAGTCT ATATGCCCAT TACACAGGCT ATTTACCAAC TTATTTATCA 
CGGAACCAAT ATCAAAGATG CCATTTATGA CATCATGAAC AATGAATTTA AACCAGAAAA 
TGAGTGGTCT TAACCCTCTA TAGAAAGGAT TTTTATGACA TCAAAACTTA GAAAGGCAGT 
CATCCCTGCT GCTGGACTAC GAACTCGATT TTTACCAGCA ACCAAGGCCC TTGCCAAAGA 
AATGTTGCCA ATCGTAGACA AACCAACTAT CCAGTTTATC GTGGAAGAAG CTCTCAAATC 
AGGTATTGAA GATATTCTAG TTGTCACTGG TAAATCAAAA CGTTCTATTG AGGACCACTT 
TGATTCAAAC TTCGAATTGG AATATAACCT CAAAGAAAAA GGGAAAACAG ATCTTTTGAA 
GCTAGTTGAT AAAACAACTG ACATGCGTCT GCATTTTATC CGCCAAACTC ATCCACGCGG 
TCTCGGAGAT GCTGTTTTGC AAGCCAAGGC TTTCGTCGGA AATGAACCTT TTGTCGTTAT 
GCTTGGTGAT GACTTGATGG ATATCACAGA CGAAAAGGCT GTTCCACTTA CCAAACAACT 
CATGGATGAC TACGAGCGTA CCCACGCGTC TACTATCGCT GTCATGCCAG TCCCTCATGA 
CGAAGTATCT GCTTACGGGG TTATTGCTCC GCAAGGCGAA GGAAAAGATG GTCTTTACAG 
TGTTGAAACC TTTGTTGAAA AACCACCTCC AGAGGACGCT CCTAGCGACC TTGCTATTAT 
CGGACGCTAC CTCCTCACGC CTGAAATTTT TGAGATTCTC GAAAACCAAG CTCCAGGTGC 
AGGAAATGAA ATTCAGCTGA CAGATGCAAT CGACACCCTC AATAAAACAC AACGTGTATT 
TGCTCGTGAG TTCAAAGGCG CTCGTTACGA TGTCGGAGAC AAGTTTGGCT TCATGAAAAC 
ATCCATCGAC TACGCCCTCA AACACCCACA AGTCAAAGAT GATTTGAAGA ATTACCTCAT 
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CCAACTTGGA AAAGAATTGA CTGAGAAGGA ATAACAAAAT CATTTATATA AAGATTAGCC 7 380 

ACACATAAAT TAAGTAAATT CTCTACTTGA ATCTACCTAT TTAATAAAAA CTAATGAAAA 7440 

CGCTATACTT GTATTTGTTT TTTCATTAAA ATAAGAGTAG AATAAATTAG TATAGTAAAA 7500 

CAAAAAAGCA CCGAATCGGT GCGCACTTTT TCAAGTTGTG TACGGACAAA GCCTTATTTT 7560 

AACTTTGCTA TGTTGTTTCT AATGGTTCCA AAATAATAAA TARTTTTAAA TTTGACTTAA 7620 

CTGTTGGAGT AGTCATGGTT AAATTAAATC AACCGAGCCG AACATAAGTT GTTTAATTTT 7680 

CTGGAAGCTA TTAATAAAAA TATAATAAGG GAGAAAGATA CCTGTAATTT TAATTTTAAA 7740 

GTAATTGCGG ACACTATCAA AGAAAAAGAT TATGGAGAAC AAATTTGTAG AATTTATCGA 7 800 

AAACAATAAA AAAGTAATCA TTTCATCACT TGCAGTTGGT GTTGTATTGG TATTAGGGTT 7 860 

TGGATGGTAT TCATATAACC AACAACAAGC AGAACAACAA GCAAAAATTG TACAATTAGA 7920 

AAAAGATAGC AAATCAGACA AAGAACAAGT TGATAAACTA TTTGAATCAT TTGATGCATC 7980 

TTCAGATGAA TCTATTTCTA AATTAAAAGA ACTATCTGAA ACTTCACTTA AAACCGATGC 8040 

AGGTAAAGAC TATCTTAATA ACAAAGTCAA AGAATCATCT AAAGCAATTG TAGATTTTCA 8100 

TTTGCAAAAA GGTTTGGCTT ATGATGTTAA AGATTCAGAT GACAAATTTA AAGATAAAGC 8160 

AACTCTTGAA ACAAATGTAA AAGAAATTAC AAAACAAATT GATTTTATCA AAAAAGTTGA 8220 

TGAAACTTTT AAACAAGAGA ATTTGGAAGA AACTCTTAAA TCTCTAAATG ATCTTGTTGA 8280 

TAAATATCAA AAACAAATCG AACTTTTGAA GAAAGAAGAA GAAAAACCTG CTGAAAAAGC 8340 

TGCTGAAAAA GCAAAGGAAT CTTCTAGTCA AAGTAATTCT TCTGGTAGTG CTTCTAATGA 8400 

GTCTTATAAT GGATCTTCCA ATTCAAATGT AGATTATAGT TCATCTGAAC AAACTAATGG 84 60 

ATATTCAAAT AATTATGGCG GTCAAGATTA TTCTGGTTCA GGAGATAGTT CAACAAATGG 8520 

TGGATCATCA GAACAATATT CATCTAGCAA TTCAAACAGC GGAGCAAATA ATGTCTACAG 8580 

ATATAAAGGC ACTGGTGCTG ACGGCTATCA AAGATACTAC TACAAAGATC ATAATAATGG 8640 

AGATGTGTAT GATGACGATG GAAATTACCT TGGGAACTTT GGTGGCGGCA TTGCAGAACC 8700 

TAGTCAACCC TAATAACTAT TTTAGAGCTG TGTTGTTTCG AATGGTTCCA AAACACATTA 8760 

AAAGCTACTC ATTTTTTAAG TAGCTTTTTT CTTATTCAAG TTTACATATT ATACTCAATG 8820 

AAAATCAAAT TCAAACCACG TCAGCATCGC CTTACCGTAG CTATGGTTAC TGACTTCGTC 8880 

AGTTTCATCT ACAACCTCAA AACCATGTTT TGAGCTGACT TCGTCAGTTC TATCTACAAC 8940 

CTCAAAGCAG TGCTTTGAGC AACCTGCGGC TAGCTTCCTA GTTTGCTCTT TGATTTTCAT 9000 

TGAGTATTAG TCGTCACAAT CCCATTCCCT TGTAGAAAAG CAAAATGGCG AGTCCTACGA 9060 

ACAAGACTAC CGCTCCTAAT CTCTGGCTGG TGTTATACAT CCGTTTTTCT CCTCTAACTG 9120 
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GAAAGATAAC 
TTCCTGGAAT 
CTAGCTCTTG 
GCCCATAAAG 
AGAGATTGCC 
TCTCCTCTAC 
ACCCAATATG 
TAGCTGGCCC 
CTGCAGTGAC 
CAGGGTAACG 
TGGTTTTCAG 
GAAAAATGTT 
GTCCTAAAAG 
AAATCTCCCT 
ACCACAAAGT 
TTCAGCGCCT 
GCGATGACCT 
TCTATAGCCT 
AATTCCGATT 
CCGCAGCCAC 
CGTAGGGACT 
CAAAGTCCTC 
CAAAGAAGTC 
CTTGTTTGAG 
TTTTTACCCT 
GTAAAAAAGC 
TCACCACTGC 
CAAAGTAACT 
CTTTGCCTTT 



TGCTAGAAAT 
CAGAACACTT 
GATATAAGGA 
ACACCTAGAC 
CATCATTCCT 
CTGCCTTCCA 
AACAAAAATG 
ATACATGGCT 
CAACATTAGT 
TCTATCAAAG 
GTATAAAGTC 
CCAGATAGCG 
CCAGACCAGG 
GTAGCTCCAG 
CCATGCGCCC 
GCTCAATCAG 
TGGCTTCTTG 
GTTTTTGCTC 
TCATAGACAA 
CCCAATAGCT 
ATCGATACCT 
GCCTGTCATA 
CATCAGTTCA 
TTCCACTTCG 
CTTTTGCACC 
TGTGTCTGTG 
TCCCTCGATT 
AGCCGCCACC 
GAAACGGATC 



CCGCCACCAA 
CCAATAATGT 
TTGCGAGTTG 
GCGCCTGCTG 
GATAAAAGAT 
AGATAATAAA 
GCAGACAAGA 
CCAAATCGAA 
AAAAATACCA 
ATTTCCTTCA 
CTGAATTTGA 
GTCATAATAG 
AACATGAATC 
TAAGGCAAAG 
CTTGGGATAA 
TTCCTGCGTT 
ATAAAAGGGG 
TTGAGATATA 
GCCCTCTATT 
AAGACTTCTT 
AGCCAAAACA 
GCAGGTTCGA 
CGCGCCAAGG 
ACTTCCATAT 
AAGAGACTCA 
ATGACATTGT 
GGGTTGACAT 
AAGGCGTCAT 
TTCACCTCGC 



997 
CTGCACCACC 
TAACCACAAA 
CATAGCGAAG 
CTAAGGATTT 
AGAGAAAGAA 
GTGAAACCAT 
GACGCCAAAC 
ATAATCTATC 
AGGCCGTCAC 
TCAATTAATA 
CAAGGATATA 
CCTCCACCGT 
AAATCAATCT 
AAAGTTTTTA 
GTTTTGGGTA 
TGAAACTCAT 
TGTTGTAAAA 
GCCTTCATTT 
CTGCTGCCTT 
CCTTAGGACT 
TCACGCCATC 
TATCAATCAA 
CTGGATTGTT 
CAAAGGCACC 
TGTCCTGTGT 
TGGTGGTTCC 
TGCGGCTAAC 
TGGCTTCATG 
AAGTTCCTGC 



GATATGGCCT 
AAGTGTCAGA 
AACAATAATC 
AGGACTAAAT 
AAACTGCTTA 
ATTAACAATG 
CTGCTCGGGA 
TGCCCTGTCA 
TAAGAGGAAG 
CCTCCTGAAC 
TCGTACTCAA 
ATCCTATCCG 
GAGATGCATC 
CCAACTGTTG 
TTAAAACCTT 
GAGAAAAAGA 
GCCCCTCGCT 
CATGCAAGAC 
CTTTTTCAGG 
CATTTGAGGG 
AACCTTTGAA 
CTCCATTCCG 
CTCAACAGGT 
TGCAACCCCT 
CAAGGCACGA 
ACCTTGAAAA 
AACTGACTGC 
AGGAAAAGCT 
AAAGAGTGTA 



GCTAGGCTGA 
TAGGATTGCC 
GCGGCAAATA 
ACAAAAACAA 
GAACCCAAAA 
AAATGTTCCC 
AACAGGCGAA 
AAGTTTCCGC 
AAACTCGTCA 
AGGAATATCA 
AGTACGACCA 
ATATCCTTTC 
CACCACTTCC 
CGGATCATAG 
CTTGCCGTCC 
GAGGTAGCTT 
TAAACCTTGG 
TTGCTTGCGT 
AAACTAGACA 
TGATGAAGAG 
AGGAGATAAC 
TCTTTTTCGT 
AGGTATCCAC 
TCTGCAACTG 
ATAGTTCCAT 
ACGCCGAAGG 
ACTTGGGTCA 
GCGTGGCCAC 
TG ACT ATT AG 



9180 
9240 
9300 
9360 
9420. 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 * 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
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TCGCAATCTG GCCGACTTTC AAATCTGGAC GAACATGGAG 
ACCAATCTCC AAAAGCACCG TCCTCATACA TGAGCATACC 
CAGGCTCAAA TAGAAAGAGC AGATTATTCT TCGGTTGCTC 
AGCCTAAGGC AATGGTCATA TGAAAATCAT GGACACAGCC 
GAGAAGCAAA AGGTAGACCT GTTTGTTCGA CGATAGGCAG 
AACCAATGGT TCGCTCCGGC TGACTTCCCT GCAGGTAGAC 
TACGAATTTG AACAAAATCC TTGCCCGTAG TCAATTTCTC 
CCTGAGTCTT GAACTCCTCC AAGCCAATCT CTGGAATCTG 
GAATCAAATC TAACATCTAT CTGTCCTCCG ATATAGCAGA 
CCCCCTCTTT TTTACTTTTA CAATTACAAG GTACGAAGCG 
TGTTGAGTTT GGCCATCAAT TTCTTTGATA ATACGAGCTG 
TTTTCTGGGA CATCTTGGGT AACAATACCT CCTGCTGCGA 
TGGACTCCTT CGATAACCAC TGCATTAGCA CCGATAAGAA 
GGTTCAGCAC TAGCTGGCTC AATCACACCT GCCAAAACTG 
TTTTTTCCAA CGATGCCACG GCCACCAAGG ATGCCACCCA 
CCGATTTCAG CACCGATATT GATAACAGAT CCCATCATGA 
TCCACCTGGT CACGGATAAT CGCACCTGGC TCGATACGAG 
AGGAAAGGAA CTGCAGAATT ACGAGCATCT TGCTCGACAA 
AAACCTTCAA GAAGCGGAGC CACATCCTTC CAGTCTCCGA 
ACAACAGAGC TAGGCACAGC AGTTGCGAGT TGCCCCTCAA 
TTCTTTTCAG CATTGGCGAT AAATTGGATA ATTTCTTGAG 
ATAGGTG 

(2) INFORMATION FOR S£Q ID NO: 149: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12566 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D> TOPOLOGY: linear 



ACCATAGAAT 
ACCAGCTTCA 
CTCAAGCCCG 
ATGCATGCGA 
GCCATCAATA 
CAAAATCCCT 
AATCACATCC 
GTGTAAATCT 
AAGAGGCTGG 
CATCCTCTAG 
GAACACCTGC 
CAACTGAACC 
CATTGTCTCC 
CACCTGCACC 
TGTCAATCAT 
TAACAGCATT 
CGTTGATAGC 
CATAATCTTG 
ATAGGACATT 
AGGTTACTTT 
CGTTCATTTT 



TGATCTGGCA 
TTTTCTTCAG 
CGCTCAAGAC 
CCTTGGTGTT 
TCTGTCCGCC 
GTCCGCCAAG 
AGCAAATAAG 
CCTCTAGTCT 
AAAAAGGGTT 
CGCTGTTTTT 
TACTACCACG 
ACTACCGATT 
GACACGGACT 
AACCTGGCTA 
GCTTCCACCA 
GTCACCAATT 
ACGCTTATCT 
ATTTTCTACC 
TCCTAGTTTG 
GACACTGGTT 
TGTAGCAGTC 



10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12127 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
CCATCCTTCT GTTGATGTGA CAGGAATGAT GATAAATCAA CCAGTAGCTA CTCGCGAAGA 
GGTGACAGAG CCTTTCAGTC ACTTGGCGGT AGAGCACAAT AGTCTCATTG CTCGTCGAAT 



60 
120 
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CGTTGAGCCA AATCAACCTG GAGAAACACG CTTTACCTAT CCCACTTATG GTGAGGGAAA 
GCTTCCAGAA GGTCTCACCA TTTCCTCCAA GGAGAGTGCA GAAACGAGTG ATTTATTAGG 
GTCTTACTTG ATTGTATCAG GAAGTTTGGA TGGAGTGAGC TTACACACCA CCTTGAAAGA 
GCTTCGTTAT CAAGGCTTTG TTTCGAATGG AGAAGATCCA TTTTCCATAC TCTTACTATT 
GACGGCCACC CCTATGGTGC TACTGAGTTT AGCTATTTTT CTGCTGACCT TTATGAGTCT 
_ GACCCTG ATT TATCGGATCA AATCCCTTCG TCAGGCAGGG "ATTCGCTTAA TAGCTOCTCA* 
GAGCTTGTTT GGAGTTGCTC TCAGACCAGT GTTAGAAGAT GTGAGACAGC TTATCTGCTC 
AGTGCTGGTA TCCAGTCTTT TGGGATTGGG GATTCTCTGG TATCAAGGTG CCTTGTTTAT 
GGCAACGGTG CAACTGGTCA TCATTGCTCT TCTACTTTAT GGATTGACCT TGGCAGGGAT 
TTCTACCTTA CTAAGTGTCG TCTATCTACT TGGTTTACAG GAAAATACTC TGGTGGATCT 
ATTGAAAGGG AAACTCCCTC TCAAACGTAT GATGACATTG ATGATCGTGG GGCAACTCTT 
AGCTGTATTG CTCGTCGCAT CGAGTGCGAC AGCTCTCCTA CCCCACTACC GTGAAATGCA 
GGAAATGGAG ACAGCTAGCA ATAAATGGAG CCAGTCCTCA GACCGTTACC CTCTATCCTT 
TGGTTGGTCT AGTGCATTTG CCGATGAAGA AGGAACGCGT AAGGATAATC GTGAGTGGCA 
GACATTTACT GAAGAACGGT TAGCCAATAC AGACTCTTTT TATATTATGA GCAATGTTGA 
CAATTTCTCA GATGGAGCAG AAGTGGACCT AGATGGCAAT CGTCTCAGTG ACTACACACC 
GTCAGGGAAT GTTATCTATG TCTCACCGCG CTATCTGATA GAAGAAAAGA TTACCGTTTC 
TTCAGAGTTT ATGGACAAGA TGCAAAACTT GTCTGAGGGA GAGTTTGGGC TCATCTTGCC 
TGAGAGCTTG CGAGAGCAGT CTGTCTACTA CCAAGGATTG TTTACACATT ACCTGCAAAA 
CTTTTCATCT GAAAGTGTAG AAGTGACGAG TCAGAAACAC TACCTCCCAC AGGTAAGGCT 
AGCTTTTACA GAAACAGGAC AGGAACGTTT CCTCTATAAT GATGGGTACA AGACAACACG 
CCAGTACCTA AAAGATCCGA TTATTGTAGT TCTAACGCCG CAAGCGACTG GAACAAGACC 
TGTTGCAGGG ATGTTGTGGG GAACTACGGC TAATAGTGCC TTGAAACTAG ATCGATATGG 
AGACAGCATC ACAGCTCTAA AAGAGAAAGG TCTGTATCAC AAGGTTTCTT ACTTGGTAAA 1560 
AAGCCAGCTA TTTTTTGCCA AGGTACTAAA TGACAAACGG GTGGAGTTTT ACTCTCTCCT 1620 
TATTGGGACG ATTTTGACCC TGTCTACGGC TATCTTGTTA TTTGATTCCA TGAATCTTCT 1680 
CTATTTTGAG CAGTTCAGAC GGGAACTTAT GATTAAACGT CTTGCTGGTA TGACAATCTA 1740 
TGAGCTTCAT GGCAAGTATT TACTGGCGCA AGGAGGAGTT CTCTTGCTTG GCCTAGTCCT 1800 
ATCTACTATT TTGACAAGAG ATGGTTTGAT TAGCGCTCTA CTTGTACCTT TGTTTACGCT I860 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



WO 98/18931 



PCT/US97/19588 



1000 

TAACGCCCTC TTGATTTTAG TAAGGCAGGA CAAAAAAGAA GAAGCTGGTA GCATGGCAGT 1920 

ATTGAAAGGA AAATAAGATG ATTGATATTC AAGGATTGGA AAAGAAATTT AATGACCGCG 1980 

CGATTTTCTC TGGTTTGAAT CTCAAGCTGG AGAAGGGCAA GGTTTATGCC TTAATCGGAA 2040 

AGAGTGGAAG CGGAAAGACG ACGCTCCTGA ATATCTTGGG AAAGCTAGAA AACATAGATG 2100 

GTGGAAGGGT TCTCTATCAG GGCAAAGATT TAAAAACCAT TCCCACTCGT GAGTATTTTC 2160 

GAGACCAGAT GGCCTATCTC TTTCAAAATT TCGGCCTCTT AGAAAACCAA TCAATCAAAG 2220 

AAAATTTGGA TTTGGCTTTT GTTGGTCAGA AAATCTCAAA AGTAGAACGT TTGGAAAGGC 2280 

AAGTGGGGGC TTTAGAAAAA GTTAATCTAG GGTATTTGGA TTTAGAACAA AAAATCTATA 2340 

CTTTATCTGG GGGAGACGCC CAACGAGTTG CCCTTGCTAA GACTATTTTG AAAAATCCAC 2400 

CCTTGATTTT GGCAGATGAA CCAACAGCAG CTCTTGATCC TGAAAATTCA GAGGAGGTTA 24 60 

TGAATCTCTT GGTGGATTTG AAAGATGAAA ATCGAATTAT CATCATTGCG ACCCATAATC 2520 

CCCTACTCTC GAATAAGGCT GATGAAATCA TTGATATGAG GAAACTTGCT CATGTGTGAA 2580 

AAAATCCGTA TTCGCAGGGT ATCTGATTAT CCTAGTGCCA GAGGTGGTTT AGAAGATATC 2 640 

CTCATCATGG AAAATATGAC CAATCATCTC CTTTTGGTTC AAATCCGAGT GCATGGCTAT 2700 

TTGCTTGATT TTGCTAGTAT TGAAGGGCAA AGGCAAAAGC ATTATCGTTT GAAAAATTTA 2760 

CCTCAGACGG TTGAACTGAC AGTGGATGAT GTGGAGGAGG ATGTGGATTT GACCCTACCT 2820 

GAAAATCGAA GTTATCAAGA AGCTGATTTT TTTGAACGCA TGTTTCGAGA CAACTGCTAA 2880 

GGCCACTTTT AAAGATTTCC AAGACTATCT TTCTTCATGA GGAAAGATAG TTTTTTGGTA 2940 

TGATTTTCAT TCCCAAAATA CAAGGGGAAT GTGTTACAAT AGTAGTAACA GATAATAGAA 3000 

AAGAGAATAG ATGAGAATTG CAGATTATAG CGTGACCAAG GCAGTGCTGG AGCGTCACCG 3060 

TTTTACCTTT AAAAAGTCCT TTGGGCAAAA TTTTTTGACG GATACCAATA TCCTTCAAAA 3120 

AATTGTGGAT ACGGCTGAAA TTGATGATCA GGTCAATGTC ATCGAAATCG GGCCAGGTAT 3180 

TGGTCCCTTG ACAGAATTTT TGCCTCAGCG TGCAGCCCAA GTCATGGCTT TTCAGATTGA 3240 

CCACCGTTTG GTGCCAATTT TGGCAGATAC CCTGCGTGAT TTTGATAATG TGACCGTAGT 3300 

TAACGAAGAT ATTCTCAAGG TTGATTTGGC GCAACATATC CAGAATTTTA AAAATCCTGA 3360 

CCTGCCAATC AAGGTAGTGG CTAATTTGCC TTACTACATC ACGACGCCTA TTCTCATGCA 3420 

CTTGATTGAG AGTGGCATTC CTTTTTGTGA G TTTGTGGTC ATGATGCAGA AAGAAGTAGC 3480 

GGACCGCATT TCAGCCCAGC CTAACACCAA GGCTTACGGT AGCTTGTCTA TCGCCGTGCA 3540 

GTATTACATG ACAGCCAAGG TTGCCTTTAT CGTGCCTCGT ACGGTCTTTG TGCCAGCGCC 3600 

AAATGTGGAT TCAGCCATCT TGAAAATGGT GCCTCGTCCA GAGCCAGCCG TAGCAGTAGA 3660 
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4380 
4440 
4500 



1001 

AGATGAGAAC TTTTTCTTTA AGGTTTCCAA GGCTAGTTTT ACCCATCGCC GCAAGACCTT 
GTGGAATAAC TTGACAGGTT ACTTTGGTAA GACTGAAGAG GTCAAGGACA AGCTGACCAA 
GCCTTTGGAC CAGGCAGGCT TGTCACCAAG TGTGCGTGGG GAACCTCTCA GCTTGGCAGA 
ATTTGCCGGT CTAGCAGACG CACTTAAAGG GCAAGGACTC TAAGATGCAG CGACAAATCA 
TTAAAGCCTT GGCAGGTTTC TACTATGTGG AGAGTGATGG CCAGGTTTAT CAAACACGCG 
~ CCCGTGGGAjT TTTCCGTAAA AAAGGCCATA CCC^WATGT TGGGC ACTGG GTACATTTCT 
CTGCCGAGGA AAATTCAGAA GGCTATATCC TCAAAATTCA CGAACGGAAA AACAGTCTGG 
TTCGTCCGCC TATTGTCAAT ATCGATCAAG CTGTAGTAAT CATGTCCGTC AAGGAACCTG 
ATTTTAACAG CAATTTGCTG GATCGTTTCT TGGTTCTTTT GGAGCACAAG GGCATCCATC 
CCATTGTCTA TATTTCCAAA ATGGATTTGT TGGAAGATAG GGGAGAACTG GATTTTTACC 4260 
AGCAGACCTA TGGTGACATC CGCTATGACT TTGTGACCAG TAAAGAGGAA CTCCTGTCTT 4 320 

TGTTAACAGG CAAGGTTACG GTCTTTATGG GGCAGACAGG TGTTGGGAAG TCAACTCTTC 
TCAATAAAAT CGCACCAGAC CTCAATCTTG AAACGGGAGA AATTTCAGAC AGTCTAGGTC 
GCGGTCGCCA TACCACTCGA GCTGTTAGTT TTTACAATCT CAACGGGGGT AAAATCGCAG 
ATACACCAGG ATTTTCATCC TTGGACTATG AAGTATCAAG GCCTGAAGAC CTCAATCAGG 4S60 
CTTTCCCAGA GATTGCTACT GTTAGCCGAG ATTGTAAGTT CCGTACTTGT ACCCATACCC 4620 
ATGAGCCGTC TTGTGCCGTC AAACCAGCTG TTGAAGACGG TCTTATTGCA ACCTTCCGTT 4680 
TTCACAATTA CCTGCAATTC CTTAGTGAAA TTGAAAATCG TAGAGAAACC TATAAAAAAG 4740 
TCAGCAAAAA AATTCCAAAA TAAGGAGAAA CCTATCTCTC AATACAAGAT TGCTCCGTCA 4800 
ATTCTGGCAG CAGATTATGC CAACTTTGAA CGTGAAATCA AACGTCTAGA AGCAACTGGG 4860 
GCAGAATATG CCCATATCGA TATCATGGAC AGTCATTTTG TACCGCAAAT CAGTTTTGGT 4920 
GCAGGTGTGG TCGAGAGCCT TCGTCCTCAT AGTAAGATGG TTTTCGATTG CCACTTGATG 4980 
GTGTCAAACC CTGAGCATCA TCTGGAAGAT TTTGCGCGTG CAGGTGCAGA CATCATCAGT 5040 
ATCCATGTAG AAGCAACGCC TCATATTCAT GGCGCCCTCC AAAAAATTCG TTCACTCGGA 5100 
GTTAACCCTT CAGTCGTTAT CAATCCTGGC ACATCAGTTG AAGCCATCAA GCACGTCCTT 5160 
CATCTAGTTG ACCAAGTTTT AGTCATGACG GTTAATCCAG GTTTTGGTGG GCAAGCCTTT 5220 
CTCCCAGAAA CCATGGATAA GGTCCGTGAG TTGGTTGCTC TTCGTGAGGA AAAAGGTTTG 5280 
AACTTTGAAA TCGAAGTGGA TGGTGGGATT GATGACCAAA CTATTGCTCA AGCCAAAGAA 5340 
GCCGGTGCGA CTGTTTTTGT AGCAGGTTCC TATGTCTTTA AGGGAGAAGT CAATGAGCGA 5400 
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GTACAAACTC 


TCAGAAAACA 


ACTGGACTAG 


GGTTGCAGTT 


TTTGCAGGCG 


GAAACCGCGG 


5460 


TCATTATCGG 


ACACATTTTG 


ATGCTTTTGT 


TGGGGTGGAT 


CGAGGCTCGC 


TCTGGGTCTT 


5520 


GGAACAAGAC 


TTACCTCTTG 


CTCTAGCAGT 


CGCAGATTTT 


GATTCTGTGA 


CGGAAGAAGA 


5580 


GCGACAGGTG 


ATTCAAAAAG 


GTGCCCAGTA 


TTTTGTCCAA GCACGACCAG AAAACGATGA 


5640 


TACAGATCTG 


GAATTGGCTC TCTTAACCAT CTTTGAACAA AATCCTCAGG CTCAGGTCAC 


5700 


TATTTTCGGT 


GCCTTGGGTG 


GCCGTATTGA 


CCATATGTTG 


GCCAATGTCT 


TTCTCCCTAG 


5760 


CAATCCTAAG 


TTGGCACCCT 


ATATGCATCA AATAGAAATT GAGGATGGGC AAAACTTCAT 


5820 


TACTTATTGT 


CCAGAAGGAA 


TCAGTCAGCT 


AGAACCTCGT 


TCAGACTACG 


ACTATCTAGC 


5880 


CTTTATGCCA 


GTTCGGGATA 


GCCAGCTGAC 


TATTCTTGGA 


GCCAAGTATG 


AGTTGACAGA 


5940 


GGAAAATTTT 


TTCTTTAAAA 


AAGTGTACGC 


TTCTAACGAA 


TATATAGATA 


GGGAAGTGTC 


6000 


GGTAACTTGC 


CCAGATGGTT 


ATGTGGTCGT 


ACTGCATAGC 


AAGGACAGGA 


GGTAGGATGG 


6060 


AAAGTTTACT 


TATTCTATTA 


TTAATTGCCA 


ATCTAGCTGG 


TCTCTTTCTG 


ATTTGGCAAA 


6120 


GGCAGGATAG 


GCAGGAGAAA 


CACTTAAGTA 


AGAGCTTGGA 


GGATCAGGCA 


GATCATTTGT 


6180 


CAGACCAGTT 


GGATTACCGC 


TTTGACCAAG 


CCAGACAAGC 


CAGCCAGTTA 


GACCAAAAAG 


6240 


ATTTGGAAGT 


GGTTGTCAGC 


GACCGTTTGC 


AAGAAGTGCG 


GATTGAATTG 


CACCAAGGTC 


6300 


TGACCCAAGT 


CCGTCAAGAA 


ATGACAGATA 


ATCTCCTCCA 


AACTAGAGAC 


AAGACAGACC 


6360 


AACGTCTCCA 


AGCCTTGCAG 


GAATCAAATG 


AGCAACGTTT 


GGAACAAATG 


CGCCAGACGG 


6420 


TCGAGGAAAA 


ACTAGAAAAG 


ACCTTGCAGA 


CACGCTTACA 


GGCTTCCTTT 


GAGACAGTTT 


6480 


CTAAACAACT 


GGAGTCTGTC 


AATCGTGGCC 


TTGGAGAAAT 


GCAGACAGTT 


GCCCGTGATG 


6540 


TCGGAGCTCT 


TAACAAGGTT 


CTCTCTGGAA 


CCAAGACGCG 


AGGGATTCTG 


GCAGAATTGC 


6600 


AACTGGGGCA 


AATTATTGAA 


GACATCATGA 


CACCTGCCCA 


GTACGAACGA 


GAATACGCAA 


6660 


CGGTTGAAAA 


CTCTAGTGAA 


CGAGTGGAGT 


ATGCCATCAA 


GTTACCCGCA 


CAAGGCGACC 


6720 


AAGAATACGT 


CTATCTGCCA 


ATTGACTCTA 


AGTTTCCACT 


GGCAGATTAT 


TACCGCTTCG 


6780 


AAGAAGCCTA 


TGAGACAGGT 


GACAAGGATG 


AGATTGAACG 


CTGTCGTAAG 


TCACTCCTAG 


6840 


CAAGCGTCAA 


GCGCTTTGCT 


AGGGATATTA 


GGAACAAGTA 


CATAGCACCA 


CCTCGGACGA 


6900 


CCAATTTTGG 


AGTTTTGTTT 


GTTCCGACAG 


AACGTCTCTA 


CTCAGAAATC 


GTCCGCAATC 


6960 


CGGTCTTCTT 


TGATGATTTG 


AGACGGGAAG 


AACAGATTAT 


TGTTGCAGGA 


CCAAGTACCC 


7020 


TATCAGCCCT 


TCTTAACTCC 


CTATCAGTTG 


GTTTCAAGAC 


CCTTAATATC 


CAAAAGAGTC 


7080 


CCGACCATAT 


CAGCAAGACT 


CTTGCCAGTG 


TCAAGACCGA 


GTTTGGCAAG 


TTTGGTGGTA 


7140 


TTCTGGTCAA 


GGCACAAAAA 


CATCTCCAAC 


ATGCCTCTGG 


CAATATTGAT 


GAATTATTAA 


7200 
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ACCGTCGTAC CATAGCTATC GAGCGGACGC TCCGTCACAT TGAGTTGTCA GAAGGTGAGC 
CTGCGCTTCA TCTACTCCAT TTTCAAGAAA ATGAGGAAGA ATATGAAGAT TACTCACATG 
AAAAAAGATG AGTTATTTGA AGGCTTTTAC CTAATCAAAT CACCTGACCT GAGCCAAACT 
CGAGCTGGGA AAAACTACCT AGCCTTTACC TTCCAAGATG ATAGTGGCGA GATTGATGGG 
AAGCTCTGGG ATCCCCAACC TCATAACATT GACGCCTTTA CCGCAGGTAA GGTTGTCCAC • 
- ATGAAAGGAC GCCGAGAAGT TTATAACAAT ACCCCTCAAG TCAATCAAAT TACTCTCCCC 
CTGCCTCAAG CTGGTGAACC CAATGACCCA GCTGATTTCA AGGTCAAGTC ACCACTTGAT 
GTCAAGGAAA TTCGTGACTA CATGTCCCAA ATGATTTTCA AAATTGAAAA TCCTGTCTGG 
CAACGGATTG TCCGAAATCT CTACACCAAG TATGATAAGG AATTCTACTC CTATCCAGCT 
GCCAAGACCA ACCACCATGC CTTTGAAACG GGCTTCCCCT ATCATACGCC GACCATGGTG 
CGTTTGCCAG ACGCTATTAG CGAAGTTTAT CCTCAGCTCA ATAACACCCT GCTCTATGCG 
GGGATTATGT TGCATGACTT AGCTAAGGTC ATCGAGTTGA CGGGGCCAGA CCAGACAGAG 
TACACAGTGC GAGGTAATCT TCTTCGACAT ATCGCTCTCA TTGATAGCGA AATTACCAAG 
ACAGTTATGG AACTCGGCAT CGATGATACC AAGGAAGAAG TCGTTTTGCT TCGTCATGTC 
ATCCTCAGTC ACCACGGCTT GCTTGAGTAT GGAAGCCCAG TCCGTCCACG CATTATGGAA 
GCAGAGATTA TCCATATGAT TGACAATCTG GATGCAAGCA TGATGATCAT GTCAACAGCT 
CTTGCTTTGG TGGATAAAGG AGAGATGACC AATAAAATCT TCGCTATGGA TAATCGTTCC 
TTCTATAAAC CAGATTTAGA TTAATAATTT AAGAAAAATG AGCATTTTTT AGGATAAGAA 
TGTTCCTTTT TTTATGTGAA TATGGTATAA TAACTAAAAG ACAAAAATGA ATACTCTTCG 
AAAATCTCTT CAAACTAGGG TAGTATCCCC TTGTCGTATG TATATATGCA GGTATATTAC 
AGGGTTTGTC AGTTCTATTG ACAATCTCAA AACAGTCTTT TCAACCACCA CCGACCAGCT 
TTCTAGTTTG CTTTTTGATT TTTTGAATAA AAATGGAATA GGAAATAGAA ATGAAATTAA 
GAACAAGTGA TCGGATGGTT GTCATTTCCA ACTATTTGAT TAATAATCCT TATAAACTAA 
CTAGTCTCAA TACTTTTGCT GAAAAGTATG AGTCTGCTAA ATCATCCATC TCAGAAGATA 
TCGTCATTAT CAAACGCGCC TTTGAGGAAA TTGAAATCGG TCATATCCAG ACAGTGACTG 
GGCCTGCCGC ACGTGTCATC TTCACACCGT CTATTTCGAG TCAGGATGCT AACGAAATGG 
TTGAAGACTT GCGTACCAAG TTGTCAGAAA GTGACCGTAT CTTGCCAGGT GGTTATATCT 
ATCTGTCTGA TTTCCTTAGC ACACCAGCCA TCTTGAAAAA TATTGCTCGT ATTATTGCCA 
AAAGCTTTAT GGACCAAAAA ATTCACGCGG TTATGACCGT AGCAACTAAG GGTGTGCCAC 



7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

3040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 
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TTCCAAATGC AGTTGCCAAT GTCCTCAATG TCTCTTTTGT CATTGTGCGC CGTGACCTGA 9000 

AAATTACCGA AGGTTCAACT GTTACCGTCA ACTATGTTTC AGGTTCAAGT GGTGACCGTA 9060 

TCGAGAAAAT GTTCCTTTCA AAACGTACTC TTAAGGCAGG CAGCCGTGTC TTGATTGTGG 9120 

ATGACTTCTT GAAAGGTGGC GGAACGGTCA ATGGTATGAT TAGTCTCTTG CGCGAGTTCG 9180 

ACTCAGAACT GGCAGGTGTA GCGGTCTTTG CGGACAATCC CCAAGAAGAA CGTGAAAAGC 9240 

AGTTTGACTA CAAGTCACTC TTGAAGGTAA CCAATATTGA TGTCAAGAAC CAAGCCATCC 9300 

ATGTTGACCT TGGCAATATC TTTGACGAAG ATAAATAAGA GATAGAACTA AAGGTTGGAA 9360 

CGATTGTCCC AGCCTTT C TT TGCAAACAGA ATAGAAGGAA GCTTATGAAA ACACCATTTA 9420 

TCAATAGAGA AGAGTTAGAA GCGATTGTTG CCGAGTTCCC GACTCCCTTT CACTTGTATG 9480 

ATGAGAAGGG GATTCGTGAG AAGGCAAGAG CCGTCAACCA AGCTTTTTCG TGGAACAAGG 9540 

GCTTTAAGGA ATATTTTGCA GTTAAGG CT A CTCCAACTCC AGCTATTTTG AAAATTCTCC 9600 

AAGAAGAAGG TTGTGGTGTG GACTGCTCTA GTTATGTAGA GCTTTTGATG AGCCATAAAC 9660 

TGGACTTTCT GGCTTCTGAG ATTATGTTCT CTTCCAACAA CACGCCAGAC AAGGAATACG 9720 

CCTATGCACG TGAATTGGGT GCGACCATTA ACTTGGATGC CTTTCAACAT ATTGAACATC 9780 

TGGAGAGAGT AGCAGGCATT CCAGAAATCA TCTCTTGTCG TTATAATCCT GGAGGCGTTT 9840 

TTGAACTGGG GACAGACATT ATGGACAATC CTGGGGAGGC TAAGTTTGGC ATGACCAAGG 9900 

ACCAGCTCTT TGAAGCCTTT GCTATCTTGA AGGAAAAAGG AGCCAAGACT TTTGGGATTC 9960 

ACTCCTTCCT AGCGTCCAAT ACCGTGACCC ATCTCTATTA TCCAGAGTTG GCTCGTCAGC 10020 

TCTTTGAACT GGCTGTTGAA ATCAAGGAAA AGTTGGGCAT TTCGCTAGAC TTTATCAATC 10080 

TTTCTGGCGG TATTGGTGTT AATTATCATC CAGACCAGGA GCCGAACGAT ATCGCCTTGA 10140 

TTGGTGAGGG AGTTCGTAAG GTGTATGAAG AGGTTCTTAC GTCAGCAGGT CTTGGTCAGG 10200 

TCAAGATTTT CACCGAATTG GGTCGTTTTA TGCTGGCACC TCACGGTGCT CTAGTCACAA 10260 

GAGTCACTCA TAAGAAGGAA ACCTACCGTA CCTATCTAGG TGTGGATGCC TCAGCAGTCA 10320 

ACCTCATGCG TCCAGCTATG TACGGAGCTT ACCATCATAT TAGCAACGTG ACCCATCCAG 10380 

ATGGACCAGC TGAAGTGGTA GATGTGCTCG GTTCACTCTG TGAAAACAAT GATAAATTTG 10440 

CAGTTAATCG CGAACTGCCT CATACAGAAA TCCGTGATTT GCTGGTCATT CATGATACAG 10500 

GTGCCCACGG ATTTTCAATG GGCTACCAGT ATAATGCCAA ATTACGTTCT GCGGAAATCC 10560 

TCTATACCGA ACAAGGTAAA GCCCGTCAAA TCCGCCGTCC AGAGCGCCCT GAGGACTATT 10620 

TTGCAACCTT ATATGGCTTC GATTTTGAAG AATAATCTGA TAATAGATTG AAAATGAAAT 10680 

TGAAAAACAG ATTGCTTTCT AAAAAATAGG CAAAAATCTT GTTTTTCCTT CAAGTCGTGA 10740 
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TATAATAAAA 
TTATGGACAA 
TGGTATTCGC 
CCCAGACACT 
GACAGAGGCC 
AGGAGAAGCT 
AGCAATCCAC 
TGAGAAGTTT 
CAACGAAGAA 
CCAGTATCTA 
GACAGACAGA 
TGATAAGTTT 
AATTCCATAT 
ACAACTCATG 
AAGCCTGATG 
TTTTTCTTCT 
TCTGATAGCA 
ACTTGCTTTT 
AACGATATAG 
AAGTGTGACA 
AATCATTTTC 
CGATTGAAGA 
AGGAATGCCT 
AGGAGAGAAG 
GATACCGTGC 
AGTAGTCGCT 
CTTTTCAGAG 
GCGTACGATA 
GAAGTACTTG 



CTATAAAACG 
GTGACAGGAA 
TTTATTGTCT 
TGGGCTATGG 
TTGCGCTACC 
CTCTTGCAGA 
TGTACCTTGG 
GACAAACTCA 
CAGCACAAGA 
TCAGATATTG 
GATGATGACT 
GAAATTCTAC 
TCCCTCGAAG 
GATACCGAAA 
GAAACATCGG 
TATCTCGAAC 
GTTGGTTGTT 
GCTGGGTTCT 
TTGACGATAA 
CTAGCTGGTG 
TTGATTTGTT 
TAGTAAAGGA 
AGGTAGCTTG 
AAAGGCATTT 
TCTTTTTGAG 
TCTTTGAGAC 
TGAAGCGTTG 
ATGGTTACGA 
ATGCCTTCAG 



TTTTCAAGGA 
TGGTGCATTC 
TTTTGCAGGG 
AGTCCAATAA 
GTGCTTTCTC 
TTGATTTCCT 
ACACCTGTGC 
TGGCTGTCAC 
TTGTCACTAG 
GAAAACCTGT 
TGATTGAACT 
CTTATCACAC 
GAGTCAAACC 
GTTATCAAGA 
GCTTTTGACT 
GTTGTTTTCC 
CAGGGGTAAC 
TGGCTAGTTC 
ACTGTTGGAG 
ACAAGAGGGA 
CTCTTTGCAT 
CACCAGCACA 
CTTGAGCAAC 
GAAGGAGGAT 
CAGCAAAGAG 
GCGTTTGGTG 
CCTTCCATGA 
TAATGATAGC 
CCATAGGCGC 



1005 
AGGTAACGAT 
GACAGAAACC 
CTGTCACATC 
GTCACGTGAA 
GGGAAATAAG 
GATTGCTCTC 
TCTTCCTTTC 
TGACTTGGTT 
CCAAACCAAT 
CTGGATTCCC 
TGGTAAGTTC 
CATGGGTGAG 
ACCAACAGCA 
TTATATGAAA 
TGCAAAAAGA 
AGCCTTGCGA 
GTCTTTTCGT 
TTCACGGACT 
AATCATCATG 
GAAGACCACC 
TTCATCTTCT 
GGCAACCAAA 
CCCTTCAGTA 
AGGGAAACAT 
AGCTTGTTGG 
TGGCTCAAGG 
TTGGTAGATA 
GACACCAAAG 
TCCGATCGTA 



ATGTCTGAAG 
TTTGGGTCAG 
CGTTGCCAGT 
CGGACGGTAG 
GGTGGGATTA 
TTCACCAAGG 
CGTAATAAAC 
CTTTTGGATA 
AAAAATATCT 
CACGTGCTAG 
GTCAAGACCC 
TTCAAGTGGC 
GATCGCGTCA 
CGTGTACATG 
CTTAGCAAAT 
TTTTTGTGTT 
CCACTTCGTT 
TTTTTGCGAA 
AAACCACCGA 
ATCATGAGTG 
ACTCCGTGAA 
ATCATACTTG 
TGTTGGGCAG 
CCTACACCGC 
GCTTCGAGTT 
ACGTGCTTGA 
CCAAGTGGTA 
CCTAGACCTT 
TTCCAAATAA 



AAACAATTGA 
TAGATGGGCC 
ATTGCCACAA 
ATCATGTCTT 
CAGTCAGTGG 
CTAAGGAACA 
CACGTTACCT 
TCAAGGAAAT 
TGGCTTCTGC 
TTCCAGGATT 
TCAAAAATGT 
GTGAACTTCG 
AGAACGCTAA 
GATAGAAAAC 
CAGCTAAGCC 
TTTTCTTGCT 
TAGAGAAAGC 
GTTTTGGACG 
CAACCCAGTA 
GGCTCATGTA 
GTGAAAGGAG 
GAGAACCTAG 
CAAAGTAGAT 
CAAACATGCT 
TTTCTTCTTG 
GGGCGTTCAT 
AGATAATCAA 
TATCAGTAGC 
ATCCTGTTGG 



10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 
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CTGACCTGTC GTTTTATCGA CATTGACACA CCCAGTCAAG ACAAGCAACA TAGCCACTCC 

CATAGCCGAG AGTGCAAAAT CGGCGT 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5238 base pairs 
< B ) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

TGACACTCTG TAGGATTGTC GTTAATTGAT TGCTCCTACT CTCTACAATA ACCACCAAAC 60 

TAAAAACGAC ATAGAAAGAT AGCATCAGCT GTAGCCATAG CGCCTTTGAC ACCTTCTGGA 120 

TGATTATGAG TTACCTCTGC AGAAAGACTC GTAAGTCCTC TAGATGATGG CCATATACCA 180 

GTTTTCGCAT AAAAACCACA GTCCATGATC CAAGCACATG GAGAAATACG CATACCTCAT 240 

CCATTCCCAA AGCTATTATA AGGCTCACGG TTATCCCTGT TTAGCCATGC ATTAAACCGA 300 

GCACCGTAAT CAGCATTCGG ATACATTCTG CCATATTTCT TCATCCCGTC AATGAAGTCA 3 60 

TCTTTTTGTC CACCATTCAT AATTGCTTCT GCAACAGCAC AGCTCATAAC CGTGTCATCT 420 

GTAAAAAAGC AGTCCTTCCG AAATAAAGGA AAGTCCTTTG TTTTGATATT GTTCCATTCG 480 

TAAACAGAAC CGACAATATC TCCAATAATT GCTCCAAGCA TCAGATTCCT CCTTGTTCAT 540 
TTTGATGCTT TTTATATTGG TTATCTACCA TATTTATTTT AGAAAATAAC ATCCTGTTGG 
ATTTTAAAAA TTTCATTTTT TTCAAAATAC GGTTTTACCA TTTCTTTCCA CCTAGCTCTA 

TGAAAATTGA TTGATTTTAA AGGAGATAGG CCATAATTTC CCAATGCATA ACCATCATTT 720 

ACTTCAACAA CAAGTGTTCT GCCATCGCGA GTAACACCGA TATCTACTCC ATAAGCTATT 780 
GGCGCATCTT TCCAACATGA TATCGCTTCA TCAATTACAC TTGCATCAAA TTGTGCATGA 
TAATCACCTG TATAGGGTCG AACATCTAAT ACGCCACCAT CTAACACAAA ACAACGCCAT 

TCAGCTATGA ATTCTACAAC CTCACTAATC CATATAGGAT AGTCGAAAGG TAGACCAATA 960 

CCTATTAAAT CATGGGTTCC ATTAACAACT CTTCCACTAA AGACTTTTGA ACCAGCTTTA 1020 

GGCTTAATAA ATTTTCCCCA ATTATCAGGT ATATTCACAA TCTCTCCTAA AATACCAGCA 1080 

TAAATCTTTC GACCATAAAA CTCTTTAAGC TCAATAGGAT AGTCATGAAC CGGAACGTTT 1140 

AAGCCCATCA TTTTTAGTAA TGCTCTAGTC TCCATTATAT AATCTACAAC TATATCTTCA 1200 

CTTGTTAACT CTTTTATTTC AGAAAAAGAT TGATATAAAA TAACTTCTTC TCCTTGTAAG 1260 

TAGGCACCTA CTTGAGCATT GTATTTATTA ATTGAAACCT CACTTGCTAA TTTACTTTGT 1320 



600 
660 



840 

900 
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CTAATATAAA CAACCATTTC ATCACTCCTA TATCACTAGT GTTACACCAA TTTGTAAAAA 1380 
ATAATAGCAA TTTTGCTCTT ATTTTTTTGA GTAAATAGCC CCCATAATAT CATCGAAATA 1440 
ATCAACGGTA TTTAGGAGTA ATTCAATAAC CTGGGACTTT GTTAGTCGCA TTCCCCTTCT 1500 
ATCTCTAGCA TCTTCTACTA AATTTTCAAG TTTCTCTAGA TTTTTATCAT CCAAGCTAAT 1560 
CATTATTCTA TTTTTATCGG TTGCCATTTT CATCACCTCA AGTTAATTCT ATCACAGGTG 1620 
- TAACACTAGT GTCAACTCC^ 

CACAGTAACT TTAAATCTTT GGTATTAAAA AATTTTCACA ATATTTATAG AAATAAAATC 1740 
TGTCTCAAAT CAGTTATCAA ATCTAGTATA AATTATGAGC GGCTACTCTA ATACTTTCCC 1800 
TCTAAACAAG AAAAAGACTT ACACTCAAGG GTTTTCTTCC CCCCCTTCGT TATAACGTTT I860 

TGACTCTTTT ACTAGCAAAG GTATATACTC ACAAGGAACT TTGGTTGACT ATTGAATCTC 1920 

TCCAACTTCT TCTTTAACAT ATCCTTCTAC ATCTTCAATC TCTACAAACA TTGGGTCTAA 1980 

GTGACACAAG AAATGCCAAA CTTCGATCCC TTTTTTTCTG TAAAGAATCG CTTCACCGTC 204 0 

TTCACTTCCG AAAAAGCTTC TGTCGATTTC ATATCCGCGG CTTTCTAAGA AGTCTTTTGC 2100 

TTTACGATAG TTCGTTTCTC TTGTTTCGAC ATAGGCTTTA ACTTCATGGT TGTTAACGAC 2160 

ATATGCATCA ATTTTTGAAT ATCCTTCGAT CACTCTATCA TTTTTGAGGG ATAAATTTGA 2220 

AATCTCTTTC CAAATAATGT TTACATTTTC CTCAGGATCG AACATAAATT TAGATAAAGG 2280 

AACAATATTT CCGTTAAAAA TAATTTCCAT ATAATCCGGT ATGTTTTTAG GATTAAAATA 2340 

CTCCACTTCA AAACCATCTT CTGTTTCCAG AGTGTATCCC GGGATTTGAG CTACAAAGGC 2400 

TTTCCCATCT TCTATGGAAT CAAATGCTAC TAAATCTTTA GAATAATCAT TTTGCTACAA 2460 

TTCCAATATA ACCATCGATA ATCTCTCCAT TTTCATTATC AGGCTAATGT AAATAAGCAC 2520 

GTCACCTCAC CAATTCAGGC TCTCTGTATC ATCTCATCAT ATTTCCTACT TACTTTACGA 2580 

GTCTTATACC CAGAACACAC CTTATCGACC TTCGGTCTCA CCTCGTCGCA TTGGCTGAAC 2 640 

ATCTACTTTT ACTTTGCTGA TGCTTCAACT CGTACAAGCA GTGATACCGC CTCAGCGTGA 2700 

TGCGTCAGTG GGACTCAAAA GGTTCGGGGA ACCTTTTGAG GATTAACTAC GTTTCTCTAA 2760 

TAAACTTACA CATTCAACTT GTTCATCATT GTCCAAACCT ATGTTGAGAT TTTCTTCTAT 2820 

AATTGGTAGC TTAAAAGTAA TGGATTTTAG CCATTGTCCG TTAGATTGTT TTTCTTCATA 2880 

AACTTGAATT TCAGAAATCA AAGCTGAAAT TAACTGCCTA CGCTCTACAT CATTCATGAC 2940 

TTTATAGAGC TTATCAAAAT AGATCAGAAC CTTATATATG TTATCTCCTG TAAGCTTTTC 3000 

AGCTTCAATA GTCTGTTTCT TTGCTTTCGC ATCAATTAGT GATGATTCTA ATTCATCTAG 3060 - 



WO 98/18931 



PCT/US97/19588 



1008 

TTTGTCATAC ATACGATATA GTCTATCATC TAAATCCTGT TTCCTTCTCT TATAATGCTT 3120 
ATCTTCAACA TCTAAATTAT CTATTTCCTC AATTAGCTTA AACTTTGTAG AATGACTCTT 3180 
TCTCAATTCC TTTTGGTAAT TATCTATTTC TTTTTCTATT TCACAGGTAT CCACCTTCAT 3240 
GTTGATTTTT TCTTGCATCA TAGAAGCAAA TTTCGGATTA CTTACTATCT TCACAATCAC 3300 
CTCTGCAACA GCATCATCTA ACAATTCTTC TCTAATTTGC TTSCTGAATG TACACTTATT 3360 
ACCTCTTATC ATCTGCCTAT GGTTACAACC ATACTAATAA AAATCTTTAT ACTTTGTGCC 3420 
ATCTTTCTTT TTCTTGATAC ACTTGTTCCC AAACATTCCC ACTCCACATA TCGGGCATTT 3480 
TACAATTCCA GAAAGCAAGT GTGTGCGTGT ATCTTTTCCT TTATTCACAT GCTCATATTT 3540 
CTTTGCTTGA GATTTTAGCT TAACCTGAGC AGCTTGCCAA ACTTCATCGC AAACTATAGC 3600 
TTCATGTATC CCTTCAGATA TTAGATATTC ATCTTGTTCA ACCTGCTTAT ATTCATTTCT 3660 

TGTACCATGA ACTTTTTCTA AAGTTCTTCT TCCAAATGCT ATTTTCCCAT TATATACAGG 3720 

ATTCTTTAAT ATCTTTCTTA TAAGACCTGC ATCAAACAAA GGATTCTTAC CATTCTGTCT 3780 

TGGGATTTTT CTAATTCCAT GATTCTCTAA GTATTTAGAT ATCCCATTGG CTCCTATCGT 3840 

AGTATTTACA TACTGGTCGA AAATCGTTCT TATTGCAACT GCCTCTTCCT CATTTATAAA 3900 

CAGCTTGCCG TCTTCAAGTT TATATCCATA CGGAGCAAAG CCACCATTCC ATTTTCCTTC 3960 

CCCTCCTTTT TGAATCCGAC CTTCCATTGT TTGAATACTG ATGTTTTCTC TTTCTATTTC 4020 

AGCCACAGCT GATAAAACAG AAATCATTAG TTTCCCAGCA TCTTTAGATG AATCAATGCC 4080 

ATCTTCAACG CAGATAAGAT TAACTCCATA ATCCTCCATT ATATGAAGTG TAGAAAGAAC 4140 

ATCAGCCGCA TTTCTTGCAA ATCTTGATAA CTTAAACACA AGAACAAAAG ATACTCCATC 4200 

TTTTCCAGAT TTTATATCTT CCATCATTCG ATTGAACTGT ATTCTACCTT CAATAGACTT 4260 

GTCAGACTTC CCGGCATCTT CATACTCTCC AACAATTTCA TAATCGTTGT AAATAGCAAA 4320 

AGCTTTCATT CGTGATTTTT GTGCCTCTAA CGAATACCCC TCTATCTGTA TTGACGTAGA 4380 

TACTCGTGTA TAGAGGTATA CTTTTATTTT TTCTTTTGAC ATAGTATTAA CCTCAATATA 4440 

ATTTTTCTAT ATCATATATA ATTTTTTTAA TTTAAGTTTG GACTATCATT TCAAGTATAT 4500 

TATAACACTT TTATTAGTCC GTCTCAATTT GTGTTTTTGC CATGTCAAAA CTATTTTTCA 4560 

TCTCTTGATT TTTTGCTGGC GTTGGATCGG GTAGATTATC TAAATCTAAA CCACCAGCAT 4620 

ATTTTGCAAT CAGATTTGCT ATTAAATCAG CCAATCCATT CCAGTCATTG TCCAATATAT 4 680 

ACCTCCTCTA AAGTTTTATA TCTAATAATT ATTTGTTTAA TTAAGTTTTT TGACATTGAC 4740 

AAGTGCTTTG GATTAGCAAC ATAGGAATCT CACTTCCGCC TCTATTCCGG ATGAGCCGGC 4800 

TTCAACCTTA GAAGTATCAT TACCCTCATT TTCTTCATAG CGGATAGGGT ATCCCTCCCT 4860 
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ATATTCAAAC TCTTACTTAT CGCTCACTTT CTTTTTGCTT AGCACAACTT TTTTTGCCGA 4 9 20 

ATTATTCAGC CGAAAGATCT TGACGGATAG GTTATTACGC TCCAAAAATA ATTAACGTCT 4 980 

TCTCTTCGTC TATTCAATTG TTAAGGTTCA AAATTTATCG AGACTTATTA. ATCTTTTTAA 5040 

AATTTGACCA TCAGAAAATA TTTATCTTGA TGTAACAAAA TTCTATAAAT TACCCTCTTA 5100 

TACTTAACAG TGAAAAGAAG TCTTTCTTGG TAACCAATTT TGAAATAGAA TTTGCTTATA 5160 

TAAAAAGGTC CAATTCCCAC TGCATAAATA GCAGTGAAAA TTACACCCTC TTGGTAACTG 5220 

TCATCTAAAA GTCTTCTA 5238 
(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13425 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D> TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


151: 






GACGATTTAC 


GAAGAATCGA 


ACAAGAACCT 


GCTCCTATCA 


ATTCCCAACC 


TCTATCTCTA 


60 


AAATCTTGCA 


GTTCATGCTT 


ATACTTTTTT 


AAGAAATCTA 


GAATCATAGA 


TACGGTAGAT 


120 


GACATCGTCT 


GGTTGACATT 


GGTCAAAATA 


GAACAAACCA 


AAACGACTCG 


TTCTATACCT 


180 


CCAACCTTTC 


AAATGCATCT 


CATGTAAATG 


TTCTTCTTCC 


TTGTCCAAAT 


CAACAATGGT 


240 


GAAAATCCGA 


AATTCTACTC 


TGCTATTCAT 


TGTCTTACCC 


CAAAATTAGA 


AAACATGCCT 


300 


CGCGTTATTT 


ATTAGATAAT 


TCTTTCCACT 


TTTGACTCAA 


TCTCCAAAAA 


ATATAAGAAA 


360 


TCTGAATCGC 


AAAAACTATC 


AATAAAACCC 


AATCTATTAT 


GAAAATCAAA 


AACACTTTCC 


420 


AACTGAAAGA 


ACTACCTCCA 


CTGACAAACT 


TTGAGAAAAA 


CGGTAGTAGA 


GCTAAAAAGA 


480 


GAAATAAAAT 


AGGAAGCATC 


CGCATTGTTA 


AAATCCGTTT 


GGCATAAAAA 


AATCTTTATT 


540 


TAAACGAAAA 


TATTATGGCA 


AAATTTACGC 


CAGTTTTTGA 


ACGGCTGATG 


TAGATATTTT 


600 


ATACTTTCAA 


AATGTTTAAA 


TGTGATTATT 


TATTTTTGAA 


AAATAGATCA 


CCAGCCCGAC 


660 


TGAAAGTGCT 


TATAGAATGA 


TAATAAGTCG 


CCTGCCGAAA 


ACAGCGAAAA 


ATAGCGGTGT 


720 


TATGCGGAGA 


TAATCTGACG 


CGATGCGAAA 


GTATATTGCA 


TACTTATTTT 


CAACAATTTA 


780 


GCAGAGTATT 


TTTATAAGTG 


TGATATAATA 


GAAGTATAAT 


TTGTTCTGAT 


AGTTTATTTT 


840 


ATGGAGAAGT 


AGATTTTTAG 


AATGCGGAGG 


GTTCAATATG 


GTTGAGTTTA 


TAAAGTCTAA 


900 


GAAAGAAATG 


AGTGAGGAGG 


ATATTAAAGC 


AAATTTCATC 


ACTCCTGCTA 


TTGTATCCAA 


960 
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AGCATGGAAA 


AATGGTGAGC 


ATATCGCTTA 


CGAAGAATAC 


TTCACTGATG 


GTCGAATTGA 


1020 


ACTTAGAGGA 


GATAAGGCTC 


GTCGTAAAGA 


AGGAAAAAAA 


TCAGACTATT 


CACTGTATTA 


1080 


CCAATTTGGA 


ACTCGAATTG 


CAATTGTTGA 


GGCAAAGGAT 


AATAAACACA 


GCGTTCGAGC 


1140 


AGGATTACAA 


CAAGCTATTG 


AATATGGAGA 


GATTTTAGAT 


GTTCCATTTG 


TTTATTCTTC 


1200 


GAATGGTCAT 


GGCTTTATTG 


AACACGACCG 


TATCACGAGA 


GAAGAACGTG 


AGCTGGAGTT 


1260 


AGACGAATTC 


CCTACTCGTG 


AAGAATTATT 


TTCTCGTATG 


ACGAAGGAAA 


AAGGATTGAC 


1320 


GTACGAAATT 


ACAGAAGCTA 


TCTCAACTCC 


ATACTATACA 


GACGCCTTCT 


CAATGAAAAC 


1380 


GCCACGCTAT 


TATCAGCAAA 


TAGCTATCAA 


CCGTACTATT 


GAAACAGTTG 


CCAGAGGACA 


1440 


AAAACGAGTA 


ATGTTTCTCA 


TGGCAACAGG 


AACGGGGAAA 


ACGTTCATGG 


CTTTTCAAAT 


1500 


TATTCATCGC 


CTTCGAAAAG 


CTGGTTTGGC 


TAAACGAGTT 


TTATTCTTAG 


CAGATAGAAA 


1560 


CATCTTAGTA 


GACCAAACGA 


TGGCTGAAGA 


CTTTAGGCCA 


TTCGAAAAGG 


TAATGACGAA 


1620 


AATTACACCA 


AAACTTTTGA 


CTGCTCCTGA 


AAAATTAAAT 


TCTTTTGAAA 


TTTATCTAGG 


1680 


GCTTTATCAG 


CAACTAACTG 


GTGAAGATGG 


AACTGAAACA 


CATTATCAAA 


AATTTGACAA 


1740 


AGACTTCTTT 


GATTTAATCG 


TAATTGATGA 


AGCGCACCGT 


GGTTCAGCTA 


AGGAAAACAG 


1800 


TAACTGGCGT 


AAGGTAATTG 


ATTATTTCAG 


TTCTGCGACA 


CAGATTGGGA 


TGACCGCTAC 


1860 


TCTTAAAGAA 


ACCAAGAATG 


CTTCCAATAC 


GGAATACTTT 


GGTCAGCCAA 


TCTATACTTA 


1920 


TAGTTTAAAA CAGGGAATCG 


AGGATGGTTT 


TTTGGCTCCA 


TATCGTGTTA 


TGAGGGTTAA 


1980 


TTTAGATGTG 


GATGTGGATG 


GTTATCGTCC 


AGAAACTGGA 


AAAGTTGATG 


CTAACGGACA 


2040 


ATTAATAGAA GATAGGTACT 


ACGGCAGGAA 


AGATTTTGAT 


AAAACCATTG 


TCATTGATGA 


2100 


TAGAACGCAA 


AGAGTTGCCA 


AGTTTGTTTC 


TGATTATATG 


AAGCAAAACA 


ATGCACGATT 


2160 


TGATAAAACA ATTGTTTTTT 


GTGTTGATAT 


TGACCATGCC 


GAGCGAATGC 


GTGCTGCACT 


2220 


TGTAAAAGAG 


AATCTAGACT 


TAGTCCAAGA 


AGACTATCGT 


TATGTCATGC 


AAGTAACTGG 


2280 


TGACAACGCT 


GAAGGAAAAG 


CTCAACTGGA 


TAACTTTATG 


GATGTCAATT 


CTAATTTTCsZ 


2340 


CGCTATTGTA 


ACAACGTCTA 


AATTATTAAC 


GACAGGAGTT 


AATGCTAAAA 


CATGTCGTTT 


^400 


GATTGTTTTA 


GACTCTAATA 


TCCAATCCAT 


GACTGAATTT 


AAACAAATTA 


TTGGTCGTGG 


2460 


CACACGTCTT 


TATCCTCAAA 


AGGGGAAAGA 


ATTTTTTACG 


ATTATTGATT 


TTCGAAATGT 


2520 


TACCAATTTG 


TTTGCTGACC 


CTGATTTTGA 


TGGTGATCCA 


GTGAAGGTGC 


TAGAAACAGG 


2580 


TGCGAAAACA 


GTCAGTGGTT 


CTACGCCCGG 


TTTCGTAGAT 


GAGGAAGGTG 


ACCCAGTAGA 


2640 


AAAATATATC 


GTTACAGACA 


AGCAGGTTAC 


CATTCTTAAT 


TCTACTGTTC 


AAGTATTGGA 


2700 


TGAAAACGGG 


AAACTGATTA 


CCGAAAGCCT 


GACCGACTAC 


ACTCGAAAGA 


ATATCTTAGG 


2760 
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2820 
2880 
2940 



3060 
3120 



TAGCTACGCC ACTTTGAACG ATTTTATCAC AGTTTGGCAT ACGGCAGATA ACAAGAACCT 
TATCTTAGAC GAACTTTATA AAAAAGGAGT TTATCTAGAT GCTATTCGAG AGTCGGAGGG 
AATATCAGAA CAAGAAATCG ATCATTTTGA TTTACTCCTA AAACTTGCCT ATGGTCAAAA 
AGAATTAACC AAAACGGAAC GTATCAATAA ACTCAAACAA AGCGGATATT TATATAAATA 3000 
TAGTGAGGAA GCGCGTGCTG TTTTGGAAAT TTTACTGAAC AAATACATGG ATAAAGGTAT 
— TGGAGAACTC-GAAAGCATTG AAACATTA^^TTCCA^Aa" mCA^AT"i T GG : rGGAAC 

CTTCAAAATC ATCAATACTT ATTTTGGAGA TAAAAAACGA TATTTACAAG CAATTAAAGA 3180 
ATTGGAGCAA GAGCTATTTA CAGTAGCTTA ATGAAAGGAA AGTATGTCAA TTACATCATT 3 240 
TCTAAAAAGA ATTCAAGATA TCACTCGAAA CGATGCTGGT GTTAATGGTG ATGCTCAACG 3300 
TATTGAGCAA ATGTCTTGGT TATTATTCTT AAAAATTTAT GATAGCCGTG AAATGGTTTG 
GGAATTAGAA GAAGACGACT ATGAGTCAAT TATCCCAGAG GAATTAAAAT GGCGAAATTG 
GCCTCATGCT CAAAATGGGG AACGCGTATT GACAGGCGAT GAATTACTTG ATTTTGTCAA 3480 
TAACAAGTTA TTCAAAGAGT TGAAAGAGCT TGAAATAACT TCAAATATGC CTATTCGAAA 3 540 

AACGATTGTT AAATCAGCTT TTGAAGATGC GAACAACTAT ATGAAAAATG GCGTCTTGTT 
ACGCCAAGTC ATCAATGTTA TTGATGAAGT TGATTTCAAT AGCCCTGAAG ATCGTCATTC 
GTTTAATGAT ATTTACGAAA AAATTCTTAA AGATATTCAA AATGCTGGGA ACTCAGGAGA 3720 
ATTTTATACG CCACGTGCAG CGACTGATTT TATTGCCGAA GTTCTTGACC CAAAACTTGG 3780 
AGAATCAATG GCAGACCTTG CTTGCGGAAC AGGAGGCTTC TTGACTTCGA CTCTGAACCG 3840 
TTTAAGTAGT CAACGTAAAA CTAGTGAAGA TACCAAAAAA TATAATACAG CTGTTTTTGG 3 900 

TATTGAAAAG AAAGCATTTC CTCATCTTTT ACCAGTTACA AATCTGTTTC TTCACGAAAT 3960 
TGATGACCCT AAAATTGTTC ATGGAAATAC TTTGGAGAAA AATGTTCGTG AATATACGGA 4020 
TGATGAAAAA TTTGACATTA TTATGATGAA TCCACCTTTT GGAGGGTCAG AATTAGAAAC 4080 
AATAAAAAAT AACTTTCCAG CAGAATTACG GAGTTCTGAA ACAGCTGATT TATTTATGGC 4140 
TGTCATTATG TATCGTTTGA AAGAAAATGG TCGTGTTGGA GTTATTTTAC CTGATGGTTT 
TCTATTTGGT GAAGGTGTAA AAACTCGCTT GAAACAAAAA CTGGTAGATG AGTTCAACTT 
GCATACGATT ATTAGGTTGC CTCATAGTGT CTTTGCACCG TATACAGCAA TCCATACGAA 4320 
CATTCTTTTC TTTGATAAAA CAAAGAAAAC AGAAGAAACT TCGTTTTATC GTTTAGATAT 43 80 

GCCAGATGGT TATAAAAATT TCTCGAAAAC TAACCCGATG AAGTCAGAAC ACTTCAATCC 
TGTTCGTGAC TGGTGGGAAA ATCGTGAAGA GATTCTGGAA GGTAAGTTCT ACAAATCTAA 



3360 
3420 



3600 
3660 



4200 
4260 



4440 

4500 
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ATCATTTACA 


CCTAGTGAAT 


TCGCTGACTT 


GAATTATAAT 


TTAGACCAGT 


CTGACTTTCC 


4560 


AAAACAGCAA 


GAGGAAATCT 


TAAATCCCTT 


TCAGTTGATT 


CAGAATTATC 


AAGCGGAAAG 


4620 


AGCAACTTTA 


AATCATAAGA 


TTGATAATGT 


ATTAGCTGAT 


ATTTTGCAGT 


TGTTGGAGGA 


4680 


CAAATAATGA 


CACCAGAACA 


ACTTAAAGCA 


AGTATTCTCC 


AAAGAGCCAT 


GGAAGGGAAA 


4740 


TTAGTGCCGC 


AAAATCCCAA 


TGACGAACCT 


GCAAGTGAAT 


TATTAAAGAG 


AATTAAAGCT 


4800 


GAAAAAGAAA 


AACTTATCAG 


TGAACGAAAA 


ATCAAACGAG 


ATAAAAAGGA 


AACTGAGATA 


4860 


TTTCGTGGTG 


ATGATGGGAA 


ACATTATGGG 


AAGTTTGCTG 


ATGGAAGCAC 


TCAAGAAATT 


4920 


GATGTTCCTT 


ATGATATTCC 


TGATACTTGG 


GAGTGGGTGA 


GCTTTTCTAC 


ATTGGTTGAA 


4980 


ATTGTCAGAG 


GTGGCTCTCC 


ACGACCAATC 


AAaGATTATC 


TTACTTCTGA 


AGTAGATGGA 


5040 


ATAAATTGGA 


TAAAAATAGG 


TGATACTGAA 


AAGGGTGAAA 


AGTATATAAA 


TAATGTTAAA 


S100 


GAAAAAATCA 


AAAAATCAGG 


GCTTAACAAA 


ACTAGATTTG 


TAAAAAAAGG 


TACATTTTTG 


5160 


TTAACTAATT 


CTATGAGTTT 


TGGTAGACCT 


TATATTTTGA 


ATGTTGATGG 


TGCAATACAC 


5220 


GATGCATGGT 


TGGCTATTTC 


GAACTATGAA 


AACTCATTAA 


ATAAAGATTA 


CCTATTCTAT 


5280 


ATTCTTTCAT 


CAAATGTAGT 


TTATTCTCAA 


TTTCTATCTC 


TAATTAGTGG 


AGCTGTTGTG 


5340 


AAAAACTTGA 


ATAGTGATAA 


AGTTGCTTCT 


ATTCTTATCC 


CTCTCCCCCC 


ACTATCCGAA 


5400 


CAACAACGAA 


TAGTAGAAGC 


AATCGAATCA 


GCTTTAGAAA 


AAGTAGATGA 


ATATGCTGAA 


5460 


AGTTATAATA GACTAGAACA GCTAGATAAA GAATTTCCAG 


ATAAACTAAA 


AAAATCTATT 


5520 


CTTCAATATG 


CTATGCAAGG 


AAAATTAGTT 


GAACAAGACC 


CAAATGATGA 


ATCAGTCGAA 


5580 


GTTTTACTTG 


AAAAAATACG 


AGCAGAAAAA 


CAAAAACTCT 


TTGAAGAAGG 


CAAGATTAAA 


5640 


AAGAAACATT 


TGGACATTTC 


TATTGTTTCC 


CAAGGAGATG 


ATAACTCTTA 


TTATGGGAAT 


5700 


ATACCTATGA 


ATTGGGTTGT 


TATAAAAATA 


AAAGATATTT 


TTTCAATAAA 


TACAGGTCTT 


5760 


TCTTACAAGA 


AGGGCGATTT 


AAGCATTAAT 


AATAAAGGTG 


TTAGAATTAT 


ACGTGGTGGT 


5820 


AATATTAAGC 


CTTTAGAATT 


TTCTCTGTTG 


GATAATGATT 


ACTACATTGA 


TACACAATTC 


5880 


ATCTCCTCTG 


AGCAAGTTTA 


TTTAAAACAT 


AATCAGCTAA 


TAACACCTGT 


ATCAACCTCT 


5940 


TTAGAACATA 


TTGGAAAGTT 


TGCAAGAATC 


GATAAAGACT 


ATGATGGTGT 


TGTGGCTGGT 


6000 


GGATTTATTT 


TCCAATTAAC 


ACCATTCGAA 


AGTTCAGAGA 


TTATTTCAAA 


ATTTCTATTA 


6060 


TTTAACTTGT 


CCTCTCCGTT 


ATTTTATAAA 


CAATTGAAAG 


CAATAACTAA 


ACTATCAGGT 


6120 


CAAGCTTTAT 


ATAATATTCC 


TAAAACTACA 


CTGAGCGAGC 


TATTAATTCC 


GTTAGCTCCT 


6180 


TTTGAGGAAC 


AGGAACTTAT 


TACTCAAAAA 


GTTGAGAAAC 


TTTTTGAAAA 


AGTAAATCAA 


6240 


CTTTGAAAAT 


GATTCTTTTC 


ATCTCTTCAT 


GATTAGAAAT 


AGGGATTAAT 


AATTCGGAGA 


6300 
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TACT GGT ACT ATTTAATCTT TTCCCTTTGA TAGCATC7TT TGAATCACCT AAACTAGACA 
TAACTGGCAA AAATATCATT AAGTAATCTC TGATAATATT TTCTTTATTA GCATAGGGGA 
ATATCGATAT AATGGCTTCA TTATGAGTGG CAGGAATATC CAATATGGCA ACTTTTCCAA 
TAGATAATTT AAAACTCATT AATAAAGTTC CTTTAGGTCA AATGTCTATT TTCTTTGATT 
TTAATCCTAA TTTAGAAATA GATTCTCTCG CATTAGTTAC ATAACCAGAT ATAGGCATAT 
~ ^TATAC^ ~~ ~"~ 

TTTTTCCTAT TCTGAAGTTA ACTAGGCTAG CAAATTTAAT ATATCTCCAT GCTTCTGCGA 6720 
TTTCATATAT AGGATAAGAG GTTGTTTCGT CTTTGTTCCC ATAATAAGAG CCATAATCAC 
AAAAATAGCA GGTACTCAGT TTGACCACCT GTTATTTTTT ACCAATTAAC AATTTTATCT 
ACAATATTTT GTTGTTCAGT AGCTGTTTTC CTTAGATAAA TTCGAGTAGT TTCTATACTT 
TCGTGTCCCA TCAAATCTGC AAGCAAGGCA ATATCATTAT ACTTCGCTAA AAAATTCTTA 

GCAAATAAAT GCCTAAAAGA ATGAGGGTAA ATTACCTTAG GATTCATTTT GTATTTATCA 7020 

GCATAATTTT TTAACTGTTG AGCAACTCCT CTTGCTGTAA TTGGTTCGTT AAATTTATTC 7080 

AAAAATAAAT AACCACTTCG GCGATTTTCT GATTCTAACC AACTAAGACA ACTATTTCTT 7140 

AATTTTTTAG GAATGTACAG TCTACGAATT TTACCACCTT TTGAGTAAAT GTCAAAATAA 7200 

CCGATTTCTA CATGCTCTAC TTTTAGTTTA ATAAGTTCAC TTACACGAGC CCCACTTGCA 7260 

CCTAAAAACC AAACGACAAA ATGCCATTTT AAAATACCAT CTTTTTTCAA ACTACGTTTA 7 3 20 

AGAAAAAGGT AATCAGCATG GCTAATGACA TCTTCTAAAA ACGGTTTTTG CTGTACTTTG 7380 

ACAAATTTTA ATTTCAAATC ATCATGACCA ATAAAAGCCA GATATTTATT TACTCCTTGT 74 40 

AGTCGCAAAT TGACAGTTTT AGGTTTAAAA TTGTCTAATA AATATCCTTT GTATTCAAAT 7500 

AAATCTTCCA TTTTGAGTTC GTAATTCTCC AAGAAAAATC GAACACCATA AACGTACGAA 7560 

CGCACAGTAT TTTCAGCTAA ACCAGCTTTC TTCAAATCTA ATTCAAAATC TTTCAACGTA 7 620 

AAACTCCTAT CTTATGTTTG ATAGAAATTC CACCGCACGT AAAACTATTA TACTAAATTA 7680 

GTGCGTCAAT ATGGGCCAAA AATTGTTCGA TTTTATCAAC GATTCTGGAT TGTTCAGGAA 7740 

GGGGTGGGAG GGGGATTAAA TATTCTTTTA TAGTTTTCGT TAATAATTCT TTTTGTTTTG 7800 

TACTACCCGA CGCTTTTTCT TCAATAACTG ACTGAACAAT AGGAGAGGAA AGAAAATTAT 7860 

AGATGAAATG GCAATTAATA ACCCCCGATA AGACTCTTAT AACTGTAACA TGGCTATCTG 7920 

CAACAGCCCA GCCATAAGGA TTTTTATTTT CATGGTAAAT AGCTAATCGT CCTAACGTAC 7980 

CTAGACCTGT TGAATTCCAC ATTAAATCAC CATCTCTTAG TAATCTTTCT TTCTGGTAAC 8040 
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TATGAACTCT TTCGGGATCA 


ATAAATCTTG 


CTAAGTCAAT 


AGAAAAGCCA 


GACCATTGAT 


8100 


TACATTTCTG AGCAATCACA GGGTATATAG 


GAATATTTGA 


ATATTTTGGA 


GACTTCCCTC 


8160 


TTTGAATGTA GGAGGTTATA 


TCGTTTAACC 


TCACCCATTC 


CCAACTTTCT 


GGTATTTCAC 


8220 


AAGGTACTTC CTCATAATAA GAGTTATCAT CTCCTTGGGA AACAATAGAA 


ATGTCCAAAT 


8280 


CTTTCTTTTT AATCTTGCCT 


TCTTCAAAGA 


GTTTTTGTTT 


TTCTGCTCGT 


ATTTTTTCAA 


8340 


GTAAAACTTC GACTGATTCA 


TCATTTGGGT 


CTTGTTCAAC 


TAATTTTCCT 


TGCATAGCAT 


8400 


ATTGAAGAAT AGATTTTTTT 


AGTTTATCTG 


GAAATTCTTT 


ATCTAGCTGT 


TCTAGTCTAT 


8460 


TATAACTTTC AGCATATTCA TCTACTTTTT CTAAAGCTGA TTCGATTGCT 


TCTACTATTC 


8520 


GTTGTTGTTC GGATAGTGGG 


GGGAGAGCAA 


TTAATAATAG 


ATTAAAATTA 


TAATCATTGA 


8580 


TTGCAGGATA ACTTGTTCCA 


GTAGATTTAT 


TATTAACACG 


ATTGATAAAA 


TTATCTGATA 


8640 


ATAAATAATA TTTCAAATAT 


GTTTCGTTAA 


GTAAAGTATC 


CAAAACAATA 


AATGCTGTAC 


8700 


TAGCTATCAA ATACTCTTTA 


AGTTCTCTAA 


CTACAGCAAT 


ATTTTTTAGA 


TATGGTCTAA 


8760 


CTGTTGAAAA TAAGACACTA 


TTCTGCGAAA 


CTAATTTTCT 


AGCACGGGAA 


GCCGCTTGTT 


8820 


CAGGTGAAAG ATATTGTAGA 


TTTTTGTAGT 


TGATTATGTT 


CTTTTTTCTA 


TCAATACTAG 


8880 


ACGTATCTAT ATACCTAAAG 


GATTTCTCTG 


GCTTATTTTG 


CCCAAAATTC 


CAATAAATTG 


8940 


ATTTTATCCT CACCCACTCC 


CAAGTATCAG 


GAATATCATA 


AGGAACATCA 


ATTTCTTGAG 


9000 


TGCTTCCATC AGCAAACTTC 


CCATAATGTT 


TCTTATGTGC 


TTCAAGTATA 


TAAAAAGGCG 


9060 


TAAAAATACG CCTATAGATA 


ATGGGGTTGA 


AATAGGTTTA 


TTGTTGATGA 


GATTGTAGAT 


9120 


AATTCAATTT TTTACTTCCA 


ATCGAATATT 


CAAATCCTCC 


ACCTTTTCTG 


CCTGTAATTG 


9180 


TTCATCATAA AATTCAATAT 


CTTCAGGATT 


TTCCCCTTGG 


CAACCTCGGC 


AGAAATATTC 


9240 


TTCCGCTCGA TCAGGATTCA AAAATCGACA AGCACAAACA 


AAACAGTCGC 


CATCATCATT 


9300 


TATTGAGATA ATATAGTAGA 


TTGAAATAAG 


ATGTAAACAA 


ATCGATTAGG 


AAAGTTAAAT 


9360 


TAGTTTCTAG AAATTTTTAG 


CAGATGTAGT 


GTACTATTCT 


AGTCTCAATT 


TACTATGCCT 


9420 


TCAAATATAT CTPTCGAAAA 


AATATTTACA 


GATGTGTAAT 


TTTGAAGCTT 


GCAAAAGTTA 


9480 


GTAAACTTGT AGATTTCGAT 


TTGAAGTAAC 


TTGTTTTCTT 


GCCCGATATT 


GTTTTTGAAA 


9540 


TTGAATTTTT CCATAGTGAC 


TCCTTAATTT 


TCTTCTACAC 


GTCTGATGAT 


AAATCTAATT 


9600 


CGCAAAAGAG TCAAGAGGAT 


TTTTCGAAAA 


ATAAATAGCG 


ACCGAAATCG 


CTATTTTAAG 


9660 


GGTTATAGGT ATTTGATGGC 


TTAGACTGCT 


GTGTGACTGT 


TTACCCACAG 


GCAATCTTTC 


9720 


TTCTATATTA GTATTAGTAA 


AGGTCTAAAT 


AATTATCAAT 


TTCCCATTGT 


GAAACGAAGG 


9780 


TTGCATAACT TGCCCATTCG 


ATTCGTTTGG 


CTTCAAGGAA 


GCTAGTATAG 


ATGTGATCTC 


9840 
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CGAGAGCAGC TTTAACCACT TCATCTTCTG TCAAAGCTTT CAAAGCGTTG TGAAGAGTTG 9900 

ATGGAAGGTC TGTAATACCA GCTTCCTTGC GCTCTTCTGC TGTCATGATG TAGATATTTT 9960 

CTTCGATAGG AGCTGGTGCT TCGATTTTAT TTTCAATACC ATACAAACCA ACTTCCAAAA 10020 

GAACAGCCAT AGCAACGTAA GGGTTCGCCA TTGGATCCAC TGAACGCAAC TCAAGACGAG 10080 

TTCCCATACC ACGTGAAGCA GGTACGCGCA CAAGTGCCGA ACGGTTACGA CCAGCCCAAG 10140 

CAATGTAAAC AGGCGCTTCA TAACCTGGAA CCAAACCTTT GTATGACTTA ACTGTTGGGT 10200 

TCATGATGGC AGTATAGTTG TAAGCATGCT TGATCAAACC GCCTAGGAAA TGGTAAGCTG 10260 

TTTCTGACAA CTGCATTCCT TTTGGATCAT TTGGATCAAA GAAGGCGTTA TTTCCTTCTG 10320 

CATCAAACAA GGACATATTA CAGTGCATAC CTGATCCAGC AATACCAAAT TTTGGCTTCC 10380 

CCATAAATGT TGCGTAAAGT CCGTGTTTGC GAGCAATGGT TTTAACAACA AGCTTAAAGA 10440 

TTTGAATCTT ATCACAAGCA CGGAGAACTT CATCGTACTT AAAGTCAATC TCATGCTGTC 10500 

CAACCGCAAC CTCGTGGTGA CTCGCTTCTA CTTCAAATCC CATTTTGGTC AAGACATTCA 10560 

CAATCTCACG ACGTGTGTTG TCCGCAAGGT CAGTAGGTGC CAAGTCAAAG TAGCCACCCT 10620 

TGTCATTCAC TTCAAGTGTT GGGTCCCCAT TTTCATCCAA CTTAAATAGG AAGAATTCTG 10680 

GCTCTGGACC AAGGTTGAAG GATTTGAATC CAACTTCTTC CATGTGACGA AGAGCTCGTT 10740 ' 

TCAAATTACC ACGAGGGTCA CCCGCAAATG CTTCACCTTC TGTTGTATAG ACATCACAGA 10800 

TCAGACCTGC AACACTTCCA TTTTCATCTC CCCAAGGGAA GACTGTCCAT CTATCCAACT 10860 ~ 

CCGGGTACAA GTACATATCC GACTCATTGA TACGTACAAA ACCTTCAATA GAAGATCCAT 10920. 

CAAACATAAC CTTGTTCGAC AAGACCTTAT CTAACTGTTC ATCTGTAGCA GGAATTTCGA 10980 

CGTTTTTCAT GGTTCCCAAA ATATCTGAGA ACATAAGACG AATAAAGGTA ACATTTTTTT 11040 

CCTTGACTTC ACGACGAATA TCTGCAGCTG TGATTGGCAT AAGTTTTCTC CTTAATCTAT 11100 

GACTACTTGC GGTTGCCTAA CCGCGACCAA AAGCTGACTG TACTGAAGCA AAACGCCCCT 11160 

GTTGGAGGAC TTCATTGTGA ACTGCACGAC GTACTTCAGT CTGACTAACC GCTTTCTTGG 11220 

ATTTCGCTTC ACGTTCAGCA TATTTTTTCT TAATGGCAGC GAT ATT AT AA CCTTCAGAGA 11280 

TATAATCTTT GATTTCAAGC AGACGATCCA TGTCATTCAA GGAATACATG CGACGATTTC 11340 

CTTCGTTTCG ATCGGGCTTG ATCAACTCTT GATCTTCATA ATAACGAATC TGACGCGCCG 11400 

ATAGATCGGT CAACTTCATA ACACTGCCGA TAGGAAAAAC AGCCATATTT CGGCGAAATT 114 60 

CTTTTTCCTT CATTTACAAT TTCCTTCTTT CTCTCTATTA TAGTCTAAAA AAAGACAAAC 11520 

GTCAATTGAT AATGTTATAA AATGTAACAT TATTTTTCTT TTTTCTCTAA AAAGAGACGA 11580 
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ATACGATCAA TATCCTAATT TACGATAATT GCGACAAAAA CTCCCATAAA CGTTTCTAAT 11640 

ACACGCACAA ACACGTACAA AATTGTCTCA CCACTTGGAA TTGATAGGGT AATGATTAAC 11700 

ATAGCTGCTA CACCACCAAT AACCCCTGCT TTGTTATTCA TGGCTACATT TGTCATAATG 11760 

CTTAACATGG TGCAGATTGG AACAACTACC AAGGTCACCC AAAAGGCTTC GTGGAAAAAG 11820 

GTATTTAATA AGAAGAAGAC CAAGGCATAG AGTCCACCGA TACTATTTCC TAGAATACGC 11880 

GAAGTCCCAA AATGAACACT CTCATCAAAA CTCTCCCTCA GGCTAAAAAC GGCTGTCAAA 11940 

GCACCAATTT GAAGACCTTT CCAGCCAAAA AAGCCAAAAA TCAAGAGAAC TAGAAAAACA 12000 

GCAATACCTC TTTTAAAGGT TCGCATACCA AGTTTGAACT GGGATTTATC GAATTTATAT 12060 

TTTTTAAAAT AACTCATAAT CTCAACTTTC TATTTCCATT TT AT CAT AAA TCGGTGATTT 12120 

TTATGAGTAA TAGTTGAGAG GAAGCGTTTT TATTTTAAGC AAAAGAAAAG AGGAACTTTC 12180 

ATCCCTCTCT TCTTTGATTT ATTTATAAAA TCTTATTTTT CTGTCAAGGC TGCAAGTCCT 12240 

GGAAGAACCT TACCTTCAAG AAGTTCCATT GATGCTCCAC CACCCCTACT AATCCATGAG 12300 

AACTTGTCTG CACGGCCAAG GTTAATCGCT GCGGCAGCTG AGTCACCACC ACCGATGATT 123 60 

GATTTAACTC CTGGTTGTTT CACGATAGCG TCCATCACAC CGATTGTACC AGCTTGGAAA 12420 

TCTGGGTTTT CAAATACACC CATAGGTCCG TTCCATACGA CT G TTTTGGC ACCAGTCAAA 12480 

GCTTCGTCAA ATTTGGCGAT AGATTTTGGA CCGATGTCAA GACCAAGGAA GCCTTCAGAA 12 540 

ACTGCTTCAC CTTCAGTGTC ACGCACTTCA GTGTAACCAG CAAATGCGTT AGCTTCTTTT 12600 

GAGTCAACTG GCAAGATCAA TTTACCATTT GCTTTTTCAA GAAGACCTTT CGCAACATCC 12660 

AATTTGTCTT CTTCTACAAG TGAGTTACCG ATTTCGATAC CTTGTGCTTT GTACAATGTG 12720 

TAAGTCATCC CACCACCGAT AAGGACGTTA TCAGCTTTTT CAAGCAAGTT TTCGATAACA 12780 

CCGATCTTGT CTGAAACTTT TGAACCACCA AGGATAGCCA CGAATGGACG TTCTGGAGTT 12840 

TCAACTGCTT CTTGGATGTA GGCAATTTCG TTTTCAAGAA GGAAACCAGC AACTGCTTTT 12900 

TCAACGTTTG CTGAGATACC AACGTTAGAT GCGTGTGCAC GGTGAGCTGT ACCGAATGCA 12960 

TCGTTTACGA AGATACCATC TCCAAGTGAT GCCCAGTATT TACCAAGTTC AGGATCGTTT 13020 

TTAGATTCTT TCTTGCCGTC AACATCTTCG TAACGAGTGT TTTCAACCAA GAGAACTTGT 13080 

CCATCTTCAA GAGCGTTGAT TGCCGCTTCT AATTCAGCAC CACGAGTGAC ACCTGGGAAA 13X40 

ACAACATCTT GACCAAGTTT TGCTGCCAAG TCAGCTGCTA CAGGAGCAAG TGATTTACCA 13200 

GCTTTATCAG Cr T CrTLTTT CACACGTCCA AGGTGAGAGA AAAGAATTGC ACGTCCACCT 13260 

TGTTCGATGA TGTACTTAAT AGTTGGAAGA GCTGCTGTGA TACGGTTATC CTTAGTGATT 13320 

ACGCCATCTT TCAATGGTAC GTTGAAGTCA ACACGAACGA GGACTTTTTT ACCTTTCAAG 13 380 
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TCAACGTCTT TAACAGTAAG TTTTGCCATG TTACAAAAAC TCCGG 
(2) INFORMATION FOR SEQ ID NO: 152: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi> 


SEQUENCE DESCRIPTION: SEQ ID NO: 


152: 






GATTTATCC? 


AC CGGnG AAT TTCCGGAGGG GTTCTAGCAG 


CAATCTTAGG 


AATCTATGAA 


60 


CGAATGATTG 


GCTTTCTGGC CCATCCCTTT AAAGACTTTA 


AAGAAAATGT 


TTTGTACTTT 


120 


ATTCCAGTTG 


CCATCGGTAT GCTTCTGGGA ATCGGCTTAT 


TTTCCTACCC 


GATTGAATAC 


180 


CTGCTTGAAA 


ATTATCAGGT TTTTGTATTA TGGAGCTTTG 


CGGGAGCTAT 


TATCGGTACA 


240 


GTTCCTAGCC 


TCCTCAAAGA ATCAACTCGA GAATCTGACC 


GAGACAAGAT 


TGATTTAGCT 


300 


TGGTTATGGA 


CAACCTTTAT CATTTCTGGA TTAGGACTCT 


ATGCCTTAAA 


TTTTGTCGTT 


360 


GGAACCTTAA 


GCGCCAGCTT TCTTAACTTC GTCCTAGCAG 


GCGCACTATT 


GGCCCTTGGC 


420 


GTCTTGGTTC 


CTGGCCTCAG CCCATCAAAT TTACTTTTGA 


TTTTGGGACT CTATGCTCCT 


480 


ATGTTGACTG 


GTTTTAAAAC TTTTGATTTC TTGGGAACCT 


TCTTTCCGAT 


TGGAATTGGT 


540 


GCAGGTGCAA 


CTCTCATCGT TTTTTCAAAA TTGATAGATT 


ATGCCTTAAA 


CAACTACCAC 


600 


TCACGCGTCT 


ATCATTTCAT CATCGGTATC CTCCTATCAA 


GTACCCTTTT 


GATCTTAATT 


660 


CCAAATGCAG 


GAAACGCTGA AAGTATCCAA TACACAGGAC 


TTTCACTTGT 


CCCTTATGTC 


720 


ATCATCGCCT 


TCTTCTTTGC GCTGGGAATC TGGCTTGGTA 


TTTGGATGAG 


TCAATTGGAG 


780 


GATAAATATA 


AATAATGGCA AAAAAAGTTA AAATCAAAAA 


AACATTGGTG 


GAACAAATCC 


840 


TATCTAAAGC 


AGCTATCCCT CATCAGGGGA TTCAAATCAA TGCCCTAGAA 


GGAGAGCTTC 


900 


CTCAA 








905 



(2) INFORMATION FOR SEQ ID NO: 153: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 
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CTTGAATTAA 


ATAAAAAACG 


TCATGCGACT 


AAGCATTTTA 


CTCATAAGCT 


TGTTGATCCC 


60 


AAAGATGTGC 


GTACGGCTAT 


CGAAATTGCA 


ACCTTAGCGC 


CAAGCGCCCA 


CAACAGCCAG 


120 


CCTTGGAAAT 


TTGTGGTGGT 


ACGTGAGAAA 


AATGCTGAAC 


TGCCAAAGTT 


AGCTTATGGT 


180 


TCCAATTTTG 


AACAGGTATC 


ATCAGCCCCT GTAACCATTG 


CCTTGTTTAC 


AGATACGGAC 


240 


TTAGCCAAAC 


GTGCTCGTAA 


GATTGCCCGT 


GTTGGTCGTG 


CTAATAACTT 


TTCTGAAGAG 


300 


CAACTTCAAT 


ATTTTATGAA 


AAATCTGCCA 


GCTGAGTTTG 


CCCGTTACAG 


TGAGCAACAA 


360 


CTCAGCGACT 


ACCTAGCTCT 


CAATGCAGGT 


TTGGTTGCCA 


TGAACTTGGT 


TCTTGCATTG 


420 


ACAGACCAAG 


GAATTGGTTC 


TAACATTATT 


CTTGGTTTTG 


ACAAATCAAA 


AGTTAATGAA 


480 


GTTTTGGAAA 


TCGAAGACCG 


TTTCCGCCCA 


GAACTCTTGA 


TCACAGTGGG 


TTATACAGAC 


540 


GAAAAATTGG 


AACCAAGCTA 


CCGCTTGCCA 


GTAGATGAAA 


TCATCGAGAA 


AAGATAGAAA 


600 


GAAGAAAAAA 


TGACAGCAAT 


TGATTTTACA 


GCAGAAGTAG 


AAAAACGCAA 


AGAAGACCTC 


660 


TTGGCTGACT 


TGTTTACCCT 


TTTGGAAATC 


AATTCACAAC 


GTGATGACAG 


CAAGGCTCAT 


720 


GCCCAGCATC 


CATTTGGGCC 


TGGTCCAGTA 


AAAGCCTTGG 


AGAAATTCCT 


TGAAATCGCA 


780 


GACCGCGATG 


GCTACCCAAC 


TAAGAATGTT 


GATAACTATG 


CAGGACATTT 


TGAGTTTGGT 


840 


GATGGAGAAG 


AAGTTCTCGG 


AATCTTTGCC 


CATATGGATG 


TGGTGCCTGC 


TGGTAGCGGT 


900 


TGGGACACAG 


ACCCTTACAC 


ACCAACTATC 


AAAGATGGTC 


GCCTTTATGC 


GCGCGGGGCT 


960 


TCGGACGATA 


AGGGTCCTAC 


AACAGCTTGT 


TACTATGGTT 


TGAAAATCAT 


CAAAGAATTG 


1020 


GGTCTTCCAA 


CTTCTAAGAA 


AGTTCGCTTC 


ATCGTTGGAA 


CAGACGAAGA 


ATCAGGCTGG 


1080 


GCAGACATGG 


ACTACTACTT 


TGAGCACGTA GGACTTGCCA 


AACCAGATTT 


CGGTTTCTCA 


1X40 


CCAGATGCTG 


AATTTCCAAT 


CATCAATGGT 


GAAAAAGGAA 


ATATCACGGA 


ATACCTCCAC 


1200 


TTTGCAGGAG 


AAAATACAGG 


TGTTGCCCGT 


CTTCACAGCT 


TTACAGGTGG 


TTTACGTGAA 


1260 


AATATGGTAC 


CAGAATCAGC 


AACAGCAGTC 


GTTTCAGGTG 


ACTTGGCTGA 


CTTGCAAGCT 


1320 


AAACTAGATC 


CCTTTGTTGC 


AGAACACAAA 


CTTAGAGGAG 


AACTCCAAGA 


AGAAGCTGGC 


1380 


AAATACAAGG 


TGACGATCAT 


TGGTAAATCA GCCCACGGTG 


CTATGCCTGC 


TTCAGGTGTC 


.440 


AATGGCGCAA 


CTTACCTTGC 


CCTCTTCCTC 


AGCCAGTTTG 


GCTTTGCTGG 


TCCAGCCAAA 


1500 


GACTACCTTG 


ACATCGCAGG 


TAAAATTCTC 


TTGAACGATC 


ATGAGGGTGA 


AAATCTTAAG 


1560 


ATTGCTCATG 


TGGATGAAAA 


GATGGGTGCT 


CTTTCTATGA 


ATGCCCGCCT 


CTTCCACTTC 


1620 


GATGAAACAA 


GTGCTGATAA 


TACCATTGCC 


CTCAACATCC 


GCTATCCAAA 


AGCAACAAGT 


1680 


CCAGAACAAA 


TCAAGTCAAT 


CCTTGAAAAC 


TTGCCAGTTG 


TTTCTGTTAG 


CCTGTCTGAA 


1740 


CACGGTCACA 


CGCCTCACTA 


TGTGCCAATG 


GAAGATCCAC 


TTGTGCAAAC 


CTTGTTGAAT 


1800 



WO 98/18931 



PCT/US97/19588 



1019 

ATCTATCAAA AACAAACTGG CTTTAAAGGT CATCAACAAG TCATCGGTGG TGGAACCTTT 1860 

GGTCGCTTGC TAGAACGCGG AGTTGCCTAC GGTGCTATGT TCCCAGACTC GATTGATACC 1920 

ATGCACCAAG CCAATGAATT TATCGCCTTC GATGATCTTT TCCCAGCAGC AGCAATTTAT 1980 

GCCGAACCTA TTTACGAATT GATCAAATAA AACGATAGAA GTCTGACATC TTATCCTTCG 2 040 

ACTTCTTTTT GGAGGGAAAG TAGATGTCTC AAATCGAAAG AATCAAACAG GCTATCATGG 2100 

GGCATTCGCA GAATGCCAGC TATACAGAGC GTGGCATTGA GCCTCT CtTT~GCACCGCCAA ~~ 2160 

AAACTGCTCG CATCAATATC ATCGGTCAGG CTCCGGGACT TAAAACTCAA GAAGCAGGCC 2220 

TTTACTGGAA AGATAAAAGT GGTGACCGCT TGCGGGACTG GCTAGGTGTG GATGAAGATA 22 80 

CCTTTTACAA TTCAGCTTAT TTTGCTGTTT TGCCTATGGA TTTCTACTTT CCAGCACATG 2340 

GCAAGTCGGG TGATCTTCCG CCTCGTACAG GTTTTGCAGA AAAATGGCAT CCGCAGGTCT 2400 

TACAGGAATT GCCTGATATT CAGTTAACCC TCTTGATTCG GCAATATGCC CAAGCCTACT 24 60 

ATTTACAGGA GAAAATCAGT GGGAAGGTAA CGCAGAGGCT GAAACACTAT AAAGACTATC 2520 

TGCCACCCTA TTTTCCCCTA GTTCACCCAT CACCACGAAA TCAAATCTGG ATGGCCAAAA 2580 

ATCCTTGGTT TGAGGCAGAA GTAGTGCCAG ATTTGAAAAA AAGAATTAAA ACCATTTTAT 2640 

AGTCAATGAA AATCAAACAG CAAACTAGGA AGCTAGTCGT AGGCTGCTCA AAGTACAGCT 2700 

TTGAACTTGC AGATAAAACT GACGAAGTCG GTAACATACG CACGGTAAGC CGACGCTGAC 27 60 

GTGGTTTGAA GAGATTTTCG AAGACTATTA GAAGAAAAAG AATGAAAGAA ATAGCCTTTG 2820 

ACGCATTTTA CCAGCTTTAC CAAAACGACC AGCTTTCTTT AGTGGATGTG AGAGAAGTGG 2880 

ATGAGTTTGC AGCTCTTCAT TTAGAAGGTG CCCACAACCT ACCGCTTAGT CAATTGGCTG 2940 

ATAGTTATGA TTAATTGCAC AAAGATCGCT TGCATTATAT TATTTGCAAA TCTGGAATGA 3000 

GATCGGCGCG TGCTTGCCAA TTCCTATTAG AACAAGGTTA TAATGTTATC AATGTCCAGG 3060 

GTGGCATGTT ACCCTTTGAA GAACTTTAAA ATTTTGCATT TCTCCTACTT GGTGTGGACT 3120 

GGGTACGACA GTTTTATTTT TAGATAATTC TTATTTTTAA GAAAATTGAA AACATTTAAT 3180 

ATTTGCCTCG TGATGCTTTT TTCAGACTCC TAATCGTGGT ATACTAGGTC ACTATTTTAT 3240 

AAATATGAAG GACATTTTTA TGGCTAAAAA AGGTACCCTA ACAGGTTTGC TCCTGTTTGG 3300 

AATATTTTTT GGTGCGGGGA ACTTGATTTT TCCGCCTTCT CTAGGTGCTC TATCTGGAGA 3360 

ACATTTTCTT CCTGCCATCG CAGGTTTTGT CTTTTCAGGC GTTGCTATCG CCGTCTTGAC 3420 

CCTTATTATT GGAACGCTAA ATCCTAAAGG ATATATCTAC GAGATTTCAA CGAAGATAGC 3480 

GCCTTCCTTT GCGACTCTTT ACCTCTCAGT TCTTTACTTG TCAATCGGTC CATTCTTTGC 3540 
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TACCCCACCT ACTGCTACAA CAGCTTACGA AGTACCGATT AGCCCCCTTT TGTCGGATGC J 600 

AAATAAAGGA CTTGGCTTGA TTGTATTTAC GGTTCTGTAT TTTGCGGCAG CCTATTTGAT 3660 

TTCGCTTAAT CCATCAAAAA TCTTAGACCG CATTGGACGT ATTTTAACGC CAGTCTTTGC 3720 

AATTTTGATT GTTATCTTGG TCGTTCTGGG AGCTATCAAA TATGCTGCAA CAAGTCCTCA 3780 

AGCTGCTTCA CTGCTTATCA ACCTTCTGCC TTTGGTACAG GTTTCCTAGA AGGTTACAAT 3840 

ACCTTGGACG CCCTTGCCTC AGTGGCCTTT AGCGTAATCG CAGTTCAAAC CTTGAAACAA 3900 

CTTGCATTTT CAAGTAAGAA AGAATACATT TCAACTATTT GGGTTGTTGG TATCGTTGTT 3960 

GCCCTTGCCT TCAGCGCTCT TTACATCGGT TTAGGTTTTC TTGGAAATCA TTTCCCAGTA 4020 

CCAGCTGAAG CGATGAAGGG TGGAACACCA GGT G TTTACA TCTTGTCACA ACCCACTCAA 4080 

GAAATCTTTG GCTCAACAGC TCAACTCTTC CTTGCAGCTA TGGTTACCGT AACCTGCTTC 4140 

ACAACGACTG TTGGTTTGAT TGTGTCAACA GCTGAGTTCT TTAATGAGCC CTTCCCACAA 4200 

ATCAGCTACA AGGTTTATGC GACAGCCTTT ACCTTGATTG GATTTGCTAT TGCCAATTTG 4260 

GGTCTTGATG CGATTATC 4278 
(2) INFORMATION FOR SEQ ID NO: 154: 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 



ACCCGATCAA 


ATGACAAAAG 


CTAACTTTGG 


TGTCGTAGGT 


ATGGCCGTAA 


TGGGTCGTAA 


60 


CCTTGCCCTT 


AATATTGAAT 


CTCGTGGTTA 


CACAGTTGCT 


ATCTACAACC 


GTAGTAAAGA 


120 


AAAAACGGAA 


GATGTGATTG 


CTTGCCATCC 


TGAAAAGAAC 


TTTGTACCAA 


GCTATGACGT 


180 


TGAAAGTTTT 


GTAAACTCAA 


TCGAAAAACC 


TCGTCGTATC 


ATGCTGATGG 


TTCAAGCTGG 


240 


ACCTGCTACA 


GATGCTACTA 


TCCAAGCCCT 


TCTTCCACAC 


CTTGACAAGG 


GTGATATCTT 


300 


GATTGACGGA 


GGAAATACTT 


TCTACAAAGA 


TACCATCCGT 


CGTAATGAAG 


AATTGGCAAA 


360 


CTCTGGTATC 


AACTTTATCG 


GTACTGGGGT 


TTCTGGTGGT 


GAAAAAGGTG 


CCCTTGAAGG 


420 


TCCTTCTATC 


ATGCCTGGTG 


GACAAAAAGA 


AGCCTACGAA 


TTGGTTGCGG 


ATGTTCTTGA 


480 


AGAAATCTCA 


GCTAAAGCAC 


CAGAAGATGG 


CAAACCATGT 


GTGACTTACA 


TCGCTCCTGA 


540 


TGGAGCTGGT 


CACTATGTGA 


AAATGGTTCA 


CAATGCTATT 


GAGTACGGTG 


ATATGCAATT 


600 


GATCGCAGAA 


AGCTATGACT 


TGATGCAACA 


CTTGCTAGGC 


CTTTCTGCAG 


AAGATATGGC 


660 
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TGAAATCTTT ACTGAGTGGA ACAAGCGTGA ATT AG AC AG C TACTTGATTG AAATCACAGC 720 

TCATATCTTG AGCCGTAAAG ACGATGAAGG CCAAGATGGA CCAATCGTAG ACTACATCCT 780 

TGATGCTGCA GGTAACAAGG GAACTGGTAA ATGGACTAGC CAATCATCTC TTGACCTTGG 840 

TGTACCATTG TCACTGATTA CTGAGTCAGT GTTTGCACGC TACATTTCAA CTTACAAAGA 900 

AGAACGTGTA CATGCTAGCA AGGTGCTTCC AAAACCAGCT GCCTTCAACT TTGAAGGAGA 960 

CAAGGCTGAA TTGATTGAAA AGATCCGTCA AGCCCTTTAC TTCTCAAAAA TCATTTCATA "i 02 0 

CGCACAAGGA TTTGCTCAAT TGCGTGTAGC CTCTAAAGAA AACAACTGGA ACTTGCCATT 1080 

TGCAGATATC GCATCTATCT GGCGTGATGG CTGTATCATC CGTTCTCCTT TCTTGCAAAA U40 

GATTACAGAT GCTTACAACC GCGATGCAGA TCTTGCCAAC CTTCTTTTCC ACGAGTACTT 1200 

CTTGGATGTT ACTGCTAAGT ACCAACAAGC AGTACGTGAT ATCGTAGCTC TTGCGGTTCA 1260 

AGCAGGTGTG CCAGTGCCAA CTTTCTCAGC AG CT ATT ACT TACTTTCATA CCTACCGTTC 1320 

AGCTGACCTT CCAGCTAACT TGATCCAACC ACAACGTGAC TACTTTGGTG CTCACACTTA 1380 

CCAACGTAAA GACAAAGAAG GAACCTTCCA CTACTCTTGG TATGACGAAA AATAAGTAGG 1440 

TCAGCCATGG GGAAACGGAT TTTATTACTT GAGAAAGAAC GAAATCTACC TCATTTTTTA 1500 

AGTTTGGAAC TCCAGAAAGA GCAGTATCGG GTTGATCTGG TAGAGGAGGG GCAAAAAGCC 1560 

CTCTCCATGG CTCTTCAGAC AGACTATGAT TTGATGTTAT TGAACGTTAA TCTGGGAGAT 1620 

ATGATGGCTC AGGATTTTGC AGAAAAATTG AGCCGAACTA AACCTGCCTC AGTCATCATG 1680 

ATTTTAGATC ATTGGGAAGA CTTGCAAGAA GAGCTGGAAG TTGTTCAGCG TTTTGCAGTT 1740 

TCATACATCT ATAAGCCAGT CCTTATCGAA AATCTGGTAG CGCGTATTTC GGCGATCTTC 1800 

CGAGGTCGGG ACTTCATTGA TCAACACTGC AGTCTGATGA AAGTTCCAAG GACCTACCGC I860 

AATCTTAGGA TAGATGTTGA ACATCACACG GTTTATCGTG GTGAAGAGAT CATTGCTCTC 1920 

ACACGCCGTG AGTATGACCT TTTGGCGACA CGG m3 
(2) INFORMATION FOR SEQ ID NO: 155: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
CCGGCAGTAC ACGAGCTTGG GGAACAGCCA CTGGAACGAT GAGGTGTGAG CTCAAAATAT 



60 
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CCTCCAGTTA 


TGTTTTTCCT AATAGTATAC 


CCGAAGAGTG AAAGGATTTT 


ATAATGGAGC 


120 


GGTTACAAAG 


AACCTACTTT 


CTATTAAACA 


GTATACTATG AAAATGTGAA 


AATTTAACAT 


180 


TTTTTTGTAC 


AAATTTTATA 


AATTATTGCC 


TTTTTAATAT CAATAGTTAA 


TCTCTTATCC 


240 


AGATCCCCCT 


TGTGTAAACT 


TTATCTTTAT 


AACCTTCAAG GCCCCTATCC CATCTATTTG 


300 


CAACAATTAG 


ATCACTTTGT 


TTTGTAAATA 


GTTCAAAATT CTTTTCAATA 


ATTACGTTAT 


360 


CTATACTAAC 


GTTTAAATTT 


GGTTCATATA 


CTAAAATTTT TATACCGACA 


ATCAATAGTT 


420 


CATTAATTAT 


ACTTAAAATA 


GCTGACTCTT 


TGTAATTATC TGAATTATAT 


TTCATCCCCA 


4 80 


ATTTATATAT 


TCCTACTATC TTTGGCTTTC GTTCCAATAT TTGTTTAACT 


ATGAACTGTT 


540 


TTCTATTTGT 


GTTTGAAATA 


TCAATCGCTT 


CTATCACTGG GGCATTTATT 


TCTATAAATT 


600 


CTTTTTTTAA 


TTGTTTAGTA 


TCTTTGGGAA 


GACAATATCC TCCAAATCCA 


AAAGAAGGAT 


660 


TATTATAAAA 


ATTTCCAATT 


CTTGGATCTA 


AACAAACACC TTTTATTACA 


ACTTCAGCAT 


720 


TTAAGCTTCT 


CCTCTCAGCA 


AAAGAATCTA 


GTTCATTAAA AAAGCAACAC 


GGAGAGCTAA 


780 


GAATCTCTTA 


GAAAAAAGCT 


TAATTGCTTC 


TGCTTCAGTA GGAGAAACTA 


ACATAACATT 


840 


TTTAATATTG 


GCAGTACTAT 


GAGTACTAAT 


CGAAAGGAAC AACTCTGCAA 


TTTTTCTTCC 


900 


TTCAACTGTC 


TCATCTCCAA 


CAACTATGCG 


ACTTGGATAT AAATTATCAT 


ATATAGAACA 


960 


ACCTTCTCTC 


AAAAATTCAG 


GGACAAAAAT 


GATATTTTTT CTATCAAACA 


GCCTTTTTAA 


1020 


TTTGTTTGAA 


AAGCCGATCG 


GAACTGTTGA 


CTTTAAAATA ATCTTTCCAT 


TAGGTTTTAC 


1080 


CCTCAGAATC 


TTCGATACCG 


TTTGTTCGAT 


TTCATATGTA TTAAAACTAC 


CAATTTTCTC 


1140 


ATCATAATCT 


GTCGCAAGCG 


CAATAATATA 


ATAATCAATA TTATTTTTAA 


TTTCAGAAAA 


1200 


TGTATCAAAA 


AAAGTAATAT 


TTAAGTTATT 


CTCGCAAAAA AACTTCATAA 


GCTCTTCATT 


1260 


TTTAGATGGA 


AGAATGCCCT 


TTTTTAAATT 


ATTTATTTTT ACAGAATCTA 


TATCATATGC 


1320 


AACAACTTTA 


TATTTAGATG 


CAAATAGTAA 


CGCGTAGGCC AGCCCAACAT 


GCCCCAAACC 


1380 


AATTACTGCT 


ATATTCATAA 


AACTACTTCC 


TTATTTCTTA ATCCAAAATC 


TAATAGAAVA 


1440 


AGCTGCCCCA 


TTCCTTAAAT 


ACAACTCTTT 


AATATTGTTT AAAAGTTTTT 


CAACTGATTT 


1500 


CCAGATTATC 


AAAATCTGAG 


ATTTATAGCA 


CAATATTGAT GATATTCTAT 


CAATATAATT 


1560 


TTTTTCATCA 


AGTTCCTCTT 


GATACATTTT 


TAATTCTTTA GTTTTTCCCA 


TATAACTAAC 


1620 


CATACTACTA 


TCACTTACAT 


ATGGGAAGTC 


CTCATAATAT ATTACTTTAT 


AACGCATAAA 


1680 


TTCAAGCGCC 


CTTCCAATAC 


TATTCACAAA 


AACATGAGCA ACATGGTCAC 


CAAGTGAAAG 


1740 


CGGACAATAT 


ACGACACATT 


TGTCGTCTAA 


ATGCATTAAC AGCTCTTTTA 


TGATATCATT 


1800 


CTTTAATGTG 


TCCTCATTTT 


TTAATTCACT 


ATAGATATGA CGGTATAGAA 


AATTGCCATT 


1860 
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TCTATCTTTC CTATACACAC ATTCATAGTA CGATAAGTGT CTAAAATCAC ATTGTAGACC 1920 

TTCACAACCT AACCTGTCTT CTTTCTTCCT TTCTTCAATC GGATATTTCC CAAGGTTACA 1980 

CAACTTATGA AATTGCTTAG CAGAGGGCTG TAGCTGTTGG CTCAAAGGCT AACCAGAAAA 2040 

TATAGTAATA ACAAGTACAA TTTCTCCTTC TGAAGTTAAT TTTGAAATAT AATCACCACA 2100 

GGAAAAAATT GCGTCATCTA AATGTGGAGA TAAAAAGATA TACTTAGTAT TCTTACTCAT 2160 

AACCATTCCC TCTACAATTT ATCTAAAAAC TCACTAACTG TCTCATTAAA TTCCACATCA ~ 2220 

TCAAAAAAAT TCACCTTATT CTTAATAATG AATATTTCGT TAAATAAACA TATATATAAA 2280 

TATTTCAATA TCCTTTCAAT ATCATCCTCT AAATTCTCCT CAATATTTTG TATCAGCCCA 2 340 

TTTACAATCT TATTAAAAAA GATAAGCTCT TTATCTCTAA AATTAAATAT TTTCATACAA 2400 

CTGTTGTATC GAAAAATATA TAAAATAATT TTTACTAATG TTTGAATATT TAAACAACTA 2460 

AATAAATGAG TTGTACCCGG GACACTATTT ATGTTATCAA GAACACTATC TTGAAACCTC 2520 

AACTCACAGT TCTTTTTGTG AAATTCTTTT TTATCGTTTA GATCTGATAT TTTTTTAGAC 2580 

ATTTCAACAA TCTCAGACAT TTTATATGGA TATCTAGGAT GAATGCCAAA ACTATGCAAA 2640 

ATCAACTGCA CCCCAAAAGT TAGACAGAAT AAATCTAACT TTTGGGGTGC AGTTCATAAG 2700 

ATTGGGATAT TTTTTTTTAC CTAGAACTAG TAGAAATATA TAGTCAAATA ACAGATACCT 2760 

TAAGGGTTTC TCATCTACAT AAAAAAATGA TACTTTTTTC TCTTCAGTAA TTACCTCATA 2820 

AGCTTCACAA TAGAATCTCA TGTTTCCCTC CCCTATATTC TTAAATAAAA TCCTTTGGAA 2880 

ATTGATATAT CTTAGTAAAA TATTGTTTAA GTTCCGGATG CGGAGCATCG GTAACAATAA 2 940 

TGACAGTCAA ATCCTCTCTA TCTAATATCT TACGTTCAAT CGCTAACGAA GTTCTCCTAT 3000 

CGATAGCAGA AGTTCCCTCG TCAATTAATA CTATTTTCTT ATTTCTAATT AGCCCTCTAG 3060 

CTAAAGTAAT TTTTTGTTTC TGCCCTCCTG ACAGTAATCT CCCATCATCA CCAACATAAT 3120 

AATCTAAAAT CTTATTAGGA AAATCTTTTA CACTCAAACC AACTTGCTCT AAAGACTGTA 3180 

GTATTTCTTC ATCAGTATAA TTTTCTTCCA ATAAAATATT ATCTCTAATC GTACCTTCAA 3240 

ACAAATAAGC TTTTTGATCT ACATATAGAA CATTCGAAAC CATATTTAAA TAGGAGGTTT 3300 

TTTTTATATC ATCCCCGCAG AATCGCAATT CTCCACTATA ATCTCTCAAA AAGCCATTCA 3360 

ATAATTTTAA TAATGTAGAT TTCCCGCTTC CACTTTCACC TAAAATTAAA TACTTTTCAT 3420 

TACGTTGAAA ACAAAAATTT AAGTTTTTTA ATATTTCTTT ATCTCCATAC TTATAGCAAA 3480 

TATTTTTTGC TTCATATAAC GGAAAATCTC TATTCACCTC ATTTGGTTCG ATATCATTCA 3540 

TTTTATTTGA CTCAATTGGA TTAATTGAAT ACAATTTTAA AAAAATAGGC TTCGTACCAA 3600 
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TAATAGAGGA TAATTGACCT CCTAATTCAC CTAGCGCTGT AAAAATAACA CCTGTTAGTG 3 660 

CTCCTATTGC TTCAATAGTA CCAATTTTCA CTATTCCTTT TATTGCAAGA TAGCCTGTTA 3720 

AAAAAACGAG AGATATCTGA AAAAAAATAT TGAGAAAGAA GCTAATAGCG CCTGCTAACG 3780 

TTTCTACAGT TGTCTTTCTT TGTATAACCA TCTTTAATAA AATTCCTGCT T C TTTAATTT 3 840 

TCTTAGGCAA TACATATAAA AGATTCAAGG ACGCTAACAC ATCAAATCCA TTCAATATAG 3900 

TCTCACTAGA TTTTAAAAAA GCTTCATTTT GGTTAGTTAA ATTTAGACTA ACTTCTCGCA 3960 

TTTTCGATCC AAAGATTTTT GGTACAAGTA GCATAATCAT TAATGAAAAC AAGGTGGCTA 4020 

CAGTCAATGA CCAATGATAG TGATTAAGAG TCACAACTGC AAATATAGTA CCAGAAATTC 4080 

CTTTTATTAC TAAAAAAAGT TGTTTAAACG CCTGATCATT TAAAGTCTGA ACATCATTAT 4140 

TTAGCCACCA AAGATATGTT CCTGATCATT TACTATGAAA TTCTTGATAG GTAGAGTTAG 4200 

AGATGTCTGT GGCAACTCTA TTTCGAATCT CT AG ATT AAA CTCTTGGATC ACTTCAACCT 4260 

GATAATTTTT CACTACCCAG TCAAGGAATA TTATCCCACA CCAGACAATC ATTTGGTAGA 4320 

TTGACAATTT CAAAAACCGC TCTAAATTCA TCGCAATTAA TTCATTCAAC ACCAGAGCAT 4380 

TAATAGTTGC TGCATAAATT AGCAATAATT GACCAGCAAC AATAAATATC GTTAATAAAC 4440 

TAAATTTTTT TATATTTGAT TTTATAATAG TATACACAAT ACTTTCTCAC TTTCTAAATT 4500 

TTAATTGAAC ATA G TTTTCA TATATACAAT AGAAAAAACC AAAATGATAT AATAACATAT 4560 

ATTTCAAAAA AGAAATTCGT TAAAAATTTT TTCTTCTCTT GCCTTCTT G A TTACTTTTAA 4620 

AGCCTTGCAT TTGTCTCCTA TTAATAGTAA CCGCTTTATG TTTAAAGAAT AATATTTCTT 4680 

TGTAACCAAT ATTCTCTCGT TGAAACTCAA TAAATTAAAA TATTTCCTAC AGTAATTATA 4740 

ATATTCTTCA TCTGCATTAA TTGTTTTTTG TGTCACTCCA GTGATACCGT TTTCTTTACT 4800 

GTGAGCGTAG TAATTCACCA AGAATTCTCG CACTATATCA ATTTGGTATC CTTGAACAAG 4860 

TAGTTTTAAT AAAACAACAC CGTCCTGATG TGAATCTATT TTCTCAAAAC CATTAATTAA 4 920 

TTCTAGCACC TCTTTTTTAC ACAACCAAAA TGACGTACCT GCTATATTGT GAACCATTTG 4 980 

AACAAACAAG GGATTTCCAA CAAAATCGGT CTTCTCCTCT TCTCGTGTAC CATTTGGATA 5040 

AATTATTATT CCATAACTAC AAACTAAAGC TAAATTCTTC ATTCTACTCT TTTTAAAACA 5100 

AGCCATCAAC TTTAAAATTC GATCTGGCAT ATATTCATCA TCATCGTCTA AAAATGATAT 5160 

ATACTTACCT CTAGAATTTT TGATACCTAT GTTTCTGGCA TTAGTTGCAC CTAAATCTTC 5220 

ATTACTTAAA ATTAACTTAA TTCTATGATT GCTATAGCCA AATTGATGGA TAATTTTATT 5280 

TCTTAAATTT ACATTACTAT AATTATCATC AATAATTATA ACTTCGATAT TTTTATAACT 5 340 

TTGATGTAAA CAACTTTTCA CAGCTCTAAT CAGAGATTCA TACCTATTAT GTGTTGGTAT 5400 
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TATAATACTT ACTAATTCTT GATCTATATT CCTATCCATG ACTACTCTTC TCTAATAATT 5460 

CATCATATAC TCTCATGGTT TCTACAAACA TTTTTTGCAC AGAAAAATCT TTTCTTATTT 5520 

TTGATTTACT ATTCTCACCT ATATATTTCA AATACTCAGA ATCATTGAGT AAAAAATTAG 5580 

CACAAGCACA CACTCCCTCA ACATCTTCCT TCTCAAATAA AAATCCATCA ACCCTATGTT 5640 

CAATAATTTC ACTTAACCCG CCAACATTAC TAGCTAAAAC CGGAGTTCCT TGTGACATTG 5700 

' ACTCTAAAAC ACACAf AGGT ATTCCTTCTG" TATCAGAACG " AATATACAAT AAATCCCATA 5760 

TTTGGTAAAC TATAGTAGCT GGATAGATTT CACCAAGTAA CCTGAAATTA TCTCTACATT 5820 

TCAAATGGCA AATTTTTTCT TTCAAAGCAG CCCACATACT ACCATTTCCA GCCATAATAA 5880 

AAATCACATC TTCTCTGACT AAAAATAATT TTTCTGCAAA TTCAAGGAAT CTATCCGGCC 5940 

TTTTTTCTGG ATCCAACCTT CCAACATAAC AAATGATTTT TTCTTATTTG GAATACAAAA 6000 

TTCTTTTTTA AAGTCTTGAA CACCTACTAC ATCTAAATCG CTATTTGATA CATTAATTCC 6060 

GTTATTTATT CCAACTATCT TCTTATTTTT TATTATACTC TCCAATCTTT TTTTTCATAG 6120 

TTTCACATAC ACAAATAAAA GCATCTCCCA TAGAATATGT CCAAAAATCA AAATAAGTCA 6180 

ACAATTTCTT TTTTAAGTTA TATTCAACCC ATCCATGGCA TGTTATCACT GTCTTAACCT 6240 

TTCCAAATCC ATTCTTGTCA AGTTTTTTTA ACATATATAA AAAATAATTA GTTGAGTAGC 6300 

CATGACAGTG TATAAGTTGG ATTTTTAATA ATTTTAAAAT ATTTTTAACG TGTAAGGCAG 6360 

TTTCAAAATT ATTTGAACAT TGAGTACAAT CAACATACCC AATATCTAAA TTTTTATAAT 6420 

CATCAATAAC CTTTGAATCT CTAGATACAA TTATCAAAAT AGGGAATAGA GACA 6474 
(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4792 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
TATTTAACGA TTTTTTTCAT GTCATTTCCT CCAAAATAGA ATACCTTATA ATCTTAACAG 60 

AAAAAGAGCA TTTACGCCAT TATATGATAT CTATCTCTGT GATAAGTTTT TTTTATGGGT 120 

AATTTAAAAG ACCAAACGCA AGATGGCAAT CAAGACCACT CCAAAGAGAA CTCTTCCCAC 180 

TAGATTGCGG TAGCGAAAGG CTACCCAAGC TGTTGGAAAG ACGGCTAAGA AGTCCAGTCA 240 

TTTGATTTGA GGAAGACTGC CAACCTTACC TGTCACTACG CTTGAAAGAA TCAGGGCAAA 300 
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GATAATGGAA ACAGGCAAAA ACTTCAAAAA ACGCTCAACA ATCGCAGCCA GGCCCTTATA 360 

CTTGACCAAG ATGAAGGGAA TCATACGGGG AATCCAAGTC ACCAAGCCAG AGAAAATAAC 420 

TGCTAATAAA AGATACTTAC TGACCATCTA AAACCACCCC CATGCTACAA CCAAGTAGCG 480 

TCGCAAACAG AACAGCTAGT GACTGAGACA TCACTGTCAA GAGCAAAAAG AAGGACACCG 540 

CAACAACTGC TAGGATAATG AGCAGATTGC GGACAGGAAT CCCTCTTTGC ATAATCTGAA 600 

ATTGCGAAGC AAAATACCAA TAAACATCCC AACCAGGGCA AAATCCAAGC CAAAGATTTC 660 

TGCATTTGCT AGCAGGCCAC CCAGAGCCGT TCCGACTACT GTCCCCACAA ACCAAGCCAC 720 

ATAGCTGTTA AGATTGTTTC CGTGCATCCA CATAGGATTT ACCTTGTCTG TATGGGCCAA 780 

TTCACCCATC AAAACGCCAT AGGTCTCATC TGTCAAGATA CT AG AC AT AC CGATATTGTA 840 

CCAAAGACTG GTATGACGGA AATAAGTCGA TGCGTGTAAA CTCAACAAAA AGAGACGCAA 900 

GTTGATTAGA AAAACCGTCA TAGCAATAGC TCCCACAGGA GCTTGAACCA CAATCAGTGC 960 

CAACATGGCA . AACTGGGCAC TCCCAGCATA AACAAAGAGA CTCATCAAGC CCATCTCAAC 1020 

AGGTGTCACA TAGGGCGCAC CGATAATTCC ACAGGCCAGG CCGATACTGA CATAGCCAAG 1080 

AGCCGTTGGC ATGGCTGCCT GCGCCCCCTC CTAAAATCCT TTTTCTTTCA TCTTTCTCCT 1140 

CATATTGTCT TAATAATACT CAATGAAAAT CAAAGACCAA ACTAGGAAAC TAGCCGCAGG 1200 

TTGCTCAAAA CACTGTTTTG AGGTTGCAGA TAGAACTGAT GAAGTCACCT CAAAACACTG 1260 

TTTTGAGGTT GTGGATAGAA CTGACGAAGT CAGCTCAAAA CACCGTTTTG AGGTTGTGGA 1320 

TAGAACTGAC GAACTCAGTA ACCATACCTA CGGCAAAGTG AAGCTGACGT GGTTTGAAGA 1380 

GAGTTTCGAA GAGTACAAGT AGGCTGAAAA GAATCCAACC ACAGCATGGA CT ATT AT AT A 1440 

GCAGATTGAA ATAAGATGAG AACAAATCGA TTGGGAAAGT AAAATTAATT TCTATAAATG 1500 

TTTTAGCAAT TGTTTCGTAC TATTTTAGAT TCAGTCTATT ATAACACATT CAGAAAACAG 1560 

AAAAAAGTCT GTTGATTTTG ACCATCATAA AAAGACTGGC AATCCAGTCT CAAACATATA 1620 

TTATAGAAAT TCTCCACTAA ATACTTTCAC GAATATTCAG AAGCATAACA AAGGCAACTA 1680 

GAAGAAATAG CAATAAAACA AAGCTAACTG CCAGAGTTCC AAAGCTAGTA GCAATGCTTA 1740 

CCAAACCTAT TGTAAATAAG CTAGGTAAAA CAACCGTAAT GGCACCGATA GAGGATTGAA 1800 

CTGCTCCCAT TGACTCCTCA GGTATTTGTT TAAAAACGAG TTCTTGCAAT CTAGGAGAGA 1860 

GAACACCTGC GAAAAAGGCA TCCAAGGTAC TAAAGATGAG AATCCAGTCA AAACGAACTG 1920 

TGGCAAATCC TACTAGAAGA AGCAACTGGA TGACAAGTGA GGCATAGAGA GCTGTTTTTA 1980 

TGGAAATGGT ATGTTGCAGA TAGCCACTTA CAAGGCTTCC GACAATCACG GCTGATAATT 2040 

CTAGTGTGGC TAACAAGGCA AGAGATTGAC CAGTTTGTAA ATTCAAAAAG GGCTGGTTCC 2100 
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TTAAAAATAG AGTGCAAATA GGAACCGTAA CATTTATCAC TGCTTGACTA GTACAGATAA 2160 

TAAACAAAAC CAAGAGCACC TTATTCATAT TCCATATCAA TTTCGATGAT TGGAGCAAAT 2220 

GCTGGCAAAA GGATTTTACA GAGAGTCCTT CTTGATACCT AATCGTTTTT TCTACTTTCA 2280 

AGAGGTCAGT TTTTATGAAG AGGATACCTA AAAATGCGAT TAAAAAGGTA AGAGCGTTCA 2340 

GTAAGGAAAT AAACTGCATG GATAGAATGC CTAGTAAGAC TCCTCCTAGG ATATTACTGA 2400 

~ "TTCTPTTTi^-TAAACt " 460 ' 

GCCCAATAAT TCTAATGAAA ATCGCAGTGA GCATGGCGCC TGAAAAATAA CTCAATGTGT 2520 

CAGACAACAG GTTAATCAGA CAAATAAATG CTACTAGCAA CAAGGAGAAA GACTCCCCTG 2580 

AAAGTGATAA AGACACTATA GAGTAAAGCA AAAATTTTGC AAAACTAATG ACTGTGTATT 2 640 

TCAAGACACG ATGATGTTGA AAATCCGCCA AAACTCCCAG AAAGATTTGT AGAACTTGGG 2700 

GCAGGGTTTC TGAAATCGTG ATGAGTAAAA TCGCCAAAGG GGCAAAACAT GCATCTGCCA 27 60 

CATAATTCAG GAAGGCCAGA TAAAAAATCG TATCCCCAAG CGTTGAAATC CACTGGTTGA 2820 

TAGTTAATTG CCTAAAATCT CTATTTTGAA GAAATACTTT CATCACAACT CCTTCTTAAG 2880 

TTCAAATGGG AATCTTTCCC CAAGGATAGA CCGCGATACT ACTAACAACC AAAATTACAG 294 0 

TAACATCAAA AGCTGACCAA TGCCATTGTA GACTATATGC AGTCCAATAG GCCAATAAAT 3000 ' 

TGACTTTGTC ATTCTAAATA AGACTGCAAA TATAAGACCT CCACCCATAT AGAAGACAAA 3060 

GTCTGTCAAG ACCCAACCGT GATTACTAAT GTGCCAGACC CCAAATAAAA CAGCGGAACC 3120 * 

AAGTACATCT AGCCCCCATT TCTTTCCTTT TTCCAGAGCA GTCATCACTA ATCCACGATA 3180 

AATCATGTCT TCAAAAATGG GACCTGCAAT CACAGGATAA AAAAAATACA TCAAAAATGC 3240 * 

TGTAGCCCCT GTAAAAGTCG GAGCAGCATG TTGATAAGAA ATTTCATTTC GAGTAGGTGG 3 300 

GAAAAGAAAA AAGGTAACGA AATTCCAAAC AACAAAAGCA AGCAGAGCTA GGAAGGAATA 3360 

GAAAAGATAG GATCCTTTAA ACTTTCTACT ATTGATTTTC TCCCATTTCC CCGACCAAAT 3 420 

CATAGCAATA AGAGCAAATA AAACCACAAG AAAATTCAAC ATCATATCCG ACAGATAATA 3480 

GGCAAAGTCA GATAGCCCAG TAACAAGGTC GCTGCGTAAA ACTAGAACAC TGAACTTCTG 3540 

GTCAGCAATA ACTAGTAGAA AAACTATAAT AAAGTAGCGG TGTGAGATTA TCTTTTTCAT 3 600 

ATATCACCTT TCTAATATCC AAATACCAAT AAAGTAACAA TGAGTAAGAA ACTATTCCAT 3660 

GAAGCATGCA GAGCTATAGC CCAATAGATG GATCGGGTGT AGCGAAACAT CATACAAAAT 3720 

ATCAAGCCCA TTCCAAAATA CTTTATGAAA TCTGTCGTTA TCCAACCATA CTGCAAAACA 3780 

TGCATACCGC CAAATATGGC AGCGGAAACA AGAACATCAA GATAGTATCT CTTAACTTTA 3840 
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GATAAACTTC TCATCAAAAG ACCACGACAA ACAACCTCTT CTGATACACG TGCGATAATA 3900 

CT ACT AT AAA CTATTCGCGT AACAAAATAG CTAATTCCTG TTAAATTGGT GGCTACTTCT 3960 

ACGACTGTAC TTCCATTCTG GGTACGAGCA AAGATATAGG . TTGTTAGATT TGCCCACACG 4020 

AACAATAAGA AAAAAGAAAG AAGGAAAACA CCCAGGTAAG ACCAACGAAA CTGGAAACGA 4080 

CCACACTCTT TCCAATGTTC ACTTTTGACA AAAGCAATTG TAGCTATAGT TCCCAGAATA 4140 

AGTACCAATA AAACTTGGAA CACATAGTAC ATATTATCAG ACAAAGCAAC CATAAAATCT 4200 

AAGTCTGATC TGACATTAAA AATGAGGTAA TAAGTCAAAA TCAACAAGCC AGTTGCTAGG 4260 

TGAAATTTCA CTTCTTTCAT TTTCTTCATC CTATTATCTC CTATAAGAGC CTATCTTCTA 4320 

CGGCGGCCAA ACAATCCATC TGCTAAATCT ATAGTCCAAT CAAAAGCTCC ACGATTAGGA 4380 

CTCATCCCTT GATTGCCCCA ACCAGGGTAA ATTCCTGGGA CGCCCCAACC AGATATACCA 4440 

CTTCTTCCAC CACCTCCCAT AGAATTTACG AGGTTGCCTC CTCTAACATC TTGCAACTCA 4500 

GCTTCTGTCA ATTCCATTGT TTCTGCAAAT TCTAAATTTA ACATCTTTTA CACTCCTTCA 4560 

ATTATCTTCA TTTGTAAACC ACTTCTGCGA CCTAGGATTT GCTTCAAGTG CTTTACAAGT 4 620 

ACAGTATAAC ACGAACATTG GCTTATTTTA GAAAATCGCA TATTTGATAT TTTTTCTTAT 4680 

AGAAATTTCA GATTTGCGAT TTTGGTGAAT TTGATTACTT CTCTGGTATA ATAAAGTTAC 4740 

TACTAATGAG GAGTCGAGAA ATATGAAGAA ACAAATTTTA ACATTATTGA AA 4792 
(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

CCGTTCTCGG CGACGGCCAT CTGATGAAGC TATTTATGAG GGAAACTGGC AAGCTGGAGA 60 

GTCAGAGTAT CTAGTCTTTC ACCGATTGCT GTGGCAGCAG ATGTGCAGGG AAAAGGAGTT 120 

GCTCAAACCT TCTTAGAGGG CTTGATTGAA GGTTTTGATT ATCTTGATTT TCGCTCAGAT 180 

ACGCATGCTG AAAACAAGGT TATGCAACAT ATTTTTGAAA AACTTGGTTT TAAACAACTC 240 

GGTAAGATGC CAGTAGATGG CGAACGCTTG GCCTATCAAG AATTAAAGAA ATAATGCAAA 300 

AGAAGTATGT AAAAATCCTC TACTCCTCAC CAATTGGTAT TCTATCACTT GTAGCTGATG 360 

ACCATTATTT GTATGGAATT TGGGTTCAGG AGCAGAAGCA TTTTGAGAGG GGACTAGGAG 420 

ATGAAACGAT AGAAGAAGTT GTTAGTCATC CTATTTTAGA CCCAGTTATT GCTTGCTTAG 480 
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ATGATTACTT TAAAGGCAAG CCTCAGGATT TATCCAACTT GCTCTTGGCG CCAATCGGAA 540 

CGAATTTTGA AAAGAGAGTT TGGGACTATT TACAGGGCAT TCCTTATGGT CAGACAGTGA 600 

CCTATGGACA AATTGCTCAA GACCTGCAAG TGGCTTCTGC TCAAGCAATT GGTGGAGCAG 660 

TGGGACGCAA TCCTTGGTCT ATCCTAGTAC CTTGTCATCG TGTGTTGGGA GCAGGCAACC 720 

CTCTGACAGG TTATGCTGCA GGAGTGGAAA AGAAAGCTTG GCTCTTGGAG CATGAAGGAC 780 

TAGATTTTAA AGATAGAAGC AATAGAAGGA" GAAGCACATG TTACAATTTA TCGAATACCC ~ 840 

CAAATGTTCA ACTTGTAAAA AAGCAAAACA AGAATTAAAT CAATTAGGTG TCGACTATAA 900 

AGCCGTCCAT ATCGTGGAAG AAACACCTAG CCAAGAAGTC ATTTTGAATT GGCTAGAAAC 960 

CTCAGGATTT GAATTGAAGC AATTTTTCAA CACCAGTGGT ATCAAATACC GTGAATTAGG 1020 

GCTAAAAGAT AAGGTAGGAA GTTTGTCAAA CCAAGAACCG GCTGAGTTGC TAGCAAGTGA 1080 

CGGTATGTTG TTAAAACGCC CCATTTTAGT AGAAAATGGA ACTGTTAAGC AAATCGCTTA 1140 

TCGAAAATCT TATGAGGAAC TGGGACTGAA ATAGTTTTTA TCTATCTCTT TGATAGATAA 1200 

AATATATAAC TTCCCTGTTT CAAAGTATGA TAAACTAGTA GCTAGACAAA GTCTGTATCT 12 60 

GACCGTAGCA AATAATTTCA TTGACGGCAG AAGCATGGTA GCATGAATCA TTATCAGAAG 1320 

AGGATGTTTT TATGAATGTT ACAACGATTT TAGCATCAGA TTGGTACCAA AACTTGATGC 1380 

AATTGATTCC GGATGGCAAG CTGTTTAGCC TACGTTCGGT CTTTGATGGA ATCCCTAGAA 1440 

TTCTCCAACA ACTTCCAACA ACAATTATGT TGACAATTGG TGGTGCCCTT TTTGGCTTGC 1500 

TTTTGGCGCT TCTTTTTGCC ATTGTGAAGA TCAATCGTGT CAAGATTTTA TATCCCTTGC 1560 

AGGCCTTCTT TGTTAGTTTC TTAAAAGGGA CACcGATTTT GGTGCAACTC ATGTTGACCT 1620 

ACTACGGAAT CCCTTTGGCT TTGAAAGCCC TCAATCAGCA ATGGGGAACT GGTCTCAATA 1680 

TCAATGCGAT TCCAGCTGCA GCTTTTGCGA TTGTCGCCTT TGCCTTTAAT GAGGCAGCTT 1740 

ATGCTAGTGA AACCATTCGT GCAGCCATTC TCTCAGTTAA TCCTGGTGAG ATTGAGGCGG 1800 

CACGCAGTCT GGGTATGACC CGAGCGCAAG TTTATCGACG AGTGATTATT CCTAATGCAG 1860 

CGGTGGTACC TACTCCAACC TTGATTAATT CCCTCATCGG TTTGACCAAG GGAACATCTC 1920 

TAGCTTTTAG TGCGGGTGTT GTGGAAGTCT TTGCCCAAGC TCAGATTCTA GGTGGAGCTG 1980 

ATTATCGCTA TTTTGAACGC TTCATCTCCG TTGCCCTTGT TTATTGGGTA GTCAATATCG 2040 

GAATTGAAAG CCTCGGTCGT TTCATCGAGA GAAAAATGGC TATTTCTGCA CCTGATACAG 2100 

TGCAACAGAT GTGAAAGGAG ACCTTCGTTA ATGATTAAGA TTTCGAATTT AAGCAA 2156 
(2) INFORMATION FOR SEQ ID NO: 158: 
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U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3140 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi> SEQUENCE DESCRIPTION: 


SEQ ID NO: 


158: 






GTATCTCTAC ACATGTCTTC AATCGATTTT 


GTTGTCCTCC 


AATTTAATTC 


CTTATATGCT 


60 


TTGTCTGCAT TTGCATAACA AGTTGCAACG 


TCTCCTGAAC 


GTCTTGGAAC 


TATTTTATAA 


120 


GGAATAGGGA TCTTATTAAC ACTTTCAAAT 


GTATTTACAA 


GTTGTAATAC 


ACTAGTGCCT 


180 


TCTCCCGAGC CTAGGTTATA GATATAAACA 


TCTGTTTTTT 


CAGATACTTT 


TTCTAAAGCT 


240 


TTTATATGTC CTATTGCTAA ATCTACTACA 


TGGATATAAT 


CACGCACACC 


AGTACCATCA 


300 


ACCGTATCAT AATCATTTCC GAACACACTT 


AGCTCTGATA 


GCTTACCTAC 


CGCTACTTGT 


360 


GCAATATAAG GCATCAAGTT GTTAGGAATT CCTGAGGGAT 


CTTCCCCAAT 


CAAACCAGAC 


420 


TCATGAGCAC CAATTGGATT GAAATAACGA 


AGCAACGCAA 


TACTCCATTC 


TGAATCTGCC 


480 


ACATGAACAT CTTTTAAAAT TTGCTCAAGC 


ATCACTTTCG 


TATACCCATA 


AGGATTTGTC 


540 


GCACTTGTTT GCATCGTCTC AATTAGAGGT 


GACTGATTGT 


TAATTCCATA 


TACAGTCGCA 


600 


CTTGAAGAAA AGACAATCTT TTTAACATTA 


AATTCTGACA 


TCACTTCAAC 


AAGTGCCAAT 


660 


CTACTCATAA TATTATTTTT GTAGTACATC 


ACAGGCTTTT 


GCACGGATTC 


TCCGACAGCT 


720 


TTATAACCTG CAAAATGAAT TGCAGCATCA 


ATCGATTCTT 


GTTCAAATAC 


CTTTCTCAAT 


780 


GCTTGTTTAT CACAAACATC TAATTCGTAA 


AACACGGGAC 


GTATTCCTGT 


AATTGCTTCA 


840 


ATACGGTCTA GCACCAAGAT GCTAGAGTTC 


GAAAGGTTGT 


CGACAATGAT 


AACTTCCTTT 


900 


CCTAAATTTA GTAATTCTAC TACGGTATGG 


CTACCAATAT 


AACCAGCTCC 


GCCTGTTACC 


960 


AATATTGCCA TCTGGGTTTC CTCCTAATTA 


ATTCCAACCG 


ACTTAACAAA 


TCTCATAAAC 


1020 


GCTTCATGCC CAGACGGTGT ATTCTTATAA 


ACTCCTGCAT 


CTTCCAGAAC 


TCTCGCAAAC 


1080 


ACTTGTCCTG CTTCGTGTTG AACTACGCTA 


TTAACCTCTT 


CTTTATTAAT 


GCGAGGATAT 


1140 


TTTTCTTTCA ATTGGTCGGC CCATTCTAAA 


TGATAATCCG 


CAATTGCATT 


ATCCTCTCCT 


1200 


AAAAGATATT TTCCAACTTC TTCTAACTCT 


GGTTTCAAAC 


GAGGTGGTAA 


TATCGCAAGT 


1260 


CCCATCACTT CGATTAACCC GATATTTTCC TTTTTAATAT GTTGTACATC 


TTGATGAGGA 


1320 


TGGAAAACAC CATCTGGGTA TTGTTCAGTA 


GTATGATTAT 


CTCTTAGAAC 


AATATCTAAT 


1380 


TCGTATCTCC CGTCCACTTT ACGAGCAATA 


GGAGTCACCG 


TATGGTGTGG 


GACATCTTCA 


1440 


GTCATAGCAA TGATGTCTAC TTCTAAATCT 


GAATATTCTC 


TCCACTTATT 


TAGAATTTTA 


1500 
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GTAGCTAAAT 


CTAACAAGCG ATTTTTATTT TCACTTTGTA ACCTAATTAC TGACATTGGC 


1560 


CATTTTACAA 


TACCAGCATT 


AACATCCTCA AAGTCTTTAA AACAAAATTC 


ACTCTCAAAT 


1620 


TTTGCTTTTT 


CCATTGGGAA 


AATATGTTTC CCTCCCTGGT AGTGGTTATG 


ACTAACAATG 


1680 


GAGCCTCCTG 


AGATAGGAAG 


ATCAGAATTT GAACCAGCAA AATATCCTGG CAAAATATCA 


1740 


ACAATCTCCA 


ATAATTGTTC 


AAATGTTTTA GAGGTAATAG CCATTGGTAC 


ATGTTGACTA 


1800 


TTCAAAAATA TCGCATGCTC ATTAAAGTAT GAGTAGGGAG AATACTGGAA 


TCCCCATACT 


I860 


TCGTCACCAA 


GTTTCAACCG 


AATAATTCTA TGATTCGAAC GTGCTGGATA 


ATTTATTCGC 


1920 


CCCTGATATC 


CTTCATTTTC 


CATACATAGT AAACATTTGG GATAATTAGT 


TCCTTTTACT 


1980 


AATTTTTCAG 


CAGCAATTGT 


TTTTGGATCT TTTTCGGGTT TTGACAAATT 


TATCGTAATC 


2040 


TCTAGCTCTC 


CGTATTTAGT 


TGATGCTCGA AACTCAATAT TCTTAGCAAT 


AGCAGAAGTT 


2100 


TTAATATAAT 


CACTATCTTT 


ACTTAACTTA TAAAACTCTT CAACTGCTTC 


TTGAGGTGAT 


2160 


ATATCATATG 


AACTCCAAAA 


AATATCATTT AATCGACTAG GTAAAGGAAC 


TATGAAATTC 


2220 


ATTAACTCTG 


CTCCTAAACA 


TTCCTTTTCC TCGATTAAAT CTTTAATTTT 


ACCGTTTTTT 


2280 


AAGGCGATTT 


CCACTAAGTA 


ATCTTTTATT TGTTTCAGGT CATTTTCATC 


GGAAATGCGA 


2340 


TCAATTCCCT 


CCTCACCTAT 


TAACGCTAGT ACTCTATTTT TCACATATAT 


TTTGTCAATT 


2400 


TCATTATACA 


TTCCGTATTC 


AATTACTCTA TCAACAAAAT TATCAATAAT 


TGTTTTCATA 


2460 


TATTTTTCTT 


TCTAATTTAT 


GTTCCCATAT TTTCTATACA TTATCCATTT 


ATAAATTGCT 


2520 


TGCGTAGTAT 


GAGCAATTTT 


ATCAAGGTGA TCAATAATAT CTAAAGCACT 


AATTACTTCA 


2580 


GAAACGTTCC 


CATCATCTTC 


AAATATGTAA TTCATTATTT TCTTTTCCAT 


ATTTATACTA 


2640 


AGCTCTTCTA 


TCTCATTCTG 


TTTTTGTATA ACAACCATAT CTAAACATCC 


AGATTGTTCC 


2700 


TCTCTATAAC 


AAGATATAGC 


CCTATTCATA TGCAGTCCGA TAACTTCATG 


AAGTATTTTT 


2760 


ATTTTTGAAA 


TAATTTTCTT 


CAAAATTTCA TTATTTTGAA GAATCTGTAG 


ATTTTTTAAA 


2820 


ATTTCAACAA 


TTCTATCCCC 


AATACGTTCA ATGTCAGTTG ATATTTTTAT 


TACACTAATA 


2880 


ATTCTTCTTA 


AGTCATATGA 


AACAGGATGT TGTAAACAAA TTAACTCATA 


TCCTTTTTTA 


2940 


TCAATATTTA 


GAACTGACTC 


ATTTATGATT AAATCTTCTT TAATCAATTC 


TACTCGTTCT 


3000 


TCATTTGATA 


AATATTCAAA 


TAACTTCTCA TATTTATCAA GCACAGATAC 


CCAAATGGTC 


3060 


TCTAAATTAT 


TTGATAATTC 


TATAATTTCA TTTTCTAAAT ATAACCTTAA 


CATTTAGCTA 


3120 


CCTCTTCTTA 


ACAAAGTTCG 






3140 


(2) INFORMATION FOR SEQ ID NO: 159: 
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U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9048 base pairs 
(8) TYPE; nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


159: 






CCGGATGATT TCCTGGTCAG 


ATAGGGGGAA 


AGTGACTTCC 


TCAGCAATCG 


CGCGTAGAGT 


60 


AOGAI I CCCT 


TCACGGATAA 


TATCGTTCAT 


ATCAATTAAG 


TGAGCAGCTT 


TTGTAATACG 


120 


TTCTATTGCA 


GACATTTTCT 


CTCCTTATAT 


TATGTTTAGT 


GCAGTTACCT 


ACTGCCAAAG 


180 


CCCAAGTGGT 


ATACTTGGAA 


TAAGCCACTG 


TGGATTAGTT 


CATTTTCTTT 


CATTACCTCT 


240 


ACATGATATC 


ACAAAATGAC 


AAGAATTGAA 


AGCATTATGG 


CATTTAGGAT 


TTATAGAAAA 


300 


TAGATAGGAA 


GTTCAATTCA 


ATTGTGAAAG 


AAATACTTAT 


CTGTGATATA 


ATAAAAAGAA 


360 


AAGGCTTGCA 


TAAGAAAGTA 


GGGAGAACGA 


AGATACAAAG 


AAGACAAAAT 


CGAAATCAGG 


420 


GTGGTTTAGC 


TTTTCGTTTT 


ATGAAGGGCT 


TGCTAAACTT 


TTTAGGAGTT 


ATCGCAAGTG 


480 


GAGCAATAAG 


GGATTTGTGG 


CGATACTCTT 


GCTAGCAGTT 


GGTTTATCAA 


TGCGCTTGGT 


S40 




GAAAGCTTCC 


AAGGAATCCC 


TTGACTAGTC 


AAAAACGAGA 


TACTATTTCT 


600 


CAAGAGGGGA 


CTAAGCAAAA 


GTCTCAGGAG 


TAGGAAGAGG 


AAAAAACTGC 


CAGAATTATG 


660 


GCCCACGGGG 


ATTTGCTCTA 


CCACGATGGA 


CTTTTCTTTT 


CAGCTAAAAA 


AGAAGACGGT 


720 


ACCTATGACT 


TTCATGAAAA 


TTTTGAGTAT 


GTGACTCCTT 


GGCTCAAGCA 


AGGGGACTAA 


780 


GCAGCAGATT 


TAGCTATTGG 


TGATTTTGAA 


GGAACCATTA 


ATAAGGATCA 


TTATTTAGCG 


840 


GGTTATCTTC 


TCTTTAATGC 


TCCTGTTGAA 


GTTATGGATG 


CTATTAAGGA 


GGCAGGTTAT 


900 


CATGTGCTGG 


ATTTAGCTCA 


TAATCATATT 


TTGGATTCGC 


AAATTGAGGG 


AGTTATTTCA 


960 


ACGGCCGATA TTATTGAGAA AGCTGGAATC ACTCCAATCG GAGTTTATAC 


GCACGAACCA 


1020 


CGTGATCAGG 


CTCCGCTGGT 


CATTAAGGAA 


GTGAATGGTA 


TCAAGGTTCC 


ATTGTTAGCC 


1080 


TATTCCTATG 


GTTTCAATGG 


AATTGAGCAG 


TATATTTCTC 


AGGAAGACTA 


TAATCGTTAT 


1140 


CTTTCAGATT 


TAAACGAAGA 


TAAGATGAAG 


GTTGAAATTG 


AACGGGCAGA 


GAAGGAAGCA 


1200 


GATATCACCA TTATCATGCT 


TCAGATGGGT 


GTTGAGTATC 


GATTGGAACC 


AACTGAAGAA 


1260 


CAAAAAGCTC 


TTTATCACAA 


GATGATCGAT 


TTGGGAGCGG 


ATATTATCTT 


TGGAGGGCAT 


1320 


CCTCACGTTG 


TTGAACCATC 


TGAAACGGTT 


GAAAAAGATG 


GAGATAAGAA 


ACTCATTATC 


1380 


TATTAAATGG 


GGAACTTCAT 


TTCCAATCAA 


CGAATTGAAT 


CTATGGGAGA 


TGAAGAGAAT 


1440 


GCTAAGTGGA 


CTGAACGTGG 


TGTTCTCATG 


GATGTCACCA 


TCAAGAAGAA 


GGATGGAAAA 


1500 
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ACAACTATCG 


GAACAGCTAA 


AGCTCATCCT 


ACTTGCGTCA 


ATCGAACACC 


AAAGGGAACC 


1560 


TTTTCACCAG 


AAGGATATCC 


CTTGTATCAT 


TACCAAACTT 


ATATTTTGGA 


ACATTTTATA 


1620 


GAGGATGGCA 


GTCATCGTGA 


CCAGTTAGAT 


GAAGCGACTA 


AGGAACGAAT 


TGATACAGCC 


1680 


TATAAAGAAA 


TGAATGAACA 


TGTGGGATTG 


AAGTGGTATT 


AGCTTGAATC 


CAGAGGAAAG 


1740 


TAAATGATGA 


TTAAGGTAAT 


TGCGACAGAT 


ATGGATCGGA 


CCTTGCTGGA 


TGCTAGAGGT 


1800 


^CAGCTTGATC 


TCCCACGATT 


GGAAAAGATT 


TTAGATCACT 


TGGATCAAAG 


GGGCATTCGT 


1860 


TTTGTCATTG 


CGACGGGCAA 


TGAAATTCAC 


CGCATGAGAC 


AACTACTGAG 


TCCCTTGGTG 


1920 


GATCGAGTGG 


TTCTGGTTGT 


TGCTAATGGC 


GCTCGTATTT 


TTGAAAACAA 


TGAATTGATT 


1980 


CAGGCTCAGA 


CATGGGATGA 


CGCCATTGTC 


AACAAGGCTT 


TGACTCATTT 


CAAGGGTCCA 


2040 


' GCGTGTCAGG 


ACCACTTTGT 


TGTAACGGGG 


ATGAAGGGTG 


ATTTTGTCAA 


GGAAGGTACG 


2100 


ATTTTTACAG 


ATCTTGAAAC 


TTTTATGACT 


CCAGAAATGA 


TTGAAAAATT 


CTACCAACGG 


2160 


ATCCAATTTG 


TGGATGAATT 


AACATCTGAC 


CTCTTTGGTG 


GTGTGCTCAA 


GATGAGCATG 


2220 


GTTGTTGGTG 


AGGAACGTTT 


GAGTTCGGTT 


TTGGAAGAAA 


TCAATGCTCT 


CTTTGATGGC 


2280 


CGTGTCCGAG 


CTGTATCCAG 


TGGCTATGGT 


TGCATTGATA 


TCCTCCAAGC 


TGGGATTCAT 


2340 


AAAGCATGGG 


GCTTGGAGGA 


ATTACTCAAG 


CGCTGGGACT 


TGAAATCCCA 


AGAAATCATG 


2400 


GCTTTTGGTG 


ATAGTGAAAA 


TGATGTTGAA 


ATGCTTGAAA 


TGGCTGGAAT 


TGCCTATCCG 


2460 


ATGGAAAATG 


CTGATGAGAA 


AGCCAAAGCT 


GTGGCGACTG 


CTCTAGCACC 


AGCCAACAGC 


2520 


CAAGGAGGAG 


TTTATCAAGT 


CTTGGAAAAC 


TGGTTAGAAA 


AAGGAGAATG 


AAGTGGCAGT 


2580 


ACAGTTATTA GAAAATTGGC 


TCCTAAAGGA 


ACAAGAAAAA 


ATTCAAACTA 


AGTATCGTCA 


2640 


CCTAAATCAC 


ATTTCTGTTG 


TAGAACCAAA 


CATTCTTTTT 


ATTGGGGATT 


CCATTGTCGA 


2700 


GTATTATCCT 


CTACAGGAGC 


TATTTGGGAC 


TTCAAAGACG 


ATTGTCAATC 


GAGGAATTCG 


2760 


TGGCTATCAG 


ACAGGACTGT 


I At, 1 AUAL»AA 




CATCTATATG 


GTGGAGCAGT 


2820 


AGATAAAATT 


TTTCTTCTGA 


TTGGGACAAA 


TGATATCGGA 


AAGGATGTTC 


CTGTGAATGA 


2880 


GGCTCTCAAT 


AATCTCGAAG 


CTATCATTCA 


ATCCGTTGCT 


CGCGATTATC 


CATTGACAGA 


2940 


GATTAAATTG 


CTTTCCATTT 


TGCCTGTCAA 


TGAGAGAGAG 


GAGTACCAGC 


AGGCAGTCTA 


3000 


TATCCGCTCG 


AATGAAAAAA 


TTCAGAACTG 


GAATCAAGCC 


TATCAAGAGC 


TTGCATCTGC 


3060 


CTATATGCAG 


GTGGAATTTG 


TGCCAGTATT 


TGATTGTTTG 


ACAGACCAAG 


CAGGCCAACT 


3120 


CAAAAAAGAA 


TATACAACTG 


ATGGACTGCA 


CCTCAGTATT 


GCTGGTTATC 


AGGCTTTGTC 


3180 


AAAATCCTTG 


AAAGACTATC 


TTTACTAAAT 


AGCTAAATAA 


TGTTAAATTT 


GAGCATAATA 


3240 
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TCTTGTAAAA AATTCTAAAA TCCTTTAAAA TAAAAAGTGA CGGAGGAATT TATGAATGTA 3 300 

AATCAGATTG TACGGATTAT TCCTACTTTA AAAGCTAATA ATAGAAAATT AAATGAAACA 33 60 

TTTTATATTG AAACCCTTGG AATGAAGGCC TTGTTAGAAG AATCGGCCTT TCTGTCACTA 3420 

CCTGACCAAA CGCGTCTTGA AAAGCTGGTT TTAGAAGAAG CTCCCAGTAT GCGTACTCGT 3480 

AAGGTAGAGG GAAGAAAAAA ACTAGCTAGA TTGATTGTCA A6QTGGAAAA TCCCTTAGAA 3540 

ATTGAAGGAA TCTTATCTAA AACAGATTCG ATTCATCGAT TATATAAAGG TCAAAATGGC 3600 

TACGCTTTTC AAATTTTCTC ACCAGAAGAT GATTTGATTT TGATTCATGC GGAAGATGAC 3 660 

ATAGCAAGTC TAGTAGAAGT AGGAGAAAAG CCTGAATTTC AAACAGATTT GGCATCAATT 3720 

TCTTTAAGTA AATTTGAGAT TTCTATGGAA TTACATCTCC CAACTGATAT CGAAAGTTTC 3 780 

TTGGAATCAT CTGAAATTGG GGCATCCCTT GATTTTATTC CAGCTCAGGG GCAGGATTTG 3840 

ACTGTGGACA ATACGGTTAC CTGGGACTTA TCTATGCTCA AGTTCTTGGT CAATGAATTA 3900 

GACATAGCAA GTCTTCGCCA GAAGTTTGAG TCTACTGAAT ATTTTATTCC TAAGTCTGAA 3960 

AAATTCTTCC TTGGTAAAGA TAGAAATAAT GTTGAATTGT GGTTTGAAGA AGTATGAAGT 4 020 

GGACCAAGAT TATTAAAAAA ATAGAAGAAC AAATCCAGGC AGGGATTTAT CCCGGAGCCT 4080 

CTTTTGCGTA TTTTAAGGAC AATCAATGGA CAGAGTTCTA TTTAGGCCAG AGTCACCCAG 4140 

AGCATGGCTT GCAGACTGAG GCAGGACTAG TTTATGACCT AGCTAGTGTC AGCAAGGTTG 4200 

TTGGGGTTGG CACAGTTTGT ACCTTCTTGT GGGAAATAGG TCAATTAGAT ATTGATAGAC 4260 

TGGTAATAGA TTTTTTACCT GAGAGTGATT ATCCAGACAT CACTATTCGC CAGCTCTTGA 4320 

CTCATGCAAC AGACCTTGAT CCTTTTATTC CTAATCGTGA TCTTTTAACA GCCCCTGAAT 4380 

TAAAGGAAGC GATGTTTCAT CTCAACAGAC GAAGTCAGCC ACCCTTTCTT TATTCGGATG 4440 

TCCATTTTTT GCTGTTGGGC TTTATTTTGG AAAGAATTTT TAATCAAGAT TTGGATGTGA 4500 

TTTTAAAGGA TCAAGTCTGG AAACCTTGGG GAATGACGGA AACTAAGTTT GGGCCAGTTG 4560 

AGCTTGCTGT TCCAACAGTT AGAGGTGTAG AGGCAGGCAT AGTGCATGAT CCCAAGGCfC 4 620 

GTCTCCTGGG TAGACATGCT GGGAGTGCTG GTTTATTTTC GACTATAAAG GATTTACAAA 4 680 

TCTTTTTAGA ACACTATTTA GCAGATGATT TTGCAAGAGA CTTAAATCAA AATTTTTCTC 4740 

CTTTGGATGA CAAGGAACGT TCTTTAGCAT GGAATTTGGA AGGAGATTGG CTAGACCATA 4 800 

CGGGCTATAC AGGTACCTTT ATCATGTGGA ATCGTCAGAA GCAAGAAGCC ACTATTTTCC 4860 

TATCGAATCG TACCTATGAA AAGGACGAGA GAGCTCAATG GAT ATT AG AC CGCAATCAAG 4920 

TGATGAACTT GATTCGCAAA GAAGAGTAAG GAGAGACATG TCAAATAGTT TAAAAGGGAC 4980 

TTTACTAACA GTTGTGGCTG GTATTGCTTG GGGGTTGTCA GGAACGACTC GCCAATACCT 5040 
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AATGGCACAC 


GCAATTTCGG 


CTCTGGTCTT 


GACTAACTTC 


CGTCTTTTAA 


TCCCTGGTGG 


5100 


AATTCTCATG 


CTCTTGGCTT 


ATGCTACTCC 


AAAGGATAAA 


ATACTGGTCT 


TTTTAAAGGA 


5160 


TAGAAAGAGT 


TTGCTGTCTC 


TTCTTATTTT 


TGCTCTGATT 


GGTCTTTTTC 


TCAACCAATT 


5220 


CGCCTATCTG 


TCTGCTATTC AGGAGACCAA TGCGGGAACA GCGACGGTGC TTCAGTATCT 


5280 


TTGTCCTGTC 


GGAATTTTAA 


TTTATAGCTG 


TATCAAGGAT 


AGGGTGGCAC 


CGACACTGGG 


5340 


AGAGATAGTT 


TCCATCATAT 


TCGCCATCGG 


AGGAACCTTC 


CTGATCGCAA CACATGGGCA 


5400 


GTTGGACCAG 


TTATCCATGA 


CACCTGCTGG 


TCTGTTCTGG 


GGTCTCTTTT 


CTGCCTTGAC 


5460 


TTATGCTCTG 


TATATCATTT 


TACCCATAGC 


CTTGATTAAA 


AAGTGGGGGA 


GCAGCTTGGT 


5520 


CATTGGTGTG 


GGAATGGTCA 


TAGCAGGTTT 


GGTCGCCCTT 


CCTTTTACAG 


GGGTTCTACA 


5580 


GGCCGATATC 


CCGACTAGTC 


TTGATTTTCT 


CCTTGCGTTT 


GCAGGCATTA 


TCCTTATCGG 


5640 


GACTGTCTTT 


GCCTATACAG 


CTTTCCTTAA 


AGGAGCCAGT 


CTGATAGGAC 


CGGTCAAGTC 


5700 


AAGCTTGTTG 


GCTTCAATTG 


AGCCAATATC 


GGCGATTTTC 


TTTGCCTTCT 


TAATAATGAA 


5760 


TGAACAATTT 


TATCCCATTG 


ATTTTCTTGG 


TATGGCAATG 


ATATTGTTTG 


CTGTAACTTT 


5820 


GATTTCTTTG 


AAAGATTTAT 


TCTTAGAAAA 


ATAAAAAAGA 


CTCTTTGTCC 


GTGACAGAGA 


5880 


GTTTTTGCGT 


GGTAATCTAA 


TTATTTTCAA 


GATAAAATTC 


AAAGCGTTCG 


CCTACATATT 


5940 


GACTTTTTAC 


GTATTCAAAA 


GCAGTACCAT 


CTTCTAGGTA 


GGAAACCTGG 


GTCAATCCAA 


6000 


GAATAGCATG 


TCCTTTTTCA 


ACTTCCAAAT 


AGTGGGCAAT 


CTTTTCTTTA 


GCAAGGCGAG 


6060 


CATACATGGT 


V- iu I lunun * 




\» A 1 AVn. LATu 


TTTTTGCAAG 


GTTTGGAAGA 


6120 


AATGACTGGT 


GATTTCTTCT 


TTTTTAAAGT 


CCTTAATGAA 


TTTTTCAGGA 


ATAGAAGCAA 


6180 


CTTCATAAAC 


TAGGGGAACT 


TGGTCGGCAT 


AGCGGACCCG 


CTCCATTCGC 


ATAATATTGT 


6240 


CCGTTGGAAA 


AATTCCTAGC 


TTGGCAACTT 


CTTGCTCATT 


GGGAATGGTT 


TTTTTGTAGG 


6300 


AAATGAGCTG 


GCTAGAGGGA 


ACTTTACCTT 


GGGATTTGAC 


AATTTCAGTA 


AAACTGGTTG 


6360 


TCCCTCGCAT 


CTTTTCTTGT 


ACTCGAGTAC 


TGGAAACAAA 


GGTGCCGCTT 


CCTACACGGC 


6420 


GCTCTAAGAC 


GCCTTCTTCG 


ACTAATAGAG 


ATACGGCTTG 


GCGGAGGGTC 


ATGCGACTGA 


6480 


CCGCAAACTG 


CTCAGCTAAA 


TCTCTTTCAC 


TGGGAAGCCT 


CTCACCAATA 


GCCCAACGGT 


6540 


ACTCGTCAAT 


ATCCTTTTTT 


ATCTGATCAT 


GGATTTTTAT 


ATAAGCAGGT 


AGCATATTTT 


6600 


TCACTTCATT 


TCTATCTTTT 


CTCTATTGTA 


CCCCAATAAA 


CTAGAAAAAG 


TCAAACTTCG 


6660 


CCTTGTTTAG 


TTGGTAATTC 


GCCCTTATTT 


GTGATAGAAT 


ATTGAGAAAA GATATTTCTT 


6720 


TTGAGAAAGG 


AAAAAGATGA 


GCAACATTTC 


AACTGATTTG 


CAAGATGTAG 


AAAAAATCAT 


6780 
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CGTATTGCAC TATGGTAGCC AGTACAACCA GCTGATTTCA CGCCGTATCC GTGAGATTGG 6840 

TGTTTTTTCA GAACTAAAAA GCCATAAAAT TTCAGCTGCT GAAGTTCGTG AAGTCAATCC 6900 

TGTAGGAATT ATTCTATCAG GTGGTCCAAA TTCTGTATAT GAAGATGGTT CATTTGATAT 6960 

TGACCCAGAA ATCTTCGAAC TCGGAATTCC AATTTTGGGA ATCTGTTATG GTATGCAGTT 7020 

ATTGACCCAT AAACTTGGAG GAAAAGTTGT TCCTGCAGGT GATGCTGGAA ATCGTGAATA 7080 

CGGTCAATCA ACCCTAACTC ACACACCATC AGCGCTTTTT GAATCAACAC CTGATGAACA 7140 

GACTGTTTTG ATGAGCCATG GTGATGCGGT TACTGAGATT CCTGCTGACT TTGTTCGTAC 7200 

AGGTACATCA GCTGACTGCC CATACGCAGC CATCGAAAAC CCAGATAAAC ACATTTACGC 7260 

TATCCAATTC CACCCAGAAG TTCGTCATTC TGTATACGGA AATGATATCC TTCGTAACTT 7 3 20 

TGCCCTTAAC ATTTGTAAGG CTAAAGGTGA CTGGTCAATG GATAATTTCA TTGACATGCA 7380 

GATCAAAAAA ATTCGTGAAA CCGTCGGTGA TAAACCTGTC CTTCTTGGTC TATCAGGTGG 744 0 

TGTTGACTCA TCTGTCGTTG GGGTTCTTCT CCAAAAAGCG ATTGGCGATC AATTGATCTG 7 500 

TATCTTCGTA GACCACGGTC TTCTTCGTAA AGGCGAAGCT GATCAAGTTA TGGACATGCT 7560 

CGGTGGTAAG TTTGGTTTGA ATATCGTCAA AGCAGACGCT GCTAAACGTT TCCTTGACAA 762 0 

ACTTGCTGGC GTTTCTGACC CTGAACAAAA ACGTAAAATC ATCGGTAACG AGTTTGTCTA 7680 

TGTATTCGAT GACGAAGCAA GCAAGCTCAA AGATGTGAAA TTCCTTGCTC AAGGTACTTT 7740 

ATATACAGAT GTTATCGAGT CTGGTACGGA TACAGCTCAA ACTATCAAGT CACACCACAA 7800 

CGTGGtGCTC TTCCAGAAGA TATGCAGTTT GAATTGATTG AACCACTCAA TACTCTTTAC 7860 

AAGGATGAAG TTCGTGCTCT TGGTACAGAG CTTGGTATGC CAGACCATAT CCTATGGCGC 7920 

CAACCATTCC CAGGACCAGG ACTTGCTATC CGTGTCATGG GTGAAATCAC TGAAGAGAAA 7980 

CTTGAAACCG TTCGTGAATC AGACGCTATT CTTCGTGAAG AAATCGCTAA AGCTGGACTT 8040 

GACCGCGATA TTTGGCAATA CTTCACTGTT AACACAGGCG TTCGTTCAGT CGGTGTTATG 8100 

GGTGACGGTC GTACGTATGA CTACACGATT GCAATCCGTG CTATCACTTC TATCGATGGT 8160 

ATGACTGCTG ATTTTGCCAA AATTCCATGG GAAGTACTTC AAAAAATCTC AGTACGTATC 8220 

GTAAATGAAG TGGATCATGT TAACCGTATC GTCTACGATA TTACAAGTAA ACCACCTGCA 8280 

ACAGTTGAGT GGGAATAATC GCAAAAAAAT TAAAAGCTTT GTAAAATCAA CGGTTACAGA 8340 

GGATTAAAAA CTGTAACTGG GATTAAAACG GGAACATTTG CTAAAAAGAA TAAATTGAAT 8400 

AATAGTTCCA AGTGGTTTAC ATTTGGACAA AAAATTAGAC CGTAGTTTTC AAGCTGCGGT 8460 

CTTTTGATAT ATATAATGAG AATTAATGGC TCTTTGTCAA CTCTAGTGGG TTGAAGTCAG 8520 

CTAAGCTCGA GAAAGGACAA ATTTTGTCCT TTCTTTTTTG ATATTCAGAG CGATAAAAAT 8580 
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CCGTTTTTTG AAGTTTTCAA AGTTCCGAAA ACCAAACGCA TTCCGCTTGA TAAGTTTGAT 8640 

GACATTATTG GTCCCTTCCA ATTTGGCGTT AGAATAGTGT AGTTGAAGGG CGTTGACGAT 8700 

TTTCTCTTTG TCCTTTAGAA ACCTTTTAAA GACAGTCTGA AAAAGACGAT GAACCTGCTT 8760 

TAGATTGTCC TCAATGAGTC CGAAAAATTT CTCCGGTTCC TTATTCTGAA AGTGAAACAG 8820 

CAAGAGTTGA TAGAGCTGAT AGTGATGTTT CAAGTCTTGT GAATAGCTCA AAAGCTTGTT 8880 

TAAAATCTCT TTATTCGTTA AATGCATACG AAAAGTAGGG CGATAAAAAT"GTTTATCGCT 8940 ~ 

GAGTTTACGA CTATCCTGTT GTATGAGCTT CCAGTAGCGC TTGATAGCCT TGTATTCATC 9000 

AGACTTTCGA TCCAATTGAT TCATGATTTG AACACCCACA CGACTCGG 9048 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10399 base pairs 
IB) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

GTACCTTTAT TGATGAATGG ACTGTTTAAA TCAGTACCAC GCCAACCAGA TATGCTTTCT 60 

GAGTTTCGTA GTTTGATGTT TTTAGGTGTT GCCTTTATTG AAGGAACTTT CTTTGTAACT 120 

CTTGTCTTCT CATTTATTAT CAAATAAATA CATGGAACGA GAACAAAAGG GAGGATTTTA 180 

GATGGAAGAA ACTATTAATC CAATCATCTC TATTGGTCCT GTTATCTTCA ATCTGACTAT 240 

CTTACCCATG ACTTTGTTGA TTGTGGGAGT TATTTTTGTC TTTATTTATT GGGCAAGCCG 300 

CAATATGACC TTGAAACCCA AAGGAAAGCA AAATGTACTT GAGTATGTCT ATGACTTTGT 3 60 

TATTGGATTT ACAGAACCTA ACATTGGTTC GCGCTACATG AAAGATTACT CACTCTTTTT 420 

CCTTTGTTTA TTCCTTTTCA TGGTGATTGC CAATAACCTT GGCTTAATGA CAAAGCTTCA 480 

AACGATCGAT GGGACTAACT GGTGGAGTTC GCCAACCGCT AATTTACAGT ATGACTTAAC 540 

CTTATCTTTT CTTGTCATTT TGTTGACACA TATAGAAAGC GTTCGTCGTC GTGGATTTAA 600 

AAAAAGTATA AAATCTTTTA TGAGTCCTGT TTTTGTCATA CCGATGAATA TCTTGGAACA 660 

ATTTACAAAC TTCTTATCTT TGGCTTTGCG GATTTTTGCG AATATCTTTG CAGGAGAGGT 720 

CATGACGAGT TTGTTACTTC TTCTTTCCCA CCAAGCTATT TATTGGTATC CAGTAGCCTT 780 

TGGAGCTAAT TTGGCTTGGA CTGCATTTTC TGTCTTTATT TCCTGCATCC AAGCTTATGT 840 

TTTTACTCTT TTGACATCTG TGTATTTAGG GAATAAGATT AATATTGAAG AGGAATAGAA 900 
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AGGAGTAACT GATGCACGTA ACAGTAGGTG AATTAATTGG TAATTTTATT TTAATCACTG 960 

CCTCTTTTAT TCTTTTGCTA GTCTTGATTA AAAAATTTGC ATGGTCTAAT ATTACAGGCA 1020 

TTTTCGAAGA AAGAGCTGAA AAAATTGCTT CAGATATTGA CAGAGCTGAA GAAGCCCGTC 1080 

AAAAAGCAGA AGTATTGGCT CAAAAACGCG AAGATGAATT GGCTGGTAGC CGTAAAGAAG 1140 

CTAAGACAAT CATTGAAAAT GC AAAGGAAA CACCTGAGCA AAGTAAGGCT AATATCTTAG 1200 

CAGATGCTAA ACTAGAAGCA GGACACTTAA AAGAAAAAGC CAATCAAGAA ATTGCTCAAA 1260 

ATAAACTAGA AGCTTTACAG AGTGTTAAGG GTGAGGTCGC AGATTTGACC ATCAGCTTAG 13 20 

CTGGTAAAAT CATCTCACAA AACCTTGACA GTCATGCCCA TAAAGCACTC ATTGATCAGT 1380 

ATATCGATCA CCTAGGAGAA GCTTAATGGA CAAGAAAACA GTAAAGGTAA TTGAAAAATA 1440 

CAGCATGCCT TTTGTCCAAT TGGTACTTGA AAAAGGAGAA GAAGACCGTA TCTTTTCAGA 1500 

CTTGACTCAA ATCAAGCAAG TTGTTGAAAA AACAGGTCTG CCTTCTTTTT TAAAACAAGT 1S60 

GGCAGTAGAC GAGTCGGATA AGGAAAAAAC AATTGCTTTT TTCCAAGATT CTGTGTCGCC 1620 

TTTATTACAA AACTTTATCC AGGTTCTGGC CTACAATCAC AGAGCAAATC TTTTTTATGA 1680 

TGTGCTTGTA GATTGCTTGA ACCGACTTGA AAAAGAAACA AATCGATTTG AAGTGACGAT 1740 

TACGTCTGCT CATCCTCTAA CTGATGAACA GAAGACTCGT TTGCTCCCTT TGATTGAGAA 1800 

AAAAATGTCT CTGAAAGTAA GGAGTGTAAA AGAACAAATC GATGAAAGTC TCATTGGTGG 1860 

TTTTGTCATT TTTGCCAATC ACAAGACAAT TGATGTGAGT ATTAAACAAC AACTTAAAGT 1920 

TGTTAAAGAA AATTTGAAAT AGAAAGTGGT GTTCTTTTGG CAATTAACGC ACAAGAAATC 1980 

ACCGCTTTAA TTAAGCAACA AATTGAAAAT TTCAAACCCA ATTTTGATGT GACTGAAACA 2040 

GGTGTTGTAA CCTATATCGG GGACGGTATC GCGCGTGCTC ACGGCCTTGA AAATGTCATG 2100 

AGTGGAGAGT TGTTGAATTT TGAAAACGGC TCTTATGGTA TGGCTCAAAA CTTGGAGTCA 2160 

ACAGACGTTG GTATTATCAT CCTAGGTGAC TTTACAGATA TCCGTGAAGG CGATACAATC 2220 

CGCCGTACAG GGAAAATCAT GGAAGTCCCT GTACGTCAAA GTCTGATTGG TCGTGTTG'iG 2280 

GATCCGCTTG GTCGTCCAGT TGACGGTCTT GGAGAAATCC ACACTGATAA AACTCGTCCA 2340 

GTAGAAGCAC CAGCTCCTGG TGTTATGCAA CGTAAGTCTC TTTCAGAACC ATTGCAAACT 2400 

GGTTTGAAAG CTATTGACGC CCTTGTACCG ATTGGTCGTG GTCAACGTGA GTTGATTATC 2460 

GGTGACCGTC AGACAGGGAA AACAACCATT GCGATTGATA CAATCTTGAA CCAAAAAGAT 2520 

CAAGATATGA TCTGTATCTA CGTCGCGATT GGACAAAAAG AATCAACAGT TCGTACGCAA 2580 

GTAGAAACAC TTCGTCAGTA CGGTGCCTTG GACTACACAA TCGTTGTGAC AGCCTCTGCT 2640 

TCACAACCAT CTCCATTGCT CTTCCTAGCT CCTTATGCTG GGGTTGCTAT GGCGGAAGAA 2700 



WO 98/18931 



PCT/US97/19588 



1039 

TTTATGTATC AAGGTAACCA TGTTTTGATT GTATACGATG ATCTATCAAA ACAAGCGGTA 2760 

GCTTATCGTG AACTGTCCCT CTTGCTTCGT CGTCCTCCAG GTCGTGAAGC CTTCCCAGGG 2820 

GATGTTTTCT ATCTCCACAG CC G TTTGCTT GAGCGCTCAG CTAAAGTTTC TGATGAACTT 2880 

GGTGGTGGAT CAATTACAGC CCTACCATTT ATCGAGACAC AAGCAGGACA TATCTCACCC 2940 

TATATCGCAA CCAACGTGAT TTCTATCACT GATGGACAAA TCTTCCTTGG CGATGGCCTC - 3000 

TTCAATGCAC GTATTCGTCC AGCCATCGAT GCGGGTTCAT CTGTATCTCG TGTAGGTGGT 3060 

TCTGCACAAA TCAAAGCCAT GAAGAAGGTT GCTGGTACAC TTCGTATCGA CCTTGCTTCA 3120 

TACCGTGAGT TGGAAGCCTT TACTAAGTTT GGTTCTGACT TGGACGCAGC AACACAGGCT 3180 

AAGTTGAACC GTGGACGTCC TACCGTTGAG GTCTTGAAAC AACCTGTTCA CAAACCATTA 3240 

CCTGTTGAGA AACAAGTAAC CATTCTTTAT GCTTTGACAC ATGGTTTCTT GGATACTGTT 3 300 

CCAGTAGATG ATATTGTTCG TTTCGAGGAA GAGTTCCATG CCTTCTTTGA TGCTCAACAT 3360 

CCAGAGATTT TGGAAACCAT TCGTGATACA AAAGACTTGC CAGAAGAAGC AGTCTTGGAT 34 20 

GCTGCGATTA CAGAGTTTCT CAATCAATCT AGCTTCCAAT AAGAATAGAG GTGTCAGATG 3 4 80 

GCAG7ATCTC TAAATGATAT TAAAACAAAA ATCGCCTCAA CAAAAAATAC GAGTCAAATC 3 540 

ACTAATGCCA TCCAAATGGT ATCGGCTGCT AAGCTAGGTC GTTCTGAAGA AGCTGCTCGC 3600 

AACTTCCAAG TTTACGCTCA GAAAGTGCGT AAACTTTTGA CAGATATCCT TCATGGTAAT 3660 

GGAGCTGGTG CTTCAACTAA TCCGATGTTG ATTACCCCTT CTGTGAAGAA GACAGGCTAT 3720 

ATCGTTATCA CTTCAGACCC CGGTTTGGTT GGAGGTTATA ATTCCTCTAT TTTGAAAGCT 3780 

GTTATGGAGT TGAAAGAAGA ATACCACCCA GACGGTAAAG GTTTTGAAAT GATCTGTATC 3 840 

GGTGGGATGG GAGCTGATTT CTTTAAGGCT CGCGGTATTC AACCACTTTA TGAATTACGT 3900 

GGCTTGTCAG ACCAACCTAG CTTTGATCAA GTTCGTAAGA TTATTTCAAA AACTCTTGAA 3960 

ATGTACCAAA ATGAACTCTT TGATGAGCTT TATGTTTGCT ACAACCACCA TGTCAATACG 4020 

CTAACCAGTC AAATGCCTGT GGAACAAATG CTTCCGATTG TTGACTTGGA TCCAAATGAA 4080 

GCGGATGAAG AGTACAGCTT GACTTTTGAA TTGGAAACCA GCCGAGAAGA AATTCTGGAG 4140 

CAGTTGTTGC CTCA G TTTGC AGAAAGTATG ATTTACGGTG CCATTATCGA TGCCAAGACA 4200 

GCTGAGAATG CTGCGGGCAT GACAGCCATG CAAACAGCGA CAGATAATGC TAAGAAAGTC 4260 

ATCAATGATT TGACAATTCA GTATAACCGT GCCAGACAGG CGGCGATTAC ACAAGAAATT 4320 

ACAGAAATCG TAGCAGGTGC TAGTGCCTTA GAATAGGCTC TAGTCCAGCT CGTATGAAAA 4380 

TGAACTTAGG ACCTAGTTGA GCTAGGAACC GACAGTATCT TATATAGAAT AGGAGAAGGA 4440 
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GATGAGTTCA CGTAAAATTG CTCAGGTTAT 


CGGTCCCGTT 


' GTAGACGTTT 


1 TGTTTGCAGC 


4500 


AGGGGAAAAA 


. CTTCCTGAGA 


. TTAACAATGC 


ACTTGTCGTC 


' TACAAAAATG 


i ACGAAAGAAA 


4S60 


AACAAAAATC 


GTCCTTGAAG 


TAGCCTTGGA 


GTTAGGAGAT 


' GGTATCGTTC GTACTATCGC 


4620 


CATGGAATCA 


ACAGATGGGT 


TGACTCGTGG 


AATGGAAGTA TTCGACACAG GTCGTCCAAT 


4680 


CTCTGTACCA 


GTAGGTAAAG 


AAACTTTGGG 


ACCTGTCTTC 


AACGTTTTCG 


GAGATACCAT 


4740 


TGACTTGGAA 


GCTCCTTTTA 


CAGAAGACGC 


AGAGCGTCAG 


CCAATTCATA 


AAAAAGCTCC 


4800 


AACTTTTGAT 


GAGTTGTCTA 


CCTCTTCTGA 


AATCCTTGAA 


ACAGGGATCA 


AGGTTATTGA 


4860 


CCTTCTTGCC 


CCTTACCTTA AAGGTGGTAA 


AGTTGGACTT 


TTCGGTGGTG 


CCGGAGTTGG 


4920 


TAAAACTGTC 


TTAATCCAAG 


AATTGATTCA 


CAACATTGCC 


CAAGAGCACG 


G7GGTATTTC 


4980 


AGTATTTGCT 


GGTGTTGGGG 


AACGTACTCG 


TGAGGGGAAT 


GACCTTTACT 


GGGAAATGAA 


5040 


AGAATCAGGC 


GTTATCGAGA 


AAACAGCCAT 


GGTCTTTGGT 


CAGATGAATG 


AGCCACCAGG 


5100 


AGCACGTATG 


CGTGTTGCCC 


TTACTGGTTT 


GACAATCGCT 


GAATACTTCC 


GTCATCTGGA 


5160 


AGGCCAAGAC 


GTGCTTCTCT 


TTATCGATAA 


TATCTTCCGT 


TTCACTCAGG 


CTGGTTCAGA 


5220 


AGTATCTGCC 


CTTTTGGGTC 


GTATGCCATC 


AGCCGTTGGT 


TACCAACCAA 


CACTTGCTAC 


5280 


GGAAATGGGT 


CAATTGCAAG 


AACGTATCAC 


ATCAACCAAG 


AAGGGTTCTG 


TAACCTCTAT 


5340 


CCAGGCTATC 


TATGTGCCAG 


CGGATGACTA 


TACTGACCCA 


GCGCCAGCAA 


CAGCCTTCGC 


5400 


TCACTTGGAT 


TCAACAACAA 


ACTTGGAACG 


TAAGTTGGTA 


CAATTGGGTA 


TCTACCCAGC 


5460 


CGTTGACCCA 


CTTGCTTCAA 


GCTCACGTGC 


CTTGGCACCT 


GAAATCGTTG 


GAGAAGAGCA 


5520 


CTATGCAGTT 


GCTGCTGAAG 


TAAAACCTCT 


CCTTCAACGT 


TACCATGAAT 


TGCAAGATAT 


5580 


CATTGCTATC 


CTTGCTATGG 


ATGAGCTTTC 


TGATGAAGAA 


AAGACCTTGG 


TTGCTCGCGC 


5640 


CCGTCGTATC 


CAGTTCTTCT 


TGTCACAAAA 


CTTCAACGTT 


GCGGAACAAT 


TTACTGGTCA 


5700 


GCCAGGTTCT 


TATGTTCCAG 


TTGCTGAAAC 


TGTACGTGGC 


TTTAAGGAAA 


TCCTTGATGG 


5760 


TAAATACGAC 


CACTTGCCAG 


AAGATGCCTT 


CCGTGGTGTA 


GGTTCTATCG 


AAGATGTGAT 


5820 


TGCAAAAGCT 


GAAAAAATGG 


GATTTTAAGA 


GGTGATCTAT 


GGCTCAGTTA 


ACTGTCCAGA 


5880 


TCGTGACACC 


AGATGGTCTC 


GTCTATGATC 


ACCATGCCAG 


CTATGTATCG 


GTTCGAACTC 


5940 


TGGATGGTGA 


GATGGGGATC 


TTGCCACGAC 


ATGAAAATAT 


GATTGCGGTT 


TTAGCAGTTG 


6000 


ATGAAGTAAA 


GGTAAAACGT 


ATCCATGATA 


AAGATCACGT 


GAACTGGATT 


GCAGTAAACG 


6060 


GTGGCGTTAT 


TGAAATTGCC 


AATGATATGA 


TCACAATCGT 


CCCTGACTCT 


GCAGAACGTG 


6120 


CTCGTGATAT 


CGATATCACT 


CGTGCAGAAC 


GTGCCAAACT 


TCGTGCAGAA 


CGTGCAATTG 


6180 


AAGAAGCACA 


AGACAAACAT 


TTGATTGACC 


AAGAACGTCG 


TGCTAAGATT 


GCTTTGCAAC 


6240 
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GTGCTATTAA CCCTATTAAT GTCGGAAATA GACTATAAGA AAAAATGAAC TTGAAAATAC 6300 

CAAGTTCATT TTTTATGGTG TTTTAAGGAG CAAAACGGAT GCAGACTGCT TCGGGAACAT 63 60 

GGAAGTCGTT GGAGAGTTCT GCTAGACGAC CATTGTCACA ATTACGTTTA AAGACAGTTG 6420 

CATTGTCAGA GTCTTGATGG ACAACAATGA GAAATTTTTG GTCGGGTGTC AAATCAAAAT 6480 

CACGTGGAGT CTGACCATGC GTTGGAACGA TTTCTAATAA CTCTAAGCTA CCGTCCGCAA 6540 

GCATGCTATA TACTGCGATA GAATCATGGC CACCGTTAGA AGCGTAGAGG TATTTACCGT 6600 

CTTTAGAGAG ATGAATAGCA GCGGTTCCAT TAAAGCCTTC GTAAGCTTCC GGTAAAGTTG 6660 

AAATGACCTG CATACGTTCA AATTCGCCAA CGCCATCGTA GATTAAAACT TCGATAGTAC 6720 

TATTGAGTTC ACAAATGAGA TAAGCGATTT TATAGTGGTT ATGGAAAATG ATATGCCCTG 6780 

AGCCTGCTCC TGGCTTGCTG TGATAGGTAT AGAGCTTACA TAATTTTCCT TCTTGATCGA 6840 

GGTCATAGGT GATGACTTGC TCACTTCCCA AGTCGCAGGT CACTAGATAG TGGTCAGGTG 6900 

TTAAATCTGT ATAGTGAACA TGGGGGGAAG CTTGATTTTC ATGTGGACCT TGGCCACTGT 5960 

GTTGATCCAT ATCACTAAGT AGAAGACTAC CATCTTCCTG GCGTTTATAA ACAAGGACTT 7020 

GTCCCTTGTG ATAGTTAGCT GCGTAAACCA AATCACGCTT TTCATCGACA CCAACATAAC 7080 

AGTGGGGAGC TCCTTCTTCA ACAACATGAT TTAACACAGT CCCGTCAGTT TGATAGGCTG 7140 

CAATTCCCCC CTTATCCTCT TGGCTACCAA CAGTGTATAA ATGTTGGTGC TGGTCAAAGG 7200 

CAAGGTAGGT TGGACTTGGC TCAGCTGCAA AAAGTTCTAG ATTTCAAACC TGACCAGTTT 7 260 

CTGTATCAAA GTCTGCCTTG TAAATCCCTT GAGAAGTACG ACGTGTATAA GTTCCAAAAT 7 320 

AAACAGTTTC TTTCATTACT ATACCTCTGT GTAAAGATAA GACTATTATA TCACAAAAAC 7380 

AAGTAAATTA AAGATATCCA ATT AG AT GT A AGCACTTTAA AAAAGAGTTA TTTTGTTTCA 7440 

AAAATGGTAT AATGAGAGAA CAATAGAAAG GAAGTATTTA TGGAGCAAAA AGAGAAACAT 7500 

TTTAGCCTAT CTTGGTTTTT CAAGTGGTTT TTAGATAACA AGGCAATTAC GGTATTTTTA 7560 

GTAACCTTAT TATTGGGACT GAATCTTTTT ATTTTAAGTA AGATTAGTTT TCTATTTTCA 7620 

CCTGTTTTAG ACTTTTTAGC AGTTGTGATG TTGCCAGTCA TTTTCTCTGG TTTCTTATAT 7680 

TATTTGTTGA ATCCTATTGT TGATTGGATG GACAAGCATA AGGTTAATCG TGTTATAGCT 7740 

ATCACTATTG TCTTTGTTAT CATCGCTCTC TTTATCATTT GGGGCTTGGC AGTCGCCATT 7800 

CCAAATCTGC AACGTCAGGT TTTGACCTTT GCAAGAAACG TTCCTGTTTA CTTAGAAGAT 7860 

ATAGATAGGA TTGTTAATGG ATTGGTAGCC CAGCACCTGC CAGATGATTT CAGACCTCAA 7920 

TTAGAGCAAG TTTTGACCAA TTTTTCTAGC CAGGCTACAG TTTTGGCAAG TAAGGTTTCA 7980 
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TCTCAGGCAG 


TCAACTGGGT 


GAGTGCCTTT 


ATTAGCGGGG 


CTTCTCAAGT 


CATTCTTCCC 


8040 


TTGATTATCG 


TTCCTTTCAT GCTCTTTTAT CTCTTGCGTG ATGGGAAAGG CTTGCGTAAC 


8100 


TATTTGACCC 


AATTCATTCC 


AACAAAATTG 


AAGGAACCTG 


TTGGACAAGT 


TTTATCAGAT 


eiso 


GTGAATCAAC 


AGTTGTCCAA CTATGTTCGA GGGCAAGTGA CAGTGCCTAT 


TATTGTAGCA 


8220 


GTAATGTTTA 


TCATCTTCTT 


CAAGATTATT 


GGTCTACGCT 


ATGCGCTTAC 


GCTGGCGGTT 


8280 


ACTGCTGGTA 


TTTTAAATCT GGTCCCTTAT CTTCGTAGCT TTCTAGCCAT 


GCTTCCTGCT 


8340 


CTAGTATTGG 


GTTTGATTGC 


TGGTCCAGTC 


ATGCTTTTGA 


AAGTAGTCAT 


TGTCTTTATC 


8400 


GTAGAACAAA 


CTATTGAAGG 


CCGTTTTGTC 


TCTCCATTGA 


TTTTGGGAAG 


TCAATTAAAC 


8460 


ATCCACCCTA 


TTAATGTTCT 


CTTTGTTTTG 


TTAACTTCAG 


GATCTATGTT 


TGGTATCTGG 


8520 


GGAGTTTTAC 


TTGGTATTCC 


GGTTTATGCC 


TCTGCTAAGG 


TTGTCATTTC 


AGCCATTTTC 


8S80 


GAATGGTATA 


AGGTAGTCAG 


TGGTCTATAT 


GAATTAGAGG 


GTGAGGAAGT 


CAAGAGTGAA 


8640 


CAATACTCAA 


CAGATGTTAC 


AGGCTTTGGA 


GGAGCAAGAT 


TTAACTAAGG 


CTGACCATTA 


8700 


TTTCGC C AAA 


GCTTTAGAAA 


ATGATTCAAG 


TGATCTTCTG 


TATGAATTGG 


CAACTTATCT 


8760 


TGAAGGGATT 


GGTTTCTATC 


CTCAGGCCAA 


GGAAATTTAC 


CTGAAAATTG 


TAGAGGATTT 


3820 


TCCAGAGGTT 


CATCTTAATC 


TAGCTGCAAT 


TGCTAGCGAG 


GATGGTCAAA 


TAGAAGAAGC 


8880 


CTTTACCTAT 


CTTGAGGAAA 


TCCAAGCTGA 


CAGTGACTGG 


TATGTCTCGT 


CTTTGGCTCT 


8940 


GAAGGCAGAC 


CTTTACCAGC 


TGGAAGCTTT 


GACAGATGTG 


GCACGTGAGA 


AATTATTGGA 


9000 


GGCCTTGACC 


TACTCAGAGG 


ATTCTCTCTT 


GATATTGGGT 


TTGGCAGAGT 


TGGATAGTGA 


9060 


GTTGGAAAAT 


TACCAAGCGG 


CTATTCAAGC 


CTATGCCCAG 


TTAGATAATC 


GCTCGATTTA 


9120 


TGAGCAAACG 


GGCATTTCCA 


CCTATCAACG 


AATTGGCTTT 


GCCTATGCTC 


AGTTAGGGAA 


9180 


ATTTGAAACG 


GCTACTGAGT 


TTTTAGAAAA 


AGCCCTGGAG 


TTAGAATACG 


ATGACTTAAC 


9240 


AGCTTTTGAG 


TTGGCCAGTC 


TTTATTTTGA 


TCAAGAAGAA 


TATCAAAAAG 


CCACCCTCTA 


9300 


CTTTAAGCAG 


CTTGATACCA 


TTTCTCCTGA 


CTTTGAAGGC 


TATGAGTATG 


GGTACAG1CA 


9360 


GGCTTTACAT 


AAGGAACATC 


AAGTTCAAGA 


AGCCCTGCGT 


ATCGCTAAGC 


AAGGATTAGA 


9420 


GAAAAATCCC 


TTTGAAACTC 


GCCTCTTGCT 


AGCTGCTTCA 


CAATrrrcrr 


ATGAATTGCA 


9480 


TGATGCTAGT 


GGTGCAGAAA 


ATTATCTCCT 


TACTGCAAAA 


GAAGACGCTG 


AGGATACAGA 


9540 


AGAAATCTTG 


CTTCGTTTAG 


CCACTATTTA 


TCTGGAGCAG 


GAGCGTTATG 


AGGATATTCT 


9600 


AGAATTGCAG 


AGTGAGGAGC 


CAGAAAATCT 


TTTGACCAAG 


TGGATGATTG 


CTCGTTCTTA 


9660 


TCAAGAAATG 


GACGATTTGG 


ATACTGCTTA 


TGAGTATTAT 


CAAGAGTTGA 


CAGGAGATTT 


9720 


GAAGGACAAT 


CCAGAATTTC 


TGGAACACTA 


TATCTATCTC 


TTGCGTGAAT 


TGGGACATTT 


9780 
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TGAAGAAGCA AAAGTCCATC CTCACACTTA CTTAAAACTG GTTCCAGATG ATGTGCAAAT 9840 

GCAAGAACTG TTTGAGAGAT TGTAACAATG TTTAACCCAA ATCATTCATA CCTCTCTCAA 9900 

CTAGATGTAA CTTACAAAAC CCCTGACCTC ATGAGCCACT TTCTTCCTCC TCATGACGTC 9960 

AGTTTTACTT TCTGCTGTTC CAGTATCGTT TTTCCTCGCT AGATTTCCTC AAAAGGGCAG 10020 

ACTCCTCCCT TGGTGCGTCA CACGATTTTT TCATCTCGAC TGTTCTTTAA TG CATC ATT A 10080 

ACGACGCTTT TCTTCTAGCT GCTTCATAAG GAACAGGAAG ATTCAGGTTC ACTTTTCTAA 10140 

TCCTAGAATA AAGTGCTGAA AACAATTCGG AATAGGCATA GAGACTAGAC AATTTGAGGA 10200 

GCTGCTTGCG TCCTGTTCGA ACACATTTTC CCACCACGTG AAGAAAAAGA TGGCGGAAGC 10260 

GTTTGATTGT TAAAGTTTGG AAGTCACCTC CAGCTAGATG TTTGAGAAAA AGATAGAGAT 10320 

TGTAGGCGAT ACAGCTCATC ATCATACGAA TTCGTTTTTC ATTAAGGTTG AACTATCCGT 10380 

TTTATCGCCA AAAAATCGG 10399 

(2) INFORMATION FOR SEQ ID NO: 161: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9409 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

GATAAGATTA AGTTAGAAAA GAAAGAACTA GGACATATCT AC CAG ATTC A GCTTTTTAAT 60 

AGCTATGGGC AGGAAGAAAT CTATCGTGTG ATTTTGATGG AGACCAATAT TAGTTCGGTT 120 

TCAACCAATA TCAAGTATGC TGCTGTCTTG ATTAATACCA GTCAGTTGGA ACAGGCTAGT 180 

CAAAAGCATG AGCAATTGAT TGTGGTCGTG ATGGCTAGTT TCTGGATTTT GTCTTTACTT 240 

GCCAGTCTCT ATCTAGCTAG GGTCAGTGTT AGGCCCCTGC TTGAGAGTAT GCACAAGCAA 300 

CAGTCTTTTG TGGAAAATGC CAGTCATGAC TTACGAACTC CACTCCCAGT TTTGCAAAAT 360 

CGCTTAGAGA CCCTTTTTCG TAAGCCAGAA GCTACCATTA TCGATGTGAG CGAAAGCATT 420 

GCATCGAGTT TGGAAGAAGT CCGAAATATG CGTTTTTTAA CGACAAGCTT GCTGAACTTA 480 

GCTCGGAGAG ATGATGGGAT TAAGCCGGAG CTTGCAGAAG TTCCAACTAG C TTTTTTAAT 540 

ACAACTTTCA CAAACTACGA GATGATTGCT TCGGAAAATA ATCGTGTCTT CCGTTTTGAA 600 

AATCGTATCC ATCGAACAAT TGTCACAGAT CAGCTTCTTC TGAAACAACT GATGACCATT 660 

CTTTTCGATA ATGCCGTCAA GTATACTGAG GAGGATGGTG AAATTGATTT TCTTATCTCG 720 
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GCGACCGATC GCAATCTTTA TTTACTTGTT TCTGATAATG GAATCGGTAT TTCGACAGAA 7 80 

GATAAAAAGA AAATTTTTGA CCGTTTTTAT CGAGTAGACA ACGCTACAAC CCGGCAAAAA 840 

GGTGGTTTTG GTTTAGGATT ATCCCTACCC AAGCAAATTG TAGATGCTCT AAAAGGAACT 900 

GTTACTCTCA AAGATAATAA ACCCAAGGGA ACAATCTTTG AAGTGAAGAT TGCCATTCAG 960 

ACACCATCTA AAAAGAAAAA ATAAAAATAT CGCTCCAATT GGGGCGATAT TTTGGATTTA 1020 

TCTTCTACGT TTTCCTTTCA TAATAGACCG TTGAA C TTTT AAAACAACTA AGCTGAATCC 1080 

GATTGCTGCG GCAAAGGCAA GAGCAGTTGA TAATTTTAAT CCTAAAAAGA TAAAACTAAA 1140 

GATAGCAATA CAGATACAAA AAACAGCGAT ATTAATAAAA AATAGGATTT CCTTGAGATT 1200 

GGCATCAGAT TGCGCTTCAG GTGTATAAGC TTCGTAATGA GGAAGCTGCT GGTTTAATTC 1260 

TTCTTGATAG TCTACCTCAT AGGATTGTAA TTTTCTTACG GGCATGATTC TCTCCTTAAC 1320 

AGTACATACC TATTTTATCA TTTTTTCGGC AGAGAATTAT TACAGAAAGG TTACAAAAAG 13 80 

AATAAAGTCC CTTTTCATTT TCAAAGCATG GCTGATTTTG GAGAAATGTG GTATAATTTT 1440 

TCTTATGGAA AAGATTGTCA TTACAGCAAC TGCTGAAAGT ATTGAACAAG TTGAACAACT 1500 

ACTCGAAGCT GGCGTAGACC GTATCTATGT CGGTGAGAAA GATTTTGGTC TTCGTCTGCC 1560 

AACGACCTTT AGTTATGACC AATTACGTGA AATCGCTAAG TTGGTTCATG ATGCTGGTAA 1620 

GGAATTGATC GTTGCGGTCA ATGCTCTCAT GCACCAAGAT ATGATGGACC GTATCAAGCC 1680 

TTTCTTAAAC TTCTTGGAAG AAATCAAGAC AG ACT AT ATT ACGATTGGGG ATGCAGGCGT 1740 

CTTTTACGTA CTTAACCGCG ATGGTTATTC ATTTAAGACC ATCTACGATG CTTCAACCAT 1800 

GGTAACTAGC AGTCGTCAGA TTAACTTCTG GGGACAAAAG GCTGGCCCAT CTGAGGCTGT 1860 

TTTGGCGCGT GAAATTCCAT CAGCTGAACT TTTCAAAATG CCAGAGATTT TGGAAATTCC 1920 

TGCTGAAGTT TTGGTTTACG GTGCTAGCGT CATCCATCAT TCTAAACGTC CACTCTTGCA 1980 

AAACTACTAT AACTTTACAC ATATCGATGA TGAAAAGACG CATAAACCTG ACCTCTTCTT 2040 

GGCTGAGCCA AGTGATCCAG AGAGCCACTA TTCCATTTTT GAAGATAATC ATGGGACCCA 2100 

TATCTTTGCC AACAATGACC TTGATTTGAT GATCAAATTA ACAGAATTGG TGGAGCATGG 2160 

CTTTACTCGC TGGAAACTAG AAGGGCTCTA CACTCCTGGT CAGAACTTTG TTGAGATTGC 2220 

AAAACTCTTT ATCCAAGCGC GTAGCTTGAT TCAAGAGGGC AACTTTAGTC ATGCTCAAGC 2280 

CTTCTTGCTG GATGAAGAAG TTCGTAAACT TCACCCTAAA AACCGTTTCC TTGATACAGG 2340 

ATTTTATGAC TACGATCCTG ACATGGTTAG ATAAAATACA TGATTCGTTG AGAGAAGGAA 2 400 

GATGCAAACA TTTCTTCTCT CAATTTTTCG TATTTCTTCA CTATTTTACA AAAATCAGCA 2460 

GGCTAGAATG CTCTATTCGA TGGGATTTTT AAGAAAAGTA GTGTTCTTGA GTTTGAAAAT 2 520 
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TATCCTATCT TTCCACGTCC CAAATGGCCC TTTTTTTGGT ATAATTTTTT ATAATGAAAA 2580 

CGATTGGTAA TCGCTATGTT GTGCTGGATT TAGAGGCAAC TAGCACAGGT AGTAAGGCTA 2640 

AAATTATCCA AGTGGGAATT CTCGTGATTG AGGACCGAGA AATCGTCGAT CACTATACGA 2700 

CGGATCTCAA TCCACATGAA CCCTTGCATC CTCATATCAA AGAACTGACA GGATTGACAC 2760 

ACCAACGTCT GGCGCAAGCA CCTGATTTTT CGCAAGTTCC CAGAAAAATA TTTGACTTGG 2820 

TGGAGGATGG GATTTTTGTA GCCCATAATG TTCAGTTTGA TGCTAATCTC TTGGCGCAAA 2880 

ATTTATTTTT TGAAGGCTAT GAGCTAAGAA ACCCTCGTGT TGATACCCTC GAATTGGCCC 2 940 

AGGTCTTTTT CCCTGAACTG GAAAAATATA CCTTGCCGAT TTTGTGTCGA GAATTAGGAA 3000 

TTCCTCTTAA ACACGCACAC ACAGCCCTTT CAGATGCCCA AGCTACAGCA GAATTACTTC 3060 

TTTTTTTACG GAAAAAGATG ACCCAGCTTC CTAAAGGTCT CTTGGAACGC TTGCTGGAAA 3120 

TGGCTGACGC TCTCCTATAT GAGTCCTACC TGGTTATTGA GGAAACTTAT CGCAACCAAT 3180 

CTATCCTGAG TTCTCCAGAC TTGGTCCAAG TTCAAGGTCT ATATTTTAAG AAAACGGAAG 3240 

CTTCTCTGGA GCCACGAAAA CTATCTCAAG ACTTTTCTAA AAATATTTCT CTGTTGAACC 3 300 

TTCAACTGAG GGAGGAACAA GAAAGTTTTG CTAAAGAGGT TGGCTTGCTA TTGAAAGATG 3 360 

AACCTGTCTC TCTGATTCAA GCGCCGACAG GGATTGGGAA AACCTATGGC TATCTCTTAC 3420 

CCGCTTTATC TCAATCCAAA GAGCGACAAA TTGTTCTTAG TGTTCCGACA AAGATTCTTC 3480 

AAAATCAAAT CATGGAAGAA GAAGGTAAAC GCCTCAAGGA AGTGTTCCAT ACAGATATTC 3540 

ATAGCTTAAA GGGACCACAA AATTATCTGA AGTTGGATGC CTTTTATCAT TCCTTGCAGG 3600 

AAAATGATGA AAATCGCTTA TTTACACGCT TTAAAATGCA AGTCTTGGTC TGGCTTACTG 3660 

AGACAGAGAC AGGAGATTTG GATGAAATCG CCCAACTCTA CCGTTACCAA CATTTTCTAG 37 20 

CAGACCTTCG TCATGATGCC AATTTATCAT CCCAGAGCTT ATTTGTGACG GAAGATTTTT 3 780 

GGAAACGTAG TCAAGAAAGG GCAGAGACTT GCAAGCTTTT AGTGACTAAT CATCCCTATC 3840 

TCGTAACCAG ACTTGAAGAT AATCCTGAAT TTGTCAGTGA CCGTTTACTG ATTATTGATG 3900 

AAGTCCAAAA GATTTTGTTA GCTCTAGAAA ATCTCCTTCA AGAGACCTAC CATATACAAT 3960 

CTATTATCGA TTTAATTGAT AAGGCTTTAG TAGGAGAAGA AAACAGGGTT CAACAACGGA 4020 

TACTAGAAAG TATTCGCTTT GAGTGTCTCT ACTTGATAGA ACAATTTCAG TCTGGCAAAT 4080 

CTAGGAAAAA TATCTTAGAT TCTCTGGACA ATCTCCATCA GTATTTTTCA GAATTGGAAG 4140 

TAGAAGACTT TGATGAGCTG CTTCGCTATT TTACAGCTGA AGGTGATTAC TGGCTTGAAG 4200 

TAACTGAAAC GAGTCAAAAG AAAATTCAGA TTTCTTCTAC AAAATCAGGC CGTACTCTTC 4260 
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TGTCCTCTTT 


ACTTCCTGAG 


AGTTGCCAAG 


TCTTGGGAGT 


ATCGGCTACT CTTGAGATTA 


4320 


CTCAGAGGGT 


TTCTTTGGCA 


GACCTTTTAG 


GCTATCCTGA 


AGCTAAATTT 


GTCAAGATTG 


4380 


AATCTCGGGG 


AAAACAGGAA 


CAAGAAGTGG 


TCATGGTCAA 


AGATTTCCCT 


CTGGTAACAG 


4440 


AAACCTCCTT 


AGAAGTCTAT 


GCCAGAGAGG 


TAGCTGCTTT 


ACTAGTGGAA 


ATTCAAGCTT 


4500 


TCCAGCAACC 


GATTTTGGTT CTCTTTACCG CTAAACACAT GCTTCTAGCA GTATCGGATT 


4560 


TACTTACAGT 


TAGCCACTTG 


GCCCAGTATA 


AAAATGGGGA 


TGTTCATCAG 


CTAAAGAAAC 


4620 


GCTTTGAAAA 


AGGTGAACAA 


CAAATCTTGC 


TTGGTGCAGC 


AAGTTTCTGG 


GAGGGAGTTG 


4680 


ATTTTTCAAG 


CCATCCTTCT 


GTGATTCAAG 


TTGTACCGAC 


GCTTCCTTTC 


CAAAATCCTC 


4740 


AAGAACCCTT 


GACGAAAAAG 


ATTAATCAAG 


AACTGAATCA 


AGAAGGGAAA 


AATGCCTTTT 


4800 


ATGATTATCA 


ATTGCCAATG 


GCCATTATTC 


GTTTAAAACA 


GGCTTTGGGA 


AGAAGTATGA 


4860 


GACGTGAATA 


CCAACGTTCC 


TTAACTCTTA 


TTTTGGATAG 


GAGAATCGTC 


GGAAAACGAT 


4920 


ACGGCAAACA 


AATAGTAGCA 


TCTCTAGCAG 


AAGAAGCGAC 


TGTTAAAACC 


ATCTCTCGAT 


4980 


CCGAAGTTGA 


CGAGGCTATT 


GATAGATTTT 


TTAATGAGCT 


TTGATAAATA 


GTATTGTATG 


5040 


AAAGTATAAG 


GTTAGTATAT 


ATGAAACCTT 


CTCTCGACTC 


AAGAGTCCAT 


TACAGTTTGC 


5100 


TCTTGCCAGT 


ATTTTTTCTA 


CTGGTCATCG 


GTGTGGTGGC 


TATCTATATA 


GCCGTTACTC 


5160 


ATGATTATCC 


CAATAATATT 


CTGCCCATTT 


TAGGGCAGCA 


GGTCGCCTGG 


ATTGCCTTGG 


5220 


GG CTTGTGAT 


TGGTTTTGTG 


GTCATGCTCT 


TTAATACAGA 


ATTTCTTTGG 


AAGCTGACCC 


5280 


CCTTTCTATA 


TATTTTAGGC 


TTGGGACTTA 


TGATCTTGCC 


GATTGTATTT 


TATAATCCAA 


5340 


GCTTAGTTGC 


ATCAACGGGT 


GCCAAAAACT 


GGGTATCAAT 


AAATGGAATT 


ACCCTATTCC 


5400 


AACCGTCAGA 


ATTTATGAAG 


ATATCCTATA 


TCCTCATCTT 


GCCTCGTGTC 


ATTGTCCAAT 


5460 


TTACAAAGAA 


ACATAAGGAA 


TGGAGACGCA 


CGGTTCCGCT 


GGACTTTTTG 


TTAATTTTCT 


5520 


GGATGATTCT 


CTTTACCATT 


CCAGTCCTAG 


TTCTTTTAGC 


ACTTCAAAGT 


GACTTGGGGA 


5580 


CGGCTTTGGT 


TTTTGTAGCC 


ATTTTCTCAG 


GAATCGTTTT 


ATTATCAGGG 


GTTTCTTGCA 


5640 


AAATTATTAT 


CCCAGTATTT 


GTGACTGCTG 


TAACAGGAGT 


TGCTGGTTTC 


TTAGCTATCT 


'..700 


TTATTAGCAA 


GGACGGACGA 


GCTTTTCTTC 


ACCAGATTGG 


AATGCCGACC 


TACCAAATTA 


5760 


ATCGGATTTT 


GCCTTGGCTC 


AATCCCTTTG 


AGTTTGCCCA 


AACAACGACT 


TACCAGCAGG 


5820 


CTCAAGGGCA 


GATTGCCATT 


GGGAGTGGTG 


CCTTATTTGG 


TCAGGGATTT 


AATGCTTCGA 


5880 


ATCTGCTTAT 


CCCAGTTCGA 


GACTCAGATA 


TGATTTTTAC 


GGTTATTGCA 


GAAGATTTTG 


5940 


GCTTTATTGG 


CTCTGTCCTG 


GTTATTGCCC 


TCTATCTCAT 


GTTGATTTAC 


CGTATCTTGA 


6000 


AGATTACTCT 


TAAATCAAAT 


AACCAGTTCT 


ACACTTATAT 


TTCCACAGGT 


TTGATTATGA 


6060 



WO 98/18931 



PCT/US97/19588 



1047 



TGTTGCTCTT CCACATCTTT GAGAATATCG 


GTGCTGTGAC 


TGGACTACTT 


CCTTTGACGG 


6120 


GGATTCCCTT GCCTTTCATT 


TCGCAAGGGG 


GATCAGCTAT 


TATCAGTAAT 


CTGATTGGTG 


6180 


TTGGTTTGCT 


TTTATCGATG 


AGTTACCAGA 


CTAATCTAGC 


TGAAGAAAAG 


AGCGGAAAAG 


6240 


TCCCATTCAA 


ACGGAAAAAG 


GTTGTATTAA 


AACAAATTAA 


ATAAGGAGAA 


AATCATGGTA 


6300 


AAAGTAGCAG TTATATTAGC TCAGGGCTTT 


GAAGAAATTG 


AAGCCTTGAC 


AGTTGTAGAT 


6360 


GTCTTGCGTC 


GAGCCAATAT 


CACATGTGAT 


ATGGTTCGTT 


TTGAAGAGCA 


AGTAACGGGT 


6420 


TCGCATGCAA 


TCCAAGTAAG 


AGCAGATCAT 


GTCTTTGATG 


GAGATTTATC 


AGACTATGAT 


6480 


ATGATTGTTC 


TTCCTGGAGG 


TATGCCTGCT 


TCTGCACATT 


TACGTGATAA 


TCAGACCTTG 


6540 


ATTCAAGAAT 


TGCAAAGCTT 


CGAGCAAGAA 


GGGAAGAAAC 


TAGCAGCCAT 


TTGTGCGGCA 


6600 


CCAATTGCCC 


TCAATCAAGC 


AGAGATATTG 


AAAAATAAGC 


GATACACTTG 


TTATGACGGC 


6660 


GTTCAAGAGC 


AAATCCTTGA 


TGGTCACTAC 


GTCAAGGAAA 


CAGTAGTGGT 


AGATGGTCAG 


6720 


TTGACAACCA 


GTCGGGGTCC 


TTCAACAGCC 


CTTGCCTTTG 


CCTACGAGTT 


GCTGGAGCAA 


6780 


CTAGGAGGGG 


ACGCAGAGAG 


TTTACGAACA 


GGAATGCTCT 


ATCGAGATGT 


CTTTGGTAAA 


6840 


AATCAGTAAA 


ACGGGAGTTA 


TTCTCTCGTT 


TTTTATGTGG 


AAAACTCAGG 


GAAATCATCG 


6900 


CTTTTTTCAT 


AAAAAAATGC 


TATAATGAAG 


GGTATGAAAT 


ATCACGATTA 


CATCTGGGAT 


6960 


TTAGGTGGAA 


CTTTACTGGA 


TAATTATGAA 


ACTTCAACAG 


CTGCATTTGT 


TGAAACATTG 


7020 


GCACTGTATG 


GTATCACACA 


AGACCATGAC 


AGTGTCTATC 


AAGCTTTAAA 


GGTTTCTACT 


7080 


CCTTTTGCGA 


TTGAGACATT 


CGCTCCCAAT 


TTAGAGAATT 


TTTTAGAAAA 


GTACAAGGAA 


7140 


AATGAAGCCA 


GAGAGCTTGA 


ACACCCGATT 


TTATTTGAAG 


GAGTTTCTGA 


CCTATTGGAA 


7200 


GACATTTCAA 


ATCAAGGTGG 


CCGTCATTTT 


TTGGTCTCTC ATCGAAATGA TCAGGTTTTG 


7260 


GAAATTTTAG 


AAAAAACCTC 


TATAGCAGCT 


TATTTTACAG 


AAGTGGTGAC 


TTCTAGCTCA 


7320 


GGCTTTAAGA 


GAAAGCCAAA 


TCCCGAATCC 


ATGCTTTATT 


TAAGAGAAAA 


GTATCAGATT 


/ Jou 


AGCTCTGGTC 


TTGTCATTGG 


TGATCGGCCG 


ATTGATATCC 


AAGCAGGTCA 


AGCTGCAGGA 


7440 


CTTGATACCC 


ACTTGTTTAC 


CAGTATCGTG 


AATTTAAGAC 


AAGTATTAGA 


CATATAAGAA 


7500 


AAAGGAATAA 


GATGACAGAA 


GAAATCAAAA 


ATCTGCACGC 


ACAGGATTAT 


GATGCCAGTC 


7560 


AAATTCAAGT 


TTTAGAGGGC 


TTAGAGGCTG 


TTCGTATGCG 


TCCAGGGATG 


TACATTGGAT 


7620 


CAACCTCAAA 


AGAAGGTCTT 


CACCATCTAG 


TCTGGGAAAT 


TGTTGATAAC 


TCAATTGACG 


7680 


AGGCCTTGGC 


AGGATTTGCC 


AGCCATATTC 


AAGTTTTTAT 


TGAGCCAGAT 


GATTCGATTA 


7740 


CTGTTGTGGA 


TGATGGGCGT 


GGTATCCCAG 


TCGATATTCA 


GGAAAAAACA 


GGCCGTCCTG 


7800 
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CTGTTGAGAC 


CGTCTTTACA 


GTCCTTCACG 


CTGGAGGAAA 


GTTCGGCGGT 


GGTGGATACA 


7860 


AGGTTTCAGC 


TGGTCTTCAC 


GGGGTGGGGT 


CGTCAGTAGT 


TAATGCCCTT 


TCCACTCAAT 


7920 


TAGACGTTCA 


TGTTCACAAA 


AATGGTAAGA 


TTCATTACCA 


AGAATACCGT 


CGTGGTCATG 


7980 


TTGTCGCAGA 


TCTTGAAATA 


GTTGGAGATA 


CGGATAAAAC 


AGGAACAACT 


GTTCACTTCA 


3040 


CACCGGACCC 


AAAAATCTTC 


ACTGAAACAA CAATCTTTGA TTTTGATAAA 


TTAAATAAAC 


8100 


GGATTCAAGA 


GTTGGCCTTT CTAAATCGCG GTCTTCAAAT TTCAATTACA GATAAGCGCC 


8160 


AAGGTTTGGA 


ACAAACCAAG 


CATTATCATT 


ATGAAGGTGG 


GATTGCTAGT 


TACGTTGAAT 


8220 


ATATCAACGA 


GAACAAGGAT 


GTAATCTTTG 


ATACACCAAT 


CTATACAGAC 


GGTGAGATGG 


8280 


ATGATATCAC 


AGTTGAGGTA 


GCCATGCAGT 


ACACAACTGG 


TTACCATGAA 


AATGTCATGA 


8340 


GTTTCGCCAA 


TAATATTCAT 


ACCCATGAAG 


GTGGAACACA 


TGAACAAGGT 


TTCCGTACAG 


8400 


CCTTGACACG 


TCTTATCAAC 


GATTATGCTC 


GTAAAAATAA 


GTTACTGAAA 


GACAATGAAC 


8460 


ATAATTTAAC 


AGGGGAAGAT 


GTTCGCGAAG 


GCTTAACTGC 


AGTTATCTCA 


GTTAAACACC 


3520 


CAAATCCACA 


GTTTGAAGGA 


CAAACCAAGA 


CCAAATTGGG 


AAATAGCGAA 


GTGGTCAAGA 


8580 


TTACCAATCG 


CCTCTTCAGT 


GAAGCTTTCT 


CCGATTTCCT 


CATGGAAAAT 


CCACAGATTG 


8640 


CCAAACGTAT 


CGTAGAAAAA 


GGAATTTTGG 


CTGCCAAGGC 


TCGTGTGGCT 


GCCAAGCGTG 


8700 


CGCGTGAAGT 


CACACGTAAA 


AAATCTGGTT 


TGCAAATTTC 


CAACCTTCCA 


GGGAAACTAG 


8760 


CAGACTGTTC 


TTCTAATAAC 


CCTGCTGAAA 


CAGAACTCTT 


CATCGTCGAA 


GGAGACTCAG 


8820 


CTGGTGGATC 


AGCCAAATCT 


GGTCGTAACC 


GTGAGTTTCA 


GGCTATCCTT 


CCAATTCGCG 


3880 


GTAAGATTTT 


GAACGTTGAA 


AAAGCAAGTA 


TGGATAAGAT 


TCTAGCCAAC 


GAAGAAATTC 


8940 


GTAGTCTTTT 


CACAGCCATG 


GGAACAGGAT 


TTGGCGCAGA 


ATTTGATGTT 


TCGAAAGCCC 


9000 


GTTACCAAAA 


ACTCGTTTTG 


ATGACCGATG 


CCGATGTCGA 


TGGAGCCCAC 


ATTCGTACCC 


9060 


TTCTTTTAAC 


CTTGATTTAT 


CGTTATATGA 


AACCAATCCT 


AGAAGCTCGT 


TATGTTTATA 


9120 


TTGCCCAACC 


ACCAATCTAT 


GGTGTCAAGG 


TTGGAAGCGA 


GATTAAAGAA 


TATATCCAGC 


9180 


CGGGTGCAGA 


TCAAGAAATC 


AAACTCCAAG 


AAGCTTTAGC 


CCGTTATAGT 


GAAGGTCGTA 


9240 


CCAAACCGAC 


TATTCAGCGT 


TATAAGGGGC 


TAGGTGAAAT 


GGACGATCAT 


CAGCTGTGGG 


9300 


AAACAACCAT 


GGATCCCGAA 


CATCGCTTGA 


TGGCTAGAGT 


TTCTGTAGAT 


GATGTGCAGA 


9360 


AGCAGATAAA 


ATCTTTGATA 


TGTTGATGGG 


GATCGAGTTG 


TCCTCGTCG 




9409 


(2) INFORMATION FOR SEQ ID MO: 162: 









U> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6415 base pairs 

(B) TYPE: nucleic acid 
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fC) STRANDEDNESS: double 
(0) TOPOLOGY; linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


162: 






CCTGGGAAAG 


TCTTGAAAAT 


TATGATAGAA 


TGGTGGAAGG 


AAAAATTCAG 


GAGAGTAGTA 


60 


GTGACTCAAA 


ATGTTGAAAG 


TCTTCTCGTA 


TCCATTGTAA 


TCAGTGCATA 


CAATGAAGAA 


120 


AAATATCTGC 


~ CTGGTCTAAT 


"TGAAGACTTA" 


"AAAAATCAAA 


CCTATCCTAA 


AGAGGATATT 


180 


GAAATTCTAT 


TTATAAATGC 


TATGTCCACA 


GATGGGACCA 


CAGCTATCAT 


TCAGCAATTT 


240 


ATAAAGGAAG 


ATACAGAGTT 


TAACTCAATT 


AG ATT GT AT A 


ACAATCCTAA 


GAAAAATCAA 


300 


GCTAGTGGTT 


TTAACCTGGG 


AGTTAAACAT 


TCTGTAGGGG 


ACCTTATTTT 


AAAAATTGAT 


360 


GCTCATTCAA 


AAGTTACTGA GACTTTTGTA 


ATGAACAATG 


TGGCTATTAT 


TCAACAAGGT 


420 


GAATTTGTCT 


GTGGGGGGCC 


TAGACCGACG 


ATTGTCGAAG 


GAAAAGGAAA 


ATGGGCAGAG 


480 


ACCTTGCATC 


TTGTTGAGGA 


AAATATGTTT 


GGCAGTAGCA 


TTCCCAATTA 


TCGAAATAGT 


540 


TCTGAGGATA 


GATATGTTTC 


TTCTATTTTT 


CATGGAATGT 


ATAAACGAGA 


GGTTTTCCAG 


600 


AAGGTTGGTT 


TAGTAAATGA 


GCAACTTGGC 


CGAACTGAAG 


ATAATGATAT 


TCATTATAGA 


660 


ATTCGAGAAT 


ATGGTTATAA 


AATCCGCTAT 


AGCCCAAGTA 


TTCTATCTTA 


TCAGTATATT 


720 


CGACCAACAT 


TCAAGAAAAT GCTGCATCAA 


AAGTATTCAA 


ATGGTTTGTG 


GATTGGCTTG 


780 


ACAAGTCATG 


TTCAGTTTAA GTGTTTATCA 


TTATTTCACT 


ATGTTCCTTG 


TTTATTTGTT 


840 


TTGAGTCTTG 


TGTTTAGTCT 


AGCATTGTTA 


CCGATCACAT 


TCGTATTCAT 


AACTTTACTA 


900 


TTAGGTGCCT 


ATTTTCTACT 


TTTGTCATTA 


CTCACTTTGC 


TGACTTTATT 


AAAACATAAA 


960 


AATGGATTTC 


TAATTGTGAT 


GCCCTTTATT 


TTATTTTCCA 


TTCACTTTGC 


TTATGGCCTT 


1020 


GGGACGATTG 


TAGGTTTAAT 


TAGAGGATTT 


AAATGGAAGA 


AGGAGTACAA 


GAGAACAATA 


1080 


ATTTATTTGG 


ATAAAATAAG 


CCAAATAAAT 


CAAAATATGC 


TATAATAACA 


ATATAGTAAA 


1140 


ACTCTTTTAA 


GGAGGAGTAG 


ATTTCTATGA 


ATAAAAAACT 


AACAGATTAT 


GTGATTGATC 


1200 


TGGTGGAAAT 


TTTAAATAAA 


CAACAAAAGC 


AGGTTTTCTG 


GGGAATATTT 


GATATTTTCA 


1260 


GTATGGTGGT 


TTCCATCATT 


GTATCTTATA 


TTTTATTTTA 


TGGGCTGATT 


AATCCAGCAC 


1320 


CTGTTGACTA 


CATTATCTAT 


ACGAGTTTGG 


CCTTCCTGTT 


CTATCAATTG 


ATGATTGGTT 


1380 


TTTGGGGGTT 


GAACGCGAGC 


ATTAGTCGTT 


ACAGCAAGAT 


TACGGATTTC 


ATCAAAATCT 


1440 


TTTTTGGTGT 


GACTGCTAGC 


AGTGTCTTGT 


CATATAGTAT 


CTGTTATGCC 


TTCTTGCCAC 


1500 


TCTTCTCCAT 


CCGTTTCATC 


ATTCTCTTTA 


TCTTGTTGAG 


TACCTTCTTG 


ATTTTATTGC 


1560 
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CACGGATTAC TTCCCAGTTA ATCTACTCCA GACGCAAAAA AGGTAGTGGT GATGGAGAAC 1620 
ACCGTCGGAC CTTCTTGATT GGTGCCGGTG ATGGTGGGGC TCTTTTTATG GATAGTTACC 1680 
AACATCCAAC CAGTGAATTA GAACTGGTCG GTATTTTGGA TAAGGATTCT AAGAAAAAGG 1740 
GTCAAAAACT TGGTGGTATT CCTGTTTTGG GCTCTTATGA CAATCTGCCT GAATTAGCCA 1800 
AACGCCATCA AATCGAGCGT GTCATCGTTG CGATTCCGTC GCTGGATCCG TCAGAATATG 1860 
AGCGTATCTT GCAGATGTGT AATAAGCTGG GTGTCAAATG TTACAACATG CCTAAGGTTG 1920 
AAACTGTTGT TCAGGGCCTT CACCAAGCAG GTACTGGCTT CCAAAAAATT GATATTACGG 1980 

ACCTTTTGGG TCGTCAGGAA ATCCGTCTTG ACGAATCGCG TCTGGGTGCA GAACTGACAG 2040 

GTAAGACCAT CTTAGTCACA GGAGCTGGAG GTTCAATCGG TTCTGAAATC TGTCGTCAAG 2100 

TTAGTCCCTT CAATCCTGAA CGCATTGTCT TGCTCCGTCA TGGGGAAAAC TCAATCTACC 2160 

TTGTTTATCA TGAATTGATT CGTAAGTTCC AAGGGATTGA TTATGTACCT GTGATTGCGG 2220 

ACATTCAAGA CTATGATCGT TTGTTGCAAG TCTTTGAGCA GTACAAACCT GCTATTGTTT 2280 

ATCATGCGGC AGCCCACAAG CATGTTCCTA TGATGGAGCG CAATCCAAAA CAAGCCTTCA 2340 

AAAACAATAT CCGTGGAACT TACAATGTTG CTAAGGCTGT TCATGAACCT AAAGTGTCTA 2400 

AGATGGTTAT GATTTCGACA GATAAGGCAG TCAATCCACC AAATGTTATG GGAGCAACCA 2460 

AGCGCGTGGC GGACTTGATT GTCACTCGCT TTAACCAACG TAGCCAATCA ACCTACTGTG 2520 

CAGTTCGTTT TGGGAATGTT CTTGGTAGCC GTGGTAGTGT CATTCCAGTC TTTGAACGTC 2580 

AGATTGCTGA AGGTGGGCCT GTAACGGTGA CACACTTCCC TATGACCCGT TACTTTATGA 2640 

CCATTCCAGA AGCTAGCCGT CTGGTTATCC ATGCTGGTGC TTATGCCAAA GATGGGGAAG 2700 

TCTTTATCCT TGATATGGGC AAACCAGTCA AGATTTATGA CTTGGCCAAG AAGATGGTGC 2760 

TTCTAAGTGG CCACACTGAA AGTGAAATTC CAATCGTTGA AGTTGGAATC CGCCCAGGTG 2820 

AAAAACTCTA CGAAGAACTC TTGGTATCAA CCGAACTCGT TGATAATCAA GTTATGGATA 2880 

AGATTTTCGT TGGTAAGGTT AATGTCATGC CTTTAGAATC CATCAATCAA AAGATTGGAG 2940 

AGTTCCGCAC TCTCAGTGGA GATGAGTTGA AGCAAGCTAT TATCGCCTTT GCTAATCAAA 1000 

CAACCCACAT TGAATAAAAA AGAAAAACGC ATAGTATCAA GTTACACAAC CTTGGTAATA 3060 

TGCGTTTTAT TATGTAGAGA CTTATACTCT TCGAAAATCT CTTCAAACCA CGTCAACGTC 3120 

GCCTTGCCGT ATATGGTTAC TGACTtCGTC AGTTCTATCC ACAACCTCAA AACAGTGTTT 3180 

TGAGytGACT TCGTCAGTTC TATCCACAAC CTCAAAACAG TGTTTTGAGc TGACtTCGTC 3240 

AGTTCTATCC ACAACCTCAA AACAGTGTTT TGAGCTGAcT TCGTCAGTTC CATCCACAAC 3300 

CTTAAAACAG TGTTTTGAGy TGACnTTCGT CAGTTCCATC TACAACCTTA AAACAGTGTT 3360 
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TTGAGCTCCC CGCAGCTAGT TTCCTAGTTP GCTCTTTGAT TTTCATTGAG TATTACTTCA 3420 

TTTTCTTCTG AAATGGAATT GTTACCCAGT CTATGCTATT GAAAATACGC CAAAACTTCT 3480 

AAGGGTTTGT GAGCGATATA ATCAGGTTGA TAGTTTAGTA GATCTGCTTG CTCTCCAAAT 3 54 0 

CCCCAAGTGA TGGCCAATTT CTGAATACCT GTTTCTCGAG CTCCCAGCAT ATCAAACTTG 3600 

GTATCTCCCA TGATGATGGC TTGTTCTGGT GCTAGTTGAT GTGTCTGCAA GGCTTGGTGA 3 660 

ATGACATCTG CCTTATGGCG "TGCTTCAGGG CTAGAACCAT "AAATGCCATC" AAAGAAATGA 3720" 

TCGATTTCCA AGTTTTTTGC CATGTCTTGA GCAGTAGATG TATCCTTTGT CGTGGTGATG 3790 

TAGAGTGGAT AACTGCTCGA TAACTCCTCA AGCAAGTCTA TAATCTGAGG AAAGAGTTGA 3840 

GCTTCATAGA TGCCTTTTGC CTTATAGTAA GAACGATATA TCTGCACGGC TTCAGAAATT 3 900 

TGGTCTTTGG ACAGGCAGGT CGCAAAACTA CTTTCGAGAG GTCCTCCCAT AAAACCACGA 3960 

ATAGTTTTGG CATCAGGGCT AGGCACCCCC AGCTCTTTAA AGGTATAGGT AAAGGCATTG 4020 

TGAATCCCGA TAGAACTATC AACGAGGGTT CCATCCAAAT CGAAAAAAAT CGCTGTGATA 4080 

GAGGTCATGG TTTCTCCTAT TTGATAAGCT TATTCTCCGA AAATTTCTTT TTGGAGGCGA 4140 

CGACCAGTAG GGGTGGTAGC GAGTCCACCT TCAGCTGTTT CACGAAAGGC ACTTGGCATG 4200 

CTTCCTCCTA CTTGGTACAT GGCATCGATC ACTTCATCCA CAGGGATTTT AGATTCGATA 4260 

CCTGCCAAGG CCATGTCTGC TGCGATGAAA GCAAAGCTAG CTCCCATCGC ATTACGTTTG 4320 

ACACAGGGAA CTTCGACCAA ACCTGCAACA GGGTCACAGA TGAGGCCTAG CATATTTTTA 4380 

ATGACAAAGG CAATAGCTTG ACTGGCCTGA TAAGGTGTTC CACCTGCAGC CAGAGTCAAG 4440 

GCGGCAGCAC TCATAGCAGA GGCTGAACCA ACTTCAGCTT GACACCCACC CTCAGCACCT 4 500 

GAGATGGAGG CATTGTTTGC GATGACTAGT CCAAAGGCAC CAGCAGCAAA GAGGAAATCC 4 560 

AATTGTTGCT CGTGGCTGAG GTCTAATTTT TCAATAGCAG CACTGAGAAC GGATGGCAGA 4 620 

CAGCCAGCAC TTCCAGCGCT TGGAGTGGCA CAGACCAAGC CCATTTTGGC ATTGTGTTCA 4680 

TTGACTGCGA TGGCATTTCG GGCAGCCGAG AGAATCGTAT AATCTGACAG AGTTTTTCCG 4740 

TTTTCGATGT AGTGATCCAA TTTGGCAGCA TCTCCACCTG TCAGGCCACT ACGAGATTTA 4 800 

TTTTCATTGA GGCCAAGTTG GACAGAGGCT TTCATAACTT CCAGATTGCG TTCCATGAGA 4860 

AGGAAGACTT CTTCACGTTC GCGACCGGTC AATTCAAACT CTGTTGTAAT CATGAGTTCT 4920 

GCGACATTTC CTTGAAAGTC CAGATCTGCT TGCTCGACCA ATTCTTTGAT AGAATAAAAC 4980 

ATGCTTCCTC CTATTTAAAG AAATTGACAT TGTGGAGATG ACGGATTTTT CGAATTTCTT 5040 

CGATAGCCTC ATCACAGTTG CGACTGTCAA CTTCGATAAT CATAATCGCT TTTTCACCAG 5100 



WO 98/18931 



PCT/US97/19588 



, iiuiuililil 1052 

CTTTTTCACG ACTGACATTC ATCTGGGCGA TATTGATACC ATAGCGGGAA AGCCCCTCTG 5160 

TAACAAGGGC AATCATACCT GGAATATCTT GATGAACGAT CATCATAGTC GGTGTATTCA 5220 

TATTGAGAGA GACGGCAAAA CCATTGAGTT CGGTTACCTG AATATTTCCT CCACCGATAG 5280 

AAATACCAGT CACGCTGATG GTCTTGTGGG CATTTTTAAC ACTAATTTTA GTGGTGTTAG 5340 

GGTGAGGGGC ATTGCTGTCT TTCTGAATGG TCCAGACAAT CTTGATACCA CGCTTGTGGG 5400 

CAATTTCCAG ACTATTTGGA ATTTCAGGAT CATCTGTATC CATTCCTAAA ATACCTGCAA 5460 

CAAGGGCTAG GTCTGTTCCG TGACCACGAT AGCTCTTGGC AAATGAGTTA AAAAGTTGGA 5520 

ATTCAACTTC TGTCCGAGTA TCATCAAAAA TGGAAGAGAC AATCTTCCCA ATACGAACAG 5580 

CACCAGCGGT ATGGCTACTA GATGGGCCAA TCATAACTGG TCCGATGATA TCAAACACAG 5 640 

ATTGAAAACG AAGTGATTTC ATCAGTTTCC CCTTATAAAA ATTCTTATCT CTATTATATC 5700 

AAAGAATGAG GGGCTTGGCT TTAATTGTGG ATGAAAACCT TTCTAATACC TCAAATAGCA 5760 

TAAAAATAGT ATCTTTTATG ACAAAAAACA CCTTATTTAG GGAAATAAAA AATAATTTTG 5820 

TAATATTTCT ACATAAAAGT GTCAAGAAAC GGTAATATTT AAAGGGTATG ATAGAACTAT 5880 

AGAAAGAAGG AGAATTTTCG AATATGAAAT CAATAACTAA AAAGATTAAA GCAACTCTTG 5940 

CAGGAGTAGC TCCCTTGTTT GCAGTATTTG CTCCATCATT TGTATCTGCT CAAGAATCAT 6000 

* 

CAACTTACAC TGTTAAAGAA GGTGATACAC TTTCAGAAAT CGCTGAAACT CACAACACAA 6060 

CAGTTGAAAA ATTGGCAGAA AACAACCACA TTGATAACAT TCATTTGATT TATGTTGATC 6120 

AAGAGTTGGT TATCGATGCC CCTGTAGCGC CTGTTCCAAC ACCAGCGCCA GCTACTTATG 6180 

CGGCACCAGC CGCTCAAGAT GAAACTGTTT CAGCTCCAGT AGCAGAAACT CCAGTAGTAA 6240 

GTGAAACAGT TGTTTCAACT GTAAGCGGAT CTGAAGCAGA AGCCAAAGAA TGGATCGCTC 6300 

AAAAAGAATC AGGTGGTAGT ATACAGCTAC AAATGGACGT TATATCGGAC GTTACCAATT 6360 

AACAGATTCA TACCTGAACG GTGACTACTC AGCTGAAAAC CAAGAACGGG TACCG 6415 

(2> INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8494 baae pairs 
(B> TYPE: nucleic acid 
(CI STRANDBDNESS : double 
(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
TACCCCTTTC GAATTTTGGC AAAAATTCGG TAAGGC7TTG ATGGTAGTTA TCGCGGTTAT 
GCCGGCTGCT GGTTTGATGA TTTCAATCGG TAAGTCTATC CTGATGATTA ACCCAACCTT 



60 
120 



WO 98/18931 



PCT/US97/ 19588 



1053 

TGCACCACTT GTCATCACAG GTGGAATTCT TGAGCAAATC GGTTGGGGGG TTATCGGTAA 180 

CCTTCACATT TTGTTTGCCC TAGCCATTGG AGGAAGCTGG GCTAAAGAAC GTGCTGGTGG 240 

TGCTTTCGCC GCTGGTCTTG CCTTCATCTT GATTAACCGT ATCACTGGTA CAATCTTTCG 300 

TGTATCAGGC GATATGTTGA AAAATCCAGA TGCTATGGTA ACTACTTTCT TTGGTGCTTC 360 

AATCAAAGTT GCTGATTACT TTATCAGTGT TCTTGAAGCT CCAGCCTTGA ACATCGGGGT 4 20 

- — — -ATTCGT AGGG ~ ATT ATCTC AG "GTTTTGTAGG ~GGC AACTCCT TAC AAC AA^^CT AC^CTT 480~ 

CCGTAAACTT CCTGATGCAC TTTCATTCTT CAACGGGAAA CGTTTGGTAC CATTTGTAGT 540 

TATTCTTCGT TCAGCAATCG CTGCAATTCT ACTTGCTGGT TTCTGGCCAG TACTTCAAAC 600 

AGGTATCAAT AACTTCCGTA TCTGGATTGC CAACTCACAA GAAACTGCTC CAATTCTTGC 660 

ACCATTCTTG TATGGTACTT TGGAACGTTT GCTCTTGCCA TTTGGTCTTC ACCACATGTT 720 

GACTATCCCA ATGAACTACA CAGCTCTTGG TGGTACTTAT GACATTTTAA CTGGTGCAGC 780 

TAAAGGTACT CAAGTATTCG CTCAAGACCC ACTATGGCTT GCATGGGTAA CAGACCTTGT 840 

AAACCTTAAA GGTACTGATG CTAGTCAATA TCAACACTTG TTAGATACAG TACATCCAGC 900 

TCGTTTCAAA GTTGGACAAA TGATCGGTTC ATTCGGTATC TTGATGGGTG TGATTGTTGC 960 

TATCTACCGT AATGTTCATG CTGACAAGAA ACATAAATAC AAAGGTATGA TGATTGCAAC 1020 

AGCTCTTGCA ACATTCTTGA CAGGGGTTAC TGAACCAATC GAATACATGT TCATGTTCAT 1080 

CGCAACACCT ATGTATCTTG TTTACTCACT TGTTCAAGGT GCTGCCTTCG CTATGGCTGA H40 

CGTCGTAAAC CTACGTATGC ACTCATTCGG TTCAATCCAG TTCTTGACTC GTACACCTAT 1200 

TGCAATCAGT GCTGGTATTG GTATGGATAT CCTTAACTTC GTTTGGGTAA CTGTTCTCTT 12 60 

TGCTGTAATC ATGTACTTTA TCGCAAACTT CATGATTCAA AAATTCAACT ACGCAACTCC 1320 

AGGGCGCAAC GGAAACTACG AAACTGCTGA AGGTTCAGAA GAAACCAGCA GCGAAGTGAA 1380 

AGTTGCAGCA GGCTCTCAAG CTGTAAACAT TATCAACCTT CTTGGTGCAC GTGTAAACAT 1440 

CGTTGATGTT GATGCATGTA TGACTCGTCT TCGTGTAACT GTTAAAGATG CAGATAAAGT 1500 

AGGAAATGCA GAGCAATGGA AAGCAGAAGG AGCTATGGGT CTTGTCATGA AAGGACAAGG 1560 

GGTTCAAGCT ATCTACGGTC CAAAAGCTGA CATTTTGAAA TCTGATATCC AAGATATCCT 1620 

TGATTCAGGT GAAATCATTC CTGAAACTCT TCCAAGCCAA ATGACTGAAG CACAACAAAA 1680 

CACTGTTCAc TTCAAAGATC TTACTGAGGA AGTTTACTCA GTAGCAGACG GTCAAGTTGT 1740 

TGCTTTGGAA CAAGTAAAGG ATCCAGTATT TGCTCAAAAA ATGATCGGTG ATGGATTTGC 1800 

AGTAGAACCT GCAAATGGAA ACATTGTATC TCCAGTTTCA GCTACTGTCT CAAGCATCTT 1860 
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CCCAACAAAA 


CATGCTTTTG 


GTATTGTGAC 


GGAAGCAGGT 


CTTGAAGTAT 


TGGTTCACAT 


1920 


TCCTTTCGAC 


ACACTAACTC 


TTGAAGGTAA 


ACCATTTACA 


GTTCATGTTG 


CTGAAGGACA 


1980 


AAAAGTTGCA 


GCAGGAGATC 


TCCTTCTCAC 


AGCTGACTTG 


GATGCTATCC 


GTGCAGCAGG 


2040 


ACGTGAAACT 


TCAACAGTAG 


TTGTCTTCAC 


AAATGGTGAT 


GCAATTAAAT 


CAGTTAAGTT 


2100 


AGAAAAAACA 


GGTTCTCTTG 


CAGCTAAAAC 


AGCAGTTGCT 


AAAGTAGAAT 


TGTAATATAC 


2160 


TTGAGGTTGG 


AAGCTGTATT 


CCAACCTCTT 


ATTTTGGGAG 


AAAAGAATGA 


AATTTTTAAC 


2220 


ACTCAATACT 


CACAGTTGCA 


TGGAGAAAGA 


AGCAGAGGAA 


AAATTCCAGA 


TTTTGCTTGA 


2280 


AGATATTCTT 


GAAAAGGACT 


ATGATTTGAT 


TTGTTTTCAA 


GAAATCAATC 


AGGAGATGAC 


2340 


CTCGTCAGAG 


GTGGAGGTTA 


ATGACCTTTA 


TCAAGCTTTG 


CCAGCAGCTG 


AGCCTATTCA 


2400 


CCAAGACCAT 


TATGTTAGAC 


TCTTGGTTGA 


AAAGTTGTCT 


GAGCAAGGGA 


AAAATTACTA 


2460 


CTGGACCTGG 


GCCTATAACC 


ATATCGGCTA 


TAACCGCTAC 


CACGAAGGTG 


TGGCTATCTT 


2520 


GTCTAAAACA 


CCTATTGAAG 


CCAGAGAAAT 


TTTGGTTTCA 


GATGTGGATG 


ATCCAACAGA 


2580 


CTATCATACT 


CGCCGTGTTG 


CCCTAGCTGA 


AACTCTAGTC 


GATGGCAAGG 


AGCTAGCAGT 


2640 


TGCCAGTGTT 


CATCTCTCTT 


GGTGGGATAA 


AGGTTTCCAA 


GAAGAATGGG 


CACGATTTGA 


2700 


GGCTGTCTTG 


AAAAAATTGA 


ACAAGCCACT 


TTTACTAGCT 


GGAGATTTCA 


ACAATCCGGC 


2760 


TGGACAGGAA 


GGTTACCAAG 


CTATTTTAGC 


TAGTCCATTA 


GGCTTACAAG 


ACGCATTTGA 


2820 


AGTTGCTCAA 


GAGAAAAGTG 


GTAGCTATAC 


TGTTCCGCCT 


GAAATTGATG 


GCTGGAAAGG 


2880 


GAACACTGAA 


CCCCTTCGAA 


TCGATTATGT 


CTTTACTACC 


AAAGAGTTAG 


CGGTGGAAAA 


2940 


TTTACATGTC 


GTATTTGATG 


GTAACAAGAG 


TCCACAAGTG 


AGTGATCACT 


ATGGCTTGAA 


3000 


TGCTATATTA 


AACTGGAAAT 


AATAACTGAA 


AAGAGCTTGG 


AACTATAAAA 


TTCCAGCCTT 


3060 


TTCTTACTAG 


AGAAGCTACT 


GGAAATAGCC 


TAAATAAGTG 


AGACTACTGT 


AATGGAATAA 


3120 


AATATGGTAT 


AATTGATAAG 


GTAGATAGAA 


TCGAGGATGT 


TATGTCATTT 


ACGAAATTTC 


3180 


AATTTAAAAA 


CTATATTAGA 


GAAGCCTTGA 


AGGAGTTAAA 


ATTTACAACT 


CCAACAGA^G 


3240 


TGCAAGACAA 


GTTGATTCCT 


ATTUTTTTGG 


CAGGTCGTGA 


CCTAGTAGGA 


GAATCAAAAA 


3300 


CAGGTTCAGG 


TAAGACTCAT 


AcrrrciTGT 


TACCGATTTP 


CCAGCAATTA 


GATGAAGCTA 


3360 


GCGATAGTGT 


ACAAGCAGTG 


ATTACTGCAC 


CGAGTCGTGA 


GTTGGCTACT 


CAAATTTACC 


3420 


AAGTAGCGCG 


TCAGATTTCA 


GCTCACTCAG 


ATGTCGAAGT 


TCGTGTGGTT 


AATTATGTGG 


3480 


GTGGTACGGA 


TAAGGCTCGC 


CAGATTGAGA 


AATTGGCAAG 


CAATCAGCCT 


CATATTGTTA 


3540 


TTGGAACACC 


AGGCCGTATC 


TACGACTTGG 


TTAAATCTGG 


TGATTTAGCT 


ATTCATAAAG 


3600 


CCAAGACATT 


TGTTGTTGAT 


GAAGCAGATA 


TGACCTTGGA 


TATGGGATTC 


TTGGAAACTG 


3660 
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TTGATAAGAT TCCTCGCAGT CTTCCAAAAG ACTTGCAATT CATGCTCTTC TCAGCGACTA 3720 
TCCCACAAAA ACTGCAACCA TTCTTGAAAA AATACTTATC AAATCCTGTT ATGGAGAAAA 3780 
TTAAGACCAA AACGGTTATT TCTCACACCA TTGATAATTG GTTGATTTCG ACCAAGGGAC 3840 

ATGATAAGAA TGCTCAAATT TACCAGTTGA CTCAGTTGAT GCAGCCGTAT TTGGCAATGA 3900 

TTTTTGTTAA CACTAAAACG CGTGCTGATG AATTGCATTC ATATCTGACT GCTCAAGGCT 3 960 

TGAAGGTTGC AAAAATCCAT GGCGATATTG CCCCTCGTGA ACGCAAGCGA ATCATGAATC 4020 

AGGTGCAAAA TCTGGATTTT GAGTATATTG TCGCAACAGA TTTGGCAGCG CGTGGGATTG 4080 

ACATTGAAGG TGTCAGCCAT GTCATCAATG ATGCCATTCC GCAAGACTTA TCTTTTTTTG 4140 

TTCATCGTGT TGGTCGTACT GGACGAAATG GCCTACCAGG TACAGCTATT ACCCTTTATC 4200 

AGCCAAGTGA TGACTCGGAT ATCCGTGAGT TGGAGAAATT GGGAATCAAG TTTAGTCCTA 42 60 

AGATGGTCAA AGACGGGGAA TTTCAAGATA CCTATGACCG TGATCGTCGT GCCAACCGTG 4320 

AGAAAAAACA AGATAAACTT GATATCGAAA TGATTGGTTT CGTTAAAAAC AAAAAGAAAA 4380 

AAGTCAAACC GGGTTATAAG AAGAAAATTC AATGGGCGGT TGATGAAAAG CGCCGTAAAA 4440 

CCAAGCGTGC TGAAAATCGC GCTCGCGGTC GTGCAGAGCG TAAAGCTAAA CGCCAAACAT 4 500 

TTTAATAGAA ATTGTTGGAG TATTGAGCTC CAACTTTTTT ATTTATGAGA ACGAACTATC 4560 

TAAACCGAAA CACTACATTA AAGACTGCAA ATTGCGATTA AAAATGGTAT AATGATAAAG 4620 

TTATATAGTC CCGATAAGAT GGTAGGTATT TATTACGAAG AGTTTTCCTA TCAGTACTTT 4680 

GTAACTCTAT AACAATATTT TTTAAGGGGG GACATTTTTA TGTCAGAGCG TAAATTATTC 4 740 

ACGTCTGAAT CTGTATCTGA GGGGCATCCG GATAAGATTG CAGACCAAAT TTCAGATGCG 4800 

ATTTTGGATG CTATTTTAGC AAAGGATCCA GAGGCGCACG TTGCTGCTGA AACAGCTGTA 4 860 

TATACTGGTT CTGTCCACGT TTTTGGTGAA ATTTCTACAA ATGCCTATGT GGATATTAAC 4920 

CGTGTGGTTC GTGATACCAT TGCAGAGATT GGTTATACCA ATACAGAATA TGGATTTTCT 4 980 

GCTGAGACGG TGGGAGTACA CCCATCTTTG GTGGAACAAT CTCCTGACAT CGCTCAAGGT 5040 

GTTAACGAAG CCTTGGACGT TCGTGGAAAT GCTGATCAAG ATCCACTGGA CTTGATTGGA 5100 

GCAGCTGACC AAGGGCTCAT GTTTGGATTT GCAGTAGATG AAACAGAAGA GCTTATCCCA 5160 

TTGCCAATTG CACTCAGTCA TAAATTGGTT CGTCGTCTGG CAGAACTTCG TAAGTCTGGA 5220 

GAAATTACCT ATCTCCGTCC AGATGCAAAA TCACAAGTTA CAGTTGAGTA CGATGAAAAT 5280 

GACCGTCCGG TACCTGTAGA TACAGTCGTT ATTTCTACTC AGCATGATCC AGAGGCCACT 5340 

AATGAACAAA TCCATCAAGA TGTGATTGAC AAGGTCATCA AAGAAGTTAT TCCATCTTCT 5400 
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TATCTTGATG ATAAGACAAA ATTCTTTATC AATCCGACAG GTCGTTTTGT AATCGGTGGT 54 60 

CCTCAAGGGG ACTCAGGTTT GACTGGTCGT AAGATTATTG TAGATACTTA TGGTGGCTAC 5520 

TCTCGTCATG GTGGTGGTGC CTTCTCTGCT AAAGATGCGA CTAAGGTGGA TCGTTCAGCC 5580 

TCTTATGCGG CTCGCTATAT TGCCAAGAAT ATCGTTGCAG CAGACCTTGC TAAGAAGGCA 564 0 

GAAGTGCAGT TGGCCTATGC TATCGGTGTT GCGCAACCTG TTTCTGTTCG TATCGATACT 5700 

TTCGGTACAG GAACAGTAGC TGAAAGTCAA CTTGAAAAAG CGGCTCGTCA AATCTTTGAC 5760 

CTTCGCCCTG CAGGGATTAT CCAAATGCTG GACCTCAAGC GTCCAATTTA CCGTCAAACA 5820 

TCGGCTTACG GTCACATGGG ACGTApAGAT ATTGATCTTC CATGGGAACG TTTGGATAAG 5880 

GTAGATGCTT TGAAAGAAGC AGTAAAATAA GATTTTAAGA GGGGAACGTC CTCTCTTTTT 5940 

TATAGTTTTT AACTATACTG GGATACTGTT CTGAAAATCC ATTTTGCGAA AGTAGAGATT 6000 

TACATGTATA GTAGATTGAA ACTAGAATAG TACACCTCAA CTTCTAAAAC ATTGTTAGCA 6060 

ATCAATTTGA CTGTCCTGAT CGATTTCTCC TGTTCTTGTT TCATTTTACT ATATTTCTTT 6120 

AAAAATGATA AAGGTTAAGA TTTCTCCTCG TAATAGATAA TCTTGGGGAT ATTTCAATCC 6180 

AAAGTTTTAT TCGTTATCAC TTGACTATTG CAAG G TT TT C TAGAGCAACA GAGTCATGGA 6240 

ATGGACTCAT GGTTGAGATT TCTCCTTGTT GCTTGGACTT CATTCAAAAG TCTGTTACCC 6300 

AAGCCTTGTT CAAACTTCTA ATACACTAGC TGTTTCCATA GCATGACTTC TGTACTAGAC 6360 

TTTCTTTTCC GAATAAATAG ATAGAACCAC AGAATCTAGT AAACCTAGAA TTAAAATTAT 6420 

GGTATAATAT TAGCAATAAA AGAAATCTGG AGGATTAGAA TCATGGTATC AACGAAAACA 64 80 

CAAATTGCTG GTTTTGAGTT TGACAATTGC TTGATGAATG CAGCAGGTGT GGCTTGTATG 6540 

ACGATAGAGG AGTTAGAACA GGTCAAAAAC TCAGCGGCAG GAACCTTTGT TACTAAGACA 6600 

GCGACCTTGG ACTTCCGTCA GGGGAATCCT GAGCCACGCT ACCAAGATGT TCCACTTGGT 6660 

TCCATCAACT CTATGGGCTT GCCAAATAAT GGCTTAGACT ATTATTTGGA TTATCTTTTA 6720 

GATTTGCAGG AAAAAGAGTC GAACCGAACT TTCTTCTTAT CTCTGGTCGG CATGTCTCCA 6780 

GAGGAAACCC ATACTATTTT GAAAAAAGTC CAAGAGAGTG ATTTTCGTGG TCTGACTGAG 6840 

CTAAATCTTT CCTGTCCAAA TGTTCCAGGT AAACCTCAGA TTGCCTATGA TTTTGAGACA 6900 

ACAGACCGGA TTTTGGCAGA AGTGTTTGCT TACTTCACCA AACCTCTTGG AATTAAATTG 6960 

CCACCTTATT TTGATATTGT TCACTTTGAC CAAGCGGCAG CTATTTTCAA CAAATATCCG 7020 

CTCAACTTTG TCAACTGCGT TAACTCTATC GGAAACGGCC TCTATATAGA AGACGAATCT 7080 

GTCGTTATTC GGCCTAAGAA TG CT T T TGGT GGAATTGGTG GAGAATACAT CAAACCGACT 7140 

GCTTTAGCCA ATGTTCACGC CTTTTATCAA CGTTTAAATC CTCAAATCCA AATTATCGGA 7200 
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ACACCTCCCG TTCTGACTGG TCGAGATGCC TTTGAACACA TCCTCTGTCC AGCAAGTATG 7260 

GTCCAGGTGG GAACGACCCT TCACAAAGAA GGCGTCAGTG CTTTTGACCG CATTACCAAT 7320 

GAACTGAAAG CAATCATGGT GGAAAAAGGC TACGAGAGCT TAGAAGATTT CCGTGGGAAA 7380 

TTCCCCTATA TTGACTAAAT TAAATCGAAA AATCTGAAGA AAGGAGAGAC GATGCTACCC 7440 

ATTGAAGAAA GTCAGAAGTT GACTTTATCA AATTTACCCA GCCTGAGCCT ATTTACAGGG 7500 

ACAGATCAGG GTCACTTTCA ACTGATGAAG AGTCAAATGT TGAAACAGAT TGGGTATGAT 7560 

TCTGCTGACC TCAACTTTGC CTACTTTGAT ATGAAAGAAG TAGTTTACAA GGATGTGGAA 7620 

CTGCAGTTGG TCAGCCTTCC TTTCTTTGCG GATGAGAAAA TCGTGATATT AGATTATTTT 7680 

ATGGATATCA CGACTGCTAA GAAACGCTTT TTGACAGATG ATGAGCTTAA GTCATTTGAG 7740 

GAATACCTTG ACAATCCTTC TCCAACAACC AAGTTGATAA TCTTTGCAGA AGGAAAGCTG 7800 

GATAGCAAAA GACGGTTAGT CAAATTACTT AAGCGTGATG CCAAGGCCTT CGATGCAGTA 7860 

GAAGTAAAAG AACAAGAATT GCGCCAGTAC TTCCAAAAGT GGAGTCAGAA ACAAGGTCTG 7920 

CAGTTTACCA ATCATTCTTT TGAAAATCTC CTCATCAAGT CGGGGTTTCA ATTTAGCGAA 7980 

ATCCAGAAAA ATCTTCTCTT TTTACAGTCC TATAAGGCGA ATTCTGTTAT TGAGGAAGAG 8040 

GATATTGTTA ACGCAATTCC CAAGACTTGC AGGACAATAT TTTTGATTTA ACTCAGTTTA 8100 

TTCTGACTAA AAAGATGGAT CAGGCGCGCG ATTTGGTGAG AGACTTGACC TTGCAAGGGG 8160 

AAGATGAAAT CAAACTGATT GCAGTCATGC TGGGACAATT TCGGACTTTT ACTCACCTGA 8220 

AGATTTTGGC GGAGTCTGGC CAAACAGAAT CGCAGATTGC AAGTAGTTTA GGTAGTTATC 8280 

TGGGACGTAA CCCAAATCCT TATCAAATCA AGTTTGCATT AAGAGATTCG AGAGGACTTT 8340 

CTTTGACCTT TTTGAAGCAA GCTATTTCCT ATTTGATTGA G AC AG ACT AT CAGATTAAGA 8400 

CAGGTCTTTA TGAAAAAGGT TTCCT T TTTG AAAAGGCACT CTTACAGATT GCTACTCAGG 8460 

TCAATTGACA TTTGTTGAAA CTACTAACCC GCGG 3494 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 9707 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
CCGGTCAGTT CGTTCAGTAC AAGGAATCAT AATCAACGAT CAATCAGAAA AAAAGACTAG 
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AAAGAAGACT GTATGGATAA TCGACCAATT GCTTTTTTGG ATTCGGGTGT CGGGGGCTTG 120 

ACCGTTGTGC GCGAGCTCAT GCGCCAGCTT CCCCATGAAG AAATCCTCTA TATTGGAGAT 180 

TCGGCGCGGG CGCCCTATGG CCCCCGTCCT GCTGAGCAAA TTCCTGAATA TACTTGGCAG 240 

CTGGTCAACT TTCTCTTGAC CAAGGATGTC AAAATGATTG TCATTGCTTG TAACACTGCG 300 

ACTGCGGTCG TCTGGGAAGA AATCAAGGCT CAACTAGATA TTCCTGTCTT GGGTGTAATT 360 

TTGCCAGGAG CTTCGGCAGC CATCAAGTCC AGTCAAGGTG GGAAAATCGG AGTGATTGGA 420 

ACGCCCATGA CGGTACAATC AGACATATAC CGTCAGAAAA TCCATGATCT GGATCCCGAC 480 

TTACAGGTGG AGAGCTTGGC CTGTCCCAAG TTTGCTCCCT TGGTTGAGTC AGGTGCCCTG 540 

TCAACCAGTG TTACCAAGAA GGTGGTCTAT GAAACCCTGC GTCCCTTGGT TGCAAAGGTC 600 

GATAGCCTGA TTTTGGGCTG TACTCATTAT CCACTCCTTC GCCCTATTAT CCAAAATGTG .660 

ATGGGGCCAA AGGTTCAGCT CATCGATAGT GGGGCAGAGT GCGTACGGGA TATCTCACTC 7 20 

TTACTCAATT ATTTTGAAAT CAATCGTGGT CGCGATGCTG GACCACTCCA TCACCGTTTT 780 

TACACAACAG CCAGTAGCCA AACTTTTGCA CAAATTGGTG AAGAATGGCT GGAAAAAGAG 840 

ATTCATGTGG AGCATGTAGA ATTATGACAA ATAAAATTTA TGAATATAAG GATGACCAGG 900 

ACTGGTATGT TGGGTCTTAT AGTATTTTTG CTGCCCTTAA CAGTTTGAGC GACTATAAGA 960 

CAGATTTTCC TCTGTTTGAA TTCTCCAAAA TATTTGGAGA TGAAGAGTAT GGTTTCCCGC 1020 

TTTCAGTTAC TGTTTTACGC TATGGTTCTA TCTACCGTTT GTTCTCCTTT GTGGTAGACA 1080 

TGCTTAATCA AGAAATGGGA CGAAACTTGG AAGTTATTCA ACGTCATGGG GCCCTGCTCT 1140 

TGGTTGAAAA TGGGCAACTC TTGTATGTAG AATTGCCTAA AGAAGGGGTC AATGTTCATG 1200 

ATTTCTTTGA GACAAGCAAG GTCAGAGAAA C C TT G TTGAT TGCGACTCGT AACGAAGGTA 1260 

AAACCAAGGA ATTCCGAGCT ATCTTTGATA AGTTAGGCTA CGATGTGGAA AATCTTAATG 1320 

ACTACCCTGA CCTGCCTGAA GTAGCAGAAA CAGGTATGAC CTTTGAAGAA AATGCCCGCC 1380 

TTAAGGCAGA AACCATTTCT CAATTAACGG GCAAGATGGT TTTGGCAGAT GATTCTGGTC 1440 

TCAAAGTCGA TGTCCTTGGT GGCTTACCAG GCGTCTGGTC AGCTCGTTTC GCAGGTCTGG 1500 

GAGCAACTGA CCGTGAAAAT AATGCCAAAC TCTTGCACGA ATTGGCCATG GTCTTTGAAC 1560 

TCAAGGACCG CTCGGCTCAG TTCCACACAA CCCTAGTCGT AGCCAGCCCA AATAAGGAAA 1620 

GTTTAGTTGT TGAAGCAGAC TGGTCAGGTT ATATTAACTT TGAACCTAAG GGTGAAAATG 1680 

GCTTTGGCTA TGATCCCCTC TTCCTTGTAG GAGAAACAGG TGAGTCATCA GCTGAATTAA 1740 

CCCTGGAAGA AAAAAATAGT CAATCTCACC GTGCCTTAGC CGTTAAGAAA C TTTTGGAGG 1800 

TATTTCCATC ATGGCAAAGC AAACCATCAT TGTAATGAGC GATTCCCATG GCGATAGCTT I860 
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GATTGTGGAA 


GAAGTCCGTG 


ATCGCTATGT 


GGGCAAAGTC 


GATGCTGTTT 


TTCATAACGG 


1920 


CGATTCTGAA 


CTACGTCCGG 


ATTCTCCACT 


TTGGGAGGGC 


ATCCGCGTTG 


TTAAAGGGAA 


1980 


CATGGACTTC 


TACGCCGGCT 


ACCCAGAACG 


TCTGGTGACT 


GAGCTTGGTT 


CGACCAAGAT 


2040 


TATCCAAACT 


CATGGTCACT 


TGTTTGACAT 


CAATTTCAAC 


TTTCAAAAGT 


TGGACTACTG 


2100 


GCCTCAGGAG 


GAAGAGGCCG 


CTATCTGCCT 


CTATGGTCAC 


TTGCATGTGC 


CAAGTGCTTG 


2160 


GTTGGAAGGC 


AAGATCCTCT 


TTCTAAATCC 


AGGTTCTATC 


AGTCAACCAC 


GAGGTACCAT 


2220 


CAGAGAATGT 


CTCTATGCTC 


GTGTGGAGAT 


TGATGATAGT 


TACTTCAAAG 


TGGACTTTTT 


2280 


GACACGAGAT 


CACGAGGTGT 


ATCCAGGTTT 


GTCCAAGGAG 


TTTAGCCGAT 


GATTGCCAAG 


2340 


GAGTTTGAGA 


CTTTCTTGTT 


GGGGCAGGAG 


GAAACTTTTT 


TGACCCCTGC 


TAAAAATCTA 


2400 


GCTGTGTTGA 


TTGATACCCA 


CAATGCGGAT 


CATGCGACCC 


TCTTGCTCAG 


TCAGATGACC 


2460 


TATACCCGTG 


TTCCCGTTGT 


GACAGATGAA 


AAACAGTTTG 


TTGGGACGAT 


TGGACTCAGA 


2520 


GATATTATGG 


CTTATCAGA? 


GGAGCATGAC 


TTGAGCCAAG 


AAATCATGGC 


GGATACGGAT 


2580 


ATCGTTCATA 


TGACAAAAAC 


GGACGTAGCG 


GTTGTTTCGC 


CTGATTTCAC 


CATTACGGAG 


2640 


GTCTTGCACA 


AGCTAGTAGA 


TGAGTCCTTC 


TTACCGGTTG 


TGGATGCAGA 


GGGTATTTTC 


2700 


CAAGGGATTA 


TTACGCGCAA 


GTCCATCCTC 


AAGGCCGTTA 


ATGCCCTCTT 


GCATGACTTT 


2760 


AGTAAGGAAT 


ATGAGATTCG 


ATGCCAATGA 


GAGACAGGAT 


TTCAGCCTTT 


TTAGAGGAAA 


2820 


AGCAGGCCrr 


GTCTGTCAAT 


TCCAAGCAGT 


CCTATAAGTA 


TGATTTGGAG 


CAATTTTTAG 


2880 


ACATGGTAGG 


TGAGCGGATT 


TCTGAGACCA 


GTCTCAAGAT 


TTACCAAGCC 


CAGCTAGCCA 


2940 


ATCTAAAAAT 


CAGCGCCCAG 


AAGCGAAAGA 


TTTCGGCCTG 


TAACCAATTT 


CTATACTTTC 


3000 


TCTATCAAAA 


AGGAGAGGTG 


GACAGCTTTT 


ACCGCTTGGA 


ATTAGCCAAA 


CAAGCTGAAA 


3060 


AGAAGACGGA 


AAAGCCAGAG 


ATTCTATACC 


TAGACTCTTT 


TTGGCAGGAA 


AGCGACCATC 


3120 






CTCTTAATCC 


TAGAAATGGG 


GCTCTTGCCC 


AGTGAGATTT 


3180 


TAGCCATCAA 


GGTTGCGGAC 


ATCAATCTGG 


ATTTTCAGGT 


GTTGCGAATC 


AGCAAGCCTT 


3240 


CCCAACAGAG 


GATTGTCACC 


ATTCCCACGG 


CCTTGCTTTC 


AGAATTGCAA 


CCCTTGATGG 


3300 


GGCAGACCTA 


TCTTTTTGAA 


AGAGGAGAGA 


AACCCTATTC 


TCGTCAGTGG 


GCCTTTCGTC 


3360 


AGTTAGAATC 


TTTTGTCAAG 


GAGAAAGGTT 


TTCCATCCTP 


ATCAGCTCAA 


GTCTTACGTG 


3420 


AACAGTTTAT 


TCTAAGACAA 


ATAGAAAACA 


AGGTCGATTT 


GTACGAAATT 


GCAAAAAAAT 


3480 


TAGGATTAAA 


AACAGTCCTG 


ACCTTAGAAA 


AATATAGATA 


ATGGATATTA 


AATTAAAAGA 


3540 


TTTTGAAGGA 


CCCCTGGACT 


TGCTCTTGCA 


TCTGCTTTCT 


AAGTACCAGA 


TGGATATCTA 


3600 
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CGATGTGCCC ATTACGGAAC TCATCGAACA GTATCTAGCC TATGTCTCAA CCCTGCAGCC 3660 

CATGCGTCTC GAAGTGACGG GTGAGTACAT GGTCATGGCT AGTCAGCTCA TGCTGATTAA 3720 

GAGTCGTAAA CTCCTTCCGA AGGTAGCAGA AGTGACAGAC TTGGGGGATG ACCTGGAGCA 3780 

GGACCTCCTC TCTCAAATCG AAGAATATCG CAAGTTCAAG CTCTTGGGTG AGCACTTGGA 3840 

AGCCAAGCAC CAAGAACGGG CCCAGTATTA TTCCAAAGCG CCGACAGAGT TGATTTACGA 3900 

AGATGCGGAG CTTGTGCATG ACAAGACGAC CATTGACCTC TTTTTGACTT TTTCAAATAT 3 9 60 

CCTAGCCAAG AAAAAAGAGG AGTTTGCACA AAATCACACG ACGATCTTGC GGGATGAGTA 4020 

TAAGATTGAG GACATGATGA TTATCGTGAA AGAGTCCTTG ATTGGACGAG ATCAATTGCG 4080 

CTTGCAGGAT TTGTTCAAGG AAGCCCAGAA TGTCCAAGAG GTCATCACCC TCTTTTTGGC 4X40 

AACCCTAGAG TTAATCAAAA CCCAGGAGTT GATCCTCGTG CAAGAGGAGA GTTTTGGAGA 4200 

TATCTATCTC ATGGAAAAGA AGGAAGAAAG TCAAGTGCCT CAAAGCTAGA CTTGATAGAG 4260 

AGGAAAGATG AGTACTTTAG CAAAAATAGA AGCGCTCTTG TTTGTAGCGG GTGAAGATGG 4320 

GATTCGGGTC CGCCAGTTAG CTGAACTCCT CTCTCTGCCA CCGACAGGCA TCCAGCAAAG 4380 

TTTAGGAAAA TTAGCCCAGA AGTATGAAAA GGACCCAGAT TCCAGTTTGG CTTTGATTGA 4440 

GACAAGTGGT GCTTATAGAT TGGTGACCAA GCCTCAATTT GCAGAGATTT TGAAGGAATA 4 500 

CTCTAAGGCG CCTATCAACC AGAGCTTGTC TCGGGCTGCC CTTGAGACCT TGTCCATTAT 4560 

TGCCTACAAA CAGCCGATTA CGCGGATAGA AATTGATGCC ATCCGTGGAG TTAACTCGAG 4 620 

TGGAGCCTTG GCAAAGTTGC AGGCTTTTGA CCTGATAAAG GAAGACGGGA AAAAGGAAGT 4 680 

ATTGGGGCGC CCCAACCTCT ATGTGACTAC GGATTATTTC CTAGATTACA TGGGGATAAA 4740 

CCATTTAGAA GAATTACCAG TGATTGATGA GCTTGAGATT CAAGCCCAAG AAAGCCAATT 4800 

ATTTGGTGAA AGGATAGAAG AAGATGAGAA TCAATAAGTA TATTGCCCAC CCAGGTGTGG 4860 

CCAGTAGGAG AAAAGCAGAA GAGCTGATTA AGCAAGGCTT GGTGACGGTT AACGGCCAAG 4920 

TGGTGCGTGA ACTAGCAACC ACTATCAAGT CAGGCGACAA GGTCGAAGTT GAAGGTCAAC 4980 

CTATCTACAA CGAAGAAAAG GTCTACTATC TGCTTAACAA ACCACGCGGT GTGATTTCCA 5040 

GTGTGACAGA TGATAAGGGT CGCAACACGG TTCTCGACCT CTTGCCCAAT GTCAAAGAGC 5100 

GTATTTACCC TGTCGGTCGT TTGGACTGGG ATACATCAGG TGTCTTGATT TTGACCAATG 5160 

ATGGGGACTT TACAGACGAG ATGATTCACC CTCGTAATGA GATTGACAAG GTTTATGTCG 5220 

CGCGTGTTAA AGGTCTGGCC AATAAGGACA ATCTCCGCCC CTTGACCCGT GGTCTTGAGA 5280 

TTGATGGTAA GAAAACCAAG CCAGCTGTTT ATGAAATTCT CAAAGTGGAC CCACTCAAAA 5340 

ATCGCTCTGT GGTGCAGTTG ACCATCCATG AAGGGCGTAA CCATCAGGTT AAAAAGATGT 5400 
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TTGAAGCTGT TGGTCTCCAA GTAGATAAGT TGTCTCGGAC TCGTTPCGGA CACCTAGACT 5460 

TGACAGGACT CCCTCCAGGA GAATCCCGTC GTCTTAATAA AAAAGAAATC ACCCAACTAC 5520 

ACACCATGGC TGTAACTAAG AAATAATGAA ACGAATTTTA ATAGCGCCTG TGCGCTTTTA 5580 

CCAACGTTTT ATCTCACCAG TCTTTCCACC CTCTTGTCGC TTTGAGCTGA CTTGTTCCAA 5640 

CTACATGATT CAGGCTATTG AAAAACATGG GTTTAAGGCG GTATTGATGG GCTTGCCTCG 5700 

GATTTTACGT TCTCATCCCT GGTCGAAAAC AGGTAAGGAC CCCGTTCCAG ACCGCTTTTC 5760 

CCTTAAACGA AATCAAGAAG GGGAATGAGG TGGGGTAAAT AGATTTCAAA ATGATAAAAA 5820 

CGCATCCTAT CAGGTTTGAG TGAACTTGAT AGGATGCGTT TTAGAATGTC AAAATTTTAT 5880 

ACTCTTCGAA AATCTCTTCA AACCGCGTCA GCTTTCATCT GCAACCTCAA AACAGTCTTT 5940 

TGAGCAACCT GCGGCTAGTT TCCTAGTTTG CTCTTTGATT TTCATTGAGT ATTAAATTGA 6000 

GTTTGAAGTG GCTTATTTCA AAGCTTTTTG TATGTCTTCA ATCATGAGTT TTGTTGATTC 6060 

AAGTCCGCCT CCGCTTAGAT ACCAGAGGTC TGGTGTTAGT TGGATAATCT TACCATTTTT 6120 

AGCAGCAGGT GTTTCAGCGA TAAGGGCATT TTCTAGGACA CCGTCGTTGC TAGAGTTCTC 6180 

CCCACCGATG GCAAGCGTAC GGTTGATGAC AAAGAGGATG TCAGGGTTGA TTTCTTTGAC 6240 

ACTTTCAAAG CTGACTTCTT GTCCGTGGCG TGAGTCTTCA AATTTTGTAT CAGTTGGTTT 6300 

GAATTTCAAG GTTTGGTACA AGAAAGAGAA ACGAGATTTG GCACCAAAGG CTGCCATTTT 6360 

TCCTTCATTA AGGAGGATCG CAAGGGCTTT TTTGTCAGAG CTTTCATTTT TAGTACCGAC 6420 

TTCTTGGATG CTCTTGTCTA GCTTGGTCAA TTCTTCCTTG GCTTTCTGTG TACCAGTTTC 6480 

GCCGAAGGCA CTTGCTAAGG ATTCGATATT AGCCTTGGTA GAAGTCCAGT AGTCGTCCTT 6540 

GCTTGCTTGG AAGAGAACGG TTCGGGCGAT TTCTTTGAAT TTGTCTACGA ATTTTTGTGT 6600 

ACGTGGCGAA GCGATAATCA AATCAGGCTC AAGGGCGGCG ATAGCTTCTA AATCAGGTTC 6660 

TTTCATAGAA CCAACATTTT TGACAGTTCC CACTAGGTCT TTTAGATAAG TCGGAACACT 6720 

TTTTGTAGGC ATTCCGACGA TATTTTTTTC AAATCCTAAA GCGCGAATAG TATCCGCAGC 6780 

GCCGAGGTCA AAGGTCACAA TCTTTTCAGG AACTTTGGAA AGTTTGACCT CGTCCAGTGA 6840 

ACTTTTAATG GTTACCTCTG TTGGAGCAGA GCTACTGGTC TCTGTCTGAC TACTGCTTGA 6900 

GTTTGTACTA CATGCACCAA GTAGGAGCAA GAAGCTGGCC ACTAGGGCAG TGAAATAAAG 6960 

TTTAAGCGAT GTTTTCATAA TTTCTCCTTT TTAAAATGTG ATAACGATTT AGCGAGTCTC 7020 

TTAATCTTAT TGACTAAGAG ACTGAAGGTT CTCTAACTTG ACCTTTTATG TTACTACCTA 7080 

TAGATACAGA TCTTTTTGTC ATTGATATCA GCTAGCGTGA TGGGAATCTC ATAAAGTTGA 7140 
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CTGAGCACCT CAGCCTGCAT GATTTGATCG CTTCTTCCCT TGCTAAAGAC CTGGCCCTCC 7200 

TTGAAGGCGA CAATTTCATC TGCATACTGA CTGGCCATGT TGATATCGTG GAGGACGATG 7260 

ATAATGGTCT TCCCGAGTTC CTCCACCAGT CGTCGAAGAA TCTGCATCAT GCTGACGCTT 7320 

TGCTTGATAT CGAGATTGTT GAGTGGTTCG TCCAGCAAGA TAAAGTCCGT ATCCTGGGCC 7380 

AGTACCATAG CGATAAAGAC GCGCTGGAGT TGCCCCCCTG ACAGGCTATT GATGTAGCGG 7440 

TCTTTTAAGT TGGTCAGTTC TAAATAGTTC AGAGTTTCTC GGATTTTTTC CCAGTCTTCT 7500 

GATCTAAGTC GACCTCGGCT GTAGGGAAAA CGTCCAAAAC TGACCAGTTC TTCAACAGTC 7560 

AATTTGGCTT GGTAATTGAT TTTCTGTTTT AGGATGGTTA CTTCTTCGGC CAGTTCTTGC 7620 

GAATTCCAGC TCTCGATTTC ACGTCCTTTG ATACTGAGAA CTCCCTGATC TTTCTTGGTT 7680 

AGCCTGCTCA TGATGGAGAG GAGAGTCGAT TTTCCAGCAC CATTTGGACC AATAAAGGCT 7740 

GTCAGTTTTT GAGGACTGAC TTCAAGCGAA ATGCCTTGCA AAATATCCTG TTTTTGAATG 7800 

GATTTGTCAA TGTTTTCCAG TTTCACTGAC GAGACCTCCT ATATAGTAAG ATAAAGAATA 7860 

AGAAGCCACC CACACTCTCA ATGATCATAC TGATACGAAT TTCCAGTGCA AAGACTCGTT 7920 

CAATCAAGGC TTGCCCCAAG GTTAAGCTAA TAAATCCAAC CAGAATGGCC ACTATAAAGA 7980 

GTAACTTGTG CTGATACTCT TTGACAATCA GGTAGGTGAG GTTGGCCAGT ATAAAGCCGA 8040 

AGAAGGCCAT AGGTCCTACC AAGGCAGTGG CCGTTGAGGT CAAAAGCACG ATTCCCCAGA 8100 

GGAGCTCTTT CTGTTCTTTT TCAACATCGA GTCCCAATAT CTGAGCCGTT TCTCTTTGCA 8160 

GGTGCAAGAC ATCTAGAACG ACTGCTTTTC GAAAGAAAAA GATTGTCAAA GCGAGGATGA 8220 

TCAGAGAACC GATGGCTAGG ATGGAAGTGT TGAGATGTTG AAAGGAGGCA AAAAGACTAT 8280 

TTTGCAGTTT ATCGTATTCG TTTGGATCCA TTAGGACTTG AAGGAAGGTG CTGATATTTC 8340 

GAAAGAGACT TCTGAGCGCT AGACAGATCA GCAGGACGAA GACCAGGTCT TGCTTCATCA 8400 

GTGTCTTCAA GTAACCTTGT AAGGCGAGAA AGAAGAGGGA CTGGACAAGA AGTAAGACTA 8460 

GGAATTCTAA GATAGGGGAT TTGCCAAGTT GAAGAAACTT GCTTTCAAAA ACCAGTAC1 A 8520 

GGGTTTGTAG TAGGACGTAG AAGGATTCAA TTCCCAAAAT ACTAGGCGTC AGGAAGCGAT 3580 

TTTCCGTCAG GGTTTGAAAA CTAATGGTCG AAATCCCAGT CGCGATGGCT ACCAAGAGAT 8640 

AAACGATGAT CTTTTGGGAA CGCAACTTCC AAGCAAAGGC TGACAAGTGA GTGATGGGCC 8700 

AAAAGTAGAG AAGACAAGCT CCGATGGCAA GAATAATGAG AATCCAGAAG AGCTTGGTAT 8760 

GTTTGCTTTT AGTCTGCATC TTTTCGTCCC CCTCTCCAGA GAAGTAGGAT AAAGACGAGA 8820 

CTACCGATGA TTCCTAGCAA GAGACTGACA GACAACTCAT AGGGCCTAAT CAGAACTCGG 8880 

GATAGGATAT CGCAAGCCAG AACTAGATTG GCACCAACCA GTGCGACCAT GAGTTTGGTT 8940 
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TCACTTAGAT TATCTCCATA GCGCTTGCCA ACAACATTCC GAACGATAAC TCCGAGAAAT 9000 

GGTAGGCCAC CCACGGTAAT CATGGTGACC CTTGTCGTTA GCGCCACCAG AAAGAGGGCC 9060 

AGTTTTTCAA GTAGGGAGTA GCAAATCCCC AAACTCTCGC TGGTTTCTTT CCCTAGATTC 9120 

ATGATGGTGA AGGTTTGGCA TAATTTCCAA ACGCTTATCA GGATGATGAG GCCTAAGAAG 9180 

AGCCACTCAT ACTGATGGGT CTGAATCATG GAGAAGGAGC CCTGGGTCCA GGCAGTCATA 9240 

CTCTGAACCA-CATTGAAACG ATAGG^ 9300 

TAGATGATCC CAATCAGAGG CAACATCCAC CTTTCCTTTA CAGTAAAAAT GGTCATAAAG 93 60 

GCTAGGAAGA AGAGGGTGAA TACGATGGAT GAAACAAAAG CGAAGAGCAT CTTGTGGGTC 9420 

ACACTAGCCG ATGGAAAGAC AAAAAGGCTC AGCACCATTC CCAGTTTGGC GGCTTCAGTC 94 80 

GTTCCAACTG TACTCGGTGC AGCAAACTGA TTTTCCCTAA TAGTCTGCAT GAGAAGGCCT 9540 

GCCATACTCA TACTAGAGCC AGTCAGGAGA ATACTGATAG TTCTTGGGAG ACGGGACTCT 9 600 

TGAAAGAGGA GCCAGGTCTG CTGGTCGAAA TCAAATAGCT TTCCCCATGA AAAATCACTG 9660 

GTCCCAATGC TAATAGAGAG AAAGACTAGG AGTAGAAGTA AGCCAGG 9 707 

(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5910 base pairs 
(B> TYPE: nucleic acid 
(C) ST RAND EON ESS : double 
(DJ TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 
CCCCAATTAT GCTTGAAAAG GAGTATACTT ATAAGTAACG CAAACGTTTG CGTCTCAAAA 60 

ATACGCAACG TTCCATTATT TTAACACACG AGGTGCTATT ATGAAAAAAC GTCAAAGTGG 120 

TGTCTTCATG CACATCTCTT CTCTTCCAGG AGCTTACGGA AT CG GAT CAT TTGGTCAAAC 180 

TGCTTACGAC TTCGTTGATT TCTTGGTCCG TACAAAACAA CGTTACTGGC AAATCCTTCC 240 

ATTAGGAGCA ACTAGTTACG GGGATTCTCC TTACCAATCT TTCTCAGCCT TCGCAGGAAA 300 

CACTCATTTT ATCGATTTAG ATATCTTGGT GGAGCAAGGT TTGTTGGAAG CAAGTGACCT 3 60 

TGAAGGAGTT GACTTTGGTA GCGATGCGTC TGAAGTTGAC TATGCTAAAA TCTACTATGC 420 

ACGTCGTCCT CTTTTAGAAA AAGCGGTGAA ACGTTTCTTT GAAGTCGGAG ATGTTAAAGA 480 

TTTTGAGAAA TTTGCTCAAG ACAACCAATC ATGGCTTGAG CTCTTTGCTG AGTATATGGC 540 

TATCAAAGAG TATTTTGACA ATCTTGCTTG GACTGAATGG CCAGATGCAG ATGCTCGTGC 600 
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TCGTAAAGCT TCAGCACTTG AAAGCTATCG TGAGCAATTG GCAGACAAGT TGGTTTACCA 660 
CCCTCTCACT CAATACTTCT TCTTCCAACA ATGGTTGAAA TTGAAAGCTT ACGCTAACGA 720 
CAACCACATC GAAATCGTTG GGGACATGCC AATCTACCTA GCGGAAGATT CAAGTGATAT 780 
GTGGGCAAAT CCACATCTCT TCAAAACAGA TGTCAATGGT AAGGCTACTT GTATCGCAGG 840 
ATGCCCACCA GATGAGTTTT CTGTAACTGG TCAGCTTTGG GGTAATCCAA TCTATGACTG 900 
GGAAGCAATG GACAAAGACG GCTACAAATG GTGGATTGAA CGCTTGCGTG AAAGCTTCAA 960 

AATCTACGAT ATCGTTCGTA TCGACCACTT CCGTGGCTTC GAATCTTACT GGGAAATCCC 1020 

TGCTGGTTCC GATACAGCAG CACCTGGTGA GTGGGTGAAA GGTCCAGGTT ACAAGCTTTT 1080 

TGCAGCCGTT AAGGAAGAAC TTGGTGAGCT AAACATCATC GCAGAAGACC TTGGCTTCAT 1140 

GACAGATGAA GTGATCGAAT TGCGTGAACG TACTGGCTTC CCAGGAATGA AGATTCTTCA 1200 

ATTTGCCTTC AACCCAGAAG ACGAAAGCAT TGATAGCCCA CACTTGGCAC CTGCTAACTC 1260 

AGTTATGTAC ACAGGAACAC ACGATAACAA TACGGTTCTT GGTTGGTACC GTAATGAGAT 1320 

TGATGATGCG ACTCGTGAGT ACATGGCTCG TTACACGAAC CGTAAAGAAT ACGAAACAGT 13 80 

GGTACACGCT ATGCTTCGTA CAGTATTTTC ATCAGTTAGC TTTATGGCAA TTGCAACTAT 1440 

GCAAGATTTA CTAGAATTGG ATGAGGCAGC TCGTATGAAC TTCCCATCTA CCCTTGGTGG 1500 

AAACTGGTCT TGGCGTATGA CTGAAGATCA ATTGACACCA GCTGTCGAGG AAGGTTTGCT 1560 

TGACTTGACA ACAATTTATC GCCGAATTAA TGAAAATTTG GTAGATTTAA AGAAATAAGA 1620 

CAATAATCAG GAGACAACTA AACATGTTAT CACTACAAGA ATTTGTACAA AATCGTTACA 1680 

ATAAAACCAT TGCAGAATGT AGCAATGAAG AGCTTTACCT TGCTCTTCTT AACTACAGCA 1740 

AGCTTGCAAG CAGCCAAAAA CCAGTCAACA CTGGTAAGAA AAAAGTTTAC TACATCTCAG 1800 

CTGAGTTCTT GATTGGTAAA CTCTTGTCAA ACAACTTGAT TAACCTTGGT CTTTACGACG 1860 

ATGTTAAAAA AGAACTTGCA GCTGCAGGTA AAGACTTGAT CGAAGTTGAA GAAGTTGAAT 1920 

TGGAACCATC TCTTGGTAAT GGTGGTTTGG GACGTTTGGC TGCCTGCTTT ATCGACTCAA 1980 

TTGCTACTCT TGGTTTGAAT GGTGACGGTG TTGGTCTTAA CTACCACTTT GGTCTTTTCC 2040 

AACAAGTTCT TAAAAACAAC CAACAAGAAA CAATTCCAAA TGCATGGTTG ACAGAGCAAA 2100 

ACTGGTTGGT TCGCTCAAGC CGTAGCTACC AAGTACCATT TGCAGACTTT ACTTTGACAT 2160 

CAACTCTTTA CGATATTGAT GTTACTGGTT ATGAAACAGC GACTAAAAAC CGCTTGCGTT 2220 

TGTTTGACTT GGATTCAGTT GATTCTTCTA TTATTAAACA TGGTATCAAC TTTGACAAGA 2280 

CAGATATCGC TCGCAACTTA ACTCTCTTCC TTTACCCAGA TGATAGTGAC CGTCAAGGTG 2340 

AATTGCTCCG TATCTTCCAA CAATACTTCA TGGTTTCAAA CGGTGCGCAA TTGATCATCG 2400 
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ACGAAGCAAT 


CGAAAAAGGA 


AGCAACTTGC 


ATGACCTTGC 


TGACTACGCA 


GTTGTCCAAA 


2460 


TCAACGATAC 


TCACCCATCA 


ATGGTGATTC 


CTGAATTGAT 


TCGTCTTTTG 


ACTGCACGTG 


2520 


GTATCGATCT 


TGACGAAGCA ATCTCAATTG TTCGTAGCAT GACTGCCTAC ACTAACCACA 


2580 


CAATCCTTGC 


TGAAGCGCTT GAAAAATGGC CTCTTGAATT CTTGCAAGAA GTGGTTCCTC 


2640 


ACTTGGTACC 


AATCATCGAA GAATTGGACC GTCGTGTGAA GGCAGAGTAC AAAGATCCAG 


2700 


^ CTGTTCAAAT 


CATCGATGAG 


AGCGGACGTG 


TTCACATGGC 


TCACATGGAT 


ATCCACTACC 


2760 


GATACAGTCT 


TAACGGGGTT 


GCAGCACTCC 


ATACTGAAAT 


CTTGAAAAAT 


TCTGAGTTGA 


2820 


AAGCCTTCTA 


CGACCTTTAC 


CCAGAAAAGT 


TCAACAACAA 


AACAAACGGT 


ATCACTTTCC 


2880 


GTCGTTGGCT 


TATGCATGCT 


AACCCAAGAT 


TGTCTCACTA 


CTTGGATGAG 


ATTCTTGGAG 


2940 


ATGGTTGGCA 


CCATGAAGCA 


GATGAGCTTG 


AAAAACTTTT 


GTCTTATGAA 


GACAAAGCAG 


3000 


TTGTCAAAGA 


AAAATTGGAA 


AGCATCAAGG 


CTCACAACAA 


ACGTAAATTG 


GCTCGTCACT 


3060 


TGAAAGAACA 


CCAAGGTGTG 


GAAATCAATC 


CAAATTCTAT 


CTTTGATATC 


CAAATCAAAC 


3120 


GTCTTCACGA 


GTACAAACGC 


CAACAAATGA 


ACGCTTTGTA 


CGTGATCCAC 


AAATACCTTG 


3180 


ACATCAAAGC 


TGCTAACATC 


CCTGCTCGTC 


CAATCACAAT 


CTTCTTTGGT 


GGTAAAGCAG 


3240 * 


CTCCAGCCTA 


CACAATCGCT 


CAAGACATTA 


TCCATTTAAT 


CCTTTGCATG 


TCAGAAGTTA 


3300 * 


TTGCTAACGA 


TCCAGCAGTA 


GCTCCACACT 


TGCAAGTAGT 


TATGGTTGAA 


AACTACAACG 


3360 


TTACTGCAGC 


AAGTTTCCTT 


ATCCCAGCAT 


GTGATATCTC 


AGAACAAATC 




3420 


CTAAAGAAGC 


TTCAGGTACT 


GGTAACATGA 


AATTCATGTT 


GAACGGAGCT 


TTGACACTTG 


3480 


GTACTATGGA 


CGGTGCTAAC 


GTGGAAATCG 


CTGAGTTGGT 


TGGAGAAGAA 


AACATCTACA 


3540 - 


TCTTCGGTGA 


AGATTCAGAA 


ACTGTTATCG 


ACCTTTACGC 


AAAAGCAGCT 


TACAAATCAA 


3600 


GCGAATTCTA 


CGCTCCTGAA 


GCTATCAAAC 


CATTGGTTGA 


CTTCATCGTT 


AGTGATGCAG 


3660 


TTCTTGCAGC 


TGGAAACAAA 


GAGCGCTTGG 


AACGTTTTTA 


CAATGAATTG 


ATCAACAAAG 


3720 


ACTGGTTCAT 


GACTCTTCTT 


GATTTGGAAG 


ACTACATCAA 


AGTCAAAGAG 


CAAATGCTTG 


3780 


CTGACTACGA AGACCGTGAC 


GCATGGTTGG 


ATAAAGTCAT 


CGTTAACATT 


TCTAAAGCAG 


3840 


GATTCTTCTC 


ATCTGACCGT 


ACAATCGCTC 


AGTATAACGA 


AGACATCTGG 


CACTTGAACT 


3900 


AATACTCTTC 


GAAAATCTCT 


TCAAACCACG 


TCAGCTTTAT 


CTGCAACCTC 


AAAGGAGTGC 


3960 


TTTGAGCAAC 


TGCGGCTAGC 


TTCCTAGTTT 


GCTCTTTGAT 


TTTCATTGAG 


TATAAGATAC 


4020 


AAATTTATAC 


TAATACATTT 


TGTAAAAAAG 


CGAGTTTCGA 


TTGAAATTCG 


cttttttaat 


4080 


GATGTAGATT 


TGGGTCAATC 


TTGTCTAAAA 


ATAGGGAAAT 


CCTAGATACA 


GTGAAGGCTT 


4140 
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TAAATGCTGG 


TTTTTACTCT 


CCTCACCCTT 


ATATTTTTTC 


GTAGTTGGTT 


ACCTCATATC 


4200 


TATTATATTC 


GCTTACATAA 


AGTATTATAA 


TATAATTGTA 


GGAAAGAAGG 


TGTTTTTATG 


4260 


ATATACACAC 


TTAAATTGGT 


GTTGTTTATT 


ACCTTTCTTG 


TAATAAGCTT 


GTTACCTGAT 


4320 


AAGATTTTTG 


CAAAAAATAA 


AAAAATTTGG 


AAAATAGTTT TTGCAATATT GACGGCAGTG 


4330 


GCAGCATTGT 


CATTTATGTA 


CTAAGTTATT TTAAGAATGT AGGGAAATAA ACCCTACATT 


4440 


CTTTTTAGTT 


TTTTCTGTTT 


TCTAAATTCT 


ATTTATCCAA 


GCGATTCAAC 


ATTTCTTGCT 


4500 


TCTTCGCTTC 


AAGTTCTGCA 


CGCTTTTCTT 


CGATTTCGGC 


ATGTTTTTTC 


TCGAGTTCAG 


4560 


AACAACTTGC 


ACCATTGCTA 


AATTCTTTTC 


GCCATCAGGA 


GATAGGGTGA 


GTCGACATGT 


4620 


CTATTACTCA 


CCCAAAGCAG 


TCCTACAAAG 


CAGGAATTTT 


CTCTTACTTT 


TTTGGAAATA 


4680 


GTAACGTTTA 


TACAGCTTTG 


ACACTTCCTA 


TCAAAGCGCC 


AAACACACTC 


CGAGGCGTTT 


4740 


ACAGAAAGCA 


GAAAACGAAT 


GATCTGGTAT 


AAGATCATTC CTTTTCyCTC 


TTTTTCTTTA 


4800 


ACTAATTATA 


TACAATGTAC 


GACGAAGTCG 


TCATTGCAAT 


GCTGATCCAC 


CACCTAAAGG 


4860 


GAACTTTAAA 


CAACATTGAT 


AAGATAAAGA 


ATATAAACAA 


CGAAAATACG 


TTATACCCAA 


4920 


TTAATTTTAT 


TGTATATCTC 


ATGATTAAAA 


GTTAATCCTT 


CCGTTGTTAG 


GAATGGCATC 


4980 


ATTTTTATCC 


CATAATTGTG 


CTAAATAAGT 


CCCCGGTGAT 


AATAAATTCA 


TAGCGAATTC 


5040 


TAAAGCAACA 


TCATTTACAA ACCAACTACC 


TAGATATCTA 


GAAATTGCTG 


AACGAATAGC 


5100 


ACTTTTTGCT 


GCATGTTTTC 


CTTTTACTTT 


AATTAGATTT 


GCAAGGCCTG 


CAGTAGTTCC 


5160 


TCCTAATGCT 


AAAGCTATTG 


CAGTATCTAA 


TAGAGCACCC 


ATTTGATTAA 


CTCTAATACC 


5220 


TTGCCAAACT 


GCTCTAAATG 


GAGAGTATGT 


AGGTGGGATT 


GTATAATCGC 


CTTGTAATTG 


5280 


TCGGTTAATT 


ACTTCTTTGA 


TCCATTGTTG 


TGAGACGTCT 


GGATGAAAAG 


ATTGCATTTC 


5340 


GTTTGCAAGT 


GTATTGATTT 


GTTCTTCTGT 


TAGAGAAGTG 


ACAGGTTGAA 


GTTCCATATT 


5400 


TGTTTCAATT 


TGTGATACTT 


GTTCAGAAGC 


GTATACAGCT 


GAAACACTTG 


GAATCGCTGA 


5460 


TACAATTAAC 


ACAATTGACG 


TCAAAAAAAC 


CGAAATAAAT 


TTCATTAATT 


TGTTCATGAG 


5520 


CTTTTCTCCT 


TTTTATTTGC 


ATCTGCTTAC 


ATTTTATCAT 


ATACTGTTAT 


TATAGTCAAA 


'SSSO 


AAAATATGCT 


ATTATGTTAA 


AAAAATATTT 


TTCAAAATAT 


AAATGCACGG 


ATTTATTTTG 


5640 


GATTTTATTT 


GTTATTTTGA 


CCTGCCTCTA 


TATTGGTAAC 


CATGATTTGT 


TTACTCTCAA 


5700 


TCATCAAGAA 


TTCTCTTTTC 


GTGGTAGCGT 


TTGGGGTCTG 


GTACTGGCCT 


TATATCACTT 


5760 


ACTATTCATT 


GATAAGTTTG 


TTATATCCAA 


TCGAAAATAA 


AGATTAGAGC 


TATGCTTGAC 


5820 


TGTGTACTTT 


TAGGATTTAT 


TTTGGAGGAA 


GATTTTGTCT 


CTATTATTTA 


TTATTTTAAA 


5880 


TTTATTTAT T 


TTGTATAAGA 


TCTATTCTTT 








5910 
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(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5406 base pairs 
<B) TYPE: nucleic acid 
<C| STRANDEDNESS: double 
(D) TOPOLOGY: linear 

fxiT SEQUENCE DESCRIPTION: SEQ ID~ NO? 166: 

GGCATAGCGA CTCATTTTTT CAACTGTCCA GGCTGGATAC CAGACTAATT TAACCTCAGT 60 

ATCCCTTACT TCTGGAACCT CTATCATAGC ATCATAAATC TGGTCTGTCA AAAGGTCTGC 120 

TAAGGGACAA CCCATAGTTC TCAAAGTCAT CTCAATCTCT GTTTGCCCTG TGTCACCGTC 180 

AAAACGAATC TCATAGATCA AACCAAGATT GACAATATCG ATTCCCAACT CAGGGTCGAT 240 

GACTTCTTCC AAGGCTGTTA AAATCCGTGT TTTGATGTTT TCAATTTGCT CTTCTGTATA 300 

AGCCATATTT TCCTCACTCT TAGTCTTCAA TAAAATCACG AAGCGGTTTG CTACGACTTG 3 60 

GTTGGCGTAG TTTTCTCAAA GCCTTTGCTT CAATCTGACG GATACGCTCA CGAGTTACGT 420 

TAAAGACTTT CCCCACATCT TCAAGTGTGC GCATTTTTCC ATCATCTAGT CCAAAACGTA 480 

GACGCAGAAC ATTTTCTTCA CGGTCTGTAA GAGTATCTAA GATTTCATCC AATTGCTCAC S40 

GCAAGACGAT ACGAGTCGTA TAATCCACTG GATTTTCAAT CACTTCATCT TCGATAAAGT 600 

CTCCAAGCTG GCTATCGTCC TCTTCACCGA TAGGAGTTTC AAGAGATACT GGTTCTTGGG 660 

CAATCTTCAA GATTTCACGA ACCTTATCAG GTGTCATATC CATTCCTTCA GCAATCTGTT 720 

CTGGTGTCGG ATCTTGCCCC AATTCTTGAA GGAGATTCCG CTGTTCACGA ACCAATTTAT 780 

TGATAGTTTC AACCATGTGA ACTGGGATAC GGATGGTACG AGCTTGCTCC GCAATAGCAC 840 

GAGTGATAGC CTGACGAATC CACCAAGTTG CATAAGTTGA AAACTTGAAC CCTTTAGAAT 900 

AGTCAAACTT GTCAACCGCC TTCATCAAGC CCATATTTCC TTCTTGAATC AAGTCAAGGA 960 

ACTGCATACC ACGACCGACA TAGCGTTTGG CAATGGAAAC AACCAAACGA AGATTGGCTT 1020 

CCGCAAGACG TTGTTTGGCT TCGATATCAC CAGCTTCAAC AGCCAGTGCC AACTCTTTCT 1080 

CCTCTTCATT GGTCAAGACA GGAACGACCC CTATTTCTTT CAAGTACATA CGGACAGGGT 1140 

CATTGACCTT AGCAGAAGTT GACCCAATCA AGTCCTCATC GCTGAGTTCT GGTTCTTCTT 1200 

CATTGCTGAG AACACGCGCA CTTGGATTTC CTTCCTTATC TGTGATAGAA ATGCCTGCAT 1260 

CCTGAATCCG TTGCAAGAGA TCTTCAATCC CATCAGCGTC CAAGGTAAAA GGAATAACCA 1320 

GACTTGCATT GATTTCATCA TCTGTTGCTG TCCCTTTTTG CTTATGATTA CGGATAAATT 1380 
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CTGCTACCTG TACGTCAAAT GTTGTTACTT Cm TlV l Tr TGTTGCCATT ATTACTCCAT 1440 

TCTTCTCTTT TGGGAAATTA AACGTTCCAA TTCTTCTAGG GCTGTATCTC TATCTCCTAC 1500 

ATGGCTAGCT TCCTGCACCT TCTTTTTGAT TCTCATATTG TCCTGATTCA AGAGAGCCTT 1560 

GTTTCGAGTC ATCTCTACTT CACTAAGTTC CTGCGGCGAT ATCTCAGCAG GCAAATCCTG 1620 

AGCTAAAACT TGGTACCAAG CTCTTTCAAC TTCCTCTGTC TGCTCTGCTA AAACTTCTGG 1680 

AGGAAGATTT CCATACTGGC CAAGCAAGTC ATATAAGACC TGAAATTCAC GTGTAGCAAA 1740 

TGCAAAGTCT TCTCGCAAAC GGTAATCGTT CAAAACAAGA GGGGATTCCA TCATCCGATA 1800 

GAGTAGATGG GCTTCTGCCC TCATAATAGC CGATAACTGC TTGGTGACAG GCATGGTGAT 1860 

TGGCGTCGGT CTGGAAATTC CTTCCATGCG ATTCTGCCTT TGCACCTGAC GACTCTCATT 19 20 

AACAATCTGC TCAATCTCGG TATAATCAAA GGACGCCAGA CTGTCAGCTA AAATATGAAT 1980 

ATAGCTGTTT TGAGCAGCGA TGGA C TTTTC TTGAACAATC AAGGGAGCTA TTTTTTCAAG 2040 

AAACTCAATC TGAGCCTGCA GATTTTCACT GTTTTCAGGT TTGTACTGAT GAATGTAGAA 2100 

CTCAATCGGA CTAATACGAG TTTTCGTTAA TAGATAGGCC AAGTCTTCTG GACCATTTTT 2160 

TTGTAGATAC TCATCAGGAT CCAAGTTATC AGGCATGCTG ACGATTTGCA CAGGCATATC 2220 

ACCAATTTCA TCCAATGCTT TCAATGTCGC GGCTTGCCCA GCCTTATCTC CATCGTAAAC 2280 

AAGAACCAAT TTCTTGGTTA ACCTTTTCAG ATGCTCAACA TGCTCTCGAC TCAAGGCTGT 2340 

TCCCATCGAC GCCACAGCAT TTTCGATTCC AGCCCGATAG GCTGCAATAA CATCCATGAA 2400 

TCCTTCCATC AGGTAAATCT CACTACCTTT TCCAGAAGAT CTTTTTGCCC TATCCATATG 24 60 

ATATAATTCG TAA C TTTTGT TAAAAATTGC AGTCGATCGG CTGTTTTTAT ACTTAGAAGT 2520 

TTGTGAATCC GTTTTTTGCC AGATACGACC TGAGAAGGCA ATGACCTTTC CTTGGTCATT 2580 

TGTCAGGGGA AACATAATGC GATTGTGAAA GGTGTCTACA AATTGATTGG CATCCGAGAG 2640 

ATAAAACAGG CCTCAATCCA GTAAATCCTC TTCACGATAC TGATCAGACA AACGTTGATA 2700 

GAGATAGTTT CGTTCTGGAG GTGCTAAACC AATCCAAAAA TGTTTAAGCA CTTCATCTGT 2760 

CAACCCCCGC TGATAAAGGT AATTTCTGGC CTCTTCGCCC ATAGTCGTTG TCATGAGAAT 2820 

AGCATGGTAA AATTTGGCTG CATCTTCGTG CATATCATAA AGAGCTTGGT GAGGTGAGGC 2880 

TGACTTCTGC TCACTATAAA GCG G TTTT TC AACCTCAATT CCAACACGCT GACCTAAGAT 2940 

TTGGACTGCT TCTATAAAGG GAACCCCTTG GTACTCCTCG ATGAACTTAA AGACATCACC 3000 

TGAGCGACCA CAACCGAAAC AGTGATAAAA CTGCTTGTCC TCTACAACAT TGAAAGATGG 3060 

TGTTTTTTCA CCATGAAAAG GACAGAGCCC TAGATAGTTC CGTCCTGCCT TTTGTAAAGA 3120 

AATCACATCT CCTATGACTT CCACAATGTT GGCATTGTTT TTGATTTCTT CAATGACTTG 3180 
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TTTGTCAACC 


ATACACAATA 


CCTCCATGTT 


ATCATAGTTT 


ACTTTATATA 


CTATACTTTA 


3240 


TTTCAGAAAA 


AAAGTAAACC 


ATTTCACTCA 


TTTTCCCTAC 


TTTATTCAAA 


GAGTTGATAA 


3300 


TAATCAGAGA 


TTTTCATTTT 


TGCTTTTTCT 


TCTTGGTTTA 


AATCTTCGAT 


AATTCGTCCT 


3360 


TCTTTCATGA CAATCAAGCG ATTGCCGTAT TTGAGAGCAT CTTCCATATG 


ATGAGTAATC 


3420 


ATAAGGGCTG 


TTAGCTGATC 


TTTCTTAACA 


AATTCATCTG 


TCAATTCCAT 


CAAAGCAACA 


3480 


CTAGTCTTTG 


GATCCAGGGC 


"agcagtatgc 


TCATCTAACA 


GGAGTAATTC 


AGCTCGCTTC 


3540 


AAGGTTGCCA 


TCAAGAGACT 


CAAAGCCTGT 


CTTTGTCCAC 


CTGATAACAA 


CTCAATCGGT 


3600 


GTATTCAAGT 


GTTTCTCAAG 


ACCATTTCCT 


ACTTTTTCAA 


TGGTTGCCTG 


AAATTCATCC 


3660 


TTATAGCTAG 


TCAAGCCTCG 


TGGTAACAAT 


CCACGCTTTT 


CACCACGAAA 


CTTGGCGATT 


3720 


AAAAGATTTT 


CAGCGACCGT 


CATACGGGGA 


GCTGTCCCCA 


TCTTTGGATC 


TTGGAAGACA 


3780 


CGAGACAGGT 


ACTTGGCACG 


CTTCTCGGGT 


GAAAACTTAG 


TGAGATCTTC 


ACCTAAAATA 


3840 


CGGATAGTTC 


CACTAGTTAG 


TGATAAGGTC 


CCTGCTATAG 


TGTTAAAGAG 


AGTTGATTTT 


3900 


CCAGCACCAT 


TTCCGCCCAA 


AATCGTGATA 


AAGTCCCGTT 


CAAAAATTTC 


TAACGAAACA 


3960 


TCATTTAAAA 


TAATCTTTTC 


TTCATCAAAG 


CCATTTTTAA 


CGATTTTGGT 


TGCATTTTTT 


4020 


AATTCTACAA 


TTGCTGTCAT 


TTGCTTAACT 


TGGCTCCTTT 


CAAGATTGTT 


TGCTTAAATG 


4080 


TTGGAATCAT 


GAGGCAGACT 


GCTAAAATCA 


AGGCACTGTA 


TAAACGAAGG 


TAACTTGTAT 


4140 


TAAAGCCAAG 


TGCGATAACT 


GCCCACACTA 


AAAATTGATA 


AGCGATAGAA 


CCTACAACGA 


4200 


TAGTAACCAA 


ACGCTCTGCC 


AAGCTCAAAC 


TCTTGAAAAT 


AACTTCTCCA 


ATAATCAAAC 


4260 


TTGCAAGCCC 


CACAACGATA 


ACCCCGATCC 


CTCGAGACAC 


ATCGGCATAA 


CCTTCTTGCT 


4320 


GAGCAATGAG 


GGCACCTGCA 


AGGGCAATCA 


CACCATTTGA 


TAAGACCAAG 


CCCATGAGCT 


4380 


CCATGCGTCC 


AGTATGAATC 


CCGAAACTTC 


TAGCCATATC 


AGGATTATCC 


CCTGTAGCAA 


4440 


TATAGGCTTG 


TCCGAGTTTA 


GTGTCCAAGA 


AAAAGAGCAT 


GAGAGCAATA 




A C Art 


CAAAGATCAG 


ACCTGTCAAG 


AGTTGATTCA 


AATCCGAATC 


AAAAGGCAAA 


ACATCCTGAA 


4560 


TTTGCTTGCT 


TCCAAGCAGG 


CCTAAATTCG 


CACGTCCCAT 


AATCAAGAGC 


ATGATTCAGT 


4620 


GACAAGAAGT 


CATCACCAAA 


ATCCCTGAGA 


GCAAGGTTGG 


GATCTTCCCT 


TTTGTATAAA 


4680 


GAAGGCCTGC 


TGCCATTCCA 


GCCAAACAAC 


CTGCTCCTAC 


AGCAACAAGT 


GTCGCTAAAA 


4740 


ATGGGTTCAC 


GCCTTTGGTT 


ATCAAAGTGA 


CAGCAACAGC 


TCCCCCAAGA 


GGGAAGGAAC 


4800 


CTTCTGTCGT 


CATATCTGGA 


AAGTTTAAAA 


TCCTAAATGT 


CATAAAGATT 


CCCAGACCTA 


4860 


GAATAGCCCA 


GACAAATCCT ' 


TGAGAAATAA 


TGGAAACAAT 


CATATTTTAT 


TTAATCCTTT 


4920 
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CTATATTCAT CTTTTTAAAA AATGGGAAGA GTCTCCTCCT CCCTACCTTA TTTATTCGAT 4980 

GACTTGTCCT GCTTCTTTGA GAACAGACTC AGGAATAGTA ATACCTAGTT CTTGTGCTAT 5040 

TTTTTTATTG ATGACTGACT TACCAGTTCA AAAGACATTG ACTGGGGTAT CGGCTGGTTT 5100 

TGCACCTTTC AAGACTTGCA CAATCATTTT ACCTGTTGCC ACACCAAGGT CATCTTGGTC 5160 

AATTACAACT GATGCCAAAC CACCTACTTC TACCATAGCT GTCGCACTGG GATAAATTGG 5220 

TTTCTTAGAA CTTTGATTGC TAGAGACAAC CGTTGGAAAT CCTGATGCAA TGGTGTTATC S280 

AATTGGAACC CAAATAGCAT . CTACCTTGCT AGTCATAACA CTGACACTTC AGGCAATTTC 5340 

ATTTGTTGAA GGAACTGCAA ATGTTTCCAC TGTCAGACCT GCCTTTTCAG CATAAGCCTT 5400 

AAATTC 54(J6 

(2i INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9711 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 



CAGCTTGCTC 


TTACTATTAT 


AGCAGATGTT 


ATAGCTGGAA 


TTATCTTGTA 


TTTCGTCTGC 


60 


AAATGGCTAG 


ATGGTAAGAA 


GTAGACCGAA 


TGACTACCCT 


ATAAACACCC 


GTTAAATCGC 


120 


TAAGATACGT 


CAAAAAAGCC 


CTTAACTATG 


GCACTAGTTA 


GGGGCTTTCG 


TCTTCTAATG 


180 


AACCTTATAC 


ACTAACTACA 


TTCTAGCATA 


TAAGCCCAGA 


TATTTCAAGA 


GTTTTATTTA 


240 


TTGTTTAAAG 


TTCTGAAAGG 


TCTATAATGA 


AGTTAGCCAT 


CTAGTATCAA 


AAAACCGACT 


300 


AGCTCTTATG 


AACTAGTCGA 


TTTCTCATCA 


ATGCGCCAAC 


ATTTCTTGGG 


CGATTTCTTG 


3 60 


GCCAGATAGG 


TTATCTGGCT 


AGTAGGTTGG 


CCAGTTGTCC 


ATTTCTTCAA 


AGAGGGCTTC 


420 


TTGGCTTGTG 


CCTCCAAAGA 


AGATATGGAA 


ATGTTCTGCC 


TTAACTGGGG 


CAACATTC4 G 


480 


GTCACTAAAC 


TGAACATACT 


TGAATTGTCC 


AGCGTCAGCA 


TCTGTGGCTT 


CAAAGAGGAA 


540 


ACGCACGCCA 


CGATTGCCTT 


TCTTGTAAGT 


CAAAATTTTC 


TTACCGACAT 


ACTTGTAAGT 


600 


GTATTTCTTG 


CTTTGTCCAC 


CTTGAACAAA 


TTCCATAGTA 


TTATCAGTAA 


TGTTAATCTT 


660 


AGTCACATCT 


GTATGATAGC 


cttttctata 


GTAAGCCTTG 


TACTCAGCCT 


GGGTCATCTT 


720 


ACCAGTCAAC 


TTAGCCTTGT 


AGTCAAAGAC 


TTGGTCAAAC 


GTGCCGTCTT 


CAAGGAAAGG 


780 


ATAAACTGAT 


TGCCAGTTAC 


CTGCATAGTC 


ACTCAAGGTG 


CGGTCCTTGA 


CAGCTGCATC 


840 


CTCGAAGTAA 


CCATTTTGGA 


CTGTCTTGGT 


ATCCTCTGCC 


TTTTCAGGTT 


CAATTGCTGG 


900 
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GCCTTCTTGG TCTGTTGTTT GTTTCAAAGC CTTGAGGTTT TTCTCCATCA CGGAAATGTA ' 9 60 

GTTTTCTCCA GCCTTGGTGT CCTCTTCTGT CAGACTTTCT AAAGGATTGA GGACATCAGT 1020 

TTTGACACCT GCTTCTTTTG AAAGTGTGTT AGCAAGGGCT TGTGAGGCAT TTCTTCAAAA 1080 

TAGATATAGG CGATTTTATT TTTCTTGACA TACTCTGTCA ATTCTGCCAA GCGAGCAGCT X140 

GATGGCTCTG CATCTGGAGA AAGTCCTGAC ATTGCGACTT GTTTGAGTCC ATAGTCCAAG 1200 

GCAAGATAGT TAAAGGCTCC GTGTTGAGTC ACAAAGCTCT TTTGTTTTGC TTGAGACAAA 1260 

CCTTCTGCGT AAGCCTTATC CAAGGCTTGC AATTTTTCGA TATAGGCAGC TGCATTCTTC 1320 

TCAAAGGTCT CTTTTTTATC AGGATAATCT GCTGACAAGC TGTCGCGGAT GTGCTCTACT 1380 

AGTTTAATCC CACGAACTGG TGATAACCAA ACATGGGGGT CAAACTCATG GTGATGACCT 1440 

TCTTCTCCAT GGTCATGGTC TCCCTCTTCT TCCTCGCCAC CTCGCAAGAG CAACATATCG 1500 

CCTCTCCCCT TGATGGTTTT CACTTTTTTC TTATCCAAGG TATCTAGCAA TTTAGGTACC 1560 

CATCTTTCCA TGTTTTCATT TTCATAAACG AAGGTATCTG CATCTTGGAT TTTGGCAACT 1620 

GCCTTGGCAG ATGGTTCGTA TTCATGAGGT TCTGTCCCAG CACCGATTAG GAGTTCTACA 1680 

TTAGCCGTAT CTCCTGCGAC TTGCTTGGTA AATTCATAGA CAGGGTAAAA GGTTGTCACG 1740 

ATATTGAGTT TACCATCTGC CTGTTTTTGA TTGGAACAAG CCACTAAAAA CAAGGCACAT 1800 

AGACTGGCTA GTAATAAGCT AATTTTTTTC ACGTTCGTCT CCTATTTGAT AAAACGTCTT 1860 

ACTAAACTGA TTAGTATAAA GACAGTTACA AAAATAATGG TAATACTTGC ACTTGCAGGT 1920 

GTTTCTGCAT AGTAGGAAAT GTAAAGTCCT GCTACCATTC CCAAAAAGCC AATCGCACTG 1980 

GCAAGCAGCA TAACCGATTT AAAGTTTTTC CCCAGACGCA GGCCAATACT AGCTGGCAAG 2040 

ACCATAATGG TCGATACCAG AAGAGCTCCT GCTGCAGGAA TCATAAGGGC AATAGCCACC 2100 

CCTGTCACCA TGTTAAAAAG AATGGACATG GTACGAACTG GCAAGCCATC CACAAAGGCC 2160 

GTATCTTCGT CAAAAGTTAA GATATACATA GGACGAAGAA AGAGAAAGGT CAAAATCAAA 2220 

ACAACCGCCG CAATGACAAA GAGGGAAATG ACCTGTTCTT CACTGATAGT CACGATCGAA 2280 

CCAAAGAGAT ATTGGTCCAA ACTCATTGAA CTCGAGCTTT TACCCTTGCT CATGACAATC 2340 

AGAGAAACAG CCAGACCTGT TGACATGAGC ATAGCTGTCC CGATTTCCAT AAAGCTCTTG 2400 

TAAACCGTAC GGAGATACTC CAGAAAGACC GCCGCAATCA AGACAATGGC AATAGTAGAA 2460 

ACAGTTGGAG AAATCCCCAA AACCAGACCA AAGGCTACAC CTGAAACTGA GACGTGGCTA 2520 

AGGGTATCAC TCATCAAACT CTGACGACGC AAGATGAGGA AGGTTCCCAA TACCGGTGAG 2580 

AAAAGACTCA TAGCAATAAC CGCCAAAAAG GCGCGTTGTA TAAAGTCGTA AGATAATAAA 2 640 
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CTAAGCATCG 


CCCACCTCCT GGCCATTCTC 


ATGAACATTG 


AAACAACCCC 


ATGGCGAGTC 


2700 


TTGGTTACGG 


ACTAGATGAA TATTGCGATC 


CGCATAATCC 


TTAACTTCTT 


CAGGGTCATG 


2760 


GGTAATCATC 


AAAACAGCCT TGCCATGATG 


ATGGGCGCTG 


TGGTGCATGA 


GTTCGTAAAA 


2820 


TTCATTTTTA 


CTTCCTGCAT CCATCCCCGT 


TGTCGGCTCG 


TCTAGGATAA 


ACACATCAGG 


2880 


GTCACAAGCA 


AACATACGCG CAATTACCGC TCGCTGCTTT TGTCCCCCAG 


ATAGAGACCC 


2940 


CAAGCGTTTG 


TCTCCATGTT CCCACATGCC 


AACTGAGTCC 


AGACTAGCCT 


TGATATGCTC 


3000 


CTCATCATGA 


GCATTCAAAC GACGGAACCA 


GCCTTTTCTC 


GGATAGCGAC 


CCGACTTGAC 


3060 


AAATTCATAG 


ACCGTACTTG GAAAACCAGC 


ATTAAAACTG 


GCAATTTGTT 


GAGGAAGATA 


3120 


GGCTATTCTC 


AATTTCTTAC CTTGCGTATT 


TGTCTTTGAA 


ATAGCCACCT 


TTCCAATCCC 


3180 


TGGTTGCAGA 


ATTCCAAGAC TAGCCTTGAT 


GAGCGTCGTC 


TTAGCCCCTC 


CATTTTCCCC 


3240 


AGTCAAGGTA 


ACAAATTCCC CACTATCAAC 


ACAATAATTG 


ATATGTTCAA 


GAACAGGCTC 


3300 


CTTATCATAA 


TAGAAGGACA AATCCTCTAC 


CGTAATATAT 


CTCATTATTT 


GATTTCTCCT 


3360 


ACTAAAGCAG 


TCAAAAACCG CTGAATCACT 


TTTTGTTCAT 


TTGGAGTAAA 


CTGAGTCGCC 


3420 


ACTTGTTCAT 


AGGTTAAAAG TGTATGCTCA 


TGGTGATGGT 


GGTGCTCCTC 


AGCGATTGGA 


3480 


CGAGCCAAGT 


CAGTCAACTG ATAAAAAATC 


ACACGCGCAT 


CTTTAGAATC 


TTTAGATGTT 


3540 


TCCAACATCC 


CTTCCTTGAC CAAAGACTTA 


ATGGCCTTGG 


TAACTGCCGC 


CTGACTGACA 


3600 


TTGAGACGAC 


GGGCCAATTC TGAATTTGTT 


AAAGATTCCT 


CTGACAAGAG 


CATAAGGATA 


3660 


TGCTCCTGAG 


TATTGGTCAG GGCCACCTCG 


CTAGTGCAAT 


GACCTATTAG 


GATTTCATGC 


3720 


TGATTTTCCG 


CCTGCAAAAT CACCTCATTC 


AAAAAAGCAT 


TGATATCCTT 


TGCTAGCTGT 


3780 


CTCATATCTG 


ACTCCTTTCC TTTTAGACTT 


CTCTTTTTTA 


AGAGAAAAAT 


ACTATTCTTT 


3840 


GACATTTTGT 


TTACCAGTTA ATTATATCAC 


AAGCAAAAAA 


AGAGTCAAGA 


AAAAACGTGA 


3900 


AAACTAGTTT 


CATTCTTGAA CTCTTCTATA 


TTATATTATC 


TATTGAAATT 


CTTTGACATC 


3960 


TCCATCATAA 


GTCGCCCAAT CTTTGCTGAA 


AAAGCGCTCA 


TTCAGATGGT 


AAGTCGGAGC 


4020 


TGGTGTGGGA 


TTGGATAGGA AAGGATCAAC 


TGCCTTGTCA 


AAAGCCAACC 


AACCCAACCA 


4080 


ACCAAGGTGA 


ATGGTGTCCT TCATAAAGAA 


AGGCTCCCCG 


CCGTCCTTAG 


AAAAATCTGC 


4140 


TATATTGGTA 


AAACCTTGAC TTTCTAACTG 


GTAGCGAATC 


TTCTGCACCG 


TTTGTTGGTA 


4200 


CATATCCTCT 


CGTAGACCAG CATAGTTCAT 


CCATTTTTTA 


TTAACAGGTG 


GAATGATAAA 


4260 


AATCGGGTTT 


ACCTTAGATT TAGAAAACTG 


TGTTAAAACC 


AACTGCAAGT 


CATTATACTC 


4320 


TGGCGACTTG 


AGATAGGTAA AGCTTTTCTG 


AGAATCCTTT 


AATTTCTTCA 


AATCCTTCTT 


4380 


GATCTGCTCA 


TTATAGAAAT AATTTTCCAT 


TCCCATCTCA 


TTATTGGAAG 


tattthttc 


4440 
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ACCATCTGCT TTGACAACAT CTTCTATTGC CTGATAAGAA AACTGGTCTG GCAAGATTTT 4500 

TAAATACTTA GCTACATGCT TATCGTAGTT AACATAGCCT CTAACCGAAA ACTGACCAAA 4560 

AAAGGAAGCT TGGCGTTCAT TAAAACGAGC CAATAATTCA ATCATTTCAT TGTCTGCTGT 4620 

CGACAATTCT TCTTTACTTG CCAACTTCTG AACCAGCTCC TTCATAGCTA CGTTTGGGAA 4 680 

CTGTTGCAGT AAGCGAGTCG CTGCATATTG ACTAGCCTGA TCCCCACATT GATGTTTCAG 4 740 

AAAACTAGTC AACTGGTCTC CATTAAAATA CTGCTGGAAG GCTGCTGCAT CATAGCCATT ~ 4 800 

TTTACTGAAC CACTGAGCTG AGATAACATA CACAACTTGT TTATTCTCCA GCTGTGGTAA 4860 

CATCTGTTGC ATTCCAAAAT ATTGGTTAAG CGATGCAGCT CCCCCCTGTC CTAAAAGATA 4 920 

AGGACGGTAG GAACGATTGT ATTTCTCAGC TAATACCGCA GGATGAGCAC CGTCAAAACG 4980 

AAGCCATTCA CTAGAGCCAA AGAAGGGAAC AAAACGCACA TTTGGATCAG ATAGTGCTCT 5040 

CACTTTTTCA CTTCGCTCCT TAAAACTATC GATAGTAGTA GCCACTGCTG AACGCTTTTC 5100 

AGCTCCTAGA TTATGATGCA TCTCAGTAGC ATAAAAGAAA ATGACCAGAA AAACCAACAA 5160 

ACCAGCGATC AAGACCGGTC CGAAGATCAT CCATAAGCGT TTAAGCATTT TGTAGCTCCA 5220 

CAATACCAGC TATGATTTTA TTAGCTGTAT TCCAGTCGTC ACGACCAAAC TCTGTTACAG 5280 

GGACACGAAT GTCAAAACGG TTCTCAATCT CCACAATCAA CTCAACCGTT CC CAT ACTA T 5340 

CCAAGACACC TGCATCAAAA AGATCTTCAT CCATCATGTC AGAAACATCT TCCATAAACA 5400 

ACTCATCAAT AATTTCAATA ACTTCTGATT TGATATCCAT ATTTTATTTC CTTTTATTTT 5460 

TTAAACCATA GATTATTCAA GAATCCAGAA AAGATTAAGA ATGACAACAT GACAACATGG 5520 

AAAGTGACAA CCATGCCAAG CAACTGAATC CAGCGATTCT CAGGTAGCGC AGCCTTCCCT 5580 

GCTTTTTTCC GTTCCTTATT GAGCGTTTTT TTCTTGCGAA CCCAGGCATC ATTGATGACC 5640 

AAGCCTAGTC CATGAAAGAG TCCATAGGCG ATATAGTACC AGGTCACACC ATGCCAAAAT 5700 

CCCATAATCA GCATATTTAC AATGTAGGCC ATGCTTGAGG TTACATTACC ATTTTTAAAG 5760 

ACTTTCTTTC TGGTTAACAC CATCACCATT CGCATAAAGA CAAAGTCACG GAACCAGAAG 5820 

GACAGACTCA TATGCCAGCG ATTCCAAAAC TCCTTTAAAT CCCTTGATAA AAAGGGCTTG 5880 

TTAAAGTTGA TAGGGCTACG GATTCCCATC AAGTTTGAGA TGGCCAAAGC AAACATAGAA 5940 

TAACCTGCAA AGTCAAAGAA GAGTTCCAGA CCAAAAGTAT ACATAACTGC CAAGCCATAG 6000 

AGATTAAAGA AGCCACCTGA CTGCAAGGCT AAATTCTTCA GAGGAGGTAG TAAGGTCTCT 6060 

CCTAAAACAT GAGCTAGGAT AAACTTATAC AAAAAGCCCC ACATGATATA GCGGACAGAT 6120 

TCATCCAGCA TATCCATCAA CTCATCTCGC TCAGGAATAG CCTGATAATT TTCATTAAAT 6180 
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CGCTTAAACC GATCCATTCG ACCACTCGAG AAAGTTGGCA TGAAGAGAAG GAAACGGAGG 62 40 

AATTCCCAGA GGGTAAAATC CTTAATCACT CCATCTCTCA GCTCGATGAC AATTCCAACC 6300 

GAACGAAAGG TCAGGTAAGA AATTCCCAAG AACCCAACCA AAGACTGCGT TCCATTGATA 6360 

GCTGGTTGCA CCTTGACAAA GATAATCGGA AGTAGGGACA GAAAACTAAC TAAGTAGAAG 6420 

ACCCACTTGC CATCCTTGCT TTTTCGATAA TGCTTGTAGA AAAGCAGGAG CAATATTTCC 64 80 

CAGCAAAGGT AAATACCCAA GGCAGCTAGT TGATTGGTCT TTCCACCCAC CAACATGCTG 6540 

ACAATAAAGA AGAGACTTAC CAACACTTCA TACCAGGCAA AGCGTTTCTT GAAAAAGAGA 6600 

CCTATAAAGA TGGGCAAGGT TGCAGCAATC ACATAAACAA AATACTGAGG ATTGCCATAT 6660 

GGCTCTAAAT GAGGAAGCTG TTGAAAAAAC TCCATCATCT CTTATTCACC TCGTTAATCA 6720 

ATCCTTTGAT GTCAATCTTT CCATTTGGAG TTAGTGGCAA ACTGTCTCGG TAAAGGAATT 6780 

TAGATGGCAT CATATAGGAC ATCATGATGT CTGTCAGGTC TTCCTTGATG GCCTTGGTAA 6840 

TATCGATATC TCGCTCAAAC TGCTCACGAA CACCGTCTTT TAAGATGACA TAAGCCAATA 6900 

GATTTTGTAC CTTGTGGTCC TTGTTATAGC GCGGTACTGC GACAGCAGAT TCGATAAAGC 6960 

GAGACTTGTT GAGGTTTTGA GAGACATCTT CTAACTCAAT GCGGTAACCG TTAAACTTAA 7020 

TCTGGAAGTC CATGCGTCCG CCGTAGAGAA GCAAGCCCTC ATCTGTCATG GTTCCCACAT 7080 

CGCCTGTGTG ATAGGCTGGC AGATCTTCAA ACTCAAAGAA GGCTTCTGCT GTTTTTTCAG 7140 

GATTGTTCAT ATAACCTTTT GAAACAGCTG GCCCAGAAAC AATGATTTCT CCCTGCTCAC 7200 

CATTTGGCAG TTTATTTCCT TCCTCGTCAA TGATAAAGGT TGGAGAATCA GCCTPGGTAT 7260 

AGCCGATTGG TAGGCGTTTG AGAGTCGCTA ACATCTCGTC TGTCACGGCA ACTGCTGACA 7320 

GAGCTACTGT CGCTTCTGTT GGGCCGTAAG CATTGATGAT ACGGGCATTT GGGAAACGCT 7380 

CGCGCAGTTT TTGAGCTGTT TTGACCGTCA ATTCTTCACC ATCAAAGTAG AAATGCGTGA 7440 

TTCCAGGCAT TTTCTCACTG TTGAAGTATT CAGACAACAT GGCCATATCT GCAAAGGATG 7500 

GTGTTGATGT CCAGATAGCG ATTGGCAATG AAAAGATAGC CGCAAAGAGT TGCTTAAAcT 7560 

CCTGACTGAT GACTGAAGGA AGAGTGAAAA GCGTACCACC AAGTGCCAAG GTCGGTGCCC 7620 

AATACATGAC AGACAAGTCA AAAGAATAAG GTGGCTGTGC CAGCATTTGC GGACGACTCG 7680 

GTGTCGCAAA TTCCTTATCC GTAATCATCC AGTTTGTAAA GCTGAGGAGA TTATCATGTG 7740 

AAATCTGCAC TCCCTTAGGC TTACCAGTCG TACCAGAAGT AAAGATAATG TAGTAATTAT 7800 

CATCTCCCTT GACTGGATGC GTGATTTCAT AGTTATTCCC TTGGGCAAAG GCTTCTTGAA 7860 

CCTGAGCTAG ATTTATCATT GGTGTAGAAA CCTGCTCCAA GGGAAAGGCT GAAATGGCAA 7920 

TAATCAAGCT TGGCTCTGCT ACTTCTAAAA TAGCTGAAAC TCGCTCCAAG GCCGAATGGC 7980 
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TATCAATTGG AATGTAGGCA TGACCTGACT TAGTCAGCGC TACAAAGGTT GCCAACATTT 8040 

CATATTCTTG GCCACCAAAA ACAACCACAG GAGACTTCTC AGGCAAGCCT AGTTGGTCAA 8100 

TGACTGCAGC CAAACTATCC CAATCACCCT TTAAATCGCC ATAAGTGTGT TCCTGCCCCA 8160 

AAACATTATA GACAGGATAG CTAGCCTGTG TCTCAGCAAA ATGCTCAATG GTTTCAATCA 8220 

TATCTGCTAT TGGTTTATTT GACACAATAG GGATTCTCCT TCAAGTTAAA ATTCATTATA 8280 

GATAAAGCTT CCTTGACCCT GACCAAGATA GCTAAAGAAG TAAAGCAGCC CTAGAAACAT 8 340 

AAGAAAATAC AAGGCTGTCC GACCAAGAAA GAGGTACAAT TCTTTTCTCT GTTTCATCAA 8400 

GAAAAACCAT TCATTTCTGT AA T TT T TCGC TAAAATAAGA GTGATTCTTA CTAGCTTATT 84 60 

TTTCTACCAT TGTACCACTT TATATAGTAT CTTTTCAATT GTTTACCGTA TGTTTCCAAT 8520 

AGATTTCAGC TTATTTTAAG GATTATACAG TTTTTCTATG TATATTTTCA AATAGAGTCA 8580 

TCCTGCTTCA AAACTCCATT TCAGCACACA ATGAAGTAAA TCTTCCCATA ATAAAACACA 8640 

CAATATCAAG TTTTTTCAAC ACCTGATACT ATGCGCTTTT CTGATTTTTA AAGACTTTTT 8700 

AACCACTCTC TCATTTAAAA TAATCTCGTC TGATATAAAT TAAAATAGCT TCTATCATCA 87 60 

GACAAATGGC TGATAGCCAA AAACTGATGC TAATACCAAA ACTCTCAGTA ATATAGCTCA 8820 

TTAGCAAAAC AAATACTGAA AATGCTAATG TAGAAATCAC TTCAACAACG GAATAGACAT 8880 

TAACTAAATG ATTTTCCTCT ACTGTTTCCT GAAGAAATAC ACTTTCAGGA ACTTCTTTTA 8940 

GTTGCGATAA CATACCAACT AAAGCTGAAA ATAATAAAAA CATCTGTGCG TTTGGAAAAT 9000 

ATAGAATAGT CAGTGTCACT ATTPCCATAG CTACAACAGC AAAAAGAATA CTTTCCCCCC 9060 

AAATCATTCA TACCTCTCTC AACTAGATGT AACTTACAAA ACCCCTGACC TCATGAGCCA 9120 

CTTTCTTCCT CCTCATGAGG TCAGTTTTAC TTTCTCCTGT TCCAGTATGG TTTTTCCTCG 9180 

CTAGATTTCC TCAAAAGGGC AGACTCCTCC CTTGGTGCGT CACACGATTT TTTCATCTCG 92 40 

ACTGTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG GTGCTTCATA AGGAACAGCA 9300 

AGATTCAGGT TGACTTTTCT AATCCTAGAA TAAACTGCTG AAAACAATTC GGAATAGGCA 93 60 

TAGAGACTAG ACAATTTGAG GAGCTGCTTG CGTCCTGTTC GAACACATTT TCCCACCACG 9420 

TGAAGAAAAA GATGGCGGAA GCGTTTGATT GTTAAAGTTT GGAAGTCACC TCCAGCTAGA 9480 

TGTTTGAGAA AAAGATAGAG ATTGTAGGCG ATACAGCTCA TCATCATACG AACTTCGTTT 9540 

TTGATTAAGG TTGAACTATC CGTTTTATCG CCAAAAAATC CCTCCTTCAT CTCCTTGATG 9600 

AAATTCTCGG CTTGACCACG TCCACGATAA ACCTGAAACT GGTCTTGGcT gTTCCACTCG 9660 

TCATATTTGT AACGAGAGAA ATAACATCGT AGAACAAGTA TCCTTCTTTT C 9711 
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(2) INFORMATION FOR SEQ 10 NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3025 base pairs 

(B) TYPE: nucleic acid 
<C) STRANOEDNESS : double 
(D) TOPOLOGY: linear 



(xi» 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


168: 






CCCCTTTGTC 


AAAACTGTAA 


AATTAACGAC 


TCAACAATTC 


ATCTTTACAC 


CAATCTCAAT 


60 


GGAAAACAAA 


AACAAATTGA 


CCTCTGTCAA 


AACTGCTATA 


AGATTATCAA 


AACAGATCCT 


120 


AACAATAGCC 


TCTTCAAAGG 


TATGACGGAT 


CTGAACAATC 


GTGACTTCGA 


TCCCTTTGGT 


180 


GATTTCTTCA 


ATGATCTAAA 


CAATTTCAGA 


CCTTCTAGCA 


ATACTCCTCC 


TATTCCCCCA 


240 


ACCCAATCAG 


GTGGAGGTTA 


CGGTGGAAAC 


GGCGGTTATG 


GTTCCCAAAA 


TCGTGGATCT 


300 


GCTCAAACTC 


CGCCACCTAG 


CCAAGAAAAA 


GGCCTGCTGG 


AAGAATTTGG 


TATTAATGTA 


360 


ACTGAAATTG 


CCCGTCGTGG 


AGACATTGAC 


CCCGTTATTG 


GGCGCGACGA 


TGAGATTATC 


420 


CGTGTCATCG 


ACATTCTCAA 


TCGTAGAACC 


AAGAATAATC 


CTCTCCTTAT 


CGGTGAACCT 


480 


GGTGTCGGAA 


AAACGGCCGT 


TGTCGAAGGT 


CTAGCTCAGA 


AAATTGTCGA 


TGGCGATGTG 


540 


CCACATAAAC 


TCCAAGCTAA 


ACAAGTCATC 


CGTCTGGATG 


TGGTTAGCTT 


AGTTCAAGCA 


600 


ACGGGGATTC 


GAGGACAATT 


TGAAGAACGC 


ATGCAAAAAC 


TCATGGAAGA 


AATTCGCAAA 


660 


CGTGAAGACA 


TCATCCTCTT TATCGATGAA ATCCATGAAA TTGTTGGTGC 


TGGTTCTGCG 


720 


AGTGATGGTA 


ATATGGACGC 


AGGAAATATC 


CTCAAGCCAG 


CCCTTGCTCG 


TGGAGAACTG 


780 


CAACTAGTCC 


GTGCTACTAC 


CCTCAATGAA 


TACCGTATCA 


TTGAAAAGGA 


TGCTGCCCTC 


840 


GAGCGTCGTA 


TGCAGCCTGT 


TAAAGTCGAT 


GAACCAACGG 


TGGACGAAAC 


AATCACTATT 


900 


CTCAAAGGGA 


TTCAAAAGAA 


ATACGAAGAT 


TACCACCACG 


TTCAATATAC 


AGATGCTGCG 


960 


ATTGAAGCAG 


CTGCAACTCT 


TTCCAATCGC 


TACATCCAAG 


ATCGCTTCTT 


GCCTGACAAG 


1020 


GCCATTGACC 


TCCTAGATGA 


AGCTGGTTCT 


AAGATGAACT 


TGACCTTGAA 


TTTTGTGGAT 


1080 


CCTAAAGTAA 


TTGATCAGCG 


CTTGATTGAG 


GCTGAAAATC 


TCAAGTCTCA 


AGCTACACGA 


1140 


GAAGAAGATT TTGAGAAGGC GGCCTACTTC CGCGACCAGA TTGCCAAGTA 


TAAGGAAATG 


1200 


CAAAAGAAAA 


AGATCACAGA 


CCAGGATACT 


CCTAGCATCA 


GCGAGAAAAC 


TATTGAGCAC 


1260 


ATTATCGAGC 


AGAAAACCAA 


TATCCCTGTT 


GGTGATTTGA 


AAGAGAAAGA 


ACAATCTCAA 


1320 


CTCATCCATC 


TAGCCGAAGA 


TCTCAAGTCT 


CATGTTATTG 


GTCAAGATGA 


TGCAGTCGAT 


1380 


AACATTGCCA 


AGGCTATTCG 


CCGTAATCGT 


GTCGGACTTG 


GTACCCCTAA 


CCGCCCAATC 


1440 
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GGAAGCTTCC TCTTCGTTGG GCCAACTGGT GTCGGTAACA 


CAGAACTTTC 


CAAACAACTG 


1500 


GCTATCGAAC TTTTTGGTTC TGCTGATAGT ATCATTCCCT 


TTGATATGAG 


TGAATACATG 


1560 


GAAAAACATA GTGTGGCTAA GTTGGTCGGC GCTCCTCCAG 


CTTATGTTGC 


CTATGATGAG 


1620 


GCTGGTCAAT TAACTGAAAA AGTTCGCCAC AATCCATATT 


CTCTCATCCT 


TCTCGATGAA 


1680 


GTGGAAAAAG CTCACCCAGA TGTTATGCAC ATGTTTCTTC 


AAGTCTTGCA 


CGATGGTCGT 


1740 


_. TTGACAGACC GGCAAGGACG CACCGTTAGC TTCAAGGATG 


CCATCATTAT 


CATG AC CTCA 


1800 


AATGCAGGTA CAGGAAAGAC CGAAGCTAGC GTTGGATTTG 


GTGCTGCTAG 


AGAAGGACGT 


1860 


ACCAATTCTG TCCTCGGTGA ACTCGGTAAC TTCTTTAGCC 


CAGAGTTTAT 


GAACCGTTTT 


1920 


GATGGCATTA TCGAATTTAA GGCTCTCAGC AAGGATAACC 


TCCTTCAGAT 


TGTCGACCTC 


1980 


ATGCTAGCAG ATGTTAACAA GCGCCTCTCT AGCAACAACA 


TTCGTTTGGA 


TGTAACTGAT 


2040 


AAGGTCAAGG AAAACTTGGT TGACCTAGGT TATGATCCAA 


AAATGGGAGC 


ACGCCCAcTT 


2100 


CGTCGGACTA TTCAAGACTA TATTGAGGAC ACAATCACTG 


ACTACTACCT 


TGAAAATCCA 


2160 


AGCGAAAAAG ATCTCAAAGC AGTTATGACT AGCAAGGGAA 


ACATTCAGAT 


TAAATCTCCC 


2220 


AAAAAACCTG AAGTTAAAAG TTCTGAAAAA GAAAAATAAA 


TCCTATAAAA 


AAGGAGTAGA 


2280 


AAATGAAATT TTTCTGCTTC TTTTTTTACT AAAATAACTG 


TAATTTCTTG 


ACAGCTTGCC . 


2340 


CTTTGTCCAT TATGATATAT AGTAGACTGA ATCTGAAATA 


GTACGAAACA 


ATTGCTAAAA 


2400 


CATTTATAGA AATTAATTTT ACTTTCCCAA TCGATTTCTT 


CTCATCTTAT 


TTCAATCTCC 


2460 


TATAGTCAAT TGAAACAAGA ACAAGACAAA AGAGCCTCAT 


AAAAGGTATT 


GCAACTTGGT 


2520 


AATACCTTTT TGAGGTGCTT TTTGATATGA GCCCATGTTT 


TCTCAATAGG 


ATTGTACTCA 


2580 


GGTGAGTAGG GAGGAAGAGG TAAAAGTTTA TACCCAAACT 


CTTCACACAA 


GAGTTCTAAC 


2640 


TTACCCATTC TATGGAATCT TGCATTATCC ATAATAATAA 


CCGATGGTGT 


GGTTAATGTT 


2700 


GCTAAGAGAA ACTTCTGAAA CCAAGCTTCA AAAAAGTCGC 


TCGTCATCGT 


CTCTTCGTAA 


2760 


GTCATTGGAG CGATTAACTC ACCATTCATT TGTTAGACCT 


GCAACCAAAG 


AAATTCTCTG 


2820 


ATATCTTCTT CCAGATACTT TGCCTCTTCT TAACTGACCT TTTAATGAGC 


GACCATATTC 


2880 


TCGATAAAAA TAAGTATCGA ATCCTGTTTC GTCAATCTAA 


ACAGGTGCTA 


GGTGCTTTAA 


2940 


ACTATTAAAA TTCTTAAGAA ATAAGGCTAC TTTTTCTGCG 


TCTTGTTCAT 


AGTAGGTGTA 


3000 


GTTCTTTTTT TTTTCGAGTG TAGCC 






3025 


(2) INFORMATION FOR SEQ ID NO: 169: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4104 base pairs 
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(B> TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 


169: 






TTTAAGGTTT TAAAAAAAGT TTTCGAAAGG TTTCTTCTTT 


ATTTTTTAAG 


GGAGAGATAA 


60 


CGTTGATATC TAAATCGTGG TCAAAGCCGG CAATTTTTCC 


TTTAGATGTG 


TATTGGTCAA 


120 


TATCATAATC TAAATCAGTT TTAGGACTGC TCTCCAAAAA 


TCCTGAGTCT 


GAGCCGTAGA 


180 


CGGAATCCAA ACAGAGGTAA ACTTGCCTGT ATCAATACTG 


TGTTCTTCCA 


TGAAGTAGAC 


240 


ACCAACGTAG ATGCCGATGT TTTTAGCACC CAGTGATGCT 


AGTTTTGCTC 


GAAAGTTTTC 


300 


GACACCTTCG TTCATATTAG ACATGGTTTT GTCTTCCACG 


TCAAGCCAAT 


AGTAACTAGG 


360 


GCTGTAAGGA GAGGCACCAT TGTAGAAAAC TTCGGCAGCC 


TTTTCCATTT 


CTTGGACACT 


420 


TTTTCCAGCT ACATAAGCGT AGACAGCAAC TGGGACATTC 


CGCTTTTGAA 


GTrCAGTGAT 


480 


ATGACTCTTA TAGGCCTTGT CTATTCCATT GATAAATGAA 


GCATCATTTT 


CTTTTGTCGT 


540 


TTGAGCACCA CTGTGAACAC GAACAATAGC ACCTGAAATA 


TTTTGTGAGA 


GGGCATCGTA 


600 


GTTGATTTCC TCAGGACGCT GCCAGCCAGA GAGCTCAATA 


ATCGGTTTGT 


CTAAGTGTTT 


660 


CAAAGCCTGT CCTTCAATCT GTGCTATATT GGATTTTGTT 


TTAAACGATT 


CGCTGTCATT 


720 


AAGTGGGCGA TTGATGATTA AAATGAACAT CATAATCCCA 


AAAAAACTAA 


ATAAAATAAG 


780 


TGGATGAATT TGTTTTCTCA TATCTTATAA TTCTACCCTA 


AAAATCAAAA 


AAAATCAAAA 


840 


AAATGGGTTA AGGAAGAGAC TTTAGAGCAT TTTTTCATTC 


AAGAGTGCCG 


AATGATTTGA 


900 


AATATGGTAT AATAAAAGGG AATTTCTACA GAAAAGAGAA GATTATGTCA AATTTTGCCA 


960 


TTATTTTAGC AGCGGGTAAA GGGACTCGCA TGAAATCTGA 


TTTGCCAAAA 


GTTTTGCACA 


1020 


AGGTTGCGGG TATTTCTATG TTGGAACATG TTTTCCGTAG 


TGTGGGAGCT 


ATCCAACCTG 


1080 


AAAAGACAGT AACAGTTCTA GGACACAACG CAGAATTGGT 


TGAGGAGGTC 


TTGGCTGGAC 


1140 


AGACAGAATT TGTGACTCAA TCTGAACAGT TGGGAACTGG 


TCATGCAGTT 


ATGATGACAG 


1200 


AGCCTATCTT AGAAGGTTTG TCAGGACACA CCTTGGTCAT 


TGCAGGAGAT 


ACTCCTTTAA 


1260 


TCACTGGTGA AAGCTTGAAA AACTTGATTG ATTTCCATAT 


CAATCATAAA 


AATGTGGCCA 


1320 


CTATCTTGAC TGCTGAAACG GATAATCCTT TTGGTTATGG 


ACGAATTGTT 


CGTAATGACA 


1380 


ATGCTGAGGT TCTTCGTATT GTTGAGCAGA AGGATGCTAC 


AGATTTTGAA 


AAGCAAATCA 


1440 


AGGAAATCAA CACTGGAACA TACGTCTTTG ACAACGAGCG 


TTTOTTTGAG 


GCTTTGAAAA 


1500 


ATATCAATAC CAATAACGCT CAAGGCGAAT ACTATATTAC 


AGACGTCATT 


GGTATTTTCC 


1560 



4 
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GTGAAACTGG TGAAAAAGTT GGCGCTTATA CTTTGAAAGA TTTTGATGAA AGTCTTGGGG 1620 

TAAATGACCG TGTGGCGCTT GCCACAGCTC AGTCAGTTAT GCGTCGTCGC ATCAATCATA 1680 

AACACATGCT CAACGGTGTT AGCTTTGTCA ATCCAGAAGC AACTTATATC GATATTGATG 1740 

TTGAGATTGC TTCGGAAGTT CAAATCGAAG CCAATGTTAC CTTGAAAGGG CAAACGAAAA 1800 

TTGGTGCTGA GACTGTTTTG ACAAACGGTA CTTATGTAGT GGACAGCACT ATCGGAGCAG 1860 

GAGCGGTCAT TACCAATTCT ATGATTGAGG AAAGTAGTGT TCCAGACCGT GTCATAGTCG 1920 

GTCCTTATGC TCACATTCGT CCAAATTCAA GTCTGGCTGC CCAAGTTCAT ATTGGTAACT 1980 

TTGTTGAGGT GAAAGCATCT TCAATCGGTG AGAATACCAA GGCTGGTCAT TTGACTTATA 204 0 

TCGGAAACTG TGAAGTGGGA AGCAACGTTA ATTTCGGTCC TGGAACTATT ACAGTCAACT 2100 

ATGACGGCAA AAACAAATAC AAGACAGTCA TTGGAAACAA TGTCTTTGTT GGTTCAAATT 2160 

\ CAACCATTAT TGCACCAGTA GAACTTGGTG ACAATTCCCT CGTTGGTGCT GGTTCAACTA 2220 

TTACTAAAGA CGTGCCAGCA GATGCTATTG CTATTGGTCG CGGTCGTCAG ATCAATAAAG 2280 

ACGAATATGC AACACGTCTT CCTCATCATC CTAAGAACCA GTAGGAGCCT ATCATGGAGT 2340 

TTGAAGAAAA AACGCTTAGC CGAAAAGAAA TCTATCAAGG ACCAATATTT AAACTGGTCC 2400 

AAGATCAGGT TGAATTACCA GAAGGCAAGG GAACTGCCCA ACGGGATTTG ATTTTCCACA 24 60 

ATGGGGCTGT CTGTGTTTTA GCAGTAACGG ATGAACAAAA ACTTATCTTG GTCAAGCAGT 2520 

ACCGCAAAGC TATCGAGGCT GTCTCTTACG AAATTCCAGC CGGAAAATTG GAAGTAGGAG 2580 

AAAACACAGC CCCTGTGGCA GCTGCCCTTC GTGAATTAGA GGAAGAAACA GCCTATACAG 2640 

GGAAATTAGA ACTCTTGTAC GATTTTTATT CAGCTATTGG CTTTTGTAAT GAGAAGTTAA 2700 

AACTATATTT AGCAAGCGAT TTGACAAAAG TGGAAAATCC GCGTCCGCAG GATGAGGATG 2760 

AAACCTTGGA AGTCCTTGAA GTGAGCTTAG AAGAAGCGAA AGAATTAATC CAATCAGGTC 2 820 

ATATCTGTGA TGCCAAGACA ATTATGGCTG TTCAGTATTG GGAGTTGCAG AAAAAATAGA 2880 

GGAGGTCAGT ATGGGTAAAT CTTTATTAAC GGATGAAATG ATTGAAAGAG CTAATAGAGG 2940 

CGAAAAAATT TCAGGTCCTC CTTTGCTAGA TGATAATGAG GAAACTAAGA TTTTACCAAC 3000 

CTCTTCTTCC CCTTTTGGTT ATGCCAATCC TAAGGATCAT GGTTTTAGCC AGGAAACCTT 3060 

GAAGATTCAG GTCGAACCAT CTATTCATAA AAGCCGTCGT ATTGAAAATA CCAAGAGAAA 3120 

TGTCTTCAAT TCTAAGTTGA ATAAAATCTT ATTTGCGGTC ATCTTTCTCT TGATTTTGCT 3180 

TGTTTTAGCA ATGAAACTTT TGTAATAGAA AAGGAATTGA AATGAAAATA GGAATTATTG 3240 

CTGCTATGCC AGAAGAACTG GCTTATCTGG TCCAGCATTT AGATAATGCC CAGGAGCAAG 3300 
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TTGTTTTTGG GAATACCTAT CATACAGGAA CCATTCCTTC TCATGAAGTC GTTCTTGTAG 3 360 

AAAGTGCAAT TGGTAAGGTC ATGTCTGCTA TGAGTGTGGC GATTTTCCCT GATCATTTCC 3420 

AGGTGGATGC CCTTATTAAT ACGCCTTCAG CTGGGGCAGT AGCAGAAGGT ATCGCTGTTG 3480 

GGGATGTCGT GATTGCTGAC AAATTAGCCT ATCATGACGT GGATGTCACA GCTTTTGGCT 3 540 

ATGCTTATGG ACAAATGGCG CAACAACCGC TTTATTTCCA ATCAGACAAA ACCTTTGTTC 3600 

CTCAAATCCA AAAGAGTTTA TCTCAATTGG ACCAAAACTG GCATCTTGCT TTGATTGCTA 3660 

CAGGAGATAG TTTTGTTGCA GGAAATGACA AGATAGAAGC GATTAACTCC CATTTCCCAG 3720 

AAGTTTTAGC CGTGGAGATG GAGCGGGCAG CTATTGCTCA AGCAGCGCAT GCCCTCAATC 3780 

TCCCAGTCTT AGTCATCCGA GCTATGAGTG ACAATGCCAA CCATGAAGCA AACATCTTTT 3840 

TTGATGAGTT TATTATCGAA GCTGGACGTC GCTCTGCCCA AGTCTTCTTG ACCTTTTTGA 3900 

AGGCTTTAGA TTAAGCGGAA ATTTGACAGT TTTTCTAGCT TATGATAAGA TTTAAGTAAA 3 9 60 

GAAAAGCTAG AAAACGTTTC AGAGGATATT ATCAGTATTG AAATGACCGT CAGTGAGATT 4020 

GCAGAGGTCT TAGGATTATC TCGCCAAGCA ATCAATAACC GTGTCAAAGA ATTACCAGAA 4080 

GAAGACACAG ATAAAAATGA CAAG 4104 
<2> INFORMATION FOR SEQ ID NO: 170; 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

CACGGATAGG CTCGGCTTTC ATCAGTCCTC AGGCTGATTT ACTAATAGCA ACTTTCCTCG 60 

ACAAAGTCCA CAGCGATACG TnTGGGTATC AATCCTACGC TTACGCTGAT ACCTTTGCTG 120 

GCAGGATTGG CAACGATAGA GCTTGATTGG CTTGGAGTTA CTATTGGGCA AGGATGCTAC 180 

AAACCGTAAT CCATCCACTG CTTTCAACAG TTCCTTAAAA TCCCGATCCT TGTGTTGATA 240 

GCCTTTCCCT TGAAAATAGA GGTGATAATG ACAGAGTTCA TGTCGGACAA TTTTCCTAAA 300 

AACGTCCAAC CCCAGTTCCT GATAAACCTT GGGATTAAAA TCCAAATGCC CATCTTTGGG 3 60 

GAAAAATCGC CCACCTGTCG AACGTAGACG CCTATTCCAC TGGACATGAT GGATAAAAGG 420 

TCTGCCGAAG TCTTCTAGTG AAACCTGCTT GACGTAATCA GTCAGTTTCA TTTGGAGCTA 4 80 

GGAGAGACAG ATTAACTTTT TCACGTTCAG TATCAATTTT CTTAACCCAA ACGCTCACCA 540 

AATCTCCAAC TGCCACCACT TGACTAGGGT GTTTGATAAA CTTGCGACTC ATATGGGAAA 600 
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TATGGATGAG 


ACCGTCCTCA 


TGAATTCCCA 


TATCAACAAA 


AGCACCGAAA 


TCAACAACGT 


660 


TACGCACCAC 


TCCTTCTAGC 


TTTTGTCCAA 


CCACTAAGTC 


CTTGATATCT 


AGGACATCTT 


720 


GGCGAaCACA 


CCTGCGTCAA 


AGGAATCACG 


GAAATCTCGA 


CCTGGTTTGA 


GAACATCTGC 


780 


AATGATATCT 


TTAAGAGTTT 


CTGGACCAAG 


GTCTAACTCT 


TGCGCCATTT 


CCTTGACTGA 


840 


AAGCGACTTG 


AGTTTGCTTT 


GGGCTTCTTC 


GTTTAGCTCT 


TTAATATCTA 


AACGTTTGAA 


900 


GAGTTCCTTA 


ACTGCACTGT 


AATTCTCTGG GTGAACTCCT "GTATT ATCAA 


GGATATTGCT 


960 


ACTTTCAGGG 


ATACGAAGGA 


AACCAGCAGC 


CTCCTCAAAG 


GCCTTGGCTC 


CCAGACGAGG 


1020 


AACTTTCTTG 


ATTTGGGCGC 


GTGAAGTGAT 


TTTTCCTTCT 


TCCTCGCGGT 


ATTTGACAAT 


1080 


ATTTTCAGAG 


ATAGTTTTGT 


TGAGTCCAGC 


TACGTGTGAA 


AGAAGAGCTG 


GGCTAGCTGT 


1140 


ATTGACATTG 


ACACCAACTT 


GGTTAACCAC 


TGTATCGACA 


ACAAAGTCCA 


GACTCTCAGA 


1200 


TAGTTTCTTC 


TGACTGACAT 


CGTGTTGGTA 


TTGACCGACA 


CCAATTGACT 


TAGGATCGAT 


1260 


TTTGACCAAT 


TCCGCAAGAG 


GATCTTGCAA 


ACGACGGGCG 


ATAGAAATGG 


CAGAGCGTTT 


1320 


TTCAACGGTC 


AAGTCTGGAA 


ACTCCTGACG 


AGCAAGTTCG 


CTGGCAGAAT 


AGACAGAAGC 


1380 


ACCACTTTCA 


TTAACGATAA 


CATAGCTGAC 


TTCAGCGAAA 


TCTTTCAGAA 


CTTCCGCTAC 


1440 


AAAAGCTTCA 


CTTTCACGAC 


TGGCCGTTCC 


ATTTCCAATG 


GCAATAATCT 


CTACACCGTA 


1500 


TTGACCAATT 


AAATCTGCTA 


AATCTTTCTT 


GGCTTCTTCG 


ATTTGACGAG 


CTGATGCTGG 


1560 


TTTAACAGGA 


TAAATAACCT 


GAGTTGTCAG 


CATTTTTCCT 


GTTGCATCCA 


CGACAGCTAG 


1620 


CTTGGCACCT 


GTACGAAAGG 


CTGGGTCAAA 


TCCAAGAACC 


ACGCGCCCTT 


TCAGTGGAGC 


1680 


AACCAAGAGG 


AGATTGCGCA 


GATTGTCAGA 


AAAAAGTTGG 


ATAGCTCCTT 


CTTCAGCTTT 


1740 


CTCAGTTAAT 


TCTGTCCGAA 


TACGACCCTC 


GATAGCAGGC 


AAGACCTTTT 


TCTTAACGGA 


1800 


TTGCTGAACA 


ACTTCATCAA 


TATAAGCATT 


TTTCACCTTG 


AAACGAGTAG 


CAAAGAAGGC 


1860 


AAGAATACGG 


TCCGTCGCAT 


CTTCAAAACC 


GATCTTCAAG 


ACACCAAGTT 






ATTGAGAGCC 


AAGGTACGAT 


AGCCTTGCAT 


AGTTCCAACT 


GTCTCTGAAA 


AATCATAATA 


1980 


AATCTGAAAA 


ACCTGCTTTT 


CATCAAGACT 


TTCATCCTTG 


GCTTGAGAAG 


TAAGTTTACA 


2040 


GTGTCTCAGC 


ACTTCCTGAT 


AAGTCATAGA 


ACGCAAGGTC 


ACATCTTCCG 


ATAAGGCTTC 


2100 


GACCAAAATA 


TCAACTGCAC 


CGGTCAAGGC 


TTCCTTGCCA 


GTCGCAAATC 


CTTCACAGAC 


2160 


GAACTTTTCA 


GCTTCTTTCT 


CTAAGTCAAC 


TATATTCTGC 


AAAATCAAGC 


GAGCAAGAGG 


2220 


AAAGAGTCCA 


GCTTCACGGG 


CAATGGTTGC 


CTTGGTACGA 


CGCTTTTCCT 


TATAAGGAAG 


2280 


ATAGAGTTCr 


TCAACGTCTG 


CTAATTTTTC 


GGCAACTAAG 


ATAGCTTCTT 


CCAATTCCTT 


2340 
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GGTCAACTTA 


CCTTGTTCTT 


GAATCTTACC 


TAAGACAGCT 


TCCTTACCGT 


CATTGAGATT 


2400 


TGTCAGACTT 


TTATCCAAAT 


CAATAATAGC 


CTTAATCGCC 


ACCTCATCCA 


GACTACCAGT 


2460 


CATGTCCTTG 


CGATAACGCG 


CGATAAAGGG 


AATAGTCGCC 


CCTTCAGCTG 


TCAAACTTAG 


2520 


AACGCTATCA 


ATTTGCTTTA 


ACGTCACTCC 


CAAATCCTGA 


GAGATTTTTT 


CATATTTTTT 


2580 


ATCCATAAAT 


CTATTATACC 


ACAAGCTAAA 


CGTTTCAAAT TAACTCGTAG AACATTTAAA 


2640 


AAATATGTAG 


GAAATAGATT 


TATATGCTAC 


AGCGCAATAA 


CTTGCACTTA 


AAGAGCATTG 


2700 


CCACCTTTTT 


TTAACCAAGC 


CATGATATCA 


AAAGTATTTA 


ATGGATCAGA 


CATAATAGCC 


2760 


AGTTCTGCAA 


GATGTTCCTG 


ACCTGGAATA 


ACACATTGAC 


TTTTCAAATT 


TTTATATGGA 


2820 


CGATTGACTA 


AAATTAATTT 


ATTAGAATAA 


GGAAGATTAT 


CCATCTTATT 


TAAAATTTCT 


2380 


TCACTAGCTG 


AATCTTTATT 


ATCAAATTTA 


AAATAAAGAT 


TATTCCAATT 


TATGCGTTTT 


2940 


TTTCTTTTTT 


CCCACTTAGT 


TCGTGCTTCr 


TCAATACTAC 


AATAATGTAG 


AAAATGAATA 


3000 


TCTATATCTC 


CTAAGTGCCC 


CAAAGGATAA 


ACTTCATGAG 


TCCAGCTCGG 


TGAAATAAGT 


3060 


TCCTCTTCGA 


AAACAAGTTC 


TTGTTCCATA 


TAATAACGAA 


AATGCTTTGT 


AAGTTTATAA 


3120 


TAATCATCAG 


GAAGAATAAA 


TAAACCAACA 


AAAGGTGTTC 


TATATTGAAA 


ACCAAGCTGT 


3180 


TTATAAATTA ATCCTCCAAC ACAATTATTA 


CTTATAATCG 


TAAAATCTAA 


TCTATCAAGC 


3240 


TCAAGAAAAG 


GGAAAATTCC 


TTTCTCTGCA 


GCTATTAACT 


TATGATAAAC 


AATATCAGAA 


3300 


TCTAAATATT 


CACCGTCATT 


TTTTAACCAA 


GCACTAAAAT 


TTGCCAATTC 


TTGAATATAT 


3360 


TGTTTTTTCG 


CTCTTTCTAT 


ATCATAGTTT 


TCTAAGACGG 


CGCAATCTTT 


GATTCTATTT 


3420 


TCATAATTTT 


CTAATATGAT 


TTTGTAGGAG 


TCTTTTAGAG 


GTTTAGCATC 


TATAACAGGT 


3480 


TTATAGATAT 


ATGTCGGGAA 


ATTAATATAG 


GTTGCAGTTT 


TAGAGTGAAT 


ATAAAGTCTC 


3540 


CAAATAAGGT 


TGTTTATATC 


AAATTGATTT 


ATTTTTCGTA 


AAAGCTTACT 


ATTGAATAAT 


3600 


TTTCCAAATA 


ATGAGCGATA 


TTGTTTTCTA 


ATTCGATGAT 


CTGTATCATC 


CATCTTTTGT 


3660 


AAAACTTGAA 


CATTCGTTAA 


ATTTTCTGTC 


AACCAATTAT 


CCCCCCAAAA 


AGGATAAAaG 


3720 


TAAAATACTC 


CATCAACCAA 


ATCAGCAAAA 


TGACCAAGAA 


CAACATCAGA 


ATCGGATAAT 


3780 


TTTATCGCAT 


GATACATCTT 


TTCAAATGTC 


CAATCAAATA 


ATGAATCATT 


TGAAGATAGA 


3840 


AACGTAATAT 


AATCTCCTGT 


AATCATATCA 


GACAACTCAG 


CAAAAGAATT 


CTCATCTATA 


3900 


ATCTTAATAT 


TAAATGATAG 


ATTCATCTGT 


TGGCTAATGG 


AAGCTATCTC 


CTCTGTAGAT 


3960 


TGATTTACAA 


TAATAACTTC 


TATATCTTTT 


AATGTTTGTC 


TCTCCACTAT 


TGACAAAGAC 


4020 


TCTAATAAAC 


TATTTTTATC 


TCCTTGATGT 


AACAAAACAA 


CACTAATTGA 


GTAAGTCAGT 


4080 


TTGACTACCT 


CCCATAATTT 


TCTGATAATG 


ATTTTCTTTT 


TATTTAATTA 


TAGCACAATT 


4140 
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ATGATATATA TCAGGTAATA TCAAGCTATA TTATCTCTTA GCTACTCAAT TTGAAATTTT 4 200 

AACTTTTCCC TTTTCCGCAA AATAATAGTA TAATAGAGGT ACAATCTACA ATCGAGGTAC 4260 

ACCTATGGCT GTCAAATTTA CAAAACGAGA CGACTTGGAC AAGATGTTTG AAGAGTTTGC 4 320 

TAAACTCCCT GATTTGAAAC AACTTACTTT CCCTGATGAC AAAGAGAAAA AAGTCAAAGC 4 380 

AGAAAAGAAA AACTAGATGA CTGCTTTTCA ACAACTCCCA TCTAGTGTAC TTCAAACTGG 4440 

AG C CATTTTT CTCTCCAT TA" TCATf^GC'cC^CCrTC TotctCATXG GAAGCATTGT 4 500 

CTCAGGGCTG ATTGAAGTTT ATATCACACC TGACAAGGTT TATCATTTTC TCCCTCGAAA 4 560 

TCGTTGGGGG AGAATCTTTT TTGGGACCTT TGTCGGTATA CTTTTCCCTT CTTGTGAATG 4620 

TGGAATCGTC CCCATCATCA ATCGTTTTCT GGAAAAAAAG GTTCCAACTT ACACGGCCGT 4680 

TCCTTTTCTT GTGACAGCAC CTGTTATCAA TCCCATTGTT CTTTTTGCGA CCTATTCTGC 4740 

CTTTGGCAAC TCCTTCCATG TCGCCCTATT ACGAGCTCTG GGTTCCATTC TTGTGGCTCT 4800 

AATACTAGGA ATTTTTCTAG GATTTTTCTG GCAAGAACCG ATTCAGAAAG AAAATCGTCT 4860 

GGCTTGTCAT GAGCATGATT TTTCTTACTT GAGTTCTGCA AAAAAAGTTT TTCAAGTCTT 4920 

TGTGCAGGCC ATTGATGAAT TTTTTGATAC GGGGCGTTAT TTGGTATTTG GCTGCCTCTT 4980 

TGCTTCTATA ATACAGGTCT ACGTTCCGAC TCGGATTCTG ACCTCTATCA GTGCGACCCC 5040 

TCTTTTTGCC ATCCTGCTCT TGATGATTTT AGCCTTTCTT CTTTCGCTCT GTAGTGAGGC 5100 

GGATGCCTTT ATAGGTCCTT CTCTTCTCTC GAGTTTCGCT TTGGCACCAG TTCTGGCCTT 5160 

TCTCGTCATT GGTCCAATGC TGGATATCAA AAATATTCTC ATGATGAAAA ATTACTTGAA 5220 

AGCACGATTT ATCAGTCACT TCATAACAAT TGTAACTCTT GTCGTCTTAG TCTATTCTCT 5280 

CTTGATTGGA GTTATCCTAT GATTCGATTT TTAGTTTTAG CTGGCTATTT TGAACTGACT 5340 

ATTTACCTCC ATCTGTCGGG CAAACTAAAC CAGTACATCA ACATCCACTA TTCCTATCTG 5400 

GCCTATATCT CCATGGTGCT TTCTTTTATC TTGGCTATCG TTCAATTGTA TATCTGGATG 5460 

AAGCAAGTCA AAACCCACAG TCATCTGAAC AGCCGATTAG CCAACATAAC GAGTATTTCT 5520 

CTTCTGGCTA TTCCACTTGT CATCGGCTTA ACTTTCCCAA CTGTTAGCTT GGATTCTCAG 5580 

ACTGTTTCTG CTAAAGGTTA TCATTTCCCC CTATCGGAAG GAACGCATCT AGCCATTCAG 5640 

ACAAGCGAAG GGACGACAAG CCAATATTTG AAACCAGATA CCAGTTCTTA TTTTTCAAAA 5700 

TCAGCCTATG AAAAGGAAAT GCGAACGGCG GCGGATAAAT ACTTATCCCA AGATAGTATT 5760 

CAGATCACTA ATGAAAACTA TATGGAAGTC ATGGAGGCTA TCTACGACTA TCCAGATGAG 5820 

TTTGAGGGCA AGACAATCCA GTTTACAGGC TTTGTCTATA ACGACCCCAG TCATGCCAAT 5880 
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AGTCAATTTC 


TGTTCCGATT 


CGGCATTATC CACTGTATCG 


CAGATTCTGG 


TGTCTATGCA 


5940 


TTGCTGACCA 


AGGGCAATAC 


CCGGCAGTAT GAAAACAACA 


CTTGGATAAC 


AGCCAAAGGA 


6000 


AAACTCGTCA 


ATCACTACCA 


TAAAGAACTC 


AAACAAAACC 


TTCCAACCTT 


GGAAATCGAC 


6060 


AGCTTTACCA AAGTCGATAA ACCAGAAAAT CCCTATCTAT ATAGAGCTTT 


TTAAGAAAAT 


6120 


CAAGATAAAA 


ACGAACAAGT 


TCTCTTCTGA 


ATAACAGAAA 


AAGRGCCTCT 


TCGTTTTTTG 


6180 


TTATATGAAA 


ATTAGTGACT 


TGTAGATTTT 


CATCTTATAC 


CATTCCCAGC 


AATACAAGTA 


6240 


GCTCATAGAA 


AATAAGCGAG 


CCACTCATTC 


ATTAGACTAG 


CGATTTCTTT 


AGGTGCTTGA 


6300 


GTATAAAGCT 


CATGGCCAAA 


GTTTTCTAAA 


AAAATAGTAT 


CAAAATAGTC 


TGGCAATTCT 


6360 


TTTAGGGCTT 


CCTCTCTCCA 


TGTAGCTTCA 


TTAGCATAGC 


GAGGACTAAT 


AAACAAGCTA 


6420 


TCTCCCACTT 


CTCTCTTAAA 


AGCTTGTATT 


TTTCTCCGTA 


GcGGAGTATC 


GCTTCTATAT 


6480 


TTTCATAATT 


TATAGCCAAC 


TCATATCTAT 


TATACTCAAC 


ATTCCAGTGA 


TAAGACTGTC 


6540 


TTACAGCTTT 


CTCCATATTT 


TCTGACCAAT 


GCTTTGCTTC 


AGATTTTTCT 


TTAGAAGTAA 


6600 


GAACATCTAA 


GTCCGAAACA 


ATTTGAGATT 


TGATATAATT 


TTTAGTTTCC 


TCTAACTCTG 


6660 


TATCCAAAGG 


TAAAATCTTA 


TCTAAATCTA 


GATAGCCACC 


ATCCAAAAGA 


ATCAGTTTCT 


6720 


TTACTTCTTC 


AAATTCCGAT 


GCGAAATAAC 


GAGCTAAATC 


TCCTCCAAGA 


GAATGGCCTA 


6780 


TCAGACAGAT 


AGATTCTTCC 


TCTACAATTT 


CATTTTTAAA 


CCATGATTTC 


AATTCTGTTT 


6840 


CATCTCGAAG 


ATGCTTTTCA 


TATGGATTTA GAAAATAGAC 


CTGCGAATCT 


AGTTCTTGAA 


6900 


GAAAATCCTT 


GCTATGATAG 


GCATTGCTTC 


CCAAACCGCC 


AATAAAATAT 


TTTTTCATTC 


6960 


TCTACTTAAT 


ACTATGCTTA 


TTCATCTTTT 


GTTCAAAGAT 


AGTTGTGATA 


ATCTGACGCA 


7020 


ATTCTTCGCG 


TTTTTTTTCT 


GGAATCTCAC 


CACTTGTTTG 


AGCTACAGCG 


TAGAGTTCAG 


7080 


GGTATTCAAT 


TGAAATGCCT 


TTAATCGTAC 


GTGTTGTAGC 


ATGTTTTCTG 


ACAAAAAACG 


7140 


GGATTCGCTT 


AATCAAGTCT 


TGTGGGACTA 


GCGCCAGAAT 


CTTCTCAGTA 


GTTTCTTTGT 


7200 


CACTAATATT 


AGACATTGTA 


AGCCTTTTCT 


TAATCATTTC 


CTGTTCTTTT 


TCTGTAAAAT 


7260 


CTTTTAATTC 


CATTCGATTA 


GTCCTCCTAT 


TTTCTCTAAG 


TTAAATTATG 


TACTAATACA 


7320 


GATGAAACTA 


CAAAGAATAA 


ACTTTAAGAA ATCTTCTCAC 


TGATAAGATT 


TTAGCATTAG 


7380 


ACTTCCTGCG 


AAACAAAATA TGGTATAGTA 


GTTCTATGAA 


TTATGAAGCA 


AGTAAACAAC 


7440 


TAACTGATGC 


ACGATTTAAA 


CGTCTPGTPG 


GTGTTCAGCG 


CACGACTTTT 


GAAGAGATAT 


7500 


TAGCTGTATT 


AAAAACAGCT 


TATCAACTTA 


AACACGCAAA 


AGCTGGACGA 


AAACCTAAAT 


7560 


TAAGCCTAGA 


AGACCTTCTT 


ATGGCCACTC 


TTCAATATGT 


GCGAGAATAC 


CGCACTTATG 


7620 


AAGAAATTGC 


GGCTGATTTT 


GGTATTCACG 


AAAGCAACTT 


AATCCGTCGG 


AGCCAATGGG 


7680 
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TTTAAGTAAC TCTTGTTCAA AGTGGTGTTA CCATTTCAAG AACTCCTCTC ACTTCTGAGC 7740 

ACACGGTAAT GATTGATAGC CATTCCCATC AATATCGTAT CTTTGGACAT AGCCAATAAA 7800 

TGTTTCATTT TTGCGTGGTT TCTGGCTATT AACGATTGAA ATAACCCACC AACTTATCAA 7860 

AAATAGAAAT AAAAATCCTA AGATTACTGT CATATCATAA CACTATTAAA GTTTAACCCA 7920 

CTTATCATTA TCCATGATAA AAGGCTTAGC CAGTCCCTCG CCTGTATAAT CCGCATACTT 7980 



GGTGCCCAAA TACTTGTAGC AATCTTCCTT ACTAGCAAAT TTAATCGCTT GGTAGGGCTC 8040 

TTCGAAAGTC AATTTCTCTA CAAATAAGAA ACCGTCATCA GCAGGTACTA AGACCCCAAC 8100 

GTGGCCTACA AACAGATACT CCCCATCCAA ATTCTCCTGC AAGACTACAG ACAGCATTCG 8160 

AGCTTTTTCA TTGAATTGAA ATTGTGAGAA GAATGCTTCC ATCTTTTCAG CGTGAACCTT 8220 

GACATCTGTA GTTGACTCAG TTGGAACTCT CGAAAATAGA ATATCAAACT CTTCCTTATC 8280 

TTGTGAATCA AAGACCTTTC CTTTATCAAT CGCATCATTA TCTAGGAAAA GCAACTGGTC 8 340 

ATTCTTTTCA AGCTTTGGAA TGGTGACTGA ATTTTTCAAA AGACAATAAC TATTGATACG 8400 

GCAGTTGGTC CCAACAAAAT CGCCCTTCTT TTGATTCCAG AGATGACTGA TTTTCTCAAC 8460 

ATCGTATTCC GTGTGAGTAA AGGAAGTGAA ATCTCCTGAT AAGCCAGTTG AGCCGACAAT 8520 

GGTATTATAG TCATTAACGA GATTAAAAAA TGCATCAACA CTATTTGGAT CCAAGTGAGC 8580 

TGATAAGAGA GATTTGACCT CTTCTGTACT TACCTGGTTG TTTAGGTTGG TGTATGAAGC 8640 

TTTCCATGGA ACTTTCGCTG AACTGCTTTG CCTTTGATTC GTCCCCTCAC AAGTAGCATG 8700 

TTGTTGTTGA CAAGCAGCCA AGCCTAAAAA CAAGGCTGAA CAGATTCCTA ATGTGGCTAA 8760 

TTTTCTTGAT TTCTTCATTT CTTTCTCCTA AATGTCTTGG ATTAAAGTTT CTTTAACTAT 8820 

TGCTTTACAG ATATTGATTA CTTTCTCATT TAATGTGTTC ATCGTCTTTC CTCCGG 8876 
(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14736 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

CGCAAACTTT CGCGGTCGGA AGGTA G TTTT ATGACACGAT TTGAGATACG AGATGATTTC 60 

TATCTCGATG GAAAATCATT TAAGATTTTA TCTGGTGCCA TTCATTATTT TAGGGTTCCT 120 

CCAGAGGATT GGTATCATTC GCTCTATAAC TTGAAGGCTC TTGGTTTTAA TACGGTAGAG 180 
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ACTTATGTTG 


CTTGGAATTT 


ACACGAGCCT 


TGTGAAGGTG AGTTTCATTT 


TGAAGGTGAT 


240 


CTGCATTTAG 


AGAAATTTCT 


CCAAATAGCG 


CAGGATTTGG GTCTCTACGC 


AATTCTGCGT 


300 


CCCTCTCCAT 


TTATCTGTGC 


GGAATGGGAA 


TTCGGTGGCT TACCAGCTTG 


GCTCTTGACC 


360 


AAGAACATGC 


GAATTCGCTC 


ATCCGACCCA 


GCATATATCG AGGCAGTTGG 


TCGCTACTAT 


420 


GATCAGTTAT 


TGCCAAGACT 


GGTGCCTCGT 


TTGTTGGACA ATGGTGGCAA 


TATTCTCATG 


480 


ATGCAGGTTG 


AAAATGAGTA 


TGGTTCTTAC 


GGAGAAGATA AGGCTTACCT 


GAGAGCGATT 


540 


CGACAGCTAA 


TGGAAGAGTG 


TGGCGTAACC 


TGTCCCCTCT TTACATCAGA 


TGGTCCATGG 


600 


CGAGCTACTC 


TGAAAGCTGG 


AACCTTAATT 


GAAGAGGACC TCTTTGTAAC 


AGGAAACTTT 


660 


GGTTCTAAGG 


CACCTTACAA 


CTTTTCGCAG 


ATGCAGGAAT TCTTTGATGA 


ACATGGTAAG 


720 


AAATGGCCAC 


TCATGTGTAT 


GGAGTTCTGG 


CATGGTTCCT TCAATCGCTG 


GAAAGAACCG 


780 


ATTATCACAC 


GGGATCCTAA 


GGAATTGGCA 


GATCCAGTTC GAGAGGTTTT 


GGAACAAGGC 


840 


TCTATCAATC 


TTTACATGTT 


CCACGGTGGT 


ACAAACTTTG GTTTCATGAA TGGTTGCTCA 


900 


GCTCGAGGAA 


CTTTGGACCT 


CCCACAAGTT 


ACGTCTTATG ATTACGATGC 


CCTTCTGGAT 


960 


GAAGAAGGAA 


ATCCAACTGC 


TAAATATCTT 


GCAGTCAAGA AGATGATGCC 


AACACATTTT 


1020 


TCAGAGTATC 


CGCAGTTGGA 


ACCACTCTAC 


AAAGAGAGTA TGGAGTTGGA 


TGCTATTCCA 


1080 


CTAGTTGAAA 


AAGTTTCTTT GTTTGAAACC TTAGATAGCT TGTCAAGTCC TGTAGAAAGT 


1140 


CTCTATCCTC 


AAAAGATGGA 


GGAGCTGGGA 


CAAAGTTATG GCTACCTACT 


TTATCGAACA 


1200 


GAAACAAACT 


GGGATGCAGA 


AGAAGAAAGA 


CTTCCTATCA TTGATGGTCG 


AGATAGGGCC 


1260 


CAGCTGTATG 


TCGATGGTCA 


GTGGGTTAAA 


ACTCAATATC AGACAGAGAT 


TGGGGAAGAT 


1320 


ATTTTTTATC 


AAGGTAAAAA 


GAAAGGGCTA 


TCTAGGTTAG ATATCTTGAT 


AGAAAATATG 


1380 


GGGCGTGTCA 


ACTATGGGCA 


TAAGTTCTTA 


GCGGATACGC AACGTAAGGG 


AATTCGGACA 


1440 


GGGCTCTGTA 


AGGATCTGCA 


TTTCTTACTA 


AACTGGAAAC ACTATCCACT 


CCCACTAGAC 


1500 


AATCCTGAGA 


AAATTGATTT 


TTCAAAAGGA 


TGGACTCAAG CACAACCAGC 


CTTTTACGC? 


1560 


TATGACTTTA 


CAGTCGAAGA 


GCCAAAAGAT 


ACTTACCTAG ACTTGTCTGA 


GTTTGGTAAG 


1620 


GGGGTTGCCT 


TTGTCAATGG 


GCAGAATCTA 


GGACGTTTTT GGAACGTTGG 


CCCAACTCTC 


1680 


TCACTTTATA 


TCCCTCATAG 


CTATCTCAAG 


GAAGGTGCCA ACCGCATCAT 


TATCTTTGAA 


1740 


ACAGAAGGTC 


AATATAAAGA 


AGAGATTCAT 


TTAACTCGTA AACCTACACT 


AAAACATATA 


1800 


AAGGGGGAAA 


ACTTATGACA 


ATTGTAGGAT 


GCCGTATTGA TGGACGTTTG 


ATCCACGGAC 


1860 


AAGTAGCCAA 


TCTTTGGGCT 


GGAAAACTAA 


ATGTTTCACG CATTATGGTT 


GTAGACGACG 


1920 


AAGTTCTCAA 


CAACGATATT 


GAAAAGAGTG 


GTTTGAAACT TGCGACACCA 


CCACGTGTGA 


1980 
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AATTGACTAT 


TTTGCCAGTT 


GAGAAAGCTG 


CAGCCAATAT 


TCTTGGTGGC 


AAATACGATA 


2040 


GCCAACGTCT 


CTTTATCGTG 


GCTCGTAAAC 


CAGACCGCTT 


CCTTGGTTTG 


GTAGAAGCAG 


2100 


GTGTACCACT 


TGAAACCCTT 


AATGTTGGGA 


ATATGTCTCA 


AACACCAGAA 


ACTCCTTCTA 


2160 


TTACACGTTC 


TATCAACGTA 


GTAGACAAGG 


ATGTGGAAGA 


CTTCCACAAA 


CTGGCAGAAA 


2220 


AAGGTGTTAA 


ACTTACTGCT 


CAGATGGTTC 


CAAATGATCC 


AATTTCAGAC 


TTTTTGAGCT 


2280 


TATTAAAATA 


GGAAAAAAAT 


TTTTAGGAGG 


TCATTGTTAT 


GATACAATGG 


TGGCAAATTT 


2340 ~ 


TACTTCTCAC 


TTTGTACTCA 


GCTTATCAAA 


TCTGTGATGA 


GTTGACGATC 


GTTTCATCTG 


2400 


CAGGTTCCCC 


TGTATTTGCT 


GGTTTCATTA 


CTGGTTTAAT 


CATGGGAGAT 


GTGACTACTG 


2460 


GTTTACTTAT 


CGGTGGTAAC 


TTGCAACTGT 


TCGTTCTTGG 


GGTTGGTACC 


TTCGGTGGTG 


2520 


C7TCTCGTAT 


CGACGCAACT 


TCTGGTGCGG 


TTCTTGCGAC 


ACCTTCTCTG 


TTTCACAAGG 


2580 


AATTGATGCA 


CCGCTTGCCA 


TTACTACAAT 


CGCTGTACCA 


GTAGCACCTC 


TCTTGACTTA 


2640 


CTTCGACGTT 


CTTGGTCGTA 


TGACTACTAC 


CTTCTTCGCT 


CACCGTGTGG 


ATGCTGCAAT 


2700 


CGAACGCTTT 


GACTATAAAG 


GTATTGAACG 


CAACTACTTG 


CTTGGTGCGA 


TTCCGTGGGC 


2760 


TCTATCTCGT 


GCCCTTCCAG 


TCTTCTTTGC 


CCTTGCTTTT 


GGTGGTGCCT 


TTGTACAATC 


2820 


AGTAGTAGAC 


TTCGTTGAAG 


CCTACAAATG 


GGTTGCAGAT 


GGCTTGACAC TTGCACGACG 


2880 


TATGCTTCCA 


GGTCTTGGAT 


TTGCAATCTT 


GCTTCGTTAC 


CTTCCAGTTA 


AACGTAACCT 


2940 


TCACTACCTT 


GCTATGGGAT 


TTGGTTTGAC 


AGCTATGTTG 


ACTGTTCTTT 


ACTCATATGT 


3000 


AACAGGTCTT 


GGTGGCGCTG 


TTGCTGGTAT 


CGTAGGTACT 


CTTCCTGCTG 


AAGTTGCTGA 


3060 - 


AAAAATTGGT 


TTCGTGAACA 


ACTTCAAAGG 


TTTGTCTATG 


ATTGGTATTT 


CTATCGTAGG 


3120 


TATTTTCCTT 


GCAGTGCTTC 


ACTTCAAAAA 


TAGCCAAAAA 


GTAGCTGTAG 


CAGCACCTTC 


3180 


TACACCATCA 


GAAAGTGGGG 


AAATCGAAGA 


TGACGAATTC 


TAATTACAAA 


CTTACAAAAG 


3240 


AAGATTTTAA 


TCAAATCAAC 


AAACGTAGCT 


TGTTTACTTT 


CCAATTAGGT 


TGGAACTACG 


3300 


AACGTATGCA AGCTTCTGGT 


TACCTTTACA 


TGATCTTGCC 


TCAGTTGCGT 


AAAATGTATG 


3360 


GTGATGGAAC 


TCCTGAATTG 


AAAGAAATGA 


TGAAAGTTCA 


TACTCAATTC 


TTCAATACTT 


3420 


CACCATTCTT 


CCATACCATT 


ATCGCTGGTT 


TTGACCTTGC 


CATGGAAGAA 


AAAGATGGTG 


3480 


TAGGTTCAAA 


AGACGCCGTT 


AACGGTATCA 


AGACAGGTTT 


GATGGGACCA 


TTCGCTCCTC 


3540 


TTGGGGATAC 


AATCTTTGGT 


TCACTTGTAC 


CTGCTATCAT 


GGGGTCAGTC 


GCAGCAACTA 


3600 


TGGCTATCGC 


TGGCCAACCT 


TGGGGGATCT 


TCCTTTGGAT 


TGCAGTTGCA 


GTAGCGTATG 


3660 


ACATCTTCCG 


TTGGAAACAG 


TTGGAATTTG 


CTTACAAAGA 


AGGGGTTAAC 


CTTATCAACA 


3720 
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ACATCCAAAG 


TACCTTGACA 


GCTTTGATTG 


ACGCTGCATC 


TGTACTTGGT 


GTCTTCATGA 


3780 


TGGGTCCTCT 


TGTAGCAACA 


GTGATTAACT 


TTGAAATTTC 


TTACAACTTC 


CCAATCGGTG 


3840 


AAAAGATGAT 


TGATTTCCAA 


GACATCTTGA 


ACCAAATCTT 


CCCACGTTTG 


CTTCCAGCAA 


3900 


TCTTTACTGC 


CTTTATCTTC 


TGGTTGCTTG 


GTAAGAAAGG 


TATGAACTCT 


ACTAAAGCTA 


3960 


TCGCTATTAT 


TATCGTACTT 


GCTTTGGCTC 


TTTCTGCCCT TGGTCACTTT 


GCACTTGGAA 


4020 


TGTAATTCCT 


TATGACTAAA 


TCATTAATTT 


TGGTGAGCCA 


TGGTCGCTTC 


TGTGAGGAGC 


4080 


TTAGAGGTAG 


CACAGAAATG 


ATTATGGGCC 


CACAAGACAA 


CATTTACACA 


GTAGCTCTTC 


4140 


TTCCAGAAGA 


TGGCCCAGAA 


GAATTTACTG 


CTAAATTTGA 


AGCTGTTATT 


GAAGGATTGG 


4200 


ATGATTTCCT 


AGTCTTTGCG 


GATCTTCTCG 


GTGGGACACC 


TTGTAATGTG 


GTGAGTCGCT 


4260 


TGATCATGGA 


AGGTCGTGAT 


ATTGACCTTT 


ACGCAGGGAT 


GAATCTTCCA 


ATGGTGATTG 


4320 


AATTTATCAA 


TGCGAGCCTT 


ACAGGCGCAG 


ATGCGGACTA 


CAAGAGCCGT 


GCTGCAGAAA 


4380 


GCATTGTGAA 


AGTTAATGAC 


CTGTTAGCGG 


GCTTCGATGA 


TGACGAAGAT 


GAATAATACT 


4440 


CTTCGAAAAT 


CTCTTCAAAC 


TACGTCAACG TCGCCTTGCC GTAGgTATAT GTTACTGACT 


4500 


TCGTCAGTCT 


TATCCGGCAA 


CCTCAAAACG 


GTGTTTTGAG 


CTGACTTCGT 


CAGTCTTATC 


4560 


CGGCAACCTC 


AAAGCAGTGC 


TTTGAGCAGC 


CTGCGGCTAG 


TTTCCTACAG 


ATTTTAGTTG 


4620 


GAACTCGATT 


CAATTCATGT 


GACAACGTGA 


AAATCGTTAG 


AGCATTTTAT 


ATAGAATATA 


4680 


CATGGGAATG 


TAGCTTACTC 


CCATTCCCAT 


ATTTAATAGA AAAAGAGGAA 


CTCAATGCTA 


4740 


CATTATACAA 


AAGAAGACTT 


GCTCGAATTG 


GGTGCAGAAA 


TCACTACGCG 


TGAAATCTAC 


4800 


CAACAGCCTG 


ATGTATGGAG 


AGAAGCTTTT 


GAATTTTATC 


AAGCAAAACG 


TGAAGAAATT 


4860 


GCAGCCTTCC 


TACAAGAAAT 


CGCTGATAAA 


CATGACTATA 


TTAAGGTTAT 


CTTGACAGGT 


4920 


GCTGGGACTT 


CTGCTTATGT 


GGGAGATACC 


TTGCTACCTT 


ATTTTAAGGA 


AGTCTATGAC 


4980 


GAACGCAAAT 


GGAATTTCAA 


TGCTATTGCG 


ACAACAGATA TCGTTGCCAA 


TCCAGCAACC 


5040 


TATTTGAAAA 


AAGATGTGGC 


AACTGTCCTT 


GTGTCTTTTG 


CTCGTAGTGG 


GAATTCGCCT 


5100 


GAAAGTTTGG 


CGACTGTTGA 


TTTGGCCAAA 


TCCTTGGTGG 


ATGAGCTTTA 


TCAAGTGACG 


5160 


ATTACTTGTG 


CAGCAGATGG 


TAAATTGGCT 


CTTCAAGCTC 


ACGGTGATGA 


TCGTAATCTC 


5220 


TTGCTCTTGC 


AACCAGCTGT 


CTCTAATGAT 


GCTGGATTTG 


CCATGACTTC 


TAGCTTTACG 


5280 


TCTATGATGT 


TGACAACTCT 


CTTGGTCTTT 


GATCCTACAG 


AATTTGCTGT 


TAAGTCTGAA 


5340 


CGTTTTGAAG 


TTGTATCTAG 


TCTTGCCCGT 


AAAGTTTTAG 


ACAAGGCAGA 


AGATGTCAAA 


5400 


GAGCTCGTTG 


ATTTAGACTT 


TAACCGTGTC 


ATCTATCTAG 


GCGCTGGTCC 


TTTCTTTGGA 


5460 


CTTGCTCATG 


AAGCTCAGCT 


CAAGATTTTG 


GAATTAACTG 


CTGGTCAAGT 


TGCGACCATG 


5520 
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TATGAAAGCC CAGTTGGCTT CCGTCACGGT CCAAAATCTC TTATCAACGA CAATACAGTT 5580 

GTTTTGGTCT TTGGTACAAC GACAGACTAC AC7CCTAAGT ACGACTTGGA CTTGGTTCGT 5640 

GAAGTTGCTG GTGACCAGAT TGCTCGTCGT GTTGTGCTTT TGAGTGATCA AGCTTTTGGT 5700 

CTTGAAAATG TCAAAGAAGT GGCCCTTGGT TGTGGCGGTG TCTTGAATGA TATTTACCGT 5760 

GTCTTCCCTT ACATCGTTTA TGCCCAACTC TTTGCTTTAT TGACTTCACT CAAGGTAGAA- 5820 

AATAAACCAG ATACACCGTC TCCTACAGGT ACAGTAAACC CTGTAGTACA AGGTGTCATA 5880 

ATTCACGAAT ATCAAAAGTA AGACAGTGTT TATGAATTCT TGACAAGAGG ATTTGTAAAT 5940 

TATCAGATAA ACCATAGATT GTCAGTACGC TTTCTATGGT TTGTTTGCTT GAGAGAAATA 6000 

GTAAAAGGAG AACAGAATGA AAGCATACAC AGAGCGTGTA TTTGGAAATG TTGAGGGTGA 6060 

GGATGTCTTG GCCTATCGAT TTGAGACAGA CGGTGGCTAC CAACTTGAGG TTATGACTTA 6120 

TGGTGCGACT ATCTTGCGCT ATGTCGCACC TGACAAGGCT GGAAATTTTG CCAATGTTAT 6180 

CTTGGGATTT GATGACTTTG ATAGTTATGT AGGCAATAGT CCCAAGCATG CAGCAAGTGT 6240 

AGGTCCTGTA GCGGGTCGTA TTGCAGGTGC GACCTTTGAG CTCAATGGTA AGACCTATGA 6300 

CCTTGAGGTT AATAATGCTA GCAACTGTAA TCACAGTGGT TCAACTGGTT GGGATTCCAG 6360 

CTTGTTTGAA GTTGAAGAAG TAAGCGATCA TGCCTTCACT CTCTACACAG AGCGTACAGA 6420 

TGGGACAGGA GGGTTCCCTG GAAATCTCAA GATTTGGATC AGTTATCACT TGGAAGAAAC 6480 

TGGTGCCTAT GAAATCAGCT ACAAGGTAAC GACCGATCAG GATACGCTGG TCAATCCAAC 6540 

CAACCACAGC TATTTCAACT TGTCTGGTGA TTTCACGCAG ACGATTGACC GTCATGTCTT 6600 

CCAACTAAAC ACAGAGGGCA TTTACTCAAT CGCTCCTGAC GGTGTTCCTG CCAAAACTCC 6660 

AGAAGCCAAC CGTGATGTGG TCAAACACGT CTACAATGGT ACCTTGTTGA AGGATATCTT 6720 

TGCAGAAGAA CATGAGCAAA TCCAGCTGGC ATCAGGTTTG GATCATCCAT TTGCCCTTCC 6780 

TGCAGGCCAT GACAATGCTG GATTCCTTTA TGACCAAAAT TCAGGTCGCT TCCTGCTTTT 6840 

CAAGACAGAA CCTCCTTCCT TTGTGGTCTA CACAGCAAAC TTTGTGGATG AAAGTGTCAT 6900 

CATAGGAGGT CAGCCAATGC TACAGCACAA TGGGATTGCT CTTGAAGCGC AAGCTTTACC 6960 

AGATGCCATT CACAGTGACC TTAAAGGCCA AGTCATTCTT AAAGCTGGTC AAACCTTCAC 7020 

CAGTAAGACA CGTTATGAAC TTGTTGTGAA GTAAAAGAGT CATTGCGCCT ACTTTTGGGA 7080 

GCTAGGAATA GGTACGCAGA GACAAATAGT AGGAAAATAT GATATAACTA AGCGTTGAAA 7140 

GCTATCTGTT AATATAATAT TCAAACTACA ATAAGGAGTA AGAAAGAAAC GAAGAAAATT 72 00 

GTATTTGCTA GTGCCTTGGC TTTGACCTTG GCTGGAGCAG TTTTGACAAA TGATGTTTTT 7 260 
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GCGAACGACA 
TCAGAGGTGC 
GCTCCTCATC 
GAAGGTTTGG 
TTTGCCAGAG 
CTTTGGAGTG 
GACGGTTTTC 
AAAATAGTGG 
TTACACACAT 
TTGCTCAAAC 
CTGTTCAGTG 
TTGTAGAAAA 
AATCTAATGG 
CAGATGGTAA 
TTTCTGGTAG 
ATGGTAGCAG 
AAAATGGCAC 
TCGGACCACA 
CAGATGGTTA 
AAGCATTCAT 
TGGCCTCTTT 
GTAAGCTAAT 
GTAATTAGTC 
AATTTACTGT 
TATCGATTTA 
TATTATTTTT 
AAAAGCGAGG 
AGAAAGTTGT 
GGGATTCAAT 
GAGGGTAAAT 



GACTTCTCGC 
TAAAACCTTC 
AACAATCTAT 
TAGATAAGTA 
CTACAGAAGC 
CCTTTCCAAC 
TAAAAGCTAT 
TTCAGACAAC 
GGGGATCGCA 
TCTAGGTGAT 
TACAGAAGTG 
ACCACTGAAA 
TAAATGGTAT 
ATGCTACTAT 
CTGGTATTAC 
ATGGTTCTAC 
TTGGTATTAC 
CTGCTACTAT 
CCGTGTAAAT 
CTTACTTAGC 
TGATTTATAA 
ATTTTATAGC 
TGAAGTCCAC 
ATTTTTTATA 
TTGTATAATG 
ATGTTTATAC 
TTAATATGAG 
CAGTGGGATT 
CGGTATCGGC 
TGACAGACGA 



AACACAAACT 
TAGTGGCAAT 
TTTGGATGCC 
TGTCCCTATC 
ATCTATAACC 
TTCTAATAGT 
TGAACAATGG 
GGGAAATCTG 
GCTTTTAAAA 
GATGCTTCTT 
ACTGCCTCAA 
GATTTTAGAG 
TTCTATGAGT 
TTGAATGACT 
TTCAGCAATT 
TTTGACGGCT 
CTTGACGAAG 
GCCTACGGTT 
GGTAATGGTG 
AAAAAGAATG 
AGATTGGATT 
CCATTAAAAG 
ACTTACTTGT 
GTATGCAGAA 
ATAAGTAATT 
TTTTATGCTA 
CCGAAAAAGC 
GCTATCACTT 
CGATAATATG 
GGAAAAATCC 



1090 
ACTGATGGTA 



CTTTTGGTTG 
ATCAATGCTA 
AAATGATCAA 
ATGGATCATA 
ATAATGGGAG 
CCTGCTGAAA 
GTCACTATGA 
ATCCTAACAA 
CAGAGGAATT 
ACCTTTCAAC 
CGTCTACCTC 
CTGGTGATGT 
TAGGTGTCAT 
CAGGTGCTAT 
CAGGAGCTAT 
CAGGTATCAT 
CAGGAGCTTT 
AATGGCTAAA 
AACGATAAGA 
CTTGTCGCCT 
CATAAGCGGT 
TGAGATGTTA 
TATTTTTAAG 
GTTGAAAAGT 
TAAAATATAG 
ATTGGTGAGA 
ACTGTATCTA 
GAAAGTCCAA 
TTGCTGGTAG 



AAAATGAAAA 
GAATCAAAGG 
TCTGTAAAGA 
CTGACCTAGA 
CCCGTCTTTC 
AAAATTTGGC 
AAGCAGATTA 
GTCGCTAATT 
TCAATACAAA 
GGCTGGTAGA 
AGTTAAAACT 
TGATCAGTCT 
GAAGACAGGT 
GCAGACTGGA 
CTTTACAGGC 
GAAGACAGGC 
GAAGACAGGT 
GGCTGTGAGC 
CTAGGCTCAG 
AAGAGGTTGA 
CAATTTCAGA 
AATCTAATTT 
TCTCTGTTTT 
TATATTTCAA 
ACTCAGAAAA 
ATTGATATAA 
AACGCCATAG 
GTTTCTTTTT 
TTCATTATAA 
AGGCCCTTCC 



TGTATTGACC 
AGAATTTGTG 
AGCGGCTGAC 
AAAGGCAGCT 
TAGCAAAGAT 
ATGGAATCAT 
TGTGGAGAAA 
AACCCTAAAT 
GCTATTACAA 
TATGGTTCTG 
AAAGCTACGG 
GGTTGGGTCG 
TGGGTGAAAA 
TTTGTAAAAT 
TGGGGAACAG 
TGGTACAAGG 
TGGTTTAAAG 
ACAACAACAC 
GCCATAGGTA 
TGGCGAACAT 
CTTTTCTATT 
AAAAAATGCT 
TTATCGTTA? 
TAGAAATTTC 
TTCCATACTA 
AGAATATAGA 
TTTCTCGATG 
GATGAGTCAA 
GTATATGACC 
ACAACTGGCT 



7320 
7380 
7440 
7500 
7S60 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
3160 
8220 
8280 
8340 
8400 
8460 
8520 
3580 
8640 
.1700 
8760 
8820 
' 8880 
8940 
9000 
9060 
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GAAGAATCAG 


ATGATACTTA 


TTACTTGGTT 


TATAGATCTC 


AACAGTTTTT 


ACCGAATACA 


9120 


GGTTTTAACC 


CAACTGTTGG 


TACTTTCCTT 


TTTACTGCAG 


GATTGAGCTT 


GTTAGTTTTA 


9180 


TTGGTTTCTA 


AAAGGGAAAA 


TGGAAAGAAA 


CGACTTGTTC 


ATTTTCTCCT 


GTTGACTACC 


9240 


ATGGGAGTTC 


AATTGTTGCC 


GGCCAGTGCT 


TTTGGGTTGA 


CCAGCCAGAT 


TTTATCTGCC 


9300 


TATAATAGTC 


AGCTTTCTAT 


CGGAGTCGGG 


GAACATTTAC 


CAGAGCCTCT 


GAAAATCGAA 


9360 


GGTTATCAAT 


ATATTGGTTA 


TATCAAAACT 


AAGAAACAGG 


ATAATACAGA 


GCTTTCAAGG 


- 9420 


ACAGTTGATG 


GGAAATACTC 


TGCTCAAAGA 


GATAGTCAAC 


CAAACTCTAC 


AAAAACATCA 


9480 


GATGTAGTTC 


ATTCAGCTGA 


TTTAGAATGG 


AACCAAGGAC 


AGGGGAAGGT 


TAGTTTACAA 


9540 


GGTGAAGCAT 


CAGGGGATGA 


TGGACTTTCA 


GAAAAATCTT 


CTATAGCAGC 


AGACAATCTA 


9G0C 


TCTTCTAATG 


ATTCATTCGC 


AAGTCAAGTT 


GAGCAGAATC 


CGGATCACAA 


AGGAGAATCT 


9660 


GTAGTTCGAC 


CAACAGTGCC 


AGAACAAGGA 


AATCCTCTGT 


CTGCTACAAC 


GGTGCAGAGT 


9720 


GCGGAAGAGG 


AAGTATTGGC 


GACGACAAAT 


GATCGACCAG 


AGTATAAACT 


TCCATTGGAA 


9780 


ACCAAAGGCA 


CGCAAGAACC 


CGGTCATGAG 


GGTGAAGCCG 


CAGTCCGTGA 


AGACTTACCA 


9840 


GTCTACACTA 


AGCCACTAGA 


AACCAAAGGT 


ACACAAGGAC 


CCGGACATGA 


AGGTGAAGCT 


9900 


GCAGTTCGGG 


AGGAAGAACC 


AGCTTACACA 


GAACCGTTAG 


CAACGAAAGG 


CACGCAAGAG 


9960 


CCAGGTCATG 


AGGGCAAAGC 


TACAGTCCGC 


GAAGACACTC 


TAGAGTACAC 


GGAACCGCTA 


10020 


GCGACAAAAG 


GCACACAAGA 


ACCCGAACAT 


GAGGGCGAAg 


CCGCACTAGA 


AGAAGAACTT 


10080 


CCGGCTTTAG 


AGGTCACTAC 


ACGAAATAGA 


ACGCAAATCC 


AGAATATTCC 


TTATACAACA 


10140 


GAAGAAATTC 


AGGATCCAAC 


ACTTCTGAAA 


AATCGTCGTA 


AGATTGAACC 


ACAAGGGCAA 


10200 


GCAGGGACAC 


GTACAATTCA 


ATATGAAGAC 


TACATCGTAA 


ATGGTAATGT 


CGTAGAAACT 


10260 


AAAGAAGTGT 


CACGAACTGA 


AGTAGCTCCG 


GTCAACGAAG 


TCGTTAAAGT 


AGGAACACTT 


10320 


GTGAAAGTTA 


AACCTACAGT 


AGAAATTACA 


AACTTAACAA 


AACTTGAGAA 


CAAAAAATCT 


10380 


ATAACTGTAA 


GTTATAACTT 


AATAGACACT 


ACCTCAGCAT 


ATGTTTCTGC 


AAAAACGCAA 


10440 


GTTTTCCATG 


GAGACAAGCT 


AGTTAAAGAG 


GTGGATATAG 


AAAATCCTGC 


CAAAGAGCAA 


10500 


GTAATATCAG 


GTTTAGATTA 


CTACACACCG 


TATACAGTTA 


AAACACACCT 


AACTTATAAT 


10560 


TTGGGTGAAA 


ATAATGAGGA 


AAATACTGAA 


ACATCAACTC 


AAGATTTCCA 


ATTAGAGTAT 


10620 


AAGAAAATAG 


AGATTAAAGA 


TATTGATTCA 


GTAGAATTAT 


ACGGTAAAGA 


AAATGATCGT 


10680 


TATCGTAGAT 


ATTTAAGTCT 


AAGTGAAGCG 


CCGACTGATA 


CGGCTAAATA 


CTTPGTAAAA 


10740 


GTGAAATCAG 


ATCGCTTCAA 


AGAAATGTAC 


CTACCTGTAA 


AATCTATTAC 


AGAAAATACG 


10800 
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CATGGAACGT ATAAAGTCAC GGTAGCCGTT GATCAACTTG TCCAAGAAGG TACAGACGGT 10860 

TACAAAGATG ATTACACATT TACTGTAGCT AAATCTAAAG CAGAGCAACC AGGAGTTTAC 10920 

ACATCCTTTA AACAGCTGGT AACAGCCATG CAAAGCAATC TGTCTGGTGT CTATACATTG 10980 

CCTTCAGATA TGACCGCAGA TGAGGTGAGC TTAGGCGATA AGCAGACAAG TTATCTCACA 11040 

GGTGCATTTA CAGGGAGCTT GATCGGTTCT GATGGAACAA AATCGTATGC CATTTATGAT 11100 

TTGAAGAAAC CATTATTTGA TACATTAAAT GGTGCTACAG TTAGAGATTT GGATATTAAA 11160 

ACTGTTTCTG CTGATAGTAA AGAAAATGTC GCAGCGCTGG CGAAGGCAGC GAATAGCGCG 11220 

AATATTAATA ATCTTGCAGT AGAAGGAAAA ATCTCAGGTC CGAAATCTGT TGCGGGATTA 11280 

GTAGCGAGCG CAACAAATAC AGTGATAGAA AACAGCTCGT TTACAGGGAA ACTTATCGCA 11340 

AATCACCAGG ACAGTAATAA AAATGATACT GGAGGAATAG TAGGTAATAT AACAGGAAAT 11400 

AGTTCGAGAG TTAATAAAGT TAGGGTAGAT GCCTTAATCT CTACTAATGC ACGCAATAAT 114 60 

AACCAAACAG CTGGAGGGAT AGTAGGTAGA TTAGAAAATG GTGCATTGAT ATCTAATTCG 11520 

GTTGCTACTG GAGAAATACG AAATGGTCAA GGATATTCTA GAGTCGGAGG AATAGTAGGA 11580 

TCTACGTGGC AAAACGGTCG AGTAAATAAT GTTGTGAGTA ACGTAGATGT TGGAGATGGT 11640 

TATGTTATCA CCGGTGATCA ATACGCAGCA GCAGATGTGA AAAATGCAAG TACATCAGTT 11700 

GATAATAGAA AAGCAGACAG ATTCGCTACA AAATTATCAA AAGACCAAAT AGACGCGAAA 11760 

GTTGCTGATT ATGGAATCAC AGTAACTCTT GATGATACTG GGCAAGATTT AAAACGTAAT 11820 

CTAAGAGAAG TTGATTATAC AAGACTAAAT AAAGCAGAAG CTGAAAGAAA AGTAGCTTAT 11880 

AGCAACATAG AAAAACTGAT GCCATTCTAC AATAAAGACC TAGTAGTTCA CTATGGTAAC 11940 

AAAGTAGCGA CAACAGATAA ACTTTACACT ACAGAATTGT TAGATGTTGT GCCGATGAAA 12000 

GATGATGAAG TAGTAACGGA TATTAATAAT AAGAAAAATT CAATAAATAA AGTTATGTTA 12060 

CATTTCAAAG ATAATACAGT AGAATACCTA GATGTAACAT TCAAAGAAAA CTTCATAAAC 12120 

ACTCAAGTAA TCGAATACAA TGTTACAGGA AAAGAATATA TATTCACACC AGAAGCATTT 12180 

GTTTCAGACT ATACAGCGAT AACGAATAAC GTACTAAGCG ACTTGCAAAA TGTAACAC7T 12240 

AACTCAGAAG CTACTAAAAA AGTACTAGGA GCAGCGAATG ATGCAGCCTT AGATAACCTA 12300 

TACTTAGATA GACAATTTGA AGAAGTTAAA GCTAATATAG CAGAACACCT AAGAAAAGTA 12360 

TTAGCGATGG ATAAATCAAT CAATACTACA GGAGACCGTG TAGTTGAATA CGTAAGTGAG 12420 

AAAATCAAAA ATAACAAAGA AGCATTTATG CTAGGTCTTA CTTATATGAA CCGTTGGTAC 12480 

GATATTAATT ATGGTAAAAT GAATACAAAA GATTTATCTA CGTACAAGTT TGACTTTAAC 12 540 

GGAAATAATG AGACTTCAAC GTTGGATACT ATTGTCGCAT TAGGAAATAG TGGACTAGAT 12600 
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AACCTCAGAC CTTCAAATAC TGTACGTTTA TATGCGAATA AACTTGCATC GGTAAAAGGA 12660 

GAAGATTCAG TCTTTGACTT CGTAGAAGCG TATAGAAAAC TCTTCTTACC AAACAAAACA 12720 

AATAACGAGT GGTTTAAAGA AAATACAAAG GCATATATAG TCGAAATGAA GTCTGATATT 12780 

GCAGAAGTAC GAGAAAAACA AGAATCACCA ACAGCCGATA GAAAATATTC ATTAGGACTT 12840 

TACGATAGAA TATCAGCACC AAGTTGGGGG CATAAGAGTA TGTTATTACC ACTACTAACT 12900 
TTACCTGAAG AATCTCTGTA TA^nTCATCG AATATGTCTA CACTTCCATT CGCTTCGTAT " 12960 

GAAAGATATC GTGATAGTGT GGATGGAGTT ATTCTTTCAG GAGATGCTTT ACGAACTTAT 13020 

GTAAGAAATA GAGTTGATAT AGCAGCGAAA AGGCATAGAG ACCATTATGA TATTTGGTAC X3080 

AATCTTCTTG ACAGTGCTTC AAAAGAAAAA CTTTTCCGTT CTGTGATAGT TTATGATGGA 13140 

TTCAATGTAA AAGATGAGAC AGGAAGAACT TATTGGGCAA GGTTAACGGA TAAAAACATC 13200 

GGCTCTATTA AAGAATTCTT CGGACCTGTT GGGAAATGCT ATGAGTATAA TAGTAGTGCA 132 60 

GGAGCGTATG CGyAtGGAAG TTTAACGCAC TTTGTGTTAG ATAGATTATT AGATGCTTAT 13 320 

GGAACGTCGG TTTATACTCA TGAAATGGTT CATAATTCTG -ATTCTGCAAT CTACTTTGAA 133 80 

GGAAATGGTA GACGTGAAGG ATTGGGAGCG GAGTTATACG CACTTGGTTT ACTGCAATCT 13440 

GTAGATAGTG TAAATTCTCA TATTTTAGCT TTAAATACGT TATATAAAGC AGAAAAAGAT 13500 

GATTTGAATA GATTGCATAC ATATAATCCG GTGGAACGTT TCGATTCGGA TGAGGCGCTT 13560 

CAAAGTTATA TGCATGGATC ATATGATGTA ATGTATACAC TTGATGCGAT GGAAGCAAAA 13 620 

GCGATATTAG CTCAAAATAA TGATGTTAAG AAAAAATGGT TTAGAAAAAT AGAAAATTAT 13 680 

TACGTTCGTG ATACTAGACA TAATAAAGAT ACACATGCAG GAAATAAAGT CCGTCCATTA 13740 

ACAGATGAAG AAGTAGCTAA CTTAACATCG TTAAACTCAT TAATCGACAA CGACATCATA 13 800 

AATAGACCTA GCTATGATGA TAGTAGAGAA TATAAACGAA ATGGCTACTA TACTATAACT 13 860 

ATGTTCTCTC CTGTATACGC AGCGCTAAGC AATTCGAAAG GTGCTCCTGG AGATATTATG 13 920 

TTTAGAAAAA TAGCTTATCA ATTACTTGCG GAAAAAGGTT ATCACAAAGG ATTCCTACCT 13 980 

TATGTTTCTA ATCAGTACGG AGCAGAAGCA TTTGCCAGCG GAAGCAAAAC ATTCTCATCA 14040 

TGGCATGGAA GAGATGTTGC TTTAGTGACA GATGATTTAG TATTTAAGAA AGTATTCAAT 14100 

GGTGAGTACT CATCATGGGC TGATTTCAAA AAAGCAATGT TTAAACAACG TATAGATAAA 14160 

CAAGATAATC TGAAACCAAT AACAATTCAA TACGAATTAG GTAATCCTAA TAGTACAAAA 14220 

GAAGTAACTA TAACAACGGC TGCACAAATG CAACAATTAA TTAATGAAGC GGCTGCGAAA 14280 

GAT ATT ACT A ATATAGATCG TGCAACGAGT CATACCCCAG CAAGTTGGGT GCATTTATTA 14 340 
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AAACAAAAAA TCTATAATGC ATATCTTCGC ACTACAGATG ACTTTAGAAA TTCTATATAT 14400 

AAATAAGATT GTAGAGTTTC ATTGTTGAGT AGTGTTTCTT GTAAGGATGA GGACTCAGAT 14460 

GACAAATCGA CTCCTTTTTC TTATGGATCG ATGTAGAGAT TTCATTGAAT GCAGATTGCA 14520 

GGAATCATCT TCAACTCATC AACGACCAAT GGTGACAAGG TGCATTTCAA TCCCACAGAA 14580 

AATGTTGATT TGAGAAATAA CTTTGCTAGT CTAGTAAAAT AAATACAAAA CAATCCTACA 14640 

AGATTTTTTC TGGGATTGTT TTTTGCTGAG TGGGATGCTT CAAGTTGTCT GGCTTGACTT 14700 

TCTTGAGGGA AGTTATATAA TAGTTGTAAT AATTAG 14736 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11770 base pairs 
( B> TYPE: nucleic acid 

(C) STRAND EON ESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172; 

ACAGGAAAGC ACGATAGCAA TCTCTTTGGA AGATTTAAAA AATATTCCTC AAAGTTTCGC 60 

TGTTGCTTAC GGTGATACGA AAGTATCTTC GATTCTCTCT GTCTTGCGTG CTAATTTAGT 120 

AAATCATTTG ATTACAGACA AAAATACAAT TTTAAAAGTT TTGGAAGAAG ATGGGGATTT 180 

GACTTTTAGA GAGATTCTAG GTGAGTGAAA ATGATAGACT GATTCAGTTT ATCGTTTTTC 240 

TTTTTAGTTG ATTGCACATT TGTCCTTATA TAAACAAAAA TACTTTATCT GTTGTTTTTG 300 

GATTGACAAC TTTATTATGT ACTTGTATTC TATAGTTACA AAAGAAAATT TTAAAATTTC 360 

AAATGAAAAA AGCTTTTTAC ATAGTGAAAT GAGGAGGAAT TTATGGAAAT GATTGTTCCA 420 

GATCAAATTA TCATGGGTTT AATTTTATAT GCTGGTGATG CGAAACAACA TATTTATAAA 480 

GCGTTAGATT ACATAAAAAA TGGTACATGT GAACGGTGTG AAGAAGAAAT ACAGTTAGCT 540 

GATGCAGCCT TATTAGAAGC TCATAATCTA CAAACAAAAT TTTTGGCACA GGAAGCCTCT 600 

GGTACAAAGA CAGAAATTAC AGCTCTCTTT GTTCATTCAC AAGATCATCT CATGACCAGT 660 

ATGACGGAGA TTAATTTAAT CAAAGAAATT ATTAGTTTGA GAAAAGAACT TCATAAAAAA 720 

TAATACTAGA GTATTATCAT TGTTATTAAC ATAGAGGAGG AAAACATAAT GGTGAAGATT 780 

GGTTT G TTTT GTGCAGCAGG TTTTTCTACT GGTATGCTTG TAAATAATAT GAAAATTGCA 840 

GCGCAATCTA GTGGAGTTGA GGCAGAAATA GAGGCGTTTT CTCAGTCTAA ATTAGCGGAT 900 

TATGCGCCAA ATATAGATGT TGCACTATTG GGTCCACAAG TTGCTTATAC ATTAGATAAA 960 

TCAAAAGAAA TTTGTGATAA GTGTGATGTT CCGATAGCTG TTATTCCGAT GATGGACTAT 1020 
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GGTATGTTAG 


ATGGGAAAAA 


AGTATTAGAT 


TTGGCCCTAT 


CTTTGATTAG 


TGGGTAAGAA 


1080 


AAGGAGATTT 


ATTATGTCAA 


AGATGGATGT 


TCAGAAAATC 


ATTGCACCGA 


TGATGAAGTT 


1140 


TGTGAATATG 


CGTGGCATTA 


TAGCTCTAAA 


AGATGGGATG 


TTAGCAATTT 


TGCCATTGAC 


1200 


AGTAGTTGGT 


AGTTTGTTCT 


TGATTATGGG 


ACAATTGCCG 


TTCGAAGGAT 


TAAATAAGAG 


1260 


CATTGCTAGT 


GTTTTTGGAG 


CTAATTGGAC 


AGAGCCGTTT 


ATGCAAGTAT 


ATTCAGGAAC 


1320 


TTTTGCTATT 


ATGGGTCTAA 


TTTCTTGTTT 


TTCAATTGCC 


TATTCTTATG 


CTAAGAATAG 


1380 


CGGAGTAGAG 


GCTTTACCAG 


CTGGAGTTCT 


ATCTGTATCT 


GCATTCTTTA 


TTTTGCTAAG 


1440 


ATCATCTTAT 


ATCCCTAAAC 


AAGGTGAGGC 


GATTGGGGAC 


GCTATTAGTA 


AAGTTTGGTT 


1500 


TGGAGGCCAA 


GGAATTATCG 


GTGCTATCAT 


TATAGGTTTG 


GTAGTAGGAA 


GTATTTATAC 


1560 


CTTCTTTATA 


AAGAGAAAAA 


TTGTTATTAA 


GATGCCAGAA 


CAAGTTCCAC 


AAGCTATTGC 


1620 


CAAACAGTTT 


GAAGCAATGA 


TTCCAGCATT 


TGTAATTTTC 


TTATCTTCTA 


TGATTGTATA 


1630 


TATTTTAGCG 


AAGTCATTGA 


CTAATGGCGG 


AACATTCATA 


GAAATGATTT 


ATTCTGCTAT 


1740 


TCAAGTTCCG 


TTGCAAGGTT 


TAACTGGATC 


TTTGTATGGT 


GCTATTGGAA 


TTGCATTCTT 


1800 


TATATCATTT 


TTGTGGTGGT 


TTGGTGTTCA 


TGGGCAATCG 


GTAGTAAATG 


GAGTAGTGAC 


1860 


AGCTCTGCTT 


TTATCTAATC 


TTGATGCTAA 


TAAAGCTATG 


TTAGCCTCTG 


CTAATCTATC 


1920 


ATTAGAAAAT GGTGCACATA TTGTTACTCA ACAATTTTTA GATTCATTTT TAATTCTATC 


1980 


AGGTTCAGGG 


ATTACGTTTG 


GTCTTGTAGT 


TGCCATGCTT 


TTTGCAGCAA 


AATCAAAACA 


2040 


ATACCAAGCC 


TTAGGAAAAG 


TTGCAGCTTT 


TCCAGCAATA 


TTTAACGTAA 


ATGAGCCAGT 


2100 


TGTATTTGGA 


TTTCCGATTG 


TCATGAATCC 


AGTTATGTTT 


GTACCTTTCA 


TTCTTGTTCC 


2160 


TGTACTTGCA 


GCTGTGATAG 


TATATGGAGC 


TATTGCAACA 


GGTTTCATGC 


AGCCATTCTC 


2220 


AGGGGTAACA 


TTGCCTTGGA 


GTACACCAGC 


TATTTTATCA 


GGATTTTTGG 


TGGGTGGATG 


2280 


GCAAGGAGTT 


ATTACTCAGC 


TGGTGATATT 


AGCGATGTCT 


ACATTGGTTT 


ATTTTCCATT 




CTTTAAAGTA 


CAGGATCGTT 


TAGCTTACCA AAATGAAATC 


AAACAATCTT 


AGAGGTATTT 


2400 


GTGTGTTACT 


GTTAAACTCA 


CACATTTGTG 


CTAAAAATTA 


GAGAGTTAAA 


ATTTTTCTAG 


2460 


TTAAAAGCTT 


GAAAATTTCT 


ATAAAAATCG 


GT ATT AT ATT 


TTCGAAAGAA 


ATAAAAATAT 


2520 


TTTCGAAAGA 


AAGGTGCTTA 


CGATGGTAAA TACAGAAGTA 


GCAAGAACAA 


CAATCAAGAC 


2580 


AGAATATTTT 


GGCAGCCTTA 


CTGAAAGGAT 


GAACAAATAT 


CGAGAAGATG 


TTTTAAATAA 


2640 


AAAACCTTAT 


ATTGATGCTG 


AGAGAGCAGT 


TCTAGCAACA 


CGCGCCTATG 


AACCATACAA 


2700 


GGAACAACCT 


AATGTCCTAA 


AACGTGCATA 


TATGCTGAAA 


GAAATTTTGG 


AAAATATGAC 


2760 
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TATCTATATT GAAGAAGAAT CTATCATTGC GGGAAATCAA CCTTCTTCCA ATAAAGATGC 2820 

TCCTATTTTT CCGGAATATA CGCTAGAATT TGTTCTCAAT GAGTTCGATC TTTTTGAAAA 2880 

GCGTGATGGA GATGTTTTCT ATATTACAGA AGAAACAAAA GAACAACTTA GAAGTATTGC 2940 

TCCGTTTTGG GAAAATAATA ATTTACGTGC TAGACCTGGT GCCTTATTAC CTGAAGAAGT 3000 

GTCTGTTTAT ATGGAAACAG GATTCTTCGG TATGGAAGGT AAGATGAATT CTGGAGATGC 3060 

TCACTTAGCA GTTAACTATC AGAAACTTTT GCAATTTGGT TTAAGAGGTT TTGAAGAGCG 3120 

GGCTCGTAAA GCAAAAGTAG CTCTAGATTT AACAGATCCA GCAAGTATTG ATAAATATCA 3180 

TTTTTACGAC TCTATATTTA TCGTAATCGA TGCTATTAAA GTATATGCAA AGCGCTTTGT 3 240 

TGCTCTTCCT AAAAGTTTAG CCGAAAATGC AAATCCTAAA CGTAAGAAAG AATTACTTGA 3300 

GATTGCAGAT ATTTGCTCTA GAGTCCCATA TGAACCGGCA ACTACTTTTG CAGAAGCTAT 3360 

TCAATCAGTT TGGTTTATTC AATGTATTTT ACAAATTGAA TCTAATGGCC ACTCTCTTTC 3420 

ATATGGCCGT TTTGATCAAT ATATGTATCC ATATATGAAG GCTGATTTAG AAAGTGGTAA 3480 

AGAAACAGAA GATAGCATTG TTGAACGTCT GACAAATCTT TGGATTAAGA CAATTACAAT 3540 

TAATAAGGTT CGCAGTCAAT CACATACATT TTCTTCAGCA GGAAGTCCTT TATATCAAAA 3600 

TGTTACAATT GGTGGACAGA CTCGAGATAA GAAGGATGCT GTTAACCCAT TATCTTATTT 3660 

GGTATTAAAA TCAGTTGCAC AAACCCATCT ACCGCAACCT AATCTAACTG TACGTTACCA 3720 

TGCAGGTTTA GATGCTCGTT TCATGAATGA GTGTATTGAA GTGATGAAAC TTG G TTTT G G 3780 

TATGCCTGCA TTTAATAATG ATGAGATTAT TATTCCTTCT TTTATTGCAA AAGGAGTATT 3840 

GGAAGATGAT GCTTATGATT ACAGTGCCAT TGGATGTGTT GAAACGGCAG TTCCAGGGAA 3900 

ATGGGGCTAT CGTTGCACAG GTATGAGTTA TATGAACTTC CCTAAGGTTC TACTTATCAC 3960 

GATGAATGAT GGAATTGATC CGGCTTCGGG TAAACGGTTT GCACCAAGCT TTGGTCGTTT 4020 

TAAGGATATG AAGAACTTTT CTGAATTAGA AAATGCTTGG GATAAAACAC TAAGATATTT 4080 

GACACGAATG AGTGTTATTG TTGAAAATTC TATTGATTTA TCATTGGAAC GAGAAGTTCC 4140 

TGATATTCTA TGTTCAGCAT TGACTGATGA TTGTATTGGT CGTGGAAAAC ACCTTAAAGA 4200 

AGGTGGAGCA GTATATGATT ATATATCAGG ATTGCAAGTT GGAATTGCAA ATTTGTCGGA 4260 

TTCATTAGCT GCAATTAAAA AATTGGTGTT TGAGGAAGAA CGTATAAGCC CAAGTCAGCT 4320 

TTGGCATGCA CTGGAAACAG ATTATGCCGG AGAAGAAGGT AAGGTCATTC AAGAAATGTT 43 80 

GATTCATGAT GCACCTAAGT ATGGTAATGA TGATGATTAT GCTGACAAAT TGGTTACTGC 4440 

TGCTTATGAC ATTTATGTTG ATGAAATTGC TAAATATCCT AATACACGTT ATGGAAGAGG 4500 

GCCTATTGGA GGAATTCGTT ATTCAGGAAC ATCTTCTATC TCAGCCAACG TAGGGCAGGG 4560 
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ACGTGGAACA TTAGCAACTC CAGATGGACG CAACGCGGGT ACACCGTTAG CAGAGGGTTG 4620 

TTCACCATCA CATAATATGG ATCAACACGG CCCTACATCT GTTTTAAAAT CTGTTTCAAA 4 680 

ATTACCAACA GATGAAATCG TAGGTGGGGT TCTCTTAAAT CAGAAAGTAA ATCCTCAAAC 4740 

GTTACCCAAA GAAGAAGATA AATTAAAACT AATTCCTTTC TTACGAACAT TCTTTAATCG 4800 

TTTACATGGG TACCATATTC AATACAATGT TGTTTCCAGA GAGACGCTGA TTGACGCTCA 4660 

GAAACATCCT GAAAAACACA GAGACTTAAT^TGTTCGTGTT GCAGGATACT CTGCATTCTT 4~920~~ 

CAATGTTCTT TCTAAGGCAA CCCAAGATGA CATTATAGGA CGTACTGAGC ATACTTTGTA 4 9 80 

AAATAAAGAG GTTCTTTTTA TGGAATTTAT GCTTGACACA TTAAATTTAG ATGAGATTAA 5040 

AAAGTGGTCT GAAATTTTGC CGCTAGCTGG GGTAACTTCA AATCCCACTA TTGCAAAAAG 5100 

AGAGGGTTCT ATTAATTTTT TTGAACGAAT CAAAGATGTA AGAGAATTGA TTGGCTCTAC 5160 

ACCCTCTATT CATGTTCAGG TGATTTCTCA AGATTTTGAA GGCATCTTAA AGGATGCTCA S220 

TAAAATTCGA AGACAAGCAG GAGATGATAT ATTTATCAAA GTACCTGTTA CTCCAGCTGG 52 80 

ATTACGTGCA ATAAAGGCGC TAAAAAAAGA GGCCT ACCAT ATCACTGCAA CAGCTATTTA 5340 

TACAGTTATT CAGGGATTAT TAGCTATCGA AGCAGGAGCG GATTACCTAG CTCCATATTA 5400 

TAATAGAATG GAAAATCTGA ACATTGATTC AAATTCTGTC ATTCGTCAAT TAGCTCTTGC 54 60 

TATTGATAGA CAGAACTCTC CTAGTAAGAT TTTAGCTGCA TCCTTTAAAA ATGTAGCACA 5520 

ACTAAATAAT GCTTTAGCTG CAGGTGCGCA TGCTGTTACA GCAGGAGCGG ATGTTTTTGA 5580 

ATCAGCTTTC GCCATGCCAT CTATCCAAAA GGCGGTTGAT GATTTTTCTG ACGATTGGTT 5640 

TGTTATTCAA AATAGTCGTT CCATTTAGAT AGAGAGGAAA TACATATCAC AATTTTTGCT 5700 

AGTCCTTCTA GATATATTCA GGGGGAAAAT GCCTTGTTTG AAAATGCCAA ATCAATTTTG 5760 

GATTTGGGAA ATTGCCCTAT TCTATTATGC GATCAGTTGG TTTATGATAT TGTTGGAAAA 5820 

CGATTTGAAG ATTACCTACA TAGGTATGGT TTCCATATTG TTCTGGCGCT ATTTAATGGT 5880 

GAAGCTTCTG ACAATGAAAT CAATCGAGTT GTTGCCTTGG CTGAGAAAGA AAATTGTGAT 5940 

AGTATTATCG GTCTTGGTGG GGGAAAGACG ATTGATAGCG CAAAAGCTAT TGCAGATTTG 6000 

ATTGAAAAGC CTGTTATTAT TGCTCCAACA ATTGCATCGA CCGACGCACC TGTATCTGCT 6060 

TTATCTGTTA TTTATACAGA TGAAGGTGCA TTTGATCATT ATCTATTTTA TTCTAAAAAT 6120 

CCAGATTTAG TTTTGGTTGA TACAAAAGTT ATTTCACAAG CCCCTAAGCG TTTATTAGCG 6180 

TCTGGTATTG CAGATGGTTT AGCAACTTGG GTTGAGGCGC GTGCGGTTAT GCAGGCAAAT 6240 

GGAAAAACTA TGTTCGGACA ACAGCAAACA TTGGCTGGAG TTGCAATTGC GAAGAAATGT 6300 
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GAAGAAACGC TCTTTGCACA TGCTTTACAG GCTATGGCAG CTTGTGAAGC TAAAGTGGTG 63 60 

ACACCAGCAT TAGAAAATAT TGTTGAAGCT AATACTTTAT TGAGTGGTCT AGGTTTTGAA 6420 

AGTGGAGGAT TAGCTGCGGC GCATGCAATT CATAATGGTT TTACTGCATT GACAGGTGAC 6480 

ATTCATCATT TAACACATGG TGAAAAAGTA GCTTATGGAA CTTTAGTACA ACTATTATTG 6540 

GAAAATAGAC CTAAAGAAGA ACTTGATAAG TATATTGAGT TTTACAAAAA AATTGGTATG 6600 

CCAACAACTC TAAAAGAAAT GCATTTGGAT CAAGTTGGAT ATGATGATTT AATAAAAGTT 6660 

GGTAAACAAG CAACTATGGA GGGTGAGACA ATTCATCAGA TCCCGTTTAA GATTTCCCCT 6720 

TCAGATGTTG CTCAAGCTAT TATCGCTGTA GATGCCTATG TAAATTCAAA ATAAACAATA 6780 

AGGACTACTG TTTTCCAAAT GGTAGTCTTT TATTGATCCC TGTATTGAAT TCTATACAAG 6840 

ATTGAAATAG GATGAGAACA AATCGATTGG GAAAGTAAAA TTAATTTCTA TAAATGTTTT 6900 

AGCAATTGTT TCGTACTATT TCAGATTCAG TCTACTATAT GTTCTTCATA AATCAAAAAG 6960 

CGACATAGGT TGTCGGCTAT TTATTGTGAA TACATTAATT AGCATTCCAG TTTTATCTTC 7020 

GGTCTAAAAT AAGTATTTTG TGCTATACGA GATAAGCTTC TTGACTTACT CCTTGATTTA 7080 

CTGCATAACA ATGGGATAAA AAGTGGGAGA TAGAGCAATT CATAGTCATC AAAATTAATC 7140 

AGATACAGTA TACAGTTTTT CCTTTAAACA CATTTCAAAT TCCCTCAAAA ATGGTATAAT 7200 

AGTAACATCA CAAAATTGGA GAGAGACCAT GAGTTTTTAC AATCATAAAG AAATTGAGCC 7260 

TAAGTGGCAG GGCTACTGGG CAGAACATCA TACATTTAAC ACAGGAACAG ATACATCAAA 7320 

ACCTAAGTTT TATGCGCTTG ATATGTTCCC TTATCCGTCT GGAGCTGGTC TGCACGTAGG 7380 

ACACCCAGAA GGTTATACTG CAACCGATAT CCTCAGTCGT TACAAACGTG CGCAAGGCTA 7440 

CAATGTCCTT CACCCAATGG GTTGGGATGC TTTTGGTTTG CCTGCAGAGC AATACGCTAT 7 500 

GGATACTGGT AATGACCCAG CAGAATTTAC AGCGGAAAAC ATTGCCAACT TCAAACGTCA 7 560 

AATTAATGCG CTTGGATTTT CTTATGACTG GGATCGTGAA GTCAACACAA CAGATCCAAA 7620 

CTACTACAAG TGGACTCAAT GGATTTTCAC CAAGCTTTAC GAAAAAGGCT TGGCCTATC A 7 680 

AGCTGAAGTG CCAGTAAACT GGGTTGAGGA ATTGGGAACT GCCATTGCCA ATGAAGAAGT 7740 

GCTTCCTGAC GGAACTTCTG AGCGTGGAGG CTATCCACTT GTCCGCAAAC CAATGCGCCA 7800 

ATGGATGCTC AAAATCACGG CTTACGCAGA GCGCTTGCTC AATGACTTAG ATGAACTAGA 7860 

TTGGTCAGAG TCTATCAAGG ATATGCAACG CAACTGGATT GGTAAATCAA CTGGTGCCAA 7920 

TGTAACTTTC AAAGTAAAAG GAACAGACAA GGAATTTACA GTCTTTACTA CTCGTCCGGA 7980 

CACACTTTTC GGTGCGACTT TCACTGTCTT GGCTCCTGAA CATGAATTAG TAGACGCTAT 8040 

CACAAGTTCA GAGCAAGCAG AAGCTGTAGC AGACTATAAA CACCAAGCCA GCCTTAAGTC 8100 
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TGACTTGCCT CGTACAGACC TTGCTAAAGA AAAAACAGGG GTTTGGACTG GTGCTTATGC 8160 

CATCAACCCT GTCAATGGTA AGGAAATGCC AATCTGGATT GCAGACTATC TCCTTGCTAG 9220 

TTATGGAACA GGTGCGGTTA TGGCTGTGCC TGCCCACGAC CAACGTGACT GGGAATTTGC 8280 

CAAACAATTT GACCTTCCAA TCGTCGAAGT ACTTGAAGGT GGAAATGTCG AAGAAGCTGC 8340 

CTACACAGAG GATGGCCTGC ATGTCAATTC AGACTTCCTA GATGGATTGA ACAAAGAAGA 8400 

CGCTATTGCC AAGATTGTGG CTTGGTTGGA AGAAAAAGGC TGTGGTCAGG AGAAGGTTAC 8460 

CTACCGTCTC CGCGACTGGC TCTTTAGCCG TCAACGTTAC TGGGGTGAGC CAATTCCAAT 8520 

CATTCATTGG GAAGATGGAA CTTCAACAGC TGTTCCTGAA ACTGAATTGC CGCTTGTCTT 8580 

GCCTGTAACC AAGGATATCC GTCCTTCAGG TACTGGTGAA AGTCCACTAG CTAACTTGAC 8640 

AGATTGGCTT GAAGTGACTC GTGAAGATGG TGTCAAAGGT CGTCGTGAAA CCAACACTAT 8700 

GCCACAATGG GCTGGTTCAA GCTGGTACTA CCTCCGCTAT ATTGACCCGC ACAATACTGA 8760 

GAAATTGGCT GATGAGGACC TCCTCAAACA ATGGTTGCCA GTAGATATCT ACGTGGGTGG 8820 

TGCGGAACAT GCTGTACTTC ACTTGCTTTA TGCTCGTTTC TGGCATAAAT TCCTCTATGA 8880 

CCTCGGTGTT GTTCCGACTA AGGAACCATT CCAAAAACTC TTTAACCAAG GCATGATTTT 8940 

GGGAACAAGC TACCGTGACC ACCGTGGTGC TCTTGTGGCA ACCGACAAGG TTGAAAAACG 9000 

TGATGGTTCC TTCTTCCATG TAGAAACAGG GGAAGAGTTG GAGCAACCGC CAGCCAAGAT 9060 

GTCTAAATCG CTCAAGAACG TTGTTAACCC AGACGATGTG GTGGAACAAT ACGGTGCCGA 9X20 

TACCCTTCGT GTTTATGAAA TGTTTATGGG ACCACTCGAT GCTTCGATTG CTTCCTCAGA 9180 

AGAAGGTTTG GAAGGAAGCC GTAAGTTCCT TGACCGAGTT TACCGTTTGA TTACAAGTAA 9240 

AGAAATCCTT GCGGAAAACA ATGGTGCTCT TGACAAGGTT TACAACGAAA CAGTCAAAGC 9300 

TGTTACTGAG CAAATTGAGT CTCTCAAATT CAACACAGCT ATTGCCCAAC TTATGGTCTT 9360 

TGTCAATGCT GCTAACAAGG AAGATAAGCT TTATGTTGAC TATGCCAAAG GCTTTATTCA 9420 

ATTGATTGCA CCATTTGCAC CTCACTTGGC AGAAGAACTC TGGCAAACAG TCGCAGAAAC 9480 

AGGTGAGTCA ATCTCTTATG TAGCTTGGCC AACTTGGGAC GAAAGCAAAT TGGTTGAAGA 9540 

TGAAATTGAA ATTGTCGTCC AAATCAAAGG AAAAGTTCGT CCCAAACTCA TGGTTGCTAA 9600 

AGATCTATCA CGTGAAGAAT TACAAGAAAT CGCTTTAGCT GATGAAAAAG TCAAAGCAGA 9660 

AATTGACGGT AAGGAAATCG TGAAAGTAAT TGCGGTACCG AATAAACTCG TTAATATCGT 9720 

CGTTAAATAA CGAGTTTATT AGCTCTATCT GCCACCTTCA ATAGTCCACT GGACTATTGA 9780 

AsCCAACTAA ATTAGTTAAC ATTGTTGTGA AATAAGATAG GAGTCCTTCA GAGTAGAATC 9840 
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TGGAGGATTT 


TTTGAATCTT 


CTTATGAAAG 


TATGATATAC 


TATGGGCAAC 


TATAAAGTTT 


9900 


GAAAAGTCAA 


ATAAGGAGAA 


TAAGATGCCA 


GTAAATGAAT 


ATGGTCAAAT 


GATTGGGGAG 


9960 


TCAATGGAAG 


CTTATACTCC 


AGGTGAATTG 


CCTTCTTTTG 


ATTTCTTAGA 


AGGGCGTTAT 


10020 


GCTAGGATAG 


AGGCTCTTTC 


AGTGGAAAAG 


CATGCGGAGG 


ATTTATTAGC 


TGTTTATGGC 


10080 


CCTGATACGC 


CTCGGGAGAT 


GTGGACCTAC 


CTCTTTCAGG 


AGTCAGTAGC 


AGACATGGAG 


10140 


GAACTGGTCA 


GCCTTTTAAA 


TCAGATGTTG 


GCTCGTAAGG 


ACCGTTTTTA 


TTATGCAATC 


10200 


ATAGACAAGG 


CAACTGGTAA 


GGCTTTGGGA 


ACTTTTTCCC 


TCATGCGAAT 


TGATCAGAAT 


10260 


AACCGAGTAA 


TAGAACTGGG 


AGCTGTCACT 


TTTTCTCCAG 


AGCTCAGGGG 


GACACGGATA 


10320 


GGAACAGAAG 


CCCAGTATCT 


CTTGGCTTGC 


TATGTCTTTG 


AGGAGCTTAA 


CTATCGTCGC 


10380 


TATGAGTGGA 


AATGCGATGC 


TCTTAACCTG 


CCATCCAGAC 


GAGCAGCGGA 


ACGTTTGGGA 


10440 


TTTATTTATG 


AAGGAACCTT 


CCGTCAGGCA 


GTGGTTTATA 


AGGGGCGTAC 


AAGAGATACG 


10500 


GATTCCTTGT CTATGATTCA TAAGGACTGG CCTCAAGTCA AAGCTCGATT GGAAATATGG 


10560 


TTGCGTCCTG 


AAAACTTTGA 


TAAAAATGGA 


CGACAGCACA 


AG AGCTTGAG 


AGAACTTTAA 


10620 


GAGGTGTTGA 


GATGATTACT 


ATTAAAAAGC 


AAGAAATTGT 


CAAGCTAGAG 


GATGTTTTGC 


10680 


ATCTCTATCA 


GGCTGTCGGT 


TGGACAAACT 


ATACCCATCA 


AACAGAGATG 


CTGGAGCAGG 


10740 


CCTTATCTCA 


TTCATTAGTA 


ATTTATCTGG 


CACTTGATGG 


TGATGCTGTG 


GTGGGCTTGA 


10800 


TTCGTTTGCT 


TGGAGATGGT 


TTTTCATCAG 


rmTGTACA 


GGATTTGATT 


GTTTTGCCTA 


10860 


GCTATCAGCG 


TCAAGGGATT 


GGTAGCTCCT 


TGATGAAAGA 


GGCTTTAGGA 


AATTTTAAAG 


10920 


AGGCCTATCA 


AGTCCAGCTG 


GCGACAGAAG 


AGACAGAAAA 


AAACGTGGGA 


TTTTATCGTT 


10980 


CTATGGGCTT 


TGAAATCTTA 


TCCACCTATG 


ACTGTACAGG 


AATGATTTGG 


ATAAACAGAG 


11040 


AAAAATAAAA 


AAACTTcrrr 


GTTCTTAAGC 


AAAGTTTAAG 


GATGGTCTAG 


TATCATATAG 


11100 


TCATTAAATA 


AAGACCTCCT 


AACTTTATTT 


AATAAAATCC 


TAAACTTTTT 


TCATCACAAT 


11160 


CTCCTAATGA 


AGCCACCCAA 


TCAGGTGGCT 


TTTTTGCGGT 


ACGACGGGCA 


TGTCCTATAT 


11220 


CTGAGGTGTA 


AGTCCTCAGC 


CTGACTATCG 


TGAGGTAGCA 


GGGAGAGGAA 


GGGATAGCGA 


11280 


AATCGTGGCT CTACGAACAG GAACGTGATA GTAAGGCGTA TATAGCGGAT 


AAGGAGGCTT 


11340 


CAAACTCTAA 


AGTCCAAAAA 


GGTAGTCGTA 


ACCTATATGT 


GTAAATCACG 


AGAGTAATTG 


11400 


AATTCGGACT 


AAGGTTTGTG 


TGAAAAAGAT 


AAATCTTTCT 


AGAGTCTAAA 


GACTCTGCGT 


11460 


CAGATTTCCT 


ATTTTCACTG 


TAACCTTTTA 


ACGTCCTCAT 


ATCTTGTATA 


AACGAGGAAA 


11520 


GATGTACGAC 


TTATCCCGTG 


AGGTTTCATG 


AGCGCTGAAA 


, GCGTAGTAAC 


AACGAATCAT 


11580 


GAGAAGTCAG 


CCGAGCCCAT 


AGTAGTGAGG 


; AAACTTCCGT 


AATGGAAGTG 


GAGCGAAGGG 


11640 
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GTGAATACTC AAACAGTCTG GGGAGAGACT GTTTGAGGTC TGTCGCTAGA AAGAGAAAAC 11700 

GACAGATCGA AGTAATCCTA CTTCACTTGT GTCTGTAAAA TGAGTGGTCT GATAGAACTG 11760 

GACTTTGAGG 11770 
(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS : 

(A ) LENGTH: 4185 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(Di TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

CGCGAAACTA CTTTCTTAGT ATAACACTTT CAGAATCATT GTCAATAGAA ATGACTTGAT 60 

TTTTTCAATT TTTTCAAGCT ATTTCCAAGG GTTGTAAAAT CGTCCCTGAT TCTGCAAGAT 120 

AAG TACT AAA CTAACTACTA AAAACAAGGT TGCCAAGAGC AAGGTAATAT AGTCTCCTTT 180 

TTTCAAGGCC TGATAACTAT ACCATGTGCG TTTTTTCTCT TTCCCAAAGC GGCGAACTCC 240 

ATGGCAGTCG CAATGGTATC AATGCGTTCT AGCGAGCTAA AAATCAAGCG CGTAATAATG 300 

AGCAGATTGC CTTTGATTCG TTGCATAAGA GAAGCTTTCT TGGATAATTC CATCCCACGC 360 

GCCTCCTGAG ACATCTTGAT AGTAAAGAAT TCTTCCTGCA AATCTGGAAT ATAGCGCAAG 420 

GTCACGCTGA CAGAATAAGC AATCTTATAG GGCACACCAA TTTGATTTAA ACTGGAAGCA 4 80 

AACTGACTAG GATGGGTTGT CATCAAAAAG ATAATAGCCA GAGGAATGGT GCAAAGATAC 540 

TTAATGGCCA AATTTAGCAG ATAAAAGAGC TCCTGGCTGG TTAGAGTGTA GACACCGATT 600 

CCCTGCCAAA TCACACTTCT CTCTCCATAA AGTCCAACCC CATACTCCGC AGAAAAGAGA 660 

TAGACCATCA AAACGTTTAA AACGGCAAAT ATCGTCGCAA AAACGGCTAC AAAGGAAACA 720 

TCTTTAAAGC GAATTTCTGA TAAATAGAGG AGAAAGACTG AAAAGATGGC AATCAGCAAC 780 

AGCATTCTGG TATCATAGCT AATCATGGCC GCCAATGATA CCAGAATGAA AAAGAGAAGT 840 

TTCCCAGCTC CTGACAAGCG ATGAATCACA GTATCTCTAT GCTGGTAACC CATTAATTTA 900 

GCTTGCATCC CTCTCTCCTT TCTTTGTAAA ATGCCGTTAA ATCCAGTGGA TCCACATCTA 960 

G TTTCTTAGC CAAGTTAAAG ATGGAGGTTT CTTTTAGATT GGCTTTTACT AACAGCTCAG 1020 

GATCGCTCAA CAGACTGGCT GGAACAGTAT CGGCAATCAA TTCTCCATCC ACCATGACAA 1080 

GGACCCGGTC TGAATAATCC AGCATCAATT GCATATCATG GGTAATCATG ACAATGGTAT 1140 

GCCCTTTTTG ATGTAACTCT TCGAGAAATT CCATAATCTC AGTATAGTTC TTCTGATCTT 1200 
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GACCTGCAGT CGGTTCATCT AGGAGAATAA TTTCAGCTCC TAAGACCAAA ATTGAAGCAA 1260 

TGGTGACACG TTTTTTCTGA CCAAATGACA GGGCAGAAAT AGGCCAATTA CGGAATTCAT 1320 

AAAGTCCACA GATTTTCAAG GTTTCATATA CTCTCGTTTC AATTTCCTTC TCATCCACAC 1380 

CTCGCAAACG GAGCCCTAGA GCCACCTCAT CAAAAATCAT ATTGGTTGAA ATCATTTGAT 1440 

TAGGATTTTG TAGCACATAT CCTACTCGTT CCGCCCGCTC TGCAACAGAA TCGCCTTTTA 1500 

TATCCTGTTT TTCCCAAAGA TAGCGTCCTT CCGTCTGAAT AAAGCTACTT ATAGCCTTGG 1560 

CTAGAGTTGA TTTCCCTGCT CCATTTTTTC CGACAATAGC AATCTTTTCA CCCTTTTTAA 1620 

TATCTAAATG TAGGGATTTT AAAATCGGTC TATCATCATA AGAAAAAGAT ACTTCCTCTA 1680 

GTCTAAAGAG TGACTGCAAT GCTGGGGTTT CTTTTGCCAG TTCATTCTGC AACTGAACCT 1740 

GACCTTTTGA GATAGACAAG TTATCCAGAT TCGCTAATTG TTCTTCCTTG ACTAAGTCCA 1800 

CACCTAATTG ACGGAGAGTC CTTAGATAAA GGGGTTCTCG AATTCCATTT TGAGTCAATA 1860 

AATCAGTCGC AAGCAACTGG TCAGGGCTCC CATTAAAAAG GATACGACCA TCGTTTATCA 1920 

AGACAATCCG ATCCACAGGG CGATGCAGAA CGTCCTCCAA ACGGTGCTCG ATAATAAGAG 1980 

TCCTCCTCCC CTCTTCCTTA TGAATCTGGT CAATCAATTC GATAATATCC TGACCTGAC? 2040 

TGGGATCTAG ATTGGCGAGT GGCTCATCAA ACAAGAGAAT CGGACTTTCA TCAATCAAGA 2100 

CACCAGCCAG ACTCACTCGC TGCTTTTGTC CACCTGACAA ATCCTGAGGA CGCTGATCCA 2160 

GTAAACGAAG AAGGTCCAGC TTTTCAGCCC ATTTATAAAC ACGACCTTTC ATCTCATCTA 2220 

GGGCTGTCAC ATCATTTTCC AGAGCAAACG CCAAATCTTC TGCCACAGAC AAGCCAATAA 2280 

ACTCCCCATC TGTATCCTGC AAAACTGTGC TAACCAGATG AGACTTATCA TAGATGCTCA 2340 

TATCAAAGGC TACTTGACCC "TTATCAAAA ATTCTCCATA TGTCTGACCC TTGTAAATAT 2400 

TGGGAATAAT CCCATTCAAA CACTGACCCA AGGTAGATTT ACCTGACCCA GATGGTCCAA 2460 

CAATTAAGAC TTTCTCTCCC TTGTAAATGG TCAAGTCTAT CCCTTGCAAG GTCGGTTCTT 2520 

GTTGTGTTTC ATACCGGAAA GAGAAATCCT TCCACTCAAT TaTAGCTTCT TTCATCTI AC 2580 

TCTCTTCATT CGCTTCTTAG ACTTCTATTT TATCATAAAT CAACCCCTTC TTGCAGTCTC 2640 

TCCTCTTAAA ATCTTACCGC CAAAAAGATT CCTATCCTAG CTTACTTGCC TAACTAATCT 2700 

ATAAACATCG AAAAAGACTA GTTGCCCAGC CTTCCCCATC ATTTTATACT CTTCGAAAAT 2760 

CTCTTCAAAC CACGTCAGcT TCGCCTTGCC GTAGGTATGG TTACTGACTt CGTCAGTTTC 2820 

ATCTACAACC TCAAAACCAT GTTTTGAGCc TGCTTCGTCA GTTCTATCCA CAATCTCAAA 2880 

ACACTGTTTT GAGCAACtGC GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT 2940 

TAGTCCTTTT TCAAACTTCC TGCACGAGTT TGGGTTCCTG CATAGGCAAG TAAGAGAAGA 3000 
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GTTCCTGCAA 


TAGCTACAGA 


TACACCATTG 


GCAATTCCCG 


CAACAATCCC 


TTGTGCAAAT 


3060 


ACTTTTTCTG 


CCGCTTCTTG 


ATAAATCACA 


ACATCTCCAA 


GTGGTCCCAA 


GACACCCCAA 


3120 


ACAAGGGCAT 


TTGCAAGTAC 


TTGAATGAGA 


TTAAAAATAA 


GAATATCTTT 


CCAGTCAAAA 


3180 


ACACCATTGA 


TCACGCGAAC 


GTACTTTCTA 


AAAAGTCCCA 


CAACTAAACC 


AAAGAGTCCG 


3240 


CTAGCGATAA 


TCCAAGTCCA 


CCATAGACCA 


TAACCAACAA 


GAGAGTCCTT 


GATTGCATGA 


3300 


CCAATCAACC 


CGACAAGCAA 


ACCGATAATC 


GGTCCAAAAA 


TAATAGAAAG 


TAGCGCTTGT 


3 3 60" 


ACCGCATACT 


GAAGCTGGAT 


GCTTGTATTT 


GG AACAGGGG 


TTGGAATGTT 


GATCATCCCG 


3420 


ATGACGACAA 


AGAGGGCAGC 


GCCAATTCCG 


ACAGCAACAA 


CTTGTTTAAT 


TGTAAATTTG 


3480 


ATTTCCATAC 


TATTCTCCTA 


TTTTATCCTT 


CTATTTTCTT 


TATTTCAATG 


GTCCAAGATG 


3540 


AACCGACACC 


T AC ATT AT AG 


GCCTTGGCAA 


AGGAACCTTG 


GTTGATAGCC 


AAACCTAAAC 


3600 


GATAGAGAGA 


GTTGATGTAA 


AGGATGGGTT 


GCCCAATTCT 


CACATCTGCA 


AATGATTTGC 


3660 


CATAGACAAC 


CTGATTTTGA 


TAGACCAGCA 


TATCAGCATG 


ATAGATGGTC 


ACTTCAAAAC 


3720 


GATCACCAAA 






ATTCTTCCCC 


TGTGATAGAG 


GTCCAAAGCG 


3780 


AACCGAAACG 


CACATCCAGA 


ATATCAATGG 


CTCCCTTCAC 


CAGATGATCT 


TCTATGATGG 


3840 


TCCCTACGAC 


TGGAAGCTCT 


ACAATCTGTT 


CCACACTGAG 


CTCTGGCCCT 


ACTTCCTCAA 


3900 


AAGTAATGTG 


ACCACTGGCC 


AGTTTAGCAC 


CAGTATAGGC 


ATAGACATCA 


CGACCGTGGA 


3960 


AGGTATAAGA 


ATGCTCTGTG 


TTTTCACGCC 


TATTGGCCAC 


CTCAGAAATC 


TCACGAATGG 


4020 


CTACAATGCC 


AACGTGTTTC 


TTGATAAAGG 


AAAGCGTCCC 


ATTATCTGGC 


GTGACAATGT 


4080 


ATTGATTTTT 


TGCAGTCTTG 


GCAACTACAC 


TCTTACGTTT 


CGAACCGACA 


CCTGGATCGA 


4140 


CAACCCATAC 


AAACGTCGTT 


CCCTCAGGCC 


AGTAATCCAC 


CCTCT 




4185 



(2) INFORMATION FOR SEQ ID NO: 174: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2069 base pairs 
{ S) TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
TGATAGAGTT AAAGCCGCTG AGTCATTCAA TCCATCTCCA ACCATCAAAA TAGTGTGACC 
TGCTTTCTGC AGTTTCTCTA CTAACTCAAA TTTCCCATCA GGTTTCAAGT CTGTATAGAC 
CTGATCAAAG GGCAAATCTT TGACTAATTC CTCTGTCCTA ATCAAGGTGT CTCCTGTTGC 
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CAGAATCAAT TTTTyCCCCT GTGCCTTAAG TTTATCCAAG GCTGTTTTTG CTTCTTTTCT 240 

CAAAGGAGTA TGAATGCAGA ACATTCCAAT CAATTCATTT TGATAAGCCA AGAATAAGAG 300 

ATTGTAGTGA CTCTTGTACT CTTCAATTAA AGCATTTTGT TCTGAACTGA TATGAATCTG 360 

CTCATCCTGC ATCAAGACAT AATTCCCAAT AAGAACTGGT TGGCCATCTA TATGAGATTT 420 

GATCCCCTTG CTTGCGATAT ATTGGAGTTT CCCATGCATT TCCTCATGTT CAATTCCCTC 480 

TATCTCAGCT TGCTTGACGA TGGCATTAGC AATAGGATGA TAAATGTGTT CCTCAAGACA 540 

GGCACTGATT CTGAGAATAT CTTCCTCACT ATAGTCTCCA AAAGGTAACA CCTTTTCAAC 600 

TATAGGATAA CTAGTTGTGA TTGTTCCTGT CTTATCAAAC AAGAAAGTAT CAACTTCCAG 660 

ATATTTCTCC AGAACATCTC CATCCTTAAT CACCATTTCA CGGTTCAACC CTTCCTTGAT 720 

AACTGTCAAA TAAGCTACAG GAGTAGAGAT TTTCAAACCG CAGGAGAAAT CGACCAATAG 780 

GAAAGAAATA GCCTTAGAAA AAGAACCTGT CAATAGGTAA GTCAGCCCAG CCCCCAAGAA 840 

ATTATATTTG ACGACTTTAT CCGCCATCTT GATGAAATAG CGTTGTTTCG TTTTCTTGTT 900 

TTCTTCAGAT TTCTTCATCA ACTCAATCAG CTGTAAAATA CGGCTGTTCA TCTGATTATC 960 

TGTTACACGA ATGCGTAACT CTCCAGTTTC TAATACTGTA TTTGCACAAA CCAAATCAGA 1020 

CTCTCTTTTT TCAACTGGAA AACTCTCTCC TGTCAAGGAA CTTTCGTTGA CCATACCTAA 1080 

ACCTGAAACT ACTTGTCCAT CAAACAGAAT TTCATTTCCT TGAGATAAGA TCAAGACATC 1140 

TCCTATTTGA ACATCGGAAC TCTTGATACT AACAACCGTA TCGCCCTGTA CTAGGAATAC 1200 

ATCGCTCTCT TTTGCAAGAA GACTCTGTTC TAAATCTGTT GCAGTTTTTT TCAAGGACCA 1260 

CTGATCTAAA TGATTCCCCA AATCAAGCAT AAACATGATA TTGCTAGCTG TCTTGGATTG 1320 

GTTCATAAAC AAAGACAATA AAATAGCCGA ACAGTCCAAG ACTTCCATCG TTAGTyCCTT 1380 

ACGCGCTAGT GTTTGATAGG CTTCTCTAAT ATAACCCAAA GCCTGATAAC AAGTCCATAT 1440 

ATAGCGAATA GGATACGGCA CAAAACTACG AAAAAGTACA CGCTTAACCG CTOCACCTGA 1S00 

AACAATAGAA TAAGCACTCT CTTCTCTACG AATGGGAAGA GTCATCAACT CAGAAACTTT 1560 

CCCTTTATCA ATTCTTTTTA AAAAGGCTTC TGCATTATCT AATACAGAAA AGCCTTCTTT 

TATGCGTAGA GTAAAGTGCT GTTGATCCAT GTAAAACTGG ATAGACTCAA TCCCCTTTTC 

ATCTCTCGCC AAGGAACGAA GATAGTCTTG AATATCCAAG GTAAGTGAAA AAGAAGATGA 

TAGTCGGATA TGTTGGTATC CTCTATGTAG CACTTTAAAA GACATATTAT TCACCTATAA 

GGCTATCTAA TTGCTCTTCT TTTTTCTCTT GCTCGTACAA ATATTTGGCA TCTTGCAAGA I860 

CATCGTCTCC ATGTTGCTTC ACAACAGAAA CAGATGCATC TAGCTCGTCT TTCAACTTGT 1920 

AAGCCTTAGC CAAAGCTTTA GAATAACCTT TTTTAGCTTC CTTACTTGCT AAGATTTTCA 1980 



1620 
1680 
1740 
1800 



WO 98/18931 



PCT/US97/19588 



1105 

AACCAAGGGT ACCAAATGCG ACACCACCCA AAAATAATGA AGATTTTTTC GCAACTTTTG 
CAACGGTTAA TACTTCTTTT AACATAGGG 
(2) INFORMATION FOR SEQ ID NO: 175: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4597 base pairs 

(B) TYPE: nucleic acid 

(CI STRANDEDNESS :^ .double _ 

{ D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 
AAATCTTGCG CAATAAAGCT CATCTCCATC TCCCGATTGA AACAGTCACT CCCCCGACTG 
TTTCAACGTC CCAAGACATA ATCTTAGGCA GATTTCTAAA ATTACACTCA AAGTGGAAGT 
CATTGACCTT TCGAATGACA GTTGAAGTTG AAATGGCCAG CTGATGGCCA ATATCGGTCA 180 
TAGAAATCTT TTCAATTAAC TTTTGCGCAA TCTTTTGGTT GATAATACGA GGAATTTGGT 
GATTTTTCTT GACGATAGAA GTTTCAGCGA CCATCATTTT CAAGCAATGA TAGCACTTAA 
AACGACCTTT TCTAAGGAGA ATTCTAGTAG GCATACCAGT CGTTTCAAGG TAAGGAATTT 
TATAGGGTCT TTAATGTCTA GTAATTTTGT GATAAAATGT AATTGTTCCA TATGATTCTT 
TCTAATGAGT TGTTTTGTCG CTTTTCATTA TAGATCTTAT GGGACTTTTT TTCTACCCAA 
AATAGGCTCC ATAATATCCA TAGGGAATTT ACCCACTACA AATATTATAG AGCCCAAAGT 
TTTAGGTCGC TTCATAATAT GCGTTTTTTG AATTTTATAG ACTGCTCGTT TAAACTCTAT 
TTACTTCGTA CCTTCTGGAG CGAGACGGAA TATTAGTCAC ATACAAAATG ACT ACT ATT A 
GGATTTTATT TTCATGTACA ATTTCAGCCA GTCTTGTTAT AATCAGCCTA TAGGAATCAA 
GGAGGTGACT CTTATGGCTG TTTTTGTGTC TTTGGATGGA ATTGTGGTAG AAGTCCTTGA 
TGTCTTTTCT TCTTTTAATG GGGATAGTGA GTTTTTCTTG TGTATAGCAT TTTGAATCTG 
GAATAGGACG CCATGACTGC TAAAAGATTT CTATAAATTA ATTTGATTTT CCTAATCAAT 
TTGTTCATAT CTTATTTCAT TCCACTATAA ACGTCTTAAA GACAAGAGTC AGTTTGTTAT 
GGAACGCTCT CAGTTCGAGG AGATGTTCCA ACTTCAAAGT AGTCGCTTGA CGACGCAAGA 
AAAATTACAA TTGTTTACCT CTGTGTTTGC TGGCCGTTAT GATGTTTATG CTAAGAATTT 1080 
TATCAATGAA CAAGGGAAAA TTCAGTATTT TCCTTCCTAT GATTATGGTT GGAAGCAGTT 
GCCACCTGAA AAACGGAGTT TCCAGACATT GACGAACTCC GTTTTGAAAT CTCATTTTCG 
TGGGGAGGCA GCTATCGGTA TCTTTCCTAT GCACTTAGAT GATAGCTGTT ATTTTTTGGT 



60 
120 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



1140 
1200 
1260 



WO 98/18931 



PCT/US97/19588 



ACTGGATTTC 
GGAACGCCAG 
GTTCTTCTTT 
AGAACTGGCA 
AAATCAGGAT 
AGAAGCTTAC 
CCAATGGACG 
CCAAGAGGAG 
GATCCAACTT 
TCCTTTTCTA 
AAAACATTGT 
TTACTATATT 
TTATTTCGAG 
GATAAATTTA 
GAATTTAAGG 
GAAAATGGTT 
ATCTCTGAAC 
TGGCTAGATC 
GCATCAGGTC 
CTCAGTAAAC 
AGTCTTTTGG 
ATGTTTGAAA 
CCTGAGCGTA 
ACTGCTGATA 
GGTCATTTGG 
GCTACTGACT 
GGACGGAATA 
TTGAAAGAGA 
AGAACGAGTT 
GGAAAATACA 



GATGAAGGAG 
ATGGAAGCCC 
GAGGAAGCGA 
ATGCAGGAAA 
GTCCTTCCTA 
CATCAAGGGC 
TATCTACAAG 
TTAGGCAAGC 
GAAAAATCGT 
ATCCCGAATT 
TAGAAATCGA 
TAAAGCAGGC 
AATCCGATCA 
AGCAGGTATC 
GAGAACTCAC 
TACTTCATGC 
GGAAAACAAA 
GCTTAAACTG 
GTGAAAAGGT 
TGCTTGATGT 
ATGAGTATGA 
AAGTTGTTGC 
AGAATGGTCA 
AGAGGGAAAC 
AAATTGAAAA 
CAGTGAGGAA 
TCTTGGTTTT 
AAGAGGTTGA 
TACTGGAGAC 
TTGGCGAAGG 



ATTGGAAAGA 
ATTTAGAGAT 
TTCCGAGTCG 
GTATGCAACT 
AGGGGGGATT 
GAACGGTCTT 
AAATTCAGAG 
AAGAATTGGA 
CTGTGACATC 
TTATAGTAGA 
TTTGACTTTC 
TATGCGACAG 
TTATTTATGG 
TGTGGAAGAT 
TTTTGAGCAA 
GGAGACTGGT 
AACAATTATT 
CTTTTTGACT 
AATCGGCTAT 
CGTTATGATT 
GATGATGATT 
TCAGTTTAGA 
TGAGCCTATT 
GGATTTTAAA 
GACCAAAGCA 
TCAGATGATT 
AGTTAATCGA 
TGACTGTTAC 
GTTAGAACAG 
TTTTGACTTA 



1106 
AGCTGGTTTA 

TTCTCGTTCG 

AGAGGCTCGC 

GTCCTTTGAT 

TGGAAATTTG 

TGTGGATGAA 

GATTTCAACT 

TAAGGAGTTG 

CAAGGCACTT 

TTGAAACTAG 

CTGATCGATT 

CCAACCTATC 

TTGCCAAGAG 

AGGAGAAAGG 

GAGTTAGCCC 

TTTGGGAAGA 

CTAGTCCATA 

TTCGAAGAGG 

GTTGGGCAGT 

CAATCTCTAT 

GTGGATGAGT 

GGGAACTATC 

GTTTTTCAGA 

CGGCAATTGC 

AGTAATTTTA 

CTCAAGGATA 

ATTCAACAGA 

ATTATTAGCG 

TTAGATAAAG 

CCTCAGTTGG 



ACCATTCGAA 
GGTCACGGAC 
TTGTTTGGAA 
TCTTTTGATC 
ATTGCCTTGC 
CAGTTTCAGC 
GCTAAAGTGG 
AAGGTCGTTT 
TTTTCTTGAA 
AATAGTACAC 
TGTCCTGTTA 
AAATTCCTGA 
GTTTGCTGTA 
TACAAAGCTC 
TCTCAGATAT 
CCGTTTTAGG 
ATAAGCAACT 
AGGAGGCTAT 
ACGGTGGGAC 
TTAAGTTGGA 
GTCATCATGT 
TTTACGGTTT 
GAATTGGTGA 
AATTAAGATT 
TACAGCTTAG 
TTCTAGCCCA 
TAGATGTCTT 
GAAAAACCAA 
GGTPTGTTTT 
ACACGCTTAT 



GAATAGCCAG 
TCCATATTTG 
AGAAACTGAT 
GCATGTTTCC 
CTTTTCAAGG 
CTTATGAAGA 
CACTGTTAAT 
TATCCAATAT 
AAATATGGCT 
CTCTGCTTCT 
TTATTTCATT 
GAGAATGTAT 
TCCATTGCAA 
TATTAGCGTG 
GACTTCTAAA 
TGCTGCTCTT 
CTTAGACCAA 
CCGTTATACG 
TAAGAAATGG 
AAATAGTCAA 
CTCTGCCTTG 
GACGGCTACG 
GATACTCC^T 
CACTTCTTTT 
TGATTGGATT 
AGTGGCAGAA 
TGAAAAATTA 
AGTCCGAGAG 
GTTGTCTACT 
CTTGGCAGCA 



1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
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CCCTTTTCTT 


GGAAAAATAA 


TTTGATTCAG 


TATGCAGGTC 


GGATTCATAG 


AAACTACAAG 


3120 


GATAAGTCTT 


TGGTGCGTAT 


TTTCGATTAT 


GTGGATATTC 


ATGTTCCTTA 


TTTAGAAAAG 


3180 


ATGTTTCAGA 


AACGACAAGT 


AGCTTATCGA 


AAGATGGATT 


ATCGTGTCAT 


CGAGGGTGAG 


3240 


GAGAAACAAT 


TCGTTTATCT 


TGATAGTAGA 


TATGAGAAGG 


TGTTGAGAGA 


GGACTTAGCA 


3300 


GGGGAAAGAC 


AGGAATGTCT 


GCTTATTTTA 


CCTTATGTGC 


AC C AG AC AAA 


ACTGATGAAT 


3360 


TTTCTAXAAG 


AATTTAGGAT 


TAGTCAAATT 


GAGATATGTA 


TACCAGAGAC- 


GGTTGOAAAT — 


- 3420— 


AAAGCATGGC 


TAGACCAGTT 


GAAGAGCCAG 


AAAATTAAAG 


TGTCTTTTAC 


TCAATCAAAA 


3430 


ATAGTAACGC 


CTATTCTTTT 


GGTGAATAAG 


ACTATTGTTT 


GGTATGGTGC 


AATGCCATTA 


3540 


TTAGGGAAGG 


TAGATGAGAT 


GACCATATTA 


CGTTTGGAAT 


CAGCTAGTAT 


AGTTTCTGAA 


3600 


CTAGTGGCAG 


GTTTACGATA 


GAGAAAATTT 


TTAAAAATTT 


CTATGTATGA 


TTTTCATTTC 


3660 


TTTAGTGAGA 


CTGTTGCCAT 


TATCACATTC 


GAATCACACA 


AAATAAAAAA 


ATTTTTATAA 


3720 


GTACTTGACA 


AATAGATTGA 


AATATCATAA 


AATAAAAACG 


GTTACAGAGT 


TATTAATTAT 


3780 


TTAAGCTTCA 


TGTCACCATT 


AAAAATTGAA 


ATAAAAGGAT 


GTTATCACTA 


ATACAAGTGA 


3840 


GCAGGAACCT 


ATTTAATCAC 


ATCAGAAGAA 


GTTTCTTGAT 


GTTTTTAAGT 


AGGTTCCTTT 


3900 


TATTTTAAAA 


GGGAAATTTT 


ATGATCATAA 


AACGAATACT 


AAACCACAAT 


GCCGTAATTG 


3960 


CGCAAAGTAA 


AAAAGATATC 


GATATTCTTC 


TTTTTGGAAG 


GGGAATAGCT 


TTTGGAAGAA 


4020 


AAACTGGAGA 


TAAAGTAAAT 


CCAATTGATA 


TTGAGAAAAG 


TTTTTTTCTC 


AAAAATAGAG 


4080 


ATAATATGAC 


CCGTTTTACA 


GAGATGTTTA 


TTAACGTTCC 


TTTGGAGTTG 


GTGTACATCA 


4140 


CCGAAAAAAT 


AATTAACCTA 


GGTAAAATAA 


CATTGGGTAA 


TAATTTTGAT 


GAAATTATCT 


4200 


ATATTAATTT 


AACGGATCAT 


ATTTCTTCGA 


GCATAGAACG 


TTATAAAGAA 


GGGATTATTA 


4260 


TTTCGAATCC 


CCTACGCTGG 


GAAATATCGA 


AATATTATAA 


AGAAGAATTT 


GAACTTGGGA 


4320 


AAAGGGCTTT 


ACAAATAATA 


AAAAAAGAGT 


TAGGTATTGA 


ACTTCCAATT 


GACGAAGCTG 


4380 


CATTCATAGC 


GCTACATTTT 


GTTAATGCTA 


ATTTAGAAAA 


TAATTTTCAA 


GAGTCGTATA 


4440 


AAATCACTGA 


AATAATTATG 


GCAATTGAGA 


AAATCATTCA 


AGATTTCTAT 


TGTACTGAGT 


4500 


TTAACCAAGA 


TTCTATTGAT 


TATTATAGAT 


TCATAACTCA 


TATGAAATTA 


TTTGCCCATC 


4560 


GCTTGGTTGA 


GAATACAACT 


TATTGTGACG 


ATGATGA 






4597 


(2) INFORMATION FOR SEQ ID NO: 176: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3984 base pairs 
< B > TYPE: nucleic acid 
(C) STRANDEDNESS : double 
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(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 



CGCCTTATTT 


ACTACTTGTT 


CCATCATATA 


TGGAATATGC 


ATGAAACCTG 


CTCTCATATT 


60 


AGGGAATTTT 


TTATCCACTA 


AATAAAGAGC 


TTGGTACATC 


AAJCTCATTGC 


AAACAAAGGT 


120 


TCCTGCACTA 


TTGGATACAA 


CTGCCGGAAG 


TCCCTGTTTT 


TTGATAGCTT 


GTACCATCGC 


180 


TTTGATAGGT 


AAACTACTAA 


AATAGGCCGA 


TGCTCCATCA 


ATACGAATCG 


GTGTATCAAT 


240 


TGGTTGATTG CCTTCGTTAT CAGGT ATGCG AGCATCATCT TGATTAATAG CCACTCGTTC 


300 


AGGTCTTAAG 


CCGGTCCTGC 


CGCCTGCTTG 


TCCAATACAA 


AGTACAGCAT 


CTGGTTGATA 


360 


TCGTAATATT 


TCTGCCTCTA 


AAACTTCTGA 


CGACTTATAA 


AAAACCGTTG 


GAATTTCTAC 


420 


CCAGCGAACT 


TCAGCCCCAT 


TAATCTCAGA 


TGCTAATAAT 


TTTACAGCCT 


CCAAAGCTGG 


480 


ATTAATCTTT 


TCACCTCCAA 


AAGGATTAAA 


ACCTGTAACC 


AATATTTTCA 


TTTTATTTTC 


540 


CTTTACTAAA 


ATGCGAGAAA 


GTACATTAAG 


AATATGTGAA 


TAACAATCAT 


TACTAGAGCA 


600 


ACACCTGCTT 


GAGCCTTTAT 


AACGCCATTC 


TGATCTTTCA 


TATCCATCAA 


TGCTGCTGGT 


660 


AGAGCGTTAA 


AATTAGCAGC 


CATTGGGGTC 


AATAAGGTCC 


CACAATAACC 


TGCTGTCATC 


720 


GCAAGAGCAC 


CAGCCACAAT 


TGGATTAGCT 

s 


CCCAGAGCAA 


ATACAAAGGG 


AACTCCAACA 


780 


CCTGCTGTAA 


TAACGGTGAA 


TGCTGCAAAA 


GCATTTCCCA 


TAATCATTGT 


GAATAGAACC 


840 


ATTCCAAGAA 


CATAGCCCAA 


AACTCCTATA 


AAGCGACTAT 


CTGAAGGAAC 


AATACCGCTA 


900 


ATCAGATGAG 


AGATAACATC 


ACCAACACCT 


GCTACAGTAA 


AAATAGCCCC 


CAAAGCCCCT 


960 


AATAATTGAG 


GAACAATCCC 


ACTTGTTGAA 


ACTTGCTGAG 


TCATTCGATT 


ATTTTCTGAT 


1020 


AACAGACTCT TAGGGTGACT ATTGGTAATC 


ACAAGAACAG 


AAATTGTAGC 


AAACAAGGCG 


1080 


GCAAGGCTAA 


TCGAAATCTT 


GCTAAATTCT 


GGAATCATTT 


GCGCTAAGAC 


CAACGCAAGT 


1140 


ATTGCCATCA 


GCATAACTGG 


AATAAAAATT 


TTATTTTTCA 


ACCTGTTAGA 


TTCAATATTG 


1200 


GCTTTCATTT 


CATCTAAGGA 


TGGCAAGGTT 


CCGATACGGA 


CTTGCTTAAA 


CAATGTTAAC 


1260 


AGCGATAATA 


GGATTACAAT 


AATACCAATA 


CTCATATTTG 


GCATATAGGA 


ACCACCTATA 


1320 


AACGTAATAG 


ACAATAGAGT 


CCAAAATGCA 


GATGTCCCAA 


GTCGAACTGG 


GTTTCTTTTA 


1380 


TCTTTATAAC 


TACAATAGGC 


TGTATGGAGA 


AATTGACAAC 


CAATCACAAT 


ATAGGTCAAC 


1440 


TCTAATAGTT 


GCTTTGCCAA 


CTCTGTCATT 


TTTGTTCTCC 


TCCCCTAGTC 


TTTTTTGATA 


1500 


TCAATTTTTT 


ATCAAATAAA 


TAATTATAAA 


TCCCCACTAC 


AATAAGTGTT 


ATAACAGCAA 


1560 


CAATAATAGA 


TGTAGAAGCA 


ATCCCTGCAT 


AATTGCTTTC 


ATAGCCTAAC 


TGATCTAATG 


1620 
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TTCCCCCTAT 


CAAGAGGACT 


CCCCCAGCAC 


CTACAAACGT 


ATTTTGAGCA 


AAGAAATTTC 


1680 


CAAAATTTTC 


ATTCGCAGCC 


GCACGCGCTT 


TTATTGTCTC 


ATCTTCAACC 


TCTGTTAACT 


1740 


TTCTACCTAA 


TTGAGACTCT 


GCAGCTGCTT 


CTCCCATAGG 


TTGAACCAAA 


CGTCTGACAA 


1300 


ACTGAGGGTG 


TCCTCCTAGA 


CGAATTGAAA 


AGAAACCAGC 


TAACTCTCGA 


ATAAAGAAAT 


1860 


AAACTGTATA 


GAAGTTTCCA 


ACTGTCAGAC 


CTTTAATCTT 


TCGAATCAAA 


TCGATTGATC 


1920 


GTTGCTTGAG 


TCCAAAGGTT 


TCTGACAGCC CCACAAGAGG 


CAACGTAACC ATAAAAATCG 


1980 


TGAGCACTCG 


CTGATTGCTA 


AATTCTTTTC 


CCAAAATCTC 


CAAAAATTCA 


ACGAGAGAAA 


2040 


CACCTGAAAC 


TAAAGCTGTA 


ACCAAACCAG 


CTAAGACTAC 


TGTTGCAATT 


GTATCAAATT 


2100 


TTAAAATAAA 


ACCCACAACA 


ATGATTGCTA 


TTCCTATTAA 


TCTAATCCAC 


TCCATATCAA 


2160 


ACTCCTTTAT 


ATTCAAAATG 


ACAGTATTTT 


TAAAATTTTA 


TCAAGATCAA 


TACCATTCCT 


2220 


TATTTAATGT 


GTTTTTCTAG 


TTCTTTTTGG 


TATTTGCTAT 


TGGATTCCAA 


TTTTTCTTTT 


2280 


TGCCATTTTT 


TAAAAACCTC 


GTTATATTCT 


TTTGTTGTAA 


CAATATCTTT 


TTGCAATTTC 


2340 


ATTCCTTTAA 


AGATATATGG 


ATCCCCCTTA 


ATACCAACTT 


GTGAGTATGG 


TTTTGAGAAT 


2400 


GGTACTACGT 


TACTTACAAC 


TGGAGAACCA 


CCAGATGAAG 


CTGTTGGCAT 


CAATAATGAA 


2460 


CTATCTGTCG 


ACCAAGCTTG 


AGCTTTGGCA 


TATTTTTCAT 


ATCTTTTCTC 


TAGGTCAGTG 


2520 


GTCTCAGAAA 


CAGCATCTTC 


TAACAATTTC 


TTATATTTAT 


CCAAACCAGG 


TTTAGCTACA 


2580 


ACATCCTTAT 


CTTTTCCTTT 


CGTAATACCA 


AGGTGTTTCA 


TGGCAGAACC 


AGATTTTGGA 


2640 


TCTATAATAT 


TCAAGTGAGA 


CGCTGGATCT 


TGATAGCTTG 


CAGCCCATCC 


TGTACTGTTC 


2700 


AAATCATAGT 


CTTTTTGAGA 


AGGAGCAACA 


TTGCCGTATT 


TATCATTTTC 


CATCAAACCA 


2760 


TCAATAACAT 


TTCCAATAAC 


GTCTGTCCTC 


GATGTTCGAG 


TCGCTATACT 


GTAGCCCAAT 


2820 


GATGCTGGAT 


CTACTGCATA 


GACATAAGAA 


AATGTTGTCG 


GTGCATCTGC 


TTCTTTATCA 


2880 


GTTTTTCCAC 


AAGCCACTAA 


AATAGCTGAC 


GTGCTCAGGA 


CCACTCCTGC 


TGTTAAGAGC 


2940 


CACTTTTTCT 


ATTTCATAAA 


GAATCTCCTT 


TGGTTTATTT 


TAATCTACTT 


TTACAATCCA 


3000 


ACCTTCTGGC 


GCTTCAATAT 


CGCCAAACTG 


AATACCCGTC 


AATTCATTAT 


ATAATTTACG 


3060 


CGTCACAGGA 


CCTACTTCTG 


TTTCACTATA 


GAATACATGG 


AAATCATCAC 


CATGTTGAAT 


3120 


ACCTCCAATT 


GGAGAAATAA 


CCGCTGCTGT 


ACCACAGGCA 


CCTGCCTCTA 


CAAAACGGTC 


3180 


AAGATTATCA 


ATTGGAACAT 


CACCCTCAAT 


AGGAGTTAAT 


CCCAAGCGAT 


GTTCTGCCAA 


3240 


ATAAAGCAAG 


GAATACTTGG 


TAATAGATGG 


CAAGATAGAT 


GGACTCAATG 


GTGTTACAAA 


3300 


TTCATTATCA 


GCTGTAATTC 


CAAAGAAGTT 


AGCTGATCCG 


ACTTCTTCAA 


TCTTTGTATG 


3360 
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AGTTGATGGG 


TCCAGATAGA 


TAACATCTGA 


GAAATCACGT 


GACTTGGCCA 


TTTTTCCTGG 


3420 


• 


TAAGAGACTT 


GCAGCATAGT 


TTCCACCAAC 


CTTAGCCGCA CCTGTACCAT 


TTGGTGCTGC 


3480 


- 


ACGGTCGTAC 


TCATCCTGAA 


TCAAGAAGTT 


GGTTGGGACC 


AAACCACCTT 


TAAAGTAATT 


3540 


• 


TCCAACTGGC 


ATAGCAAAGA 


TGGTGAAAAT 


GTACTCTTCT 


GCCCGTTTTA 


CCCCGATAAT 


3600 




ATCTCCGACA 


CCAATCAAAA 


GACGGCGAAG 


ATATAAGCTT 


CCACCTGTTC 


CGTATGGTGG 


3660 




TACGTATTCT 


TCATTCGCAC 


GGACAACTGC 


TTTACAAGCT 


TCTACAAACA 


TGTCTGTCGG 


3720 




AACTTGTGGC 


ATCAAGAGAC 


GGTCACATGT 


ACGTTGCAGA 


CGTTTAGCAT 


TTTCATCAGG 


3780 




ACGGAACAGT 


TGAACACTGC 


CATCCTTAGT 


ACGATAAGCT 


TTCAAACCTT 


CAAATGCTTG 


3840 




TTGTCCATAG 


TGAAGACTTG 


GAGAAGACTC 


TGAAATATGC 


AAAGTTCCAT 


CCTCTGTAAC 


3900 




CTCTCCTTGA 


TCCCATTGTC 


CATTTTTGAA 


ATGAGCAAGA 


TAGCGATAAG 


GTAATTTCAT 


3960 




ATAGGAAAAA 


CCGAGGTTTT 


CCGG 








3984 





(2) INFORMATION FOR SEQ ID NO: 177: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8703 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

TATCTAATTA TTGGTTTTTA TCGCTGACCT TGGCTATTGT TGGG G TT GT T TTACCCTTGT 60 

TCCCTACAAC ACCTTTCCTT TTGTTGTCTA TTGCTT G TTT CTCCAGAAGT TCCAAGCGAT 120 

TCGAAGATTG GCTTTATCAT ACCAAGCTCT ATCAAGCATA TGTAGCTGAT TTTCCTGAGA 180 

CCAAGTCTAT TGCGCGTGAA CGAAAGAAAA AAATCATCGT CTCTATCTAC GTCTTCATGG 240 

GAATTTCTAT TTATTTTGCA CCTCTTTTAC CAGTCAAAAT CGGTCTGGGT GCTTTGACCA 300 

TCTTTATTAC TTATTATCTC TTCAAGGTCA TTCCAGACAA AGAATAGTTA AAACAGTAC T 3 60 

TATTTGCCTT GATAAAATTG AAAGCATATT CATAACAATA TGATATAATA AAATTGAAGT 420 

AATATTCAAG GAGAATCAAA TGATTTACGA ATTTTGTGCT GAAAATGTGA CTTTACTTGA 480 

AAAAGCCATG CAGGCTGGAG CTCGTCGGAT TGAACTCTGT GATAATCTAG CAGTTGGTGG S40 

GACAACACCC AGCTATGGAG TGACTAAGGC AGCGGTTGAA CTGGCAGCTA ACTACGATAC 600 

AACCATCATG ACCATGATTC GGCCACGTGG TGGTGACTTT GTCTATAATG ACCTAGAAAT 660 

TGCTATCATG CTAGAAGACA TTCGTTTGAC TGCTCAGGCT GGAAGTCAAG GGGTTGTATT 720 

TGGAGCTTTA ACTGCTGATA AAAAGTTGGA TAAGCCTAAT CTGGAAAAGT TAATTGCTGC 780 
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ATCAAAAGGA ATGGAAATTG TCTTTCACAT GGCCTTTGAT GAACTAAGTG ATGAAGATCA 840 

AGCGGAACCT ATTGACTGGC TCAGTCAAGC CGGTGTCACT CCTATCCTAA CTCCTGCTGG 900 

TGTGTCTGGC GACTCCTTAG AAAAACGTTT TGTTCACTAT CACAGAATTT TGGAGTACGC 960 

TAAAGGTAAA ATTGAAATTC TACCAGGTGG GGGGATTGAC CTTGAAAACC GTCAAACCTT 1020 

TATCGACCAG GTGGGGGTAA CACAATTGCA TGGTACTAAG GTTGTTTTTT AAAAAATAGA 1080 
AAGGAACTGC TAGCTTTGGG TAGCAGTTTT CACTTATGTT TGAAATTTTT AAATCCTATC ' ""1140 

AATTTAATCA AGAAAAGGCT CATGATTATG CTTTTATAGA AAATAGCGAA GTCTGGACAT 1200 

ATAGTTGCCA GA T TTTGCAA GGTGACTTTG TCATGACTGT GTCCATCACT GCTGATAATG 1260 

TGAACTTTCA AGTCTTTCAC CAAGAGACTG GTGACCTCTA TCCTCACGTT T AT AT GG AAA 1320 

GCATGAGGGG AAGTTTTGTC GGAAATGTCC GTGAGGCTTG TCTGGAGATT CTTTACCAGA 1380 

TTCGGAAGGC TTGTTTTGAT GTGCAAGATT TTATCTGTCA TCAGACTAAG CGTATCATGA 1440 

CTCAAGTTCA GGAAAAGTAT GGAAACCAGT TGGAGTATCT GTGGGAAAAA TCGCCTGATA 1500 

CAGCTGTATT GCGCCATGAA GGCAATCAAA AGTGGTATGC CGTCTTGATG AAAATCTCTT 1560 

GGAATAAGCT GGAAAAGGGC AGAGAAGGAC AAGTGGAAGC AGTCAACCTC AAGCATGACC 1620 

AAGTAGCTAA TTTGCTTTCA CAAAAGGGGA TTTATCCAGC CTTCCATATG AGCAAGCGCT 1680 

ACTGGATTAG TGTGTCCCTT GATGATACTT TATCAGATGA AGAAGTACTG GAATTGATAG 1740 

AAAAAAGTTG GAACTTAACC TCTAAAAAAT GAAATATTTT AATAATTTTC ATGAACTTTC 1800 

AATTAGCTAA ATATTCTTTA CTGAAGAGAT TTTTAGAAAA TATAGGATTT ACCACACTAG 1860 

AGGAATATGG TGCCATCTTC AAATACCTGA TTGAGAATGT CAAGACGGAT CGTCAGATCA 1920 

TCTATTCGCC TCACTGTCAT GATGACCTCG GAATGGCAGT GGCAAATAGC CTTGCTGCTG 1980 

TCAAGAATGG TGCAGGACGT GTTGAAGGGA CTATCAATGG TATTAGGGAG CGAGCTGAAA 2040 

ATGCTGCTTT GGAAGAAATT GCAGTGGCTC TCAATATTCG CCAAGATTAC TACCAAGTAG 2100 

AAACCAGTAT TGTCCTAAAT GAGACCATCA ATACGTCAGA AATGGTTTCT CGCTTCTCTG 2160 

GTATTCCAGT TCCTAAAAAC AAAGCCGTCG TTGGTGGCAA TACCTTCTCC CACGAATCTG 2220 

GTATTCACCA AGATGGAGTC CTTAAAAATC CTCTCACTTA TGAGATCATC ACACCTGAAT 2280 

TGGTTGGTGT TAAGATTCTG CTTGGAAAAT TATCTGGTCG CCATGCTTTT GTTGAGAAAC 2340 

TGAGAGAATT GGCCCTAGAT TTTACAGAAG AGGATATCAA ACCACTCTTT GCTAAGTTCA 2400 

AGGCACTGGT CGATAAGAAG CAAGAAATCA CAGATGCAGA TATTCGAGCT TTGGTAGCTG 2460 

GAACCATGGT TGAAAATCCA GAAGGCTTCC ACTTTGATGA TTTACAACTT CAAACTCATG 2520 
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CAGATAATGA 


CATTGAAGCG 


CTCGTTAGCC 


TACCCAATAT 


GGATGGTGAG 


AAAGTCGAAT 


2580 


TTAATGCCAC 


AGGGCAAGGT 


TCCGTTGAAG 


CAATCTTTAA 


TGCTATCGAT 


AAGTTCTTTA 


2640 


ACCAATCTGT 


TCGTTTGGTG 


TCCTACACTA 


TCGATGCGGT 


AACAGATGGA 


ATCCATACCC 


2700 


AGGATCCGCT 


TTTGGTCACT 


GTTGAAAACA 


GAGATACAGA 


AACCATCTTT 


AATGCAGCAG 


2760 


GGCTTGATTT 


TGATGTGTTG 


AAGGCTTCTG 


CTATTGTCTA 


TATAAACGCT 


AATACCTTTG 


2820 


TTCAAAAAGA 


GAATGCAGGT GAGATGGGAC GCAGTGTTTC TTACCACGAT 


ATGCCTAGTG 


2880 


TGTAAAGGAG 


AAGGCTATGG 


CAAAGAAAAT 


AGTACCTCTA 


GCAGGAGACG 


GAATTGCCCC 


2940 


AGAAATCATG 


GAGGTTGGTT 


TAGAAGTTCT 


GGAGGCTCTA 


GCTGAAAAAA 


CAGGTTTTGA 


3000 


CTATGAGATT 


GACAGACGAC 


CGTTCGGAGG 


TGCAGATATT 


GATGCAGCAT 


GACCTCCCTT 


3060 


ACCTGATGAA 


ACCCTTAAGG 


CAAGTAGGGA 


AGCAGATGCT 


ATCCTACTAG 


TAGCTATCGG 


3120 


TAGTCCTCAG 


TATGATGGAG 


CAGTGGTTCG 


CCCTGAACAA 


GGCCTGATGG 


CTCTCCGTAA 


3180 


GGAACTCAAT 


CTTTACGCTA 


ATATTCGTCC 


TGTAAAAATC 


TTTGACAGTC 


TCAAGCATTT 


3240 


GTCACCACTC 


AAACTGGAAC 


GAATTGCTGG 


TGTAGACTTT 


GTCGTGGTGC 


GTGAATTGAC 


3300 


AGGCGGGATT 


TACTTTGGAT 


ATCATATTCT 


TGAAGAGCGC 


AATGCGCGTG 


ATATCAACGA 


3360 


CTATAGCTAT 


GAGGAAGTGG 


AGCGGATTAT 


TCGCAAAGCC 


TTTGAAATTG 


CAAGAAATCG 


3420 


CAGAAAAATC 


GTTACTAGTA 


TCGATAAGCA 


AAATGTTCTA 


GCGACCTCAA 


AACTCTGGCG 


3480 


GAAAGTAGCT 


GAGGAAGTCG 


CACAGGATTT 


CCCAGATCTA 


ACCTTGGAAC 


ATCAGCTGGT 


3540 


AGACTCAGCT 


GCTATGCTTA 


TGATTACCAA 


TCCTGCTAAG 


TTTGATGTTA 


TTGTAACGGA 


3600 


GAATCTTTTT 


GGAGATATTT 


TATCTGATGA 


ATCAAGCGTC 


TTATCTGGTA 


CACTTGGGGT 


3660 


TATCCCATCA 


GCCAGTCATT 


CTGAAAATGG 


ACCAAGTCTC 


TATGAACCTA 


TTCACGGTTC 


3720 


AGCACCTGAT 


ATTGCAGGTC 


AAGGAATTGC 


CAATCCTATT 


TCCATGATTT 


TATCAGTTTC 


3780 


CATGATGTTG 


AGAGATAGTT 


TCGGACGTTA 


TGAGGATGCA 


GAGCGTATCA 


AACGTGCTGT 


3840 


TGAGACAAGT 


CTGGCGGCAG 


GAATTTTAAC 


GAGAGATATA 


GGAGGTCAGG 


CTTCAACAAA 


3900 


GGAAATGACG 


GAAGCTATTA 


TTGCAAGGTT 


ATGAAGTTAG 


ACGAAAAAAT 


TACTCTAGTC 


3960 


CTTTTGATTT 


GGAATGTCAT 


CATTTTCTTG 


ATTTATGGTA 


TTGACAAATC 


TAAGGCAAGG 


4020 


AGAAGAGTTT 


GGCGCATCCC 


TGAGAAAATC 


TTACTTATTT 


TAGCCTTTAC 


TTTTGGTGGT 


4080 


TTTGGTGCCT 


GGCTAGCAGG 


AATCATCTTT 


CACCACAAGA 


CTCGAAAATG 


GTACTTTAAA 


4140 


ATAGTTTGGT 


TTCTTGGGAT 


GGTGACCACA 


CTAGTAGCCT 


TATATTTTAT 


TTGGAGGTAA 


4200 


TGGATGGCAG 


GGTCTTCGAG 


GGAATACGCT 


GCTTGGGCTC 


TAGCGGACTA 


TGGTTTTAAG 


4260 


GTCGTGATTG 


CAGGATCTTT 


CGGTGACATT 


CATTACAATA 


ATGAACTCAA 


TAATGGCATG 


4320 
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TTGCCAATCG TTCAGCCTAG AGAGGTTAGA GAGAAACTAG CCCAGCTAAA ACCAACCGAC 4 380 

CAGGTAACTG TGGACTTGGA ACAACAAAAA ATCATCTCAC CAGTTGAAGA ATTCACCTTC 4440 

GAGATAGATA GCGAGTGGAA ACATAAACTC CTAAATAGTT TGGATGATAT CGCTATTACC 4 500 

TTGCAGTATG AAGAGTTGAT TGCTGCTTAT GAAAAACAAC GACCAGCCTA CTGCCAGGAT 4 560 

TAGAAAAAAT AGAAAAGGAG ATATAGTAAA CTGAAATAAG ATGTAAACAA ATGAATTGGA 4 620 

GCTTAACATC CATTTCCAGC AATTTTTTAG AAACTACAGT GG ACT AT TCT GG ATTCAAC A 4680- 

CATTATAAAA TTATGACAAA ACACATTCAC AAGAAGGCTA CGACATTTTA AAAGGTGAGG 47 40 

GCGGATGTAT CGTTTGCCCT ACTAAAGTTG GTTACATTAT CATGACCAGT GACAAGGCAG 4800 

GACTTGAGCG TAAGTTCGCA GCCAAAGAAC GTAAGCGTAA CAAACCAGGT GTTCTTCTCT 4 06C 

GCGGTAGCAT GGATGAACTT TGCGCTTTAG CGCAACTCAA CCCAGAAATT GAAGCATTCT 4920 

ACTAAAAACA TTGGGATCAA GATATTCTTC TTGGTTGTAT CCTTCCTTGG AAACCAGAAG 4 980 

CCTTTGAAAA ACTCAAAGCA TACGGGGATG GCCGTGAAGA ACTTATTACT GATGTACGTG 5040 

GTACTAGCTG TTTTGTTATC AAGTTTGGAA AAGCAGGTGA ACAATTGGCT GCCAAGCTTT 5100 

GGGAAGAAGG TAAAATGGTC TACGCCTCAT CTGCTTCAAT GACAAAACGA TTGAAACTCG 5160 

CTATGAGCAA GGTGTAATGG TGTCTATGCT CGATAAGGAC GGCAAACTCA TCCCAGAACA 5220 

AGGAGGACCA CGTTCAACTT CACCAGCTCC AGTTGTGATC CGTAAAGGGC TTGACATTGA 5280 

TAAAATCATG ATGCACCTGT CAGATACTTT TAACTCATGG GACTACCGTC AGGTTGAGTA 5340 

TTATTAGGAT AGAGAAGAAG TCTAGTGTTA TGAGATATTA AAGCTCCTAA CACTGGGCTT 5400 

TTGTTTAGAA TTTCTTTTCT TTTTCTATAG GATATGGTAT TCTATGTAGA AAATATATGT 5460 

TAATAAGTAA TGCCAATATT TAAACATCAT TAGTAAAAGG AGTTAGATTG ATGAATAAAA 5520 

GAAAAGTTAG TTTAGAAGAT TTTTATAAAT GGTATAGTCT AAATAAAGAA GAGTTATTAA 5S80 

ATAAGGCAAC TGTTGGTGAA AAGTTTAATG ATAAATTAAA AGAACACTTT CTCCAGGAAT 5640 

GGCCTTTGGA TAGGATTTTA ACAATGTCAA TCGATGAATA TGTAATAGGA AAGGGACAGC 5700 

AAAATAAGTC TTTATGCTAC GCTCTTGAGA AGGGAAAATA CAAAAATCTA TTTCTTGGAA 5760 

TTTCTGGTGG CTCAGCTTCA AAATTTGGTA TTTATTGGAA TAAAAAAACA AACAAATATA 5820 

AAGATCAAGC TAATAATGAG ATTTCAGAGT TGGATCAGCG ATTTTCAAAA TTAAAATCAG 5880 

ATTTGTATGA AATTATCAAA GAAGGTATTC GTTTTAACTT TGAAAATCCT ATTTTTGATA 5940 

TGAAAAGATC AACAAATGAA TTTATTGGTC GTTCTGCTAT GGTGACAAAA TTACTTTGTA 6000 

TCTATACTGA GGGAGATCCT TTCTTTGGTG TAAATATTAA TAGTCAGAAA GAATTTTGGA 6060 



WO 98/18931 



PCT/US97/19588 



1114 

ACCACTTTGT TTCTCAGACA AATCAACGTG GACCTTATCT GCAAAATCAT AAAATAATTG 
AACTGGTGTC CAAAACTTAT CCTGAGTTGG AGCCATCGAA ATTAGGAACT ATGCTTTTTG 
AGTATTCTAA GCTTTTTATG GAAAATAAGG AAGACAATAG TACAATGGAT TCATCAAACA 
ATTTTCGTCA TCAATTAACT CAATCTCTAT TAAAGTCTCC AAACCTCATC CTCCGCGGTC 
CTCCTGGCAC GGGAAAAACT TATCTTGCPA AAGAAATTGC TAAAGAATTA ACGGATGGCA 
ACGAAGATCA AATCGGATTT GTACAATTTC ACCCATCATA TGATTATACG GATTTTGTAG 
AAGGTTTAAG ACCAGTATCA AATGGGGATC GAGCTATTGA GTTTAGGCTA CAGGACCCTA 
TTTTT AAAGA TTTTTGTCAG AAAGCAAAAG AAACCCAATT GATTGGAGGA CAAGATAATT 
TTGATGACGC TTGGGATTCT TACTTAGAAT ATATAAATGT TGCTGAAGAA AAAGAATATA 
TAACAAAAAC ATCTTACTTA TCTGTTAATA GTAGACAAAA TTTGTCAGTA AATTATGATA 
GTGGTGTTCC AGGATGGTCA CTACCTAGCA AATATGTTTA CGAGTTGTAT AAAGATAAAA 
ATTATAATAA GCAAGAATAC TACAAAAGTG CTGGAAAAAC TGTCCTAGAA ACATTGAGAA 
AGAGATTTGG TTTGAAAGAC TATGTTTCCC CAACAGAAAT TGATACTGAT AAGAATTTTG 
TCTTCATCAT CGATGAGATC AATCGTGGGG AGATTTCTAA GATTTTTGGC GAACTCTTTT 
TCTCTATCGA CCCCGGCTAT CGTGGTGAAA AAGGAAGTGT TTCTACCCAA TATGCAAATC 
TACACGAAAC TGATGAAAAG TTCTATATCC CCGAAAATGT TTACATCATC GGAACTATGA 
ATGATATTGA TCGTTCAGTG GATACCTTTG ATTTTCCTAT GCGTCGTCGT TTTCCTTTTC 
TTGAAGTTAC TGTCGAGGGT CAAGCTGGCA TGTTGGATAA AGAGTTGAAT ATCCATGCAG 
AAGAAGCAAA AATTCGTCTA AGAAACTTGA ACCCTGCTAT CCAAAATATT CAGGAATTAA 
ACAGTCATTA TCATATTGGA CCAAGTTATT TTCTTAAGTT GAAGGATGTA GATTTTGACT 
ATGAATTACT CTGGTCTGAT TATATTAAGC CTCTCCTAGA AGACTACTTG CGAGGTTCTT 
ATGATGAGGT TGAAACTTTG GAaACTTTGA AAAAAGCATT TGATCTGACA AATAATGAGC 
AAAAAGATCA GGCACTAGCT GATGACAATG AAGGCGATGA AAACGATCAT GCCGATTAUT 
GATAATCAAC ACAAGATTAT TAAAGAAAAA TTTGTTGAAG AATATCCTAA ACTAAGCAAT 
CCTCTTTTAG ACAGAACCTT GGAAAGTCTA TCCCAAGATG AACGTATTTT CATTTTTCCA 
AATGATTwCA CTCATACTCC TGATTTGGAT AAGGACCAAA AGATTTTTGA AACAGTCAAT 
CAGAAAATCA ACACAGGGAA CGTGATTGGT TTTCTTGGAT ATGGTCAGGA AAGATTAACG 
ATTTCCTCAC GATPTTCTGA TGAGAGTAAT GACCACTTTT TGCATTATCT CTTAAACAAG 
GTTCTTCATA TCAATCTCAC TAGTTTAGAT GTTGCTTTGT CTCGTGAAGA GAGGCTTTAT 
CAACTTTTGG TGTATCTCTT TCCCAACTAT CTACAAGCTG CTATTCCAAA AGGTCTTTAT 



6120 

6180 

6240 

6300 

6160 

6420 

64SO 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7S00 

7S60 

7620 

7680 

7740 

7800 

7860 
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AAGGAATATC ATCGATTTTC TCATAACGAC AGTCATGTTA AGGGAGTGAT TCATGTAAGA 7920 

AACCATCTCA AGAAAAATCT TCCTTTCACG GGAAATATTG CCTACCCAAC GAGAGAGTTC 7980 

ACCTATCATA ATCCCCTCAT GCAGTTGGTC CGTCACACTA TTGAATACAT TAAGAATCAG 8040 

AAAAGCATTC GTCAAGGGGT ACTAGATAAT CTCTCAACTA GTCGTGAAAA CGTATCTGAA 8100 

ATCGTGCGTG TAACGCCCTC TTATAAACTA GCTGATCGTG CTAAGATTAT TCCGGGAAAT 8160 
CAATCTAAAC CTATACGTCA TGCATACTTT ^TACGAGTACA GAAACTTACA AGAACTTTGT ' ' 8220 

CTGATGATCC TAAACCAAGA AAAGCACGGT TTAGGGTATC AAGATCAAAA AATCTATGGT 8280 

ATTCTCTTTG ATGTTGCCTG GCTTTGGGAA GAGTATGTTT ACACCTTGTT GCCAAAAGGT 8 340 

TTTGTACATC CCAGAAATAA GG ATAAGACG GATGGAATTT CAGTATTTTC TGTTGGGAAA 8400 

CGAAAAGTAT ATCCAGATTT TTATGACAGA GAACGAAAGA TTGTTCTAGA TGCAAAATAT 8460 

AAAAAACTGC AATTGACTGA AAAAGGAATC AACCGTGAGG ACTTATTCCA GCTGATTTCC 8520 

TATTCTTATA TTTTAAAAGC TGAGAAGGCT GGACTGATTT TTCCTAGTAT GGAGCAGTCA 8580 

GTAAATAGTG AAATAGGAAA AGTAGCTGGC TATGGAGCTC AATTGAAGAA GTGGTCTATT 8640 

CCAATCCCTC AGAATGCCTC ATTCTATAGT ACATTTTGTA AAATCATCGA AAATTCAGAA 8700 
GAG 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4854 base pairs 
(8) TYPE: nucleic acid 

(C) STRAND EONESS : double 

(D) TOPOLOGY: linear 



8703 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

CATCACCACT TTTAGATGGC TTTAACAGTG AAATTATTGC TTTTAATCTT TCTTGTTCGC 60 

CTAATTTAGA ACAAGTACAA ACAATGTTGG AACAGGCATT CAAAGACAAG CACTACGAGA 120 

ATACGATTCT CCATAGTGAC CAAGGCTGGC AATATCAACA CGATTCTTAT CATCGGTTCC 180 

TAGAGAGTAA GGGAATTCAA GCATCCATGT CACGTAAGGG CAACAGCCAA GACAACGGTA 240 

GGATGGAATC TTTCTTTGGC ATTTTAAAAT CCGAAATGTT TTATGGCTAT GAGAAAACAT 300 

TTAAATCACT TAACCAATTG GAACAAGCCA TTATAGACTA TATTGATTAT TACAACAATA 360 

AGAAAATTAA GATAAAACTA AAAGGACTTA GTCCTGTGCA GTACAGAACT AAATCCTTTG 420 

GATAAATTAT TTGTCTAACT GTTTGGGGGC AGTACACAAG AAAGCGCTTT AAAACCAGTA 480 
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CACCTTTTCA TAAGGTTCGC TTGATGTACC AAGATGAGGC TGGTTTCCGT AGAATCAGTA 540 

AACTGGGATC TTGTTGGTCT CCAATAGGAG TAGGTCCACA TGTCCATAGT CACTATATAC 600 

GAGAATTTCG CTATTGTTAT GGAGCTGTTG ATGCCCATAC AGGCGAATCA T TT TTC TT AA 660 

TAGCTGGTGG ATGTAATACT GAGTGGATGA ACGCCTTTTT AGAAGAGCTT TCACAAGCTT 720 

ATCCAGATGA TTATCTTTTA CTCGTTATGC ACAATGCTAT ATGGCATAAA TCAACTACCT 780 

TAAAGATTCC CACTAATATT GGTTTTACCT TTATTCCTCC ATACACACCA GAGATGAACC 840 

CATTGAACAA GTCTGGAAAG AGATTCGTAA ACGTGGATTT AAGAATAAAG CCTTTCGAAC 900 

TTTGGAAGAT GTCATGAATC AACTCCAAGA TGTCATACAA GCATTGGACA AGGAGGTGAT 960 

AAAGTCCATC GTTAATCGGA GATGGACTAG AATGCTTTTT GAAAACACAT GAGTATAAAA 1020 

TTGAATTGCT TATAAAAAAG CTCCATACAC TGGATGTGTA TAGAGCAATG GGGCTTTATT 108 C 

TGATATAGAG TTCTTGGTTT TTTAGGACAA TTTCTCGGAT ACTTGCAAAC TTTTTAAGTT 1140 

TTTTGATTTC TTCTGGATGA GTGACGAGAG TGATAACATA ACCTTCCTTG CCCATACGAC 1200 

CAGTACGGCC AGCACGGTGT GTGTAGGTTT CGCTATCTCT AGGAATATCA AAGTTTACGA 1260 

CACATTCTAG GCTATCGATA TCAATTCCAC GAGCCAAAAG GTCAGTTCCA AGAAGCAGGG 1320 

TTAGTTGGTT ATCTTTAAAC TTTTCTAAGA TGATTTTTCT AAATTTAACA TTAACATCAC 1380 

TAGCGAGGGA AACAGCCAAT ATATCACGAT ACTGTAGTTT TTCCTCGGCA TTCCCAAGGT 1440 

CTGACAGGCT ATTGAAGAAG ACTAGACCAC GGAAATCCTC TACATGAGCC AGTTTTCGTA 1500 

GCATATCCAC TCGATGACGT TGGTCTACCT GCATGTAGAA ATCCTGGATA TTGTCCAATT 1560 

TTTGATCAGA GAGATCAATA GTGCGTGTAT TCGGCACAAT CTTTTCTTGG TCAAACTTGG 1620 

TCGTGGCACT CATGTAGACC AGTTGGTGGT CACGAGGTCC GTAGTGAGTG ATTTTTTCTA 1680 

CAAAGTGAAT CTGAGAATCA TCTAGTAATT GGTCAAATTC ATCCAGGATG ATGGTTTCCA 1740 

CATTCATCAT CTTGATTTTT TTAAGTTTAA TGAGTTCAAA GATACGGCCA GGAGTTCCAA 1800 

TCAGAATTTC TGGCCCCTTT TTAAGACGTT CAATTTGTCG TTTCTGACTT GAACCTGAAA 1860 

GGAAGAGTTG AGCAGTCAAT CCGATAGCTT CTGCCCACGT TTTACATACA TCAAAAATCT 1920 

GTCCAGCAAG TTCCGTATTT GGTGCTAGAA TCAAGAGTTG TTGGGCTTTT TTCTTTTGTA 1980 

GTCTGAGAAG ACTTGGTAGG AGATACGCTA GGGTCTTACC AGTTCCGGTT TGGCTCACTC 2040 

CTAGGAGGTT TTCTCCAGCA AGAAGGGGCT CAAATAGTTG AGTTTGAATG GGGGTGAATT 2100 

CTTGGAAACC GAGTTGGTCA CTCAGTTCTT GCCATTCAGT CGGTAGTTTG GTTTTCATTT 2160 

TTCTGCCTCA AATCTAATGC CAGCAGTCTC GCGCATGGTA TATAGTAGCT CATGAACAGA 2220 

GCCTGCATCA TACAGCCAAG TTTGGTAGAG ATTCAGATCT GGTTGCTGGA 7CATGTGTGC 2280 
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AAATCCAGCC ACTTCCTCAG TCATCGTATG AGGAGCCTGT TGGATAGGAA CCTGGACTTG 2340 

ATTTCCTTGG TGGTCGGTAA AAATAGCTGA GCGAATATGC TCAATCGTGT TGAGAGTCAA 2400 

GGTTCCATCT GTTGTATAAA TCTCGCAAGG AAGATTGGAA GTGATGTTTT TTCCAGCCTT 2 4 60 

GATGTGAACT TGATAGTCTG GGTAGAAGAG GATACCATCT CCATTTAGGT CAATGCTATT 2520 

GTCAAGCTGT TGAGCATGGT AAGTCGCGTC ATTGGCTTTT CCAAAAAGAC CAACAGCAGC 2580 

ATAGAGGGGA TAAATCCCCA AATCCATGAG GGCTCCACCA^GCAAAACGGT CTGAAAAGAC - 2640" 

ATTTGGTGTT TGTCCAGCCA ACAAGTCAGG CATCTTGGAA GAGTATTTGG CATACTTGAA 2700 

ATCTGCTCCT AACACTTGCT TATCTGCTAA AAACTTTTTC ATAGTAGTAA AGGCTTTCTC 27 60 

GTGGTAATTA CGAGCTGCTT CAAAGATAAA ACAGTTATTT TTTTCAGCTG TTTGAATCAA 2820 

ATCAAACCAT TCTTGTGGTT GAGAGACAGC TGCCTTTTCG AGAATAACAT GTTTACCACC 2880 

AGACAAGGCA GCTTTTGCCT GAGCAAAATG TAAGGAGTTT GGACTGGCGA TAT AG ACT AA 2940 

ATCAAAAGAA GATTTGAAGA AGACTTCTAA TTGATCGAAT AGTTGGATAT TCTGATAGCG 3000 

AGAAGCAAAG GTTGCTGCAG TTTCTAGTTT TCTAGAATAG ATTGCGACCA GTTGGTATTC 3060 

TCCACTGGTA TGCGCTGCTT CTATGAAATG ATGGCTGATA GCGCCAGTTC CGATGACACC 3120 

TAATTTTAGC ATAAATACTC CTTTTCCGAT TTTAAATCCT TCTTTCATTA TAACATAGAT 3180 

AGACGGGACT ATCCAACACA GAGGAGAAAA TTTCAAATAA GCTATTAGCT TTCTTTTCCG 3240 

AATAAATAGA TAGAAGCATA GAATCTAGCA AACCTAGATT TAAAAATGTG CTATAATAGA 3 300 

AGGAGGAAAA GGAGGATTCT CAGACATCTA GGTATCACCC CAACTAATGA TTTGTCAATT 3 3 60 

TATCCGCGAT ATCCTGGACT TGCCAGCAAA AAATGTGACG ATTTTGGAGG GAAGTAACAT 3420 

TCACGTCTTG CCTTCCATGC CCTACTCAGC GTAAGATTTC TATACTAGTA TAGACGTCTT 3480 

GGCGGAGTTA GATAATGGAA TCCAAGTTAT CATCGAAATT CAGGTTCATC ATCAGAATTT 3540 

TTTCATCAAT CGCCTATGGC CTTATCTGTG CAGTCAGGTT AATCAAAACC TAGAAAAAAT 3 600 

TCGCCAACGT GAAGGTGATA CCCACCAGAG CTACAAACAA ATCGCACTAG TATACGCTAT 3 660 

CGCAATTGTC GATACTAATT ACTTCTCAGA TGACCTAGCT TTTCATAGTT TTATAGTAAA 3 720 

ATGAAATGAG AACAGGACAA ATCGATCAGG ACAGTCAAAT CGATTTCTAA CAATGTTTTA 3 780 

GAAGTATAGG TCTACTATTC TACCTTCAAT CTACTAGAAA TTCCATAGAT AGAAAACTAC 3840 

ATAATCTCTA CAGATACGGA TGTTGGAGTT GATGTAAGAT GCTTTGGCTT CCTAGAGGAA 3900 

TTGTGGATTG CCAAATTGTA TCATTGAAAT TATTGCTCAA ATTTGTTATG ATATAAATAT 3960 

GAATAAAAGT AGACTAGGAC GTGGCAGACA CGGGAAAACG AGACATGTAT TATTGGCTTT 4020 
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GATTGGTATT TTAGCAATTT CTATTTGCCT ATTAGGCCCA TTTATTGCTT TTAAGATCTA 
CCACCAAAAA AGTTTTGAGC AAAAGATTGA ATCGCTCAAA AAAGAGAAAG ATGATCAATT 
GAGTGAGGCA AATCAGAAGG AGCATTTTCG TCAGGGGCAA GCCGAAGTGA TTGCCTATTA 
TCCTCTCCAA GGGGAGAAAC TGATTTCCTC TCTTACGGAC CTGATAAATC AAGATGTTAA 
GGACAAGCTA GAAAGTAAGG ACAATCTTGT TTTCTACTAT ACAGAGCAAG AAGAGTCAGG 
TTTAAAGGGA GTCGTTAATC GTAATGTGAC CAAACAAATC TATGATTTAG TTGCTTTTAA 
GATTGAAGAG ACTGAAAAGA CCAGTCTAGG AAAGGTTCAC TTAACAGAAG ATGGGCAACC 
TTTTACACTT GACCAACTGT TTTCAGATGC TAGTAAGGCT AAGGAACAGC TGATAAAAGA 
CTTCACCTCC TTCATAGAGG ATAAAAAAAT AGAGCAAGAC CAGACTGAGC AGATTCTAAA 
AAACTTCTCT GACCAAGACT TCTCTGCATG GAATTTTGAT TACAAGGATA GTCAGATTAT 
CCTTTATCCA AGTCCTGTGG TTGAAAATTT AGAAGAGATA GCCTTGCCAG TATCTGCTTT 
CTTTGATCTT ATCCAATCTT CGTACTTACT CGAAAAAGAT GCGGCCTTGT ACCAATCTTA 
CTTTGATAAG AAACATCAAA AAGTTGTCGC TCTAACCTTT GATGATGGTC CAAATCCAGC 
AACGACCCCG CAGGT ATT AG ACACCCTAGC TAAATATGAT ATTACAAGCG GGGT 
(2) INFORMATION FOR SEQ ID NO: 179: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

48S4 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179; 
TAAACAGGTG TTAGGTGCTC TAAACTATTA AAATTCTAAG GAAATAAGGC TACTTTTTCT 60 
GGGTCTTGTT CATAGTAGGT GTGGTTCTTT TTTTCGAGTG TAGCCCATAC CTTTGAGCGC 120 
ATAGTGGATG GTAGTTGGAT GACAGCCAAA TTCAGAAGCT ATTTCAGTCA AAT AAGCA'i C 



180 



TGGATTGTCA GTAAGATAGT TTTTAACTCT ATCTCTATCA ACTTTTCTTG GTTTTGTTCC 240 

TTTTACTTGG TGGTTTAGCT CTCCTGTTTT CTCTTTTAGC TTTAACCAGC CATAAATGGT 300 

ATTACGTGAG ATTTGGAAAA CGTGTGATGC TTCTGTTATA CTACCTGTTC GCTCACAATA 360 

AGAGACAACT TTTTTACGAA AATCTATTGA ATATCCCATA AGAAGATTAT ACCACATTGT 420 

GTACTATTTT TGGTTCATTT TACTATATTT CTAAACACTT AGAAATAATA AAACAAATTA 480 

AATATTATTT CTAAATATTT GAAAATAACA TCTATTTGTA TTATACTATC TTTGAGGTAA 540 

CTATTATGAA CTATATCAAA AGACCACATT ATTTAGATTT TTTAAGAAAA CATCGTGACC 600 
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GACCAATCAT 


CAAAGTTGTG 


AGTGGAGTTA 


GACGAGCTGG 


TAAATCTGTG 


CTTTTTCAAC 


660 


TCTATAAAGA 


GGAGTTACTA 


GCAACTGGGG 


TAGACGAGGA TCAGATTATA TTCATCAATT 


720 


TCGAAGATTT 


GAGTTACTAT 


GATCTGCGAC 


ATTTTCAAAC 


ATTATTCGCT TATATAAAAG 


780 


ATCAATTAGT 


TAGCAAGAAA 


ACATACTATA 


TCTTTTTAGA 


TGAAATTCAA 


TATGTTGAAA 


840 


AATTTGAACT 


GGTAGCAGAT 


AGTCTATTCA 


TCTTAGCAAA 


TCTAGACCTC 


TATTTGACTG 


900 


GATCTAACCC CTACTTTATG AGTAGCCAAT TAGCAACAAA CTTGACTCGT 


CGGTATGTTG - 


~~ 960 


AGATAGAGGT 


TCTTCCTTTG 


TCATTTGAAG 


AATATCTATC 


AGGTCAATCT 


CTCACAGAGA 


1020 


ATCTGAATAC 


AACAGAAATT 


TTTAACAATT 


ATCTCTTTAG 


TGCTTTCCCT 


TACTTATTGC 


1080 


AAACATCATC 


TTACGATGAA 


AAAATTGACT 


ATCTCAGAGG 


AATATATAAC 


TCCATACTGT 


1140 


TAAATGATAT 


TGTCACTAGA 


TTGGGAAAAC 


CAAATCCTAC 


TATTATTGAG 


CGCATTGTCC 


1200 


GAACCCTTCT 


CAGTAGTACA 


GGTAGCTTAA 


TATCAACAAA 


TAAGATTCCC 


AATACCCTAG 


1260 


TCAGCCAAAA 


TGTTTCAATA 


TCCCATAATA 


CTTTGGAAAA 


TTATTTGACA 


ACTTTGACAC 


1320 


ATAGTTTACT 


TTTTTATTCC 


GTTCCACGTT 


TTGATGTAAA 


AGGTAGAGCA 


TTATTGCAAC 


1380 


GTTTAGAAAA 


ATATTATCCC 


GTTGATTTAG 


GTTTACGACA 


TCTCTTATTA 


CCAGACCAGA 


1440 


AAGAAGACAT 


TAGGCATATC 


TTGGAAAATA 


TGGTATATTT 


GGAATTGAGA 


CGTAGATATT 


1500 


CACAAGTATA 


TGTTGGTAAT 


TTAGATAAGT 


ATGAGGTTGA 


TTTTGTTGTT 


GTAACTGATC 


1560 


TTGGCCACTA 


CGCTTATTAT 


CAGGTCAGTG 


AAACAACACT 


TGCTCCAGAA 


ACACTAGAAA 


1620 


GAGAACTTAG 


ACCACTAGAA 


GCCATTAAAG 


ATCAATTCCC 


TAAATATCTA 


TTAACAATGC 


1680 


ATACGATTCA 


GCCAACAGCC 


AATTACAATG 


GAATCGAGAA 


GAAAAGCATT 


ATAGATTGGT 


1740 


TACTAGAAAA 


ATAGATAAAT 


ATAAATCATA 


CAGCTAATTA 


GATTTGCAAC 


AGTCTGTTAT 


1800 


CAATGATTCT 


ACCCAAATCC 


TAACAAGATA 


TAGTGAATTT 


CGAATACGCT 


ATATAATACG 


10DU 


GACACTTGAA 


AATAGAAATT 


GGGGATGAAA 


GGGGATCTAT 


AATTTCTGGA 


AGTACTATCA 


1920 


AAAATTAATA 


TCATAGTCTT 


ATTAGAGAAT 


AGCATCACCC 


ACTTTCTCAA 


ATAAGATTAA 


1980 


ATTGTAACTG 


AATTATAATG 


AAAAAGAGAC 


TGAGCAATCA 


GTCTTTAAAA 


TCAGAAAAGC 


2040 


GCATAGTATC 


AGGTATTGAA 


CAACCTTGAT 


AATATGCGTT 


TTATTATGGA 


AATATTTGCT 


2100 


TCATTTTCTC 


CTGAAATAGA 


GCTTTTGCTA 


TCCTATTTTT 


CTCTATTTCT 


AATGATTTAC 


2160 


TTCAACTTCT 


TACCTCTTGG 


GAAAAA 








2186 


(2) INFORMATION FOR SEQ ID NO: 180: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3236 base pairs 
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(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

GTCACACGTT TGACTTCACG TATTTCATAA GTATAAACTT TCTTTTTATC GGTTAGATAA 60 

ATCTTCATGC CATTTTTAGC ATTATCTAAA GGAGAAAATA ACATTTTATT AGCATTATCA 120 

ACACCAAAGA TATGGTGACT AGCTAGACTA TAATTTCCTT CTCCCATTAC TTGCTCGCGT 180 

TTCATTGTAC CAGCTCCGTA GAAGAGATTA ACATTATCAA GTCCTTTAAA AATCGGCAAA 240 

TTCATTTCCA ATTCAGGAAT TGCAATTCCC CCAATAACTG GTAATTTTTG AGCATCCCAT 300 

TGAGAAGTTA GAACAGCTTC CGAAGAGATA GCTTTGACAG AATCAAAGTC AAAATTGCCT 360 

TCTGTATCCT GATTTTCTTC TAATTTTTCT TTTGATACCT GGCTAACTTG ATACTTATTG 420 

GTATTCCAGA CTATGAAAAT ATTTCGAATT TGAGTATTAA AAATCAAAGC CAGTGACAGT 480 

AATATCAGAA ATCCTGCTAG GATATTTGTC AGCAGATTTT TTCGCTTGTT TTTCTTTTTA 540 

TTATTTTTTT GAGACATTAT GCTTCACCTT CTGTTTCGTT TTCTGTCCCA ACTTCTTCTT 600 

TTTCTGCCAC CGCAACCGTT GTGAAAGTCA CTATCTGAGC ATCTTGATCC AGGCGCATTA 660 

CTTTAACTCC CATAGTTGCA CGTCCT G TTT GTGAAATATT GGCAAGATTG GTTCGAATCA 720 

TGACACCTGT ATCAGTGATA ATCATCAAAT CCTCATCCCC TTGAACAGTC ATAAGACCGG 780 

CCAGCAAGCC A T TTTTTTCC GTAATTTTAG CTGTCTGCAT TCCCTTACCA CCACGACCTT 840 

TTGTTGGGTA TTCAGTAGCG ACTGTACGCT TACCATATCC TTTTTCTGTG ATAATAAGAA 900 

CCTCATCTTG ATCAGTAATC AAGCTGGCAC CAACAACTGT GTCTCCTTCA CGAACGTTAA 960 

CACCTTTCAC ACCAGTGGCG ATACGGCTCA TACCACGAAC GGCTGATTGA TTAAAGCGAA 1020 

CTGCATAACC AAACTTGGTA CCAATGATAA TATCCATATC TCCTTCTGCC AACAAGACAT 1080 

TGATTAACTC ATCTTCATCC TTTAAATTCA GCGCTTTGAG ACCATTTTGA CGAATATTGG 1140 

CAAACTCCTT AACACTGGTT CTCTTCACAA TACCGTGACG GGTTGTAAAG AAGAGATAAG 1200 

CATCATCACT GCGATCAGAC TCAACATTGA TAACCGTCTG AATACTTTCG TCTTCATCCA 1260 

ATTTCAAGAG ATTGACTACT GGTAGCCCTT TGGCAGTCCG AC CAT ACT C A GGAATTTCAT 1320 

AACCTTTAAG ACGATAGACA CGTCCCTTGT TTGTGAAGAA GACCAGATGA TCATGGGTGC 1380 

TAGTTGACAC TAACTCACGA ACAAAGTCAT CATCTTTCAC TCCCGTTCCT TGGACACCAC 1440 

GACCCCCACG TTTTTGAGCA GTGAACTCGT CCTGATCCAA ACGCTTAATG TAGCCTCTGT 1500 

TAGAAAGGGT AATCAAGACA TCCGATTCTT CAATCAAGTC CTCATCCTCG AGACTCAAGA 1560 
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CCTCTCCAAT CATCAACTCT GTACGGCGCT TATCAGAAAA TTTACGTTTA ACTTCATCCA 1620 

ATTCCTCTTT GATAATTTGA GAAACACGTT CAGGCTTAGC AAGAATATCT GCTAAATCCG 1680 

CAATCAGAGC CAAGAGGTCA TCATACTCAC ATTGAATCTT ATCGCGTTCC AAACCTGTCA 1740 

AACGACGAAG ACGCATATCA AGGATAGCTT GACTTTGACG TTCAGAAAGC TTAAACTTGC 1800 

TCATCAACTC AGCTTCAGcT TCCGCATcCG CTTCACTACC ACGGATGATA CGAATCAyTC 1860 

GTCGATATGC TCTAGCGCAA TCAAGAGACC TTCTAAGATA TCACCCCGCG CTTCCGCTTT 1920 

TTCCTTATCA AAACGTGTAC GACGAACAAC CACTTCTTTT TGGTGCTCGA TATAAGCATC 1980 

CAAAATCTGA CGAAGAGACA AAATTTTCGG TATACCATTT TGGATAGCGA GCATATTGAA 2040 

AC CAAAATTG GTTTGCATTT GGGTCATTTT GAAGAGGTTA TTGAGAATAA CATTGGCTGA 2100 

GGCGTCGCGC TTGACTTCAA TAACAAATCG AACACCTTCA CGGTTTGACT CATCACGTAC 2160 

TGCTGTGATA CCCTCAATGC GTTTTTCCTG AACCAAGCGA ACAATATGCT CATGCACCTT 2220 

GGTTTTATTG ACCATGTAAG GAAATTCTGT TACAACGATA CGCTCACGAC CAGTCTTAGT 2 280 

CGTTTCAATC TCTGTACGAG AACGTAGGAC AATCGAACCT TTACCTCTTT CATAAGCCTT 2340 

ATGGATACCT GATTTCCCCA TGACAAGAGC ACCAGTTGGA AAATCTGGTC CAGGCAAGAC 2400 

TTCCATCAAG TCCTTGGTAG TCACTTCAGG ATTATCCATG ACCAACTTCA CTGCATCAAT 24 60 

gGTTTCACCC AGATTATGAG GTGGAATATT GGTTGCCATC CCAACCGCGA TACCAGTTGC 2520 

TCCATTAACC AAAAGGTTTG GAAAACGCGC TGGCAACACC AAGGGTTCCC GTTCATTGGC 2580 

ATCATAGTTA TCAACGAAAT CAACTGTATT TTTGTTGATA TCACGAAGCA TTTCCAGAGC 2 640 

AATCTTGCTC ATACGTGCCT CGGTATAACG TTGAGCGGCA GCACTATCTC CATCCATGGA 2700 

ACCAAAATTC CCATGACCAT CTACAAGCAT GTAACGGTAG CTCCACCATT GAGCCATACG 2760 

GACCATGGCT TCATAAATAG AGGAATCCCC GTGTGGCTGA TATTTACCCA TGACATCCCC 2820 

TGTAATACGA GCAGATTTTT TATGGGGTTT GTCTGGCGTC ACACCCAATT CATTCATTCC 2880 

GTAGAGAATG CGACGGTGAA CAGGTTTTAA GCCATCTCGA ACATCAGGAA GAGCTCGCGC 2 94 0 

TACGATAACA CTCATGGCGT AGTCGATAAA ACTTGCCTTC ATCTCCTTTG TCAGATTGAC 3000 

ATTCACTAAA TTTTTATCCT GCATTAATAA ATGCCTCATT TCACAATTAG TAAGTAACAA 3060 

CATTATACCA TAAATTCCCA TCTATTTCAG CCTCTAAACC ACTAAAACGT TTACATCGAG 3120 

AACTATAAGG CATATTCGTG ACAAAGTTTT TTAAAAGTGA TACAATGAAG TTGTCTAGGG 3180 

AAAACCCCTA ATAGAATAAG GAGATGGTTA nACAATGACT CTGACTAACA CACAAA 323 6 
(2) INFORMATION FOR SEQ ID NO: 181: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
f D > TOPOLOGY: linear 



<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


181: 






AGGTCCTGAA 


GTATTGGAAC 


AGGAAGGTCA 


AGAGTTTTTG 


GAACATTTCA 


AAAAACTCTT 


60 


CGAGTCAGTT 


GAAGTAGTAG CCATCTCACG TACTCTGCCA GCTGGCCTTC CAGTTGATTA 


120 


CTATGCGAGC 


TTGGTAGAAC 


TTGCTAATCA 


AGCTGGCAAC 


CATGTAGTCT 


TGGACTGCTC 


180 


AGGTGCAGCA 


CTTCAGGCTG 


TTCTTGAATC 


ACCCCATAAA 


CCAACAGTCA 


TCAAACCAAA 


240 


TAATGAAGAA 


TTGTCTCAGC 


TTCTTGGAAG 


AGAAGTTTCT 


CAGGATTTGC 


ATGAATTAAA 


300 


AGAAGTACTT 


CAAGAACCTT 


TGTTTGCAGG 


GATTGAATGC 


ATTATCGTTT 


CACTTGGTGC 


360 


CAACGGTACT 


TTTGCCAAAC 


ATGGTGACAC 


TTTCTACAAG 


GTAGATATTC 


CTAGAATTCA 


420 


GGTGGTAAAT 


CCTGTTGGAT 


CTGGAGACTC 


TACTCTGCCA 


GGAATTTCTT 


CAGGACTTCT 


480 


TCACAAAGAA TCGGATGCAG AATTACTCAT CAAGGCAAAT CTCCTTCGTA 


TGCTCAATGC 


540 


TCAAGAAAAA ATGACTGGTC ATGTCAACAT GGCCAACTAT 


CAAGCTCTAT 


ATGATCAATT 


600 


AATAGTAAAA 


GAGGTATAAA 


ATGGCTTTAA 


CAGAACAAAA 


ACGTGTACGC 


TTAGAAAAAC 


660 


TTTCTGATGA 


AAATGGTATC 


ATCTCAGCTC 


TTGCATTTGA 


CCAACGTGGT 


CCTTTGAAAC 


720 


GCCTCATGGT 


TAAACACCAA 


ACAGAAGAAC 


CAACTGTGGC 


CCAAATGGAA 


GAACTTAAAG 


780 


TCTTGGTAGC 


AGATGAATTG 


ACTAAATATG 


CTTCATCTAT 


GCTTCTTGAC 


CCTGAGTATG 


840 


GACTTCCAGC 


AACTAAAGCT 


CTTGATGAAA 


AAGCTGGTCT 


TCTCCTTGCT 


TATGAAAAAA 


900 


CAGGTTATGA 


CACAACAAGC 


ACAAAACGCT 


TGCCAGACTG 


CTTGGATGTT 


TGGTCTGCAA 


960 


AACGTATTAA 


AGAAGAAGGT 


GCAGATGCAG 


TTAAATTCTT 


GCTTTACTAT 


GATGTAGATA 


1020 


GCTCAGACGA 


ACTCAATCAA 


GAAAAACAAG 


CCTACATCGA 


ACGCATCCGT 


TCTGAGTGTG 


1080 


TGGCTGAAGA 


TATCCCATTC 


TTCCTTGAAA 


TCCTTGCTTA 


CGATGAAAAA 


ATTGCGGATG 


1140 


CAGGTTCTGT 


AGAATACGCT 


AAAGTAAAAC 


CACACAAAGT 


TATCGGCGCT 


ATGAAAGTCT 


1200 


TTTCAGACCC 


ACGCTTTAAC 


ATTGATGTTT 


TGAAAGTTGA 


AGTTCCTGTT 


AACATTAAAT 


1260 


ATGTTGAAGC 


JcTCGCTGAAG 


GTGAAGTACT 


TTATACACGT 


GAAGAAGCAG 


CAGCCTTCTT 


1320 


CAAAGCGCAA 


GATGAAGCAA 


CGAACTTGCC 


ATACATCTAC 


TTGAGTGCTG 


GTGTATCAGC 


1380 


TAAACTCTTC 


CAAGATACTC 


TTGTATTTGC 


TCATGAATCA 


GGTGCGAACT 


TTAACGGAGT 


1440 


TCTTTGTGGC 


CGTGCTACAT 


GGGCAGGATC 


AGTTGAAGCT 


TACATCAAAG 


ATGGTGAAGC 


1500 
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AGCAGCTCGC GAATCCtCCC ACAACTGGAT TTGAAAACAT TGACGAACTC AACAAAGTTC 1560 

TTCAAAGAAC AGCAACTTCA TGGAAAGAAC GCGTGTAAGA AAGTCCTCCT AGTTTAGGAA 1620 

CATGAATCTA AAAAAATTTA AAAAAACTTG TATGTAAAGG CTTACAAAAT AACTTACTTG 1680 

TGCTATACTT AAATCACAAG TTAATATGAA TTAGAAAGTA ACTATATGAA GTATAATAAA 1740 

AATAGGATAT AGTTTATTTT ACGAGCTAGG AAGGAAAAAT ACGGAAACAA TATTGCCAGA 1800 

ATAAACTATA TTTAGATCCA CATTTCATTC ATTGTTTTAT AAAAGGAGAA GATAAACGGC " I860 

TACTAAAAAG AGTTTTAAAG CGTTAGTTGT AGGACTAGGT ATTGTTTCAA TATTCTTATC 1920 

AGCCTTACCT ATGGTTACTG GTTCTGTATT TGCAGATAGT GCCCTAACTA CAGTAGATAA 1980 

AGCAAATGAT ATTGTTTTGA ATGTTGATGG GAATAAATTT TATAATGTTT CGCTTTCAGA 2040 

AGATATTGTA AATGCTGGTC AAATTTTGGA AGATTATTTT TATGTAGATA AATTTGGAAA 2100 
TATAAATTTA AAAGGCACTC CTGAAGAGTT AGCAAAAAAT ATTGGTATTT CTGTACAAGA 
AGCAAGTTTG ATGTATGGAG CTGTAAAAGA GTTACCCAAC GTTTACGAAA GAGGTCCTCT 
AGGTTTTCGT TTCAATCTTG GTCCTCAAGT GAGGGGGATG GGTGGCTGGG CTGCTGGAGC 
TTTCGCTACT GGATATGCTG GATGGCATTT GAAACAATTT GCGCTTAATC CTGTTACATC 
TGGATTTGTT GCTGTAATAA GTGGTGCGAT TGGCTGGGCT GTAAAAACTG CTCTAGAAAA 

TTATTGGACA GTTGCTGTAG CTACAGTAGA AGTGCCGTTT GTGAACCTTG TTTACACCAT 2460 

AGATTTACCT TAGAGGTTAT TTCTTTATGA ATCATTCTTT TAAAAAAATA ACTGTATTTT 252 0 

GTTTTATAGT TTCTTGTGTT CTTTGTTTAT TAGACTTAAT GAATTTTAAA AATGTAGCTA 2580 

CTTTITTATT TTTCTGTCTT CCTCTTTTTG TTTTGATTTA CAAAAATAAA TAAAAACAGA 2640 

GCCTCTGTTT GATGAATTTT AGAACATAGT TAAGTTTTAA AAAAAGTTGT ATGTAAAGGT 2700 

TTACAAAATA ACTTACTTGT GCTATACTTA AATCACAAGT TAATACAAGG TGAGTGTTAC 2760 

TAAGTAATAT TAGGCATGAT CACAGGTGAA TTAGAAATCA GCTGATTTTC TAGTTCATTT 2820 

GTGGTCATTT TTTGTACTTA TATACCTTTA AGATATAAAA GGAGGTTGAC ATGTATCGAA 2880 

TTCTAAATCC AATGAATCAC AATGTCTCGC TTGTCAGAAA TGATAAGGGA GAAGACGTGA 2940 

TTGTAATTGG TAAGGGAATT GCATTCGGAA AGAAGAAGGG GGATTTGATT GCTGAAAATC 3000 

AGGTTGAGAA AATCTTTCGG ATGAAGACCG AAGAGTCCAG AGAAAACTTT ATGGCTCTTC 3060 

TCAAAGATGT TCCGCTTGAT TTTATCACAG TGACCTATGA AATCATTGAT AAGCTATCAA 3120 

AGAAATATCA TTATCCGATT CAAGAGTATC TCTATGTAAC CTTGACAGAT CATATTTACT 3180 

GTTCTTATCA AGCTCTAACT CAAGGAAGGT ACAAGGATAG TAATCTGCCA GATATTTCCG 3240 



2160 
2220 
2280 
2340 
2400 
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CTAAGTATCC 


TGTCGCTTTT 


CAAATCGCAA 


ATGAAGCTTT TGAAATTTAC 


CGTCAGAAGC 


3300 


TAGCAGATCA 


TTTTCCTGAG GACGAAATTA TTCGGATTGC TTATCATTTC 


ATTAATGCTG 


3360 


AAGGTGAAAA 


TGAAGTGGAA CTTGTGGAGT CGATTGATAA GAGGAAAGAA 


ATTCTCAGGA 


3420 


ATGTTGAAGA 


AGTTTTAACG GACTATGCAA TTCAACGAAC TAAAAAGAAT 


AACCATTTCT 


3480 


ATGATCGCTT 


TATGATCCAT TTGAATTATT TCTTGGATTA TTTAGACAGA 


TCTAGAGATC 


3540 


ATAACCAATC 


ACTTCTGGAT 


ATGGAAGATC 


ATATTAAACA 


ATCCTATCCA 


AAAGCCTTCG 


3600 


AGATTGGTTC 


CAAGATCTAT GATGTGATTA CCCAACATAC GGGTCTTGAT 


TTGTATAAAA 


3660 


GTGAACGAGT 


TTATCTAGTT CTACATATCC AACGTTTATT GTCATAAAAA 


TTTATTTAAA 


3720 


ACTATATAAG 


GAGAATTCTA 


TCATGAATAG 


AGAAGAAGTA 


ACATTGTTAG 


GTTTTGAAAT 


3780 


CGTAGCCTAT 


GCTGGCGATG 


CTCGTTCAAA 


ACTATTGGAA 


GCCTTGAAGG 


CTGCTGAAGC 


3840 


TGGTGATTTT 


GAAAAAGCGG 


ACGCTCTGGT 


AGAGGAAGCT 


GGTAGCTGTA 


TTGCAGAGCC 


3900 


TCACCACGCG 


CAAACAAGTC 


TATTGACTAA 


GGAAGCTTCA 


GGTGAGGACT 


TGGCTTATAG 


3960 


TGTAACCATG 


ATGCATGGCC 


AAGACCACTT 


AATGACAACT 


ATCTTGTTAA 


AAGATTTGAT 


4020 


GCATCATTTA 


ATTGAACTCT 


ACAAGAGAGG 


AGTTCAATAA 


TGAATAAACT 


AATTGCATTT 


4080 


ATCGAGAAAG 


GAAAGCCTTT 


CTTTGAAAAA 


CTATCTCGTA 


ATATCTATCT 


TCGTGCTATT 


4140 


CGTGATGGTT 


TCATTGCAGG 


TATGCCTGTT 


ATTCTCTTCT 


CAAGTATCTT 


TATCTTGATT 


4200 


GCCTTTGTAC 


CAAACTCATG 


GGGCTTTAAA 


TGGTCTGATG 


AAGTTGTAGC 


CTTTCTGATG 


4260 


AAACCTTATA 


GCTATTCTAT 


GGGTATTCTG 


GCTCTCTTGG 


TACCTGGTAC 


AACAGCTAAG 


4320 


TCATTGACTG 


ACTCAGTAAA 


CCGGAGCATG 


GAAAAAACCA 


ATCAAATCAA 


GTATATCTCA 


4380 


ACATTGTTGG CAGCAATTGT TGGTTTGTTG ATGTTGGCAG CTGATCCTAT 


CGAAAGTGGT 


4440 


CTAGCTACTG 


GATTCTTGGG 


GACAAAAGGT 


TTGCTTTCAG 


CCTTCCTTGC 


TGCCTTTGTT 


4500 


ACTGTAGCCA 


TCTATAAGGT 


TTGTGTTAAG 


AACAACGTCA 


CTATTCGTAT 


GCCTGACGAA 


4560 


GTTCCACCAA 


ATATCTCACA 


AGTCTTTAAA GATCTCATTC 


CATTCACTCT 


ATCTGTTGTT 


4620 


TCTCTTTATG 


CTCTTGACTT 


ATTAGCACGT 


TATTTTGTTG GTTCTAGTGT 


GGCAGAATCA 


4680 


ATCGGTAAAT 


TCTTCGCACC 


ACTCTTCTCA 


GCAGCAGACG 


CATACCTTGG 


TATTACCATT 


4740 


ATCTTTGGTG 


CCTTTGCCTT 


CTTCTGGTTT 


GTTGGGATTC 


ATGGTCCATC 


TATCGTTGAA 


4800 


CCAGCTATCG 


CACCTATTAC 


CTATGCCAAT 


GCCGAAGTTA 


ACTTGAACCT 


TCTCCAACAA 


4860 


GGGATGCATG CAGACAAGAT TCTTACTTCT GGTACACAAA TGTTTATCGT TACCATGGGT 


4920 


GGTACAGGTG 


CGACATTGCT 


CGTTCCATTT 


ATGTTCATGT 


GGTTGACAAA 


ATCGAAACGT 


4980 


AACCGTGCAA 


TCGGACGTGC 


TTCAGTAGTT 


CCTACCTTCT 


TCGGTGTAAA 


TGAACCAATC 


5040 
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TTGTTTGGTG CACCTCTTGT TTTGAATCCA ATCTTCTTCA TTCCATTTAT CTTTCCTCCA 5100 

ATTGCAAACG TATGCATTTT CAAATTCTTT ATTGAAACTC TTGCAATGAA CTCATTCACT 5l60 

CCTAATCTAC CATGGACAAC TCCAGCTCCA CTAGGTCTAG TTCTTGGAAC TAACTTCCAA 5220 

GTGCTATCAT TCATTCTTGC TGCCCTTCTA ATCGTGGTTG ACCTTGTCAT TTACTATCCA 5280 

TTCCTTAAGG TCTATCATGA ACAAATTCTT GAAGAACAAC GTTCAGGTAA GTCTAATGAT 5340 

GAATTGAAAG AAAAAGTTGC TGCAAACTTC AACACTGCAA AAGCGGATGC TATTCTTGAA 5400 

AAAGCGGGTG TCGATGCAGC ACAAAATACC ATCACTGAAC AAACAAATGT CCTCGTTCTC 5460 

TGTGCAGGTG GAGGAACAAG TGGTCTCCTT GCAAATGCTT TGAATAAGGC AGCAGCAGAA 5 520 

TACAATCTCC CTGTGAAAGC AGCAGCAGGC GCCTATGGTC CTCACCGTGA AATGTTACCA 5580 

GAGTTTGATC TTGTTATCCT TGCCCCTCAA GTTGCTTCAA ACTTTGAAGA TATGAAAGCA 5640 

GAAACAGATA AGCTCGGTAT TAAACTAGCG AAAACAGAAG GCGCTCAATA CATCAAATTA 5700 

ACTCGTGATG GAAAACGTGC TCTTGCATTC GTACAAGCGC AATTCGATTA ACGCTAGAGA 5760 

CTCTGAAATA CTCTCCCATC GTTACGGAAA TCGCTATGGC GAATTTCCTA TTATTAATTC 5820 

GTCGCTAAAA AGATATCGTT TTTACCTCCT CATGTCACAA TTCCGTGACT TGGTACAAGA 5880 

AGTGAGATGG AGAAGGATGG CTCACTGACT CCTCTCCTCT CACTTTTACT TTATTTAAAT 5940 

CAAGAAATAG GTGAAAAAAA TGACAAAAAC ACTTCCAAAA GACTTTATTT TTGGTGGCGC 6000 

AACAGCTGCT TATCAAGCAG AACGTGCTAC ACATACTGAT GGAAAAGCAC CAGTTGCTTG 6060 

GGATAAATAT CTTGAGGATA ACTACTGGTA CACTGCCGAA CCAGCTAGTG ATTTTTACAA 6120 

TCGATATCCA GTTGACCTCA AGCTAGCAGA ACACTATGGT GTCAATGGTA TTCGAATTTC 6180 

TATTGCTTGG TCACGTATTT TCCCGACTGG TTACGGCCAA GTAAATGCTA AAGGTGTTGA 6240 

GTTTTATCAT AATTTATTTG CAGAGTGTCA CAAACGTCAT GTTCAGCCTT TTGTAACTCT 6300 

TCATCACTTT GACACGCCAG AAGCTCTCCA CTCAAATGGA GACTTCTTAA ACCGTGAAAA 6360 

TATCGAACAT TTTCTAGACT ACGCTGCCTT CTGTTTTGAA GAATTTCCAG AAGTAAACTA 6420 

TTGGACAACC TTTAATGAAA TTGGACCAAT CGGTGATCGT CAATATTTGG TTGGGAAATT 6480 

CCCTCCAGGT ATCCAGTACG ACCTTCCCAA AGTCTTTCAA TCACACCACA ATATGATGGT 6540 

GTCTCATCCA CCCGCGGTAA AATTGTACAA AGAGAAAGCC TATAAAGGGG AAATTGGTGT 6600 

TCTTCACGCC CTGCCAACTA AATATCCTCT AGATCCTGAA AATCCAGCAG ATGTTCGTGC 6660 

AGCTGAGTTG GAAGATATCA TCCACAATAA ATTCATCTTA GACGCAACTT ATCTAGGTCG 6720 

CTATTCAGCT GAAACCATGG AAGGTGTCAA CCATATCTTA TTAGTCAATG GTCGTACTTT 6780 
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GGATCTTCGT GAAGAAGATT TTACAGCATT AGAACCTGCA AAAGACTTGA ATGATTTCCT 6840 

AGGAATCAAC TACTATATGA GTGACTGGAT GGAAGCCTTT GATGGAGAAA CTGAAATTAT 6900 

CCATAATGGT AAAGGTGAAA AAGGAAGCTC TAAGTATCAA ATCAAAGGTG TTGGTCGTCG 6960 

TGTAGCTCCT GACTATGTAC CACGCACGGA TTGGGATTGG ATTATCTACC CTCAAGGTTT 7020 

GTATGACCAA ATCATGCGTG TGAAGAAAGA TTATCCTAAC TACAAGAAGA TTTACATCAC 7080 

TGAAAATGGT CTCGGCTATA AAGATGAGTT CGTTGATAAC ACTGTTTACG ATGATGGTCG 7140 

TATTGATTAC GTGAAGCAAC ACTTGGAGGT TTTATCTGAT GCCATTGCAC ATGGAGCTAA 7 200 

TGTAAAAGGT TACTTCATTT GCTCATTAAT GGATGTCTTC TCATGGTCAA ACGCTTATGA 7260 

GAAACGTTAT GGTCTCTTCT ACGTAGATTT TGAAACTCAA GAACGTTATC CTAAGAAATC 7320 

AG CTC ACTGG TACAAGAAAG TAGCGGAAAC TCAGATTATA GACTAGTAGA ATTAGTCATT 7380 

AGATATAGAA TTTTAGTGAG TCAAAAAGAT GTTCAAAGAT TTTATCCAAT CTATTTATGA 7440 

AAAAAAGTTT ATATTATAAA TTTCGAAAAA TGCTCTCAAA TACCGTGTTT GACGAGTGAA 7500 

GAATTGAAAA GTCTTGGAAA ATGGTATGTC TCGACTGGTA AAGAATGGAT TTGTCATTCA 7560 

GATGATGAGC TGGAAGAATT TAAAAATCTA TTTTTAAATT TTATCAATCC TGAAGAATGG 7 620 

GATACTATCT CCTTTGATTC AGATTTTATG CCGTTTCAAC AATCGTAACC AATTTCTCAA 7680 

AAAAGTTAAA TCTTATATTT AGTACTCTGT AAAACTCTTA TCTAATCACG TTGCTTATAC 7740 

TCAATGAAAA TCAAAGAGCA ACTTTAAACT AGGAAGCGAG TCGCAGATTT CTCAATGCAT 7 800 

AGCTTTGAGG AATTGGGCAA AAAGTCTTTG ATATAGAAAA ACGCATAGTA TCAGGTGTTT 7860 

CAACACCTGA TACTATGCGT TTTATTGTGG GAAGATTTAC TTTTTTTCTT CTGAAATTGA 7920 

GTTGTTACCC AGGCTCTTTC AGTTTATPAA GGCTTGATGA CTTTAATGTG TTTAGATAGC 7980 

TTAAAAAGGA TTGAATCACT TAGTTTAGAA TCTGAAACAA TAGTATCAAG ATTTGATACA 8040 

TTATAAAAAG TATAAAAATC AAACTTATTG AACTTCCTAT GATCTGCGAG TAAATATTTT 8100 

TTATTAGAAT TATTTAAAGC GATGCGTTGA GCCTCTCCCT CTTCCTCGCT AAAAGTAGCT 8160 

AGAGCTCCGT TTTGAATACC ATTACAGCTA ACGAAAGCTT TAGAAAATTG GAGATTAGAG 8220 

AGATTTTGTA GGGTCAATGT ACCAACAAAA GCACCTGTAA TATCGCGATA ATTTCCACCT 8280 

ATTAAAATCA AATCTGTTAA TTTTCGTTCG CTTAAAATCA GAAAAACAGG TAGACTGTTG 8340 

GTTACGACGC GGATATTGTC AATAGGCAAC TCACGCGCAA AAAACTCTAA TGTTGTTCCT 8400 

GGTCCAATGA AAATAGTTTC TCTTTCTTCT ACTAGACTGC CTGCAAAATG GGCTATTTCT 8460 

TGTTTTTCTG CCGTTTGGAG GGCTTGTTTT TCAATATTTG ATCGCTCATT AGTCAAAAGG 8520 

GAGTTGGTTC GAAGTTTTTC AGCTCCACCA TGCACACGAA TCAGCAAATC TTTATCAGCT 8580 
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AATTCCTCTA AATACCCCCT TGCAGTCATA TCTCAAACGC CTATTTCCTC CATAATCTCT 8640 
TTAACTGTTA T 

(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3786 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEONESS : double 

. . ...... f . D .j TOPOLOGY: linear - - - ■ 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
AATCTCCAAT CAGTGCCACT TCAGCTACAA AGAAGAGGAG GATAATAACT CCGTTCACAA 
GGACAGACAA GAATAATTGA TAGAAGGAGT CGCTTTCACT TGCTTGACTT GGTCTTGTAA 
TGATwTGGAG ACTGGCAAGC AGAATGATTC CAATGCTAAT CACACACAAG AGGGCTGTAA 
ATCGTAGGCT ATCAAAGAAA GCAAAGAAAC TAGCAATACC AGTCACCAmG ATTGGAATTG 
CCAAGAGTTG ACTATATTGT TGGAGAACCT TGTCTAGCGT CCAGTCCTTT TCCTGGTCCA 
TAAATCGTCT CACAACGAAA CTACCCAAGA GGAATGAAAA GAAGAAGAGT GTTCTCGCTA 
CTAGGATAGA GATGATAGAA AAAAGAGTTA AAGGAGCTAG CTGCTCAGGG AAGCGACTCT 
TAATGCTTGC TATATGTCCA TAGTAAGCAT GTTTGATGTG ATAGATACTA AAGAAAAAGG 
AAGATGCAGA AAACAGAATG AGCAAGAGAA AGGCTGTGTA ACTGTGTGTG ATACTTCTTT 540 
CCAACTTACT TGTAGGAGAT TTGATCGCTT CCACTAGCCA AGACCAAAAA TCAAGCACTT 600 
GCTCTTTCCA TTTATCCCTA GATTTTGGAG CTTGGTCCGG GATATAAGGA CTTTCTAAAG 660 
ATTTACTGAT AAGAAGTGGC TCTTTCGTGG TTGCTTTTTG CTGAGGAAGA GCTTCTTGGC 720 
TCTCTTCAGC TATAGTGACT TTTTCTGTTT CTTTAGAAAG CTCTGGCTCT TCTTCAGTAG 780 
AATTAGATGC CTTCTTTTCT TCTATTTCTG TTCTCGCTTC ACTGTCTTCA GGAGCTTCAA 
TTTTCTCTTC TTGCTGGCTT TCCAATTCGA CTTCAGCTTG AGGGACTTCC TCCTCTAACT 
GAGTATTTTT TTCAATTGGT GTATCGAGAT CGGCTATCGT TTCTTCACCC TTGTCTGCAA 960 
CCTCTTGAGC TTGCTCTTCA GGCTTGTTCT TGCTTGTTGT TTTTACAAAA TCATTACTTT 1020 
CAAACCATTC TTGTTTCATG GTAGAACCTC CTTTTTAGTT AGATAAATAT GTTTCCATAG 
TAGCAAATGT AAGCGTTTTT GTCAACGTCT GCTTGGTGTG GATATTAGAT CAATATTATC 
ATCAGATCTC GCAATGAGTT GATCCTTGAC ATCGGTTTTT TCAGTTTTGT AAGGGTTGCT 1200 
TAATTCCGTA CCTCTTGATT CAGGCTTTTC TCTTGTGAAT TGGAAGATAG AACCATAGTT 1260 



60 
120 
180 
240 
300 
360 
420 
480 



840 
900 



1080 
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CCTTGAGATG TCCCAGTTAA TTCGTTGGCT TTCTTTCTCG TCTAGGATGA TTCTGAGATA 1320 

ATCTTTGCCA GTCAGTTCAA CCTTGCCATG GACTTGGATA TTTTCACCGT GGAAGTGATT 1380 

CTCTCTTGAC TCTAGCTGAC TATCTCTAAC AACTGTATCA AAGATATTAA CGATATTGGG 1440 

CGTTGTGAGT TTACTGTTTT TGATACGACT TCCTTCAATT CGGAGGATAT AGCTGTTTGT 1500 

ATTGAGGGTC GCATTTTCAA GGCTAGCATT TATGATGGTG GTTTGTCCGC GATTGGCTGA 1560 

GATGTTGATC CCTTTTAGAG TTCTCCCTTT TGGTAGTCGG AGAATAACTT CTTCAAAACG 1620 

ACTAGACTAG CTACTTGCGA TATGAAGAAT CCCACCAATT CCAGAAGAGA GAAACGGAGT 1680 

TTCAGACAGT TTCTTATCAG TGAGACTCAG AGTTCTATCG TTCTGATTGG TGATAAGATC 1740 

ATGGTGAGCA GAAAGAGATG GATGGTAAGA AATGTGGATT TGATCATCGA AAGAGTCTGT 1800 

GATGGTGAGC GTGTGTTGGT GGAGAGTAAT TTCTACCTTT TCGACTTCCT TGCCAAAGGT 1860 

TAGCTTTTCC GTACGGCTAT CATAGACAGG TTCTTTGGAC ATGGAAAGTA GGCTCTTAAC 1920 

CCCGTCAGAT TGGATACCTA CAAAAAGCAG GATAAAGCCG ATAACGGTAG TCACCACACC 1980 

AAAGATGAGA AATCCTTTTG TCCATTTACG CATGCTGATT ACCTCTCTTT CCTTTTTTAA 2040 

GAACAAATTG TACCAGACGA ACAATGAGTA GACCGAAGAA GCGAGTTGCA TAGGAAATGC 2100 

CAAGTAAAAC TAGCGAAGAA GCACCCATAG CCAGTAAACC AGAACCAAAA ATCAAGATAA 2160 

AGGCTGATTT GGCTTGGGCG AGGACAGTGA AACTTTCAAC TAAAAATAGG AATCCGCCGA 2220 

TGATACCCAG TATGGAAACT GCAAAGAAAG CCAGAATGAC AGTCAAAGCG GCTACAAGAA 2280 

TTGCGAACAG GGTCACGAGG ATGGCGATTC CCAGAGGAAT GCCGATAGGT GCTGCAAGCA 2340 

CGGCTAACAA GGCGATATGT AAAATTTGTC GGTTATTTTT TTGAGCGGGT GCTTCATTGA 2400 

TTTTTTTATC GAGAAGATTG GATAGAACTT CGTGGGCCGC TTCTTTGGGA GTTCCCAAAC 2460 

TAGCGATGAG TTCTTCTTCT CCTTCGACTC CAGCATCGTC AAAGAGCTCT CTGAAATAGT 2520 

CCATGGCTTC GATACGGTCA GCTTCAGGTA GTTTCTTGAG ATAGAGTTCT AGCTGACTCA 2580 

GCTATTCAGT TCTTGTCATG GCGGATACTC CCTTCTATGA TGCCATTGAT GGTGTCTGTA 2640 

TAGAGTGCCC ATTCATCTTT TAGGGTCAAG AGCTGCTCTA TACCACCGTT TGTCAAGGAG 2700 

TAGTATTTGC GCATGCGACC TTGGAACTCT CTAGAATAGG TTGTCAGAAA GCTATTGCCT 2760 

TCCAATTTTT TGAGAATCGG ATAGAGTGTG GATTCTTTGA TATTAGCGAT CAGCTTAATG 2820 

GTTTGGCTAA TCTCATAACC ATAAGAATCA CCCTGCTCCA GTACAGCCAA GATGAGAAAT 2880 

TCAATCAAGG CAGAGGATGT TGGAAAGTAC ATGGGAAACC TCCTTTTCTA ATGTGTAAGA 2940 

TTTTTATATA TAATTTTTCT ACACATACAT TCTACATCTA AAAGAAAGCC CTGTCAAGAG 3000 

AAATGTGTAA AATTTTTATA TATAAAAAAC TTCTAGCTAA AACTAGAAGT TTAAAGGATC 3060 
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TTATCCGCTC TGTCCACTGT AAAGACGGCC ACAGTCATCA GGATATCGAT GAGCAAGAGG 3120 

GCAGCTACAG ATGGTACCCA AGAGTGGAAC AGGTCAAAAC TGTAACCAAA GAGGGTTGGC 3180 

CCAAAGGCTG CTAGGATATA GCCTCCTGTT TGAGATAGGC CGGACAATTG GGCTGTCTTT 3240 

TCAGGGGCGC TTGTCTTGAG TGAAAAGTTG ACCATGACAT AAGGGAAGAG GGCACTGGTT 3300 

GCGGTTCCGA TGAGGAGATG GATGGCAAGC CAGTAAATGA AATTATTGAT TGGGAAAAAG 3 360 

AGCATGGAAA TCCCGACCAC ACCACCTAGT GAAACCACAG TGAGCATGAG CTGACGGTTG 3420 

CGAGTAGATA AACTGGTTGT CAGGCTTGGG ATGGTCATTG AAAAAGCAAT GCTAATCAGA 3480 

GATAAGATAG AAGTCAGCAA GCCAGCTTCG TGACTGGATA GACCTGCATG GATAGACATG 3540 

GTAGGTAACC AGCTCATGAC GGTGTAAAAG ATCAAGGATT GAAAACCTGA AAAGATAATA 3 600 

ATTGCCCAAA CCTGTTTATT ACGCATGACC TTTATTTGAC TTTTTTGTTT GGTTTGTGGA 3 660 

GCTAGTCTAT GATTATAGCG GTGATTTGCG AGCCAGACCA AAAAAGTTGC TAGACAGAGT 37 20 

AACGTGAGGA GAAGGATAAG TCCTTTCCAA GAACTGGCTT GTGTAATGGG CACAGCTAGA 3 7 80 

TAGGAA 3786 



(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3054 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

TCAGCTAAAA AACATTGCTA AATTCATTGA AGCTGGTGCT ACACATTCCG ATTCAACTTC 60 

TCACACGGCG ACCACCAAGA ACAAGGTGAG CGTATGGCAA CTGTTAAACT TGCGGAAAAA 120 

ATTGCAGGTA AAAAAGTTGG TTTCCTTCTT GATACAAAAG GACCTGAAAT CCGTACAGAA 180 

TTGTTCGAAG GTGAAGCTAA AGAATATTCA TACAAAACTG GTGAAAAAAT TCGTGTTGCA 24 0 

ACTAAACAAG GAATCAAATC AACTCGTGAA GTGATTGCGT TGAACGTTGC TGGTGCTCTT 300 

GATATCTATG ATGATGTTGA AGTTGGTCGT CAA G TTTTGG TTGACGATGG TAAACTTGGT 3 60 

CTTCGTGTGG TTGCTAAAGA TGATGCAACT CGTGAATTTG AAGTTGAAGT TGAAAACGAT 420 

GGTATCATCG CTAAACAAAA AGGTGTGAAC ATCCCTAACA CTAAAATTCC TTTCCCAGCT 480 

CTTGCTGAAC GCGATAACGA CGATATCCGT TTCGGTCTTG AACAAGGTAT CAACTTCATC 540 

GCAATTTCAT TCGTACGTAC TGCAAAAGAT GTGAACGAAG TTCGTGCAAT CTGTGAAGAA 600 
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ACTGGAAACG GACATCTTCA ATTGTTCGCT AAAATCGAAA ACCAACAAGG TATCGATAAC 660 

TTAGATGAAA TCATCGAAGC AGCTGATGGT ATTATGATTG CTCGTGGTGA TATGGGTATC 720 

GAAGTACCGT TCGAAATGCT TCCAGTTTAT CAAAAAATGA TTATCAAGAA AGTCAATGCT 780 

GCAGGTAAAG TTGTTATCAC TGCAACAAAC ATGCTTGAAA CAATGACTGA AAAACCACGT 840 

GCAACTCCTT CAGAAGTATC AGATGTATTC AACGCTGTTA TCGACGGAAC TGACGCTACA 900 

ATGTTGTCAG GCGAGTCTGC AAACGGTAAA TACCCACTCG AGTCAGTAAC TACAATGGCT 960 

ACAATCGACA AGAACGCTCA AGCTCTTCTT AATGAATACG GACCTCTTGA TTCAGATTCA 1020 

TTTGAGCGTA ACTCTAAGAC ACAAGTAATG GCTTCTGCTG TTAAAGATGC TACTAGCTCA 1080 

ATGGATATCA AATTGGTTGT AACTCTTACT AAGACAGGTC ATACTGCACG TTTGATTTCT 1140 

AAATACCGTC CAAATGCTGA CATCTTAGCA TTCACATTTG ACGAATTGAC AGAACCTGGC 1200 

TTGATGTTGA ACTGGGGTGT TATCCCAATG TTGACAGATG CTCCATCTTC AACTGACGA? 1260 

ATGTTCCAAA TCGCTGAACG TAAAGCGGTA GAAGCAGGTC TCGTTGAGTC AGGCGATGAT 1320 

ATCGTTATCG TTGCTGGTGT GCCAGTAGGA GAAGCTCTTC GCACAAACAC AATGCGTATC 1380 

CGCACAGTAC GTTAAGAAAA ATATAAAAAC CTATCATATC CAGCTTTAGA CCTTGTGTGA 1440 

TAGGCTTTTT GTATAGAGGG TAAGAAATAG GCAAAACTTT CATAATGGAT TGATACTCTT 1500 

CGAAAATCTC TTCAAACCAC GTCAGCGTCG CCTTACCGTA TATATGTTAC TgACTTCGTC 1560 

AGTTCTATCT ACAACCTCAA AGCAGTGCTT TGAGCAACtG CGGCTAGCTT CCTAGTTTGC 1620 

TCTTTGATTT TCATTGAGTA TGAAATAAGA TATGCACAAA TTGATTAGAA AGTCAAATGA 1680 

ATTTCTACAA ATGTTTTAGC AATCGTAATG TACTTGTCTA GATTCGATCT GATATATTTT 1740 

CGATTTAATG ATATGGTATT TAAAACCTCC AAAGTAGCTT ACTCCATTCT TTTACTTACG 1800 

TGAGTGTAGA TGTTATTTAC TGTTTTAGCG TTTTTGTGTT CCACTCTAAC CATTATAGCA 1860 

TTCTTCTCAG CTAGTGTACT AAGGAGTGTG TGCCTGAAAA TATGGGAACT AAGGGGCTGG 1920 

TTTATCGGTT TCTCTAGTTT AGTATTTGCC TTTTGCAAAG TGATCTTAAA TGCCTTTCl'C 1980 

TAAATTTACA TATCACTATT GTTTAACAAA ATCTAATCTA TTTTAGGTCA CTTATTCTTT 2040 

TTTTGAAATG TAGAATGAAC TTTTTCAAAG TTTTTCGAAT CTTTTAAAAT CTGTTTGCTT 2100 

TATATCGCCA TTCTCCCCCC TTTTTTAATT CTCCCTATAT AGCCTGACAG CTTTCCCGAT 2160 

GGTACGAATA TGGTTGCTTT CGTCTAGGTG GATGTCGGGG TATTCGCGAT TGACTTTTTT 2220 

TGAGGCAGCC TTGGCGGAGT TTCTTGACAT AGTTAGTGCC GTCTACTTGG AAGATGCCGA 22 80 

TGGTATTATA GTCAATCTGT GGGGTATTCT TGATAAATAG GTAGTCGCTG TTTCTTATCT 2340 

TTGGCTCCAT GGACTTGCTG ACGACATAAG CGATTGGGTC GTAGTCGTCT GGGATAATGG 2400 
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AAACTCCATA TCTAAATCGT TGTCCTGCAT CGAGCGGCTA CCTGCAGAGA TAAACTACCT 2460 

AACACGAGAG TAAGTAGTCT GTCTGTAGTC GTCCAGTCTG ATGATTTTTA CGATACTTCG 2 520 

TTTTTCTGAT CATACAGTTG CCTCTCGGCA TAGGTCAGAA CTTTACCTTG TCTGGGTGGT 2580 

TCCCGTTGGT CGTAGATAGA TTGGATATCG CTAGGAGAAT CCTTTTGAAC TGGAGGAAAG 2640 

AGGGCATCGA TCAAGCTACT GAATACTTTA ACTAAGTCAA ATATAGTATT TTTCTTAGTA 2700 

GACCTAACCC TTTTTTCATA ATTTCTAATG GTGTTTTTAC TTATACCTAT CTTAGTACCC 2760 

AATTCTTATT GAGTCCAACC ATTACTAGTC TATATTGTTT TATAGTTGAT TGAGTTTGGA 2820 

ATAGTACGCT GTAGCTGCTA AAACATTTCT AGAAATTAAT TTGACTTTCC TAATAGAGTT 2 880 

GTTCATATCT TATTTCAATC TATTATGTTT TTCACCTCTA ACAATCGCAA TCTCTTCT T T 2940 

ATCCATGAAT GAAATCGCTT TCTATTTTTG TAAGTAAAGC ATAACACGAA ATCCACGAAA 3000 

ATGAAAACCT TTGTTGTGTT TTCGTAAAAA ATTTGTTGAC AGAGCACGAA ACGC 3054 

(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1590 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

yGAAAATTTG GTAAAATATA TCTTAATCAT TTTCAGGAGG ACAAAAATTT 60 

CAGAATTTAG TAAATGGAAA ATGGAAATCA TCTGAACAAG AAATTACGAT 120 

ATCAATCAAG AAGAATTGGG TACAGTTCCA GCCATGACTC AGACTGAACC 180 

ATGCAAGCTG CGCGTGCAGC CCTGCCAGCA TGGCGAGCTT TATCAGCAGT ' 240 

GCTTATTTGC ATAAAACAGC AGCTATTTTA GAACGCGATA AGGAAGAAAT 300 

CTTGCCAAAG AAGTAGCAAA AGGGATTAAA GCAGCAATTG GAGAAGTAGT 360 

GACTTGATTC GTTATGCTGC TGAGGAAGGT CTCCGTATCA CTGGACAAGC 420 

GGTGGTTTTG AGGCAACAAG TAAAAACAAA CTGGCTGTTG TCCGTCGTGA 480 

ATCGTGCTAG CGATTGCTCC CTTTAATTAT CCAGTTAATT TATCTGCTTC 540 

CCTGCCTTGA TTGCACGGAA TGTGGTCATG TTTAAGCCAC CAACACAAGG 600 

GGACTCTTGT TGGCTAAAGC ATTTGAAGAA GCAGGGATTC CGGCAGCTGT 660 

ATTACAGGTC GTGGTTCAGA AATTGGGGAT TATATCATTG AGCACAAAGA 720 



TGTGATTTTC 
GACAAGATAT 
TTATTCACCA 
TGATGAGGCT 
TGAACGTGCG 
TGGTACTATC 
GCGTACAGCA 
AATGGAAGGT 
ACCAGTTGGT 
TAAAATTGCA 
TTCCATTTCT 
TTTCAACACC 
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AGTCAACTTC ATCAACTTTA CAGGTTCAAC TCCTATTCGA GAACCTATTG GTCGTTTACC 780 

TGGTATGCCT CCTATCATGT TGGAACTTGG TGCGAAAGAT GCAGCTCTTG TACTAGAAGA 840 

TGCAGATTTG GAACATGCTG CCAAGCAAAT TGTTGCGGGA GCCTTTAGCT ACTCAGGACA 900 

ACGTTGCACG CCCATTAAAC GTGTCATTGT TCTCGAAAGT GTAGCAGATA AATTAGCTAC 960 

TTTGCTTCAG GAAGAAGTTT CTAAATTAAC AGTTGCTGAT CCATTTGACA ATGCTGATAT 1020 

TACACCTGTT ATTGACAATG CTTCAGCCGA CTTCATTTGG GGCTTGATTG AGGATGCACA 1080 

AGAAAAAGAA GCTCAGGCTC TTACACCAAT CAAACGTGAG GCCAATCTTC TCTGGCCAGT 1140 

GCTTTTTGAC CAAGTTACAA AAGATATGAA AGTGGCATGG GAAGAGCCAT TTGGTCCTGT 1200 

TTTACCAATC ATTCGTGTGG CTAGTGTAGA GGAAGCTATT GCCTTTGCCA ACGAATCTGA 1260 

ATTCGGCCTT CAATCATCAG TCTTTACAAA TGATTTCAAA AAAGCCTTTG AAATTGCTGA 1320 

AAAACTTGAA GTAGGTACAC TCCACATTAA TAATAAAACC CAGCGTGGTC CAGATAATTT 1380 

CCCATTCCTT GGTGTCAAAG GTTCTGGAGC TGGAGTGCAA GGAATTAAAT ATAGCATTGA 1440 

AGCGATGACA AATGTCAAAT CCATTGTTTT TGATGTGAAA TAACGTGTAA AACCAGGAAA 1500 

TTGTTTTCCT GGTTTTATTT TTTTGCTATA AAATAATAAT AATTATAGAA AAAATACGAA 1560 

CTTTTTGGTA TTATAATAGA TTGAAACCGG 1590 
(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4848 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

CCTGCAGTTG TCAGACCTGT AATTTTCTTT TTATCTGTAA TAAGAATCGT TCCAGCGCCT 60 

AGAAAACCCA CACCTGATAT AACTTGAGCT CCTAATCGTG TAGGATCTCC TGTCCCAAAT 120 

TTATAAGATA CGTATTCATT CGTCATCATA ATCAAACATG CACCTAGACA AACAATACTA 180 

TAAGTTCGGA TGCCTGCAGG CTGGGATTTG CTCCCTCTCT CTAAACCAAT TATACTACCA 240 

ATGACTACTG ATAAAACAAT CCTGACAACT ATTTCAATAT TTGATAACCC AAGACTAGTG 300 

GCTGTCATGA TTATTTCCTT ACTTTACGCC CCGGTCTTTG TGTGAAGTAT AATACCGTTC 360 

CAGAAATAAT CATCAGAACA ATTGTATAAA CAAATACCAG AGCTTGTGCA TTAGATGTTG 420 

CTGTTTCATC ACCTGCAGAT CGAATCGTAA TACCTAATGG TTGACCTAGG GGATGGTAAA 480 

GGAATACAGA TAAGTCGAAG TCAGTTAATA AAGAGTTAAA GTTTAAAGCA ATAACAGAGA 540 
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GAACAACCGC TAAAATAAAT GGAATGATAA CCTTCATCAT AGTATAAAAA GGTGAAGCAC 600 

CCATACTTCT TGCTGCATCT TCCATCTCAT CATCAACACT AAATAAAATA GCACGTACCA 660 

TTCTATAAGA AAATGGGATT TTTACAACTA TATATGCAAT AAGTAGAATT ACCAAACTAC 7 20 

CTACCAAAAT CTGATTCAAG ACAAGAAATT GTGGCTGATT AAAAGTAAAT AATAAACTTA 7 80 

CTGCTAAAAG TGTACTTCGT AGTAACCAAG GAAGTAGAGC ACCATATTCA AATAAGAAAT 340 
CAAAACGAGA TTTATGTTTT CTGACAACAC GACCAAATAC AACTCCGAGA ATTGTTGCTG ~ 900 

TTGTCGCAGC AATAATAGAA TAAATAAAGC TGACCAAGAA TGGAGAGAAT GCCGCACTAT 960 

TACTAAAGAA TAAGCGATAA TTTTCTAAAG TAAAGTTTGA TAATGTTAAG TTACCTGTTT 1020 

GAATTGCAAC TGGATCTGTA AATGAGTATA AT ACT AT AAA AATTAGTGGA AGCATGAAAA 1080 

CTGTGAACAA TCCATATGCT ACAATGTGAG CAATGATATT CCAAGGCTTA GACGCAATTT 1140 

TTTGTTTTTT AAGAGGCGCT TTAGTCTTAG AG AT AG AAAT ATAATTTCCA CCTTTTTCTA 1200 

TCTTATTCAT GATAGTAAGC AAAATTGTAG TTGCAATACC TAAAATAATT GCAAGTAGGG 12 60 

CAGCTAAATC ACGAGAATTC CCCATCCCTG CAAATGTAAT AATCATTGGA TTTATAGTTT 1320 

GAAATTCTTT ACCACCAACA ATCATGGGTG CTGCTACTGC AGATAAACCA CTAAGAAAAA 13 80 

CCATAATAGT AAGTGCAAAT AG AGTTGG AA TTAAGGTTGG TAACACTACT TTTCGGAAAA 1440 

CAGTAAATGG TTTTGCTCCC ATATTTCGAG CAGCCTCAAT AGTGTGATAG TCAACGCTTC 1500 

GAATTGTATT TGTTAAAAAC AATGTATGAT TAGCAGTTCC TGAAAATGTC ATAATGAATA 1560 

AGACTGCACC ATACCCAATA AACCAGTTAG GGTCTAAAGA AGGGATAACA TTTTGTAAAA 1620 

ATTTTGTAAT CAATCCATAA GGACCATAGA CAAATTTATA TCCAGTCGCT AAAACCACTC 1680 

CTC CAT AAAT TAAAGAGGTC ATATAACCTA ATTTTAAAAT TTTAGCACCT TTAATATCAA 1740 

AGTACTCTGT AAATAGAACA CAAAGAATAC CTACGACATT AACTGTAATA ATGAGTGAAA 1800 

ATGCTAACTT AAAACTGTTC ATAATACTCT GAAGTGCCCT CTGAGATTTT AGAACACGAT 1860 

GTACAGCATC AAGGGAAAAT TCTCCTCCTT TT AC AAAT AC ATTCACTACT AG AT CAAAGT 1920 

TTGGATAAAT AATAAATGTT ACTAAGAACC AGATTAACCC TAAACGAATA AGCCAATCTT 1980 

TTAAATTTAA TTTATGACGC ATACTGCACC TCCTTAAAAT TGCAGAACGT CTGATGGTGT 2040 

GATAAATAAT TCCACACTTT CTCCGACAGA TCTAATAGCA GCCTGACTAT CAATACTTGT 2100 

TACATTAAGA ATCTGACTTT CAGAAACTTT TATTGTATAG TGAATTGTAA CTCCAGAAAA 2160 

CTCAACATCA ATAATTGTCC CTTTTAGAAT AAAATCTTCT TCAGTTTCAC GATTGAATCC 2220 

AACTTTCTCT AATCGAATGT ATCCTTTTTT ATCCTCTAAG AAAACGCTTG TATTTTTCAA 22 80 
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TAATACTTCG TGGACTGTTT CATCGGTCAA AACATTAATA TCTCCAATAA AATCACATAC 2 3 40 

AAATTCAGTT TGAGAATTAT GATAAATCTC TACTGGTGTA CCGACCTGTT CGATGTATCC 2400 

ATTCTTAAAG ACTGCAATTC TATCAGATAA AGTCAAGGCT TCCTCTTGAT CATGAGTAAC 2460 

ATATAAAGTA GTAATACCTA ACTCTTTTTG AAGTCTTTTC AACTCTTTTC TCAAATCTAC 2520 

ACGTAATTTT GCGTCAAGGT TTGACAATGG TTCATCTAGA CAAACAATTT TAGGTTCAAG 2580 

AACCAGAGCA CGAGCCAATG CTACCCTTTG TTGTTGACCC CCAGATAATT CTGATACATT 2640 

ACGCTGTAAC TGTTGATCAG AGATCTTAAT TTTTGCTGCC ACTGCTGATA CTTTAGCTTT 2700 

AATAACATCT GGAGCTACCT TCTTAACTTT TAAACCAAAT GCAATATTAT CAAAAACAGT 2760 

CATAGTTGGA AATAGCGCAT AAGATTGAAA TACAATACCA ATTCCACGCT TTTCAGGTTC 2 320 

CAAATGAGTG ACATCTGTTC CATTAACTTC AATACTTCCT GATGATGGAT CTAGAAAACC 2880 

TACCAATGCT CTCAAACTAG TTGATTTACC ACATCCTGAA GGCCCAAGAA ATGTAAAAAA 2940 

TTCCCCTTCA TGTATATCTA AATTCAGATT ATCAATTGCA ACAAAATCAC CATATTTAAT 3000 

TTGAATATTA TCAAATTTAA TCATCTCACT AACTCCCTCT ATT ACT AAA C CAAAAGCCTC 3060 

TCTTTATTTC TTCCATAAAT TTAGAAATAA TAGAGACACT TGGACATAAA AATTAACTCT 3120 

TATTTCTTAT TGTACGTATT CTAATTCAGC TTTTTCTACC CATTCATCCA AATGCTTTCC 3180 

AACAGCTTCC CAGTCAATAT TTTGTGGTTT CACTTGATCA ACAAATTTCT TCGTATCTTC 3240 

AGCTAGATCT TTGAGGGCAT CTTTATTTGC ACGAATAGAT CCAAAGTTCT TACTATATTC 3 300 

TACTTGAATT TCTGATTGAC CAAACCAATC AATAAATTCT TTAGCTAACG CITGTTTTTT 3 360 

ACTAGTGCTT AAAACCATAG TTTGTTCAGT TACAAATGGT ACACCAATCT CAGGAGTCAT 3420 

AACTTTGAAA ACAACATTTT GTTCTTTTTG TCCAACTAAT GCACCAGAAC CCCACATCAT 3480 

TCCATATTGT ATTGGATCTT CTTTGTCTAA CATCTTAACA ATTGAACTTT CTCCCTTTTG 3540 

AAGAGTGTAT GCATTTTTCA AATATTCTTT TGCTACTTCC CAACCTTTTT CGGAAACACC 3600 

TAATTCACCT TTATCATCAA GGTATCGAAC TAAGATACTT GCTAGAATTG CCCGTCCTGT 3660 

ACCTCCTTGA AGACCAGAAA TTGAATATTT ACCTTTATAC TTACTACCTA ATTCAGTCCA 37 20 

ATCTTTAGGC ATTTCTTTTA CATCAGGCGC CCCAATTAAA ACTAATGGTT GAACAATCAC 3780 

AGGATTATAA TAATTATCTT TATCTGATAA AGATTGATCA ATTTTATCTA ACCATTTAGG 3840 

CTTGTACTGT ACTAGTAATT TTTGATCTCT AATTTTATTT GAATCAACAG CACCAATTCC 3900 

AAATACCATA TCTGCAACTG CATTATTCTT CTCAGCAATA ACACGGTCTG CTAATTGAGC 3960 

GCCAGCGATA TCAACCATTT TT AT ATT AAA ACCAGCTTCT TTTGCTTTAG CAGTTAACCA 4020 

ATCACCACGA CCATTTGAGA CTGAGTTCGA ATAGATAACT AATTCTTGAC TTTTATCAGC 4080 
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TTTTTCPT.CA GATGAAGAAG CAGTCGTAGA ATTTGAACCT CCAGAGCAAG CAGCAAGTGT 4140 

AGTAAgAGCA ACTCCCGTTG CAAGTACAGT AGACCAAACT TTCATTTTTT TCATGATAAG 4200 

TTCTCCTTTT TTATTATTTT ATTTAAATTT TTCGTGATAT GG AACAAATT GTCTCATATC 42 60 

TTCAAATACA GTATAGTCAA TACGGTTTAC ACTAATAGTT GGAATCTTCT CTAATAAAAT 4320 

TTCAGTTAAT TCTGCTCTGA CTTTAGTAAA CTCTTCTTCC TCCTCTTCGG TTAGAGGAAT 4 3 80 

CCGAAGATAC CCAATTGAAA TATGGAATTG ATATCTATCA TGATTACCGA AACAAACACC 4 440 

TGCTTTTTCT GAGACATAAG TACGAATTTC TTCTAATCTC TTTGCAGAAG CTTCATCTGC 4500 

AGGTTCAACT ACTATGTTTT GTTTTCCCAT TTCAGTTATA CGCATATGAA TTTCTTCATC 4560 

CAACAATGGA AAAATTTCAA GTTGTTTAGC AAAGTAATCA TGTATTTCCT GTAAAGGTGT 4620 

ATCTAGAGGA AGATTACTGC TCCAAAACTC gtTCACGATT TTCATGGCAC AACAATTCAA 4 680 

TTACAGTCAT CTGAATAGAA TTCCTTGGAG TTAAAGTAAA CTTATCCATA AATGGTAATT 4740 

CTCTATAACG TGATTGAATA ATATCAACAA CTTCCATCAA ATCTTGTTTA GTATAAAGAT 4800 

TTGCTACAAC TGTATTCCCA GGG AAATGAT TAAATTCCCC ATTCTCGG 4848 
(2) INFORMATION FOR SEQ ID NO: 186: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3763 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

GTTATAAGCA ACACCTTCTT GCTTGCCATA AGTTGTGAAA TGGGTAGAAT CGATATCTAC 60 

AATGAGTTGG TTTAGCTGGT GAAACTGTAA AAAGAATTCG ACCAATTCAA GGTTGAGGCA 120 

TCGCAAACTA TGGACTGTTT CCTCGTCAGT TCTGGAAAGA AAACGGGATA AGGTTGGCTG 180 

TGAAGCAAGC TGCCCTCCTT CCAATAATTT TGGAAAGTAG GCATCAGCTG ACAATTCTTT 240 

ACAAGCATAG TCCGTTCCAT AACCTGTTAA CAGTTGAAAG AGGAACTGGA CAAGGATATC 300 

TGAATCCGAA TAACGACAGT AGCGGCGTTG GTCATTCGTT ACTAAATACT TAGAAATCCG 3 60 

CTCTTTTAGT TTCAACTGGG AAAAAAGTTC CTGAAAAAAG ATAAGACCAC CATACTGGGT 420 

TAAATGACCT CCATCGAAAG ATAGTTGGTA AAAAGACTTG TTTTGGAAGT GATGATTTGG 480 

TAAACTGTTC ATGTGAGTTT CCTTTCTTTT TGTGTTTTTT TCTACACTTA TACCATAAAG 540 

GGGAAACTCT TTTTTGTCTA GTAAAAAACA CCCATTGGGT GAAAAAAGAA ACCATCCAGG 600 
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ATCTAAGCTA 


AGGCAAGGAT 


TCTGGATGCT 


TTTTAGATTT 


GGGGTGAATA 


ATTGGGGATT 


660 


TAGGAGAAAT 


CATGCTATCT 


TCCAAATCAA 


AATCAACTTC 


ACTCCATAGT 


CTCAACTGAT 


720 


TGATTTTCCC 


ATCTTGATAG 


GTCACATCCT 


TGTCAAGGAT 


AAACTGAGTC 


AACACCTCAT 


780 


GTTGACCTTG 


ACACCTGATG 


TCATCTACCA 


AGAGCCAGAC 


ATCCTCTACC 


AACATGAGGA 


840 


TTTTTCTCCT 


GTGAAGATAA 


GGCAAATCAG 


GTTCTGCTGA 


CCAATAAGCC 


CCCTCAATAT 


900 


AATGCACTCC 


CTCCCTTTCT 


TTATGGTGAC 


AAAACAGGGA GTGAGGATAG TATTCATATT 


960 


CCCAGGATCC 


CGTGATTCTT 


TCCGGAGCTT 


TCCCATCTAC 


AATGCAGGTC 


GAATGACTCC 


1020 


AAGCACTCTT 


TAAGAGATAA 


CGTTCATATA 


TCTCCCGATA 


AGAATAACGC 


CCAGCATCTA 


1080 


TGAAAATAGG 


TTGGCCTTGA 


TACTGTAAGC 


AAAAACTATT 


CTCGTCACTA 


TGACTATGGG 


1140 


CACTTCCTAG 


CGGACCATTT 


TTGAAAAATA 


GATAACGATG 


TTCATCCTTA 


ATGCAGACAT 


1200 


GTCCAGAGTC 


TTCAAAGATC 


ATGGACTTAG 


GCTGCCAAGC 


TCTCTTTTCA 


AATTCCTGCA 


1260 


GTCGCTTGAC 


CTTTTCTCGC 


CCCAGGAACA 


AGAGGCTAAG 


CAAATCAACT 


TTAACATCCA 


1320 


GACCCTTAAG 


AAGGTCTTCC 


TGGTTCAAAA 


CCACAGCAGA 


CAGGCTCAAA 


ATTTCTGTCG 


1380 


TTTCTGTAGA 


ATCGCTATCA 


CCAAAAGCCA 


AAGTCCGTCC 


ATCTAAGCCT 


GTCATCATTT 


1440 


GAATATACGT 


CGCCATCTTT 


TCCAGCAACT 


CTTGGTAACT 


ATCTTGCAAG 


TCTGGAAGCA 


1500 


AGAGACACAA 


ATCCAGCAAG 


GCTTTATAAA 


CCTCTACATG 


ATAGAGAATC 


GACTGTTCAA 


1560 


ACTGGCTTCC 


ATCTCCTAAA 


ATCTGTGTCT 


CAATTTGCTG 


TTTCAACTCC 


TCTGAAGCAA 


1620 


AATGGTAAGC 


TTCTTCTAGA 


TCCATCTTAT 


CTGAAAAGAA 


ATGATAGATA 


GCAAGCATCG 


1680 


GAATTGTTTG 


TAAAATCCCC 


CAGTTACTAA 


GGGTGTACTT 


GGCGCGATAG 


TAGCTTTTCA 


1740 


TAAAGTCAAT 


CTGCTTTTCT 


AGACTGACCA 


AAATTTTCTC 


TAGTTCTTTC 


TCCTCTAGCA 


1800 


AGTCAAATTT 


CAAGAGGAGC 


AAGACTAGTT 


TCAACCAAGT 


AAAGGAACGA 


ATACCCGTAT 


1860 


CCAACGTTCT 


AGTCATCAAG 


GATTGAGGAG 


AAAATTCTCT 


CACCTGCTCA 


ATCCAATCAA 


1920 


ATAGAAAGAA 


CTTGCACTTT 


TGAATATAGT 


CCTTATCTCC 


TTCTACCAGA 


TACCCTATCA 


1960 


TAAACTGCAA 


GAGATATTCT 


TGTCGATTGA 


GCATATAAGA 


CCATTCTGGA 


TCATCTTCAA 


2040 


ATACTTGATC 


CCATACCATC 


GGCTGGATTT 


GATGGATTTT 


TGAACAAGGC 


TCCATATCCC 


2100 


AAGGACTATC 


AAACATAAAA 


CGATTGTCCA 


TCAAGCGTTC 


AAGGGAACTC 


TTGACTTTCT 


2160 


CATAGTCTTT 


TGAACAGTGC 


GACAAGATAT 


AATCACGACA TTGATTTCCA TCGACTCTTT 


2220 


CAAAAAATTG 


TCTTCTTTCT 


TCTTTCATTA 


TCTATTACCA 


GAAAAAGAAC 


TACTTAAAAA 


2280 


GCAGTTCTTT 


TGTCTTTCCC 


ATTACACTTT 


CCTTTTCTAC 


ATGGATGACC 


ACACCTTTTG 


2340 


CAATCTGCAA 


GGAGACCAAG 


TCATCTTGGA 


TAGAAATGAT 


TTTTCCATGA 


ATTCCAGACA 


2400 
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ATAACAACAC 


TTCATCACCA 


AATGTTAAAG 


AAGCTAAATA 


CTCTTCTCGT 


TGCTCCATCT 


2460 


GTTTGCGAAG 


CAACTTTTGC 


TGACGAATAG 


AATGAAACCT 


TGACAGTAAA 


AGGGGACTCA 


2520 


CTGCCAAGAC 


AATCACTATT 


CCATAAAACA 


ATGTTGTATC 


CATTAAGCTA 


TAATCTTAAG 


2580 


CCAGCTTCCG 


ATAATTCCGA 


TGATAACTCT 


TAAAATAACG 


AGTTTATATG 


TTGTCCATTT 


2640 


CTTTTCTTTG 


ATCAAGTAGT 


AAACTAAAAG 


TGTAAATAGG 


GCTGGTAGAA 


GAGCTGGAGC 


2700 


- - AACCTTATCA 


AGCATTCGCT 


GAATACTTAC 


GATACTTTGT 


TTAGCGTCTG- CTTTAACTTC 


2760 


CCCTGCAGCA 


AAGGTAATCG 


GCACCATAAT 


CTTAACAGAT 


CTCGCTGCCA 


AACCAGCAAT 


2820 


TACGtTACAC 


CGATAATATT 


GGCAATACGA 


GAAATCGTTG 


CCATCTGTTC 


GCTTAGTTTA 


2880 


TCAATCACAG 


TTGTTCCTAG 


TTTGTATCCA 


TACAGACCAG 


TTGACAATTT 


AATCGCTGTT 


2940 


AAAATCGTAT 


TCATCGCAAG 


GAAGAACAAG 


ATTGGACCGA 


CAACCAAGCC 


TTCTTGAGCA 


3000 


AACGAAGCTG 


CGATGGTTGA 


GAACAATGGA 


GCTAAACAGA 


ATTGAGAAAG 


AGAATCCCCA 


3060 


ATACCTGCCA 


ATGGTCCCAT 


CAAGGCCATC 


TTGATGCTAC 


GTGTTTCTTT 


TGCCGGACGG 


3120 


CCATTTTCCA 


ACATTACAAG 


ATGCAAGCTG 


GTAATAAAAG 


GCAGGAAGTG 


TGGGTTGGTA 


3180 


TTATAGAATT 


CACAGTTTTC 


TTCCAAGGCT 


TGGTAGAAAC 


CTTCCTGATC 


CTCTCCATAG 


3240 


TGTTTTTTCA 


AAGCAGGATA 


CATCACATTG 


GCATATCCCA 


ACCCTTGATA 


GTTACTATAG 


3300 


TTAAATCCAT 


TTTGACAAAA 


GAATGCCCGC 


AAAGACGTTT 


TAAGATAATC 


ACGTTTTGTT 


3360 


AATTTGTTAG 


ATCCAGTCAT 


CGTGTGCTTC 


CTCCTCTACC 


ACATGATCCG 


CTGTTTTTGG 


3420 


CTTGTTATAA 


AATTCAATCA 


AAGCAAAGAT 


AGTACCTACA 


ATTGCAATAC 


CAATTGTTGG 


3480 


GATGTTTAGA 


TAAGCTGCAC 


AAACATATCC 


CAACAAGACA 


AAGGGAATCA 


ACTCTTTCTT 


' 3540 


AGCCATCACT 


GACAAGATCA 


TCGCAAAACC 


GATAGCTGGG 


AGCATTTTAC 


CAGCAACTGT 


3600 


CAAACCTGTA 


AGTAATACCG 


GTGGAATGTA 


GTCTACGAGT 


TTCAACAAGG 


TATCCATTGA 


3660 


AAGGGCACCA 


AGCAACCCAA 


GGTAAATCCA 


ATAAAGGCAA 


ACAACCAAAT 


TGTTGCATTT 


3720 


AGAGTGAACT 


TAAATTTCTT 


CAAATTATGG 


TTTTTCAAGT 


GCT 




3763 


(2) INFORMATION FOR SEQ ID NO: 187: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 
(O) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
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CAATCTCTGA 


GTATGTGCGG 


TCAATACTAw 


CAAAGGGAAT 


yCCTGACGTC 


AAGTAATGTT 


60 


CAATTGGmCT 


ATAGGTAATG 


GCAACCACTC 


CATCAACTTT 


ATTATGACGC 


AACATCTCCA 


120 


GATAGTCTTG 


CTCTCTATTT GTACCATTGA TAGAACATAA GAGTAATTTG TTATTTCTCT 


180 


TATAGACTTC 


ATTTTCCACA 


TGCATAGCAA 


ATTCTGAAAA 


GAAGGCATGC 


CAGATACTTG 


240 


GTACAATGAT 


TGCAATCGTT 


TCTGTTCGAT TTTTTTTCAT TCCTCTAGCG TAGTAATCTG 


300 


GAATGTAATT 


CAAAGTTTTA 


ATCGCTTGTT 


CCACTTTTTT 


CAAAGTTACT 


TCTTTAATGC 


360 


CTTTTTCTTT 


ATTAATTACA 


CGTGAAACAG 


TTCCAACACT 


AACTCCTGCT 


TCTAAACCAA 


420 


CATCTTTCAT 


GGTAATTGAT 


tttctttgtt 


CT AC CAT ATT 


ATCACCTCCT 


TTCAATATAT 


480 


AGTATCATGC 


AAATGCTTTT 


TAAGCAACTA 


TTTCTCAATC 


ATTTTTGGCC 


AGATCATTTA 


S40 


TCCCATCATG 


AATAAAATCA 


CTCCAATTAG 


CTTTTGAAAA 


TACTTCAATT 


TTCATGTGTA 


500 


AACATCTACA 


TAAAACAGGA 


AAAGCCTTGG 


TTTCATGGCT 


TTTTTCGTAT 


CTTCTATAAA 


660 


AAAAGCAAGA 


GTTTTAGATG 


GCTATAAATC 


TAGATGTACA 


TTTTGCTTAA 


ATGATTGAAG 


720 


GTCTTTTCTT 


AACAAAAACA 


CCCCCAAAAT 


TAGACTTTTT 


CTGTCTAACT 


TTTGAGGTAC 


780 


AGTTCAAACG 


CGAAATAGCG 


TTTTTTTGTT 


ATTTTTGGTT 


ACTCATCTAA 


TCGAATAAAC 


840 


ATCATGGCAT 


TTAACAAGTA 


TATGAGTGAG 


ACCCTGTTTA 


TATTATTTGA 


ATAGATGAGT 


900 


CTCTTATTTT 


CAATAGGAGG 


AATAATAAAA 


TTAGAAATAA 


TGATATCATA 


AGGTGAATCT 


960 


TCTAAAGATT 


CCTTTGATAA 


TTCTAATTCA 


GTCCAAACTT 


CCAGTTCAAA 


ATTATTGCTA 


1020 


CAATAATAAG 


AAAGTGTCTC 


TGCAACGAAT 


TTTGCATGAT 


ACTGATCAAA 


ATTACTCATA 


1080 


ACTAAAACCT 


TTAGTTTAGG 


CTGATTTTGT 


AGCAAATTAA 


TCACCAAATG 


TTTGGTATGA 


1140 


GTGATGAAGG 


TATAAGATAG 


ATGATTTACC 


ATCATTGAAC 


TAGAACAAAC 


CTCAAGAGTC 


1200 


TCTAAATAGT 


GAGAAAGCTC 


TTTTTTTATA 


TCTGAAACAA 


ATTTTGGAAA 


AATATTTTGA 


1260 


AAGTTCCTGA 


TTGTATTCCC 


TTTTTGATCA 


AATAAAATAA 


ACTCAGTAAA 


CAACTCTTGA 


1320 


CGATACAGAT 


GTGCGGTATT 


ATGCAGATGC 


CAAATCAGAT 


TATCCTTATT 


CTCCATTTCii 


1380 


ATCTGATACT 


TGACTGAAAT 


CTGATCAATA 


AAATCACTCA 


ATAGATGGTA 


AGATTTTTCA 


1440 


ACATAACTAT 


CCTTTTTTAC 


GCATTTCATA 


AAGAGACTTT 


CATCTATGAA 


AAACATTTTT 


1500 


TGAAAGTAAG 


ACACAAATAA 


TTGGCAAACA 


ACTTCTTCAT 


CTAAAGAGAT 


ATTGTATTCT 


1560 


GATTCAAAAC 


TCTGAGCAAC 


ACCTTCTATT 


CCTTCTGCCT 


GCATTAAAAA 


ATCCAAACTT 


1620 


TGGTCGTTAA 


AAGAATCTTT 


ATCTACTTCC 


ATAAAATGAC 


CAAACTTTAT 


TCTATATAGG 


1680 


TTCGTAACTA GGAGCAACTT 


TAGCATTCTA 


TGCGTTGACA 


AATTCATTGG 


AAAGCTTGTT 


1740 


TCCTTATAAA 


CCAATTCTAA 


CAATTGAGAT 


AGTGGCTCTG 


ATGAAAAATT 


TTCAAATGGC 


1800 
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CATTCTAGGA AATAATATTT TTCTGAAAAA TATTGTGCAA AAAAGTAACG AATGTCTCTC 1860 

TCATTTCCAA TGATTTGAAC AGGGGTCAGA CTAACTTCAA ATTGAAATTG CCTTTTAATC 1920 

ACTTTATTGA TTTGGCTAAT AATACGATAG AGCGAAGATG AACTGATATA AAATTCTTTA 1980 

CAAATACTCT CAGCTTGACA ACCTTCATTA AAGAAGATGA ATTCTAAAAT CGAAAAATGA 2040 

GTTGAATGTT TAAAGAAATG ATGGTAAACC ATTTCAATAT CACTATCATC GGTATTAATA 2100 

ATGCGTAfAC~CATTAGTAGA AGAATGAAAA "ATCAAGTCAG GAAAAGCAGA TTTAACATGG - 2160 

CATAGATCAT CTTTGACTGC ACGTTCTGTA CAATTTAATA ACTCTGCTAG TTCAGAACGA 2220 

TGAAACCAAC GTTTATCTTC AAATAATAAT TCTAATAATT CTAATTGCCT ATGACTTTTT 2 280 

TTAGATAATA AATCTCTCAT GAATATCTTT CTCTCTTTAT AAATTATCGG ATTAAACCTC 2340 

TTGCAATTAT ACCACAAAGA ATAGGTATAG CATGATATAA CGACTTTTCC TAAAATCTTT 2 400 

TATTTCGTAT AATAACACTA CGGAGACAAT ATATAAACAA TTTTCTTATT TTACCGTCTA 2460 

TTGAGGGCGT GAATACAGAA TCAAATTCAA GTCTAAAGAT TATATTTTTA ATTTTAAAAA 2520 

TTATATAATA GCAACAATTA AAGAATTTGA TTTTTTAAAA TTATATAATA ATAACAATCG 2 580 

AAATAATTGA CTTTTCTATA TTAAAGTTAT ATAATAGTAA TAATCAAAGA AATTGATTTT 2 640 

TTGATATTAA AATAAAAAAG GAGGGTAGGC AGTGTTGTGA TCAATTATTG CTGGAGGTCT 2700 

TATTGGTCTC TTGGCAGGTA AAATCACTAA AAAAGTAGTT CTATCGGAAT CATCGCAAAT 2760 

GTATTCGCTG GTTTAGTCGG GGCATATGCA GGACAATCTC TTTTAGGTAG TTGGGGTCCA 2820 

GCAATCGCTG GAATGGCTTT GCTCCCATCT ATTGTAGGTG CAGCGATTGT GATTACTGTA 2 880 

GTGTCATTCT TTACAGGTAG AAAGTAAACT TTTCGCCAGT AAAGTTAGCA AACTATTTTT 2940 

AAATCAATGA CGGGAAAAAT AGTTTAAATG TTAAATCGAA AGGATTGTAT ATGTCAAAAG 3000 

CAAAGAAAAT ATGTTTCATT ATTTTCTGTA TTTTAATCTT GACAATTTTC CTTCCTGTTT 3060 

TGATAGATTA TCATCAAGTT AGTGATCTAG GTATTCATCT ACTTAGCTGG AGACAGAACT 3120 

CCGTAGTTGA ATTCTATCTT GCTAGATATG TCTTTTGGGG GACAGTGGTT CTATCAACTT 3180 

TAGTTTTATT ATCCATTTTA GTTGTGATGT TTTATCCTAA ACGTTACTTG GAAATCCAAC 3 24 0 

TTGAAACTAA AAACGATACA TTAAAATTAA AGAATTCGGC AATCGAAGGT TTTGTTAGAA 3 300 

GTTTGGTGAG TGATCATAGA TTGATCAAGA ACCCAACTGT TCATGTAAAT TTACGAAAAA 3 360 

ATAAATGTTT CGTTCATGTA GAAGGTAAAA TTCTTCCTTC AGACAACATC GCTGACAGAT 3420 

GCCAAATAAT TCAAAATGAA ATAACTAATG GATTGAAGCA GTTTTTTGGT ATTGAGCGTC 3480 

AAGTAAAACT TGAAGTTGCA GTAAAAAATT ACCAACCAAA ACCTCAAAAC AAAAAGACTG 3 54 0 
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TTAGTCGTGT GAAGTAAGGA AGTAAAAAAT GGAATGGCTT AAACAATATC GATATCCAAT 3600 

TATCGCTGGT CTCATAGGCG TATTTCTGGC TTGTTTGATT GTCTCCTTTG GCTTCTTCAA 3 660 

AACAATATTT GTATTCATTT TAGGAGCACT GGGAGTTGCA GCTGGATTAT ATATCGAAAA 3720 

AAACTATATA GATAAATAAA AAAATAAAAA TTACTAATTT AATTAAAGGA GTTTCATATG 3780 

TCAAACGAAA AAAACACAAA CACTAACGTA GAAAAGAAAG ATGCTACTGT TGTAGCTCAC 3840 

GAAATCAAAG GGGAACTTAC TTACGAAGAT AAAGTTATCC AAAAAATCAT TGGTCTPTCA 3900 

CTAGAAAACG TTTCAGGTCT TTTGGGAATC GATGGTGGTT TCTTCTCAAA TCTTAAAGAA 3960 

AAAATCGTTA ACAGCGATGA CGTAACAAGT G G TGTTAACG TAGAAGTTGG TAAAACACAA 4020 

GTTGCACTTG ACTTAAACGT TATTGTTGAG TACCAAAAAA ATGTTCCAGC TTTATATTCA 4080 

GAAATCAGAG AAATCGTATC TTCAGAAGTT GCTAAAATCA CTGACTTGGA AATTGTTGAA 4140 

ATCAACGTAA ACGTTGTCGA CATCAAAACT AAAGAACAGC ATGAAGCAGA CTCAGTAAGC 4200 

CTTCAAGATC GCGTATCTGA CGTTGCTGAA TCAACAGGAG AATTCACTTC AGAACAATTC 42 60 

GAAAAAGCTA AATCTGGTCT TCGATCTGGT TTCTCAACTG TTCAAGAAAA AGTTAGCGAA 4 320 

GGTGTAGAAG CTGTTAAAGG TGCAGCAAAT GGTGTAGTAT CTCACGAAAA CACTCCTGTA 4 380 

AACTAAGATA AAATAAATAT AACAGGAGAA ATTATCATGT CAGTAGAAGA AAAATTAAAT 4440 

CAAGCTAAAG GTTCTATTAA AGAAGGTGTT GGGAAAGCCA TCGGTGATGA AAAAATGGAA 4500 

AAAGAAGGTG CAGCTGAAAA AGTTGTTTCT AAAGTAAAAG AAGTTGCCGA AGACGCTAAA 4560 

GACGCTGTAG AAGGTGCTGT AGAAGGTGTT AAAAACATGT TGAGTGGCGA CGATAAATAA 4620 

GGTTAAAAGT TACTTTATCT TTTTAGTAAT ATTAGTCAAA AGAGTCTGAG TCAAGATGAT 4680 

TCTCAGAAAA CAAAAACCTA GAGATTCCCA ATTGCGGAAC TCTAGCTTTT TAATTTTGCC 4740 

TCTTTCTCTT ATTATATTTC AGCAGGTTGT TGGCCATGAG TACGAATCCC ATGTCAATTC 4 800 

TCACTTGACG CTTACCTCTC AGATGACATC TCTTATAACC CAAACAAACC TTTATCTGCC 4860 

CAAAGACAGA TTTCATATCA ATCTTACGTT TAGCGAAAAT TTGTCTACCC TTGGAAGATA 4920 

AAAGTGCCTG ATATTCTTTA GTTTTTAAAC ACTGGTAACG TTCATTCATA TACACTCTCT 4980 

TTTGAGGGGC TGATTCAGGT TCATAATCGC AGTCAACATT GATTTCAAGG CTGTTTGCTT 5040 

TCTATCTCCC CGG 5053 
(2) INFORMATION FOR SEQ ID NO: 188: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6492 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(O) TOPOLOGY: linear 
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(XI) 


SEQUENCE DESCRIPTION: 


SEQ ID NO : 


133: 






AATTCTCTTT 


TTTCfAAP A A 


AATGTATGAC. CTGCACTTGA 


ATACTTCTCA 


TTGTTTGTAC 


60 


ATTCATCTAC 


TTTPATATAA 


TCTTTTACAA 


AATCATAATA 


TGACATAACA 


CACTATCCCT 


120 


TTTACACAAT 


/\1 ll*^A/\i In 


GCCTTATTAA 


TTCAAAACTA 


TTCTATTAGT 


AATTATAACA 


180 


GATGTATAAT 


AUAAAAvjCAA 


TG AT AG AT AT 


TATCAATTAA 


GCGAATTTAT 


ATCTAAAAGG 


240 


GATATTAAAG 


AAAGGAGATA 


TGCTTATGAA 


GATTTACAAA 


AAACTATTTG 


CTTATGTCCA 


300 


AGATAAGAAA 


1 A lVl LWjU 


TTTTGGCCAT 


AATTTTTTCT 


GCTATATCTG 


CTGCACTTAC 


360 


AGTATATGGA 


TATTATTTAA 


TCTACAAATT 


TCTAGATAAG 


TTAATAATTA 


ATTCAAACTT 


420 


ATCCGGTGCA 


UAOAVj i AT ACi 


% Mil ft a ft ft 

CATTAAAATC 


TGTTATTACA 


CTAACAAGTG 


GAGCGATATT 


480 


TTATTTTGTC 


I U AIAjAATvjT 


TTTCACATAT 


CTTGGGATTC 


AGGCTTGAAA 


CAAATTTAAG 


540 


AAAAAGGGaA 


It- OA IAjOTLTT 


GGAAAAAGCA 


AGTTTTAGGT 


TCTTTGACTT 


AAATCCATCT 


600 


GGTCAAATAA 


GAAAGATTAT 


AGATGACAAT 


GCTGCACAAA 


CTCATCAGGT 


GGTAGCACAC 


660 


ATGATTCCCG 


ATAGTTCTCA 


GGCAATAATC 


ACACCCGTAC 


TTGTACTTCC 


ACTTGGCTTT 


720 


ATAGTAAGTA 


TAAGAGTTGG 


CATAATTTTG 


CTTGCTCTTA 


CTATAATTGG 


TGGCTTAATT 


780 


TTAGGGGCAA 


TGATGGGCGA 


GCAAGAATTT 


ATGAAGATAT 


ACCAAGAATC 


CCTATCTAAA 


840 


CTAAGTGCTG 


AAACTGTTGA 


GTACGTGAGA 


GGAATGCAAG 


TTGTAAAAAT 


ATTTAAACCA 


900 


AATGTAGAGT 


CTTTTAAAAG 


CTTTTATAAG 


GCGATAAAAG 


ATTACTCAAA 


GTATGCTTAT 


960 


GATTATTCCC 


T ATCTTGT AA 


AAGGCCTTAT 


GTTTTGTATC 


AATCCTTATT 


TTTTGGACTG 


1020 


ATTGCAATTT 


TAATTATTCC 


TATAGTTTAT 


TTTATGACTA 


GCTTAGCTAG 


CGCAAAGGTG 


1080 


ATTTTACTtJ 


AGC7TATCAT 


r * *'. TTT^* ATTT 


TTATCAGGAG 


TTCTCTTTGT 


TTCATTCATG 


1140 


AGAATGATGT 


GCACTCCATG 


TATATTTCl'C 


AAGGAAATTA 


TGCAGTAGAT 


ACTTTAGAGG 


1200 


CGCTTTACGA 


AGATATGCAA 


AAAGACAAAT 


TAGTGCATGG 


TAATGTCAAT 


AATTVTAAAA 


1260 


ACTATAATAT 


AGAATTTGAG 


AATGTTAGCT 


TTGCTTATAA 


TGATAAAGCT 


GTCATTGAAA 


1320 


ATTTATCCTT 


TAATTTAGAA 


GAAGGAAAGT 


CCTACGCACT 


TGTCGGTTCA 


TCTGGATCAG 


1380 


GCAAATCAAC 


AGTAGCAAAA 


CTTATATCAG 


GTTTTTACAA 


TGTTAATAAA 


GGAAGCATAA 


1440 


AGATAGGCGG 


GATAGCAATA 


AGTGAATATT 


CTGACGAAGC 


CTTAATTAAA 


GCCATTTCCT 


1500 


TTGTTTTTCA 


AGATTCAAAA 


TTATTCAAGA 


AGAGCATTTA 


TGATAATGTA 


GCGTTAGCTA 


1560 


ATAAAGATGC 


GACGAAAGAT 


GACGTTATGA 


GAGCCTTAAA 


ATTAGCAGGA 


TGCGATTTAA 


1620 
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TATTAGACAA 


ATTCCCAGAA 


AGAGAAAATA 


CAATCATAGG 


CTCAAAAGGT 


GTTTATTTAT 


1680 


CCGGTGGAGA 


AAAACAAAGA 


ATTGCAATTG 


CTAGAGCAAT 


TTTAAAGGAT 


TCCAAAATTA 


1740 


TTATTATGGA TGAAGCATCA GCATCTATTG ACCCAGATAA CGAGTTTGAA 


TTGCAAAAAG 


1800 


CTTTTAAAAA 


TCTTATGAAG 


GATAAAACAG 


TTATCATGAT 


TGCACACAGG 


CTATCTACAA 


1860 


TTAAAGACCT 


TGATGAAATT 


ATTGTCATGG 


ATAGTGGAAA 


AATTATAGAA 


AGAGGGTCTG 


1920 


ACAAAGAATT 


AATGTCAAAA 


GATACAAGGT 


ATAAGAGCCT 


GCAAGAGATG 


TTTAACAGTG 


1980 


CGAATGAATG 


GAGGGTTTCA 


AATGAAAGAG 


TTTTATAAAA 


AAAGATTTGC 


TCTTACAGAT 


2040 


GGAGGAGCAA 


GAAATTTAAG 


TAAAGCAACA 


CTGGCTTCAT 


TTTTCGTTTA 


TTGTATAAAC 


2100 


ATGCTTCCTG 


CCATATTACT 


TATGATTTTT 


GCTCAGGAAG 


TTTTGGAAAA 


TATGGGCAAA 


2160 


AGCAATGGCT 


TTTATATAGT 


ATTCTCAGTT 


TTGATTTTGA 


TACCAATGTA 


TATTTTGCTT 


2220 


TCTATCGAAT 


ACGATAAATT 


ATATAACACA 


ACCTATCAAG 


AAAGTGCAGA 


TTTAAGAATA 


2280 


AGGACAGCGG 


AGAATTTATC 


AAAATTACCT 


CTATCTTACT 


TTTCTAAACA 


TGACATTTCC 


2340 


GACATTTCAC 


AAACAATCAT 


GGCTGATATT 


GAAGGCATAG 


AGCATGCAAT 


GAGCCACTCA 


2400 


ATACCAAAGG 


TGGGCGGCAT 


GGTACTGTTT 


TTCCCATTAA 


TATCTGTAAT 


GATGCTAGCG 


2460 


GGCAATGTCA 


AGATGGGTTT 


AGCTGTAATT 


ATTCCATCTA 


TTTTAAGCTT 


TATATTTATA 


2520 


CCTTTATCTA 


AAAAATATCA 


GGTTAATGGA 


CAGAATAGAT 


ATTATGATGT 


CTTAAGAAAA 


2580 


AACTCAGAAA 


GCTTTCAAGA 


AAATATCGAA 


ATGCAAATGG 


AGATTAAACC 


ATATAATTTA 


2640 


TCGAAGGATA 


TTAAAGATCA 


CTTATATAAA 


AAAATGGAAG 


ATAGTGAGAA 


AGTACACTTA 


2700 


AAGGCGGAAG 


TAACTACAAT 


TTTAACTTTG 


TCTATATCTT 


CAATATTTAC 


CTTTATATCT 


2760 


CTTGCTGTTG 


TGATATTTGT 


CGGCGTAAAT 


CTAATTATTA 


ATAAAGAGAT 


AAATTCTCTC 


2820 


TACCTTATAG 


GATATTTACT 


AGCTGCTATG 


AAGATAACAG 


ACTCTTTAGA 


TGCATCTAAA 


2880 


GAGGGCTTGA 


TGGAAATATT 


TTATTTATCG 


CCCAAAATAG 


AAACATTAAA 


AGAAATTCAA 


2940 


AATCAAGATT 


TACAAGAAGG 


CGATGACTAT 


AGCTTAAAAA 


AATTTGATAT 


TGATCTAAAA 


3000 


GATGTTGAGT 


TTGCCTACAA 


TAAAGACCCA AAAGTTTTAA 


ATGGTGTAAG 


TTTTAAAGCT 


3060 


AAGCAGGGAG 


AGGTCACTGC 


TTTGGTAGGT 


GCAAGTGGCT 


GCGGTAAAAC 


AACTATCTTG 


3120 


AAACTTATAT 


CAAGACTTTA 


TGATTATGAC 


AAGGGACAAA 


TCTTAATCGA 


TGGCAAAGAT 


3180 


ATAAAGGAAA 


TATCAACAGA 


ATCCCTTTTT 


GATAAGGTGT 


CTATTGTTTT 


CCAAGATGTG 


3240 


GTTCTCTTTA 


ATCAAAGCGT 


TATGGAAAAT 


ATTAGAATCG 


GTAAGCAAGA 


TGCAAGTGAC 


3300 


GAAGAGGTTA 


AAAGAGCAGC 


AAAACTTGCA AATTGCACAG 


ATTTTATAGA 


AAAAATGGAT 


3360 


AAAGGTTTCG 


ATACAGTTAT 


TGGTGAAAAC 


GGAGCTGAGC 


TATCAGGAGG 


AGAAAGACAA 


3420 
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AGATTATCAA 


TAGCCAGAGC 


CTTCTTAAAA 


GATGCGCCGA 


TATTGATCTT 


AGATGAGATA 


3480 


ACAGCAAGCC 


TTGATGTTAA 


CAACGAGAAA 


AAGATTCAAG 


AGTCTTTAAA 


TAATTTAGTT 


3540 


AAAGATAAAA 


CTGTTGTAAT 


CATTTCACAT 


AGAATGAAAT 


CCATAGAAAA 


TGCAGACAAG 


3600 


ATAGTAGTTC 


TTCAAAACGG 


AAGAGTAGAA 


AGCGAAGGTA 


AGCATGAAGA 


GCTTTTACAA 


3660 


AAATCAAAAA 


TTTACAAAAA TTTAATAGAA 


AAGACAAAAA 


TGGCAGAAGA 


ATTTATTTAT 


3720 


TAGGAGGACT 


"acaatggata 


ATAAAAAATT 


AAAAGTAAAA 


GATTTAGTAA 


"GCATCGGtGT" 


"T780 


TTTTGGCGTA 


ATTTATTTTG 


CCTTCATGTT 


TGGAGTTGGT 


ATCATGGGCT 


TGATTCCAAT 


3840 


ATTGTTCTTA 


ATATACCCGA 


CAGTATTAGC 


CATAGTTGCA 


GGAACTGTTC 


TTATGTTATT 


3900 


TATGGCTAAG 


GTTCAAAAGC 


CATGGGCACT 


ATTTATATTT 


GGTATGATAT 


CACCACTTGT 


3960 


GATGTTTGCA 


GCTGGTCATA 


CCTACGTAGT 


TGTGGTTTTA 


TCACTTATAG 


TAATGATAAT 


4020 


AGCAGAATTA 


ATTAGAAAGA 


TTGGTAATTA 


TAATTCATTT 


AAATACAATA 


TGCTTTCTTA 


4080 


TGCAATCTTC AGCACATgGA TATGTAGCTC TTTAATGCAA ATGCTTTTAG 


CAAAAGAAAA 


4140 


ATATATGGAG 


TGGTCTTTGA 


TGACTATGGG 


AAAAGATTAT 


GTTGATGTAT 


TAGAAAAGTT 


4200 


AATAACTTAT 


CCTCACATGG 


CTTTAGTAGC 


CTTAGGTGCT 


TTCTTACGAG 


GAATTCTTGG 


4260 * 


AGCATATATA 


GGCAAGGCTC 


TATTGAAAAA 


ACACTTTTCA 


AATGGATTAT 


ATTGTGTGGG 


4320 


ATACTTTACT 


CCTTGCCTAA 


TTTTATGGTG 


CTATCTGAAT 


TAAACCCTAT 


AGTTAAGATG 


4380 


TTTTTGAGTA 


TACCTATTGT 


TATTAGAATG 


TTTATTTTAC 


CATTTATGGC 


AGCAAGCTTT 


4440 


ATGATAAAGA 


CCTCGGATGT 


AGGCGCAATA 


ATTTCATCGA 


TGGATAAGCT 


TAAGATTTCA 


4500 J 


AAGAATGTAT 


CCATACCTAT 


TGCGGTTATG 


TTTAGATTCT 


TCCCATCTTT 


TAAGGAGGAG 


4S60 


AAGAAAAACA 


TCAAAATGGC 


TATGAGAGTA AGACCCATAA ATTTTAAAAA CCCAGTCAAA 


4620 


TATCTTGAAT 


ATGTTTCTGT 


GCCACTACTC 


ATTATATCAT 


CTAATATATC 


AGATGACATT 


4680 


GCAAAAGCGG 


CAGAAACAAA 


GGCAATAGAA 


AATCCAATTC 


CCAAGACCAG 


ATACATTCGC 


4740 


GTAAAGATAC 


AGCTAATTGA 


TTTTGTTTAT 


GTTTTAGCGG 


TTGCTGGACT 


TATTGTGGGA 


4800 


GGCTTAATAT 


GCTTGAAATA 


AAAAATTTAA 


GTCTTGATTA 


TGGTGAAGAG 


CATATATTAG 


4860 


ATGATATATC 


ACTATCCATA 


GCCGAGGGAG 


AGTGCGTGCT 


ATTTACAGGA 


AAAAGTGGAA 


4920 


ATGGTAAGTC 


ATCTTTAATA 


AATTCAATCA 


ATGGACTAGC 


TGTAAGGTAT 


GATAACGCAA 


4980 


AGACAAAGGG 


CGAAATAATT 


ATTGATGGTA 


AGAATATAAA 


AAATTTGGAA 


CTTTATCAAA 


5040 


TCTCAATGCT 


TGTTTCAACT 


GTTTTTCAAA 


ATCCTAAGAC 


ATATTTTTTT 


AATGTCAATA 


5100 


CGACATTAGA 


ATTATTATTT 


TATTTGGAAA 


ATATCCGTCT 


TGCAAGAGAA 


GAGATGGACA 


5160 
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GGCGTTTGAA 


GGATATACTT 


GAGATATTCC 


CGATAAAAAA 


TCTTTTGAAC 


AGAAATATAT 


5220 


TTAATCTATC 


CGCCGGTGAA 


AAACAAATTC 


TTTGCATTGC 


AGCTTCTTAT 


ATAGCAGGTA 


5280 


CAAAGATTAT 


AGTTATGGAT 


GAGCCTTCAT 


CGAATTTAGA 


TATTAAAAGC 


ATAAGTGTTT 


5340 


TGGCAAAGAT 


GCTAAAGATA 


TTAAAAGAGA 


AAGGCATAAG 


CATAATTGTT 


GCAGAGCATA 


5400 


GAATTTATTA 


TTTGATGGAC 


ATAGTTGACC 


GTGTATTTTT 


AATAGATAAA 


GGAAAGCTTA 


5460 


AAAAAACTTA 


TACTAGAAGT 


GAATTTTTAA 


AGCTAGATAA 


AAATGAATTA 


AATGCTTTAA 


5520 


GTTTAAGAGA 


TAAAGAATTA 


AGTAAATTAA 


AAGTTCCTTA 


TTTAAAAGAA 


GGTGGAGAGT 


5580 


ATCAGATAAA 


AAATCTTAGT 


TACAAATTTA 


CTGATGATGA 


GTGTTTAACC 


TTAAAAGATA 


5640 


TTTCGTTCAA 


GCTTGGGAAA 


ATTTATGGCA 


TAATAGGATC 


CAACGGACGA 


GGAAAATCAA 


5700 


CGCTTTTAAG 


ATGTTTAATA 


GGTCTTGAGA 


AAAAATCAAA 


AGAAGAAATT 


TATTTTAAGG 


5760 


GAGAGAAGCT 


ATCTAAAAAA 


GAAAGACTCA 


AAAACTCTTC 


ACTTGTTATG 


CAAGATGTAA 


5820 


ATCATCAATT 


ATTCACAGAT 


GAAGTATTCA 


ACGAGCTTAG 


ATTAGGAGTA 


AAGAATTTTG 


5880 


ATGAAGAAAA 


GGCGAAAATC 


ATTTTAAACC 


CCAATTATTC 


ACCCCAAATC 


TAAAAACCAT 


5940 


CCAGAATCCT 


TGCCTTAGCT 


TAGATCCTGG 


ATGGTTTCTT 


TTTTCACCCA 


ATCGGTGTTT 


6000 


TTTACTAGAC 


AAAAAAGAGT 


TTCCCCTTTA 


TGGTATAAGT 


GTAGAAAAAA 


ACACAAAAAG 


6060 


AAAGGAAACT 


CACATGAACA 


GTTTACCAAA 


TCATCACTTC 


CAAAACAAGT 


CTTTTTACCA 


6120 


ACTATCTTTC 


GATGGAGGTC 


ATTTAACCCA 


GTATGGTGGT 


CTTATCTTTT 


TTCAGGAACT 


6180 


TTTTTCCCAG 


TTGAAACTAA 


AAGAGCGGAT 


TTCTAAGTAT 


TTAGTAACGA 


ATGACCAACG 


6240 


CCGCTACTGT 


CGTTATTCGG 


ATTCAGATAT 


CCTTGTCCAG 


TTCCTCTTTC 


AACTGTTAAC 


6300 


AGGTTATGGA 


ACGGACTATG 


CTTGTAAAGA 


ATTGTCAGCT 


GATGCCTACT 


TTCCAAAATT 


6360 


GTTGGAAGGA GGGCAGCTTG 


TTCACAGCCA 


ACCTTATCCC 


GTTTTCTTTC 


CAGAACTGAC 


6420 


GAGGAAACAG 


TCCATAGTTT 


GCGATGCCTC 


AACCTTGAAT 


TGGTCGAATT 


CTTTTTACAT 


6480 


GTTCACCAGC 


TG 










6492 



(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 
AACTGAACGT AAAGGCTTCG ACGCAGAACG TGACGCTGCC CAAGCTGCCC TTGATGACCT 



60 
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TAAGAAAGCT CAAGAAGACA ACAACTTGGA CG AC AT G AAA ACAAAACTTG AAGCATTGAA 120 

CGAAAAAGCT CAAGGACTTG CTGTTAAACT CTACGAACAA GCCGCAGCAG CGCAACAAGC 180 

TCAAGAAGGA GCAGAAGGCG CACAAGCAAC AGGGAACGCA GGCGATGACG TCGTAGACGG 240 

AGAGTTTACG GAAAAGTAAG ATGAGTGTAT TGGATGAAGA GTATCTAAAA AATACACGAA 300 

AAGTTTATAA TGATTTTTGT AATCAAGCTG ATAACTATAG AACATCAAAA GATTTTATTG 360 

ATAATATTCC AATAGAATAT TTAGCTAG AT ~ATAGAG AATT ATATTAGCTG AACATGAT AG 420 

TTGTATCAAA AATGATGAAG CGGTAAGGAA TTTTGTTACC TCACTATTGT TGTCTGCATT 480 

TGTATCGGCG ATGGTACCAG CTATGATATC ATTAGAAATA CAAACATATA AATTTGTAAT 540 

ACCGTTCATA ATTGGTATGA TTTGGACAGT AGTTGTATTT CTTATGATCA ATTGGAATTA 600 

TATAGGCAAA TACTAAGAAG AGACAAAAAT ATATAAATAT TTCTGTACTT ATAGGATATT 660 

TAAAATCAAA ATAAAGTTAA TTTACTTATT TGCAGAGGTT GCAACCCAGC CTCTGTTTTT 720 

CGATAAAAAG GGACGGAATC TCA T TT G TTT GG G TTTTGTC TCATCAATAG AAAGGAACAA 780 

AGAGTGTTCG TAACTGAACA CGGGTTTCAG AATTTCTTAC TAAATATAAA AGAAAGGAAT 840 

TGAACCCGAC CTAAATGGTG GTTCGATTCA GAACATCAAT AGAAAGGAAT AAGGGTGTTC 900 

GTAACTGAAC ACGGGCTATG GACTGTGCCA AAAAGATAGT TTTTTCTACG ACGTAAGCGT 960 

CCGTCGTCAA AACTCCTAGA TGGCTGTGTC CGTTTGACGC CCTTTGTATC TTGAATTATG 1020 

AACAATACTG AATTTTATGA TCGTCTGGGG GTATCCAAAA ACGCTTCGGC AGACGAAATC 1080 

AAAAAGGCTT ATCGTAAGCT TTCCAAAAAA TATCACCCAG ATATCAACAA GGAGCCTGGT 1140 

GCTGAGGACA AGTACAAGGA AGTTCAAGAA GCCTATGAGA CTTTGAGTGA CGACCAAAAA 1200 

CGTGCTGCCT ATGACCAGTA TGGTGCTGCA GGCGCCAATG GTG G TTTTGG TGGAGCTGGT 12 60 

GGTTTCGGCG GTTTCAATGG GGCAGGTGGC TTCGGTGGTT TTGAGGATAT TTTCTCAAGT 13 20 

TTCTTCGGCG GAGGCGGTTC TTCGCGCAAT CCAAACGCTC CTCGCCAAGG AGATGATCTC 13 80 

CAGTATCGTG TCAATTTGAC CTTTGAAGAA GCTATCTTCG GAACTGAGAA GGAAGTTAAG 1440 

TATCATCGTG AAGCTGGCTG TCGTACATGT AATGGATCTG GTGCTAAGCC AGGGACAAGT 1500 

CCAGTCACTT GTGGACGCTG TCATGGCGCT GGTGTCATTA ACGTCGATAC GCAGACTCCT 1560 

CTTGGTATGA TGCGTCGCCA AGTAACCTGT GATGTCTGTC ACGGTCGAGG AAAAGAAATC 1620 

AAATATCCAT GTACAACCTG TCATGGAACA GGTCATGAGA AACAAGCTCA TAGCGTACAT 1680 

GTGAAAATCC CTGCTGGTGT GGAAACAGGT CAACAAATTC GCCTCGCTGG TCAAGGTGAA 1740 

GCAGGCTTTA ACGGTGGACC TTATGGTGAC TTGTATGTAG TAGTTTCTGT GGAAGCTAGC 1800 
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GACAAGTTTG AACGTGAAGG AACGACTATC TTCTACAATC TCAACCTCAA CTTTGTCCAA I860 

GCGGCTCTTG GTCATACAGT AGATATTCCA ACTCTTCACG GTGATGTTGA ATTGGTTATT 1920 

CCAGAGGGAA CTCAGACTCC TAAGAAGTTC CGCCTACGTA GTAAGGGGGC ACCCAGCCTT 1980 

CGTGCCGCTG CAGTTGCTGA CCAATACGTT ACTGTTAATG TCGTAACACC CACAGGCTTG 2040 

AACGACCGCC AAAAAGTAGC CTTGAAAGAA TTCGCGGCTG CTGGTGACTT GAAAGTAAAT 2100 

CCAAAGAAAA AAGCCTTCTT TGACCATATT AAAGATGCCT TTGATGGAGA ATAATACTCT 2160 

TCGAAAATCT CTTCAAACCA CGTCAGCGTT GCCTTCCCGT ATATATGTGA CTGACTTCGT 2220 

CAGTCGTATC TACAACCTCA AAACAGTGTT TTGAGCAGCC CGTGGCTACT TTCCTAGTTT 2280 

GCTTTTTACT TTATAGATTT TTTAAGACTT TCCTAAGTAA TGACGGACGG TACTGACCTC 2340 

CTTCGAAGTT CCATACCTAA ACTTTGAACC TAAGTTTTAA AGTTTCCGGA CAGCTGAAAC 2400 

CAAGCTGTTT CACGTGTTTT CATTACGCCA GAAAGTCTTC CATTTACTTC TGAAATGGTG 2460 

AATGATACTC TTCAAAAATT TCTTCAAACC ACGTCAGCGT CGGCTTGTCA TGGGTATGCT 2520 

TACTGACTTC GTCAGTTCTA TCCACAACCT CAAAACAGTG TTTGAGCTGA CTTCGTCACT 2580 

TCTATCCACA ACCTTAAAAC GGTGTTTTGA CCAGTCTGTG CCTAGCTTTC TAGTTTGCTT 2640 

TTTGATTTTT ATTGAGTATG AATTACCTAA ATTATGATGC ATAGTTGATG GGATATATAT 2700 

AATAGATTGA AATAGAATAT GAACAAATTG ATAAGAGGAT TTTAAAGTAA TCTCTAACAA 2760 

TGCTTTACAA ACTATGGTGT GCTATTCTAA ATTCAATTCA CTATAACTTC TTTACGTTTT 2820 

AAAAAAGAGC CGTCGGGCTC TTTTTACTTA TCTTCAGTTC CCTGCATTTC TTTTATCACA 2880 

GCTAGTCTAG TCTGGATATC CTTTTCCAAG ACCTTAAACT TGTAAGTCAA GTCTTCTTGG 2940 

TATTCCTTGA TAAGTTCTTT TTGCTGGTTA ATGATTTGCA GGCTGTTTTG GATAATATCC 3000 

ACATCGTCCT TGATAGCTTG AACGCGGTCA GTGGTATTCA AGACTTCATC TGTGATGGTT 3060 

TGGCGATTTT TTGTAACCAG ATAACTTCCG GCTGCAGCTC CTGCAAATAG CAGTAGGTTG 3120 

GATAATTTCA TAGCAACTCC TTAAGCGTTT TTGATGGTTT CAGCGACTTG AGCAAGTTTG 3180 

TCAAAGTCTG GTTCGTGGGC GATAAAATCA ATCTTGAGGT CATCGTCAGC ACTCTAGCGA 3240 

GGCACAAGGT GAACGTGAGT ATCAAAAACT GTTTGACCAG CGACTTCTTC ACAGTTGGAA 3300 

ATGATATTCA TACCAGCAGC CTTAGTGACT TTCATGACTT TTTGAGCTAC TTTTGGTACT 3360 

TGGGCAAAGA GTTGGcTGGC GCTCGTAGCA TCCATCTCCA AAAGATTGCG ATAGTGTTCT 3420 

TTTGGCACGA CCAAGGTGTG TCCTAGTGTT ACTTGAGAGA TATCAAGAAA GCCAAGCACC 3480 

TGCTCATCrr CATATACTTT TGAAGCAGGA ATTTCCCCTG CGATGATTTT ACAAAAAATG 3540 

CAATCTGACA TAAAATCTAC CTCTACTGTA CTGAATTTTG ATATAATATA GCTACATTAT 3600 
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ACCACATTTG GAGAAAATAT GTTAGAAATT AAAAACCTGA CAGGTGGCTA TGTTCATGTT 3660 

CCTGTTTTGA AAGATCTGTC CTTTACTGTT GAAAGTGGGC AGTTGGTCGG TTTGATTGGT 3720 

CTCAATGGTG CTGGGAAATC AACGACGATC AATGAGATTA TCCGTCTGTT GGCACCTTAT 3780 

AGTGGCTCCA TCAATATCAA TGGCCTGACT CTGCAAGGAG ATGCGACTAG CTACCGCAAG 3840 

CAGATTGGCT ACATTCCTGA GACGCCTAGT CTGTATGAGG AATTGACCCT CAGAGAGCAT 3 900 

ATCGAAACGG CTGCTATGGC TTACGGTATT GAGCAAAAAG TGGCTTTCGA ACGAGTAGAC "3960 " 

CCCTTGTTAA AAATGTTCCG TTTGGAACAG AAATTAGACT GGTTCCCTGT TCATTTTTCA 4020 

AAAGGGATGA AGCAGAAGGT CATGATTATC TGTGCTTTTG TGGTGGATCC AAGTCTTTTC 4080 

ATCGTGGATG AGCCTTTCCT TGGTCTTGAT CCGCTGGCTA TTTCTGATTT GATTCAGCTT 4140 

TTGGAAGTGG AGAAGCAAAA GGGCAAGTCT ATTCTCATGA GTACCCACGT GCTGGATTCG 4200 

GCGGAGAAGA TGTGTGATGC CTTTGTCATT CTTCACAAGG GAGAGGTGCG TTCCAAAGGC 4260 

AATCTCCTGC AACTACGTGA AGCCTTTGAT ATGCCTGAGG CTAGTTTGAA TGATATTTAC 4320 

TTGGCTCTGA CCAAAGAGGA GGATCTATGA AAGACTTGTT TTTAAAGAGA AAGCAGGCCT 4380 

TTCGTAAGGA GTGTCTTGGT TATCTGCGCT ATGTGCTCAA TGACCACTTT GTCTTGTTCC 4440 

TGCTTGTCCT GTTGGGCTTT CTAGCCTACC AGTACAGTCA ACTCTTACAA CATTTTCCTG 4500 

AAAATCATTG GCCTATCCTT TTGTTTGTAG GAATTACGTC T G TTTTACTT TTACTTTGGG 4560 

GAGGAACTGC CACCTATATG GAGGCTCCAG ACAAGCTCTT TCTCTTAGTT GGAGAAGAGG 4620 

AAATTAAGCT CCATCTCAAG CGTCAAACTG GCATTTCCCT AGTCTTTTGG CTCTTTGTAC 4680 

AGACCCTTTT CTTGCTGTTA TTTGCGCCTT TATTTTTAGC AATGGGTTAT GGCTTGCCAG 4740 

TTTTTCTGCT CTATGTGCTT TTATTGGGGG TAGGAAAATA TTTCCACTTT TGTCAAAAGG 4800 

CCAGCAAATT TTTCACTGAA ACTGGACTGG ACTGGGACTA TGTTATTTCT CAAGAAAGCA 4860 

ACCGTAAGCA AGTCTTGCTT CGTTTCTTTG CCCTCTTTAC GCAGGTCAAG GGAATTTCAA 4920 

ACAGCGTTAA GCGTCCTGCC TATCTGGACT TTATTTTAAA GGCTGTTCAG AAGGTGCCTG 4980 

GCAAGATTTG GCAAAATCTC TATCTGCGTT CTTATCTGCG AAATGGCGAC CTCTTTGCTC 5040 

TCAGTCTTCG TCTTCTCTTG CTTTCCTTGC TGGCGCAGGT TTTTATCGAG CAAGCTTGGA 5100 

TTGCGACAGC AGTGGTAGTT CTCTTTAACT ACCTCTTGCT CTTCCAGTTG CTGGCCCTCT 5160 

ATCATGCCTT TGACTACCAG TATTTGACCC AACTCTTTCC GCTGGACAAG GGGCAAAAGG 5220 

AAAAAGGCTT ACAGGAGGTA GTTCGAGGAT TGACCAGTTT TGTTTTACTT GTGGAATTAG 5280 

TTGTTGCGTT GATTACCTTC CAAGAAAAAC TAGCCCTTCT AGCCTTACTA GGAGCTGGTT 5340 
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TGGTTTTACT AGTCTTGTAT TTGCCTTATC AGCTAAAACG TCAGATGCAC GACTAACATT 5400 
GCTGATACGA CACTAAAAAA GAAGTTGAGT TCAGTCTCTC TCAACTTCTT TTTTGTTACT 5460 
ACACGATAAT GGTTGGTCCG TAGAGACTTA TACTCTTCGA AAATCTCTTC AAACCACGTC 5520 
AGCGTCGTCT TACCGTACTC AAGTACAGCT TGCGGCTAGC TTCCTAGTTT GCTCTTTGAT SS80 
TTTCATTGAG TATTAACTTG GTCTTGACTT GGTCAAAGTG GAAGCGGTCA TACGCCCGCC 5640 
AAGCGGCGCG AGTTGGAGCA TCTGGATCAA GAGCGCTGAG TCCCATGAGA AGACTCGAAG 5700 
TCTGGTAAAA TTTTTCTAGT TCAATCAAGA ATCGATTATC CACTGTTTCA GCCTTGGCTA 5760 
GAAAACCAAG AATAGAGTTT AATTGCTCCT GAAAGCGGAC GTCGTCAGCG CTTGCCTGTT 5820 

TGCATGCTTG GTAGGCTTTG TTTAAGTCAG TAATCAAAGT ATGAGCTCTT TTGATGGGGT 5880 

CTGTATCTGT CATGGGAATG CCTCCTTTAA TCTGCGTCCC AGTCTTACTT CTGCCAACTG 5940 

TGTTTTGATA CTGTTAGTTT ATCACTTTTA ATTCTTTTTT TTTATTCAAA TCTTTAATTG 6000 

TCATTGAAAT GTCTTGAATT GCGCTGAGTG AATTTTATCA TAAAATAGTT GTAAGCTCAT 6060 

CATGATGTTG TAGAAAATAA TCCTTTTAGG AGTTTTCAAA GACTGTTTAC GATTGGGTGT 6120 

GCTTGGGCTA GACCTTTTCT GTTATTCTTT TCTTAGGAGG AGAATCCAAT GAAATATATG 6180 

ATTATTCAGA CGCAGAAAAC AGTCTATAAA GTAAACATCG ACGATATCTA CTATATCCAA 6240 

ACACATCCAA CTAAAGCCCA TACCGTACAG ATTGTTACAG AAGAAGCTAG TTTTAATATG 6300 

CTTCAAAATT TAAGTAATCT TGAGAACCAA TGTGGGGAAA CCTTGATGAG ATGTCATCGA 63 60 

AATTGTTTGG TTAATCTTGA TAAATTAAAA TCGATTGATT TTCAAGAAAG AATCCTTTTT 6420 

CTCGGAGAAG AAGGTCAATA CGCTGTCAAG TATGCCAGAC GTCGCTATAG AGAAATTCCT 64 80 

CAAAAATGGT TGAAAGAGGG ACAGTAAGAA GATGAGAATA TTTGTTTTAG AGGATGATTT 6540 

TTCCCAACAG ACTAGAATTG AAACGACGAT TGAGAAACTT TTGAAAGCAC ATCATATCAT 6600 

TCCTAGCTCT TTTGAGGTAT TTGGCAAGCC GGACCAACTG CTGGCTGAAG TGCATGAGAA 6660 

GGGGGCCCAT CAGCTATTCT TTTTGGATAT TGAGATTCGA AATGAAGAGA TGAAGGGACT 6720 

CGAAGTGGCT AGAAAGATTC GGGATCGGGA TCCTTATGCC CTGATTGTCT TTGTGACGAC 6780 

TCACTCGGAG TTTATGCCCC TGTCTTTTCG CTACCAAGTG TCTGCTTTGG ACTACATTGA 6840 

TAAGGCCTTG TCAGCAGAGG ACTTTGAATC TCGGATCGAG ACAGCCCTCC TCTATGCCAA 6900 

TAGTCAAGAT AGTAAAAGTC TGGCGGAAGA TTGCTTTTAC TTTAAATCAA AATTTGCCCA 6960 

ATTTCAGTAT CCTTTTAAAG AGGTTTACTA TCTCGAAACG TCGCCCAGAG CCCATCGTGT 7020 

TATTCTCTAT ACCAAGACAG ACAGGCTGGA ATTTACAGCG AGTTTAGAGC ACGTTTTCAA 7080 

GCAGGAGCCC CGTCTCTTGC AGTGCCACCG CTCTTTTCTC ATCAATCCTG CAAATGTGGT 7140 
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GCATTTGGAT AAGAAACAAA AACTGCTTTT CTTT 7174 
(2) INFORMATION FOR SEQ ID NO: 190: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

CCACCAGGGA AAATCATTGA AGTTGGTAGT CACCAAGAGT TAATGCAGGC GCAAAGTTTC 60 

TACCATCATC TATTCAATAA ATAAGGAGAA TGTCATCAAT CCTAATCTTT TTAGAAGCGT 120 

CGAGTTTTAT CAGAGACGTT ACCATAACTA TGCGACAGTG TTAATTATAC CTCTTTCATT 180 

ACTATTTACT TTCATCTTGA TTTTCTCCCT TGTTGCCACA AAAGAAATTA CTGTTACTTC 24 0 

CCAAGGAGAA ATCGCCCCTA CAgTGTCATT GCCTCCATTC AGTCAACCAG TGATAATCCT 300 

ATCCTAGCTA ATCATTTAGT GGCAAATCAA GTAGTTGAAA AAGGGGACTT ACTCATCAAA 360 

TACT CTG AAA CAATGGAAGA AAGTCAGAAA ACTGCCTTAG CAACTCAATT ACAAAGACTT 420 

GAGAAGCAAA AAGAAGGACT TGGAATTTTG AAACAAAGCT TAGAAAAAGC GACTCATCTT 480 

TTTTCTGGCG AGGATGAATT TGGCTACCAT AATACCTTTA TGAATTTTAC TAAACAATCC 540 

CATGATATTG AACTGGGTAT CACAAACACT AACACCGAAG TTTCAAATCA AGCTAATCTT 600 

TCCAATAGCA GTTCATCAGC TATTGAACAA GAAATTACAA AAGTTCAACA ACAAATTGGA 660 

GAATATCAAG AGTTGAGAGA TGCTATCATA AATAACAGAG CACGCTTACC AACTGGCAAT 720 

CCGCACCAGT CAATTTTGAA TCGTTATCTT GTAGCCTCAC AAGGACAAAC ACAAGGAACT 780 

GCAGAGGAGC CATTTTTATC TCAAATTAAT CAAAGTATTG CAGGTCTTGA ATCATCTATC 840 

GCAAGCCTCA AAATTCAGCA AGCTGCTATC GGAAGTGTAG CAACTTATGA TAACAGTTTA 900 

GCAACCAAAA TTGAAGTACT CCGCACTCAG TTTTTACAGA CAGCCTCACA CCAACAACTA 960 

ACTGTGGAGA ATCAATTAAC AGAATTAAAA GTACAACTAG ATCAAGCCAC ACAGCGTTTG 1020 

GAAAACAATA CCTTAACCTC CCCAAGTAAA GGTATCGTTC ATCTGAACAG CGAATTTGAA 1080 

GGTAAAAATA GAATTCCAAC TGGTACAGAA ATTGCTCAAA TATTCCCTGT CATCACAGAT 1140 

ACAAGAGAAG TACTAATCAC TTACTACGTA TCTTCTGACT ATCTACCTCT ACTAGATAAA 1200 

GGACAAACTG TAAGATTAAA ACTGGAGAAG ATTGGAAATC ACGGCACCAC CAT CATC GGC 1260 

CAACTTCAGA CAATTGATCA AACTCCTACC AGAACAGAGC AAGGAAATCT CTTTAAATTA 1320 
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ACCGCTCTTC CAAAACTATC TAACGACGAT AGTAAACTCA TCCAATATGG CTTACAAGGT 1380 

CGCGTCACTA GTGTAACTAC AAAGAAAACA TATTTTGATT ATTTCAAAGA TAAAATTTTA 1440 

ACACATTCTG ATTAATTTTC ACATAACACT CTATAACTAT TTATTATCTT ATCAAAAAGG 1500 

AGAATCATAA CATGGATAAG AAACAAAACC TAACTTCATT TCAAGAACTA ACAACTACCG 1560 

AACTCAATCA AATTACAGGT GGAGGATTGT GGGAAGATTT ATTATATAAC ATTAATAGAT 1620 

ATGCTCATTA CATCACATAA GAACTTCATC ATCCAATACA ACTATAAAAA AATAAGACCG 1680 

AGAAACAAGT ACTCTCGGTC TTATTTTTCA TCATTCTGTA TCTATCACAG TAAGTACCTG 1740 

ACGAAAGACT TGATTTTGAC AGGTGGTATT TAGACTGGTA TTAGGATGGC TTTCCACAAT 1800 

CTTCATG AC G GTATAGAGAC CAACTCCTCT CTCCTCCCCT TTAGAACTGG CTCCAAAGGA 1860 

GAAGATTTCA GAAATATCGA TGCCCTCTTC TTTGATGGAG TTTTCGATGA TAAAGGTCTC 1920 

CTGTGCTCCA TTTTTTAAAA AGGCGATTGA AACATGAGGT TGACTAGCTT CCACACTGGC 1980 

TTCAATAGCA TTGTCACAAA GGATAGACAC AATGGTTAGA AAATCAAGTA GACTCATCCC 204 0 

CTCGACCTGA ATCTCCTCAG GAACTTCGAC ATTAAAGACA ATGTTCTTAT CTCTGGCTTT 2100 

TAAAAATTTC CCTGCTAGAA GACTTTTGAG GGCTTTATCA CGAATATTTA CCAATCTGCC 2160 

CAGGTCATAT TTATTGTTCT GCAATTTCTG ACTGGAATCC TTTAAGACGG AGCCATAGAC 2220 

CTCTTTTATC TGCTCCATAT CCTCCTCTTC AATGCCCAGA CGTAAGCTAG TCAAGAGGTT 2280 

GGTATAATCA TGACGAAAGC TCCGTACTTC CTTGTAAAGC TCCTCTATAT GCCGACTATA 2340 

GCGTTCCATA TCTCTATAGC GCAGGGCCTG CTCTTGTTCC AATCTCTCAT AGAGTTTTTC 2400 

CTTCAAATAG GTATCCAATT TCTTGATAAC CCCCATAAAA AAGAGTAGGT AAAAGACTAG 2460 

GATGAGATGG CGAACAGTCT TTGATTGAAT ACTTTGTTCA TATTCAAAAA AAGACAGACT 2 520 

TTCCATGACT AGATAGTAGC CACCCATTAT CCACTTAATC TGAGTCAGGG ACTTTTGAAA 2580 

GGCTTTATCG AGAATCTCCT TTCTCAAGCT AGTAAAATCG TAGTCCAACC ATTTCAAAAA 2 640 

AGCTAGAGAA ATGAAGAAAT TGAAAATTAT TATACATAAC CCAGTAAATG ACT AGC CATC 2700 

ATATACTTGC CCTTGTCCCA AAAATGGAAG CACAAAATAG GAGACTCCTC TATAAAAGAG 2760 

ATTCACCAAT ATCATTGGAA AGAGACCATA AAAGAAAAGG A G TTTTTTAG GAAGCCCTCT 2820 

CAATAATAAG AAAGATAAGC CTATGCCGTA CAAGGGTTCC ATAAAATAAG ATAGGTAAAC 2880 

ATTTCCTACT ATATAGCTAA TCATCACAAA AACAAAGGCC AACAGTATCT TCAAAAGAAA 2940 

GGCCTTAAAA ATCCTCTCGA AAGTAAGATC AATTCCATCC ACCTTAAAGA AGATGACAAT 3000 

TTCTAGTCCA TTAGTAACAA GTGTATACAA CAATATCCAA GCAATGTTCA TAAATTCTCC 3060 

TAGCTCAGTG TAATTTATTG ATGGCCTCAG ACACTTCCCT GACCTTATAA CGGGCGATTA 312 0 
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GACAACTTCC ACCATTGGGA GAGAAGAGCA GTTTTTCTTT CTTATCCAAA TGCACCACAT 3180 
TTGCAGGATT GATGAGAAAA GAGCGGT 3207 
(21 INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10357 base pairs 

(B) TYPE: nucleic acid 

(C) _ STRAND EDNESS double 

(D) TOPOLOGY: linear 



lxi> SEQUENCE DESCRIPTION: SEQ ID NO: 191: 
CTGAATCAAG TGTACTGCAC CAGTTCGTGC ATCAGGCATA ACAACATCTA CAGATATAAT 60 
ATTGTTTTCT GAGTCCGCCT CATAAGTTAA AATCATAAAT TTTTCGATAT TCGAATTTTT 120 
AGTAGCTTGT TCAATTTCTT GAATCATTTC ATCAGAAACT AACTCCATCT CAATTGGAAA 180 
GGAATGACTA TTTTCATCAT TTTTGTAGGA AGAATCTTGA TTAAGATAAA GTGTATTCAT 240 
CTGAGCATAT TCAAATAAGT AGCCACTCTT ATTTTTTTGT ACCAAAGGAA ATTGGTTTGT 300 
AAGTCGCTTC TTACCCTTTA TAATTAACAA TACTTTCCCA TATTTTTCTC TATTTGTTTC 360 
AAATTCTAAA TATCCCCAAG TCTGTCCTGC TAATTGTAAT TTATACTCAA ACAAATCTGC 420 
TGATGCAAAT GCAGTATCAA TATGATTAGG TCGCGTCCAT GCATAACCAT TCGACACTAT 480 
CATTGTCTCT CTTTTTTCTA GACGTTCATC TACATAATCT TTTTCCCCTT TCATCAAAGT 540 
ATCTACAATT TTTTGTGCCT CAAGCGAATC AAAGAGATCC TGATTCAACA TAATTCTTCC 600 
TCCTCCAAAT ACTTTTTAAT GAATTATACC ATTTTCTTAA AGAAATTACT ACAATAATTA 660 
TCTTTTTCTT AAAGTTCTGT GTCAGAGTAA TTTAGAAAAT TATATCTTCT ATAGTAAAAT 720 
CAATTAAAAA CTGAACAAAT TTATTGGGAA ATTCAAATCG CTTTCTGAAA ATATTTTAGG 780 
AACCGTAGTG TAATATTCCA GATTCAATTC ACTATAAAAC TGACCTTTCT CCTGCAAAAG 840 
AAAAAGGAAA GACTTCCTTT CCTGCCTTTC CTCTTACTTG CTACTTGTTT GATTATTTTT 900 
GGTAAGCTAC TGCTTGTCTG ATAAAATCCT GAATCGGCTC TCCTTGGTGG AGAGCTTTTA 960 

CTATTTTCGA ACCGACGATA ACACCATCTG ACACCGCATT GAAGCGTTCC AGATCGGCTT 1020 

GACTAGATAC ACCAAAACCT GTCAAGACTG GGATGTCGGC CACTTGATGA AGTTGCGCCA 1080 

AGTGCTTGTC CAAATCTGCA CGGTAATTGC CTGATTTCCC TGTCACTCCA TTGATGGCAA 1140 

CGGCATAGAT GAATCCCTCC GCCCCTTCAA TCAACTCTTT CTGGCGCTCA ATTCCTGTGG 12 00 

TCAAGCTTAC TAAAGGAATC AAGGCGATAT CTGTATTTGC CAAAAATGGT TCTACAAAGT 12 60 
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TGGCATGTTC ATGAGGCAGC TCTGGGATAA TCAAGCCCTT CACAGCTGTA TCAGCCAGAT 1320 

CTTTGACAAA GTTCTCCACA CCGTACTGAA AGAGGGGGTT GAAGTAGGTC ATGATGACCA 1380 

GTGGAATCTC TGTTTCAATG CTTTTCAAGG TTTCAACTAA AGCCTGGGTA GAGCTCCCGT 1440 

GGGCTAAACT GCGCAAGCCA GCTTCTTCGA TAACAGGTCC ATCTGCAACA GGCTCTGAAA 1500 

AGGGAATACC CACTTCAATT GCAGAGACAC CCAAATCTTC TAAAAAGTGA ATTGTTTCAG 1560 

CAAGACCGTC CAAACCTTTC TCGTGGTCAC CAGCCATGAT ATAGGGAACA AAAATTCCTT 1620 

TTCCAGCTGC TTTAATAGCA TTTAATTTTT CTGTTAGTGT CTTAGGCATG AGCTTCTCCC 1680 

TTCTTTGCTG CATCTGCTTC CAAGCGGTCC TTGACTTGAA CCACATCCTT GTCCCCACGA 17 40 

CCTGATAGGC AGACAATCAT AGACTTTTCT GGTCCAAGTT CTTTGGCCAA TTTCACCCCA 1800 

AAGGCGATAG CATGGCTAGA TTCCAAGGCT GGGATAATCC CTTCCACACG AGACAAGAGT 1860 

TGGAATCCTT CCAAGGCTTC TTCGTCTGTC ACAGGGACAT AGCTGGCACG TTTAATATCG 1920 

TGGTAGTGAG AATGCTCTGG ACCGATACCA GGATAGTCCA AACCTGCTGA GATAGAGAAG 1980 

GCTTCAAGAA TTTGACCATG GGCATCTTGG AGCACATCCA TGAGGGAACC GTGAAGGACA 2040 

CCTGGACGAC CCTTGGTCAA GGTAGCTGCG TGGTGCTCTG TATCCACACC AACCCCTGCT 2100 

GCTTCAGTTC CATACATAGC TACTGACTCA TCTTCTACAA AGGGATGGAA GAGCCCGATA 2160 

GCATTCGACC CACCACCAAC ACAGGCTACT AGGGCATCTG GCAGATCTCG ACCTGTCAAG 2220 

TCACGGTACT GTTGTTTAGC CTCTCGACCG ATGACACTTT GGAAGTCACG AACGATTTCT 2280 

GGAAATGGAT GAGGCCCCAA GGCAGAACCA AGGATATAGT GGGTATCGTC GATATTACCC 2340 

ACCCATGAAC GAAGGGCTGC ATTGACCGCA TCCTTGAGCA CGCGCGAACC ATCTGTTACA 2400 

GCCTCGACCT TGGCTCCCAA AAGCTCCATG CGGAAGACAT TGAGGGCTTG GCGTTTCACA 2460 

TCTTCCTCAC CCATGTAGAT GGTACATTCC ATGTTAAAGA GGGCTGCAGC AGTTGCAGTT 2520 

CCCACACCGT GCTGACCAGC ACCCGTTTCT GCCATAATTT TCTTTTTACC CATGCGTTTG 2580 

GCAAGCCAAA CTTGTCCTAA GGCATTGTTA ATCTTGTGGG CTCCTGTATG GTTAAGGTCT 2640 

TCCCGTTTGA GATAAATCTT GGCTCCGCCA ATATGCTGGG TCAAGTTTTT TGCGTAATAA 2700 

AGAGGAGTTT CACGTCCTAC GTACTGGCGC AAAACCTGGT TTAATTCCTC TTGGAAACTT 2760 

GGGTCTGCCT GACTTTCACG GTAGGCCTTC TCCAACTCCA AAACTGCTCT CATCAATGTT 2820 

TCTGGGACAA AACGTCCGCC GAATTTTCCG TAAAATCCAT CTTTATTTGG TTCCTCATAT 2880 

GCCATGCTTT ACCCTCTCTA TAAATCTTCT AATCTTTTCA TGATCTTTTT GTCCATCTGT 2940 

CTCCACTCCG CTCGATACAT CTACTGCATA GGGAGTAAAG TCTTGAATTG CTTTTACTAC 3000 

ATTATCTTCA TTAAGGCCAC CTGCGATAAA GAAGGGCTGT GCTAGTCCAG TCGTATCCAG 3060 
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TTGACCCCAA TCAAAGGGCT GGCCACTTCC TGCCACAGGG GCATCAAAGA GTAGATAATC . 3120 

TGCCTGAGAA TTCGGGACAT GCCCATTTCC ATCTACCTGC ACAGCCTGAA TACTGGCACA 3180 

AGGCAAATTC TCAAATAAAT CATCTGCCAC CTGACCGTGA ACTTCAACCA ACTCCAACCC 3240 

AACTTTGTCA ATCGCTTCCA GCAGTTCTAC CCGACTTGGT GAAACAAATA CTCCAACCTT 3300 

TTTCACATCT GCAGGAATAA GCTTTGCCAA CTCAGCTGCC TCTTCTAAAG TCACCTGTCT 3360 

TTTACTAGGT GCAAAGACAA AACCGATATA GTCGGCTCClT GCTGAAACGG CTGTTTCCAC 3420 

CGCTTCTTTG GTCGATAGTC CACAAATTTT AACCTTTGTC AATCTGCAAC TCCTTGATTC 3480 

TCTGGGCCAC ATTTTCTGCC TGCATAAGAG CTGTCCCTAC CAAAATTCCG TTAAAGTATG 3540 

GGGCTAGTCG TTCCGCATCC TGCCCTGTCA AAATGGCAGA TTCAGAAATG TAATAGCGAC 3600 

CTTCCTCAAA GTAAGGGGCT AAATCTACAC TGGTCTGCAA GTCGACCTCA AAGCTAGTCA 3660 

AGTTCCGGTT GTTGACCCCG ATAATCTCAG CACCAAGTCT GTGGGCTACC TCTAGTTCAG 3720 

CTAGATTGTG ACTCTCCACT AAGACTTCCA GACCAAGCTC TGTCGCGTAG TCATACAGTT 3780 

CCTTGAGGCG TTCTTCGGAC AAGGCTGCCA CAATGAGCAA GATAACTG7C GCACCTGCAT 3840 

TGCGAGCGCG GATGATTTGC TTTTCATCGA TGATAAAGTC TTTCTTGAGC GTCGGAATCT 3 900 

CTACCTGACT GGAAATTTCC CGTAGATAAT CCAAATGCCC TTTAAAGAAA ACCTCATCTG 3960 

TCAACACCGA AATCATCACT GCTCCGTTTT CTTCATAAGT CTGGGCCTGT TGCACAATAT 4020 

CCACATCGAG ATTGATATCT CCCAAACTAG GCCTACCTTT CTTGACCTCA GCGATTACCT .4080 

GCAAGCGGTC CTGATGATTC TTCAAAAATT CTGCCAAGCG ATAGGTCTGG CGCAGAGGCT 4140 

GGATTTGCTC CAGCTTCATC TGCTCCACCT CACGCGCCTT CTCCTCTAAG ATTCGTGCTA 4200 

AAAATTCCTG ACTCATTTTT GGTACTCCTG TAACAGTCTC AGTTTTTCAA GGGCCTTGCC 4260 

TCTAGCAATC ACTTGACGGG CCAAGGCAAC CCCTTCCTTG ATGCTATCAA TCTTACCATT 4320 

AGCATAGAAA CCAAGACCAG CATTGAAGAC TGTCGTTTCC AAGAATGGAC TTGCTTCGTT 4380 

TTTCAGAACG CTAAGCAAAA TTTCTGCATT TTCCTGAGCA TTCCCACCAC GAATATCTTC 4440 

CATAGCATAG CCTTCCATTC CCAAATCCTC TGGAGTAAAG CTTGACAAGC TGATTTCGCC 4500 

ATTTTCAAGA AGTGCAATCT TGGTTGTTCC GTTCAAGCCA GCTTCATCCA ACCCTTCTGG 4560 

TCCAGCAACC ACGATGGCAC GTTTGCGACC CATATTTTTC AAAACCTGAG CTGTACTTTC 4620 

TAGGAGTTCT GGACGACTAA TTCCAAGAAG CTGTGTTTCT AAAGCCATTG GATGAATCAG 4680 

TGGACCAGTC AAGTTCATAA TCGTTGGAAT TCCCAATTCC AAACGAGCTG GCATGATGTA 4740 

TTTCATAGCT GGGTGCATAT TTTTAGCGAA GAGAAAGACG ATTCCAGTTT TATCAAAGAC 4800 
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CTTACCTAGT TCAGCTGGTT TGAGCTCAAG ATTGATTCCC AAGGCTTCGA GGACATCTGC 4860 

GGAACCAGAT TTAGAAGATA TCGAGCGGTT ACCGTGTTTG GCCATGTCAA TACCGCCACC 4920 

AGCCAAGACA AAGGCTGCAG TTCTGGAAAT ATTAAAACTG AAAGACTTCT CCCCACCTGT 4980 

ACCACAGTTG TCCATGGCAT CATGAATCTC AGTTGGAATA TGCTGGGCAT GTCCTCTCAT 5040 

GACTTGGGCA ATGGCTGTGC GTTCTTCAGG TGTTTCCCCC TTCATCTTAA GAGCTAAGAG 5100 

GAGAGAAGCA ATCTGCGCTT CAGTTACACG CCCAGTTACG ATACGCTCAA TCACATCCGT 5160 

CATTTCCACA CCTGATAAAT TTTCAAATTT TGCTAGTTTT TCAATAATCT CTTT CA TCCT 5220 

AGTTTCCTCA CTTTACAACC TCCTCGATAA AATTCCGAAT AGAAGACAAG CCGTCTGGCG 5280 

TTCCAATGCT CTCTGGATGG TACTGGAAGC CATAAATCGG TAGGTTTTTA TCTTGAATCC 5340 

CCATGATGGC TTGGTCATCA GTCGAACGAG CTGTCACTTC AAAGTCTTCT GG CATTTCCT 5400 

CAATCAAAAT ACTGTGATAA CGCATGACCG CACGGCCATC CTCAATACCT TGATACAAAA 5460 

CAGATGGCGC TTCAAAGTTG ATATTGCTCT GTTTCCCATG CATGACTTTT GGAGCCAAAC 5520 

CTAGCTTACC ACCAAAGACT TCTGCAATGG CTTGGTGGCC CAAACAAATC CCAAGAATCG 5580 

GCTTCTTGCC TGCAAAATCA CGAATCATGT CTTCCATCTT TCCAGCATCA ACTGGCCAAC 5640 

CAGGACCAGG AGAAAAGACC AGACCATCTG CTTTTTCAGC TTCTTCATAC AGCTTGGAAT 5700 

CATCATTTCT CAGAACCTGA ACTTCTGCAA AATTCCCAAT GTATTGGGCC AAGTTATAGG 5760 

TAAAAGAATC ATAGTTGTCA ATCAATAAAA TCATGGTCTT AGTTCTCCAA TTCTAGTCAT 5820 

AGATTTTGCT TTGTTAATGG TTTCTTGGTA TTCGTTTTGG GCGATAGAGT CGTAGACAAT 5880 

CCCTGCCCCA GCCTGCACAT AGGCTCTTTG ATTTTTGAGA ATCATGGTTC GGATGGCGAT 5940 

GGCCAAATCC ATATCACCCG TCGCAGACAA GTAGCCGATT GCCCCAGCGT ATACTCCCCG 6000 

TTTTTCCGTT TCCAGTTCAT AGATACGTCT CATCGCTCGA ATCTTTGGTG CTCCAGAAAC 6060 

GGTTCCAGCA GGAAGCGTTG CTTTCAAGGC ATCCATGGCA GTGAGTTCTC GAAGCAAACG 6120 

CCCCTTGACT ACGCTGGTCA AATGCATGAC GTAGCGGAAG AGCTCCACTT CCATATACTT 6180 

AGTGACTTGG ACACTGGTCG TTTCAGAGAT GCGGCCAATA TCGTTACGCC CCAACTCTAC 6240 

CAACATTCGA TGTTCTGCTG TTTCCTTCTC ATCAGAGAGG AGGTCAGTCG CCAAGGCCTT 6300 

GTCTTCTTCA TCCGTAGCCC CTCTTGGTCG CGTCCCTGCA ATCGGATTGG TTGTCACGAT 63 60 

GCCATTTTTG ACAGAAACCA AACTTTCTGG ACTAGCTCCG ATGATTTGAT AATCCCCAAA 64 20 

ATCATAGAAA TAAAGGTAAT TAGAAGGATT AGTCACGCGG AGATTTCTGT AGAAGTCAAA 6480 

TGGATTTCCA GTAACTTCTG CTGAAAAACG CTGGCTGAGT ACACATTGGA ACATATCTCC 6540 

GTTACGAATC AAGTCACGAG CT GTT TCTAC CATTCCCTCA AACTTATGTG GAGCGATATG 6600 
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ATCTCTCATT 

niw k \m k \~r\ k A 


cctgt err Afi 


ATATCTCCTG 


»P ft ft /*> ft /^/"» MM 

T AA C AGGCTG 


TGCGATAAAG 


7320 


GGCACTCCCT 


TOAfiAATfSAT 


Ul 1 1 1 V— I lv.1 


t- r\_tj 1 1 ia.Au 


ft ft ft ✓"»#■*>/■* « m 

ATGAAC-CCAA 


CTTTACAGCT 


7380 


T w rv x, r £*tgct* r 


villi I. AV9t_A 


Av.V_ALAAU»i, 1 


t- i Liu riiAGA 


^* ft ft ft i - "* ft ft 

GAAAGAACTG 


TAATTTTTCC 


7440 


ATCTATTATT 


i i 1 i nVi^ 1 lw 


1 AU 1 A*j lv> 1 Vj 


vaa 1 UuLAuV. 


TAGGTCCTTG 


CCTCCACGAC 


7500 


C AGAG AC ATT 


CI ATfi A AH Af! A 


*^^^^P^Pl^ & ^^^^^^ 
i VJ 1 1 V.A IV. 


va* l alalui T 


TATACTCTTC 


GaaaATCTCT 


7S60 


TCAAACCGCG 


TCAA£*£?TfV:P 


1 VJV.V-VV4 Al* 


vi l A IWuAL 


TGACTTCGTC 


AGTTCTATCT 


7620 


GCAACCTCAA 


a iPWPWfn 

rt/V_rtVj 1 \j ill 


Tt*l A**VTV^ A f*T 
IVjAlA. IviAs, 1 


4A.v» i LAu rru 


ft ^/^/^ » ft ft 
TATCCACAAC 


CTCaAAACAG 


7680 


4 *J 4 4 4 4 WO\JW 


1 MAW 1 1 1 V. 


Av» I ILIA 1 v_k_ 


Ak*AAU LTLAA 


AAI.AO 


Iftfl ft AMUt ft / 

TGAGCTGACT 


7740 


TCGTCAGTTC 


TATCCACAJM" 1 


(TP A AA. AP Art 


lUi 4 4 i VjAVj*— 


auLL iGv.GvjC 


TAGTTTCCTA 


7800 


W 4*4 4^44 


4 1114 V. A 1 


1 **J/\Vj 1A1 1 AV» 


1 AvjL l"l~i"l"l I" 


t, GT ATT AGTC 


C AGC L~i~i~i~i~;' 


7860 


GTTTGCTTTT 


AGTAGTAGGC 


ATGGAGCTGT 


AGATAGAACT 


CAAGTTCATC 


AAAGCGACTT 


7920 


AAGGCCCTAA 


TAAAAGATAA 


ACCAAACGAC 


GGATAGAAAA 


AAGCCCACAC 


ACAGAATATA 


7980 


CTTCCGTGTG 


AGGCCGTTGG 


TAACGCGGTG 


CCACCTCAAT 


TATAAAGGGA 


CTATCCCTTT 


8040 


ACATCTCTGC 


CTTCTTtAAC 


AACAAGCTGC 


ACTGTAAGGT 


GTGCGCACCG 


AATTTTCATT 


8100 


GTTTCAAATT 


CATTTTCAAA 


ATCAGCCCAC 


TTTCACTACT 


TCCAACCACC 


TATTCACAAT 


8160 


CACCACAGGC 


TCCCTGAAGA 


TCAAAAATAG 


TTACTTTTCT 


GATTTGTTGA 


ACTTATTTTA 


8220 


ATACTTTGTT 


TTTTCTTTGT 


CAAGACTTTT 


TTACGATTTT 


TTTGAAAATA 


TCATTCGAAT 


8280 


ATGACCATGT 


CTTCCTTAGA 


TCGAACATGA 


ACATGTCCCA 


CTTCTTAGAA 


ATTGGATCCA 


8340 
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ACTCAATACA AACTGAATGG AGCCTAAACA GAACTTATTT TAGAACACTC CATCTTTTCC 8400 

ACTAGGATTT TCAAGAATTA AACAATACTA GAAACTCTGT CTCCTAACAA ATTTAGGACA 84 60 

AACTTCAACA GATGTGACAC TTTCCCCTTT AATAATTGCT AAAACACCTT CTATCATTTC 8520 

TTTAGCCAAT TTAACATAAT TGGCAGCAAT TGTAGACAAA GCTGGAGTAT AATACTGACA 8580 

AATAGGAATA TTATCAAATC CAATGATAGA AATATCATCT GGAATAAGAA TTCCTTTCTC 8640 

ATAGCACGCA CGAATCAAGC CCTGAACCTT TTCATCTCCT GAAACAAAAA TAATGTCCGG 8700 

ATAATTTTGG GTAGTCAAGT GCTGCATTGC ATAAGAATAA ACTGAATCAA TTGTAGATAA 8760 

GCCATAAATG ACTTTTAAAT CCATAAAGTA ATTTTTATCA TTCAGAAAAG AACGCACACC 8820 

TCTTTCACGA TCCTTATTAA CATGGGATTC TCCTCCCATA AGCAACCACA TATTTTTAAA 8880 

TTTTTCTTCA G TTAC AGCTT TCATCATATC ATAAGTAGCT TGAAAATTAT TATTAGATAC 8940 

ATAGACTACT CCAGACGTTT GAGATTCACC GAAAACAAGA AAAGGCATAT GGTTCTTCTT 9000 

TAAATACTGA ATTCTGATAT CAT CT AC ACT TTCATAAAAA ACAATAACAC CATCTACTAG 9060 

GCTACCTGTG CTTGATATAA TTGAATTACT AATTGTATCC TCCTCTCCAA AGTACTCAAC 9120 

TATAGCATTA ACACCAAATT CTTTACACGT CCGTAACACT TTATCTAACA GCGTATGAAA 9180 

CCAAATTAAA GGAAAAGAGT CGATTTTTTT TACAGAAATC AATATATTTA TAGCTTCTTT 9240 

TTTAGTTAAA TTTTTTGCAT ACGCATTTGG AATATACGAC AATTCCTCTA TAACTTTTTG 9300 

AATCGCTTGA TAAGTTTCTT CTTTAACATT TACTCCACCA TTAATAACTC GTGAAACTGT 9360 

TTTTGGAGAA AAACCTGATA AACGTGCAAT ATCATAAATA GTTACCTTTT TCCCATTTAT 9420 

ATTTTTCATT TCAGTCCTCC ATTACGAACA TTCTAATATT ACTATACAAT ATTTAATTTT 9480 

TTTTAACAAG AGAATTTAGT AAATTATTTA AGATCCACAA ATTCACAAAA TTAATTTTAC 9540 

AAATATTCTT CCCCTTCAAA AAAGTTTAAA TTGCATTTCA CACCTTTATT TTTAAGAATG 9600 

TTTCCAACTT CACGACAAAT AAATTCATAT GAGAAAAAAC TGCCATAAAA TTGTAGATTA 9660 

ACTTTTTCAG TAAAATGTGT AGGATTTATA AAAACATATA ATAGCCTGTC AATGTAACAT 9720 

TTTAACATAG AGTTAATTTT TTCTTTAAAG ATAACATTTG TTATCAACTC ATCAGGAGGT 9780 

AAATGAAAGG CAAACACCAT TTCACAAATA TCATAAAAAG AAATAAATTT GTATACTTGT 9840 

ATCAAACAAT TATTATCAAA ATATTCTATT TTACCTAAAT CAAAATTGAT TTTATAATCT 9900 

TTCATAAAAA CCTCTGAGCA AAAATCTACT CAAAAATTAG ATGATTAAAA CATCTAAAAA 9960 

GCAAAAGGAC AAAAACATCT GTCCCTTTGT TTACTAAATT TCAGCTAATT TCTTCGACAT 10020 

AAATAACACC TACAATATTA GCAATTTCTT CCATCAGTCG AAGATGTTCA AATCTACCTG 10080 

ATAATTCCAG AGTAATAAAT GACCCTATTT TTTTCTCCGG AACATCAAAG TATTCAATTC 10140 
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TCTCAGAATT AACATCTCCA AACGCTCTTC TTGAATCGGT CATTCTGATA CCATTTTCTG 
CACAATAAAC CAATACACGA TTATAGGCTT CTGTAGATTT AACCACTATA TACAATTCAA 
TCATTTTAGA ACGATTTTGC AGATATTTTT TTACTGGTTG GAACATGGAT ATCACACCCC 
AAACAGAAAT GGCTACTAAA AGAGCTCCCT CATAAGG 
(2) INFORMATION FOR SEQ ID NO: 192: 

( i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6867 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



10200 
10260 
10320 
10357 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

CGGGACATTC TCAATCTTCT GTCTTTTGTT TTTCTCTTCT TTCTATGATA CAATGGAAAA 60 

AATAAATTCA AAAGGAGTTT TTTTATGACT TATCCAAATC TCTTGGACCG CTTCTTAACC 120 

TATGTTAAGG TCAACACGCG CTCTGATGAA CACTCTACTA CTACTCCAAG TACACAGAGT 180 

CAGGTTGACT TCGCAACAAA TGTCCTAATT CCTGAAATGA AACGTGTTGG ACTGCAAAAT 240 

GTTTACTATC TACCGAATGG TTTTGCTATT GGAACCTTGC CAGCCAACGA TCCGTCTTTA 300 

ACACGTAAGA TTGGTTTTAT ATCGCACATG GATACTGCTG ATTTTAATGC TGAAGGAGTC 360 

AATCCACAGG TAATTGAAAA CTACGATGGT GGTGTGATTG AACTAGGGAA TTCTGGTTTC 420 

AAACTCGATC CAGCTGACTT CAAGAGTCTT GAAAAATATC CAGGACAAAC GCTCATCACA 480 

ACAGATGGAA CAACCTTGCT AGGTGCTGAT GACAAGTCAG GAATTGCTGA AATTATGACA 540 

GCCATTGAAT ATCTAACTGC TCATCCTGAA ATTAAGCACT GTGAGATTCG TGTTGGTTTT 600 

GGTCCAGATG AAGAAATCGG TGTTGGTGCC AATAAATTTG ATGCAGAAGA TTTTGATGTG 660 

GATTTTGCCT ACACTGTTGA TGGTGGTCCA CTAGGTGAAC TTCAGTACGA GACTTTCTCA 720 

GCCGCTGGTG CTGAATTGCA TTTCCAAGCT CGTAATGTCC ACCCTCCTAC TGCCAAAGGG 780 

CAGATGGTCA ATGCCCTTCA GCTAGCAATT GATTTTCATA ATCAACTTCC AGAAAATGAC 840 

CGACCTGAGT TAACTGAAGG TTACCAAGGT TTTTACCATC TAATGGATGT GACAGGTAGT 900 

GTTGAGGAGG CGCGTGCAAG CTACATCATT CGTGATTTTG AAAAAGATGC CTTTGAAGCG 960 

CGTAAAGCAT CCATGCAATC TATCGCTGAT AAGATGAATG AAGAACTTGG GAGCGACCGT 1020 

GTCACTCTCA ACTTGACAGA CCAGTACTAC AATATGAAAG AAGTCATTGA AAAAGATATG 1080 

ACTCCAATTA CCATTGCTAA AGCCGTTATG GAAGATCTAG GTATCACGCC TATTATCGAA 1140 
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CCAATCCGGG 


GTGGAACACA 


CGGCTCTAAG 


ATTTCCTTTA 


TGGGAATCCC 


AACTCCGAAT 


1200 


ATCTTTCCAG 


GTGGCGAAAA 


TATGCACGGA 


CGTTTTGAAT 


ACGTTACCCT 


TCAGACTATG 


1260 


GAACGTGCAG 


TTGATACCAT 


CATTGGCATC 


GTAGCTTATA 


AAGGCTAAAA 


AGACGAGGTA 


1320 


GCTCAGCTAC 


TTCGCCTTTC 


TTTTTATTCT 


ACTGGTTTTT 


CTTGATTTCC 


AGTAGTTGTA 


1380 


GAAGATTCTG 


TTGTTTCATT 


TTCTGAAGTT 


GATTCAGCAG 


CTTTAGAATC 


TCTTGTATTG 


1440 


CTTGGTTTGT 


TTTCCTCGCT 


AGCAGTTTCA 


ATCTTACATT 


CTGCAGTTGC 


GTTTGGTTGG 


1500 


TTCTCAGCAC 


TGGTGTTATC 


ACCATTTGCT 


TCAGCATTTC 


TTGCTGGACT 


TGTTTCTTCA 


1560 


CTTGCGCTAG 


CTTTTGACTG 


GATTTGATGA 


TTCAAAACTA 


GAATAGCTTT 


TGTCGATTCA 


1620 


AGTAAAGCTG 


TTTTGTCTTT 


ACTCTT AG C A 


GAAACTTGAT 


CTAATAATGC 


ATCCACCTTA 


1680 


TCAAAGTCCG 


CATCAGATCC 


ATTATTACTT 


TCTAAATAAG 


AGTGAAGCGA 


CATGAGAATA 


1740 


TCGTAGAGTT 


TTTGATAGAG 


TACAAGTCTC 


TGACGATCTT 


GCTCAGCATT 


TTCCTTTTCT 


1800 


TGTTGAAGGG 


CGCTAGCGAT 


ACGAGTCAAG 


ACATCTTTTA 


CCTGACTGTT 


TACTTCATCC 


1860 


AAGTCTGCAT 


CAGCCTTGTT 


TGTGGCAGCT 


TTTAGATTTT 


CTACTTCTTC 


TGCCAAGGAT 


1920 


TGTCTGATTC 


CTTCTTCATG 


GATTTGTTCC 


AACAGTTGAT 


TTGCCTTGCT 


CAAAAGACTT 


1980 


TCTACTTCTT 


CCTTGCTATC 


TGTCGCAGAT 


TATTGGTTGC 


TATCTACCAT 


CTACTCCTAA 


2040 


AACAGGAGAG 


TTATAATCCA 


AGATTACAAG 


GCCTTACAGA 


AATAAGAAAT 


CCAGATAAGA 


2100 


CAATGTTCGT 


CCAAGACGCT 


ATTCGCTTCG 


CACAGCAGCA 


CGGATTCAAT 


ATGCTTTAAT 


2160 


TTTAAAGTTT 


AGGTGTCAAG 


ACCTCTTTTT 


AGTGTGCCCA 


AAATTTAGAG 


AAGTAATCAA 


2220 


TCAACTAACT 


TTTATTTTTT 


TCAAACTTTC 


AGTAAACTGA 


CCTAAAGCTA 


ACTCAATCTG 


2280 


TCTTTGTAGA 


TGCTTCTGCT 


ATCAGCTAGA 


AGTTGATCTA 


CTTTTGCCAA 


GACTGCCTTC 


2340 


TCATCAAAAG 


TTCCAGGTTC 


ATAGTTGGAT 


TGCAGGGATG 


GAATCTTGTT 


TTTCAAAGCC 


2400 


GCTTCATATC 


CCTTAGTTTG 


AACCTTGATG 


TAGTGATTGT 


GGTCGCCATG 


AGGAATCACA 


2460 


AAACCTTCTG 


AATCTTCACT 


TATAATTCGA 


TTGGCATCAA 


AACCATGACC 


ATCTTCTTCC 


2520 


TCATGATGGA 


CATGTAGTGA 


CGGATTACTT 


AATACAGAAC 


TAGAAGAACT 


TCCTACCTCT 


2580 


TCCCTGTTAG 


AGTGTGATGG 


GGGATTGTTA 


AGAGATGACT 


TAGGAATATA 


GTGATAGTGA 


2640 


TCCCCATGTC 


TTACTATATA 


AGCATCACCT 


GTATCTCTGA 


CAATATCATT 


AGGGTTAAAG 


2700 


ACATATCTGG 


CTGCTAATTC 


ACCTGCCGAC 


AAGTCACTCT 


CAGGAATGAA 


ATGATAGTGA 


2760 


CCACCATGTG CTACTATAGT 


AGATTGAAAT 


AGAATATGAG 


CAAATTGATA 


AGGGGATTTT 


2820 


AAAGTAATTT 


CTAACAATGA 


TTTAGAAACT 


ATGATGTGCT 


ATTCTAAATT 


CAACTCACTA 


2880 


TATATAACCA 


TCATCGGTAG 


TATAACGTCC 


CTCTAATTTT 


GCTACAGATA 


CTTCTGCACT 


2940 
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ACCTCCTTTA TCCTCTTTAC CATGTTCTTG TTTTTGCCGA TTGATTTCAT CTTTTGTTCG 3000 

TACATTTTCT GCATGAGCTT GATCTTTAAG GTAAACATAA TACTTTCCAT CTACCTTAAT 3060 

AATATATCCT CCCTTAACCT AACTGACGAT ATCTTGATCT TTCGCCTGAT AGTTGGGGGC 3120 

TTTCATTAAT AGCTCTTCAC TAAAGAGCGC ATCAAAAGGA ACTTTACCAT TATAGTAGTG 3180 

ATAATGATCG CCATGAGAAG TTACATAACC TTGATCTGTA ATCTTAATAA CAATTTGTTT 3240 

TGCTTGAATT CCTTCTTTTT GACTAACCTA GTCTGGAGTC AAATTTTCAG TCTTCTTAGT 3 300 

GTCTTTATTA CTGTTTACAT ATGAAACACG ATTTTTATCT GTATTGCCCT GTTAGCTATG 33 60 

TTGGTTCAGA GCATAAACAC ACAGACTTAA GGAAAGGATA ACAACAGATC CAGCTGCTAT 3420 

ATATTTCTTT TTAAATTTCA TAATTACCTC ATTTCTATAA TTATTTATAT GATGTCTTCA 3480 

TTATTAAATG ATTAAATAAA TTAATTAACC AATTAATTAA CTAGTAAATA TTCCACCTCT 3540 

TTTTAAGTTG TATGTCAAGA AATTTTATAT ATTAATAATA AAATGAAATT CTCCCAAAGT 3600 

CAGAGTTTTA TTTCTAACTT TTGAGAGAAC TTCATTTTTG ATTCAGACTT TTTCTACTGC 3660 

TATTCCTTAC GCTATGAGAT CAGATAAATT CTTTTTTATC ACTTCTCCAC TTGGCAATCT 3720 

TAATTCAATC GTTCCATCCA TATTGAATAT AACACTATCT AAGCCTAATC CGTAACTACC 3780 

TGTAAATTTT TCTAATTTTT CTTGTACAGG ATCTACTGCT GGAGCTTCCT CTAATGCTGG 3840 

ATCTAACATA GGCTCACTCC CCACATTCCC TTCTGGATTC AACATTCCAT TATCCGTTGA 3900 

GTTTTCTGCT TTTACAGGTT TTTCGTTTGG TGCCTCTGGT AAAGAATCTG CTGGTTTATT 3960 

TTCTGTTGGT TGGTTCTCAA CTGTTCCAGT AGATACTTTT CCATTTTCAG ATGGTTTATT 4020 

TTCACCATTT CCTTGAGGTG CTTCTCCTGT AAAATCTGCC ATATTCTTTT TAATGACTTC 4080 

TCCCGATGGT AAATATAATT CAATTGTTCC GTCCATATTA AACAAGACAT TTTCTAGCTT 4140 

CATCCCATAA CTTTCAGCAA ATTTTGCTAC TTTTTCTTGT ACAGGATCCA CTGTAGGAAC 4200 

TTCTTCTAAC GTTGAATTAC TAGTACTATT CCCAGTTTCA GAAAGTTTTT CTTTTTCTAC 4260 

CTTCTCACTA GTCTTTGGTT CTTCTACCTT TTCATCAAGT TTTAAGTTTT CTTGTGCTTT 4320 

ATTCCTTTTA AATTGTGGTA GAATACTTGG TTTATCAGTT TGATTTTCTT TTTCCAAGAT 4380 

AGGTACTTCC ACAATATAAG TCGATTGATT GTCCAAATAA GCATTTGCCA TGAAGGTTAC 4440 

AGGAATTTTA TTTCCGGCCG TTCTGGTTGT TCCTTGGTTT AATTTCGGAA TCGGTAATTT 4500 

GATTTCACCA ACTTTATAGT TATTTTCTAA ATAAGCATTT CCATGAAATT CATCAAACAC 4560 

TCTGACTAAA GCATCAGTTC CTTTAGGCAC TGCAAATTGA GGGTTCACTC TTAAATAAGT 4620 

ATCCCCTGCA TGGAAAGGAT AGAAAATCGT TTGACTGGCC ATTTTGTAAG CTAAAGAGGT 4680 
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TCGAACTCTA AATGTACCAT 


CATAACTTAC 


TTCTGGATAA 


TCTTTTGAAG 


CGATAGTATA 


4*740 


CTTAAATCTT TGTCCTGGTA AATAAGGTTG ATCTAATTCA AAGTTTGCAA 


TATTCCCTAC 


4800 


TCCTTCTCCA 


AATACTTTAC 


CAGATACTTT 


CTCCAATACT 


TTTCCATCTG 


GTGTTATTAA 


4860 


TTTTACTAGC 


ATATTGATAC 


CTAATTTTTT 


CTCCAATTCA GGCGGAAAAC 


TAAAAGAAAC 


4920 


GCGTTTTTGA 


CCATTGGCTA 


GAGTAAAGTT 


TTGATTATTA 


AACGTACTAT 


TTTTTAACAA 


4980 


ATTAACAACA 


TTCGTTAATT 


CTTCTCCAGT 


ATAAACTTTA 


TTCCCTTCTT 


TTTTAGCAAC 


5040 


TCCTTCTTCG 


GGTTTAAACA 


GTTCATAGTT 


ACTGTGAGAA 


TGACCAATTC 


CAACCGGTTT 


5100 


ATGTTCATCA 


ATCGGATCTG 


CATGATGGTG 


ATCTCCATGC 


GGATAAATAA 


TCGCATTTTT 


5160 


TTCTTTATTC 


ACGACAATAC 


TTTCACGTTT 


GACACCATAT 


TGTTTCATAA 


TGCCAGCAAT 


5220 


TTTTTCTTCG 


ATTTTTTTAT 


CTAAATCTTT 


CATTTCTTTG 


GCATTACTTG 


GATAATCCTG 


5230 


TTCATGAGAT 


GACAAAGAAT 


CTAATCCATT 


ATGACTAGTT 


TTAACTTCCT 


CTAAATGTTT 


5340 


TTGCGCAsCT 


TAATTTGCTC 


TTCTGTCAAG 


TCCTTCTTGA 


AGAAATAATG 


ATTGTGGTCT 


5400 


CCGTGACTCA 


TGACAAAACC 


TGATTCATCT 


TCAGCGATAA 


TACGATTAGC 


ATCAAATCCG 


5460 


TATCCATCTT 


CTTCATGTTT 


CTCATGTGAA 


GTTCCTGGAT 


TGATTGGAAG 


AGATGGAGAA 


5520 


GGTGTTGCTA 


GACTATTGTT 


TGGAAGACTC 


GGTTGCCCAA 


TTTGATTTGA 


TTTTGGAATG 


5580 


TAATGGAAAT 


GATCACCATG 


TCTTACAATA 


TAAGCTGTAG 


CCGTTTCTTC 


AACGATATCT 


5640 


TTTGGATTAA 


AAATATAACC 


ATCAGATGCT 


GAAGAGAGCT 


CCTTACTTGT 


CGTTAAAGAA 


5700 


GAAGGATTGC 


TTGAAAGACT 


GCCTAGACTA 


GACACTACTT 


CATTAGGTTT 


TGCATTTGTA 


5760 


GAAACTGTAG 


AACCAGTTCC 


ACTGATAGGC 


ACCATTCTGG 


CAATCTTTTC 


TTCTAAGGCA 


5820 


GAAAGCTTGC 


TGTAAGGAAT 


AAAGTGGTAA 


TGGTCGCCAT 


GCGCAATCGC 


AACTCCATTT 


5880 


GGTGTACGAC 


TGATAATCTT 


AGCAGGGTCA 


AAGACCAGGC 


CATCTGATTC 


ACTGTAACGT 


5940 


TGGCCGCTAG 


GTGAATCATA 


GAGTTCCTTC 


AAAAGACTCT 


GGAGATTTTC 


AGATTTATTT 


6000 


GCTGGCTTGC 


TAGTTGATCC 


TTTTGCTACA 


GATTGCGTGT 


TATTGTCACT 


ACCTCTTGAA 


6060 


GAATAGCTTA 


ACTCACTCGG 


TTGCATATTT 


TTTCCAGCCA 


GATGTGCTTT 


AGCTGCTGCT 


6120 


AATTCACTAG 


CAGATAAATC 


GCTTTTGGGA 


ATGTAGTGAT 


AGTGACCTCC 


ATGAGGAACG 


6180 


ATATAAGCAT 


TACCCGTATC 


TTCGATAATA 


TCAGCTGGAT 


TAAAGACATA 


ACCATCATTT 


6240 


GTCGTATATC 


GTCCCTGAGA 


CCTTGCTACA 


GCAACATTAG 


AGTTAACCTT 


CTCATTATCT 


6300 


TTGACATGTT 




ACGATTGATT 


TCATCTTTAG 


TTCGAACATT 


ATCAGCATGA 


6360 


GCTGCATCTT 


TCAGGTAGAC 


ATAATATTTT 


CCATCGACCT 


TGATGATATA 


ACCACCCTTG 


6420 


ACTTCATTGA 


CAATATCAGC 


GTCTTTAAGT 


TGATAGTTTG 


GATCCTTCAT 


CAAGAGTTCT 


6480 
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TCACTAAAGA GGGCATCATA AGGAACTTTC CCATTATAGT AATCATAGTG GTCACCCTCT 6540 

GACGTTACAT AGCCCTGATC TGTAATTTTG ATTACAATTT GCTCAGCCTG AATTCCTTCT 6600 

TTCTGGCTAA CCTGGTCTGG TGTCAAGTTT TCACTTTTCT GACTTGACTG GCTCCCATCC 6660 

ACATAAGAGA CACGATTATT GTCCTTATTT TCCTGCGAAC GATGCTGGTT TAGTGCATAG 6720 

GCACATAGAC TCAAGGATAC GATAACAGCT GATCCAGCTG CTATATATTT TTTACTAAAT 6780 

TTCATAAATC CCTCATTTCA ATAAATGATG AAGTTTTTTC TCAACTTCTT TT ACTTTATT ~ 6840 

AAATAGTTTT CTAAACCCGG GGGTACC 

(2) INFORMATION FOR SEQ ID NO: 193: 

(iJ SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 999 base pairs 
<B> TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 
CGTTCTAAAA ATGCAGTACG TTTGATTGAG AAATCAGTTA AAGGTATGCT TCCACACAAT 

ACACTTGGAC GCGCTCAAGG TATGAAGTTG AAAGTATTTG TTGGAGCTGA GCACACTCAC 120 

GCTGCACAAC AACCAGAAGT TCTTGACATT TCAGGACTTA TCTAAGGAAA GGAACAATAA 180 

AGTATGTCAC AAGCACAATA TGCAGGTACT GGACGTCGTA AAAACGCTGT TGCACGCGTT 240 

CGCCTTCTTC CAGGAACTGG TAAAATCACT GTTAACAAAA AAGATGTTGA AGAGTACATC 300 

CCACACGCTG ACCTTCGTCT TGTCATCAAC CAACCATTCG CAGTTACTTC AACTGTACGT 360 

TCATACGACG TTTTCGTTAA CGTTATAGGT GGTGGATACG CTGGTCAATC AGGAGCTATC 420 
CGTCACGGTA TCCCTCGTGC CCTTCTTCAA GTAGACCCAG ACTTCCGCGA TTCATTGAAA 



60 



480 



CGCGCAGGAC TTCTTACACG TGACTCACGT AAACTTGAAC GTAAGAAACC AGGTCTTAAG 540 

AAAGCTCGTA AAGCATCACA ATTTAGTAAA CGTTAATTCG AAAGAATTAC TATACTTATA 600 

CAGAGCACCT TTCGGGGTGT TCTTTTTTTA TACTTTCTTA CTAAATTGGT GCAATTGACA 660 

CAGTTCTTGC GACTTTAGTC GCTTACAAAT GTGCCTGCAA CCTGACATGG TCACTTGCCT 720 

CAAAACGTTA ATCAATACGA TTATATCAAC GTTTCAAAGC ACTCAAGGGT TTACCCTATG 780 

GGTGCTTTTT TCTATACTTT CTAAAAAAGT TTACCCTAAA ATTTGCCCTA AAATTACCCT 840 
ACTTATTTTT AAGATGTTGG TAGGCAACTT GTCCAGCAGA TAATGGAACT ATGTTTGAAG 



900 



TATTAACATA AGTCTTAGTT GTAACGGTAT CGCTATGAGT TAATGCTTCA GAAATGGCTT 960 
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CTAAGCTCAT TCCTGCTTTT TTAGCAAGTG TCGCTCCTG 999 
(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

AATATTATCA CTGTTCTTGA AGGCAGAACA CAAGCTGTCA TCCGAAATCA CTTTCTTCGC 60 

TACGATAGAG CCGTTCGTTG TCAACTGAAA ATCATTACGA TGGATATGTT TAGTCCTTAC 12C 

TATGACTTGG CTAAACAGCT TTTTCCGTGT GCTAAAATCG TTCT AG AT CG TTTCCATATT 180 

ATCCAACATC TCAGCCGTGC CATGAGTCGT TTTCGTGTTC AAATTATGAA TCAGTTTGAA 240 

CGAAAATCTC ATGAATACAA GGCTATCAAG CCTTACTGGA AACTCATCCA ACAGGATAGT 300 

CGTAAACTCA GCGATAAACG TTTTTATCGC CCTACTTTTC GCATGCACTT AACAAATAAA 360 

GAAATTCTTG ACAACATTTT AAGCTATTCA GAAGACTTGA AACACCACTA TCAGATCTAT 420 

CAACTCTTAC TTTTTCACTT TCAGAACAAA GACCCTGAGA AATTTTTCGG ACTCATTGAG 480 

GACAATCTGA AGCAGGTTCA TCCTCTTTTT CAGACTCTCT TTAAAACCTT TCTAAAGAAC 540 

AAAGAGAAAA TCGTCAACGC CCTTCAACTA CCCTATTCAA ACGCCAAATT GGAAGCGACC 600 

AATAATCTCA TCAAACTTAT CAAACGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 660 

AAAAAACGGA TTTTTATCGC TCTGAACATC AAAAAAGAAA GGACGAAATT TGTCCTTTCT 720 

CAAGCTTACC TTTTCTTCAA CCCACTACAG TTGACAAAGA GCCTATTTTC GCTGATTCTC 780 

CACTACATTT GACTGGATTC TAATTTTTTA GAGAAATACA AAAGAGCTAG CTTTAGCTAG 840 

CTCTTTTCCT ATGCGGAGAG AGGGACTTGA ACCCTCACGA CCTAAAGCGG TCACAGGATC 900 

CTTAGTCCTG CGCGTCTGCC AATTCCGCCA TCCCCGCGTC GATTACTTTA CTAGTATATC 960 

AACTTTTGGG ATGCTTGTCA ACACTTTTTT TCAAATTTTT TCATTTTCAC CAACCAGGTT 1020 

ACTCAAAAAG TTCATTTAGA TTTTCATCTA CTAACTTAGC TCCGAGTGTA TTTTTGAAAT 1080 

GACCTAGGGC AAATTGATGA TTTTCAGGCC AGATGGAAGC AACAGCTGGT TTAACAATCT 1140 

CGATGTCATA TCCTAGATTA TAGGCATCTA TAGCTGTATG TAGGACACAG ATATCCGTCA 1200 

AGACACCTGT TAAGATAACG GTAGACACTC TACGCTCTCT CAAACGAATA TCTAGGTCAG 1260 

TCCCTGAAAA AGCTGAGTAA TGGCCTTTAT CCATCCAAAA GACACGACTG TCTGAACCAT 1320 

GCTCTTGATA AAACATCCCC AAATCTCCAT ATAAATTCCG TCCACTCGTC CCAATCAGAT 1380 
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TATGAGGAGG AAATAACTTA CTTTCCGGAT GGAAACAATC GTTTTCTTCA TGAGCATCAA 14 40 

TAGTAAAGAA GATATAATCT CCTCGTTCAA AAGCTAATCG AGTTACCTTG CTGATGGCAT 1500 

CCCAAATCGC CTGAGCTGGA GCACCTGCTG TTAGTTTCCC ACTATCAGCA ACAAAATCTT 1560 

CTGTATAATC AATCGAAATT AAAGCCTTTG TCATTAGTAA TCTCTTTTCT TCACTTCTTC 1620 

AAAAATATCT GAAATCAAGA CCTTAAGATA GGTTCCCTTC ATTCCAAGTG AGCGACTTTC 1680 

AATAATCCCC GCAGACTCAA GTTTACGAAG AGCATTGACA ATCACAGAGC GAGTGATTCC 1740 

GATACGATCT GCAATCACTG ACGCAGTCAA CTTCCCTTCA TTTCCATTTA ATTCCCCTAA 1800 

AATTGCTGAA ACAGCACGGA GTTCGGAGTA AGAAAGGGTA TTGACCGCCA TGGTGACAGC 1860 

AGTACGACGA CGAATATTTT TCTCATCTTC TTCACGTTGG AAGTTAAGAA GCTGAATCCC 1920 

AACAACGCTA CTGGCAATCT CAACAAGAAC CAACTCCTCA TCTTCGAATT TTTTATCATT 1980 

ACGCCAAATA ATCAAAGAAC CAAGGCGAAT CCCCGATACA TGAATCGGTG CAATAGTCGT 204 0 

CAAGCCATCT GGAAAATCAT CTCTACTCTC AATAGGGAAA ATACTCATAT CATGCTCAAC 2100 

AGGCAAGTTT GCTTCTGTTT CGTAAATCAT ATTAGCCCCT TGAACGTAGT CATCTGGGAA 2160 

AATCTTAGTT TGGAAGAATT GCTtACGCGA TCTGTATTTG TTTTATAACG CATAAAATAG 2220 

CCAAGCAGAC CTCCCTTACT ATTGATAATG CAGGCATTGC AATGAATAAT ATCCGCTAAC 2280 

TGACGCGTAA TAGCGTTGTA AGGGAGCTCA TCTCG 2 315 
(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xii SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

CGATTTCTTC CATTTCTTCA AATAAGAATA CTTCATCTGA CATATCTCTT ACCTTCTTCA 60 

TCAAAAATTA TTTTGTAATC GATTACATTG CAGATCGTAA CATAAAGAAA AACAGATGTC 120 

AAATATTAAA CGTAAAAACA TGGTCACTAA AGAACTATAA GAGAAAAGGT AAACCTAGCG 180 

ACGCGATGAA CGCTGGGTCG TTTGGTTTCG ATTGCTCTCT TCCTCTTGTT TTTTCTGTTC 24 0 

TTCTTCTTGT TTTTTCTCAG CTTCCTTGGC CTCTTGTTTG GCTTTTTCCT CAGCTTCCAT 300 

AATTAATTTA TCCGCCACAG TGTAGCTGTA GATTCCAGCT TCCATGTCGA CCACACTCGG 360 

TTCTGACAAT TGAGGCTTAA TCTTACTGTA ATATGGCAGT TTCTTACTCA TTTCAGATAG 420 
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AGGAACCAAG ACTTCCTCCG AATCATTCAT GGTCAATCGA ATTAAATCCG ATGTCACCTT 480 
GCTTGGGGCT AATTCCACCT TTTGGATAGC CGCCTTGAGT TCTGGGCTAA TTTGAGCAAG 540 
TTCTGAGACA AAAACTTTGA TTTGTTCACT ATCATTAAAG AGAACTGATA AATAAGTTTC 600 
TGGTAAACTG TTCAGACTCA CAGAACTAGT CTCAAGCTGA CCACTGGAAA GAATAGGATA 660 

ATGATTTTCA CCAGAAATAT ACTAGGCCAC AATATCATAT TCCTTGACCT TAATAGTGAA 720 

CTTAGTTGGA AATTGATAGA CAAGTTGAGC TGATTCAACC CAATAGTTAG ACTTAATCTG 7 80 
CTTTTCATAT TTTGCCTTGT CTAGCAGAAG GTTAATCGTA TAATCCGAAT CCTGAATGCC 840 

TGAAGCCTGT CGAATATCAT CAGCTGTAGT TTGCACCGTT CCCTCAACAC GAATATCTTT 900 

CATGGTCGCA TAACGACTGA GCAAGTAGGC AGAGACAAAC AATAGAAGCA GACTTGGAAA 960 

TAAAATCGTC AAGGCTCGCA AGATATGGAT ACCAGGAATC TTTGCTTTGG CTGGTTTTTC 1020 

CTTTGTAGCC TTTTTAGCAA GCTTTTTATC CTGTTCCTCC TTCTCTTTAG ACTCTGGTTC 1080 

TTCTTTCTCT TCTTTCTCTT TGTCAGCCTC TGAGGATGCT ACTTTTTCTT CAGACTCTTC 1140 

CTTAGCTGAT TCTGAATCTT CCTGGTCTGT TTCACTCTCC TGGTCCTGTT TATCCTCTGA 12 00 

CTTCTCAGAT TCTTCTCCCA TTCGAGCTTG TCTTTCCTTT TCCTTCTCCT CAGCTAGAGC 1260 

CGCCTCTTCT TCAGCCTTCT TTTTTAGATA TTCTTGGTTT CGTTTCTGCC ATTCTGATAA 1320 

CTCTTTCAAT TCTTCGAGGG TTTCTTTGTC CTCATTTTTC TTATCTTTTG ACATTTACTT 1380 

TCCTTATGAT AAATCTTTTT TCAACAATTG ATAAAAATCT CCTACACATT TCAATTCCTT 1440 

AGAAGCTTTC ATCTTAGCTT GGTAATCTTC CTTGTGACTT AGTAAGTGAG AAAGCTTCTC 1500 

TTCCAAACTA TCCAAGGTCA AATCGCTTTC TTGAAGGTCT TCTGCATAGC CTTTCTTAAC 1560 

AAAGTAAGCT GCATTTTCAA TCTGGTCACC ACGACTAGCT TCACGACCAA GCGGCACAAT 1620 

GACATGCAAT TTTGCTATCG CCAAGAGCTC AAAAATCGTA TTGGCACCAC CTCGTGTCAC 1680 

AACAATATCA GCCAATTCCA TCAAGGGTTG ATAGAGATCG GTCACATAGT CAACACGAAA 1740 

AAGATTTTGC CTCAACTCAT TCAGACTAGA ATCTCCAGTT AGATTGATAA TATTGTAGCG 1800 

CTCTGTTAGT TCTTTCTTAT GGTCTGTCAC CAATTGGTTA AAGACACGAG CGCCTGCAGA 1860 

ACCGCCAACA AACAATACAG TTGGCAATTT GGGATTAAAG TGGGTTTGAA TATCCACCAA 1920 

TTCATCTGGT TCTGGAGTGT TTTTGTCCGA AACCTTGGTC ACCGCTCCCA CATGCTCAAC 1980 

CTTAGCCAAA CTCGAAGCTT GTTCAAAGGT TGAATACATC TTAGTCGCAA ATTTATAGGC 2040 

GATTTTATTG GCCAAGCCCA TAGACAGGTC AGATTCGTGA ATAAAGACAG GCACTCCTGA 2100 

CACACGCGCA GCGATAACAG GCGGTACTGA GACAAAGCCC CCCTTTGAAA AAAGGGTCTG 2160 

TGGACGCAGT CGCAACATGA TAAAGAGCGA TTGGACAATT CCCCAACCAA CTTTGAAGAC 2220 



WO 98/18931 PCT/US97/19588 



1165 



GTCCACCATA 


TTTTGCCAAG 


AGAAATAGCG 


ACGCAATTTT 


CCACTCGCAA 


TAGAATGGAA 


2280 


GGTGACATCC 


AAACCTGACT 


TAAGGATTTC 


TTGCTGTTCG 


ATACCACACT 


TGTCCCCGAT 


2340 


ATACTGGACT 


TCCCAACCAT 


CTTCGATGAA 


CTTGGGCATT 


AACAAAAGAT 


TGAGGGTAAC 


2400 


GTCTCCAACC 


GTCCCCCCAC 


CTGTAAAGAC 


AATTTTTTTC 


ATATTATTCT 


TTTAACTC C*Ci 

L L k r\f\\* * li— w 


2 460 


CTACTGTGTC 


GATAAAGAGG 


TCGCCACGTA 


CTTCAAAGTT 


AGCATACATA 

******* * «V^«\ 1 ^\ 






CATTGGCAGG 


ACTAAGAAGA 


ACCACATCTC 


CTTGAGTCGC 


AAGCTCATAG 






TCGCATCTGC 


AATATCTGTC 


GCCTCCACAT 


AAGCGACACC 


AGCCTTGTCT 




? AAA 


TGACACGTTC 


TGCAGATTGA 


CCCAGGATGA 


CCATCTTCTT 


GAGTCCAGTA 


ATGTCTCG C A 


2700 


CCAATTCGTC 


AAACTCATTG 


CCACGGTCCA 


AACCACCTGC 


AATCAAGACG 


Af*PTT*GPT<VP 


2760 


TGTCAAATCC 


TGACAAGGCT 


TTTTGAGTAG 


CCAACATATT 


AGTTGATTTA 


CTGTC GT**P A 1* 


2820 


AGAATTTAAC 


ACCCTTGATG 


TCATCCACAA 


ACTGGAGACG 


GTGTTTGACA 


CCACCGAAGG 


2880 


CTGAAAGAGT 


TTCCTTGATG 


GTTTGATTGT 


CCACATCACG 


AAGCTTGGCT 


AC AGCAAT AG 


2940 


TCGCAAGGGC 


ATTTTCCACA 


TTGTGGCTAC 


CTGGAACACC 


GATTTCATTC 


GCTGCCATGA 


3000 


CTACTTCACC 


ACGGAAGTAG 


AGTTGACCAT 


CTTCCAGATA 


AGCTCCATCA 


ACCTTTTCAA 


3060 


GTGTTGAAAA 


TGGTACAACA 


GTGGCTTCTG 


TCTTGGAAGT 


CAAGTCTTTT 


GCCAAGTCTT 


3 120 


GATTAAAGTT 


CAAGACAAGG 


AAATCAGCTG 


CTGTCATCTT 


GTTCTGGATA 


* * 1 A 


3180 


CTGCTACATA 


TTCCGAAAAT 


GACCCATGGT 


AGTCGATATG 


AGTTGGCATG 


AGGTTGGTAA 


3240 


TAACCGCAAT 


CTCTGGATGG 


AATTCTTGAA 


CACCCATGAG 


TTGGAAAGAA 


GAAAGTTCCA 

WWW 4 A V» WA 


3300 


TAACAAGCGT 


GTCCTTATCT 


GATGCTATTT 


GAGCAACCTG 


ACTAGCTGGA 


TAGCCGATAT 


3360 


TCCCTGATAA 


AAGACCATGT 


TGGCCAGCAG 


CAGTCAAAAC 


TTCCCCAATC 


ATAGTCGTTG 


3420 


TGGTTGTCTT 


ACCGTTCGAT 


CCTGTGATAC 


CAATAATCGG 


TGCTTCTGAA 


ATCAAATAAG 


3480 


CCAATTCCAC 


CTCAGTCAAG 


ACTGGAATTC 


CCTTGGCCAA 


AGCCTTTTCA 


ATCATGGCAT 


3540 


TGTTGTAGGG 


GATACCTGGA 


TTTTTCACCA 


TAAGGGCAAA 


CTCTTCATCC 


AAGAGTTCCA 


3600 


AAGGATGGCC 


ACCTGTAATG 


ACCTTGATCC 


CTTCTTCCAG 


CAAACTTTGG 


GCAGCTGGAT 


3660 


TGTCCTCGAA 


AGGTTTCCCA 


TCATTTACTG 


TCACAATGGC 


ACCTAGCTTG 


TCCAACAAAC 


3720 


GAGCTGCAGA 


TTCACCAGAC 


TTGGCCAAAC 


CTAAAACAAG 


GACTTTCTTA 


TTTTTAAATT 


3780 


GATCTATTAC 


TTTCATGTCT 


CGAACTCCAT 


TTCTACTCCT 


ACTATTTTAC 


CATTTTTATG 


3840 


GAAATAAAAA 


AGCCACAAAG 


TGTGTTTGTG 


ACTCTTTCTT 


CTAACTGAAT 


CTTACCATAT 


3900 


CATCTATGTG 


ATAAATCGGT 


AACTCGAATG 


ACCTGATCCA 


CTTGCTCCCA 


AATCAGAGGA 


3960 
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TTATCGGTCG CAATAATAAT GGTCCGATTC GGATTTTTTA AAGATTCTAG GATGGAAAGT 4020 

AATTCCTCAG ACTTTTTGGG GTCTAAGGAA GCGGTTGGTT CATCTGCGAG GATCAAAGGT 4080 

GGATCCTTTA AAATTATCTT CGCTAGTGCA ACACGTTGTG CTTCTCCTCC TGATAACTCA 4140 

AATATAGGTT GCTTCAAATC CAAATAAGAG AGGTTTACAC GGTTTAGAGC TTGTTTCATC 4200 

AAAGAGATTT TCTCTTTTTC CTTCAACTTT TTACCAACTA AACCCAGATT GAGATTCTCT 42 60 

TTGACGGTTT GGCTTTCAAT TAAGCCAAAA TCTTGAAATA AGTATCCTAA GTAATCTCTA 4 320 

AAGAAAACAG AAGGCTTGAT GTCCTTAAGA GAAGTGCCAT CATAGATGAT TTGCCCTTTG 4380 

TCATATGGCT CCAATCGTCC AATCATATTC AAGAGTGTTG TCTTACCACA GCCACTTGTA 44 40 

CCGATTAAGG CATAAATTTT CCCACCTTCA AAATGAAGAT TCATATCTGA AAATAGCTGA 4 500 

CGGCTTCCAA ATTTTTTAGA TATATTCTTT AGTTCAATCA TCCTATTTTC CTTTCATAAT 4560 

TGTCATAGAA ACACGAGATT CTTTCTGCGC TTGACGCTAA AGCGTCAAAA CTGCACTAGC 4620 

TAGAAAGACC AATAAAGTGA GCAAGCCAAT CACCAAGTCT CGACTGCTTA AAATAAAGAG 4680 

ACTAGCACCA AATACAAAAC TAGCAAATTG GCTAACCATA TACTGAGCAT GTGTTTCAAA 4740 

AAATCGTAAA CCTGAAATTC GTTTAATCAA GATATCTCGG CGGAATTGCT CGAAATATAG 4800 

AAGATTGACA GAATAAAAGA GTAACAAGGA ACTGGCTATT CCAACAATAG CTCCTAAGAT 4860 

TAAAGTTGCT GTTTCAGTTT GAACTTCATT ATAACGAGTT AGATAAACAC TTCTTCCTTC 4 920 

TTTAAGATAG GATACTTGCT CATAAATTCC AGCTTTCTTC AAGAGTTCTA GCCCACTCTC 4980 

ATATCCTTTG ATAAAGAGTT G TTTTCCAGC ATTGATAGAC CAACTAGATA AGGATATAAA 5040 

ACTATCACCT GTAGAAGTCG GCGTGAATAC CACTAAAATC GGATCAGTCA AATACTGACT 5100 

AGATACGGGA TTCTCACCGT TATTATAAAC AAACCGCTTT TCTCCCATTG AAAGATAACT 5160 

AACGTGCGCT TTCATCTCAT AATCCAAAGG AGCACTTGCC TCCTCACCAG ATTTTCCATA 5220 

ATAACTCAAT CTTTCTTCAA AAACTTTCTT AAGTTCTGCT TCTCGAGAGC GCAAATGTTC 5280 

TGGGAGCAAG AGGATAAACT CACCTTTTTG GACATGGGCT AACTTCTGTT TGGTCTCAGC 5340 

ATCTACCACG ACCTTTTCCT TGTCCAAATA ACTGGGACTA ACATAGAGCG TATTAGCATC 5400 

TGAACTATAG GTATCCAGTG TCTCTCCCTG TTCATTTTTT CCTTGTGGAT TGGCAAAATG 5460 

GAGCAGATTA TCCTTTACAT AAAGAGCTTG TTCTTCTTCG ATTGCTTCCT TGGCAAAGGC 5520 

ATACCACTTG CTCTGATTTT CTGTATCTTT TCCTCTATCA CCTAAGCCAA AGGAAATCTG 5580 

GTAATAGTCT GCTCTGTCCT GCCATGCTTG TTTTGAAATT TCAAGTTCTT TCAATCGTTG 5640 

GTAAGACGTC AAACCTGTCT TAACAGCGTA GCCTACTGTA AAAACAGCTA CTAACTGACA 5700 

CAATAGGGTT AAAGCCATCA AGCGTTTAAG GGGTAATCTT CCCTTAATAA CGGGAACTAA 5760 
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TCCTTTGTAA CTCAAACTCA TTAGGTAAAG G AGO ATT ACT AAAATTCAAA TCGCCAATAA 5820 

AAACAACACA TAGAAACTAA TCCCAAAACC ATAGGTGGCT AACAAGATAG GATAAAACAA 5880 

ACCTTGACTA AAAAGAACGA CTCCCCCACC TAGGAAGGAA AGGAGGGCTG ATAGAAGGAG 5940 

CCATTTGATA TCAGTAGATA AAGAATGCCC CATGATGGAT AAGAGAGTCT GACCAGAAAA 6000 

GAGTTTTATA CCTGCTGCTC TCATTTCCTT AATCCGAGTG ATAATCACTA AAGCAAAGAA 6060 

AGATAAGCCA "AATATTGCTA "AACTAATTAA AATAAGCGCA TTTAGTAATA TTCGAAAAGC " ~6i~20 

AAGAAAATAG GGCGGTATCT TTCGGTCAGC ACTTGCTTTA TAACCCAAAT CTCCTAATTT 6180 

ATCGGCAAGC TTTTCTTTCG TCAAGGAGCC TGACAAAAGG AGATAACTAT TTAGCGGAnT 6240 

ACACGTTCAC GACTTTCTTG GCTAGCTTCT TGGAATTCTT TTGGTAAAGT TCCCTGACCA 6300 

TAAGTTGCAT AAGTAAAGTG AGTCGTCCCA TCCTTACTCG GCTCTACAAT TCTTCTAGCT 6360 

ATTAAACTCT GTTCTGAGTT TGCAAAATTC TCCAATTCCT GTTCAAATAC CTCACGCGTC 6420 

GGTTCCTGAG TATCTTTTTT GACACGAAGT AAAGAAACGG AATCATAGCT TGCATATAAA 6480 

TATTGTGGCG CACGTAAGAC AATAATCCAA GCAAGGAAGA AGCTGAGAAA AAAAGTTGAT 6540 1 

AATAATATGA ATAGTTTCTT CATAGTAGAC TCCTTGTAAA CAAAATTCCC CCTGTAATTT 6600 * 

CTTACAAGGG GAACGATTTA AATCAATGAA CGATTAGTCA TAATCACAGT AAAATGCTAC 6660 

TTGTTCTCCC CATTTAGTCC AAATCCATGC AGG 6693 
(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 
(C> STRAND EDNESS : double 

(D) TOPOLOGY: linear 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

CCGGTCTATG TACCCACTAC TTTGGGACAA TATGGGGATC AGCTACCCAA AACTAATCGA 60 

GCGTTTGGTT GACCTTGCCA AGGAAAGTTT TGACAAGCGC GACGATTTGA TATAAAATGA 120 

AAGAGAGGGT AGAAGCCAGA ACCATCACTG CACGGTGACT AGAGTTCTCG GACTTCAGCC 180 

CTTTTTAAAG GAGTAGAAAT GAAATTAACA ATCCATGAAA TTGCCCAACT TGTTGGAGCC 240 

AAAAATGATA TCAGTATCTT TGAGGACACC CAGTTAGAAA AAGCTGAGTT TGATAGTCGT 300 

TTGATTGGAA CTGGAGATTT ATTTGTGCCA CTTAAAGGTG CGCGTGATGG CCATGACTTT 360 

ATTGAAACAG CCTTTGAAAA TGGTGCAGCA GTAACCTTGT CTGAGAAAGA GGTCTCAAAT 420 
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CATCCTTACA 


TTCTAGTAGA 


TGATGTTTTG 


ACAGCCTTTC 


AATCCTTAGC 


ATCCTACTAT 


480 


CTTGAAAAAA 


CGACTGTTGA 


TGTCTTTGCT 


GTTACACGTT 


CAAATGGCAA 


GACAACGACT 


540 


AAGGATATGT 


TGGCGCATTT 


ACTGTCAACA 


AGATACAAGA 


CCTACAAAAC 


ACAAGGCAAT 


600 


TACAATAATG 


AGATTGGCCT 


TCCTTACACA 


GTTCrrCATA 


TGCCTGAAGG 


AACAGAAAAG 


660 


TTGGTTTTGG 


AGATGGGACA 


GGATCACTTG 


GGCGATATTC 


ATCTCTTGTC 


TGAATTGGCT 


720 


CGTCCAAAAA 


CAGCCATCGT 


GACCTTGGTT 


GGAGAAGCCC 


ATTTGGCCTT 


TTTCAAAGAC 


780 


CGTTCAGAGA 


TTGCTAAGGG 


AAAAATGCAA 


ATTGCAGACG 


GAATGGCTTC 


AGCTTCCTTG 


840 


CTTTTAGCGC 


CGCCTGACCC 


TATCGTAGAG 


GACTATTTGC 


CAACTGATAA 


AAAGGTGGTT 


900 


CGTTTTGGGC 


AAGGGGCAGA 


GCTGGAAATT 


ACTGACTTGG 


TTGAGCGCAA 


AGATAGTCTG 


960 


ACCTTCAAGG 


CCAATTTCTT 


AGAGCAAGCC 


CTTGATTTGC 


CAGTAACTGG 


CAAGTACAAT 


1020 


GCGACAAATG 


CTATGATTGC 


ATCCTATGTT 


GCCTTGCAAG 


AAGGAGTTTC 


AGAGGAGCAA 


1080 


ATTCGTTTGG 


CCTTCCAAGA 


TCTTGAATTG 


ACGCGTAACC 


GTACCGAGTG 


GAAGAAAGCA 


1140 


GCCAATGGAG 


CAGATATCCT 


GTCAGATGTT 


TACAATGCCA 


ATCCAACTCC 


TATGAAACTG 


1200 


ATTTTAGAGA 


CTTTCTCTGC 


CATTCCAGCC 


AATGAACGTG 


GCAAGAAAAT 


TGCAGTGTTG 


1260 


CCGGATATGA 


AGGAGCTTGG 


TGACCAGTCT 


GTTCAACTTC 


ATAATCAGAT 


GATTTTGAGC 


1320 


CTTTCTCCAG 


ATGTGCTTGA 


TACCGTGATT 


TTCTATGGAG 


AAAATATTGC 


TGAATTAGCC 


1380 


CAATTGGCCA 


GTCAAATGTT 


CCCAATCGGC 


CACCTTTACT 


ACTTCAAGAA 


AACAGAAGAC 


1440 


CAGGATCAAT 


TTGAAGACCT 


AGTCAAGCAG 


GTCAAGGAAA 


GCCTTGGAGC 


CCATGACCAA 


1500 


ATCCTGCTCA 


AAGGCTCTAA 


CTCTATGAAT 


CTAGCCAAGT 


TGGTAGAAAG 


TTTAGAAAAT 


1560 


GAAGACAAGT 


GATTTTGTCA 


AGTATTTGCA 


AAGAATGATT 


GCCATTACAG 


ATACTGGCTT 


1620 


AACCTTTACA 


AAAGATCCGT 


TTGACCGTGA 


GCGCTACGAA 


CACTTGCGAA 


GTCTCTTATC 


1680 


TGAAATGTTG 


AATCAAGCAT 


CAGACCTTGA 


TTCCGAAGAA 


GTGGCAGAAG 


TCTTGAACCC 


1740 


AACTTCTGCT 


TATGCGACTC 


CGTTAATGGA 


CGTCCGTGCT 


TGGATTGTTG 


AGGATGAGAA 


1800 


GATTTGTCTG 


GTTAGGGGAC 


AAGGAGAGGA 


TAGTTGGGCT 


TTGCCGG 




1847 


(2) INFORMATION FOR SEQ ID NO: 197: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
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CAAGCGAAAA CATTTTTTAT TCCAAATAAA CAGAGCATTT TAGGAGAACA AGAGATTTTG 
AATGCCAAGT CCATCTTGGC CTTGCTAGAC GGTTTGGAGT CACATAGCTA TGATGTAGTC 
TATCTCCGTC AGCCTCTTAA TCGTCTCGAA TATATCGAGT GTGCGATAGT GGGGCAATCA 
CAATTTCTCT TTAAGGTCAG TTATGCTGAT GGTCAAAAGC CTTACCCTGT CGATCTTCCT 
GACCTACTAA CAAAGACAGA CTGGCAGATT ATCAAGTCAT TTTTAGATGC TTTGCTTGCT 
TATAGAGGGA- CTGATATTGA AGGGCTACAT GGTTTTGATT TTGAACCTTA TTTCCAAGCA 
AGTATTCAAG CCTATCTAGC AGACCCTGTA GCTCGTTTTA CGATTTGCCA AGGAATTTTT 
AATCCTATTT TCTTTAGTCG TGAGAACTTG AAAAGCTTTT TAGAGGCAGA TGGCTTGGCT 
CAGTTTGAAG CGCGTGTGCG TGCGGTTCAA GACACAGATG CCTACTTTGC CAGACTTTCC 
TTCTATCAGG ATGGAGAAGG AAAAGTGCAT GGCGTTTACC ATCTAGCTCA AGGAGTCAAG 
ACAGTTTTAC CGAGAGAACC GTTTGTTCCT GCAGCCTATA TTGAGCAATT CCTGCATAAC 660 
GAAGTCCAGT GGGAGATTGA CTTGGTTCAA ATCACAGGAG ATGGCTCTAA ACCAGAAGAC 
TATGAAGCCA TTGCTCGCTT GGACTATGCA AAATTC7TAC AGGTATTACC CCCATCTTTT 
TACCACCAAC TAGACGCCAA TCAAATAGAA GTGCAACCCA TATTAGACAA AGATTTTAAA 
ACATTAGCAC AAGAAAAGTA AAGCAGAAGC AGGTCAATCG ACTTGCTTTT TTGACATAGA 
AAAAATCCTG CCAACaTGAC AGGATTGCTA CTCAATGAAA ATCAAAGAGC AAACTAGGAA 960 
CCTACCCGCA GCTGTACTTG AGTACGGTAA GGCGAAGCTG ACGTGGTTTG AATTTGATTT 1020 
TTGAAGAGTA TGAACTTTAA AGAAAAGCCA AGATACGAAG AT u)62 
(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

TATCTACAAC CTCAAAAACA TGTTTTGawG gCTCCTCAGT CTATCTACAA CCTCAAAAAC 60 

ATGTTTTgAa JeGCtcGTCAG tTCTATCTAC AACCTCAAAA ACATGTTTTG AcaGCcTcCT 120 

CAGTTCTATC TACAACCTCA AAAACATGTT TTGAGCTGAC TTCGTTAGTT TCATCTACAA 180 

CCTCAAAAAC ATGTTTTGAG CTGACTTCGT TAGTTTCATC TACAACCTCA AAAACATGTT 240 

TTGangnCnT CGTCAGTTCT ATCTGCAACC TCAAAGCACT GCTTTgagcG CTTCGTCAGT 300 



60 
120 
180 
240 
300 
"360 
420 
480 
540 
600 



720 
780 
840 
900 
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TCTATCTACA ACCTCAAAAC ACTGTGTTGC GCACCCTTTA ATCAGCCGCC TAGTCCGC7C 3 60 

TATGGTATTC ATTAAGTCAA CATCTCTTGT TTAAGAGCAC CAAATCAGCA AATCTTCTCG 420 

ATTCCCTGAT TTTTTCTATT TACGTTTTCG TGTTGAGCTA CGTTCTGTCA AACCATGAGG 4 80 

TAAGAGAACT TCACGTTCTT CCAACTCTTC CTTATGCATA ATCTTCGTCA ACATACGCAT 540 

ACTAATGGCA CCAAGGTCAT AAAGAGGTTG GGCAATCGTT CTCAAGTTTG GACGGGTAAA 600 

CCCTGAGATT TGTGAATCAT CACTAGTAAT AATTTCAAAA TCTTCTGGCA CAGAAACACC 660 

CTTATCAGCC AAACCGTTCA AGACTCCTGC TGCCAACTCA TCACCTGTCA CAACTGCTGC 720 

AGTTGCATTT GATGAAATCA AACGCTCTGC TAAGGCGTAA CCATCATCAT AGCTATATTT 780 

AGATTCAAAT ACCAAACCCT CACTATAAGT GATTCCTGCT TTTTTCAAGG TTTCCTTCTA 840 

GCCAACTAAA CGAACCTTAC CATTGATGTC ATCCACTAGC GGACCCCTAA CGAAAGCAAT 900 

ACGCTCATTT TCTTTACCAA GGTAACTCAC TGCATCAATT GTTGCTTGCT TATAGTCAAT 960 

ATTGACACTT GGCAACTGGT GCTCAACATC GACAGTTCCT GCGAGAACAA TCGCAGTACG 1020 

TGAACGCGAA AATTCTGAGC GAATTTTATC TGTCAAGTGA TACCCCATAT AGATAATGCC 1080 

ATCTACCTGC TTTGAAAAGA GGGTATTGAC AACAGAAACT TCTTTCTCGT TATCTTCATC 1140 

GCTATTAGCT AGGACAATAT TGTACTTGTA CATTTCTGCA ATATCATCAA TCCCCTTACC 1200 

CAAACTCGAA AAATAACCAT TGGTAATATT TGGAATCACG ACACCGACAG TCGTTGTCTT 1260 

TTTACTTGCA AGACCACGCG CAACTGCATT TGGACGATAA TCCAAACGAT CAATTACCTC 1320 

TAGCACTTTT TTACGGGTAT TCT C TTTTAC ATTTTTATTG CCATTGACCA CACGGCTGAC 1380 

CGTCGCCATG GAAACACCTG CTTCACGAGC GACATCATAA ATGGTTACTG TATCATCTGC 1440 

ATTCATTCCT TTTCCTGTCC TTTCTATCTC ACACATTCTT TTACAAGTAG AGGTACTGAT 1500 

TGAAGCTCTA TATCTACTTA CAAAAGTGAA GATGTGAAAA TTTCGTTTTC ATATTTCTAC 1560 

TTATTCCATT CTATCACTAA TTGTAAACAC TTTCAAGTGT TTTTTGAAGA TTGATTGAAA 1620 

AAATTTCATA GAAAACCTAG CTTTAGCTCC TTGCTACCAC CTTAGACTAA ACAAAAAGGA 1680 
GGAAACTAAG CCCTCCTAAA GTTATAGTAA AATGAAATAA GAACAGGATA AATCGATCAG " 1740 

GACAGTCAAA TCGATTTCTA ACAATGTTTT AGAAGTAGAG GTGTACTATT CTAGTTTCAA 1800 

TCTACTATAG GTATTGTTCC ATTCACTACC GTCAATTTTA GCACATAGTC TTCATGAAAA 1860 

TATTATATCA TCATAACCAA CCAGATTCTT TCGCGATATT AGCTGCCTCT GTTCGATTAC 1920 

CTGCATCTAG TTTCGAAAGA ATATTGGTGA CATAGTTTCG GACTGTTCCG TTGGATAGAT 1980 

AAAGTTTGTC TGCAATTTCT TGCTTAGAGA AGCCCTGAGC AATTCCCTTT AAAACTGCGA 2040 

TTTCTTGCTC CGTTAATGGA TTGGGATGCA TCATCACCAC TTCCATCAAT TCAGGCGAAT 2100 
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ACTCCTTGCG 


TCCTTCGAGG 


ACGGTGTGCA 


AGGTTTGCAT 


GAGGTCTGCA 


ATGTTTCTTT 


2160 


CTTTTAATAC 


ATAAGCATCT 


ACTCCAGCCT 


TGACCGCACG 


TTCAAAATAC 


CCAGGACCCT 


2220 


TGAAGGTCCT 


CACCACAACC 


ACCTTTGTTT 


CAAGCTTTTC 


TGCTCGTATC 


CACTCCAAGA 


2280 


CTTCAAGACC 


TGTCTTAACA 


GGCATTTCTA 


CGTCAAGGAT 


GGCGATATCT 


ACAGACTCCT 


2340 


TTTCTAATAG 


TTGGATTGCT 


TCTTGCCCAT 


TCTTGGCTTG 


AAAGACAGAC 


TCTACATCCG 


2400 


- GTTGAAGCAT GACCAACTGG 


CACATGGCAT 


CTCGCAXCAT 


ACTTTGATCT 


TCTGCGACTA " 


2460 


ATACTTTCAT 


CTACTTTCTC 


TCCTTATAAA 


GTAGTCGAAC 


CTGCACTTCA 


GTTGGATGTT 


2520 


TCTGACTGAT 


TACACTTACT 


TCTCCTGAAA 


ATGGAAAAAC 


ACGATTTCGG 


ACTGTATGGA 


2580 


GCTCATCCCC 


GCTTATAGAG 


GCAAAGCCAC 


AGCCATCATC 


TCTCACTGTT 


AGAATGAGTT 


2640 


CTTTCTCTGT 


CCGTTCTAAT 


TTCAAGTAGA 


CTTTAGACGC 


TTTAGCATGT 


TTGATGATAT 


2700 


TGGTCACTAA 


TTCAAGCAAA 


ATCATGGAAG 


CCGTTGACTC 


CAATTCCTGA 


GTTAAGCTAG 


2760 


ACTTGTCCAA 


GTGATTCTCA 


ACTTGAACCT 


CAATTCCAGC 


AATTTCTAAC 


ATCTTTTTCA 


2820 


CAGTCTCTAG 


TTCGGATGTC 


AAAGTTCTAG 


ACTTAAGATT 


TTCCACAATG 


GTTCGCACTT 


2880 


CATTCATGGA 


CCCTTGCTGA 


TCTGGTGAAT 


TTCTTTTAAT 


TCCTTTTCCA 


CCTGTGGATA 


2940 


AGCCTCCATC 


TGAAATAACT 


GCAAGGCTAA 


ATCTGTCTTG 


ACACTCAGCA 


TAGCAAAGGT 


3000 


ATGTCCCAGA 


CTATCATGCA 


AATCCTGACC 


GATACGACTA 


CGTTCATTTT 


CAGCAAGCAA 


3060 


TAGATTTATC 


TGAGCATTTT 


GCTTGACCTG 


AGCTTCTTTC 


AAATCCTCGA 


CAATACGAAT 


3120 


CCGAACCAAT 


CCAAAAGTCA 


TTAAATCGAC 


AAAAGTAAGA 


ATTACAAGTA 


GATAGAATAG 


3X80 


AAACTCAACT 


TCGATTCTCT 


GAAAAATCAA 


CAGTTGCCCC 


ACAACAAGGA 


CTTGAGCAAG 


3240 


AAGAAAAGTC 


CAGACATGTA 


AAGACTTTAA 


ACTACGTACG 


CTGAAATGAT 


AACTTAAGAG 


3300 


ATTGGATAGG 


AAAAAGAAAA 


ACCAGATATA 


ATTAACAGCA 


ACAAAGGCAG 


TATTCCCAAC 


3360 


TACATAAGTC 


AGCATGAGGC 


CCCAATATAG 


CCAAGATAGG 


CGCTCGCTCT 


TAGTTGTTAA 


3420 


AACACCCAAA 


TATGCCACTA 


CAAATAGAAT 


ATCAATCAAT 


AAATGCCAGG 


CAGAAAGCCA 


3480 


CCCAGTCACT 


ACAGACAGGA 


TGGGGAAAAT 


CATAAAAATT 


AAACTGATCC 


AAAACATATA 


3540 


ATGTATTCTT 


TTCAGTCTTT 


CAAGCATTAA 


GCATTCTCCT 


TATGACCTTG 


AAGGTAAATG 


3600 


GTCAAACCAA 


ACAAAACTAC 


TGAAAAAACA 


AGTAAATAAA 


CTGTGGCTGA 


TAGATTGATG 


3660 


CCACCCTCAT 


TTAAGAAGGT 


CTTGAGCAAC 


TCCATCAACT 


GATAGGTCGG 


GAGACACTTA 


3720 


CCTACTACTT 


GCATCCAGTC 


TGGAAATAAA 


GAGATAGGCA 


TCCAGAG7CC 


ACCTAAAACA 


3780 


GCCAACCCTA 


GATAAAGAAG 


ATTGCCCACG 


ACAGACATCA 


ACTGACTAGT 


TGGTAAGAGA 


3840 



WO 98/18931 PCT/US97/19588 



1172 



CTCAAGGTCA 


AACCAACCCC 


TACGAAGGCA 


ATACTTCCTA 


CTATCAGCAA 


AAGTGCAGCC 


3900 


CCAATCCAAT 


TTCCAAGAGA 


CATGTCCACA 


CCTCTTACAA 


AATGCCCAAC 


TGAGAAAACC 


3960 


ACCAAGATTG 


AAACCAAATA 


ATCAACCAGC 


ATACTTGTTA 


TCTTTGATAG 


ATAATATTCT 


4020 


ACCATATTTA 


CAGGGCTATG 


ACGCAATGTT 


TTCTGCCAGT 


TGTTGATCTT 


GTCCGTATGT 


4080 


AAAACAACTG 


GGAATGAGAA 


GATAGCTGTT 


GACATCATGG 


AAAATGCAGT 


CATGGAGATA 


4140 


AGATAATCAC 


GCATAAAATT 


CGCGAGTTCA 


CCTGGTGTGT 


CCTGATAGAT 


ACCAGAAAAA 


4200 


AATAAATAGA 


AAGCCGTCGG 


CATCCCTACT 


GACAATAGAT 


AATAGATCAA 


TTGTCGTTTG 


4260 


GTCAATAAAA 


ATTCTATCTT 


ACTAAGTGCT 


AGCCATCGTT 


TCATCTTAGT 


TATCTCCCTT 


4320 


CTGCGTTTCT 


TCAAAGATTG 


TATCCAACAA 


ACTACGATTA 


TTAACTTCAA 


TTTCTTGTAT 


4380 


GCCACATCCT 


GCTTGAACTA 


ACAGTTCCCA 


AAAAGCATCT 


GCTTCGCGTG 


TGACTACTTG 


4440 


TAG AG CATC C 


TGTTTTTGTG 


ACCAGTTTTC 


AACCAAGTTA 


GACTGCTCAA 


TGACTTCCTT 


4500 


GTATGCCAGA 


GGAAGGATAA 


AATGCTTTTC 


AATTCCCTCA 


CTACGCATAG 


CTAGAGGCGT 


4560 


CGTATCACGA 


ATCAACTCTC 


CCTTATTTAA 


AACCAAAATC 


CGGTCAGCCG 


TATGCTCTAC 


4620 


CTCTTCAATA 


TAATGAGACG 


AATAGAGAAT 


CGTGACTCCT 


TGCGCTTTTA 


GGTCCCGAAC 


4680 


GATTTCCCAA 


AAGCGTTGAC 


GAGTTGAAGT 


ATCCATGGCA 


GCAGTTGGTT 


CATCTAAAAA 


4740 


GACAAGCTTT 


GGTCGCCCAA 


TCAAGGTCAA 


GACAAAAGAG 


AAGAGACGCT 


TTTGCCCGCC 


4800 


TGACAATTTT 


TCTGCGAATT 


GCTCTTTTTG 


TTGCTGGTCA 


AACTGCAATA 


GTTGATCGAT 


4860 


TTCCTCATCG 


CTCAAGGAAT 


TTCGATAGAT 


ACGTTGAAAG 


AAAGCAATCA 


ACTCTTTGAC 


4920 


CTTTAATTTC 


TGAACGATGA 


CATTTTCTTG 


AGGCAGATAA 


CCTCTAATAT 


AGTCTAACTG 


4980 


AGAACTCGTC 


ACTGACAAGC 


CTTGGATGGA 


TACTTGACCG 


CTTGTGACCA 


GTTTATCTCC 


S040 


AAGCAGACAG 


TCCAAGAGTG 


TGGTCTTCCC 


AGCACCATTG 


GGCCCAATCA 


AGCCGACGCA 


5100 


TTCACCTTCA 


GCTACCTCAA 


AGGAAATACC 


CTTCAAAATA 


GCCTTGCCCT 


TGATGTTTTT 


5160 


ATTTAGGCTT 


TCTACCTTAA 


TCATATTCAT 


GATATTCTCC 


TTTCAACCAC 


TCCATTCTCA 


5220 


TAAGGAAAAC 


GACGAAAATC 


ATAAATCCAA 


ACCCCAAAGC 


ACCACGAATG 


AATTGGCGAA 


5280 


gCAAGGTTTG 


GTCAAACCAA 


CCTGTAAACA 


TTTCCACTAA 


CCATACCAAG 


AGTGACAGGC 


5340 


CGATAAAGAA 


ATAGATGATC 


CCTCTCTTCA 


TTCCTCAAGC 


TCCTTTTTCA 


CATCTCCCAC 


5400 


TAATTTCAAA 


CCTTCTCTAA 


CAAGCCAAGA 


CATCATTCCA 


AAGCCAGCAA 


AGAGCTCCCA 


5460 


AGGAAAATGA 


TAGAAACTCT 


CATCCAATCC 


CGAAAACATG 


ACTTAGGTCA 


TAACTCCTGC 


5520 


TACTACTAAA 


CTCACTGCGA 


TAATCATTTT 


ATTTCTCATC 


TCTTCTTCCT 


CCATTTCATA 


5580 


CTACAATTAT 


AGTCTTTTGA 


AATCAGAGGA 


GACAGAAGCT 


TCTGTCACTA 


GAAAATATGA 


5640 
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CAAATCTCAT AAAAAATTCT GTTCAAAACA AGCAAGATAC ACTATACAAT AAAACACAAT 5700 

TAGAAAAATC TAAGGCAACT TCCTCAAAAG AGATATCAAA CCCAATTCAC ACCATAATGT 5760 

AAACTAATAC TTATTTAAAA TCAAAAAGAG TAGAAATTTT TATCAGACAA AC AC AT AT AT 5820 

AGTGTATTGA ATCTATAACA GTAGGCCTTA AATACTAAAA TATTTCTATA AATTAATTTA 5880 

ACTTTCCTGA TAGAGCTGTT CATATCTTAT TTCAATTCTC TAAATTATAC CTTGAACAAA 5940 

ACCCTTCTAT TTCTTTCTTA AAGATTTATA AGAGTTATAA AATCTGTTAA ATTTCAATGT 6000 

GTATACCTAA ACTACGGTAT TTATTGAAAA GACTGGAGAC AAAAAGTATA CGCTGCCAAA 6060 

ATGAATTACT GAAAATCAAA AAAGAGAGAA CCAAACTGAT TCCCTCTTAA TGTATATAAT 6120 

ATCTAGTTTT AAAAATACAC ACTCACATAT CTCTGTAATG AATCGGGAAG ACAGGATTCG 6180 

AACCTGCCAC ACCTTGGTCC CAAACCAAGC ACTCTACCAA GCTGAGCTAC TTCCCGAGTT 6240 

AAATAGAAAA ATGCACCCTA GAGGAGTCGA ACCTCTAACC GCCTGATTCG TACTCACCTA 6300 

CTCTATCCAG TTGAGCTAAG GGTGCTCCAT ATTATCCCGA GGACCGGAAT CGAACCGGTA 6360 

CGATCGTTAC CAATCGCAGG ATTTTAAGTC CTGTGCCTCT GCCAGTTCCG CCACCCCGGC 6420 

CTCTCTAACC GAACGACGGG ATTCGAACCC GCGACCCCCA CCTTGGCAAG GTGGTGTTCT 6480 

ACCACTGAAC TACGTTCGCA CTGTTTTCTT CTATCTAAAA ATGCCGGCTA CATGACTTGA 6540 

ACACGCGACC CTCTGATTAC AAATCAGATG CTCTACCAAC TGAGCTAAGC CGGCTCATTT 6600 

GTTATATCTT AATGCGGGTT AAGGGACTTG AACCCCCACG CCGTTAAGCG CCAGATCCTA 6660 

AATCTGGTGC GTCTGCCAAT TCCGCCAAAC CCGCATATAT GACCCGTACT GGGCTCGAAC 6720 

CAGTGACCCA TTGATTAAAA GTCAATTGCT CTACCAACTG AGCTAACGAG TCTAAAATAA 67 80 

cTTGCGTTAC CTTAAACGGT CCCGACGGGA ATCGAACCCG CGATCTcGCC CTGACAAGGC 6840 

GACGTG 6 846 
(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2911 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

GAATTCATTT TAAATAAAGA TACGGGAGAG GTAAGTGAAT TAAAACCTCA TAGGGTAACT 60 

GTGACCATTC AAAATGGAAA AGAAATGAGT TCAACGATAG TGTCGGAAGA AGATTTTATT 120 
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TTACCTGTTT ATAAGGGTGA ATTAGAAAAA GGATACCAAT TTGATGGTTG GGAAATTTCT 180 

GGTTTCGAAG GTAAAAAAGA CGCTCGCTAT GTTATTAATC TATCAAAAGA TACCTTTATA 240 

AAACCTGTAT TCAAGAAAAT AGAGCAGAAA AAGGAGGAAG AAAATAAACC TACTTTTGAT 300 

GTATCGAAAA AGAAAGATAA CCCACAAGTA AACCATAGTC AATTAAATGA AAGTCACAGA 360 

AAAGAGGATT TACAAAGAGA AGAGCATTCA CAAAAATCTG ATTCAACTAA GGATGTTACA 420 

GCTACAGTTC TTGATAAAAA CAATATCAGT AGTAAATCAA CTACTAACAA TCCTAATAAG 480 

TTGCCAAAAA CTGGAACAGC AAGCGGAGCC CAGACACTAT TAGCTGCCGG AATAATGTTT 540 

ATAGTACGAA TTTTTCTTGC ATTGAAGAAA AAAAATCAAG ATTAAGATAA AAGCTATAGA 600 

AAAAAATCGT TTATGTACTG AGATTAGATA GTGAGGTGAT GACATAGTTT TGTGAAAATA 660 

GCCATTTATA ACTCAATTAT TTAGTTTACT TTACTTTACT AGTGATACTA TTTGGAGTTA 720 

TTAATGGACT TAGTTTATAT AACTAATGAA TTGATTGAAA GGGTTAGTAT TGACAATATT 780 

GGTCATATTG ACTAGAAAAT AGAGTCTATC AAAATTTAAA GGCTAATACA GGTGATGAGA 840 

CAATTTCGGC TCTTTGTCAA CTGTAGTGGG TTGAAGTCAG CTAAGCTCGA GAAAGCACAA 900 

A T TTTGTCCT TTCTTTTTTG ATATTCAGAG CGATAAAAAT CCGTTTTTTG AAGTTTTCAA 960 

AGTTTCGAAA ACCAAAGGCA TTGCGCTTGA TAAGTTTGAT GAGATTATTG GTCGCTTCCA 1020 

GTTTGGCATT AGAATAGTGT AGTTGAAGGG CATTGACAAT CTTCTCTTTA TCTTTGAGGA 1080 

AGGTTTTAGA GGATGAACTT GATTCAGATT GTCCTCAATG AGTCCGAAAA ATTTGTCAGG 1140 

CTCCTTATTC TGAAAGTGAA AAAGCAAGAG TTGATAGAGA TTATAGTGGT GTTTCAAGTC 1200 

TTCTGAATAG CTCAAAAGTT TATCTATAGT AGATTGAAAC TAGAATAGTA CACCTCTGCT 1260 

TCTAAAACAT TGTTAGAAAT CGATTTGACT CTCCTGAATG ATTTGTCCTG TTATTATTTC 1320 

ATTTTACTAT AAATCCACGT TTACGAATCT CTTTCCACAC TTGTTCAATG GGGTTCATCT 13 80 

CTGGTGTGTA TGGAGGAATA AATGCAAAAC CAATATTAGT CGGAATCTTT AAGGTACTTG 1440 

ATTTATGCCA TATAGCATTG TCCATAACGA GTAAAAGATA ATCATCTGGA TAAGCTTGTG 1500 

AAAGCTCCTA TTCCTAAAGC CCCTTTATAA CCTCTTGCGA GAGAGACTAT TGACTCAGCC 1560 

CTTACTTCAT GCGGATGAAA CTTCTTATCG GGTTCTAGAG AGTCATAGCC ATCTGACCTA 1620 

CTATTGGACC TTTTTGTCTG GGAAAGTTGA GAATCAAGCA ATCACGCTGT ACCATCATGA 1680 

TCAGAGTCGG AGTGGTTCGG TAGTACAAGA ATTCCTAGGA GATTATTCTG GCTATGTTCA 1740 

TTGTGATATG TTGCGGCAGT AACTTAGGAC TTTAGTCCTC TAGTTCTGCC TATGCGATAG 1800 

CAGTCCAAGG TTTAGGAGCA AGGCGACGCT AAGCTTGGTA AACTGCGAAC CCCTAGAAGC 1860 

TTATCGTCAA CTGGAAGAAG CTGAACTTGT TGGATCTTGG GCGCATGTGA GAAGGAAATT 1920 
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TTTTGAAGCG ACCCCCAAGC AAGCAGATAA ATCATCCTTA GGAGCTAAAG GTTTAGCTTA 1980 

TTGTGATCAG TTATTTTCCT TGGAAAkAGA CTGGGAGGCT TTGCCACCTG ATGAACGACT 2040 

ACAGAAACGT CAAGAACATC TCCAGCCCCT AATGGAAGAC TTCTTTGCTT GGTGCCGCCG 2100 

TCACTCAGTT TTAGCAGCTT CAAAACTAGC AAGGGCAATT GAATACAGCC TCAAGTATGA 2160 

AGAAACCTTT AAGACTATTT TGAAAGACGG ACATCTGGTC CTTTCCAATA ATCTAGCTGA 2220 

-ACGeGCCATT-AAATCATTGG-TTATGGGACG GAGTAAAAGAH7TCCAGTGGA CTCTTTTAGC 2280~ 

CTGAGCTCAG TTTAAAAAAG CGAGGGTGGT TATTTTCTCA AAGTTTTGAA GGAGCTAAAG 2340 

CAAGAGCTAT TGTTATGAGC TTGTTGGAAA CAGCTAAACG TCATCAATTA TAGTGCGTTG 2400 

AATCTATAAC AGTACGCATC GACTGCTAAA ACATTTCTAT AAATCAATTT TCCTTTCCTA 24 60 

ATCGATTTGT TCATATCTTA TTTCAATCCA TTATAAATAG CGAGAAATAT CTATCCTATC 2520 

TTCTAGAATG TCTTCCAAAC GAGGAAACTC TCGTAAACAA AGAGGTTTTA GAGGTTTATT 2580 

TACCATGGAC TAAAGTTGTA CAAGAAAAGT GCAAATAAGA AATCTCCAGA TTAGGAACTA 2640 

TCCGTGAGTT CACTAATCTG GAGATTTTTC AATAGAtTCG TTATTGGGCG GTTACGATAT 2700 

GATCACTACT TCGTCAGTCT TATCTACAAC CTCAAAACAG TGTTTTGAGC AACCTGCGAC 2760 

TAGCTTCCTA GTTTACTCTT TGATTTTCAT TGAATATTAG AACAGAAAAA ATGCTTGGAG 2820 

TATTTGTTTG TGTGTTTATT TTTATATAAC AAACTATAAA CAAAATAAAA ATATAAAAAA 2880 

AGAGACAAAA AAGAACAGAA AGTAATTGAC A 2911 

(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6854 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

GAAAATAAGT CTTGACAGAA AGCGCTATCA ATGATAGAAT GAATTCAGAT AAAAAGATTT 60 

ATTTTTAAAA CAAAAATGAA ACGTTTCAAA AAAAGAAATA AAGAGACAGC GCCAAGCGCT 120 

ATCTTTTCTA GAAAAAAATG AAACGTTTCA AAAAAGGAGG TTGCTATGAA TAGCAAAGCG 180 

AAGCAAGTTT CTCTTTGGGA AAGAATCAAG AAACAAAAAC TCTTGTTATT GATGACTGTC 240 

CCCGGTTTAG TTTTAACCTT TATCTTTAAA TACATCCCTA TGTATGGGGT TTTAATCGCA 300 

TTTAAAGATT ACAATCCTTT AAAAGGAATT TTAGGGAGTG ATTGGATTGG TTTTTCTGAG 3 60 
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TTTACAAAAT TCATATCCTC TCCCAACTTT GGTATCTTGT TAGCCAACAC ATTAAAATTA 420 

AGTATCTATG GTTTATTGCT TGGCTTTTTA CCACCAATCA TTCTCGCGAT TATGCTCAAT 480 

CAACTCTTGA GTGAAAAAGT CAAAAAACGA ATTCAGCTCA TTTTATACGC ACCAAACTTT 540 

ATCTCAGTCG TTGTTATTGT CGGTATGATT TTCCTCTTCT TTTCAGTGGG AGGACCAATC 600 

AACAATTTTC TTTCTATGTT TGGAATGAAG GCTGACTTCT TGACAAATCC AGACTTCTTT 660 

AGACCTTTAT ACATCTTTAG TGGTATCTGG CAAGGAATGG GCTGGGCTTC AACGCTCTAC 720 

ACGGCAACAT TGGTAAATGT AGATCCAGCC TTAGTAGAAG CAGCCCGACT GGATGGAGCC 780 

AATATCTTCC AACGAATCTG GCACATTGAT ATTCCAGCTC TTAAGCCTAT TATGGTTATC 840 

CAATTT G TTT TAGCTGCAGG TGGAATTATG AATGTCGGAT ATGAAAAAGC ATTCTTGATG 900 

CAGACATCGT TAAATTTGCC AACTTCTGAA ATTATCTCGA CATATGTCTA TAAAGTTGGT 960 

CTTGTATCAG GAGACTATTC TTACTCAACA GCGGTTGGTT TGTTTAATGC AGTGATTAAC 1020 

GTAGTATTGC TTGTTGCAGT TAACCAAATC GTTAAACGCA TGAATAATGG TGAAGGAATT 1080 

TAAGGAGGAA AGTATGAAAA ATTCGATTAT GGATACAAAA TTTGATAGAC GTATCTTACT 1140 

CTTAAATAAA ATCATTATTG TCTTTATCGT TTTGATGACT TTGCTTCCTT TACTTTATAT 1200 

CGTCGTAGCA TCCTTTATGG ATCCTAAGGT TCTGGTTAGT AGAGGGATTA GCTTTAATCC 1260 

AGCCGATTGG ACTGTAGAAG GTTACCAGCG TGTATTCAGT GACCAATCTA TTCTAAGAGG 1320 

TTTTATCAAT TCTCTACTAT ACTCTTTTGG ATTTGCAGCT TTAACAGTCT TGCTATCTGT 1380 

GTTTACAGCT TATCCTCTTT CTAAGAAAGA CTTGGTTGGA CGTCGTTGGA TTAACTACTT 1440 

CTTGATTGTA ACTATGTTCT TTGGTGGTGG TTTAGTCCCA ACTTACTTGC TCGTAAAAGA 1500 

ATTGGGAATG CTCAATACTC CATGGGCTAT CATTGTTCCA GGTGCTGTTA ACGTTTGGAA 1560 

TATTATTCTT GCTAGGGCCT ATTTCCAAGG ATTGCCTGAA GAATTAGTTG AAGCTGCTGT 1620 

CATTGATGGT GCAAATGATT TACAGATTTT CTTCAAAATC ATGCTTCCTC TTGCAAAACC 1680 

AATTATGTTT GTTCTCTTCC TTTATGCTTT TGTAGGACAG TGGAACTCAT ACTTTGATGC 1740 

AATGATTTAT ATCAAGGATC CAAACTTGGA ACCATTGCAA CTTGTACTTC GTAAAATTCT 1800 

CATTCAGAGC CAACCAGGTC AAGACATGAT TGGAGCACAA GCGGCTATGA ATGAAATGAA 1860 

ACGTTTAGCT GAATTGATTA AATACGCAAC TATTGTCATT TCCAGCTTGC CATTGATTGT 1920 

TATGTATCCA TTCTTCCAAA AATACTTTGA TAAAGGAATT ATGGCTGGTT CACTTAAAGG 1980 

ATAAAAAAAG AAAAAATAAA AGGAGTTTTC TCATGAAATT CAAAACATTC TCAAAATCAG 2040 

CAGTTTTGTT GACAGCTAGT TTAGCAGTAC TTGCAGCCTG TGGCTCAAAA AATACAGCTT 2100 

CAAGTCCAGA TTATAAGTTG GAAGGTGTAA CATTCCCGCT TCAAGAAAAG AAAACATTGA 2160 
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AGTTTATGAC AGCCAGTTCA CCGTTATCTC CTAAAGACCC AAATGAAAAG TTAATTTTGC 2220 

AACGTTTGGA GAAGGAAACT GGCGTTCATA TTGACTGGAC CAACTACCAA TCCGACTTTG 2280 

CAGAAAAACG TAACTTGGAT ATTTCTACTG GTGATTTACC AGATGCTATC CACAACGACG 2340 

GAGCTTCAGA TGTGGACTTG ATGAACTGGG CTAAAAAAGG TGTTATTATT CCACTTGAAG 2400 

ATTTGATTCA TAAATACATG CCAAATCTTA ACAAAATTTT GGATGAGAAA CCAGAGTACA 24 60 

_ _ _ AGGCCTTGAT-GACAGCACCT-GATGGCK:ACA-TTTACTCATT"TCCATGG^ 2 ~520 

GAGATCGTAA AGAGTCTATT CACAGTGTCA ACGATATGGC TTGGATTAAC AAAGATTGGC 2580 

TTAAGAAACT TGGTCTTGAA ATGCCAAAAA CTACTGATGA TTTGATTAAA GTCCTAGAAG 2640 

CTTTCAAAAA CGGGGATCCA AATGGAAATG GAGAGGCTGA TGAAATTCCA TTTTCATTTA 2700 

TTAGTGCTAA CGGAAACGAA GATTTTAAAT TCCTATTTGC TGCATTTGGT ATAGGGGATA 2760 

ACGATGATCA TTTAGTAGTA GGAAATGATG GCAAACTTGA CTTCACAGCA GATAACGATA 2820 

ACTATAAACA AGGTGTCAAA TTTATCCGTC AATTGCAAGA AAAAGGCCTG ATTGATAAAG 2880 

AAGCTTTCGA ACATGATTGG AATACTTACA TTGCTAAAGG TCATGATCAG AAATTTGGTG 2 940 

TTTACTTTAC ATGGGATAAG AATAATGTTA CTGGAAGTAA CGAAAGTTAT GATGTTTTAC 3000 

CAGTACTTGC TGGACCAAG7 GGTCAAAAAC ACGTAGCTCG TACAAACGGT ATGGGATTTG 3060 

CACGTGACAA GATGGTTATT ACCAGTGTAA ACAAAAACCT AGAATTGACA GCTAAATGGA 3120 

TTGATGCACA ATACGCTCCA CTCCAATCTG TGCAAAATAA CTGGGGAACT TACGGAGATG 3180 

ACAAACAACA AAACATCTTT GAATTGGATC AAGCGTCAAA TAGTCTAAAA CACTTACCAC 3240 

TAAACGGAAC TGCACCAGCA GAACTTCGTC AAAAGACTGA AGTAGGAGGA CCACTAGCTA 3300 

TCCTAGATTC ATACTATGGT AAAGTAACAA CCATGCCTGA TGATGCCAAA TGGCGTTTGG 33 60 

ATCTTATCAA AGAATATTAT GTTCCTTACA TGAGCAATGT CAATAACTAT CCAAGAGTCT 3420 

TTATGACACA GGAAGATTTG GACAAGATTC CCCATATCGA AGCAGATATG AATGACTATA 3480 

TCTACCGTAA ACGTGCTGAA TGGATTGTAA ATGGCAATAT TG AT ACT GAG TGGGATGATT 3 540 

ACAAGAAAGA ACTTGAAAAA TACGGACTTT CTGATTACCT CGCTATTAAA CAAAAATACT 3600 

ACGACCAATA CCAAGCAAAC AAAAACTAGA GGTTGATTAT GGGAGATAAG AAATACACAG 3660 

TAGAAAAAGC CAATCGTTTT ATAGCAGAAA ATAAACATCT CGTTAATACT CAATATAAGC 3720 

CTGAAGAACA TTTTTCAGCT GAGATTGGTT GGATCAATGA TCCAAATGGA TTTGTCTATT 3780 

TTCGTGGAGA ATACCATCTC TTTTATCAAT TCTATCCATA TGATAGTGTT TGGGGGCCTA 3840 

TGCACTGGGG ACATGCTAAA AGTAAGGACT TGGTGACTTG GGAGCACTTG CCAGTGGCAC 3900 
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TTGCTCCTGA CCAAGATTAT GACCGAAATG GTTGTTTCTC AGGCTCTGCC ATTCTCAAGG 3960 

ATGATCGCCT CTGGCTCATG TACACTGGAC ATATCGAAGA AGAAACCGGT CTCCGCCAAG 4020 

-TGCAAAATAT GGTATTTTCA GATGACGGGA TTCACTTTGA AAAGATTTCC CAAAATCCAG 4080 

TTGCAACTGG ATCAGACTTA CCAGATGAGT TGATTGCTGC TGATTTCCGT GATCCAAAAC 4140 

TCTTTGAAAA AGATGGACGC TATTACTCCG TAGTACCTGC CAAACACAAG GATAATGTGG 4200 

GCTGTATCGT TCTACTAGGG TCCGATAACC TAGTAGAATG GCAGTTCGAA TCCATCTTTT 42 60 

TAAAAGGGGG AGAACACCAA GGTTTTATGT GGGAATGCCC AGATTACTTC GAGTTAGATG 4 32 0 

GGAAAGATTG CCTTATTATG TCACCCATGC GTTATCAGCG TCAGCGACAC TCATATCATA 4380 

ACATCAACTC ATCGCTTTTG TTCACGGGTA AGGTAGATTG GAGAGAAAAA CGTTTTATCC 4440 

CAGAATCAGT TCAAGAAATT GATCATGGCC AAGACTTCTA TGCGCCTCAA ACATTGTTGG 4500 

ACCATCAAAA TCGTCGTATC CTGATTGCTT GGATGCAGAC ATGGGGGCGT ACCCTTCCAA 4560 

CCCATGACCA AGAACACAAG TGGGCATCTG CCATGACTCT ACCTAGAATT CTAAGATTGG 4620 

AAGATGGCAA ACTAAGACAA TTCCCTGTTA AAAAAGGCCA ATATCAAATC CAAATAGATA 4 680 

AAGATTGTCA TTACCACTTA GGAAATGATA TAGATTATCT TGAATTTGGT TATGACAGTA 4 740 

ATGCGCAGCA AGTTTACATT GATCGTAGCC ATCTTATTCA AAAAATTCTA GGTGAAGAAG 4800 

AACAGGACAC TAGTCGACGG TATGTAGATA TTGAAGCTAA AGAATTGGAA GTTGTTCTAG 4860 

ATAAAAATTC CATCGAGATT TTTGTCAATC AAGGTGAAGC AAGCTTGACT GCAACTTATT 4920 

ACTTAACGGT GCCAGCTGAG CTATCACGAA TTGATTAAAA ATTAAGTTAT TTCTCCTAAA 4980 

GAAAAAGTTC TCTTTCTAAA ATAGTGGAAA GAGGACTTTT TGTCTTTTGG GTATATAAGC 5040 

TTAGTTTATG GTATTTGTAA AATTGGTGTT GGATTATGAT TTAAGCTAGT TTTCTAAAGA 5100 

ATTTGAAAAA AATTTTATTT AAGCAAAAAA ACCTTGGTTC CAAGGCTTTT CCTGTTGTAT 5160 

TTAGATGCCC CCTACAGGGA TTGTAGGAGA TATGTTGCTT AGATGTTCTT GATTTTCTGG 5220 

TGTTTTGTAA CGTTTAAATG AGTTTTTTGA GTTTGTTGGT GGGGCGTTGC CCGGCAATTG 5280 

CCCGACTTAT TGCTTGAAAA AGAATTTAAA ATATAGTATA GTTAATTATA GATTAACACT 5340 

TGCTTGGAGG AACTGATGAA GAACAATGAA AGATTAGGTA TTAAATTAAG TAGAGATAGC 5400 

GTTTTAGGAT TGAGGGAAGT TAGAAGGCTT TATTTAGGCA GTTCAGATAT CCCAGTTTCT 5460 

GATGGCTATG TGATTGAAGT TGCTTATAAC CAGATATCAC ATGAGATTGA TATTATTGAT 5520 

TGGGTAGAGT TGAACAAGTC AAAAATTAAG ATAAGTGAAA TTAGTGAAAG CGTGGATATA 5580 

GATGCCACTA GCTTGAGAAC AACTTTGACT TTAGACACAT TAGTATATGA AGGTATGAGA 5640 

GATATACAGT TAAAGTTGAG AGAGCTTACA AAGGGGAGAG TATTCTTTTC ATTTGTAGTG 5700 
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AAGTTAGTTT TGTTTGCTTC TATTTTAAAG AAAAAAGATT TACTAGAAAA ATTTCAAGAA 5760 

AAGTCTTAAT CAAGTATTGA CACTTTATCT GGATTTCGGT ATAATATGCT TAGAAAGGAA 5820 

TCTTTCTAAA TTTTTTTCGT CCTTATGTGT TAATCAAAGA CGAATACAAA AACATATTTT 5880 

TTTACTCTAA AAAGTGTTAA TCAATGATGT ATTTGTTAGA GAGGTACATA AATGGAATTC 5940 

AGAGCACCAC CAGTTATAAT AGTATAAAAC GTATAATAAA AATATTTTAA CTTGAATTAT 6000 

AGAAAACGAG -AAACAAATCA TGAAACAAAA ACAACCGATT GTTTCTAGAA CGAAACAACA 6060 

TACATTTGAA GAGCTTATTC AAGACCAAAA GTTAGAAAGA TTGGCTAAGT TGTCGCCCGA 6120 

TTTGGTTGGA AGGTATGGTT TTACTGCTAG CTGTGCGTCT TCATTTGCGA ACTTGATTAA 6180 

AGAAGCGTAT GGGGGTAAAA ATCTAAACGT ACTTTATGCG AGTCGGATGT TGGCTCTCTG 6240 

GAATATTGCT TGCAGTTGTT ATCATAAGGC TGATGGGTAT TCTTTACCAG ATGCGCTTTT 6300 

TAGTGATAAA AAAATTTGTC TAGATTCTTA CTATTACCAC AAGAATACCT CTAATACCAT 6360 

AACTAGTGAT GTGATAAAAG ATGTTTACGA TAATTATAAT AATTATATGG TTTTAACTCG 64 2 0 

AGAAGCGACA CCTGAATACA TTTATGTTGT ACAAACTGAA ATGCCAAAAG ATTCAGATTT 6480 

ATATTTTTAT ATTAGAGAAG TTCTGGGATT ATCCTTTAGT ACCATGCATT ATGCATTTTT 6540 

AGTCAAGGTT CTTGCAGGAG CGCTTGCTAG AAAATATAAG CCATATCGAA ATTGAATTAT 6600 

TTAAATTTAT ACTCTTCGAA AATCAAATTC AAACCAAGTC AGCTTCGCCT TGCTGTACTC 6660 

AAGTGCTGTC TGTGGCTAGC TTCTTAGTTT CCTTTTTGAT TTTCATTGAG TATTACTCTT 6720 

ATGGTAGTTA TTTATGGCAT AATAATATTG ATTTGGGAGT TATAGCGAAA ATTTTAGGTT 6780 

CTATAATATT TGTAGTGGCT AAACCACTAT AGATATTATG GAGCCTATTT ATTGTAGAAA 6840 

AAAGTCCCAT ATGA 68 54 

(2) INFORMATION FOR SEQ ID NO: 201: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 3895 base pairs 
(Bi TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

TCCTTGCTAA GTTTATACTC AATGAAAATC AAAGAACAAA CTAGGAAGCT AGCCACAGGT 60 

TGCTCAAAGC ACCGCTTTGA GGTTGCAGAT AAAACTGACA CGGTTTGAAG AGATTTTCGA 120 

AGAGTATTAA TTTACATAAA TAGCCAGTGT TTGATAGGGT TTGAGTAGAA TTTTCTCAGA 180 
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CACTTCTGCA 


TCTTCATAGT 


TTGATATCAA 


AATCTGTCCA 


TTTTGGTAGA 


CTGCTGGCAA 


240 


GTCGATTTcA 


CTTCTTTAGC 


ATAAAAGTTA 


TTGAGCACTA 


GTAACTTTTG 


ATCCTCAAAC 


300 


TGGCGTTCAA 


AAGCCTACAC 


TTGTTTGCTA 


TCTTCAAAGG 


CTGCTTTGTA 


ACTTCCTTCT 


360 


GAAATGATTG 


GCATTTCCTT 


ACGCATCGAA 


TCAAGTCTTG 


ATAGAAGGTA 


AAAATCGGAC 


420 


CCTGGATTTC 


ATTTTCTACA 


TTGATGTATT 


TATAGGATTT 


ACCAGCTTTC 


AACCAAGGAG 


480 


TGCCTGTTGA 


AAATCCTGCA 


TTTTCCGAAG 


CATCCCACTG 


CATGGGAATG 


CGTGAATTAT 


540 


CACGCGACTT 


AGCTTGAATA 


ATCTGGAAGG 


CTTCTTGCTG 


ACTCTTTCCT 


TCTTCTAAGA 


600 


GCATCTGATA 


GGCATTAAGC 


GATTCGACAT 


CCACATAATC 


AGCCATAGAA 


TCATAGTCTG 


660 


GGTCAATCAT 


CCCGATTTCC 


TCACCCATGT 


AGATATAAGG 


TGTCCCACGT 


GACAGGTGAA 


720 


TGCTGGCTGC 


TAGCATGGTG 


GCTCCTTCCT 


TGCGGAAGTT 


TTGAATATCG 


ACAAAACGGT 


780 


TCAAGGCACG 


TGGTTGATCG 


TGATTATTCC 


AAAAGAGGGC 


ACTCCAACCG 


TCTTTATCAC 


840 


TCATTTCCTT 


ACCCCAACTA 


TGGTAAAGAC 


TCTTCAACTC 


TTCAAAATCA 


AAGGGAGCCA 


900 


AGGTCCACTT 


TTCTCCATCC 


TTATAGTCCA 


CCTTGAGGTC 


ATGAAAATTA 


AAGGTCATGG 


960 


ATAATTCCTG 


ACGATCAGGC 


GACGAATAGA 


GGACACAGTT 


TTCCATGGTG 


GTAGAAGACA 


1020 


TTTCCCCAAC 


TGTCATAAAG 


CTATCGTCGG 


ATCCAAAAGT 


GGCTTGGTTC 


ATCATACGCA 


1080 


AATAGTTATG 


AACGATGGGT 


TTGTCTGTAT 


AAGCTGGCTT 


CCCTTCATTT 


TCAGGACAGT 


1140 


CCACTGAAAC 


CTCGTCCTTA 


CCGATCAAAT 


TGATCACATC 


AAATCGGAAA 


CCTTTGACAC 


1200 


CCTTGTCGCG 


CCAGAAATTA 


ACAACCTTGA 


AAAGCTCCTT 


ACGGACATTG 


GAATTGCGCC 


1260 


ACTTAAGGTC 


AGCCTGGGTC 


TCATCAAATA 


GGTGAAGATA 


GTATTTCCCA 


GTATCCCCGA 


1320 


AAGGCGTCCA 


TGCAGAACCA 


CCAAACTTAG 


ACTGCCAATC 


TGTTGGTTGG 


TCTTGGATGA 


1380 


AGAAAAAGTC 


TTGATAATAC 


TTATCACCAG 


CTAGCGCTTT 


CTGAAACCAT 


TCATGCTCTG 


1440 


TCGAACAATG 


ATTAAGTACC 


ATGTCCAGCA 


TAAAGTCAAT 


CTTGTGCTCT 


TTACCGACAC 


1500 


ACACCATTTT 


CTCAAAATCA 


GCCATATCAC 


CAAAAAGAGG 


ATCCACTGCC 


ATATAATCTG 


1560 


AAATATCGTA 


ACCATTATCC 


CGTTGAGGGC 


TTGGATAGAA 


TGGATTGAGC 


CAGACCATAT 


1620 


CCACACCTAG 


TTTGGCTAAA 


TAGGGAATTT 


TTTCGATAAT 


CCCACGGAAA 


TCCCCAATAC 


1680 


CGTTTTCAGT 


GGTGTCTTTG 


TAAGATTTTG 


GATAGATTTG 


ATAGACTACT 


TTTCCTTTAT 


1740 


CAAGTGTCAT 


CTGTTTCTCC 


TTTTCTGATA 


AAAGGGAGGA 


AGCAGTCTTC 


CGTCCCTATT 


1800 


TGTGCTATTT 


CAATTATACT 


CAATGAAAAT 


CAAAGAACAA 


ACTAGGAAGC 


TAGCCACAGG 


1860 


TTGCTCAAAA 


CACTATTTTG 


AGGTTGCAGA 


TAGAGCTGAC 


GTGGTTTGAA 


GAGATTTTCG 


1920 


AAGAGTATTA 


GATTCGTGTA 


GCGACCATGA 


GAGATGCTCC 


AGCTTGGATC 


GTTGTCGGAT 


1980 



WO 98/18931 



PCT/US97/19588 



1181 

AAGTTCCGGC AATAGTCGCT GTATAAGCAT CTTGGTTGCT GATGATAACA GGAGTTTCTG 2040 
TCACCAGACC TGCAGCCTTA ATGACATCCA TATCAAAACG AATCAGTTGC TGACCAACTG 2100 
TAACGTGATC TCCTTGGACT ACAACACTTT CAAAACCTTT GCCATCAAGA CCTACTGTAT 2160 
CCATACCGAT CTGGATGAGC AATTCAACTC CCTCGTCAGA GACAATGCCG ATGGCATGCT 2220 
TGGTAGGGAA AAGAACCGTC ACTGTCCCAT TAACTGGAGA GGTCAACTCA CCTTGGCTTG 2280 

GTTCAATGAG--TAGACCTTGC - CCCATGACAC CTGATG 2340" 

TCAATTCTTT CACTTGGCCA GTTACTGGGC TGATAATTTC TACCGAAGTA AGTTCTACTG 2400 

GTTCATGGTT CACAAATTCT GCTTCTTCTT GAGCAACGAA TTCTGCCTGC AACTTCCTAT 2460 

CGCCCTCTGT TTTTC7AAAG AGACCAGCCT TGCGGAAGAA GAAAGTCAAC AGCATTGGAA 2520 

CAACAATCGC AACTAGCATA GTTCCTGCAA ATGGCAGCAT GTATTGAGGT TGAATAGAGA 2580 

GAATACCTGG CAAACCACCG ATACCAATAG AAGCCGCAGT TACATTAAAA GTAACGGATA 2640 

ACATGCCTGC AAGGGCTGAA CCAGTCATCC CAGCAACAAA TGGATAAATA TATTTTACGT 2 700 

TAACCCCAAA AACAGCTGGT TCTGTAACAC CGAGATAGGC TGAAATGGTT GCAGGAAGTG 2760 

AAACCTGAGC CPCACGCTCA TCATGGCGAT GCATGAAATA ATAGGCAAAC ACGGCTGACC 2820 

CTTGAGCAAT ATTAGAAAGA GCAATCATTG GCCATAGGGC AGTGCCACCA GCATCCGCAA 2880 

TCAATTGTGT ATCAATGGCA TTGGTCATAT GGTGCAGACC TGTGATGACA AATGGAGCGT 2940 

AGAGGGCGCC AAAAATTGCA CCGAAGAGCC ATTTAACTGG ACCAGTTAAA CCTCCCAAGA 3000 

CAACTGATGA AAGTCCTTGT CCAATTGT CC AACCGATTGG TCCCAAAACA GTATGAGCCA 3060 

AAATCAAGGC TGGAATCAAT GACAAGAAAG GTACAAAAAT CATAGAAATG ACTTCTGGGA 3120 

TATGCTTGTG CCAGAAGATT TCAAGATAAG ACAGACTCAA ACCTGCAAGC AAGGCTGGGA 3180 

TAACTTGGGC TTGGTAACCG ATACGATTAA CAGTAAAATA GCCAAAATTC CAAACCCAGT 3240 

TTGCCGCGAT ATCAGCTGCT GGCGTTGAAG CAACCGCATA GGCATTGAGC AACTCAGGTC 3 300 

ATACCAAACA GATTCCGAGA ACAATTCCCA AAATTTGGCT GGTTCCCATC TTACGAGAAA 33 60 

CAGACCAAGT AATCCCTACT GGTAAGAACT GGAAGATAGC TTCACCAGGC AACCAGAGGA 3420 

AGTGATTGAC ACCTGCCCAA AACTGAGAGG ATTCTGTGAT GGTCTTGCCA TCCAACATCG 3480 

ACCAATGGAC ACCTTCCAAG ACATTACGGA AACCGAGGAT CAATCCTCCG ACTATCAAGG 3540 

CTGGAATAAT CGGAGTAAAA ATCTCCGCCA GAGTGGTCAT AACACCTTGG ACCACGTTTT 3600 

GATTACTCTT AGCTGCAGAC TTGGCTGCTT CTTTGGAAAC ACCCTCAATA CCTGAAACGG 3660 

CTGTAAAATC ATTATAAAAG ATGGGCACGT CATTTCCAAT GATTACCTGA AATTGACCTG 3720 
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CATTTGTAAA GGTTCCTTTA ACAGCTGGAA TTGACTCGAT AGCTTTAACA TTAGCCTTCT 3780 

TATCATCTCC TAAAACAAAC CGCATCCGTG TCGCACAGTG ACTTACGGCA GTCACATTTT 3840 

CTTTGCCTCC GATTGCCTGA AGCAGATCTT TGGCTTCTTG TTCAAATTTT CCCGG 3895 
<2) INFORMATION FOR SEQ ID NO: 202: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3936 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 

AGGATCGCCG CTCCAGCTAC TAAGTCTCGT GCAGTGCCGA TTTATCAAAC AACATTTTTT 60 

GTTTTTGATG ACACGTAGGA AGGTGCCGAT CTGTTTGCCT TGAGGAAACC AGGGAACATT 120 

TATACTCGTA TCACCAATCC TACAACAGCT GCCCTTGAAG GTGGTGTTGA AGCGCTAgcA 180 

ACAGCATCAG GTATGACTGC AGTGACTTAT ACGATTTTGG CGATTGCCCA TGCTGGTGAC 240 

CATGTAGTGG CTGCTTCGAC TATTTACGGT GGAACCTTCA ATCTTTTGAA AGAACCCCTT 300 

CCTCGTTATG GTATCACAAC AACCTTTTTC GATATTGATA ATTTGGAGGA AGTAGAAGCA 3 60 

GCTATCAAAG ACAATACCAA GCTTGTCTTG ATTGAAACCT TGGGTAACCC CTTGATTAAT 420 

ATTCCAGACC TGGAAAAACT GGCAGAGATT GCTCATAAAC ATCAAATCCC ACTTGTGTCA 480 

GACAATACTT TTGCAACACC TTATTTGATT AACGTCTTCT CTCATGGCGT TGACATTGCC 540 

ATTCACTCTG TCACTAAGTT TATCGGTGGG CATGGTACAA CTATTGGAGG AATAATTGTC 600 

GATAGTGGTC GTTTTGACTG GACGGCTTCA GGGAAATTCC CTCAATTTGT TGACGAGGGT 660 

CCAAGCTGCC ACAATTTGAG CTATACTCGT GATGTGGGTG CAGCAGCCTT TATTATAGCT 720 

GTTCGAGTTC AATTGCTTCG TGATACAGGT GCAGCCTTGT CACCATTCAA TCCTTTCCTC 7 80 

TTGCTACAAA GACTTGAAAC CTCTTCACTT CGTGTGGAAC GCCATGTACA AAATGCTGAG 840 

ACAATTGTTG ATTTTCTTGT CAACCATCCT AAGGTAGAAA AGGTAAATTA TCCAAAACTT 900 

GCAGATAGTC CTTATCATGC CTTGGCTGAG AAATACTTGC CAAAAGGTGT CGGTTCAATC 960 

TTTACCTTCC ACGTCAAAGG TGGCGAGGAA GAAGCACGCA AGGTCATTGA TAATTTAGAA 1020 

ATCTTTTCTG ACCTTGCAAA CGCGGCAGAT GCTAAATCGC TTGTTGTCCA TCCAGCAACA 1080 

ACCACTCACG GTCAATTGTC AGAAAAAGAC CTAGAAGCAG CAGGTGTCAC ACCAAACTAA 1140 

ATTCGTTTGT CAATCGGTCT TGAAAATGTA GAAGATTTGA TTGAAGACTT GCGCTTGGCC 1200 

TTGGAAAAAA TTTAAAGTAA AAGAAGATAA ACAGTGGGCT TCGACTCACT GTTTTTGATT 1260 
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TTCCCTCAGG CATGATATAA TGGTTACAGA AGTCTAGAAA GAGGAACGAT ATGAACGAAA 1320 
TCAAATGTCC CAACTGTGGG GAAGTCTTTA CAGTAAATGA GAGTCACTAT GCCGAACTCT 1380 
TGTCCCAAGT GAGAACGGCA GAGTTTCATA AGGAACTACA CGATAGGATG AAGCAGGAAC 1440 
TCCCCTTCCC TGAGCAAAAG GCCATGAATG AGCAACAGAC TAAACTGGCT CAGAAGGATC 1500 

AAGAAATTGC GCAATTACAG AGTCAGATCC AAAACTTTGA TACAGAAAAA GAATTGGCCA 1560 
AGAAAGAGGT TGAACAGACA AGCCATGAGG CTCTCTTGGC TAAGGACAAG GAACTACACC - - 1620 

TCTTAGAAAA TCAGTTGGCT ACCTTGCGTT TGGAGCATGA AAATCAACTA CAAAAGACCC 1680 

TTTCTGACCT AGAAAAAGAA CGGGATCAGG TTAAAAACCA ACTACTTTTG CAGGAAAAGG 1740 

AAAATGAATT ATCTTTGGCT TCTGTTAAGC AAAACTACGA AGCCCAGCTC AAGGCAGCTA 1800 

GTGAACAAGT CCAGTTTTAT AAGAATTTTA AGGCTCAACA ATCTACAAAA GCGATTGGGG 1860 

AAAGCCTAGA ACAGTATGCA GAGAGTGACT TTAACAAGGT TCCTACTTTC GCCTTTCCAA 1920 

ATGCTTACTT TGAGAAGGAT AACAAGGTCT CTTCGCGTGG GTCTAAAGGG GACTTTATCT 1980 

TCCGTGAGTG TGATGAAAAT GGAGTTGAAA TCATTTCTAT CATGTTTGAG ATGAAAAACG 2040 

AAGCGGACGG AACAGAGAAG AAGCACAAGA ATGCAGATTT TTACAAGGAA TTGGACAAGG 2100 

ACCGTCGGGA GAAGAACTCT GAGTATGCCG TTTTGGTGAC CATGCTTGAG GCTGATAATG 2160 

ACTACTTTAA CACAGGGATT GTTGACGTCA GTCACGAGTA TGAAAAAATG TATGTTGTTC 2220 

GTCCTCAATT CTTTATCCAA TTCATTCGTC TCTTACGTAA TGCGGCGCTA AATTCCCTAA 2280 

AATACAAGCA GGAGTTGGCC TTGGTTCGCG AGCAAAATAT TGACATTACG CATTTTGAGG 2340 

AAGATTTGGA TGCCTTTAAG CTAGCTTTTG CTAAGAACTA TAATTCAGCT TCGACTAACT 2400 

TTGGAAAAGC TATTGATGAA ATCGACAAGG CCATCAAACG CATGGAAGAG GTTAAGAAAT 2460 

TCCTGACCAC ATCTGAAAAC CAACTCCGTT TAGCTAACAA CAAATTGGAA GATGTCTCTG 2520 

TTAAAAAATT GACCCGGAAA AATCCAACAA TGAAAGCGAA GTTCGAAGCA CTGAAGGGGG 2580 

AGTAGAAAGC AAAAATGAAC GGTATTATTA ACTTAAAAAA GGAAGCAGGA ATGACCTCGC 2640 

ATGATGCGGT TTTTAAACTG CGTAAGATTT TGGGAACCAA GAAAATTGGT CATGGTGGAA 2700 

CCTTGGATCC GGATGTGGTG GGTGTTTTGC CGATTGCGGT TGGCAAGGCG ACACGCATGG 2760 

TCGAGTTTAT GCAGGACGAG GGTAAGATCT ATGAGGGGGA AATCACTCTG GGCTATTCCA 2 620 

CGAAGACTGA GGATGCTAGT GGGGAAGTGG TCGCAGAAAC CCCTGTTTTG TCTCTCTTGG 2880 

ATGAAAAGCT TGTTGATGAA GCGA7TGCTA GCTTGACTGG GCCTATTACT CAGATTCCCC 2940 

CTATGTATTC GGCAGTTAAG GTTAATGGTC GCAAGCTCTA TGAGTATGCG CGTGCTGGTC 3000 



WO 98/18931 



PCT/US97/19588 



1184 

AGGAAGTCGA GCGTCCAGAA CGTCAGGTGA CCATTTATCA ATTTGAGCGA ACAAGTCCGA 3060 

TTTCTTATGA TGGCCAACTT GCCCGATTCA CTTTTCGTGT AAAATGCAGT AAAGGGACGT 3120 

ACATCCGTAC TTTGTCACTT GATTTGGGTG AAAAGCTTGG TTATGCGGCT CATATGTCCC 3180 

ATTTGACTCG TACTAGTGCT GCTGGCTTAC AATTAGAAGA CGCTCTTGCC TTGGAGGAAA 3240 

TTGCTGAAAA AGTAGAGGCT GGGCAATTAG ATTTTCTCCA TCCTTTAGAG ATTGGGACAG 3300 

GTGACCTTGT CAAAGTTTTC CTAAGTCCAG AAGAGGCTAC AGAAGTTCGC TTTGGTCGTT 33 60 

TTATTGACCT AGACCAAACG GACAAAGAAC TGGCTGCCTT TGAAGATGAT AAATTGTTAG 3420 

CCATTCTAGA AAAACGGGGC AATCTCTATA AGCCAAGGAA GGTTTTTAGC TAGATCGTTT 34 80 

AGGAATAAAA ATCGGGTGAT AGATAACAAT TGCTTGATAA AACCCCATAC TAATAGTAGA 3 540 

ATGGTTTTGG GAATTATAAT ATTCCAATTG TTGCGAGTTG TAGGTACTCA AATAATCTAT 3 600 

ATAGAAATTT AGAGGTGTGA AATGAAGCAA TTTAAAATTC TTTCAGATAA ATATTTAGAG 3 660 

TCCATTACAG GTTCTGATGG GAACTTAGGC CCAGGATTTG CTCTGATAAT TCCATGATCC 3720 

GAAATGAGTT TCGAGAAAGG GTGGAGCAAC TTCTTCAACA AAAAGAAATA AATGAAAATA 3780 

GTGAGTTGAG TCACCTGTTT CGTCTTGCTA TACAAAATTT AGACAGAAAT GAAAAATACC 3840 

AATCGGTCAT GGCCAATTTG AGTCAAGGGT TGTCACTTTA CCTCATGACG CATCATTACC 3900 

AGGCACCTAA GTCTGTCATT GATTTT GG TT TATGGA 3936 



(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

CATCCAGCAA CTGCTCCTCT GAGCGTTTCA AAATTGATGT AATTTTTCTA GTTTTTTCTA 60 

ATAAATGTGC CATTTTTCAC CTCGAATTTA ATCGCTATCA TTATAACATA AAAACGTCTC 120 

TTTTTCAATA ATTATCTGAA AATTCCTTAT TGACTTGCAT TGACTTACAA TTTAATTAAA 180 

AACCAGAATA TTTTTAATTA AATTGTTCCT TTTCTATTGA CAAGTTGCCT ATTTTTGTGT 240 

ATCATAATAT TATAAAAGAT AATATAATAA TTTTATTTGT CTTTTCACAT TCGGTCTCCT 300 

TATATAAAAA AGCGATTCAT TTTGAACCGC TTTTTCTTAT TTATCGCCTT TGTTACGAAT 360 

AACAAAGCCT GTTTGCTTTT CGCTTAAAGT ATTGCGTGGT TTTTTATTAT CCTTACGGTA 420 

ACGTTTTTCC TTATCAAAAC GATCGTTGCC ACGACTTCCT TTTTTGAACT CATCACGGCG 480 
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ACCATTCCCA CGCCGATCAC GCTCTCGACG GTCCTCCCCA CGACGCCCTC CACGACCTCC 540 

CTTAGCTTTA CCACCGAAAC CATTACCTGA TGGTTTAAAC GCTACTCGtT TTTCACGTGC 600 

AATCTCCACT TCTGGAAGGC TATCTGGGTC TTGGACTGTC AGACTCAAGA TATACATTGC 660 

CAATTCTTCT GGAGTAAACT CAGCAGCCAA TTTGCGACCA TCCTTACCAA ATTTCTCAAA 720 

GTTGGCACGA ATGGTTTCAT CTGCAAAATC ACGTTCGATT TTCTTGAGAG CTACCTGTTT 780 

TTTTGATTGG AAGGATTCTT CTACACTTGC AGGTTTGAGA CCTTTCATGC GTTTCTTAGT 840 

CAAGTTTTCA ATGATTTGAA GGTAACCCAT TTCGTTTGGA GCAACAAAAG TAATAGATTG 900 

ACCTGACTTA CCAGCACGAC CTGTACGACC GATACGGTGA ACATAACTCT CAGGATCTTG 960 

TGGAATATCG TAGTTGTAGA CATGGGTCAC ACCTGAAATA TCCAAACCAC GCGCTGCAAC 1020 

GTCTGTCGCA ACCAAAACAT CAAGATTGCC ATTTTTAAAC TCACGAAGGA CACGAAGACG 1080 

TTTGTTTTGG TCTAGCTCGC CATGAATTCC TTCTGCACGG AACCCACGAA TTTTCAAACC 1140 

ACGAGTCAAT TCATCCACAC GGCGTTTGGT ACGACCAAAT ACAATAGCGA GTTCTCGTTG 1200 

TGCCACATCC ATGAGACGAG TCATGGTGTC AAATTTTTCT TGTTCCTTAA CACGGATATA 12 60 

GTACTGGTCA ACCAATTCTG TTGTCAATTC CTTAGCCGCA ATCTTGACAT GTTCAGGGGC 1320 

TTTCATAAAC TGAACACCGA TACGTTTGAT GGCATCTGGC ATAGTTGCTG AGAAAAGCAA 1380 

AGTTTGACGG TTCTCAGGTA CACGGGAAAT AATGCCTTCG ATGTCTTCAA GGAAGCCCAT 1440 

GTTAACCATT TCATCCGCTT CGTCAAGGAT AAGGGTTTCA ATCTCTTGTA ATTTCAAGGC 1500 

CTTGCGTTTA ATCAAGTCCA AGAGGCGACC TGGAGTTCCC ACCACAATAT GGGCACCAGA 1560 

TTTAAGAGCC TTAATTTGTT TTTCAATGCT TGATCCGCCA TATACTGAAC GGACTTTGAC 1620 

TCCCTTACTA CGACCAAAGC GGAAGAGTTC TTCTTGACTT TGGACAGCTA GTTCACGAGT 1680 

TGGAGCGATG ACCAAGGCTT GGATAGTCGC TTCTTCTGTA CGGATTTTTT CAAGGGTAGG 1740 

CAAGCCAAAG GCTCCAGTTT TTCCTGTACC AGTCTGAGCT TGACCGATAA CATCCTTGTC 1800 

TTCAAGGGCC AAAGGAATAG TTTGTTCTTG GATAGGACTA GCTTCTACAA AACCAGCTTT 1860 

TTCAATTTCT GCTAGCAAAT CAGCAGACAA GTTTAATTCA TTAAATTTCA CGTTATTCTT 1920 

CTTTCTAAAG GTGGTGCGAA GCCACCCTAT AGGGCTTAGT TTATACTTTT CTTTTTATGA 1980 

CGTATTTTCA TATAACTAGA TATAAAATCG TGTTGCTTCT TTTCCACAAA AGAAAAGTAC 2040 

TGTTTTCTTT GCAACCTATC TAGTATAACA CAAGACCAGA GCAAAAGATA GCCCCATTTC 2100 

TACAGAAAAT CATGTAAGCG CTTTTTGACT TTCTTTTTTG ATTGAACGAC CTAGATAATA 2160 

AGACAAAGCC AAGGCGATAC TGTATAAAAT GAGAAAAACG AACAAGGTTT GTGTGTACGA 2220 
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ATGAGCCATT TTATAAGTCT CTGCTAATAA AATAGGTCCC GCTAAACCAG CCATTGCCCA 2280 

AGCTGTTAAA ATATAACCAT GCAGAGCGGC CAATTCCTTG GTTCCAAAAA TATCACTGAG 2340 

ATAAGCTGGA ATCAAAGAAA AACCAGCTCC ATAGCAAGTC ATCAAAATAG ACATAGCAAC 2400 

TACAAATAAA ACGGAATCTG TAAAGAGCCA AAGTGAGAGA GAAAAGAAAA GATTCACAAG 24 60 

CAGTAATATA CTAAAGGTTA GAGGGCGACC GATATAGTCA GACAAACTCG CCCAGAGCAA 2520 

GCGACCAAAT CCATTGAAAA TCCCCAAAAC ACCCACCATT ACTGCTGCAT GACTTGTAGA 2580 

CAAGCCAGCC ATCTCCTGTG CCATTGGCGA TGCCGCTGAA ATTAAGCCTA AACCACAAGC 2640 

TATGTTCATA AAGAAAATAA TCCAAAGCAT ATAAAACCGA TTGCTTTTTA GAGCCTGATT 2700 

TGCACCCATT CCTTGCGTCA AAGACCCTGT TTTTTCTTTC CCTGAAGAAG ATAAAATTCC 2760 

AAGCTCTTGC TCATTTCGAC GCTTAATGAA TTGTGAAGCT AGGAGCATGA TAATAAAGTA 2820 

ACTTGCTCCT AAAATATAAA AAGTTTCTAC AAGCCCTACC CCTGCGATGA GGTGTTGCGC 2880 

TATGGGACTA GTCAATAAAG AAGCAAAACC AAACCCCATA ATCGCTAAAC CTGTTGCGAG 2940 

ACCACGTTTA TCAGGAAACC ATTTTATAAT CGTCGACACA GGGGTAATAT AGCCTGCTCC 3000 

CAAACCAACC CCACCTAAAA TGCCATAAGC GAGATACAAC AACCACAGCT CTGACGGTCT 3060 

ATTGCAAATC CTGTTAAGAT ATTTCCACCT GCGTATAGAA AAGCAGATAG ACTTCCCATG 3120 

ACTTTCGGAC CAAATTTTTC TACCAAACGC CCCATAAATG CAGCCGATAA GCCCAAACAA 3180 

AAGATTGCTA GACTAAAGGC GAAGGCAACA GAAGCCTGAT CCCATCCCGT 3230 

(2 J INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5096 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

CCTATGAAGA CTGTCCCAAC TGGGTGTCCT TCTAGGCTAT CTGGTCCTGC CACTCCAGTC 60 

AAACTAATTC CAAAATCAGA CTGGGTCTTG CTTCGTGCCT GCTCAGCCAT CTTCTGAGCT 120 

GTAAATTCAG ACACCACACC ATGTTCTTCC AAATTCTTGG CAGGAATATC CAACATCCTT 180 

GATTTTTCCT CCAAGCTATA GGTCACAAAA CCACCCTTAA ATATACTTGA AACTCCAGAA 240 

AAATTCGCCA CGGTAGCTTG GAAAAGACCT GCCGTCAAAC TCTCTGCAGC CGCGATGGTT 300 

TTCCCTTGCC TTTTCAGTTC TTCTACCACA ATGCTGGCTA AACTAGTTTC TTCCCCATAA 360 

CCATAGCAAA AGTCTCGTAA AGAAATTCCT TCGAAAGTCT GGCAGTCCAA GATTTGATTT 420 
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TCCAAGATAT CCAGCGCTTC ATTCCCCTCT TCTTGACTGC TAGCCTTTGT TGACAGACGT 480 

AGAGTGACTT CTCCTGTCTT GGCATAAGGG GCCAAGGTAG GATCGATCTC ATTATCAATT 540 

AAATCAGCCA AAATCGTAAC CAACTGGCTC TCGCCAATCC CAAAGAAACG AAGAACTCGG 600 

GAATACAGCT TGCTCCCTGT CATCAACTTG GGTAGAAGTT GGTTTAAGAC CATGCGTTTC 660 

AATTCACTTG GCGCACCTGG AAGGACGACA TAGGTCACTC CGTCTACTTC TAATTTTCCT 720 

-CCAACAGCCA-GTCCTGTrrC~GTtlKW'CAGT^ "7 a 0 ~ 

CTTTCGTTAT TCGGTGTTCG GGCATAGTCT GGTCGCAGGG TAAAAAAGAT ATCCAACTTC 840 

TCCTGAGCCT GAGGATCAAA GACTAATGCT TTCCCTAAAA ATTTAGCTAG G G TT TG TTTG 900 

GTTAGGTCGT CCTCAGTTGG CCCCAAACCG CCTGTCAAAA TCACCAGACT GCTACGTTGA 960 

CTGGCAATCT CAAGCAAAGA CAAGAGACGA ACTTCATTGT CTCCTACAGC CGTCTGAAAA 1020 

TATACATCTA CCCCAATCTC AGCTAGTTTT TCCGACAAAA ACTGGGCATT GGTGTTGACA 1080 

ATCTGCCCTG TCAAAATCTC TGTTCCAACA GCAATGATTT CTGCTTTCAT GTTTCCTCCT 1140 

ACCTATCTAT TCGTATTTTT TTGAAAAAAT CGCAGGAATT TTCCTACGAT TGATTTTTTT 1200 

ATTTGTATCA AAAGTTAATT ATCTTCATCA CCAACAGGTG CTCTCCCAAA TAAATCTTCA 1260 

AATAAAACCG CATTGGTTTC AAGCTGAGTA ACTTCTTCTT GTCCCAAAGA ACGTCGGAGT 132 0 

AGATTTTGCA TTTCCAACAT ATGTGCTCTC GAAACAATCT GGTAAGAAAC ACCTTGAAGT 1380 

ATCTCTCCTT CACCCTGCAA CTGCTGAGTT TCAATGGTTT TAAATGAATC TTTATAGCCT 1440 

AGCAAGTTAG GGATACTTTT TGCAGACAAA TCAATATTGG TCTGCATATT GTCACTCAAA 1500 

GCTTTTAGAA TCTCTTGATA ATGACCAATG CTATTTAAAC TGAGACCTTT TTCCATGACT 1560 

TTTTGAATAA CTTCACGTTG ACGTTTTTGA CGACCATAAT CCCCCTCAGG ATCTTGGTAA 1620 

CGCATTCGTG CATAGACTAG GGCTTCTTCT CCCCCAATAT GTTGCTCCCC AACACCGATA 1680 

GAAATAGTAT TAAATTCTTC TTGGTCACTG ATAGAAATTG GGAAACCTAG GATATTATTG 17 40 

ACTGTAATAC CTCCTACTGC ATCCACTAGT TTTTGCAATC CTCTCATATT GACCATCACA 1800 

TAGCGATCAA TATGGATATT CATCATTTTT TGAATGGTTT CTATAGCAAG CTCTGCTCCA 1860 

CCATCTGCAT ATGCTGAGTT CAGTTTCGCT TCATGAGCCT GACCATTCCC TGATTCAATG 1920 

CGCGTCAGAA TATCCCGCTC TAAACTCATC ATTGTTGTTT TTTTCGTTTT AGGATTCACT 1980 

GTCATCAAGA TCATGCTATC ACTTCTACCG ACCCAAGTTT CAGTTCGTTC AACATTTCCG 2040 

GTGTCCACTC CCATTAACAG AATGGTTAGA GGTTCAGTCG CTTCAATAAC CTTGGTTTCT 2100 

TCACCGATTT TTTTATAGGT TTTAGCTAAG GTTTCTGTCC CTTGTTGATA AATAGTATAA 2160 
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GCAAAAACAC 


CTACTCCTAC 


TACAGTTACA 


GAAAGTAAAG 


CTAGCACCAT 


TCCAATAATT 


2220 


TTTTTAACCA 


TATTTCTACT 


AACCTATCAG 


TTTACCCATC 


AAGTAAACAT 


CGATAAATTT 


2280 


CCCTTCTTCT 


ATATATGCCC 


CACGCTCTTG 


GCTACCTTCA 


ATGACAAAGC 


CATGCTTTTG 


2340 


ATAAAGATCG 


ACTGCTGCTT 


GATTACGAGT 


TTGGACAGTC 


AGTTGGAGAC 


GACGCAGAAT 


2400 


GCCACTTGCT 


TGTGCCCACT 


CTATCGCTTC 


TTCTAGCAAC 


AAACTTCCCA 


AGCCATTATT 


2460 


CCAAT ATCTT 


TTTCCAATCA 


CAATGAAGAG 


ATCTCCAATA 


TGACGGACTC 


TCTTACGCTG 


2S20 


ATCAGCTGTA 


ATATTTACAA 


TACCAGCAAT 


TTTGCCATTT 


AAGAATGCAA 


GTAAGGTTAT 


2580 


CTGATTGTCC 


GAACTAGCTT 


GCTTGTTGAG 


GAATATTTCC 


ATCTCCTCAC 


TAGTCAAGAG 


2640 


AATACCATCT 


CCGTCTAGGC 


TGGTAAAGTC 


TGTCTCCAAA 


CTCACACGAT 


TTAAAAAGGC 


2700 


CACTAATTCA 


GCTGCATCTT 


TGGGCTCTGC 


TTCCCTAATG 


AGCAATTCAT 


ACTCCATATT 


2760 


GAAGCTCCTC 


TAACAATTTC 


TCAGCACGCA 


AACCCTTTGC 


CTGAAAATTT 


AAACGGCGTC 


2820 


CATCTGCTTC 


TTTTACAATT 


TCCAATTCTA 


AATAAGCATC 


TGGCAAGGCA 


TCTCCTAAGA 


2880 


GATTTCCCCA 


CTCAATAACA 


GTCACGCCGC 


CACCAAAGAT 


AAACTCATCC 


AAGTCGATAG 


2940 


AATCAGCATC 


TCCTTCAATA 


CGATAAACAT 


CTAGGTGATA 


AAGTGGAAGT 


CGACCTTCAT 


3000 


ACTCTCTCAC 


GATAGTATAG 


GTGGGACTTT 


TAATCATTTG 


AGAAATCTGT 


AATCCTTTTG 


3060 


CAAGTCCTTT 


AGTAAAGGTC 


GTTTTACCTG 


CACCCAGTTC 


TCCAGTTAAG 


ATTAAAACAT 


3120 


CATTCTTTGC 


TAATAGATGG 


CCCAAACGCT 


CCCCTAAGGC 


TTGCAACTCT 


TCTTCATTTT 


3180 


TTGTGTACAT 


ACTCTTATTA 


TACCAAAAAC 


1TTTCTTTTG 


TGTCTATTTT 


CCTACTAAAC 


3240 


TTATCATCAT 


AACATCCATA 


AAAAACAGGC 


TTTCTCTAAA 


AGAAAATGAG 


CGTAACAATG 


3300 


ACCAATACAA GATCTCGGAA 


AATATGACCA 


TAAAAGGAAA 


CTTCCTTCTT 


AACCGAATTT 


3360 


GGGACAAGAT 


AGGCTGCAAA 


AAACAAGCCC 


AGTCCAATAT 


AAATCAGAAG 


TGAGACAATG 


3420 


GTCATTGGAT 


TTCTTAAGAA 


AAGAAGTGTT 


GCTAAAATAG 


TCACCAACAC 


TGTCTTTTTT 


3480 


CTGTCCAGCA 


TAGCAAGAAA 


ATCGCGCACG 


TATTTTTTCA 


AGGGTAAAAA 


AATCAGCAAA 


3540 


TCTAGCCCAA 


ATAGGAAAAA 


GAAGGATGGC 


AATAAAAAGT 


CAACTAATTC 


TTGCTGCAGC 


3600 


GTATTTTTGA 


TGAACAAGTT 


ATCTGACAAA 


ACAAGAACAG 


CTCCTAACAA 


ATTAATTAAG 


3660 


AGTAACATAC 


TGTAAAAAAG 


CTTCACCGAC 


TTCTTACTGG 


CTAGGACACT 


ATGGACTTCT 


3720 


TGCTTACGGG 


TATAAAGATA 


ATTTACTCCA 


GCACAGATTC 


CTGAAACGAA 


AACCATGCTT 


3780 


CCGATGAAAA 


AAGCTGTACT 


TTGTTTAAAG 


GACAAGATGC 


ATTCCTTCCA 


TAGGAAACAG 


3840 


CTACTCAAAC 


TGATTTGAAT 


TAAAGCTAAC 


AAAAATAAGA 


TTCTCATTGA 


TTTCATCTTC 


3900 


TCTCTCCCTT 


CCTACCAATC 


ATT AT ACT AG 


GAGAAAAGAG 


AGAACTGTTT 


CTAATCTTCT 


3960 
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CAAATCTCTC TTTAAGACGC TAAACAAACA CT AG AG ACT A ATACTCAATG AAAATCAAAG 4020 

ATCAAACTAC GTAGCTAGCC ACAGGTTGCT CAAAACAGTG TTTTGAGATT GCAGATAGAG 4080 

CTGACGTGAT TTGAAGAGAT TTTCGAAGAA TATAAATTTG AAATCATGAA AATCCGTCAA 4140 

ACGGGTGGTT GTTTTGTCTC GCACCTCACG GAGCGAGACG GACTCAGAGT CACATAATTA 4200 

TAAGGCTGAT AGTATTAATC TAACTATCAG CtTmCAGGTT ATTTAACGTT TCAGAAAAAC 4260 

-TATAATGTCA AGATTAACTA AACAGTATCT AGTTCCTTCA AATAATTTTC TATCTTCATC 4 320 

AACATTAAAG GATTGTTATA AATCTTACAT AACTCTCTTG CTTCTATATA ATAATTTTTG 4 380 

ACTTGTTCTC TGTCTAGAAA TTTGGCTCCA GCATTTCCTA CAAGAATAAG TAGAGGAGCC 4440 

AATTGGTAGC TTGTCTGTCT TTGTTTACAG AGTTCAATCC TTTCAAGAGC TTCTTCCATG 4 500 

GCTTCATTAT ATTTTTCCTT TGATACTAGG TAGTGAGCGT ACTTGTAACG AACTCTGATG 4 560 

TAGCCAAATA AAAACTCTTG ATGGTCCAAA TTTTTTGTCT GATACAACTC TATTAAATGA 4620 

GAGTAGTTTG CCTCATATTC TTGTTCACGA CCCACTAAGG AATAGAAATT AGATAGAGTA 4 680 

TTCAACGCCT TTAAATAAAT CAGACTATTT GAAGAGACTT TTAATAATAT ATTTTCCAAT 4740 

GACGAAATTG CCTCACACTT ACTGTCATAT TGATAGAACT CAATTATAGA TTTAATCCAT 4800 

TCAAGGTAAG TTCGGTCTTC TAATGTTAGA AAAGTGCTTC GTTCTACTTC TATTTTATAA 4860 

AGATATTCTA AATCGTCATA ATTTCTGTCA TCTAATAGGC GAGCAGATAG ATGTTTGAAA 4 920 

TTAGAGAGGT TAGACTTAAC TTCGATTTCT TCATTGAAAA AGTAATCCAA AGGGACTTCA 4980 

AGTCGTTGAG AGAGTTTGAA TAACAAGTCT GCGGAGGGAA TAAAATGACC TCTTTCAATT 5040 

TTACTAATCT GGCTTTGTTC ACAAATTCCT TCTGCAAGAG TTTGTTGGGA GAGTCT 5096 
(2) INFORMATION FOR SEQ 10 NO: 205: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 
ACAAGATAAA AATAAAGGAT TACAATGGGG AATATAAAGT AAACCGGTAA ACCTAAAAAC 60 

AAAGGAGAAA AGATGAAAAT TGTACTTGTA GGGCATGGAC ATTTTGCTAC AGGGATTTAT 120 

AGTTCTTTAC AATTGATTGC AGGTAATCAA GAAAATGTGG AGGCGATTGA CTTTGTGGAA 180 

GG AATGTCAG CAGATGAACT CAAGCAAAAA ATCTTACTTG CAATTTCAAA TGAAGAAGAA 240 
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GTTTTAATCC TAACTGATCT CTTGGGAGGA TCGCCATTCA AGGTTTCTTC TACCATAATG 3 00 

GG AGAAAATC CAGCCAAGAC AATGAATGTT CTCTCGGGTT TGAACTTAGC CATGTTAATG 3 60 

GAAGCAGTCT TTGCTAGAAT GGCTCATAGC TTTGATGAGG TTGTTAATAA ATCAGTAGTG 420 

GCGGCCCAGG GCGGAGTCGT AAATCGTAAA GAATTGTTTT CAACGGATGC AGAGGAAGAG 480 

GAAGAAGATT TCGAATCGGG TATTTAAAGG GTAAAAGAAT GATAAAAAAG GTTACGATTG 540 

AAAAAATAAA ATCGCCTGAG CGCTTCTTAG AAGTACCACT TCTGACGAAA GAAGAAGTCG 600 

GCCAGGCAAT CGATAAGGTT ATTCGGCAGT TAGAACTCAA CCTTGACTAT TTCAAGGAAG 660 

ATTTCCCGAC GCCAGCTACC TTTGATAATG TCTATCCAAT CATGGATAAC ACGGAATGGA 720 

CCAATGGTTT CTGGACAGGA GAACTGTGGT TGGCTTATGA ATACAGTCAA CAGGATGCAT 780 

TTAAAAACAT CGCTCATAAA AATGTTCTTT CTTTCCTGGA TCGTGTCAAT AAGAGAGTAG 840 

AATTGGATCA CCATGATCTC GGCTTCTTGT ACACACCGTC TTGTATGGCT GAATATAAGA 900 

TAAATGGAGA TGGAGAGGCT AGAGAAGCAA CCTTGAAAGC TGCAGATAAG TTGATTGAAC 960 

GCTATCAAGA AAAAGGTGGT TTTATTCAAG CTTGGGGAGA CTTGGGCAAG AAAGAGCATT 1020 

ACCGTTTGAT TATCGACTGC TTGCTCAATA TCCAACTCTT ATTCTTTGCT TATCAAGAAA 1080 

CAGGCGATCA AAAATACTAC GATATTGCAG AAAGCCATTT C7ATGCTTCA GCTAATAATG 1140 

TAATCCGTGA TGACGCTTCG TCCTTCCACA CCTTCTATTT TGATCCTGAG ACAGGTCAAC 1200 

CCTTTAAAGG TGTAACGAGA CAAGGGTATA GTGATGATTC ATGCTGGGCA CCTGGTCAAT 1260 

CATGGGGAGT CTATGGTATT CCTTTGACTT ATCGTCACTT AAAAGACGAG tCCTGCTTTG 1320 

ACTTGTTTAA CGGTGTCACC AATTATTTCT TGAATCGTCT GCCAAAAGAT CATGTGTCCT 13 80 

ATTGGGATTT GATTTTTAAT GATGGTAGTG ATCAATCACG AGATTCTTCA GCAACAGCTA 1440 

TCGCCCTCTG TGGGATTCAT GAAATGCTAA AACATCTCCC AGAGGTGGAT GCTGACAAAG 1500 

ATATTTATAA ACATGCTATG CATGCCATGC TTCGTTCCTT GATCGAACAT TATGCAAATG 1560 

ATCAATTTAC CCCTGGTGGG ACAAGTCTCC TCCACGGTGT GTACTCATGG CATTCAGGTA 1620 

AAGGAGTGGA TGAAGGCAAT ATCTGGGGTG ACTACTATTA CCTAGAAGCC CTTATCCGTT 1680 

TCTACAAAGA CTGGAACCTA TATTGGTAGG AGGAGAAATA TGACAATGCC AAATATTATT 1740 

ATGACCCGTA TCGATGAACG GTTGATTCAT GGACAAGGAC AACTTTGGGT AAAATACCTA 1800 

GGTTGTAATA CGGTCATTGT TGCCAATGAC GAAGTAAGCA CCGACAAGAT GCAACAAACT 1860 

CTGATGAAAA CAGTTGTGCC AGACTCAGTT GCCATGCGTT TCTTCCCTTT GCAAAAGGTG 1920 

ATTGATATCA TTCACAAGGC TAATCCTGCT CAAACGATCT TTATCGTTGT AAAGGATCTG 1980 

AAGGACGCTT TAACCTTGGT AGAAGGTGGT GTCACTATCA AAGAAATCAA TATTGGGAAC 2040 
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ATTCACAATC CCCCTGGTAA AGAGCAAGTG ACACGCTCCA TCTTCCTGCG TGAAGAGGAC 2100 

AAGGCGGCCC TCAAGGAATT GAGCCAAACT CATCAAGTAA CATTTAATAC GAAAACAACT 2160 

CCAACAGGAA ATGATGGAGC TGTTCAAGTC AACATTATGG ACTATATTTA ACAGAGGAGA 2220 

TCGTTATGTC GATTAATGTA TTTCAAGCGA TTTTAATTGG ATTATGGACA GCTTTCTGTT 2280 

TTAGTGGAAT GCTGTTAGGA ATTTACACCA ATAGATGTAT TGTTCTGTCA TTTGGTGTCG 2 340 



GAATTATTCT AGGTGATCTG TCATGCTCTT GCAATGGGAG CCAATGGTGA ATTGG 2395 

(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3342 base pairs 
IB) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

CCTTCTTTAG AGGTTAATTT TGCAAAATCG TCCATTGTTA TATAAGGATT ATTATAGAGA 60 

CTGTTCGCAA AGAATCTCTG ATATGTTTTT GAATCTTTTG AATACAAAAC TATCTCTCTA 120 

ATAGCATTGC CATCTGTTCC ATCAATTGGT AAACATACCG TAACTAGAAA AAGAATTATA 130 

TTCAAAATAA AAAATTCTGA TGCGTACGGC ACAAATCCCA AAAGTGCTAA TATTGCGACA 240 

ATTAGGTTAG CTCCACCTCC CCCAAAGAAG TAGAACACCA AATTCCTATC ACTATTTTTT 300 

TCATTAGTAA TGTTTCTATT ACTCATTTGA CAATAACCGA ATGCTAATAA CACTGGAAAT 360 

TTGAAATATA TTTTTTTTCT GAAATAGAAG AAAAAGGGAG TAGCAAGCAT CTCTAGTTTA 420 

TAAGATAAAC ATCTTCCCAC TAAAAAATGA CCTAGTTCAT GTAATGTAAT TGATATTAAC 480 

GAAATTAAAA TCAATCGAAA ATAATAGATT AATGAATCAT TTGGAAAAAT TATCAATAAT 540 

AGGAACAATA ACGGAATCAA ACATAAATAT ATGACAGAGT TATTTAATAT TTTCAACATA 600 

ATACCATTCC TCTAAACTAT TAGCTTCAAA AAGGCGTTTT TTCTCCCAAT ACATCTTCTC 660 

AAAATGTTCG GAATCATAAT TTTCTAAAAT TAATTTTAtG TCTGGTAAGC TCTTTCTTGA 720 

TAATCCGTTG TTTTGTACTT AATTTTCCCT TCAAGTACAT CTTCAATTTT ATAAGTTGCC 780 

TCCATCAACT GAGCCTCTGC AATATCTTTG AGTGAATTGG TAATTGAAAC TTGGTGTAAT 840 

ATCTGTCCts CCATATATGA AAATATATCT CTAAGATATT CTGACACATT ATCAGAGCCG 900 

TTACTCTCAG CAACATCTAA TGTTACAACA AACTTTCCAC CTAATCGAAA AAGATGGCTC 960 

CACCCCCCAA TCCTTTCAAT AAAGTTTTTT GTGTCCACAG ATACGTTTTG TAAATATACA 1020 
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GGAGAAGAGA TAATTATAAT ATCAGACTCT AATAACTCTT TTTTTATAAC ACCTCCATCA 1080 

TCACCATTAC TTTCCCTATC AATTCCTTTC TTAAACAACT CTTCTGAATC AGAATTAGAT 1140 

ATTTCTAGCT CTGAATTGAA AGGTGTCCTG AAAGATATAT CAACATTATT TCTACTAGAA 1200 

ATGATACTTG AAAGTCTCTT AGTATACTCT AAAGTCTTAG AGTTATGATT TCGCACTCCT 1260 

GCATATATAA ATATTTTATT CATTTTAATT CATCCTCTCA ATTTGAATTT AGTAGATTTT 1320 

TCAAGATAGT ATGGTACAAA AACAGACTTT TGTTGACTCA CATTATTACA TATGTTTTGT 1380 

ATTAAACCAA AATCAATACT ATTTTTGGAG TAATTTTGAT TTTAGTTTAA AATCATTTCT 1440 

ATAACAGTAG CATATACCTC AAGCCGTTTA GCAATTAGAA TAGAACTTTT CTTTATTATA 1500 

TTATTATCTC AACGAAAAGC TACACTATTA AAAATATTTT ATAGAATTAC ATATTAAACT 1560 

AGTCAATCTT GGTATTTTTA TATTGCTTAA TGAGTGGACA CCTCTATTTT AGAAACAAAA 1620 

CTATAAATTA AGCTAGATTT CAAGTAATGA GGGGATAACT ATCTTTTTGT CATTCTGATT 1680 

CAGTGCCATA TACCTTAAAA AAGTATAAGC AATACCAGTC ACACCTGTAT ACAAAGAAAA 1740 

ATCTGGGAAA TTGCTTGTTT GGACGATACG ATACTCTCCT TCTTTTGATT T ATT C ATT AC 1800 

AACACTACAC AATAAAGACT CCAATTCCAT ACTAGTATCC ATTTCTTTCA TGTAGTCGAT 1860 

GTAAAAATTT ATTATGGCCA TACTTCCATG GCAAAATGTA TCATTATCTA AACTAGCTAC 1920 

AATTCCCTCT GGAACACTTT GGGGATGATT AACTAATGTC CCAAATTCTC CACTACACCA 1980 

CTTCAAAGAA TGAATTTTGA TTTTCTCCCT AGGAACTAGT TGTAAAATTA ATTCTTTATA 2040 

TTTTTTAAGT CTTGTCACTT TATAAATATT TTTTAATGTA AAAATTACAC CTGATAGTCC 2100 

ATGGCCAAAA CTATATCCAA AATTACTATT ATCTCTCTCG CTTACATCAT TATATAGCGT 2160 

ATCACCTAAA CTTAATACTA GCCTTAGAAC ACGTTCCTTC TCTATTCCTC TCCTATAATA 2220 

TCTTACCAGT GTATTAATTA AAGGTAGAAG ACCATTAATA TAGTCAGACT TGTTTGAAAC 2280 

ACTTGCAAAA TCAGTCTTTT CAAGCTCAGT TAAAACACTC TTTATATAAT TTAAGCATGC 2 340 

GAGAGTATTT GTATCGTAAT CCTCTATAAT GGATAGAACA ATGAAATATC CTATATCCCC 2 400 

AGTTAAACCA AATGTGGTCT TAGATAAAGA AACAGATGGC GGAATTGCAG ATAAGATTTT 24 60 

ATTGTACAGT TGAGTATATG ATGATTTATC TTTCAATAAT TTTACATAGT ACATAAACAG 2S20 

TAATATTCCA GCTCTACCCC TATACATATC ATTmCCCGTT TCTTCAAGAC ACCATTTAGA 2580 

ACCTTTAAAA TTAACAGGTA TACTCCAAAT TGGATATTCG TCATAAATAT TATTAATAAC 2640 

CAAAGAGTCT GCAATATTTT CTACTTCATT ATGCAGAATA GTAACTAAAC TTTCATTTGG 2700 

GAGTTTTTTT CTATTAGATA AGTTTAATTT ATATCCTTTT TTTCGCTGAT CAAAGCTTGG 2760 

AAAATAAATT TCAATGATAT CAAGTTGCTT TTCTAAATTT TCCAAATTAT TATTAGGTAA 2 820 
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ATATTTCATA AAATAGTCAT ATCCAGAAAA TTGATGTAGG GAAATAAAAT GATTTCCAAA 2380 

ATCATCGTAG ATTTCATTGA TATTTGTATC TGTATAAAAA ATCGGAATAT CTAATAACCT 2940 

CATTTGTTCA CATTCGCTTG CTACAATACC TTGATTAGAA AACTTATTGC TCCAGAGATT 3000 

TTCCAATGCT TTTTCTCTAT CTAACATTTC TTCATAAAAA TCAGGATGAT ATAAAAAAGA 3060 

TAGTACTGAA GCATAGCTAT TTGTGTCTCT AAAAAGTACC CTTGTCTTTA AACCATACAA 3120 



GTTTGCTTTT AATAGCATTT ^TAAATTCTTC TGTTTTATTT - AACTCTTCAA ATATCAGATA" 3180 

AAAATCCCTA AAACCTTTTT TGAAATCTTT TATATACTTA TCAAATTCTA TATCACCATC 3240 

CCGAACAGGC AGGTTTTTCC CACCTTCAAA ATCAATTTTC CCAATATCAA ACTTTACCTT 3300 

ATCACTATTT AAATTAATTA AAACTTGACC AGGGATCCTC TA 3342 

(2) INFORMATION FOR SEQ ID NO: 207: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3454 base pairs 
{ 8) TYPE: nucleic acid 

(C) STRANOEONESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

CAGAAAAGAA TCTTAAAGAA AAATGATATT GTAGAAGTTG AAATTGTTGA TTTGACCCAT 60 

GAAGGGGCAG GAGTTGCCAA GGTAGATGGT TTGGTCTTTT TTGTAGAGAA TGCTTTACCG 120 

AGTGAAAAAA TTCTCATGCG TGTCCTCAAG GTCAATAAAA AGATTGGCTT TGGAAAAGTT 180 

GAAAAATACC TTGTCCAGTC ACCACACCGT AATCAAGATC TAGATTTGGC TTACCTGCGT 240 

TCAGGAATCG CCGATTTAGG ACACCTTTCT TATCCAGAAC AGCTCAAGTT TAAAACCAAG 300 

CAAGTCAAGG ACAGTCTCTA CAAGATTGCT GGAATTGCAG ATGTAGAAGT TGCTGAAACG 3 60 

CTTGGTATGG AACATCCAGT CAAGTATCGC AATAAGG CGC AGGTCCCCGT TCGTCGAG^G 420 

AATGGTGTCT TGGAAACAGG ATTTTTCCGT AAGAATTCGC ATAACCTCAT GCCCCTTGAA 4 80 

GATTTCTTTA TCCAGGATCC TGTCATTGAC CAAGTCGTAG TAGCTCTTCG AGACCTGCTC 540 

CGTCGTTTTG ATTTAAAACC TTATGACGAA AAGGAACAGT CTGGATTGAT TCGGAATCTT 600 

GTGGTGCGTC GTGGTCACTA TTCAGGACAA ATCATGGTCG TTTTGGTGAC AACTCGTCCA 660 

AAAGTTTTTC G TGTTGACCA ATTGATTGAA CAAGTTATCA AGCAGTTCCC AGAGATTGTG 720 

TCTGTCATGC AAAATATCAA CGACCAGAAT ACCAATGCGA TTTTTGGTAA GGAGTGGCGC 780 

ACTCTTTATG GTCAAGACTA TATTACGGAC CAGATGTTGG GAAATGACTT CCAAATCGCT 840 
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GGCCCACCCT TTTACCAAGT CAATACTGAA ATGGCCGACA AACTCTATCA AACAGCCATT 900 

GACTTTGCAG AGTTAAAAAA AGATGATGTG ATTATTGATG CCTATTCTGG TATTGGAACC 960 

ATTGGTTTAT CAGTCGCCAA CCATGTCAAA GAAGTCTACG GTGTTGAACT GATTCCAGAA 1020 

GCAGTAGAGA ATAGCCAGAA GAATGCTTCT TTGAACAAGA TTACTAATGC CCACTATGTC 1080 

TGTGACACGG CTGAAAATGC CATGAAGAAA TGGCTCAAGG AAGGTATTCA ACCAACCGTT 1140 

ATCTTGGTTG ATCCTCCACG CAAGGGCTTG ACAGAAAGCT TTATCAAAGC AAGCGCCCAA 1200 

ACAGGAGCCG ATCGCATCGC CTATATCTCC TGCAATGTCG CAACCATGGC GCGTGATATT 1260 

AAACTATACC AAGAGTTGGG ATATGAATTG AAGAAAGTCC AGCCGGTGGA TCTATTTCCT 1320 

CAAACGCATC ACGTCGAGAC GGTAGCACTT TTGTCCAAAC TCGATGTCGA TAAGCACATA 1380 

AGTGTTGAAA TTGAGCTGGA TGAGATGGAT TTGACAAGTG CGGAGAGCAA AGCAACATAT 1440 

GCTCAAATCA AAGAATATGT TTGGAATAAA TTTGAATTAA AAGTTTCGAC ATTATATATT 1500 

GCACAGATAA AAAAGAAATG TGGAATAGAA TTACGAGAAC ATTACAACAA GTCTAAAAAG 1560 

GATAAACAAA TTATTCCACA GTGTACACCT GAAAAAGAAG AAGCCATCAT GGATGCTTTG 1620 

AGACACTTCA AAATGATTTA ATAGAAAAGA ATGACAGTAT ATGACTTTCT GCATTTATTA 1680 

CATTCCTACT TGGTATAGGA ACAGCTATTA TTCCTTTCTT GCAAGGTATC AATTAGAAAA 1740 

TAGGCTCAAT ATAAAGATTG ATAGGATCAT TTTTATATTT AAAGGAGCGT TGAAATGATT 1800 

GATAAAGGCA ACAAAAAATT TTAGGATAAA TTTGCTAAGT TGTATGCCTC TTTTATGAAA 1860 

AAAGATAAAG AGGTTTATGA TAAAGTTTGT GAATATCTTA GTCCTCATTT GAATAAAGAT 1920 

ATGGAGGTGC TTGAACTTGC TTGTTGGTTT CGTGTCATAA CAGTTATAGA GGCAAATACT 1980 

TATGTAAATA TAAGGAGTTC AAGACTTCTA CCAAAGTTTA AAACTCAAAA AATAAATAGT 2040 

TGGTGTGCTG CTTACAATAT CCATTTTAAT AATGGATATT GTAAGCAGCA CCCCcAtGAA 2100 

TTTAAAGATT CTTTAAAGAG TCTTATTTTG TGATGAAAAT TTAATATGTA AATCTCAGAC 2160 

GATAGAAATT AAAAACTCTA TCGTCTTTTT TATACTCAAA ATTAGGAGGT AAAAATGGTA 2220 

AGGATAAGAG GTCCCACTTA AAACAATTTA TGGCAAAATA AGGACGGAAT AACACAACAA 2280 

ATTCTCTAAA ACAAATCACT AAATCAATGT AAGATTGAAT GAAATCAATA TTTATGCTAT 2340 

AATTAAATAA ATTTAATGAA GAAAAAAAGA GGGATATTAT GGCACTTAAC TATAAACCAT 2400 

TATGGATACA GTTAGCAAAA AAAGGACTAA AGAAAACAGA TGTAATAGCT ATGGCAGGAC 2460 

TTACAACAAA TGTTATGGCA CAAATGGGAA AGGATAAACC AATTACATTT AAGAATTTAG 2520 

AAAGAATATG TAAGGCTTTA TCTTGCACTC CTAATGATAT TATTAGTTTT GAAGATAATT 2580 

TTAGTGACGA GGAATAGAAA ATGACTTTAA GGACAGAAGA TCAAGTTAGG GATTATGCAA 2 640 
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GAGAAGTATA GCCTTTAATG AACTTGAAGA AAACATCAAT CAAGGTACTG GTCAAATAAC 2700 

TACTTTTAAT CAATTAGGCT TCAAGGGATA TTCAAATAAG CCACATCCTT GGTATTTACC 2760 

TAAAAATATG AATGATGTAG CAATAATCCT TGAAACAAAA TCAGAAGAAA GAGATATTAG 2820 

CAAACAAATT TTTATTGATG AGTTAATGAA AAATATAGAC ATAATTTAAC TAAAAATAAA 2 980 

AACTAGATCC TTTTTTGAAA AAATTATATT ATTAAATTTG TAACTGTATC TATTGACAAT 2940 

---GATAATTAW-ATCGATACAA-TAGACT^ 3000 ~ " 

TTTTTTCTAA TmGCTATTTT AGCTATGTCT ATAGTTTTTA GCGCTTGTTC TTCTAATTCT 3060 

GTTAAAAATG AAGAAAATAC TTCTAAAGAG CATGCGCCTG ATAAAATAGT TTTAGATCAT 3120 

GCrrTCGGTC AAACTATATT AGATAAAAAA CCTGAAAGAG TTGCAACTAT TGCTTGGGGA 3180 

AATCATGATG TAGCATTAGC TTTAGGAATA GTTCCTGTTG GATTTTCAAA AGCAAATTAC 3 240 

GGTGTAAGTG CTGATAAAGG AGTTTTACCA TGGACAGAAG AAAAAATCAA AGAACTAAAT 3 300 

GGTAAAGCTA ACCTATTTGA CGATTTGGAT GGACTTAACT TTGAAGCAAT ATCAAATTCT 3360 

AAACCAGATG TTATCTTACC AGGTTATTCT GGTATAACTA AAGAAGATTA TGACACTCTA 3420 ' 

TCAAAAATTG CTCCTGTAGC AGCATACAAA TCTG 3454 ~ 
(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3752 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 

(D) TOPOLOGY: linear : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

CGGGAGTATA CTTAATATAA TTATAGTCTA AAAATGACTA TCAGAAAAGA GGTAAATTTA 60 

GATGAATAAC AAAAAAATGA TTTTAACAAG TCTACCCAGC GTCGCTATCT TAGGGGCTGG 120 

TTTTGTTACG TCTCAGCCTA CTTTTGTAAG AGCAGAAGAA TCTCCACAAG TTGTCGAAAA 180 

ATCTTCATTA GAGAAGAAAT ATGAGGAAGC AAAAGCAAAA GCTGATACTC CCAAGAAAGA 240 

TTACGAAACG GCTAAAAAGA AAGCAGAAGA CGCTCAGAAA AAGTATGAAG ATGATCAGAA 300 

GAGAACTGAG GAGAAAGCTC GAAAAGAAGC AGAAGCATCT CAAAAATTGA ATGATGTGGC 360 

GCTTGTTGTT CAAAATGCAT ATAAAGAGTA CCGAGAAGTT CAAAATCAAC GTAGTAAATA 420 

TAAATCTGAC CCTGAATATC AGAAAAAATT AACAGAGGTC GACTCTAAAA TAGAGAAGGC 4 80 

TAGGAAAGAG CAACAGGACT TGCAAAATAA ATTTAATGAA GTAAGAGCAG TTGTAGTTCC 540 
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TGAACCAAAT GCGTTGGCTG AGACTAAGAA AAAAGCACAA CAAGCTAAAG CACAAGAAAA 600 

AGTAGCTAAG AGAAAATATG ATTATGCAAC TCTAAAGGTA GCACTACCGA AGAAAGAAGT 660 

AGAGGCTAAG GAACTTGAAA TTGAAAAACT TCAATATGAA ATTTCTACTT TGGAACAAGA 720 

AGTTGCTACT GCTCAACATC AAGTAGATAA TTTGAAAAAA CTTCTTGCTG GTGCGGATCC 780 

TGATGATGGC ACAGAAGTTA TAGAAGCTAA ATTAAAAAAA G9AGAAGCTG AGCTAAACGC 840 

TAAACAAGCT GAGTTAGCAA AAAAACAAAC AGAACTTGAA AAACTTCTTG ACAGCCTTGA 900 

TCCTGAAGGT AAGACTCAGG ATGAATTAGA TAAAGAAGCA GAAGAAGCTG AGTTGGATAA 960 

AAAAG CTG AT GAACTTCAAA ATAAAGTTGC TGATTTAGAA AAAGAAATTA GTAACCTTGA 1020 

AATATTACTT GGAGGGGCTG ATCCTGAAGA TGATACTGCT CCTCTTCAAA ATAAATTAGC 1080 

TGCTAAAAAA GCTGAGTTAG CAAAAAAACA AACAGAACTT GAAAAACTTC TTGACAGCCT 1140 

TGATCCTGAA GGTAAGACTC AGGATGAATT AGATAAAGAA GCAGAAGAAG CTGAGTTGGA 1200 

TAAAAAAGCT GATGAACTTC AAAATAAAGT TGCTGATTTA GAAAAAGAAA TTAGTAACCT 1260 

TGAAATATTA CTTGGAGGGG CTGATTCTGA AGATGATACT GCTGCTCTTC AAAATAAATT 1320 

AG CT ACT AAA AAAGCTGAAT TGGAAAAAAC TCAAAAAGAA TTAGATCCAG CTCTTAATGA 1380 

GTTAGGCCCT GATGGAGATG AAGAAGAAAC TCCAGCGCCG GCTCCTCAAC CAGAGCAACC 1440 

AGCTCCTGCA CCAAAACCAG AGCAACCAGC TCCACCTCCA AAACCAGAGC AACCAGCTCC 1500 

TGCACCAAAA CCAGAGCAAC CAGCTCCAGC TCCAAAACCA GAGCAACCAG CTCCAGCTCC 1560 

AAAACCAGAG CAACCAGCTA AGCCGGAGAA ACCAGCTGAA GAGCCTACTC AACCAGAAAA 1620 

ACCAGCCACT CCAAAAACAG GCTGGAAACA AGAAAACGGT ATGTGGTATT TCTACAATAC 1680 

TGATGGTTCA ATGGCAATAG GTTGGCTCCA AAACAACGGT TCATGGTACT ACCTAAACGC 1740 

TAACGGCGCT ATGGCAACAG GTTGGGTGAA AGATGGAGAT ACCTGGTACT ATCTTGAAGC 1800 

ATCAGGTGCT ATGAAAGCAA GCCAATGGTT CAAAGTATCA GATAAATGGT ACTATGTCAA 1860 

CAGCAATGGC GCTATGGCGA CAGGCTGGCT CCAATACAAT GGCTCATGGT ACTACCTCAA 1920 

CGCTAATGGT GATATGGCGA CAGGATGGCT CCAATACAAC GGTTCATGGT ATTACCTCAA 1980 

CGCTAATGGT GATATGGCGA CAGGATGGGC TAAAGTCAAC GGTTCATGGT ACTACCTAAA 2040 

CGCTAACGGT GCTATGGCTA CAGGTTGGGC TAAAGTCAAC GGTTCATGGT ACTACCTAAA 2100 

CGCTAACGGT TCAATGGCAA CAGGTTGGGT GAAAGATGGA GATACCTGGT ACTATCTTGA 2160 

AGCATCAGGT GCTATGAAAG CAAGCCAATG GTTCAAAGTA TCAGATAAAT GGTACTATGT 2220 

CAATGGCTTA GGTGCCCTTG CAGTCAACAC AACTGTAGAT GGCTATAAAG TCAATGGCAA 2280 

TGGTGAATGG GTTTAAGCCG ATTAAATTAA ATCATGTTAA GAACATTTGA CATTTTAATT 2340 
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TTGAAACAAA GATAAGGTTC GATTGAATAG ATTTATCTTC GTATTCTTTA GGTACCTATC 2400 

TTATGATTTC AGGAAATGTC ATTAAAAAAA CGACTCATTT TCTCTAACCT GAAAAATAGA 24 60 

TTAGAGAAAA TGGGTTGTTT TATCTATTAT AGTTATTTGA ATGAAGmTAA GAAGAAGGTA 2520 

TACTCACATC ATTCACATAA TCTGTATATT GACTATAAGT TTTAAAAAAC AATTTTTAAG 2580 

CTCTTCCTTG TCTTCTCTAA CCAAGCGTGT TATAATGAAT ACTGCTCAAG CGACCTTCAA 2 640 

TCGTGAACCA- CACACGACCT-TCAATCCTGA- ATAAACGAAT-AGATGGGAGA -CTTACCATGA " 2700 

GTGATAACTC TAAAACACGT GTTGTCGTGG GGATGAGTGG TGGTGTTGAT TCGTCGGTGA 2760 

CGGCTCTTTT GCTCAAGGAG CACCGCTACG ATGTGATCGG TATCTTCATG AAGAACTGGG 2 820 

ATGACACAGA TGAAAACGGC GTCTGTACGG CGACCGAAGA TTACAAGGAT GTGGTTGCGG 2880 

TGGCAGACCA GATTGGCATT CCCTACTACT CTGTCAATTT TGAAAAAGAG TACTGGGACC 2940 

GCGTTTTTGA GTATTTCCTA GCGGAATACC GTGCAGGGCG CACGCCAAAT CCGGACGTTA 3000 

TGTGCAACAA GGAAATCAAG TTCAAGGCCT TTTTGGACTA TGCCATAACC TTGGCGGCAC 3060 

ACTATGTAGC GACTGGGCAT TATGCTCGAG TGCCGCGTGA TGAGGATGGT ACCGTTCACA 3120 

TGCTTCGTGG CGTGGACAAT GGCAAGGATC AGACCTATTT CCTCAGCCAA CTTTCGCAAG 3180 

AACAACTTCA AAAAACCATG TTCCCACTAG GACATTTGGA AAAGCCTGAA GTACGCAGAC 3240 

TAGCAGAAGA AGCAGGCCTT TCGACTGCTA AGAAGAAAGA CTCGACAGGG ATTTGCTTTA 3300 

TCGGAGAAAA GAACTTTAAA AACTTTCTCA GCAACTACCT GCCAGCTCAG CCTGGTCGCA 3360 

TGATGACTGT GGATGGTCGC GATATGGGCG AGCATGCAGG TCTTATGTAC TATACAATCG 3420 

GTCAGCGTGG CGGACTCGGT ATCGGTGGGC AACACGGCGG TCACAATGCC CCTTGGTTCG 3480 

TTGTCGGAAA AGATCTAAGC AAGAATATTC TCTATGTAGG ACAAGGATTC TACCATGATT 3540 

CGCTCATGTC AACTAGCCTA GAAGCCAGTC AAGTCCACTT TACTCGTGAA ATGCCAGAAG 3 600 

AGTTTACGCT AGAATGTACG GCTAAATTCC GTTACCGTCA GCCTGACTCT AAGGTGACCC 3660 

TTCATCTCAA AGGAGAAAAG ACAGAGGTCA TCTTTGCGGA ACCACAACGC GCGATTACAC 3720 

CAGGACAGGC AGTTGTCTTT TACGATGGCG GG 37 52 

(2) INFORMATION FOR SEQ 10 NO: 209: 

(i) , SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3530 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

TATTTATATT TTTTTATCTC TGGCATACTT TCATACCTTT TTAGACTTAA ACTCTTTAAT 60 

AGTGCCTTTC CACCTCTTTT TATCTATAAA GATTCTCCTA CATCATAATT CATTTTTTTA 120 

TTTAAACCTT TCTGTCTTAG TTTGTCTTTA TCTTCTTCAT ACCATTTTAA GATTGTCACA 180 

TAGTGGTTTT GATAGCTCTT ACCACTGCTT TCCATGTATC TGGATAGTTT ATTTATCATT 240 

ATATCTGTGT GTGAGTTTAA TTTTTCTTTT AGATTTTTAT ATTCTTCTTT GCTTAACCTT 300 

ACATTTTTGA ATTCTCCATA AAAAATGGGG GTGGACTTTT TATCTATCTC TCCCTCTCTC 360 

TCTTTATCTA TCTCTATATC TTTCCATGTA ATTCCAATCT GGAGTACCTC TACTGTCTAT 420 

CGGTAATTTA ATTTTCATAT CTGGCAATAC TGTGCTAGAT ATTTGATCTT TATATTCAGT 480 
ATTTTTTAAA GCTTGCCTAA TAATTGAAGT TAAATAGAAT GCTACTTCTT TATTCAATTC . 540 

TTTATTTTTT AATTTTAAAC AATGAATTTT CATATCTAGG CTTGCTTTAT ATTTATGATA 600 

AAAGACTGCT CCTAAAAATG AAACAGATAT AAAATTTTCA AAAACTCTAT AATTTTTATC 660 

ATCTATATCT TCGTAGTAAC CTAAGATACC ATTGTCAATA TTTGTAGCAC TAATTCTAGG 720 

AGTTTTTCCA TCGAGTAAAT ATCTTTTTGG AATAGATGAG CCTGTTGGTA CTTAACTCGA 780 

TTTCCCCTTT TTTTCGGTAA TAAATATTTC TTTTTATTTT CTTCTCTCAT ATTTTTCCTA 840 

CCTGTCCTTT GTACGATGAG TATTTTCTAG ATTTTCyTGA ATAACTTTTT ACTTGAAGTT 900 

TTAGCTTTTG AACTAGTCGT TGTACTTTCT TTTTGTTTAT TATCAGTCCT GATCTTTTTA 960 

ATATTGCTGT TATTCTCTAT ATCCTATTTT TCATTCATGA TATTCTTTTA CTAATTTTAT 1020 

CTTAAATTCT GTGCTGTATT TGCCATTAAA AAACTGACCT CCTTTAGTTA CTTTTTTGGC 1080 

CTAACTTTTG AGGGTCAGTT CAAAATTTGC GACTTTTAAA TGAATTCCAA TATTCAATTA 1140 

TTAAGAGTTA ACATGGTGCT TGCCAATAGG AATCATTAGA GGCGAATTGG AAATAGGGTC 1200 

ACGTATAATT TTTGCTTCAA GATTAAAGAT ATCTTTAACT AGTTTATCAT TTAGTATATC 1260 

TTCAGGCTTT CCCTCTGCAA CAAGTTTACC TTCTTTAATT GCAAATAGGT AATCAGCGTA 1320 

TCTTGCTGTT AGATTTATAT CGTGCAAAAT CATGCAAATG GTTGTCTTAT ATTTTTGGTT 1380 

TAGATCAGTC AAGAGGTCTA ATAGTTCTAT TTGATATGAG ATATCCAAGT AAGTAGTTGG 1440 

CTCATCTAAA AGTAGGATAC TTGTATCTTG GGCTAGGGCT AGAGCTATCC ATACTCTTTG 1500 

CCTTTGACCC CCAGAAAGTT CTTCAACTAG GTTATTTGCT AGATCTTCAA CATTGGCCTT 1560 

AACCATTGAT CTGTTTATTA TTTCAAGGTC ATCTTTTCCA AGACTCTTAA AAGGCTTTCT 1620 

GTAGGGGAAA CGACCACGGC TTACAAGATC AGCTACTGTT ATTGATTCAG GGATTATTGG 1680 

AGATTGAGGT AATATAGCTA TGTGTTTTGC TAAATCTTTT TCTTTATAAG AATTAATTGA 1740 
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1TTATTATCA AGCAATACTT CTCCCTCTAA TCGCTTTATA AGTCGAGACA AG G TTTTAAT 1800 

GAGTGTTGAT TTCCCACAAC CATTTGACCC AATAATAACT GATATTTTTT CTTCAGGTAT 1860 

TTTTATATTT ATATTTTCCA AGATTATTTT TTCATCATAA CCGCAGGTAA GATTATTTGA 1920 

CCACAGACCT TTCATTATAT ATTCCTCCTG TTCATTTTTA TTAGTAAGTA TATTAAGTAT 1980 

GGTGAACCTA ACAAGCCAGT TACAACACCT ACTGGATATC TAGCTGGTAA AATATTTTGA 2040 

GAGAATATGT CTGATAACAA 'AACTAGTAAA" ATTCCAACCA ATCCAGCTAA TATTGGGCTT 2100 

CTTTTCTTGC CAATATTTAA GGCTATGGGA CCACCTAAAA AAGATATACA AGCTATTGGT 2160 

CCTGTAATTG AAGTAGAAAA AGCAGTTAAA GATACAGCGC AAAAAATTAA AACAAGCCTT 2220 

GAAAGCTCGG GATTTGCTCC AAGTCCGATT GCTATTTCTT CACCAAGTTC AATAATTTCT 2280 

AGTCTTTTAT TAAAAAATAA AACTAATATA GTAGCAATAA TACTTACTAT TAGAACAAGA 2 340 

GGTATGTCAT CTAACTTTGT AAAAGATAAA GAGCCACTGA GCCATCTCAT AACTTCTTCT 2400 

AATTCATATC TTGCTACTTT CAACAATAAA AATGAGGTGC CTGCTCTTGT GACAGCTTGA 2460 

AAACCAATAC CTAATATTAT CAGTCTTGCT GCTGAAAAAC CATCTTTTTT AGCTAGTAAA 2S20 

AATAATATTA AAGATGATGT TAGTCCACAA GTTATTGAAA TAATTCCAGT AGTTAAACTA 2580 

TTTGTTTTTA ATACCAATAT GCAAAAGACC GCTGCAATAG ATGAAGAACT TGTGACACCG 2640 

ATTATATCAG GACTTGCAAG AGGATTTCTT AACATAGTTT GAAAGATAAA TCCTGCCAAT 2700 

CCAAAAGACC AGCCAGCTAT AATTCCTGCT AATAATTTTG GTAATCTAAT TTCCATAATC 2760 

GAAAAACTAG CTCCAGGAAC AGTTTCACTA TTTAAGACTT TAATCAAAGT TGAAAAAGAA 2820 

TAACTTTCAT CTCCGATAAG TAAAATGAAA AATGATAGAC TGATTATTAT TAATAAAAAT 2880 

AGTGAGGAAA ATAGTGTTAT TCTATTTTTT CTTTTTTGAA TACCTATAAT TAAATTTTGC 2940 

ATTAGTTATT AACCCCTCTA TTTTTCATAG TTACATAAAT AAGTACTGGA CCCCCGATTA 3000 

TTGCAGTAAT TATCCCTACT TCAATTTCAC CTGGTTTACC TAACATACGG CCGATTATAT 3060 

CACATATAAG CAAGAGCTCT GCACCTATAA AAGATGAAGA AATGGTCATT GTGCGTATAT 312 0 

CTTTGCTTAT AAATAAGCCA CAAAAGTGAG GAACTATAAG ACCTACGAAG CCAATAGGTC 3180 

CACCAATTGC AGTAATACTT GAACATAAAA GCACACTTGC AATTATTGCA AGTGATCTTA 3240 

TCCTATTAAC ATTAACTCCA AGACCAACAG CCATTTCATC ACCCATAGcT AAAGCGTTTA 3 300 

AATCTGATGA AATAAATATA GCTATCAAGT GACCTAAAAT TATAAAAGGT AGTAGTGTAG 33 60 

ATATAGAAGA TAATGTAGCT GCTCCAAGGC TACCTATTTG CCAAAATCTA AATTTGTCTA 3420 

AGACGTTATT ATTCGGTAAA ATTAAAAAAC TTACAAAACT GCTTAAAGCC ATACTAACAC 3480 
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AACTTCCTGA TAAGGCAAGT TTTATAGGGG TAAGGCCTGC TTTTCCGTTA CAGCAATCGC 3540 

GTATACAAAA ATTGCACTTA CTAAGCCACC AATGATTGCG 3580 

(2) INFORMATION FOR SEQ ID NO: 210: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11378 base pairs 
(8) TYPE: nucleic acid 

(C) STRANOEDNESS : double 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 



CCAAATTGCT 


CCACAATTAT 


TATGGAGTCG 


TCGTTTGGCA 


GATGGGCGTG 


ATATGTGTGC 


60 


TCAAGAATGG 


TTGACAGGCA 


AGATATTGAC 


CCCCTATGAT 


ATG AATCGT A 


AGCAAATCGT 


120 


CAATATTTTA 


ACCCGTCTTC 


ATCGCTCACG 


TCCGTTGATG 


ACACAATTGA 


GTCGTTTGGG 


180 


CTATGCCATG 


GAAACACCTG 


TAGATTTACT 


ACAGTCTTGG 


CAGGAAACGG 


CTCCAGATGC 


240 


TTTGCGTAAA 


AATCATTTTA 


TCAGTGAAGT 


GATGGCTGAT 


TTACGTCAGA 


CTATTCCAGG 


300 


ATTTAGAGAG 


GACCATGCGA 


CCATTGTCCA 


TGGAGATGTA 


CGACATAGTA 


ATTGGATTGA 


360 


GACAGATAGT 


GGCTTGATTT 


ATTTAGTAGA 


TTGGGATTCG 


GTTCGCTTGA 


CCGATCGCAT 


420 


GTTTGATGTG 


GCCCATATGC 


TCTGCCATTA 


TATTTCAGAA 


CATCAGTGGA 


AGGAATGGTT 


480 


GACCTACTAC 


GGTTACAAGT 


ACAATCAAAC 


GGTATTAAGT 


AAATTGTATT 


GGTATGGTCA 


540 


ATTGTCTTAT 


TTGAGTCAGA 


TTTCCAAGTA 


TTATATGAAC 


CAAGATTTAG 


AAAATGTCAA 


600 


TCGGGAGATT 


CATGGTTTGC 


GTCATTTCCG 


AGACAAGTAT 


GGAAAGAGAA 


GATGAGAGTT 


660 


AGAAATCGTA 


AAGGGGCAAC 


AGAATTACTA 


GAGGCAAATC 


CCCAGTATGT 


GGTCCTCAAT 


720 


CCCTTGGAAG 


CCAAGGCAAA 


ATGGCGCGAC 


TTGTTTGGCA 


ATGATAATCC 


CATTCATGTG 


780 


GAAGTTGGAA 


GTGGAAAGGG 


TGCCTTTGTT 


TCAGGTATGG 


CCAAGCAAAA 


CCCTGACATC 


840 


AACTATATCG 


GGATTGATAT 


TCAAAAGTCT 


GTTTTGAGCT 


ACGCTTTGGA 


CAAGGTGCTT 


900 


GAAGTTGGAG 


TGCCTAACAT 


CAAGCTCTTG 


TGGGTAGATG 


GTTCTGACTT 


AACTGACTAC 


960 


TTTGAAGACG 


GTGAGATTGA 


TCGCTTGTAT 


CTGAACTTTT 


CAGATCCATG 


GCCGAAAAAA 


1020 


CGCCATGAAA 


AGCGTCGTTT 


GACCTACAAG 


ACCTTCTTGG 


ATACCTTCAA 


ACGTATCTTG 


1080 


CCTGAAAATG 


GAGAAATTCA 


TTTCAAGACG 


GATAACCGTG 


GCTTGTTTGA 


GTACAGTTTA 


1140 


GTGAGCTTTT 


CTCAATATGG 


CATGAAACTC 


AATGGTGTCT 


GGTTAGATTT 


GCATGCCAGT 


1200 


GATTTTGAAG 


GCAATGTCAT 


GACAGAATAC 


GAGCAAAAAT 


TCTCAAACAA 


GGGCCAACTT 


1260 


ATCTACCGAG 


TTGAGGCAGA 


ATTTTAAGAG 


ATAACCTAAA 


ATTAGGCTGT 


ACAAGTGCTT 


1320 
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TTGCTTTACA 


TAAGTTGGCA 


AACGTGCTAT 


ACTGATAGTA 


AGAATATGAA 


AAGTGAGGCG 


X380 


GGGAAATATC 


TTCGCCTCTT 


GCTTATGAGG 


ACGTGGACGC 


AATCGCAACA 


ATCGTAGAAT 


1440 


TAGTCAGAGA 


AGTTGTAGAA 


CCTGTCATAG 


AAGCTCCTTT 


TGAACTCGTG 


GATATCGAGT 


1500 


ATCCAAAGAT 


TGCCAGTCAC 


ATCATTCTCA 


GTATTTTTGT 


ACATAAACCC 


GAAGAATTAC 


1560 


CTTGAACGAC 


ACGGCAGACT 


TGACAGAAAT 


TATCAGTCCT 


GTCCTAGACA 


CCATCAACCC- 


1620 


--AGATCCCTTC- CCAGAACAAT ATTTCCTAGA AATTACCAGT CCAGGTTTGG 


AACGTCCTTT 


1680 


GAAAACCAAG 


GATGCCGTCG 


CTGGAGCGGT 


TGGAAAATAC 


ATCCATGTCG 


GGCTCTACCA 


1740 


AGCCATCGAT 


AAGCAAAAGG 


TCTTTGAAGG 


AACCTTGTTG 


GCCTTCGAAG 


AGGACGAGTT 


1800 


GACTATGGAA 


TATATGGACA 


AGACGCGTAA 


GAAAACCGTC 


CAAATTC C AT 


ACAGTTTAGT 


1860 


ATCAAAAGCA 


CGTTTAGCAG 


TTAAATTATA 


GAAAAAGAAA 


GGATAGCTTT 


TGAGGATTCA 


1920 


AAAGTGAAGA 


AAACATGAGT 


AAAGAAATGC 


TAGAGGCCTT 


CCGCATTTTG 


GAAGAAGACA 


1980 


AGGGAATCAA 


AAAAGAAGAT 


ATCATCGACG 


CAGTAGTAGA 


CTCCCTTCCT 


TCCGCTTATC 


2040 


GCAGACGCTA 


TGGTCAGTCA 


GACAGCGTAG 


CTATTGACTT 


CAACGAAAAA 


ACAGGTGACT 


2100 


TTACAGTTTA 


TACTGTCCGT 


GAAGTTGTTG 


ATGAAGTATT 


TGATAGCCGT 


TTGGAAATCA 


2160 


GCTTGAAAGA 


TGCTCTTCCC 


ATTAATTCAG 


CTTATGAACT 


TGGAGACAAA 


ATCAAGTTTG 


2220 


AAGAAGCACC 


AGCTGAGTTT 


GGTCGTGTAG 


CAGCCCAATC 


TGCCAAACAA 


ACCATCATGG 


2280 


AAAAAATGCG CAAgCAAACA 


CGTGCCATCA 


CTTACAATAC 


TTACAAAGAA 


CATGAGCAAG 


2340 


AAATCATGTC 


TGGTACAGTA 


GAACGCTTTG 


ACAACCGCTT 


TATCTATCTC 


AACCTTGGTA 


2400 


GCATCGAAGC 


CCAATTGTCA 


AAACAAGACC 


AAATTCCTGG 


AGAAGTTTTT 


GCTTCTCATC 


2460 


ATCGTATCGA 


AGTTTATGTT 


TACAAGGTTG 


AAGACAACCC 


TCGTGGTGTG 


AACGTCTTTG 


2520 


TTAGCCGTAG 


TCATCCAGAA 


ATGATCAAAC 


CTTTAATGGA 


GCAAGAAATT 


CCAGAAGTTT 


2580 


ATGATGGAAC 


TGTTGAAATC 


ATGAGCGTGG 


CTCGTGAAGC 


AGGTCACCCT 




2 640 


CTGTTCGTAG 


CCACAATCCA 


AACGTGGATG 


CTATCGGTAC 


AATCGTTGGA 


CGTGGTGGTG 


2700 


CTAATATCAA 


GAAGATTACT 


AGCAAATTCC 


ACCCAGCTCG 


TTACGATGCT 


AAAAATGACC 


2760 


GCATGGTACC 


AATCGAAGAA 


AATATCGATG 


TTATCGACTG 


GGTAGCAGAT 


CCAGCTGAAT 


2820 


TTATCTACAA 


TGCCATCGCT 


CCTGCTGAGG 


TTGACCAAGT 


TATCTTTGAT 


GAAAACGACA 


2880 


GCAAACGTGC 


CTTGGTGGTT 


GTTCCAGATA 


ACAAGCTTTC 


TCTTCCCATT 




2940 


GACAAAACGT 


GCGCTTGGCG 


GCTCACTTGA 


CTGGTTACCG 


TATCGATATC 


AAGTCTGCTA 


3000 


GCGAATTTGA 


ACCCATGGAA 


GACGCTGCTT 


CAGTAGAGTT 


GGAAGTACAA 


AACGATACTG 


3060 
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TAGAAGAATA AAACCTGCTA GAGGAGGGAA AGATGAAAAC AAGAAAAATC CCTTTGCGCA 3120 

AGTCTGTTGT GTCTAACGAA CTGATTGATA AGCGTGATTT GCTCCGCATT GTCAAGAACA 3180 

AGGAAGGACA AGTCTTTATT GATcCTACGG GCAAGGCCAA TGGCCGCGGC GCTTATATCA 3240 

AACTAGACAA TGCAGAAGCC CTAGAGGCGA AAAAGAAGAA GGTCTTTAAC CGCAGCTTTA 3300 

GCATGGAAGT GGAAGAAAGC TTTTATGACG AGTTGATCGC TTATGTGGAT CACAAAGTGA 3360 

AAAGAAGAGA GTTGGGACTT GAATAAGCAA AAGATAAGTA ATCTCTTGGG GCTTGCTCAG 3420 

CGAGCAGGGC GCATCATATC GGGTGAAGAA TTGGTGGTCA AGGCCATTCA AGACGGCAAG 3480 

GCCAAGTTGG TCTTTCTAGC TCATGATGCT GGACCCAATC TGACCAAGAA GATTCAAGAT 3540 

AAAAGTCATT ATTATCAAGT AGAAATTGTA ACCGTGTTTT CAACACTGGA ATTAAGCATA 3600 

GCAGTCGGGA AATCGAGAAA GGTTTTGGCT CTAACAGATG CTGGATTTAC AAAGAAAATG 3660 

AGGTCTCTTA TGGAATAGAA GAGGAGGACA TGATTTGTCT AAGAAAAGAT TGTACGAAAT 3720 

CGCAAAAGAA CTTGGAAAAG AAAGTAAAGA AGTTGTAGCG CGTGCAAAAG AGTTGGGCTT 3780 

GGATGTGAAA AGCCACTCAT CAAGTGTCGA AGAAGCTGTC GCTGCAAAAA TTGCTGCCAG 3840 

CTTTAAGCCT GCAGCTGCTC CGAAAGTAGA AGCAAAACCT GCAGCCCCAA AAGTAAGTGC 3900 

AGAAAAGAAA GCCGAAAAAT CTGAGCCAGC TAAACCAGCT GTAGCTAAGG AAGAGGCAAA 3960 

ACCTGCAGCC CCAAAAGCAA GTGCAGAAAA GAAAGCCGAA AAGTCTGAAC CAGTAAAACC 4020 

AGCTGTAGCC AAGGAAGAGG CAAAACCAGC TGAGCCAGTC ACTCCGAAAA CAGAAAAAGT 4080 

AGCGGCTAAA CCGCAAAGTC GTAATTTCAA GGCTGAGCGT GAAGCACGTG CTAAAGAGCA 4140 

GGCAGAGCGA CGCAAGCAAA ATAAGGGCAA TAACCGTGAC CAACAACAAA ACGGAAACCG 4200 

TCAGAAAAAC GACGGCCGTA ATGGTGGAAA ACAAGGTCAA AGCAACCGCG ACAATCCTCC 4260 

CTTTAATGAC CAAGCTAAGA AGCAGCAAGG TCAGCAAAAA CGTAGAAATG AGCGCCGTCA 4320 

GCAAGAGGAT AAACGTTCAA ATCAAGCGGC TCCACGTATT GACTTTAAAG CCCGTGCAGC 4380 

AGCCCTAAAA GCAGAGCAAA ATGCAGAGTA CGCTCGTTCA AGTGAGGAAC GCTTCAAGCA 4440 

GTATCAGGCT GCTAAAGAAG CCTTGGCTCA ACCTAACAAA CGCAAGGAAC CAGAGGAAAT 4500 

CTTTGAAGAA GCGGCTAAGT TAGCTGAACA AGCACAGCAA CTTCAAGCAG TGGTTGAAGT 4560 

CGTCCCTGAG AAAAAAGAAC CTGCAGTGGA TACACGTCGT AAAAAACAAG CTCGACCAGA 4620 

CAAAAATCGT GACGATTATG ATCATGAAGA AGATGCTCCT AGAAAACAAC AAAAGAATCG 4680 

AAGTAGTCAA AATCAAGTGA GAAATCAAAA GAATAGTAAC TGGAATAACA ACAAAAAGAA 4740 

CAAAAAAGGC AATAACAAGA ACAACCGTAA TCAGACTCCA AAACCTGTTA CGGAGCGTAA 4800 

ATTCCATGAA TTGCCAACAG AATTTGAATA TACAGATGCT ATGACCCTTG CGGAAATCGC 4860 
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AAAACGTATC AAACGTGAAC CAGCTGAAAT TGTTAAGAAA CTTTTCATGA TGGGTGTCAT 4 920 

GGCCACACAA AACCAATCCT TGGATGGGGA AACAATTGAA CTCCTCATGG TGGATTACGG 4980 

TATCGAAGCC AAACAAAAGG TTGAAGTGGA TAATGCTGAC ATCGAACGTT TCTTTGTCGA 5040 

ACATGGTTAT CTCAATGAAG ATGAATTGGT TGAGCGTCCA CCAGTTGTTA CTATCATGGG 5100 

ACACGTTGAC CACGGTAAAA CAACCCTTTT GGATACTCTT CGTAACTCAC GTGTTGCGAC 5160 

AGCTGAA^A~^OTTGGTATTA CTCAGCATAT CGCTGCCTAC^AAXTCGTGG AAAATGGTAA 5220 

GAAGATTACC TTCCTTCATA CACCAGGACA CGCGGCCTTT ACATCAATGC GTGCGCGTGG 5280 

TGCTTCTGTT ACCGATATTA CGATCTTGGT CGTAGCGGCA GATGACGGGG TTATGCCTCA 5340 

GACTATTGAA GCCATCAACC ACTCAAAAGC AGCTAACGTT CCAATCATCG TAGCTATTAA 5400 

CAAGATTGAT AAACCAGGTG CTAACCCAGA ACGCGTTATC GGTGAATTGG CAGAGCATGG 5460 

TGTGATGTCA ACTGCTTGGG GTGGAGATTC TGAATTTGTT GAAATTTCGG CTAAATTCAA 5520 

CCAAAATATC GAAGAATTGT TGGAAACAGT CCTTCTTGTG GCTGAAATCC AAGAACTCAA 5580 

AGCAGACCCA ACAGTTCGTG CGATCGGTAC GGTTATCGAA GCGCGCTTGG ATAAAGGAAA 564 0 

AGGTGCGGTC GCAACCCTTC TTGTACAACA AGGTACCTTG AATGTTCAAG ACCCAATCGT 5700 

TGTCGGAAAT ACcTTCGGTC GTGTCCGTGC TATGACCAAC GACCTTGGTC GTCGTGTTAA 5760 

AGTTGCTGGA CCATCAACAC CAGTCTCTAT CACAGGTTTG AACGAAGCAC CGATCGCGGG 5820 

TGACCACTTT GCCGTTTACG AGGATGAAAA ATCTGCGCGT GCAGCAGGTG AAGAGCGTGC 5880 

CAAACGTGCC CTCATGAAAC AACGTCAAGC TACCCAACGT GTTAGCCTTG AAAACCTCTT 5940 

TGATACCCTT AAAGCTGGGG AACTCAAATC TGTTAATGTT ATCATCAAGG CTGATGTACA 6000 

AGGTTCTGTT GAAGCCCTTT CTGCCTCACT TCAAAAGATT GACGTGGAAG GTCTCAAAGT 6060 

GACTATCGTC CACTCAGCGG TCGGTGCTAT CAACGAATCA GACGTGACCC TTGCCGAAGC 6120 

TTCAAATGCC TTTATCGTTG GTTTCAACGT ACGCCCTACA CCACAAGCTC GTCAACAAGC 6180 

AGAAGCTGAC GATGTGGAAA TCCGTCTTCA CAGCATTATC TACAAGGTTA TCGAAGAGAT 6240 

GGAAGAAGCT ATGAAAGGGA TGCTTGATCC AGAATTTGAA GAAAAAGTTA TTGGTGAAGC 6300 

GGTTATCCGT GAAACCTTCA AGGTGTCTAA AGTGGGAACT ATCGGTGGAT TTATGGTTAT 6360 

CAACGGTAAG GTTGCCCGTG ACTCTAAAGT CCGTGTTATC CGTGATGGTG TCGTTATCTA 6420 

TGATGGTGAA CTCGCAAGCT TGAAACACTA TAAAGACGAC GTGAAAGAAG TCACAAACGG 6480 

TCGTGAAGGT GGATTGATGA TCGACGGCTA CAATGATATT AAGATGGATG ATGTGATTGA 6540 

GGCGTATGTC ATGGAAGAAA TCAAGAGATA AGATTTTTTG CTCCTTTCTT AGGTGGTGAG 6600 
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GGACGCAAGC AAACCGATGG TTTCATTGCT TATTTTTGAG CCTAGGGTCT CAAAAATCCC 6660 

CTGTGATGGG ACTGATAAAT CAGTTCCATC ACTTTCACCA CGGCGAAAGA AGCAGATGAC 6720 

TTCAAATTGA ACTTCGTTTC AATTTAAACT GAAAATCAAG AAGTTTAAAA TAGCTAGGTC 6780 
TGCTGGCCTA GCTTTTGGTT CAAAGTAGAG AAAGGAATAT CATGGCAAAT CATTTCCCTA ~ 6840 

CAGATCGTGT GGGCATGGAA ATCAAGCGTG AAGTCAATGA GATTTTGCAA AAGAAAGTCC 6900 

GTGATCCACG TGTCCAAGGT GTGACCATCA TAGATGTTCA GATGCTGGGT GACTTGTCTG 6960 

TTGCCAAGGT TTATTACACC ATTTTGAGTA ACCTTGCTTC GGATAACCAA AAAGCCCAAA 7020 

TCGGGCTTGA AAAAGCAACT GGTACCATCA AACGTGAACT TGGTCGCAAT TTGAAATTGT 7080 

ACAAAATCCC AGATTTGACC TTCGTCAAAG ACGAGTCCAT CGAGTATGGA AA.CAAGATTC 7140 

ACGAGATGCT ACGCAATCTG GATAAGAACT AAAGAAGAGG GGTTGCCCCT CTTTTTTGGT 7200 

GGAGGAAAAT AGGTTCAATT TGAAATGGAA AAATATTCTT TTATAATAGA TTGAAACTAG 72 60 

AATAGTACGC CTCTACTTCT AAAATATTGT TAGAAATCGA TTTGACTGTC CTGATCGATT 7320 

TGTCCTGTTC TTGTTTCATT TTAATATAAA AAAGGGATTC TGTATTTTTT AATGTTATCT 7380 

AATTAGAAAA TGCTTTTTTT GTAGGAAATA TAATATGATA AGGTGCAAAA AAGAAATAAG 7440 

GAGTTTGTAT ATGGCTGAAC AAGACTTAGC TATGCAAGTA TTGCAACAAG TGGTGAAACT 7500 

ACCTGTTGTT AAGGTTGATC GTTCGAAATT TTTAGTGGAT AAGTTTTCCA AAGAATTGGA 7560 

TCCAAAAGAT ATTCCTACCT TATTGGAACA AGGTCCAACG ACTCTTCTAT CTCAAGAAAT 7620 

ATTAGATCGT GTAGCTAATG CTTGTATTCG GGACAATGTA TTATTAGCGA GTGGGACTTC 7680 

TGTTTTGGCA GGATTACCTG GAGGGCTTGC TATGGCAATT ACCATTCCAG CTGATGTGGC 7740 

TCAATTTTAT GCTTTCTCTC TGAAATTGGC TCAAGAATTA GGTTATATTT ATGGTTATGA 7800 

GGATCTTTGG GCTTCACGAG AGGAGTTGAG TGAAGATGCT CAAAATACCC TCTTGCTTTA 7860 

TCTAGGCGTA ATGTTAGGGG TGAATGGAAC CGCTGCTTTG CTACGTGTTG GTAGTATAAC 7920 

AATTGCCAAA CAGGTAATGA AAATAGTGCC TAATAAAGCT TTAACAAAGA CGCTTTGGTA 7980 

CCCTATTTTG AAAAAAGTCT TAAAAATATT TGGTGTGAAT CTTACCAAGG GAGGGTTGGC 8040 

CAAAGGAATG GGGAAATTTA TTCCTATCTT GGGTGGTATC ATTTCAGGTG GTTTAACCTT 8100 

TGCAACTATG AAACCAATGG GGGAAAGCTT GCAGAAAGAA TTATCCAAGC TAGTCAACTA 8160 

TAGTGAAGTT CAATATCAAG AAGATGTTGA AACAATCCGA AAAGAGGCTG AAATCATCAA 8220 

AGGAGAGTAA TATGAATCCT ATCAAAGCTT TTGCTAAAAT TTATGGTAAT TACTTTTTGA 8280 

CCGTGCAAGG TGTAAAAGTG ATGAAAACGA TAAAGAAAGC TGACCATGTC GTTGTTGGTC 8340 

TGGGGAAACT TTTTATTGCC GACAAGTTAA TGGATACGGC TCGGTGGCTC ATTAAGCCAG 8400 
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AGGACAGAGA ATGAAATTTT TTTGGTCTTC TTGCTATTCT TTTTATCAAA CCGATTATTG 84 60 

GGATTGTCAA ATTCTTTTGG ATGATCATCT CTTTTGCAGT CCAATTGCTG TTTTACAACA 8520 

TAGTGTTTAA GATATTGGAT TGGCTCTTTA AACTTATCTA GATGGTAATC CAAGTTGCAG 8580 

AGAACTAGCA GGAACTCCAC TGCTA G T TT T TTATTCTCTT TCCATATGGT ATAATATAAG 8640 

CAGTAAAATC ATTTTATACT CTTCGAAAAT CTCTTCAAAC CACGTCAGCT TCACCTTGCA 8700 

-GTATATATGT TACTGACTTC GTCAGTTCTA TCGACAACCT ^AAAACGGTG TTTTGAGCTG 8760 

ACTTCGTCAG TTCTATCTAC AACCTCAAAA CACTGTTTTG AGCAACCTGC GGCTAGCTTC 8820 

CTAGTTTGCT CTTTGATTTT CATTGAGTAT TACAACATAC AATGGAGGTC GTCATGGACA 8880 

ATATCATCGA TGTGTCAATT CCTGTTGCAG AAGTGGTGGA CAAGCATCCA GAAGTCTTGG 8940 

AAATTCTAGT GCAGTTCGC? TTTAAACCCC TTCCCAATCC CTTAATGCCC AATACAGTTG 9000 

GTCGTAAAGT ATCACTTAAA CAGGGTTCTA AGCTAGCAGG AACTCCTATG GACAAGATTG 9060 

TACGCACACT GGAAGCGAAT GGCTACGAAG TGATTGGATT AGACTAATGA CAGATGAACG 9120 

GATTCATATC CTACGGGATA TTTTGTTAGA ATTGCACAAT GGCGCCTCTC CTGAGTCGGT 9180 

TCAAGATCGC TTTGATGCGA CCTTTACGGG CCTCTCAGCC ATCGACATTT CCCTTATGGA 9240 

GCACGAGCTG ATGAACTCGG ATTCGGGCGT CACTTTTGAA GATGTTATGG AACTCTGTGA 9300 

TGTCCATGCC AATCTTTTTA AAAATGCTAT CAAACGTGTC GAAGTTTCAC ATACTGAGCA 9360 

TCCAGGTCAC CCAGTTCGTG TCTTCAAAGA AGAAAATCTG GCTCTCCGTG CGGCCTTGAT 9420 

TCGCATTCGT AGATTGTTAG ATACCTATGA GTCTATGGAA GACGAGGAAA TGCTGGCGGA 9480 

GATGCCTAAG GGTTTGGTGC GTCAGATGGG ACTTGTGGGT CAATTTGACA TCCATTACCA 9540 

ACGTAAGGAA GAACTCTTCT TTCCTATCAT GGAGCGCTAT GGACACGATT CACCTCCCAA 9600 

AGTTATGTGG GGAGTGGATG ATCAGATTAG GGAACTCTTT CAAACAGCTC TAACGACAGC 9660 

CAAGTCACTA CCAGAAGTCT CAATTAGCAG TGTAAAGGAA GCTTTTGAAG CTTTTGCGPC 9720 

AGAGTTTGAA AGTATGATTT TCAAGGAAGA GTCCATCCTC CTCATGATTC TCCTTGAGTC 9780 

TTTTACTCAG GATGACTGGC TTCAGATTGC GGAGGAGAGC GATGCCTATG GCTATGCCAT 9840 

CATCCGTCCG TCAGAGAAAT GCGTGCCAGA ACGACAGAGC TTTATTGAGG AAAAGATTGC 9900 

AGAGGAGCCT GTACAGCTAG ATACGGCAGA AGGTCAAGTT CAACAAGTCA TAGATACGCC 9960 

AGAAGGCCAT TTTACCATTA CCTTTACCCC TAAGGAAAAG GAAGCTGTGC TGGACCGCCA 10020 

TAGTCAACAG GCTTTTGGTA ATGGCTATCT TTCAGTCGAG CAGGCCAATC TCATCCTCAA 10080 

TCATCTCCCT ATGGAGATTA CCTTTGTCAA TAAAGAAGAT ATTTTCCAGT ATTACAATCA 10140 
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CAATACGCCA GCTGATGAGA TGATTTTCAA ACGGACCCCC TCCCAAGTCG GGCGCAATGT 10200 

CCAACTCTGC CATCCGCCTA AGTACTTGGA CAAGGTCAAA ACTATCATGA AGGGGCTTCG 10260 

TGAGGGAAGC AAAGACAAGT ATGAAATGTG GTTCAAGTCT GAGTCGCGAG GTAAGTTTGT 10320 

CCACATCACC TATGCTGCAG TACACGATGA AGACGGAGAA TTCCAAGGAG TGTTGGACTA 10380 

TGTTCAGGAT ATCCAGCCCT ACCGTGAGAT TGATACGGAC TATTTPCGTG GATTAGAATA 10440 

AGGAGAAAAA ATCAGTTACG AACAAGAATT TATGAAGGAA TTTGAAGCTT GGGTCAATAC 10500 

CCAAATCATG ATTAACGACA TGGCGCACAA GGAAAGCCAA AAAGTTTACG AAGAAGACCA 10560 

GGACGAGCGT GCCAAAGATG CCATGATTCG CTACGAGAGT CGCTTGGATG CTTATCAGTT 10620 

CTTGCTTCCT AAGTTTGAAA ACTTCAAAGT AGGCAAGGGA TTCCATGATT TGCCAGAACG 10680 

CTTGTTTGGT GAGCGAAATT ATTAAACGAG AAAGATTCTT GATTT TT CAC TAAAATCTTG 10740 

ATAGAATGTT TATGTTAAAT CCTTGTCAGA GCAGGGATTT TTTATTGAAA GGATTTTATC 10800 

ATGTCAAAGA AACTCAATCG TAAAAAACAA TTACGAAATG GCCTCCGTCG CGCAGGTGCC 10860 

TTTTCAAGTA CGGTGACTAA CGTTGTAGAT GAGACAAAAA AAGTCGTGAA GCGTGCAGAA 10920 

CACTCAGCAA GCGCAGCTGG TAAGGCTGTT TCTAAAAAAG TTGAACAAGC AGTAGAAGCT 10980 

ACCAAAGAGC AAGCTCAAAA AGTAGCTAAT TCTCTAGAAC ATTTTGCAGC AAATTTGGGT 11040 

GGACTTCCAC TTGATCGTGC CAAGACTTTC TATGATGAAG GAATCAACTC TGCTTCAGAT 11100 

TTCAAAAACT GGACTGAAAA AGAACTCCTT GCCTTGAAAG GAATCGGCCC AGCTACCATC 11160 

AAGAAATTGA AAGAAAATGG CATCAACTTC AAGTAATTTT TCTTGAGCCT TGCATTTCCG 11220 

AAAAAATCTT GCTACAATAG AGCCATTAGA GGTGTTTTGA ATCCCACATT TTACAGAAAG 11280 

TGGCGGCGCT GAGAAGTCCA CAAATGTGTC AAAACTGGTT GCTAATGGAT GAAAAATTGA 11340 

AATAAAAGTG TCTTTTTGCT TTAAAGACGA GAGTTGCG 11378 
(2| INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 211: 

CCGCGAGCCA CGGCGAATTT GCTGCGGGTA TTCATCAGTC AGGATCTATG ATCTTTGGTG 60 

AACAAGAAAA GGTTCAAGTT GTGACCTTTA TGCCAAATGA AGGTCCTGAT GATCTATACG 120 

CTAAGTTTAA TAACGCTGTT GCTGCATTTG ACGCAGAAGA TGAGGTTCTA GTTTTGGCTG 180 
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ACCTTTGGAC TGGTTCTCCA TTTAACCAAG CTACTCGCGT GATGGGACAA AATCCTCACC 240 

GTAAGTTTGC CATCATCACA GGACTTAACT TACCGATGTT GATTCAAGCC TACACAGAGC 300 

GCCTCATGGA CGCTGCTGCA GGTGTAGAAA AAGTCGCTGC TAATATCATT AAAGAAGCCA 360 

AAGATGGCAT CAAAGCTCTT CCAGAAGAGC TAAATCCAGT CGAAGAAGTT GCAAGCGCTG 420 

CAGCTGCTCC AGTTGCCCAA ACTGCTATCC CAGAAGGAAC TGTTATCGGA GACGGTAAAT 480 

TGAAAATCAA TCTTGCCCGT CTTGAGAGAG GTCTACTTCA CGGTCAGGTT GCAACTCCTT 540 

GGACTCCAGA TTCAAAAGCA AATCGTATCA TCGTTGCTTC ACATAACGTG GCTAAAGACG 600 

ACCTTCGTAA AGAATTGATT AAACAAGCAG CTCCAGGTAA TGTCAAGGCT AACGTGGTTC 660 

CAATTCAAAA ACTGATTGAG ATTTCAAAAG ACCCACGTTT TGGAGAAACA CATGCCCTTA 720 

TCTTGTTTGA AACACCTCAA GATGCCCTTC GTGCCATCGA AGGCGGCGTG CCAATCAAGA 780 

CTCTTAATGT TGGTTCTATC GCTCACTCAA CAGGTAAAAC ATTGGTCAAT ACCGTTTTGT 840 

CTATGGACAA AGAAGACGTT GCTACATTTG AAAAAATGCG TGACTTGGGT GTTGAATTTG 900 

ATGTCCGTAA AGTACCAAAT GATTCTAAAA AAGATTTGTT TGACTTGATT AACAAAGCCA 960 

ATGTCAAATA AGCCATTATT TATGAAAGGA TTTTAAACAT GTCTATTATT TCTATGGTTT 1020 

TAGTAGTCGT TGTAGCCTTC TTTGCAGGTC TTGAAGGCAT CCTCGACCAG TTCCAATTTC 1080 

ACCAACCACT TCTAGCCTGT ACCCTTATTG GGCTTCTAAC AGGTCACTTG GAAGCAGGGA 1140 

TTATCCTCGG TGGATCGCTT CAAATGATTG CCCTTGGTTG GTCAAATATC GGTGCTGCTA 1200 

TCGCTCCTGA TGCTGCACTT GCTTCTGTCG CTGCTGCCAT TATCATGGTT CTTGGTGCTG 1260 

ACTTTACCAA GACTCGTATC GCTGTTGCCC AAGCCGTTGC TATCCCTCTT GCTGTAGCTG 1320 

GACTTTTCTT GACAATGATT GTTCGTACAA TTTCAGTTGG TTTGGTTCAT ACTGCAGATG 1380 

CTGCCGCTAA AAAAGGTGAC TTCGGCGCTG TGGAGCGTGC GCATTTCATC GCGCTACTTT 1440 

TCCAAGGACT TCGTATCGCG CTTCCTGCAG CTCTTCTCCT TATGGTACCA ACTGAAACTG 1500 

TACAAAGTAT CCTTAGTGCC ATGCCAGACT GGCTCAAAGA TGGTATGGCT ATCGGTGGTG 1560 

GTATGGTCGT TGCCGTTGGT TACGCCATGG TTATCAACAT GATCGCAACT CGTGAAGTAT 1620 

GGCCATTCTT CGCTCTTGGT TTCGTPCTCG CTGCTGTGTC AGATATTACT CTAATCGGAT 1680 

TCGGTGCTAT CGGCGTTGCT ATCGCTCTTA TCTACCTTCA CCTTTCTAAA ACTCGTGGAA 1740 

ATGGTGGCGG AGGAGCCGCA ACTTCTAACG ACCCAATCGG CGATATCOTA GAAGACTACT 1800 

AAGATAAGAA AGGACTGAAA ACATCATGAC TGAAAAACTT CAATTAACTA AATCAGATCG 1860 

TAAAAAAGTT TGGTGGCGTT CAACCTTCTT ACAAGGGTCT TGGAACTTTG AACGGATGCA 1920 
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AAACTTGCOC 


TGGGCTTATA 


CACTCATTCC 


ACCTATCAAA 


AAACTCTATA 


CTAAAAAAGA 


1980 


AGATCAAATC 


GCTGCTCTTG 


AGCGTCACCT 


TGAGTTCTTC 


AACACTCATC 


CATACGTAGC 


2040 


TGCTCCACTC 


ATGGGGCTTA 


CTCTTGCGCT 


TGAAGAAGAA 


CGTGCTAACG 


GTGTGGAAAT 


2100 


CGATGACGCT 


GCTATCCAAG 


GGGTTAAAAT 


CGGTATGATG 


GGACCTCTTG 


CTGGTATCGG 


2160 


TGACCCAGTA 


TTCTGGTTTA 


CAGTACGCCC 


AATCCTTGGA 


TCTCTCGCTG 


CTTCACTTGC 


2220 


CCTTACTGGC 


AATATCTTGG 


GGCCACTCCT 


cttctttgtt 


GCATGGAACT 


TGATTCGTAT 


2280 


GTCATTCTTG 


TGGTATGTTC 


AAGAGATTGG 


ATACAAGGCT 


GGATCAGAAA 


TCACTAAAGA 


2340 


TATGTCTGGT 


GGTATCCTTC 


AAGATATCAC 


TAAAGGAGCT 


TCTATCCTTG 


GGATGTTCAT 


2400 


TCTTGCTGTC 


CTTGTTCAAC 


GCTGGGTAAA 


TATTAAATTT 


GCTTTCGATG 


TTTCTAAAGT 


2460 


TCAACTAGAT 


GAAAACGCTT 


ATATCCATTG 


GGATAAATTG 


CCAGAAGGGT 


CTAAAGGTAT 


2520 


CCAAGAAGCA 


TTCGCACAAG 


TAGGACAAGG 


ATTGTCTCAA 


ACTCCTGAAA 


AAGTTACTAC 


2580 


TTTCCAACAA 


AACTTGGATA 


TGTTGATTCC 


TGGATTATCA 


GGACTACTCC 


TTACTTTACT 


2640 


TTGCATGTAC 


TTACTTAAGA 


AAAAAGTATC 


TCCAATCACT 


ATTATCCTTG 


CCCTCTTCGC 


2700 


AGTGGGTATT 


GTGGCACATG 


TTCTTCACAT 


CATGTAATCA 


AGCAACTAAA 


AAGGAACCAG 


2760 


GTTCTAAAAT 


CTGATTCCTT 


TTTTCTATGC 


TTTTATTCAG 


CCAAGGCTCC 


CATTGGATCC 


2820 


CATGGTGCAA 


GTACGATTGG 


TTCTGCTCCA TAGGCAGCTT GTTCTTCTGC 


TGTCAGCAAT 


2880 


TCCTTACGAA 


CAACGATTTG 


GTATGTGTAT 


TCGTCCATCC 


AAGCGTCTGA 


GGCAACAAAG 


2940 


TAACCATCTG 


TACCGACCTT 


GTCTCCCCAT 


GAGTTTTCAA 


CCTTCCACTT 


GGTTGATTTA 


3000 


CCATTTTCGT 


CCAAGTCAAC 


ACCTGTCAAG 


ACCATGGCCT 


GGGTCATCAA 


GCTTTCACTA 


3060 


TAGTCCAAAC 


GTCCAGCCTT 


GTCTTGAGTA 


AGTTTAATGT 


CCATGCTTGA 


TTCAAAGTCA 


3120 


TAAACATCTG 


TCGCAAGGAT 


GCCAGCTTAC 


GGTTGCTGAG 


CTGGCCGACA 


TCAGAACCAA 


3180 


ACCAAACAGT 


CTCACCTGCT 


TGCATTTGGG 


CAATCGCCAA 


TTCTTTCAAG 


CGCTCCATTG 


3240 


GAACGTTGAT 


GTAGCGAACT 


GCACGGCTAC 


CAACCACATT 


CCCCAACATC 


TCAACTGTGT 


3300 


AAGATTTTCC 


GTAAGGTTTA 


TCAGCAGTTG 


GAGCATTGAT 


AACAGAAACG 


TAGTCTTCTA 


3360 


AAGGAAGATT 


GACATATTTC 


TTGTAAAACT 


CTTGTGGTGT 


GATTCCTTTT 


TCACTTTTGT 


3420 


AGTTGTTATC 


TTTATCGCGA 


TAAGCAAAGT 


CAAACTTGCG 


TGGTGGAAGT 


CCTAATGACA 


3480 


TAGCAAGAAA 


GTTAAAGATT 


TCTTGCAAGA 


GGTCTTCTTT 


CTTAGCTTGA 


ACAGTCGCTT 


3540 


GATCTGCACC 


AGAAACAAGC 


AAGTCACGCA 


AGATTTGAGC 


ATCTTGACGA 


AGCAATTTAT 


3600 


TAAGGATCCC 


ATTTAGCTCA 


CGACTGCTGC 


TAGATGAAAC 


AGACTCAGGA 


TAAACTGACT 


3660 


TAGGCACGAC 


ACCGTATTTT 


TCAAAGAGGG 


AAACGACCAT 


ATCCCATTGA 


CCGCCATCTT 


3720 
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GTTGAGGTGT TTGGAGTAAG AAGCTAACtT GCGGCTAGTC AATTCTTGGT CTGAAGTCGC 3780 

AATGACTTGC TCCAAGAACC AGTTTGATTT CTCATACTTA TCCCAGAAGA AAGTGTGGGC 3840 

TTGTGACAAC TCAAAGTTCT CCAATTTGTA TTGCGAGATG ACTTTGTGGC GGAACGTGTT 3900 

GAGAGCCGCA AACATCCAGC AACGACCAGA CGCTTTCTGG TTAGTGACCT TGTCCTTGGT 3960 

TAAATCCAAT GAGAAAACAG GTGTGTTGTC TACATGCCTT TGGCGACGTT CCAGAGCTGC 4 020 

AAAAATTCCG TTGTGGCTCG CAGCATTTTC AATCGCTTGG TATTTTACAT TTGCTTCATA 4080 

GTTGGCAAAT AGTTTATCAG TAAATGATTC TTGAATCGCG TTCATAGATT CCTCCTTTTA 4140 

GTCTACAGTG TATTGG 41S6 



{2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 212: 



AAAAACAACA 


AAATAAAACA 


AAAACAAAAA 


TATCGAGGTT 


TATTTTCAAA 


ACTTTCGATA 


60 


TTTTTATTAA 


GTTATTATTT 


TGTTGTTTCT 


AGTTTACTTT 


TTGATGGTTA 


AGAGTGGTGG 


120 


AGAATTATAC 


TCAATGAAAA 


TCAAAGAGCA 


AACTAGGAAG 


CTAGCCGCAG 


GCTGTACTTG 


180 


ACTACGGCAA 


GGCGAAG CTG 


ACGTGGTTTG 


AATTTGATTT 


TCGAAGAGTA 


TTAGTGCAAA 


240 


CCGTAGTTGT 


AGTCATCATC 


TTGCATGGCT 


TCAACTTCGC 


CAAGAAGGTA 


ACCATTTCCG 


300 


ACTTGAGAGA 


AGAAGTCATG 


GTTGGAAGTT 


CCTGTTGAAA 


TACCGTTCAT 


AACGATTGGG 


360 


TTGACATCTT 


CAGCTGAATC 


TGGGAAAAGT 


GGATCTTGTC 


CCATGTTCAT 


GAGAGCTTTA 


420 


TTGGCATTGT 


AGCGAAGGAA 


GGTTTTAACC 


TCTTCAGTCC 


AACCAACACC 


CTCATAAAGA 


480 


CTCTCTGTGT 


AGCCTTCTTC 


ATTTTCATAA 


AGAGTATAGA 


GTACCTCGTA 


CATCCATTCT 


540 


TTGAGTTTTT 


CTTGCTCTTC 


TTCAGGTAAT 


TCATTGAAAC 


CAAGTTGGAA 


TTTGTAACCA 


600 


ATGTAGGTTC 


CGTGAACAGA 


CTCGTCACGA 


ATAATCAATT 


TAATGATTTC 


TGCAACGTTG 


660 


GCAAGTTTGT 


TGTTACCGAG 


ATAGTAGAGG 


GGAGTGAAGA 


AACCAGAGTA 


GAAGAGGAAG 


720 


GTTTCGAGGA 


AGACGCTGGC 


AACTTTCTTT 


TCAAGTGGGC 


TGCCGTTTAG 


GTAGATTTCG 


780 


TTGACAATCT 


CAGCCTTCTT 


TTGTAGGTAA 


GGATTGGTAT 


TGGTCCATTC 


GAAAATTTCT 


840 


TCAATCTCAG 


CCTTAGTATT 


CAAGGTAGAA 


AAGATTGATG 


AGTAAGATTT 


AGCGTGGACA 


900 
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GATTCCATAA ATTGGATGTT ATTGAAGACA GCTTCCTCAT GTGGTGTACG GATGTCTGCG 960 

CGAAGGGCTT GAACCCCAGT TTCAGATTGC ATAGTGTCAA GAAGGGTTAA ACCACCAAAA 1020 

ACTTTTCCGA CCAAGTCTTT CTCTTTGTTA GATAGCTTTC TCCAGTCATC CAAGTCGTTT 1080 

GATAAGGGAA TACCTGTATC GAGCCAAAAT TGCTCCGTCA GTTTTTCCCA AGTTGATTTG 1140 

TCGATGACAT CTTCGATGGC ATTCCAGTTA ATGGCTTTGT AGTAAGTTTC CATTTAAAAT 1200 

CTCTTTCTGT GTTTAGTATT GCGAACTCAC AATTATTTCT ACTTTACCAT AATTCTATAG 1260 

GAGTATCGCA CAAAAAGTCG GAAGCCCGAC TTTTAAAATG TTACATAAAT TATGTTATGA 1320 

CATAGTAGAT TTGATTTTAT CAGTGCTGCT TAGGGAAAAA TAGTGTTTCT ATGCTAGAAA 1380 

CTAAATCACA CAGCTTTCAC ATTGGTTGGC GCCGACTTCT CCACCGTCAT CTGTAAAGGT 1440 

ACGGACGTAG TAGATAGACT TGATTCCCTT GTTAAACCCA TAGTTACGAA GGATGGACAA 1500 

GTCACGTGTC GTTTGTTTAT TTTCCCTCTT CCATTCGTAA AGGCCTTTTG GAATGTCACT 1560 

GCGCATGAAG AGGGTGAGTG AAAGTCCTTG ATCCACGTGT TCAGTCGCAG CAGCGTAAAC 1620 

ATCGATGACT TTACGCATAT CCATATCCTA GGCAGAAGTG TAGTAAGGAA TGGTTTCTGT 1680 

AGACAAGCCA GCAGCAGGGT AATAGATTTT ACCAATTTTC TTCTCTTGGC GTTCTTCGAT 1740 

ACGTTGCGTA ATCGGGTGGA TAGAAGCAGA AACGTCGTTG ATATAGCTGA TAGAACCATT 1800 

TGGCGCTACA GCAAGGCGAT TTTGGTGGTA AAGACCATCT TCTTGAACCT TGTCGCGAAG 1860 

TTCAGCCCAA TCAGCAACAC CAGGGATAAA GACATTTTTG AAGAGTTCTT TAACACGGTC 1920 

TGATGTTGGA ACAAATTCAC CAGTTACATA CTTGTCAAAG TAACTTC CG T TAGCATAGTC 1980 

TGATTTTTCA AAGTTGTGGA AGGTAATACC ACGTTCACGT GCAATATTGT TTGACTCTAC 2040 

CAAGGTCCAC TAGTTCATAA GCATAAAGTA GATGCTTGTA AATTCAACAG ACTCAGGTGA 2100 

AC CAT ATT CA ATGAGTT G TT GGGCAAGCTA GCTGTGCAGT CCCATGGCAC CGAGACCAAA 2160 

GGTGTGGGCT TGGCTATTTC CATGGTCAAT CGTTGGTACA GCTACGATAT GTGAACTATC 2220 

TGTAACGAAA GTAAGGGCAC GAACCATAGC ACGGATAGAA CGACCAAAAT CAGGTGAAGT 2280 

CATCATGTTA ACCACGTTGG TTGAACCCAG GTTACATGAA ACATCTGTTC CCATTTGAAG 2340 

GAATTCTTGA GCATCGTTGA TCAAGCTTGG TTCTTGAACT TGAAGAATCT CAGAACACAA 2400 

GTTACTCATG ATAATCTTTC CATCAACAGG ATTTGCACGG TTAGCCGTAT CGATGTTGAC 2460 

TACATAAGGA TAGCCAGACT CTTGTTGCAA TTTAGAGATT TCAGTTTCCA AATCCCGCGC 252 0 

CTTGATTTTT GTCTTGCGAA TATTTGGATT TGCGACCAAT TCATCGTATT TTTCAGTAAT 2580 

GTCGATGTAA TTGAATGGCA CACCGTATTC TTTTTCTACA GAGTAAGGGC TGAAGAGGTA 2640 

CATTTCTTCA TTTTTACGAG CCAATTCGTA GAATTTATCA GGTACTACAA CACCAAGTGA 2700 
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TACAGTCTTG ACACGTACTT TTTCATCAGC GTTTTCTTTC TTAGTTCAAA GGAAAGCGAT 2760 

GATATCTGGG TGAAAGACGT TGAGGTAGAC AACACCAGCA CCTTGACCTT GCCCCAATTG 2820 

GTTGGAGTAA GAGAAGCTGT CTTCAAAAAG CTTCATAACA GGAACGACAC CTGAAGCAGC 2880 

TCCTTCATAG CCTTTGATAG GTGCACCAGC TTCACGAAGG TTGCTGAGGG TAATTCCCAC 2940 

ACCACCACCA ATACGTGAAA GTTGAAGAGC TGAGTTGATA GAACGCCCGA TAGAGTTCAT 3000 

ATCATCCGTC ACTTGGATTA GGAAACAAGA TACCAACTCC CCACGACGAG CACGTCCAGC 3060 

ATTCAAGAAG GAAGGAGTAG CAGGTTGGTA GCGTTGGTGG ATGATTTCAT TGGCAATATC 3120 

GATTGCAACA GCTTCATTCC CATCAGCGAA ATAAAGGGCA TTGAAGAAGA CACGGTCTTC 3180 

CATATTTTCA AGATAGTATT CACCGTCATT AGTCTTTAAG GCATATTGAT TGTAAAATTT 3240 

ATAAGCTGCC ATGAATGACT TGAATTGGAA GTTTTGGTCT TTGATAAATT GAGCTAATTC 3300 

TTCCAAGAAC TCTGGACGGT ATTTCTTGAT AAAGGCTGTT TCGATGTAGT TGTGTTCAAT 3360 

GAGGTAATTG ATTTTGTCTT TGATTGAATC AAAAACCATA GTGTTTGGAA CTACATTTTC 3420 

TTTAAAGAAA GCATCCAAGG CTTCCTTGTC TTTATGAAGC ATGATTTGTC CATTAACAGG 3480 

ACGCTTAATT TCGTTATTAA GACGGAAGTA AGTCACGTCT TCAAGATGTT TTAATCCCAT 354 0 

AAAATTTCCC TTATCTAATT ACAAAAGAAA GGCTTCTAAG TTAGCCCTAA AAGCAGTTTC 3600 

TTCTGGATGA TGTACTAAGA TTATGCTAAT TGTTTCAGTT TTCCTGGTTG GAAACCTGAA 3660 

AAGACTTCAG TTGGTGTTTG GATAACAGGA GCTGCGCTAA AACCGAGCTC TTTAACTTGA 3720 

TCGACGTACT CAGGTTGCTC ATCAAGATTG ATTTCACGAT AAGAGACATT ATTACTGTCC 3780 

AAGAAACGCT TGGTCATTTT ACATTGGACA CAATTGTTTT TAGAATAAAC GGTTACCATT 384 0 

GTGTAACTCC TCTTCAAAAT TTAATACTAT CTTAGTATAT CAGAAAATAA AATTTTGTCG 3900 

GG 3902 

(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24S6 base pairs 
(S) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

TATTGAAGCT ATTGTAGACT ACAAAGATAA GGATTTGCAG TTAGTAGGCG GTGAGACTCA 60 

CTGATAACCT AAAAAGGATA GTCAATTATG CT T CTTTACT AACTATTAAC TATGCTAAAT 120 
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CAATTGACGT TGTTTACATA AAACTCTATA TCAGAGAAGC CTGATATAGA GTTTTTTCTT 180 

GCTACTTTTA GGATTTTTTT GTAAAATACA AAAAGTGAAG ACAGGTATGA AATGAGCAAG 240 

AAAGATAAAA AAATCGAAAT TCAAGTAGCG GATGCCAAAG TTAATGTTGC TAAAGACAGT 300 

TTTGAAGGTT ATACATTGAC TATCGGTAAA AAAGTTATCG GAGAAATTGC CGAATTAGAC 3 60 

GGACAATTTG CCATTATAAA GAATGGGAAT GTCGATAGTT TTTATAAAAA ATTGGAAAAA 420 

GCTGTGGAAA TTTTGATTGA AAATTATAAT TTAGCAAAAT AAGTCTTGTT TTTTTGAAAT 480 

TTTCATGATA TAATAGTCCA TGTTGATTGT AGGAGAGATA GCGAAGAGGC TAAACGCGGC 54 0 

GGACTGTAAA TCCGCCCCTT CGGCTTCGGG GGTTCGAATC CCTCTCTCTC CATTTCATTA 600 

ATGGGGTATA GCCAAGCGGT AAGGCAAGGG ACTTTGACTC CCTCATGCGT TGGTTCGAAT 660 

CCAGCTACCC CAGTTCTTAG GTAATAATCA ACATACAAAG CAAAATATCT TAGGCTATTT 720 

TATTTTTATA ATTGAAAGAC GTGAATGATA TGAACATGTC CTTGCGGGTG CTTAGGAAAA 7 80 

AAATTATAAG TATGTCAAGT TTAAGAAAAA CTTCATTGTT GGAGGATTTT TTAGATGAAC 840 

GAATTTGAAG ATTTGCTAAA TAGCGTTAGT CAAGTTGAGA CTGGTGATGT TGTTAGTGCT 900 

CAAGTATTGA CAGTTGATGC GACTCAAGCT AACGTTGCAA TCTCTGGAAC TGGTGTTGAA 960 

GGTGTCTTGA CTCTTCGCGA ATTGACAAAC GATCGTGATG CAGATATCAA TGACTTTGTT 1020 

AAAGTAGGAG AAGTATTGGA TGTTCTTGTA CTTCGTCAAG TAGTTGGTAA AGATACTGAT 1080 

ACAGTTACAT AC C TT G TATC TAAAAAACGC CTTGAAGCTC GCAAAGCATG GGACAAACTT 1140 

GTTGGTCGCG AAGAAGAAGT TCTTACTGTT AAAGGAACGC GTGCCGTTAA AGGTGGACTT 1200 

TCAGTAGAAT TTGAAGGTGT TCGTGGATTT ATCCCAGCTT CAATGTTGGA TACTCCTTTC 1260 

GTACGTAACG CTGAGCGTTT TGTAGGTCAA GAATTTGATA CTAAAATCAA AGAAGTTAAC 1320 

GCTAAAGAAA ACCGCTTCAT CCTTTCACGT CGTCAAGTTG TTGAAGCAGC TACTGCAGCA 1380 

GCTCGCGCTG AAGTATTCGG TAAATTGGCT GTTGGTGATG TTGTAACTCG TAAAGTTGCT 1440 

CGTATCACAA GCTTCGGCCC TTTCGTCCAC CTTGGTGGTG TTGACGGATT GGTTCACTTG 1500 

ACTCAATTGT CACATGAACG TAATGTATCA CCAAAATCAG TTGTAACTGT TGGTGAAGAA 1560 

ATTGAAGTGA AAATCCTTGA TCTTAACGAA GAAGAAGGAC GTGTATCACT TTCACTTAAA 1620 

GCAACAGTAC CAGGACCATG GGATGGCGTT GAGCAAAAAT TGGCTAAAGG TGATGTAGTA 1680 

GAAGGAACAG TTAAACGTTT GACTGACTTC GGTGCATTTG TTGAAGTATT GCCAGGTATC 1740 

GATGGACTTG TTCACGTATC ACAAATTTCA CACAAACGGA TTGAAAATCC AAAAGAAGCT 1800 

CTTAAAGTTG GTCAAGAAGT TCAAGTTAAA GTTCTTGAAG TTAACGCAGA TGCAGAACGC 1860 

GTGTCACTTT CTATTAAAGC TCTTGAAGAA CGTCCACCCC AAGAAGAAGG ACAAAAAGAA 1920 



WO 98/18931 



PCT/US97/19588 



1213 

GAAAAACGTG CTCCTCGTCC ACGTCGTCCA AGACGTCAAG AAAAGCGTGA TTTCGAACTT 1980 

CCAGAAACAC AAACAGGATT TTCAATGGCT GATTTGTTTG GTGATATCGA ACTTTAATCA 2040 

AATTGAAAAT TCACAAAATC CTTTGTTTAC TAAACAAGGG ATTTTTCTGG CTCTTTGTCA 2100 

ACTGTAGTGG GTTGAAGAAA AGCTAACCTC GAGAAAGGAC AAATTTTGTC CTTTCT T T T T 2160 

TGATATTCAG AGCGATAAAA ATCCCTTTTT TGAAGTTTTC AAAGTTCCGA AAACCAAACC 2 220 

CATTGCGCTT GAT AAGTTTG "aTGAGATTAT TGGTCGCTTC CACTTTCGCG TTAGAATAGT 2280 

GTAGTTGAAG GCTGTTGACA AGCTTTTCTT TATCTTTGAG CAACGTTTTA AAGACAGTCT 2 340 

GAAAAATAGG ATGAACCTGC TTAAGATTCT CCTCAATAAG TCCGAAAAAT TTCTCCGGTT 2400 

CCTTATTCTG AAAGTGAAAC AGCAAGAGTT GATAGAGCTC ATAGTGGTCT TTCACG 2 456 
(2) INFORMATION FOR SEQ ID NO: 214: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10974 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

AAATAGGATA TAGAGACATC CTTCTGATCT GCTTTTwACA AAGTCCAATT ATATGCGCAT 60 

CTATACCTCC ACAATGTCCA TTATTATmCC TAACTATAAT ATGAGCCGAA AACACTATAT 120 

CCTTAATGTC TCCATATCCA TCAGGGATAT TAATATTTAT TTTTCCACAA CTATATTGCA 180 

TTGTAACCAT CTCCTTAAAC GACGCATTAT GATATTTCAT AGAGAAATTT TTATGAATAA 240 

CTCAATAATT TTATAGTAAA TCATGCTTAT ATCTCAAAGA TACCTATTTT ATCTTGTCTC 300 

GACCTTCTCC AAAGAATTGC TATAATACTA TTACAAATCC ATCTGCACTA CACTTCAAAT 360 

TTTAGCACTG TATAAAAACG TTTCAATACA CTAACTTCAA GAAAACTTCC ACTATTAATT 420 

GAAAAAATTG ATAGAGATAA ATTAAAAATC TATATTGAAA CTCATCCCCA TGCTTATTTG 480 

ACTGAAATAG CTGCTGAATT CAACTGTCCT CCAACAACTA TTCATTACGC TCTAAAGGCT 540 

ATGGGATATA GTCTAAAAAA GAGCCGTACC TACTGCGAAC AAGACCCAGA AAAACTAAAT 600 

CGGTTCCTTA AAGAATTGAA TCACTTAAGC TACCTCACTC CTATTTATAT TTATGAGACA 660 

GGGGTTGAGA CCTATTTTTA TCTCGAATAT GATCGAGCCT TGAGCAGGCA GTTAGTCTCT 720 

CTGGAAGAAG ATATAATTAT TTGAATTAAG ATCGAGACAA CGCACACCAG AGATTGCGAT 7 80 

ACTGTTATAG AAGTACTAAT GCCCTTTTTT GTTTCAATAT ACTATGGCTC CGATGACCTA 840 
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TAAAGATACG 


ATGACGAGTG 


ACTTTTTCGA 


AGCTTGCTTC 


CAAAAATTCT 


TACTACCTAC 


900 


TTTAGATACA 


CCATCCCTTA 


TCATTATGGA 


CAATGCAAGG 


TTTCACAGAA 


TGAACATGTG 


960 


TAAGGAGCAG 


GGCATAGACT 


GTTACCACTT 


CCTACCTATT 


CACCCCACTA 


TAATCCCATT 


1020 


GAGAAAATAT 


GGGCTTACAT 


CAAAAACATC 


TCAGAATAAT 


ATTGTCAAAT 


TACGATGCTT 


1080 


TTCTTGAGGC 


ACTTTTGTCC 


TATTCTTGTT 


TCAGCCGACT 


ATACTCCGTT 


ATTGGGCAGC 


1140 


TACGGAACAG 


TCGATGGGAC 


GATGGGGGGA CATAAAAAAA TCCTCCAGTT TTGTTTTTTA 


1200 


TAACAGTATA 


CTGGAGAATT 


GACAATCTCG 


GTAGATACCT 


CGTTATAGCG 


CGGTTACTTA 


1260 


TTAGGCAGTT 


ACAAAACAAC 


TGTGAACAGA 


AAACATTCCA 


GAGTCAGACA 


AGACTTTGGA 


1320 


ATGTTTTGGC 


TCTATAATTT 


CTGTAGTGGG 


TAATCCCACC 


CCAGGAATTA 


TAGGGTCGTT 


1380 


TCTTGTAGAA 


AAAAAGCCCC 


ATATGACCTA 


TAATGAAAAG 


CGTCTAACCA 


ACTCATTAGA 


1440 


AAGGCTTCAT 


ATGGAACAAC 


TTAAGAATAC 


CACAGATTTG 


CTCGGATTGG 


AAGACAAAAA 


1500 


TATCAAAATC 


TTGTCTGTTC 


TGAAATACCA 


AACCCATCTA 


GTCGTTCAGG 


CAAAGTTGGA 


1560 


TTCCCCCGCT 


CCTCCTTGTC 


CTCATTGTCA 


AGGGAAGATG 


ATCAAATACG 


ACTTCCAGAA 


1620 


AGCCTCTAAA 


ATTCCGCTTC 


TCGACTGTCA 


GGGTTTACCC 


ACGGTACTGC 


ATCTCAAAAA 


1680 


GCGCCGCTTT 


CAGTGCAAGA 


ATTGCCTTAA 


GGTGGTCGTT 


TCTCAAACAT 


CCATTGTCAA 


1740 


GAAAAATTGC 


CAGATTTCCA 


ACATGGTGAG 


ACAAAAAATC 


GCTCAGCTCC 


TCCTTGAAAA 


1800 


GCAGTCTATG 


ACTGAGATTG 


CCCACAGATT 


GGCGGTCTCA 


ACTTCCACCG 


TCATCCGAAA 


1960 


ACTGAGGGAA 


TTTAAGTTTG 


AAACCGATTG 


GACCAAGTTC 


CCAAAAGTTA 


TGAGTTGGGA 


1920 


TGAGTATAGC 


TTCAAAAAGA 


GCAAAATGAG 


CTTCATTGCC 


CAAGATTTTG 


AGTCCAAATC 


1980 


CATCCTCGCA 


ATTTTAGACG 


GGCGAACTCA 


TGCGGTGATT 


CGAAACCATT 


TCCAACGCTA 


2040 


TCAGAGAGAG 


GTTCGGGAGC 


TGGTCGAGGT 


CATCACCATG 


GACATGTACA 


GCCCTTATTA 


2100 


7CGGCTCCCT 


AAGCAACTCT 


TTCCAAAGGC 


GAAGATTCTT 


CTTGACCGCT 


TCCACATTGT 


2160 


CCAACATCTG 


AGCCGAGCTA 


TGAACCGAGT 


ACGAATCCAA 


ATCATGAACC 


AATTTGACCG 


2220 


AAAATCCTTG 


GAGTATCGGG 


CGCTCAAGCG 


CTTTTGGAAC 


CCTCGCTTTT 


TCGTTTCTAG 


2280 


GCTCGGGCTA 


AATCAGTCCA 


CTGGACTGAT 


TTACTACACC 


AGTATAGCTT 


CAAGCTCTGT 


2340 


CAGAAACGAT 


TCTATCAGCC 


CACGTTTCGA 


ATGCACTTAA 


CCCATCGGGA 


AGTACGAGAT 


2400 


AAGCTGCTTT 


CTTACTCTGA 


GGGATTACAG 


GTTCACTACG 


AACTCTATCA 


ACTCCTGCTC 


2460 


TTTCATTTTC 


AAGAGAAGAA 


TGCCGACCAT 


TTCTTTGGAT 


TGATTGAGCA 


AGAACTGCCA 


2520 


ACGGTTCATC 


CGCTTTTTCA 


AACGGTCTTT 


TGGACTTTTT 


TAAGGGATAG 


AGATAAGATT 


2580 


ATCAACGCAC 


TTAAGCTGCC 


TTATTCCAAC 


GCTAAACTTG 


AAGCGACCAA 


TAATTTGATT 


2640 
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AAGATTATCA AGCGCAAAGC CTTTGGTTTC CGGAACTTTA ACAATTTTAA AAAACGGATT 2700 

TTGATGACTT TGAACATCAA AAAAGAGAGT ACGAATTTCG TACTCTCCAG ATTGCAGCTT 2760 

TTCGCCTACC CACTACACTT GACAAAGAGC CACTCTTTAT TCCATGGTAT CAAAGGCAAG 2820 

ACTTGCTTTG GCATTGAGGT CCCAGCCTGC GAAGTTTTCT TTGTTCCACT CGCTGACGCT 2880 

GGCATAGGCA ATCATACCTG CATTGTCTCC GCAGACTCGC AGAGGGGGGA TGATAACCTT 2 940 

GACATCTGTG ATTTCGGCTG CTAGGCGTTC TCTGAGACCT TTATTGGCTG CCACACCACC 3000 

TGCCACAACT AGGATTTTAA CAGGATATTT CTCCAAAGCC TTCTTGGTTT TTGCCATGAG 3060 

AATGTCCATA ACTGCTGCTT GGAAGGAAGC ACACAAATCT TCTGTAGACA GGCTTTCTCC 3120 

CTTTTGCTCG GCATTGTGAT GAAGATTGAT AAAGGCAGAT TTCAAACCTG AGAAGGAGAA 3180 

CTCCAGATTA TCTTCCTTAA TCATGGCACG GGGGAAATCA TAAATATCCT GCCCCTGATG 3240 

AGCCAGCTCG TCAATCTCAC GACCTGCAGG ATAGGTCAAG CCCATGACAC GGCCGACCTT 3300 

ATCATAAGCC TCACCAACCG CATCATCACG GGTTTCCCCA ACAATCTTAT AATCTCCTGC 3 3 60 

CTCCGAAACA TAAACCAACT CTGTGTGTCC GCCGCTGACC AAGAGGGCTA GCAAGGGAAA 34 20 

CTCCAAAGGC TCCACACTCT GAGCTGCCAT GAGCTGCCCA GCCATGTGAT TAACAGGAAT 34 80 

CAGTGGAAGT CCGTGAGCCC AAGCAAAGGC CTTGGCAGCT GACAAACCAA CTAGCAAGGC 3 540 

TCCGACCAAG CCTGGTCCGT AGGTAACCGC AACAGCTGTC ACGTCCTCTT CGGTAATCCC 3 600 

TGCTTCTGCC AATGCCTCCT CGATACAGGC TGTAATGACC TCGACATGGT GACGACTGGC 3660 

TACTTCGGGC ACTACGCCAC CAAAACGTTT GTGACTCTCA ATTTGACTAG CAATGACATT 3720 

GGACAAGAGC TCATCGTCGT TTTTCAAGAC GGCGACACTG GTCTCATCAC ACGATGTCTC 3780 

AAATGCTAAA ATATATCTAT CCTTCATCTA TTTCTCTCTT CATGATAATG GCGTCCTCGA 3 840 

CTGGGTCATG GTAGTAGGCC TTTCGCTCAG CGATAACTGT CATCTTTTCT TTCTTGTAAA 3 900 

ATGCTTGCGC TCGTTGATTT GACTGTCTGA CTTCGAGGAA AATTTCCTTG TCTGTCGGCA 3960 

ATTGAGCAAA CAAGGcTGAC GCAATCCCCT GACCCTGATA AGCTCCTTTG ACAGCGATTT 4020 

GCAGGACTTC TGCTTCAAAA AGATTCTCCT GCACAGCTAG AAATCCAATC ACTTCTCCCC 4080 

CATCATAAGC CAATGCATAC CAAGTCTGGT CTTGGGACAG ATCTGCTTGG ATTTGCTCCA 4140 

GAGTCCAAGG ACTGACTAGG TAAACAGCTG CCATAACAGC GTAGATGGCT TGAGCTAGGT 4200 

CAGGCTGTTG TTGAATTCGC TTGATTTCTA TCATAGGCGT TTAATCTAAG ACTCGCCAGA 4260 

CTCGGTATGG TTCTTGAGCC AGTTTTCCTC AGCCTCGACT CGTTTGAGGT AATTCGGCAC 4 320 

AAAATCATGC AAGGAGTCTG CTTCCTTGTC CCAGGCCAAA AGAGCTAGAT TAGCTGCATT 4 3 80 



WO 98/18931 



PCI7US97/19588 



1216 

GGGCAATCTT TCTTTGTAAT CAGTCCTTGG CAAGTGTTTT TGAATCTGCT CAACAAAGGG 4440 

GCCAACTTCT CCGACAAAGG TTACCTGACT AGTACCCTTG ACTTTTTCTA GCACCTCTTC 4 500 

AAAAGATAGG TGCGCTTCTG CCATGACAGG TTTGGCATTT TCATAAAATC CTGCATAAAC 4560 

ATTATTGCGA CGCGCATCCA TCAAGGGGAC AAACAAACCT TCTTGTTGAT GGGGCACCAG 4620 

AGCCAAGAGA CTCGACATAC CAACCAACTC CATGTTCAGG GTGTGAGCTA AGGTCTTAGC 4680 

AGTTGCTACC GCAATTCGCA AGCCTGTATA GCTACCCGGC CCTTCAGCTA CCACGATTCG 4740 

GTCCAAATCC TTGGGTGTCC AATCCAAACT TGCCATCAAA AAATCGATGG CAGGCATAAG 4800 

AGTAATACTG TGATTTTTCT TAATATTAAT CGTCGTCTCG GCAAGAACCT GCTTATCCTC 4860 

TAAAATAGCC AGAGAAAGAG CCTTGCTGGA CGTATCAAAA GCTAATACTT TCATAACACA 4 92 0 

TTCCTATCTT TTTGTCTGCT TACT ATT ATA CTACAAAAGC TCGCACATGG GAATTTTCTT 4980 

TGCCCCCAGA CAAGAGTGCC CTCACTTAAC TAAAAATAAT TTAAAAAAAT GCTCACTTTT 5040 

CCTTTTCTTT TCCGAATATA AAAGTGAACA AGAAAAAAGG AGGAAAGTTC AATGACAAAT S100 

TTTGACATTC TTGACAATCA ATTTTTATCC TTATCTGAAA ATGAATTATC AGATATTGAT 5160 

GGCGGTCTCG CTCCCTTGGT TATCTTTGGA GTAGCAGTAT CTTGGAAGGC TATTGCAGGT 5220 

GGAACAGCAC TTATAGGTTC TGGTTTGGCA GCTCGTTATT TTTTAGGAGG AGATTAATAT 5280 

GATGAAAGAT TTGAACAATT ATCGTGAAAT TTCTAATAAG GAATTGCAAG AAATCAAGGG 5340 

TGGCTTTGGT GTCGGTGTTG GTATCGCTTT ATTTATGGCA GGTTATACCA TTGGAAAAGA 5400 

CCTTCGTAAA AAGTTTGGTA AGTCATGCTA GATAAGAAAC ACATTTTTAG AAGGATAAAT 54 60 

TTTATTGTCT TCATCTCTTA CAGTTTGCTC AGCATTCTCA ATGATTTGAA CATTACTACC 5520 

ATCCCTTTAC CATTCGATTT ATCTGTTTGT ATTGTTTTAT TTTTATGCTT CAACTCTATT 5580 

TTTGATCAGA ACAATGACTC CCATAAAAAT AATAAGCTTT GAAAATTCCA TTGTCATGTC 5640 

ATGTTAGAAA AATGCAAAGA CCACCTCATC TTGATAGATG GGGTGGAATT TTCGTGTCCT 5700 

AAATCTACTA TCTCTACATT CCCAAACAAA AAACCCCAGC ATAAGCAGGC CATCTAAGCA 5760 

TTTAATTCAA AGTAAAATAC AAACCAAACG ACATAGGTCA CGAGGAGGAG AAAAAGCGAG 5820 

TAGAGAGTCA CAAAGGTCAT TTTCCACAAG AACTTGGTTT GTCGTCGTTC CAGTTTGGCA 5880 

AATAGAAGAT TCCCCGCATA AACGCAAGCA ACAAAAACAA TAAAAGCTAC CAAGCGAGCT 5940 

CCGATAGCAA AAGCAAATAA GTTATACATA GGGCAACCTC CTTGACTTAA AATCTATATG 6000 

GAATTATGAC AAGCAATAAA TTTCACTTCC GTTATCAACA TAATACATTT TCTTTATTTT 6060 

TGAAAACGCT TACCAAAGAA ATCGTCCCCT AACTTTGTCG TTTCCGTCTT TTACTAATTT 6120 

TTCATTTTGT GGTATAATTG AAATAATTGT AACGAATCAA GGTCAATCTA GACACAAAAT 6180 
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GG AATGAAAT CAAGCAAATA TCTGCTAAAA GTTTGGAATA AGCTGACCTG TAAATAGAAA 6240 

GGAACTATAT GATTTACAAA GTTTTTTATC AAGAAACAAA AGAACGTAGC CCACCCCGTC 63 00 

AAACAACACG CACGCTTTAC CTAGACATCG ATGCCAGCTC AGAACTTGAG GGCCGTATCA 6360 

CTCCTCGCCA ACTTGTCGAA GAAAATCGCC CAGAGTACAA TATCGAGTAT ATCGAACTCT 64 20 

TGTCTGACAA ATTCCTCGAT TACGAAAAAG AAACTGGCGC CTTCGAAATT ACGGAGTTCT 6480 

AATATGCCCT-ACACTCTTAA ACCTGAAGAA GTGGGCGTTT TTGCCATCCG TGGTCTAGCA ""6540 " 

GAAATCGGGA AAAACACTTA CGGAATTGAA TACCAAGACG AGATTATCAT CGTCGATGCT 6600 

GGGATTAAAT TCCCAGAAGA TGACTTGCTT GGTATCGACT ATGTCATTCC TGACTACTCT 6660 

TACATCGTGG ACAATATCGA CCGCGTCAAG GCTGTTTTAA TCACACACCG ACACGAGGAC 6720 

CACATTGGTG GGATTCCGTT CCTACTCAAG CAAGCAAATG TCCCTATTTA TGCTGGACCG 6780 

CTTGCCTTGG CTTTGATCCG TGGGAAACTC GAAGAACACG GCCTCTTGCG CAACGCCAAA 6840 

CTTTACGAAA TCAACCACAA CACCGAGTTG ACCTTTAAAA ATCTCAAGGC AACTTTCTTT 6900 

AGAACGACTC ACTCTATTCC AGAGCCTTTG GGGATTGTCA TTCATACTCC TCAAGGGAAA 6960 

ATCGTCTGTA CGGGTGACTT TAAGTTCGAC TTTACTCCAG TTGGAGAACC TGCGGACTTG 7020 

CATCGTATGG CTGCGCTTGG TGAAGAAGGC GTGCTCTGTC TCCTGTCTGA CTCGACAAAT 7080 

GCGGAAGTAC CAACCTTTAC CAACTCTGAA AAAGTCGTTG GTCAGTCCAT TATGAAGATT 7140 

ATCCAAGGTA TTGAAGGACG TATCATCTTT GCATCCTTTG CCTCAAATAT CTTCCGTCTC 7 200 

CAGCAGGCAA CAGAAGCTGC TGTTAAGACT GCACGCAAGA TTGCGGTCTT TGGTCGTTCT 7 260 

ATGGAAAAGG CCATTGTCAA CGGAATCGAT CTTGGCTACA TCAAAGCTCC TAAGGGAACC 7320 

TTTATCGAGC CAAATGAAAT CAAAGATTAT CCTGCAGGAG AAGTTCTTAT CCTCTGTACA 7 380 

GGTAGTCAGG GTGAGCCTAT GGCAGCCCTC TCTCGTATCG CCAACGGAAC CCACCGTCAA 7 440 

GTACAATTAC AACCAGGTGA TACCGTTATC TTCTCTTCTA GTCCCATCCC TCCAAACACT 7500 

ACTAGTGTCA ACAAGCTGAT TAACATCATT TCTGAAGCTG GTGTCGAAGT TATCCACGGT 7 560 

AAAGTGAACA ATATCCATAC ATCTGGACAC GGTGGTCAGC AAGAGCAAAA ACTCATGCTC 7620 

TGCTTGATTA AGCCAAAATA CTTCATGCCT GTCCACGGTG AATACCGCAT GCAAAAAGTC 7 680 

CACGCTGGAC TAGCAGTGGA TACTGGTGTT GAGAAGGACA ATATCTTTAT CATGAGCAAT 7740 

GGCGATGTCC TTGCCCTTAC TGCTGACTCA GCTCGTATCG CAGGTCATTT CAACGCCCAA 7800 

GATATCTATG T CG AT GG AAA TCGTATCGGT GAAATTGGCG CAGCTGTCCT CAAAGATCGT 7 860 

CGCGATCTAT CTGAAGACGG TGTCGTTCTG GCAGTTGCAA CTGTTGACTT CAAATCGCAG 7920 
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ATGATTCTAT 


CTGCTCCAGA 


CATCCTCAGC 


CGAGGCTTTG 


TCTACATGAG AGAGTCTGGC 


7980 


GACTTGATTC 


GCCAAAGCCA 


GCGTATCCTC 


TTCAATGCCA 


TTCGTATCGC ACTGAAAAAT 


8040 


AAGGATGCTA 


GCGTGCAATC 


TGTCAATGGT 


GCCATTGTCA 


ACGCTATTCG CCCCTTCCTC 


8100 


TATGAAAATA CCGAACGTGA ACCGATCATC ATCCCGATCA TCCTCACACC AGATGAAGAA 


8160 


TAAAGCAAGA 


AAACAGCCCC 


GTCCTCGGAG 


CTGTTTTTCT 


CTATGCTTTC TTTTGAGATT 


8220 


AAAACTCATA 


CTCAATGAAA 


ATCAAAGAGC 


AAACTAGGAA GCTACCCCTA GGTTGCTCAA 


8280 


AGCACTGCTT 


TGAGGTTGTA 


GATAGAACTG 


ACGAAGTCAG 


TAGCCATACC TACGGCAAGG 


8340 


CGACGTTGAC 


GCGCTTTGAA 


GAGATTTTCG 


AAGAGTATCA 


ATAAAAATCG AAATCAGACT 


8400 


AGAAGGCTAA 


GCGAAAGCAT 


AACTTGAGT7 


AGCTCCCATA 


GTTCGGGAAA CTATGGGAGG 


3460 


CTGGAGATGA 


ATCAAAGCCA 


AGCTTTGAAC 


TCATTCGTAA 


GAAGCCGACG ACGTATCATT 


8520 


TTGATTTTTG 


AAGAGTTTTA 


GAAATACTAC 


GATTTTTACC 


TTCCAGATAC ACCATCAAAA 


8580 


TAGAAATATC 


TGCTGGGTTT 


ACTCCCGAAA 


TACGGCTGGC 


TTGGCCGATG GTTTCTGGAT 


8640 


TGATGAGTTT 


GAACTTCTGA 


CGGGCTTCGG 


TTGCGATAGA 


ATCAATGTCA TCCCAGTCGA 


8700 


TATTGGCCCG 


AATGCGTTTT 


TCTTCCATGC 


GTTTCATCTT 


GGCAACCTGG TCCATGGCTT 


8760 


TGGAAATATA 


GCCTTCATAC 


TTGATTTCTG 


TTTCAATCAA 


TTCGATAATC TTGTCATCCA 


8820 


AGTCTTCTGC 


AGCTGGTCCG 


ATGAAGGCCA 


CCACATCTTG 


GTAAGAAACT TCTGGACGGC 


8880 


GAAGGAATTC 


CTTGGCTGTC 


ACTGCATCGG 


TCAAGGGTTT 


GAAGCCCATC TCCTCAACCT 


8940 


TGGCATTGGT 


TTCCTTGACT 


GGCTTGAGTT 


TGATACTGTC 


TAGGCGCTTC ATCTCATTAT 


9000 


CAAATTGATT 


TTTCTTGATT 


TCAAAACGAG 


CCCAGCGTTC 


ATCGTCCACA AGGCCAATCT 


9060 


CGCGTCCCAT 


CTCAGTCAAG 


CGCATATCAG 


CATTGTCATG 


ACGAAGAATG AGACGGTATT 


9120 


CAGCACGACT 


GGTCAAGAGA 


CGGTAGGGTT 


CAATGGTTCC 


CTTGGTCACC AACTCGTCGA 


9180 


TCATCACCCC 


GATATAACCA 


TCACTGCGCT 


TCAAAATCAA 


TTCAGGCTTG CCTTGGATTT 


9240 


TCAGAGCCGC 


ATTGATACCC 


GCCATAATCC 


CTTGGCCTGC 


TGCCTCTTCG TAACCTGATG 


9300 


TTCCATTTGT 


CTGACCAGCA 


GTGAAGAGAC 


CTGAGATTTT 


CTTGGTTTCC AAAGTCGCAC 


9360 


GCAACTGATG 


AGGCAAGACC 


ATATCATACT 


CAATAGCATA 


ACCTGTCCGC ATCATCTCTG 


9420 


CATTTTCCAA 


ACCTTTGATG 


GAATGCACCA 


AGTCACGCTG 


GACATCCTCA GGCAGACTGC 


9480 


TTGAAACTCC 


TTGCACATAG 


ACTTCCTCAG 


TATTGCGCCC 


TTCTGGCTCA AGGAAGAGTT 


9540 


GGTGACGTTC 


CTTGTCCGCA 


AAGCGCACAA 


TCTTGTCTTC 


AATCGACGGA CAGTAACGAG 


9600 


GCCCCACTCC 


CTTGACCACA 


. CCTGTAAACA 


. TAGGCGCACG 


GTGGAGGTTG TTTTGGATAA 


9660 


TCTCATGACT 


1 GGTACCATTG 


i GTATAGGTCA 


> ACCAGCATGG 


1 TACTTGGTCC TTGACATAAT 


9720 
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CCTCATCACG TGAAGTGTAT GAGAAATGAT TAGGCACTTC GTCTCCTGGC TGAATTTCTG 9780 

TCACATCGTA ATTGATAGAA GAAGCCTTGA CACGTGGAGG GGTTCCTGTC TTGAAACGAC 9840 

CGATTTCGAG ACCCAGTTCC TTGAGATTGT CAGCTAGGTT AATAGAAGCC AAGCTGTGGT 9900 

TAGGACCTGA TGACTACTTG AGGTCTCCGA TGATAATTTC CCCACGGAGA CCAGTCCCTG 9960 

TCGTCACAAT AACAGCCTTA GCAGCATATT CTTGATGGGT GGCTGTACGC ACACCGACAA 10020 

GCTTGGCATC-TTCCACCAAA ATCTCATCAA TCATGGTTTG ACGAAGGGTC AGATTTTCTT 10080 

GGTTTTCAAC CGTCTTGCGC ATCTCCTTAG AGTAAAGTTC CTTGTCAGCC TGCCCACGAA 10140 

GGGCACGGAC AGCTGGCCCC TTCCCTGTGT TTAGCATCTT CATCTGGATG TAAGTCTTGT 10200 

CAATGGTrrr GGCCATCTCG CCACCGAGGG CATCGACTTC ACGCACGACA ATCCCCTTGG 102 60 

CAGAACCACC GATAGAGGGA TTACAAGGCA TGAAAGCCAG CATTTCAATA TTGATGGTCG 10320 

CAAGCAGGAC CTTACAGCCC ATACGGCTAG CGGCCAAGGA AGCCTCAACC CCAGCGTGTC 10380 

CCGCACCAAT TACAATAATA TCGTATTCTT CAGTAAAATG ATAAGTCATG TTTCTCTCCT 10440 

ATTCCTCAAG ATGAATGTGT CTTAGTTGGC CTTCCCAATC TGGTAGGGCT GTTTTTAAAA 10500 

AGACTGGAAC TAGCTGGATA TTCTGGAGCT TATCCAAGTC AATCCACTCA CAGGGCTGCC 10560 

TTTTCTCATC TTCCTGCATG GTCAACGGGG CATCTTCAAG CAAATCCACC AGATAATGAA 10620 

ACTCGATATT GTGATAGGAA ACGCCGTCCA CTTCAAAACG ATTTTCAACC ACAAAAGCTA 10680 

GCTGCCCAGC TTGAGCTTTC ACACCCAGTT CTTCCTTCAC TTCACGGACT ACCGCGTCTT 10740 

CCGTGCTTTC ATTGACTTGA ATCGCACCTC CAATAGTGTA ATACTTGCCC TTGTCTTTGG 10800 

TAACTAGAAG CTTGTGATTT TGGACAATCA AGGCTGTAGC CCGAACACCA AAAACCGTAT 10860 

TGTCTACTTT TGTCCGAAAG TCTTGTTGAG TCATTCTTGT CCTTTCCCTT AAACGACACA 10920 

AAAACAGTCA AAACTACAAA GAAGTGCAGG ACAAAAAAGC CTGCAACATC CAGG 10974 
(2) INFORKATION FOR SEQ ID NO: 215: 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

CCCGTTATGA TTATGGATAG CGCTTTCAAA TTTTTAAACT CCTATCCCAT CCTTTTATCT 60 

ATATAATAAG TGAAAATATA ATAACTGTCA AGTAACTGAA GTGAATTTTA TAAAAAAATT 120 
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ACAAGCCAAA TTTGTAAACT TTACACTAAG CCGCTAGgCA ATCCTCTATC AGAATATCCG 180 

TTTATTTGTC AATAATCCGA GAAAATCTTG CAACGCTTAG AACTCTATAA AAACTATCAA 240 

CATTTATATG ACTTGCGAAT AGCAATCCTG CTAAACCTTT CCACACTCTA TCTATACAAT 300 

CAAGATAAAA ACATGTGTAA GCAAATCTGC TACACTTTAC TGGAGGACGC CAAGAATAAG 360 

AAAAGCTACG ATAGGCTTGC TATCTGCTAT GTCCGTATTG GGATTTGTAC AGACGATTCT 420 

AAACTTATCC AAAAAGGCTT CTCCCTTCTG GAGCTGACCG AGGAAACTTC TATGCTGTCT 480 

CATCTCAAAA AAGAAGTAGA GACCCATTAT CAACCAAAGA AATTATAAAA AAAGTCGAGG 540 

GAGCTCCTCG ACCTTTTCAT AGAATCGCCG AACGATTTAA CGAGAAAGTA TGACTTTTAC 600 

GTTTATCCCA ACTCAATTAT GACATTTTTT TCAAAAGTCA ATATATCTCA CTTTTTCAAC 660 

GACAAGAAAG AGGCTGATAA TCTACCAACC TCTTATTCTG AACCCATCAC TCCATCACTT 720 

TTTAGCTTCA TTCGCTTTCT TAGCGACTGC AATCTCGTAT TCGACTTGGT CATTCCCCTT 780 

ACCGGTACAA CCATGAGCAA TTGTAGTCGC TCCTATCTGA TGCGCTATTT CAACCAATTT 840 

TTTAGAAATC AGAGGGCGGC TCAAGGCAGA TACCAAGAGA TACTTTTGTT CATAATAGGC 900 

ATGTGACTGA TGAGCCACTA GCACATAATC TGTAGCAAAT TCGTCCTTAA CATCAATGAC 960 

ATAAGATTCT ACTGCCCAAA CCTTAAG 987 



(2) INFORMATION FOR SEQ ID NO: 216: 

{ T) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2651 base pairs 
<B) TYPE: nucleic acid 
(C) STRANOEONESS: double 
(Di TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

CTGGGTCTTG TTCATAGTAG GTGTGGTtCT TTTTTTCGAG TGTAGCCCAT AGCTTTGAGC 60 

GCATAGTGGA TGGTAGTTGG ATGACAGCCA AAGTCAGAAG CTATTTCACT CAAATAACCA 120 

TCTGGATTGT CAGTAAGATA GTTTTTAAGT CTATCTCTAT CAACTTTTCT TGGTTTTGTT 180 

CCTTTTACTT GGTGGTTTAG CTCTCCTGTT TTCTCTTTTA GCTTTAACCA GCCATAAATG 2 40 

GTATTACGTG AGATTTGGAA AACGTGTGAT GCTTCTGTTA TACTACCTAT TCGCTCACAA 300 

TAAGAGAGAA CTTTTTTACG AAAATCTATT GAATATGCCA TAAGAAGATT ATACCACATT 3 60 

GTGTACTATT TTTGGTTCAT TTTACTATAT TTTATAAGTT ATAGTGTAGC ATTCCAACTT 420 

CAAAGCACTA TAAAGTAAAT TGAAACAAGA ACAATACAAA CAATTCTCGT AAACGGATTG 4 80 

CAACCACAAA AAAGCAAGCA TTCACAAGAA TACTTACCTA TCATGGGAGG AACAACCGTT 540 
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CCTCTTTTTT ATTACTAAAA TTCAAAGAAT TCCAATGCTT TTTTCAAGAG CAAATCCGTA 600 

TATTCTGGAT CTTCTTGGGC TACTTCTATT TCCCGCTGAA CTTTTTCCAA ATCATCTGTA 660 

AT C ACT C CAT CTACTCCTAA GTGAAGAGAT TTGCTGATAG CTTCTGAATC ATTGACAGTC 720 

CAGACATAAA GTTTCTGATC CGTTGTCCAT AGTTTGCTTA CAAAATATTC ATCCAAGGTT 790 

GAGTACTCCA TAGTATATCC TGTCGCTCTT GTTTTAGGAA AGACAGAATT GTAGGGCATG 840 

ATGAAATAAA CTGGTAGTTC GG CAT CAT AC TGTCTTACTT TTTCGACAAC ATGGTAGTCT 900 

AAAGACTGGA TTTGATGTCC ATAAATCTTG AGCTTTGCAG CATAACGGGC TAAAAAGCGG 960 

TTCATCATGT CTGGACTATC TTTTTTACTG GTTTTAATTT CAATTAGTAA TTTTTGACCA 1020 

AGTTCGTTGG CTCGACTGAG ATAATCTTCA AAGCTTGAAA TTTTAGTCTG GTAGCCATTT 1080 

TCAAAAATAT CAATCCCTTT AAGCTCCTCC AAGTTTAAGT CTTGAGGACT TTTATTGATA 1140 

CCTGCTAGAT TTTTCAAGTT AGCATCATGC ATCATGACAA ACTGCCCATC TTTTGTTTCC 1200 

TGCACCTCCG TCTCCACCAA GTCTGGTTTG AGTTCTGCTG TAGTTTCCAA GGACTCTACT 1260 

GTATTTTGAA TCCCATTTGC ATTGGAAACC CCTCGGTGAG AAATAAGTTG AGGTAGATGA 1320 

ACGATGGGAG CCTCCAGATA AATATAACCT TCTAAGGCAA AGAAAAGACT GGCACAAGTC 1380 

ATGACACCCC ATCCCACGAT GTGATCTTTT TCTCTCCTAG GAAGCATATC CAGCTCCTTT 1440 

CCTGTCAAAA ATGAAACAAA TTTAACCAAA AAATAAGTCA GAGCCATATA ATAGAGATTT 1500 

TTAATCACGA CAAAATTCAA AATACCAAGA ATCAGAGACT CTCTCTGAGT GATATCATCT 1560 

ACCAAACTTT GAGCCAATAA TAAAGGAATC AAAGGAAGAT ArAATAATAA ATGTGCTTTG 1620 

AGCAAGATGT AAAATAAATT CCAAGCATAA AAAGTAACTC TCTTCTTGGT TTTCTCCAAG 1680 

CTAAACATCA CTGCTTCTCG AACAGTCAGC TGATCATATA CAATCTTCGG AAGGGCAAAC 1740 

ATCAATCTGA CAGAGACATA GAGAAAGATA AGACATAGAA GTAGGATGCT CAGCCACCAC 1800 

ATCCAATATC TATCTTCTAA ATAAGCTTGG ATAAACTCTG GAATGACGAT TTTATTAACA 1860 

TAATAAATCT TCAGCATTTT CCGTATAAAA GGAAACAGCA TAGCTATATA GAAAAAGATA 1920 

AACAAGGCTT TAGCGCAAGT TACCTTTTTC ATAAATCCAA AACTTTCATG GAAAACCTTG 1980 

CGGATATACT CAATTAGCCT TCGCTTTTCA TTATAGAGGA GATGACGAGC ACCAATAAAG 2040 

AGGAGTCCTA TTTGAAAATA AGCAACCAGA AGGTTAATTA CAATCAAGGC TAAAAAAGCT 2100 

AGACTAATCA ATGGAGAATG AGTAAGGATG GCTAAGACAT TGTTATAGGA AATAAAAAGA 2160 

TAACCTGTCT GATCTAATAA GAAGCTAGCC AACCATGAAT TGAATGGTAC CCACAAATAC 2220 

TCCACTATCA TAAAAATCAA GAAAAATAGA AAGAGGATTT TATCAAGATC GAGGTAAATC 2280 
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TGTTTAAGAC CCAATTTTTT AGGTTTTTCA GGTTTCATAG GCACTCCTAG TCAAATAATT 2 340 

GAGACAAGTC CAAGCCACCA AAAGGATTGT TTGATAAGCT ACTTTCTCTC TCTAACAATT 2400 

CCCTAGCTTG ATCCGACTCT AAGAAGCATT CGTAAACACG CGCCGTCATC CGAGCATCCT 24 60 

CTAAACTATT ATGAGACTGA CCTTGAAATC CAAGAAATGA CGCAACACTT TCCAATTTGA 2520 

GATTGGCAAT ACCATGTAAA TCTGAACTCC GACGTTCAAA AGCTTCATCA TACAAATCCA 2580 

CCTTGTACTG TTGGCTATAG TCTAAACCAT GCTCTGCTAA AATAGGTAAA TCACTTTTAG 2640 



CAGCATTGTA G 

(2) INFORMATION FOR SEQ ID NO: 217: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



2651 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

CGTTATAATA AACTTGTGAA AAAATTAACA AAGGATATCG TTCCTTGAAA CCTATGGACG 60 

AAAATATGGC TGATAAAAAA ACTGTGACAC CAGAGGAAAA GAAACTCGTT GCTGAAAAAC 120 

ACGTAGATGA GTTGGTTCAA AAAGCTCTAG TTGCCCTTGA AGAAATGCGT AAATTGGATC 180 

AAGAACAAGT TGACTACATC GTTGCCAAAG CATCAGTAGC AGCTTTGGAT GCCCACGGAG 24 0 

AATTGGCTTT ACATGCCTTT GAAGAAACAG GACGTCCTCT ATTTGAAGAC AAAGCAACTA 300 

AGAACTTGTT TGCCTGTGAA CACGTAGTAA ACAACATGCG CCACACTAAG ACACTTGGCC 3 60 

TTATCGAAGA AGACGATGTA ACAGGATTGA CTCTTATTGC TGAACCAGTT GGTGTTGTTT 420 

GTGGTATTAC TCCAACAACA AACCCAACAT CAACAGCAAT CTTCAAATCA TTGATTTCAT 480 

TGAAGACACG TAACCCAATC GTCTTTGCCT TC CATC CATC AGCACAAGAA TCATCTGCTC 540 

ATGCAGCTCG TATCGTCCGC GATGCAGCTA TCGCAGCTGG TGCTCCTGAA AACTCTCTCC 600 

AATGGATTAC TCAACCATCT ATGGAAGCAA CAAGTGCCCT TATGAACCAC GAAGGTGTTG 660 

CGACAATCCT TGCAACAGGT GGTAATGCCA TGGTTAAGGC GGCTTATTCA TGTGGTAAAC 720 

CAGCTCTTGG GGTAGGTGCC GGAAACGTTC CAGCTTATGT TGAAAAATCA GCAAACATTC 780 

GTCAAGCAGC ACACGATATC GTCATGTCTA AATCATTTGA TAACGCTATG GTCTGTGCAT 840 

CTGAACAAGC AGTTATCATT GATAAAGAAA TTTACGATGA ATTTGTAGCA GAGTTCAAAT 900 

CTTACCACAC TTACTTTGTA AACAAAAAAG AAAAAGCTCT TCTTGAAGAG TTCTGCTTCG 960 

GCGTCAAAGC AAACAGCAAA AACTGTGCTG GTGCAAAATT GAACGCTGAC ATCGTTGGTA 1020 
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AACCAGCAAC 


TTGGATTGCA 


GAACAAGCAG 


GATTTACAGT 


TCCAGAAGGA ACAAACATTC 


1080 


TTGCTGCAGA 


ATGTAAAGAA 


GTTGGCGAAA 


ATCAGCCATT 


GACTCGTGAA 


AAATTGTCAC 


1140 


CAGTTATTGC 


AGTTTTGAAA 


TCTGAAAGCC 


GTGAAGATGG 


TATTACTAAG 


GCTCGTCAAA 


1200 


TGGTTGAATT 


TAACGGTCTT 


GGACACTCAG 


CAGCTATCCA 


CACAGCTGAC 


GAAGAATTGA 


1260 


CTAAAGAATT 


TGGTAAAGCT 


GTTAAAGCTA 


TTCGTGTTAT 


CTGTAACTCA 


CCTTCTACTT 


1320 


TTGGTGGTAT 


CGGGGACGTT 


TACAATGCCT 


TCTTGCCATC 


ATTGACACTT 


GGATGTGGTT 


1380 


CTTACCGACC 


CAACTCAGTT 


GGGGATAACG 


TTAGTGCCAT 


TAACCTCTTG 


AATATCAAAA 


1440 


AAGTCGGAAG 


ACGGAGAAAT 


AACATGCAAT 


GGATGAAACT 


TCCTTCAAAA 


ACATACTTTG 


1500 


AACGTGATTC 


AATTCAATAC 


CTTCAAAAAT 


GTCGTGACGT 


TGAACGTGTC 


ATGATCGTTA 


1560 


CTGACCATGC 


CATGGTAGAG 


CTTGGTTTCC 


TTGATCGTAT 


CATCGAACAA 


CTGGACCTTC 


1620 


GTCGCAATAA 


GGTTGTTTAC 


CAAATCTTTG 


CGGATGTAGA 


ACCGGATCCA GATATCACAA 


1680 


CTGTAAACCG 


TGGTACTGAG 


ATTATGCCTG 


CCTTCAAACC 


ACATACCATC 


ATCGCACTCG 


1740 


GTGGTGGGTC 


TCCAATGGAT 


GCTGCCAAAG 


TAATGTGGCT 


CTTCTACGAG 


CAACCAGAAG 


1800 


TGGACTTCCG 


TGACCTTGTC 


CAAAAATTCA 


TGGATATCCG 


TAAACGTGCC 


TTCAAGTTCC 


1860 


CATTGCTTGG 


TAAGAAGACT 


AAATTCATCG 


CGATTCCAAC 


TACATCTGGT 


ACAGGATCTG 


1920 


AAGTAACACC 


ATTTGCCGTT 


ATCTCTGATA 


AAGCAAACAA 


CCGTAAATAC 


CCAATCGCTG 


1980 


ACTACTCATT 


GACACCAACT 


GTGGCAATCG 


TAGATCCTGC 


TTTGGTATTG 


ACAGTTCCAG 


2040 


GATTTGTTGC 


TGCTGATACT 


GGTATGGACG 


TATTGACTCA 


CGCGACAGAA 


GCATACGTAT 


2100 


CACAAATGGC 


TAGTGACTAC 


ACTGATGGTT 


TAGCACTTCA 


AGCCATTAAA 


TTGGTCTTTG 


2160 


AAAATCTCGA 


AAGCTCAGTT 


AAGAATGCAG 


ACTTCCACTC 


ACG7GAGAAA 


ATGCATAACG 


2220 


CTTCAACAAT 


CGCTGGTATG 


GCCTTTGCCA 


ATGCCTTCCT 


AGGTATTTCT 


CACTCAATGG 


2280 


LULnlnAUn 1 




it*' i*/^ ft ft r+ ft ft 

TTC C AC ACAA 


TCCACGGTCG 


TACAAATGCT 


ATCTTGCTTC 


2340 


CATACGTTAT 


CCGTTACAAC 


GGTACACGTC 


CAGCTAAGAC 


AGCAACATGG 


CCTAAGTACA 


2400 


ACTACTACCG 


TGCAGATGAA 


AAATACCAAG 


ATATCGCACG 


CATGCTTGGA 


CTTCCAGCTT 


2460 


CTACTCCAGA 


AGAAGGGGTT 


GAATCTTACG 


CAAAAGCTGT 


CTACGAACTC 


GGTGAACGTA 


2520 


TTGGGATCCA 


AATGAATTTT 


AGAGACCAAG 


GAATTGACGA 


AAAAGAATGG 


AAAGAACATT 


2580 


CTCGTAAATT 


AGCCTTCCTG 


GCTTATGAAG 


ACCAATCTTC 


ACCAGCTAAC 


CCACGTCTTC 


2640 


CAATGGTAGA 


CCATATGCAA 


GAAATCATCG 


AAGATGCATA 


CTATGGCTAC 


AAAGAAAGAC 


2700 


CAGGACGCCG 


TAAATAATTG 


TTTATCAGTC 


TAGAAGCAAG 


ACAAAAACTC 


AATTTGAGGG 


2760 
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AAAGATCCAG 


TAATTTTTCT 


ATGATAAAAG 


GCATCCTATC 


AAGGTTTTTG 


AACACCTGAT 


2820 


AGGATGCCTT 


TTTATGATAT 


TGAGGCCTTT TTGCCCTTTT TGAAAAACTA GAATAGAAAC 


2880 


AAAATATATA 


ATAGATTGAA 


ACTAGAATAG 


TACATATCTG 


CTTCTAAAAC 


ATTGTTAGAA 


2940 


TTCGATTTGA 


CTGTCCTGAT 


CGATTTGTCC 


TGTTCTTATT 


TCATTTTGAT 


ATATAAAAAA 


3000 


TATAGTATAG 


TAGACTGAAT 


CTAAAATACT ACGAAACAAT TCCTAAAACA TTTATAGAAA 


3060 


TTAATTTTAC 


TTTTCTGATA 


GAGTTGTTCA 


CATCTTATTT 


CAATTCACTA 


TAGTTTAATT 


3120 


TAAGAGTAGT 


ATTTACTAAG 


GCCCAATTAA 


AATCAAAGAG 


CAAACTAGAA 


AACGAGTGCC 


3180 


ATTCAGCTCA 


AAACACTGAT 


TTGAGATTGC 


AGATAAGACT 


AGCCCCCTCA 


TTAACAGATT 


3240 


TACGATAAAA 


CGATGACAAG 


GTGTGTTGCT 


TTTTGATTTC 


TAAAGAGTAT 


AATGATAGAT 


3300 


CTCTATAAAA 


TAAGTGCGAA 


GGAAATGAGC 


TTTTATAGTC 


CTTTCGTTTT 


AAAATACTAT 


3360 


CTCAGATATT 


CTTATATCGA 


CAAGAAGTTT 


TTGAGTCATT 


CCCTCATCAT 


ACATATTAAA 


3420 


TAAATAGTGG 


CTCATTCAAT 


TTTTCACTAG 


AATAATAAGC 


TAGTATAGTA 


AACTGAAATA 


3480 


AGATATAAAC 


AAATAAATTG 


GAGCTTAACA 


TCCATTTCCA 


GCAATTTTTT 


AGAAACTACA 


3540 


GTGGACTATT 


CTAGATTCAA 


CAT ATT AT AA 


AAACTAGAGT 


AAAACAAAAG 


GATTGGATCT 


3600 


TGTGTAATGC 


AGGATCCAAT 


CCTTTCAATC 


ATTTTGTCCA 


ACTTTTGGAG 


GTTCCTACAA 


3660 


TGTAGTCGTC 


ATTAATAAAG 


ACAGATGGGA 


ATGACAGTGT 


TCCTATTTAT 


TTTGATAGAG 


3720 


ATCGATGAAT 


TCTTTAGATA 


GCAACTGAAT 


AATCTCTGTT 


GAAGCCATTT 


GGTCTTCTGC 


3780 


ATGCATAAAT 


AGCAAGGAGA 


ATCCTATTTT 


TTCTCCAGTA 


GCTTCTTTTT 


GTATGAGATT 


3840 


AGAGTGAATC 


TTGTGCGCTT 


CTACTAAGGA 


GTCTTCCGCT 


TCTTCAACTT 


TAATTTTCGC 


3900 


TTCTTTTAAA 


TTTCCTGCCT 


TAGCTAGTTG 


GATGGCTTCA 


ATAAAGGATG 


ATTTGGCTGC 


3960 


TCCACTATTG 


GCAATGAGCT 


GAAAACAGAT 


ATATTCCATT 


TCTTCTGTCA 


TCTTATTTCT 


4020 


CCTATCCATG 


CAAGTGCTTG 


TTCCAGAACT 


TTTGCTCCAT 


TCATCATTCC 


GTAATCCCGC 


4080 


ATATCAATGG 


TATCTACAGG 


GATATTTCCT 


GCAATTTCTT 


TCACAGCAAG 


TAACTCATAA 


4140 


CGAATTTGTG 


GCCCAATTAG 


AATGACATCT 


GCTTCATGGA 


TATTCTTTTT 


AGCTTCTGTC 


4200 


ATTGATTTTG 


CTTGGATAGA 


GATTTCAATC 


CCACGTTCAG 


TCCCACTTTG 


TTGCATTTTT 


4260 


TTAACAAGCA 


TACTTGTCGA 


CATTCCCGCA 


TTACATACTA 


ATAAAATTTG 


TTTCATAATC 


4320 


TTAACCTTCC 


ATTTCTTGTT 


CAACAACTTT 


GTCATTAACT 


TTGATAAATG 


GAATCTATAG 


4380 


AAGAACTCCA 


AGTGCAAAGA 


TGATGAATTG 


AACTAGAACT 


GCTCTCACGT 


CCCCTGCTGT 


4440 


TGCTAACCAT 


GCATTTAAGA 


ATACTGGTGT 


AGTCCAAGGA 


ACTTGTATAA 


ATGCAGGACT 


4500 


CATGAATTCT 


GTAACTGTTG 


CTAAGTAGCT 


GATTAAAATA 


CCAAGGACTG 


GAACTGTGAT 


4560 



WO 98/18931 



PCT/US97/19588 



1225 

AAATGGAATA GCTAATGAAA TGTTATAAAC GATTGGGTAA CCGAATAATA CTGGTTCATT 4620 

GATATTGAAG ATACCAGGTC CAAAAGATAA TTTAGCCACG TTTTTAGAGA CAGCATTGCG 4 680 

ACTCACTAAG AATGTTGCTA TTAATAAACA TAATGTAGAT CCACTACCAC CCATTAAAGC 4740 

GAATGTTTGT ATTTCTGATA GGTTGATGAT GTGTGGAATG GCTTGTCCAT TATTTGCTGC 4800 

AGTGATGTTT TCAGTAATGT TAATTAATAG TAATGGTTCT AGGATGGCAC TGTAAATAAC 4860 

TGCTTGGTGA ATACCAAATA GCCATAACAT ATTTCCTAAA GAGTAAATAA TAATGACCCC 4920 

GATTAAGCTT GTACCAATAT GACGAATTGG TTCTTGAATA AAGATTGTAA TGATTGAGAT 4980 

TAAGTTCATT CCAGTTATAT TGAATAATAA TGCTGAAACA ACCCCAAATA AGGAGATGAC 5040 

GGTCATGACT GGAAGTAATA CGCTAAATGA TCTACTAACA GCTGGTGGAA TATTTTCACC 5100 

AAGGTTCATT TGTAAAGCTT TAACCTTTGA TAATTCAATG AATAATTCTG TTGCAATAAT 5160 

CGtACGATAA CCCCGGCGAA CATTGCGCCT GTACCTGTGT TGTTGAATGA AAGAACACCT 5220 

GAAATGTTTA CCGCATCTTT TGCTCCGTCA GGAACTACAG AAACTGTATT TGGCATCATC 5280 

ACAATTAAAG AAACTAATGA TAGCATTGAT GCTGCTAACG GCTTTTCGAA ATCTCTGTTT 5340 

TTAGCTAAGA AATAACCAAC CATTACAGCA ATAATCATAC CTGAAATACT TAAAGTACCG 5400 

TTTGCAATTG TTATTCCCCA ATATTGGAAT CTTGTTAATG TATCCCCTTG GAAAATCCAC 5460 

TTAAATACCG TGTTGTTCAA AAGAACGATT AAACCTGCCA AAATATATAA TGGCATTACT 5520 

GTTACGAATG CATCTCTTAG GGTTTTTAAA TGAATTTGGT TCCCTAGTTT ACCAGCAAAG 5580 

GATGGCAAAA AAATTTTTTT GGGGGGGGGG GTTATTAAAC CCCCCTTTTT AAAAAAAA 5638 
(2) INFORMATION FOR SEQ 10 NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4745 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

CCGGAAGCTG TTGCCCTTGG AACTCCAAAT GAAGAAACAG CCTTTGTCTT GAACTATTTT 60 

GGTGTGGAAG CACCACGTGT TATCACTTCT GCCAAAGCAG AGGGGGCAGA GCAAGTTATC 120 

TTGACTGACC ACAATGAATT CCAACAATCT GTATCAGATA TCGCTGAAGT AGAAGTTTAC 180 

GGTGTTCTAG ACCACCACCG TGTGGCTAAC TTTGAAACTG CAAGCCCACT TTACATGCGT 240 

TTGGAGCCAG TTGGATCAGC CTCTTCAATC GTTTACCGTA TGTTCAAAGA ACATGGTGTA 300 
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GCTGTCCCTA AAGAGATTGC AGGTTTGATG CTTTCAGGTT TGATTTCAGA TACCCTTCTT 3 60 

TTGAAATCAC CAACAACACA CC CAACAGAT AAAATCATTG CTCCTCAATT GGCTGAATTG 420 

GCTGGTGTGA ACTTGGAAGA ATATGGTTTG GCAATGTTGA AAGCTGGTAC CAACTTGGCT 480 

AGCAAATCTG CTGAAGAATT GATTGATATC GATGCTAAGA CTTTTGAACT CAACGGAAAT 540 

AATGTCCGTG TTGCCCAAGT GAACACAGTT GACATCGCTG AAGTTTTGGA ACGCCAAGCA 600 

GAAATTGAAG CTGCAATGCA AGCTGCCAAC GAATCAAACG GCTACTCTGA CTTTGTCTTG 660 

ATGATTACAG ATATCGTCAA CTCAAACTCA GAAATCTTGG CTCTTGGTGC CAATATGGAC 720 

AAGGTCGAAG CGCCTTTCAA CTTCAAACTT GAAAACAATC ATGCCTTCCT TGCTGGTGCC 780 

GTTTCACGTA AGAAACAACT GGTACCTCAA TTGACTGAAA GCTTTAATGC GTAAGATTTT 840 

GGGTGTCAGC TCAAAATCGG AAAGTCTAGT TTGCCTTATA TCGCAAGGAG TTTCGGCTCC 900 

TTTTTTCTAG GAGTGAAGTA TGTTAGAAAA TGGCGATTTG ATTTTTGTGA GAGATGGGTC 960 

AGACATGGGA CAGGCCATCC AGACTTCCAC AGGTAACTAT AGCCATGTTG CCATTTATTT 1020 

GGATGGGATG ATTTATCATG CTAGTGGACA GGCTGGTGTT GTCTGTCAAG AACCGGCAGA 1080 

CTTCTTTGAG TCCAATCATT TATACGACCT CTATGTTTAC CCAGAAATGG ATATCCAGTC 1140 

GGTGAAGGAA AGAGCTTGCA AACATCTTGG AGCACCCTAC AATGCTTCTT TCTATCCAGA 1200 

TGCAGCTGGT TTTTACTGCT CCCAGTATAT AGCAGAAATC CTACCTATTT TTGAAACTAT 1260 

TCCTATGAAA TTTGGAGwTG GGGAGCAGGA GATTAGTGAT TTTTGGAGGG AGTATTACAT 1320 

AGAACTAGGT CTGCCTGTTC CTCTGAACCA AGCTGGTACC AATCCTAGTC AGTTGGCAGC 13 80 

ATCGCCTCTG TTACAATGTA AAGAAAGGAA TCTTCATGAT TCAGATTTTT AATCCATCTC 1440 

GTTTGACGAG ACAGCCATTT TTGGAGAATT GATCCGCTAT CTGGATCAGT ATGAGGATGT 1500 

GATTCTACGG GAAATTAAGG CTCAATTTCC AGATGTTGCA GTTGATAAAC TCATGGAAGA 1560 

GTATATAAAG GCAGGCTTGA TTCTACGTGA AAATAAGCGC TATTACCTCA ATTTTCCTAC 1620 

GCTTGAATCA CTTGATAGTC TTGAACTGGA TCAAGAGATT TTTGTCAGAG AAGCTAGTCC 1680 

GGTCTATCAA GCCTTGTTGG AGCAGAGTTT TGAGACGGAA TTGCGCAATC AAATCAATGC 1740 

AGCTATTTTA GTTGAAAAGA CGGACTTTGC GCGCATTAAA ATGACCCTGT CCAATTATTT 1800 

TTACAAGGTC AAACAGCAGT ATCCTTTGAC AGAAAAACAG CAGGAGCTCT ATGACATTTT 1860 

AGGAGATGTT AATCCTGAGT ATGCCCTCAA GTATATGACG GCTTTTTTGT TGAAATTTCT 1920 

CAAAAAAGAC CAGCTTATGC AGAAATGCCG TCATATCTTT GTGGACAGTT AGGTTGTCTT 1980 

ACGCTATATT GTGCAAAATG AAGATGGAAA GTATGAGTTG GCTATCGATT TTGATAAGGA 2040 

GAGGTTAACT TTCTACTTAG CGTGATTTCT TGTTTCTGAG TACATTGTTT GACTTTCCTT 2100 
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AGTATTCGCT 


ATAAACTATA 


TGTAACCGGT 


AACACATATC 


GG AATAAACT 


AAAGGAGACA 


2160 


ATCATATGTC 


ACTTGAAAAC 


AAATTGGAAC 


AAGCAACAGG 


CGCTGTCAAA 


GAAGGTTTTG 


2220 


GTAAAGTTAC 


TGGAGACAGC 


AAGACAGAAC 


TTCAAGGAGC 


TGTTGAAAAA 


ACAGTTGCTA 


2280 


AGGCAAAAGA 


CGTTGTAGAA 


GACGCAAAAG 


GTGCTGTAGA 


AGGTGCCGTT 


GAAGGTTTGA 


2340 


AAAACGTTTT 


TACTAAAGAA 


TAGGAAAAAA 


TCAAGGGTTT 


CATTTTCCCT 


TGATTTTTTC 


2400 


TAT-TCTTATA 


AATAATTTTC 


"TGCGACGGCT 


GTATCTCCTG 


~ GGT AGGATTC 


TTTCTTGCCC 


"~2460 


TGCATGATTT 


GGTAACAATC 


GGCTCCCTTA 


CCCGCAATAA 


TAACTGCATC 


TAATTCGTGA 


2520 


TTTGTGATAG 


CCATTGCCGC 


CTTGATGGCT 


TCTTGGCGAT 


CCGCAATCTT 


TTCAACAGGA 


2580 


TGATTGATGT 


AGCTACTAAT 


TTCATCTGCA 


ATGGCCATTG 


GCTCTTCATA 


GTTAGGGTCA 


2640 


TCAGCAGTCA 


GAAAGACTTG 


AATCTCAGGG 


TGTTGATTGA 


GGAGGAGGCC 


AAAGTCCTTA 


2700 


CGACGACTTT 


CTCCCTTGTT 


TCCTGTTGAT 


CCCAGAACCA 


GAGCAATCTT 


TCCGGTTTGA 


2760 


TGAGTTTCAA 


CCACATTGAT 


GAGTTTTTTC 


AGACTATCCC 


CATTGTCGGC 


ATAGTCGATG 


2820 


AAGACCTTGG 


CTCCATTTTT 


CTGAGTGAGG 


ACTTCCATAC 


GACCAGGAAC 


GCGGGTTGCA 


2880 


GCGATGCCTT 


TTTTGATGTC 


CTCAAGACTT 


GCTCCGAGAC 


GGAGACAAGC 


AAGTCCAGCA 


2940 " 


GCAACTGCAT 


TTTCTTGGTT 


GAAGTTGCCA 


ATGAGTTGAA 


TATCATAATC 


TCCAGCGAGT 


3000 


TTACCCGTAG 


CTGAAAAGCT 


AAAGGCTTTG 


GAATTCTCGA 


TTTGGTTATC 


AAATTGGCTA 


3060 


CCATAGAAAT 


CATGCTCTTG 


ATCTTCAACC 


TGTTCTTTCA 


AGACTGAGAA 


GTGGTCCATG 


3120 


TCACTGTTAA 


TGATGACTCC 


TCGGCTCTTT 


TCCATCAAGA 


GACGCTTGTG 


GTAGAAATAG 


3180 


TCTTCAAAGC 


TAGGGTGTTC 


AATCGGGCCG 


ATATGGTCTG 


GGCTGATATT 


TAGGAAAACT 


3240 


CCCACATCAA 


AGGTTAGACC 


ATAGACACGT 


TTGACCAGAT 


AGGCTTGACT 


GGAGACTTCC 


3300 ' 


ATGATGAGGT 


GGGTACGGTC 


ATTTTGCACA 


GCCTGATTCA 


TCATCTCAAA 


GAGGTCAATA 


3360 


CTCTCAGGGG 


TTGTCAACGC 


TGACTTAAAG 


AAAGTCTCGC 


CATCAAGAGT 




J *i £ U 


GTCGACAACA 


TAGCAGGTCT 


ATGCCCTTGA 


GATAAGATGT 


TATAGGCGAA 


ATAGGCTGCT 


3480 


GTTGTCTTAC 


CCTTAGTACC 


AGTAAAGGCA 


AGGAGTTTGA 


GTTTTTCCTG 


TGGATTACCA 


3540 


TAGAACTCCA 


TGGCAATCAA 


ACTCATGGCT 


TTCTTTATAT 


CGTTCACAAT 


GATGACAGGG 


3600 


ATACCGACTT 


CGTAGTCCTT 


TTCAGCTACA 


TACCAAGCTA 


ATCCTTGTGT 


TATAGCAGAA 


3660 


AGAAGGTATT 


CTTTTTTAAA 


GGCAGCGCCT 


TTTGCGAAAA 


AAAGAGTGTC 


TTCTGTTACT 


3720 


TTTCGGCTGT 


CGTAGCTGAT 


GCTATCAAAA 


ATAACTTTGC 


TGTAGTTGTA 


GTGGTAATGA 


3780 


CCTTGGTCAA 


TAATTTCGCG 


AAAAAGGCCA 


TCTTTCTTTA 


AAATATCTAA 


TACGGTTTCA 


3840 
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ATCTTAATCA 


TACTTTCTAT 


TGTAAACCGA 


AAGTCGTAAA 


TTTACAAGTA 


ACAAGGAAAA 


3900 


GTTTATAATG 


GAAGATAAGG 


AGTTTTTCCT 


AGTTATCAAA 


ATTGAATGAG 


GAATCTATCT 


3960 


CGCACGAAAA 


CAATCACCAG 


CAGGCCCAGA 


TGTTACGGGG 


GACTGCTTGG 


CTAACGGCTA 


4020 


GTAACTTTAT 


CAGTCGCCTA 


CTCGGGGCTG 


TTTACATTAT 


CCCTTGGTAC 


ATCTGGATGG 


4080 


GGGCTTATGC 


AGCTAAGGCA 


AATGGTCTCT 


TTACCATGGG 


TTACAATATC 


TATGCTTGGT 


4140 


TCTTGTTGGT 


TTCAACAGCG 


GGGATTCCAG 


TTGCGGTGGC 


CAAGCAAGTT 


GCCAAGTATA 


4200 


ATACCATGCG 


AGAAGAAGAG 


CATAGCTTTC 


CCCTGATTCG 


GAGCTTCTTA 


CCCTTTATGA 


4260 


CAGGACTAGG 


CCTGGTTTTT 


GCTTTAGTCT 


TGTATGTCTT 


TGCTCCTTGG 


CTAGCAGACT 


4320 


TGTCTGGCGT 


GGGCAAAGAC 


TTGATCCCAA 


TCATGCAAAG 


CTTGGCTTGG 


GGAGTCTTGA 


4380 


TTTTCCCGTC 


TATGAGTGTT 


ATCCGAGGAT 


TTTTCCAAGG 


GATGAATAAC 


CTCAAACCCT 


4440 


ATGCCATGAG 


CCAAATTGCT 


GAGCAGGTCA 


TTCGTGTTAT 


CTGGATGCTC 


CTAGCAACCT 


4500 


TTATCATTAT 


CAAGCTCGGT 


TCAGGAGATT 


ATCTAGCAGC 


CGTTACCCAA 


TCAACCTTTG 


4560 


CTGCCTTTGT 


CGGTATGGTA 


GCCAGTTTTG 


CAGTCTTGAT 


TTATTTCCTT 


GCCCAAGAAG 


4620 


GTTCACTCAA 


AAGAATCTTT 


GAAACAGGAG 


ATAAGATTAA 


CAGTAAGCGT 


CTCTTGGTTG 


4680 


ATACCATTAA 


GGAAGCCATT 


CCTTTTATCC 


TGACAGGGTC 


TGCCATCCAG 


CTCTTCCAGA 


4740 


TTTTG 












4745 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1900 base pairs . 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 219: 
CCTGATTGAC CTTATAATAA GGAACAAAAC ACAATGCACT ACCTTTTCAA CAAAAGAGTT 60 

GCTGCTTGAT TAAAACCATC ACACCAGTTA TACCATTTTG CTTCATACCC ATCTTGAGCT 120 

AGGATACGAT CTTCTAAATC AAAAACAGAG TAAATCTTTC TTTCCTCGCA AGCTTGCCCA 180 

TAGAGATGAT ATAGTTCATC ACCACCATCT CTATCCCACT CAGCAGAAAT CGTATCCCGA 240 

CCTGCCAATA AAGCCTGATA AGCCCTGTGA TGCCCATCTG TAATCAGCAA ACAATCTCCA 300 

AAGGCAAGAA TACTGATTGG ATCGACTTGG ATTGTTTCTG CCGACTGGTA AAGCATCTGA 3 60 

ATATCTTGCA ACTTCTTTTC TGATAAATAT AGTTGAGTCA GATGAAGATC TGCTATATTG 420 

ACTTTCATTT CTTTCTCCTC AAGGGAATTC GATACTCACT TCTGTTTGCC TTTAAATCGC 480 
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CATTGGAAGC GGAgCTTGTC 


ATAAAAGGGA 


AACTCGATAA 


ACAGGACTCC 


CAAGCCCACA 


540 


CAGAGACTGG 


CAACGACCTC 


TGATGGGTAA 


TGAACTCCCA 


GATAGACTCT 


TGATACCAGC 


600 


ACACTGACTA 


GGTAGAGGCC 


AAGGACGATT 


TGTACGATTT 


TTCTCCAGAC 


CTGATCTTTA 


660 


ATCCGCTGAC 


TAAGAATAAC 


AATCAAAGTC CCTACCATCA GCGTTACACC 


TAGAGAATGC 


720 


CCACTTGGGA 


AGGAAAATCC 


CTTCTCCTCC 


ACCAGATGTA 


AAATACCTCG 


TCGTGGGCGC 


780 


TGGTAGATAT 


TTTTAAAGGT 


CACGATTAAA" 


AGACCTGCCA 


AAGCCAG ATT 


TCCCAGCATC 


840 


AAGAAACTTT 


CTATCTTCCA 


TCGCTTACGA 


TAAAAGACAA 


AAGCTGTAAT 


GACAACCCAA 


900 


GTGATAATCA 


CTGGGATATC 


AATCAGACGT 


GTGAGGGCTC 


GAAAAAGAAT 


AGTCAAATAA 


960 


TCTGGTAAGT 


CTCCTCGAAT 


GGCAGTCTGA 


ATCGATTGGT 


CAAAATTGAC 


CAACATTTCA 


1020 


GGGTAAAATT 


TGACCATGTA 


GCCAAGAATA 


ACGAAAAGTA 


AAAGCGCAAA 


ACTGCCCTTC 


1080 


ATTAAAAATG 


TTTGTTTATC 


TCTCATAATG 


TTTTAAGGTT 


GGTTTCAAGA 


GAACATACAA 


1140 


CAACCAGAAT 


GAAACGGAAA 


AGATAACACC 


TTCAATCAAG 


TTAAAAGGTA 


ATACCATGGT 


1200 


CATTAGGTAG 


TTGGAAAGTC 


CCAAAATTTT 


TCCAATATCA 


AAGTTAGCAA 


ACTTAGCGTA 


1260 


CAAAGGAACA 


GCATAAACAT 


AGTTGAGAAC 


CAACATGGCC 


AAGGTTAAAC 


CAATAGTTCC 


1320 


AGCTAGAGAG 


CCTAGTAGGA 


AACGAAGGGT 


TGTCCGTTCC 


TTTTTCCAAA 


TCAAAGCAAA 


1380 


TACGATGACA 


AAAACTCCCA 


AAGCTACGAT 


ATTCATCGGC 


AAACCAATGT 


AAGTATTCAC 


1440 


TCCTTGGCTG 


TTAAGAAGCA 


ATTTCAAGAG 


TGAGCGAAGC 


AAGAGCACTC 


CTAGAGmCsC 


1S00 


AGGCAAATCC 


ATGACCACCA 


GACCCACAAG 


GACTGGCAAG 


ATACTAAATT 


CGATCTTGAG 


1560 


GAAAGATGCC 


GCTGGTAAAA 


GCGGAAAGTC 


AAAGTACATC 


AGCACAAATG 


AGATGGCTGA 


1620 


TAGAATTGCA ATGGTCGAAA 


GTCGACGTGT 


GTTTGTCATA 


ACAGGTTCCT 


CCAATTTTCT 


1680 


ATAAAATCAG 


AAGAAGTTGG 


AAAGGATTCC 


TCTATCTATT 


CTCACTTTTT 


ATATCCCAAA 


1740 


AGTTCCCTCT 


TACT CT ATT A 


AAGAAAAACA 


AAGCAAGTGG 


TTACAATCCG 


GCTATAAATC 


1800 


TATCAAAACA 


GACAAGGCTA 


TTCTTTCGTC 


TTCTCCCATC 


CAGACTATAC 


TGTCGGTTGT 


1860 


GGAATCTCAC 


CACATCACGT 


TGCGCTCACG 


GACTTCTTTA 






1900 


(2) INFORMATION FOR SEQ ID NO: 220: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4692 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEONESS : double 
{D> TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 



GCTTTTCCAC 


CAGGAGCTTC 


TCCTTTATCA 


GAATGACCAT 


CCCATCTGCT 


CACGATAGAT 


60 


GAATAATGAT 


ATTTTTTACC 


ATGATAGTAA 


TTTGAAAAAG 


CCTAACCACC 


TCCTGAACCT 


120 


TCTCCATATG 


TCCATACTCC 


TCCATCTGGA 


TATTATACAC 


CAGCTGATGC 


AGCTCCCAAT 


180 


AATGTAAAAC 


TTGAAATAAG 


AGCTACAGCA 


AGTAATCTAT 


GTTTTTTCGT 


TTTCATTTTA 


240 


TTTTTTCTTT 


CAAAAAAAGC 


ACACCTTGAG 


CAACAATGCA 


ACAAAATAAA 


TCCTCCTCTC 


300 


TCTTTTATTG 


AAACCGCTTT 


CTTATGTGAT 


AAGAATAACT 


TTTTTATtAT 


TTGTTGTCAA 


360 


GGAAAAAATC 


GAATTTTTTA 


GATATTTTAC 


TATATTACCT 


CTGTGAATAA 


TATTATATAG 


420 


TAGTTTTATT 


TCAAAATAAT 


ATGCAACCAG 


TACTAACCAA 


ATATAAAATA 


GATGCCATTA 


480 


ACGAATTTTA 


TTCAAGTTTT 


TCCCATTCAT 


ACTATACAAG 


T AAAAG AG AT 


GGTGTTAACT 


S40 


AAAAAGCAAT 


TCAAACTATT 


GTAAAATTCC 


TAGCAAAAAG 


AGAGCCGAAA 


CTCTCTTTTT 


600 


TATCTTCTTT 


TACTTTTTTT 


GACTGGCATG 


AGTGTGATGT 


CTCTAACACT 


AAAGTAAGCT 


660 


AGGATCAACA 


TGGCTATTGC 


TAGGAATATT 


TCTGTTGGTA 


ATTGAAAAAT 


TTTCAGAAAA 


720 


GATAGAACCA 


ATAAAATCAA GAGTGCCACT 


AAAATACATA 


CCATAGCGAC 


GATATTGACA 


780 


GTCCCTTTAA 


TGCTTTCTGG 


TGTCGCAAAT 


ACATAGAGTA 


GGAGCAGTAA 


AATTCCTAGG 


840 


ACTAAATAGA 


CCATCTTTCT 


CTCTTTCPAG 


CTCTTATTCA 


GCTGATTTTT 


TCTTCTTGTT 


900 


AGCTTTCTCA 


CGCTCTGCTT 


TGTTAAGGAT 


TTGTTTACGC 


AAACGGATAG 


ACTCAGGCGT 


960 


TACTTCCATG 


TACTCATCGT 


CGTTCAAGAA 


CTCAAGAGAC 


TCTTCAAGTG 


TCAAGATACG 


1020 


AGGCGTCTTG 


ATAACAGCTG 


TTTGGTCCTT 


AGTAGCTGAA 


CGAACGTTGG 


TCATTTCTTT 


1080 


TGCCTTCGTG 


ATGTTAACTG 


TCAAGTCATT 


TTCACGAGAG 


TTTTCACCGA 


TGATCATTCC 


1140 


TTCATAAACC 


TCAGTACCTG 


GGTTGACAAA 


GATCGTACCA 


CGTTCTTCGA 


TAGACATGAT 


1200 


TGAGTAAGTT 


GTAGCCTTAC 


CAGCATCGAT 


AGAAACAAGG 


GCACCACGGT 


GACGTCCACC 


1260 


AATTTCCCCT 


GGAATCAATG 


GCAAGTATTG 


GTCGAAGGTA TGGTTCATGA 


TACCGTAACC 


1320 


ACGAGTCATT 


GATAAGAACT 


CAGTTGAGTA 


TCCAATCAAA 


CCACGCGCTG 


GAACAAGGAA 


1380 


GACCAAACGA 


GTTTGACCAT 


TACCAGTTGA 


AATCATATCC 


AACATTTCAC 


CTTTACGTTC 


1440 


AGAAAGGCTT 


TGGATAACAG 


ACCCTTGGTA 


TTCTTCTGGA 


GTGTCGATTT 


GTACACGTTC 


1500 


AAATGGTTCA 


CATTTAATAC 


CGTCGATTTC 


TTTTACGATA 


ACTTCTGGAC 


GAGATACTTG 


1560 


AAGTTCATAG 


CCCTCACGAC 


GCATTGTTTC 


GATAAGGATT 


GACAAGTGCA 


ATTCTCCACG 


1620 


TCCTGAAACA 


GTCCATTTAT 


CTGGTGAATC 


AGTTGGGTCA 


ACACGAAGGG 


AAACGTCTCT 


1680 


TTGCAATTCT 


GCCTGCAAGC 


GTTCTTCCAC 


CTTACGAGAA 


GTTACCCATT 


TACCTTCTTT 


1740 
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ACCAGCAAAT GGTGAGTTCT TGACCAAGAA AGTCATTTGA AGAGTTGGCT CATCGATGTG 1800 

TAGGATTGGA AGAGCTTCTA CTGCATCTGT CGGAGTGATG CTTTCACCGA CAAAGATCTC .1860 

TTCCATACCT GAAACGGCAA TCAAGTCACC CGCTTTGCCT TCTTGGATTT CACGACGTTC 1920 

CAAACCAAAG AAACCGAAGA GTTTTGTAAC ACGGAAGTTT TTAGTTGTAC CGTCAAGTTT 1980 

AGAAAGGGTA ACTTGGTCCC CAACCTTAAC TGTACCACGG AAGACACGAC CGATACCGAT 2040 

ACGTCCAACC~ AAGTCATTGT AGTCCAAAAG TGACACTTGG AACTGCAAAG GCTCATCTGA 2100 

GTTATCTACT GGAGCTGGCA TATGGTCGAT AATCGTGTCA AAGATTGGTG CCATAGTCGC 2160 

TTCTTGGTCA GCTGGATCAT CTGACAATGA AGAACTTCCG TTGATCGCTG AAGCATAAAC 2220 

CACTGGGAAA TCAAGCTGGT CGTCATCTGC ACCAAGCTCG ATGAAAAGTT CCAAGACTTC 2 280 

ATCCACTACT TCTGCTGGAC GAGCTGATGG CTTATCGATT TTGTTAACAA CCACGATTGG 23 4 0 

GACAAGGTCT TGTTCCAAGG CTTTTTTCAA TACGAAACGA GTTTGTGGCA TGGTTCCTTC 2400 

ATAGGCATCT ACGACCAAGA CAACACCGTC AACCATTTTC ATGATACGCT CAACTTCTCC 24 60 

ACCAAAGTCC GCGTGTCCTG GTGTGTCCAT AATGTTGATA CGAGTTCCGT TGTAAGCAAC 2520 

GGCAGTATTT TTAGCAAGGA TGGTAATTCC ACGCTCTTTT TCGATATCGT TTGAGTCCAT 2580 

AGCACGCTCT GCCAATTCAG TCCGTGCATC AAGCGTTTCT GATTGTTTCA ATAATTCCTC 2640 

AACCAGGGTT GTTTTACCGT GGTCAACGTG GGCGATAATC GCAATGTTAC GGATATCTTC 2700 

TCTTAATTTT GTCATGATTT CCTCTATAAT ATTCAAAATT TATTTTCTAA CTGAACGATT 2760 

ATACCATAAT TTCAAATAAA TAACATAACT CAAGCAAGTG TAAATGTTTT CACTCTCCTT 2820 

TTCTTTTCAC GTCAAGCCTT TTCAAAGCGA GCGACTTATG ATAAGATAGG CACAGTATGC 2880 

GTTTAGATAA TTTATTAGCT CAAGAAAAAA TCAGCCGAAA GGCCATGAAG CAAGCACTCC 2940 

TCAGAGGGGA AATTCTAGTC GATGGTTGCC CAGCCCGCTC CCTAGCTCAA AATATCGATA 3000 

CAGGACTACA AGAACTCCTT TTTCAGGATC GAATCATTCA AGGCTATGAA CACACCTATC 3060 

TTATGCTTCA TAAACCTGCT GGTGCCGTTA CAGCCAACAA AGACAAGGAA CTTCCGACCG 3120 

TCATGGACCT GCTTCCATCT AACATCCAGT CTGACAAGCT CTATGCCGTT GGCCGACTGG 3180 

ACCGAGATAC AACGGGACTC CTCCTCTTGA CCGATAACGG TCCCTTCGCC TTTCAGCTCC 3240 

TCCATCCCCA ATATCATGTC GATAAGACTT ACCAAGTTGA GGTTAATGGA CTTCTAACAC 3 300 

CTGACCATAT CCAAACCTTT CAAAAAGGAA TTGTCTTTTT AGATGACACT GTCTGTAAAC 3360 

CCGCAAAACT AGAGATTCTA TCTGCAAGTC sCTCCCTCAG TCAAGCCTCT ATCACCATTT 3420 

CAGAAGGAAA ATTTCATCAA ATCAAGAAAA TGTTCCTCTC GGTTGGTGTT AAGGTGACTA 3480 
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CCCTCAAAAG AATCCAATTT GGGGACTTCA CATTGAACCC AGATTTAGCA GAACGTAACT 3540 

ACCGCCCTTT GAACCAAAAA GAGTTACAAA TCATTAAAAA CTATTTAGAC ATGAGTCGAT 3600 

AAAACAAAAA AAGCTTTAAA ACTAAACCTT TTTTCTTTTA TTTACCGAAA AATTAAGGCG 3660 

ATTGCTACAA TCCAGTTAAC TACAGAAATC ACAATTCCTA AGATATTAAG AATCTTTTCT 3720 

ATTTTATAGT CTAATTGTGA CTCTTTTTGG TATGAAATAG GCAAGACCAA TCCTATGATA 3780 

CCCAAAATCA GGCCTACAAT TGGAAATAAC AAACCAAGAA TAATCGACAA GATACCCACA 3840 

AAAAGTGGAT TTTTCTTCTT TTCTTTTATG TTCTAAGAAC TCCTTAAATT TTATACAAAT 3900 

TAATTATACT ATAAAACAAT AGCTTCATCC TATCATTCGA CTAATTTGGA AATAAGGTTA 3960 

GCTAGTCTTC ACTTTCCCTT TCCAAGAATC CAAGCCATAA GAAAGGATAT AAATCTCAGA 4020 

AAAACCTTCT TTTTTCAAGT AAAGAGCTGC ATTTCTAACT CGTTGCGCAC GTTGGTTTTC 4080 

GTAGAGAAGG ACAGCTTTAT CTTTACGAAG GGCTGCAAGA CTAGTTTTCA ACTGACTTGA 4140 

AGGAATATTG CGTGCACCAA GGATATGTTT TCTGTGGAAT TCTGCTGGGT CGCGCAAATC 4200 

AATCAATTGA CCCGTACGAA TCAAGGCTTC AAACTCCTCA TTGTCCACAA TTTTAGCCGC 4260 

ACGGCGAATA CGAAGATAGT TAAAGCCCAT CCACGCCAAC ATTGCTAGTA TAAGTGCCCA 4 3 20 

CAAAATCCAA GTAACCATTA GTTCTTTTCT CCATTTTTCT CAATATAATC CAATTCTACC 4380 

TTGTGCTCTC TGCGAAGAAC TGCTTCTGCC TCTAGATAGT CTAATTTATC CATCAACCCT 4440 

GCATCGTAAA TCCGAGATAG TTCCAACTTC ATCAGTTCAA TATCATATAA GCGTTTTCCC 4500 

ATGTAAACAA TAATACCAAA TCGTTTGAGG AATTGCTGCA CATCATAGAA TGTTTTCATA 4560 

AGACTCATTC TAGCAAAATT TTGTGTTTTT TTCAAGAAGA GACTCACACA ATGCTCCTTA 4 620 

TTTTCCTATC TTCTTTAGCG ATTCTAAGGC AAGTATGGTA CAATAAAAAC ATGGGGATTC 4680 

AACAATTACA TT 4692 
(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

GCTAAAAAGC TGATAATCTT CGACTC CTGT ATATGATGTG TCTTTTCATG TAAGACACGC 60 

GCCGCCAGAA TCATGGCAAG AGCTGCAAGA CTGGCAAGTA AGAAGCCGAT AAGATAGGCA 120 

AAAAGATAAG TGAATTTGAC AAAGAAAGTC AAAAGAACTA GGAAACCAAA GCCTCCTCCA 180 
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AAAACTACCA AACTCTTTCG TAAATCCCAG ATTTTATCCA ACTGCTTGAC GAGGGAAGTC 240 

GTCTGACGAA CGCCTACAAT ACTTGCTAAC ATACTTCCTA AAAAGAATGG ATAGACATGA 300 

GTTAAACTGG AGAAATAAAC AGAGGAATAA GAGCTCACTA GAAAACTACC AATAAACATG 360 

GAGAAGAAAC TGATCAAGAA GGCAACAGCA GATAAGAGAA AGACCATCCC CTTCAACTGA 420 

CCATTTGATT TAGCTTGTTT GGATAAGAAC CAAACTGCCA ATCCCCAAAG AATATAGTAC 480 

TGAACCTCAA CTGCCAAACT CCAATTATGA ACAAACAAAT CAGGAATGAA CTG AGATTCA 540 

TAACTCCCAC CTGTTAGGAG TTCATAGAAG TTGGTCATAA AGCCTAAGAC GCCCGCAATC 600 

TGGCCACCAA TTCCAGCAAC ATAGTCTTGG CGAACCAAGA AAGTAAAAGG CATGGTCACC 660 

AAGACCATCA AAACCACAGG TGGCACAATC TCGATAAAAG CGTCTT 706 
(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3236 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 



CAGCTCATGG 


GCAATATCAG 


TCATAGAAAT 


TTTTTCAATT 


AACTTTTGAG 


CAATTTTTTG 


60 


GTTGATGATA 


CGAGGGATTT 


GGTGATTTTT 


CTTTACCAGG 


GGAGTCTCAG 


CAACCATCAT 


120 


TTTTGAACAG 


TGATAGCACT 


TGAAACGGCG 


TTTTCTAAGG 


AGAATTCTAG 


AAGGCATACC 


180 


AGTTGTTTCG 


AGGTAAGGGA 


TCTTAGACGG 


TTTTTGAAAG 


TCATATTTCT 


TCATTAGACT 


240 


TCCACAATCA 


GGGCAAGATG 


GAGCCTCATA 


ATCCAGCTTA 


GCGATAATTT 


CTTTGTGGGT 


300 


ATCCATATTG 


ATGATATCTA 


GAATCTTGAT 


GTTTGGGTCT 


TTAATATCGA 


GCAGTTTTGT 


360 


GATAAAATGT 


AATTGTTCCA 


TATGATTCTT 


TCTAATGAGT 


TGTTTTGTCG 


CTTTTCATTA 


420 


TAGGTCATAT 


GGGACTTTTT 


TTCTACACAA 


AAATAAGCTC 


CATAATATCC 


ATAGGGGATT 


480 


TACCCACTAC 


AAATATTATA 


GAGCCCGAAA 


ATATGGGAAA 


ACTGATCCTT 


GTTTCTGCTT 


540 


TTGTCTATAG 


AAGAATAATA 


AAGATTATCT 


TCTTCAAATT 


CTCCGATATT 


CTCTAAAGTT 


600 


TTGTGCAAGT 


TGCACAGAAC 


TTGTTTATTT 


TTTTGGTCAT 


CTTGCCATAG 


AAATATAAAG 


660 


CGTTTTCATA 


TATAATATAA 


TTATCAAAAG 


ACAAAAGGAG 


TTCACCTCAT 


GGTAGAATTG 


720 


AATCTTAAAA 


ATATTTACAA 


AAAATATCCA AACAGCGAAC ACTATTCAGT TGAAGATTTC 


780 


AACTTGAACA 


TCAAAGATAA 


AGAATTTATC 


GTTTTCGTAG 


GACCTTCAGG 


ATGTGGTAAA 


840 
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TCAACTACAC TCCCTATGAT TCCTCGTCTT GAAGACATTA CAGAAGGTAC TGCATCTATC 900 

GATGGCGTAG TTGTCAACGA CGTAGCTCCA AAAGACCGTG ATATCGCCAT GGTATTCCAA 960 

AACTACGCTC TTTACCCACA CATCACTCTT TATGACAACA TGGCTTTCGG TTTGAAATTG 1020 

CGTAAATACA GCAAAGAAGA CATTAACAAA CGTGTTCAAG AAGCAGCTGA AATACTTGGA 1080 

TTGAAAGAAT TCTTGGAACG TAAACCAGCT GACCTTTCAG GTGGTCAACG TCAACGTGTT 1140 

GCCATGGGGC GTGCGATTGT CCGTGATGCG AAAGTATTCT TGATGGACGA ACCTTTGTCA 1200 

AACTTGCATG CCAAACTTCG TGTATCAATG CGTGCTGAAA TCGCTAAAAT TCACCGTCGT 1260 

ATCGGAGCTA CAACTATCTA TCTAACTCAC GACCAAACAG AAGCGATGAC ACTTGCAGAC 1320 

CGTATCGTTA TTATGTCAGC TACTAAGAAC CCTGCTGGTA CAGGTACTAT CGGACGTGTA 13 80 

GAACAAATCG GTACTCCTCA AGAAGTTTAC AAAAATCCAG TTAACAAATT CGTTGCAGGA 1440 

TTCATCGGAA GCCCAGCTAT GAACTTCATC ACCGTGAAAT TGGTTGGTAG CGAAATTGTT 1500 

TCTGACGGTT TCCGTTTGAA AGTGCCAGAA GGAGCATTGA AAGTTCTTCG TGAAAAAGGC 1560 

TACGAAGGAA AAGAATTGAT CTTTCGTATC CGTCCAGAAG ACGTGAATGC AGAACCTGCT 1620 

TTCCTTGAAA CATTCCCAGA CTGTGTTGTA AAAGCGACTA TCTCTGTATC AGAACTGCTT 1680 

GGTTCAGAAT CTCACCTTTA CTGTCAAGTT GGTAAAGACG AGTTTGTTGC AAAAGTTGAT 1740 

GCTCGTGACT ACTTGCAAAC AGGTGCAACA GTTGAGCTTG GATTTGACTT GAACAAAGCA 1800 

CACTTCTTCC ATGTAGAAAC TGAAAAAACA ATCTACTAAA ATAAATAAAA TTCAAAGCAC 1860 

TACAAGAAAA GATATCTCTT TATCAATTGT AGTGGAGAGA TATCAGTTAA TCTAGGGAGA 1920 

GAAACAAAAT GCTTCTCTCC TTTTTGCTAG AGAAGTCATA TTATGCATCT ATATTGTGAT 1980 

GCTCTTTAAT ACTCTTCGAA AATCTCTTCA AACCACGTCA ACGTCGCCTT GCCGTACCTA 2040 

TGATTACTGA TTTCGTCACT TTTATCTGCA ACCTCAAAGA TGTACTTTGA GCAGCTTACG 2100 

GCTAGTTTCC TAGTTTGCTC TTTGATTTCC ATTGAGTATT ATTTCTGGCT ACCATCTACA 2160 

AGTGAAGCTA TATGCGTAAA CTACCTGAGC AATTGAATTC GAACTAGAGA GGTAATAATA 2220 

AATTTATGCT ATAGTTATGG TGACTTGTAT GCTTTTGATT CTAGTTTATC AAATAATAGA 22 80 

TTAGAATTGT CAGATAATAT CATTTTGTGT TATAATGAAG AAAAAACAGA GGTGTTCAAA 2 340 

TCTCAGAACC AGGTCATAAG TTTTTAGCAA AATTGGGGAA AAAACGCTTA CGTCCAGGTG 2400 

GAAAGCGTGC CACAGATTGG TTAATTGCAG AAGGAGGATT TTCAAAAGAA AAGAGAATAC 24 60 

TAGAGGTTGC GTGTAATAGG GGAACTACAG CAATTGAGTT GGCACAGCGT TTTGGTTGCA 2520 

AGATAACTGC TGTTGATATG GATGCTCAAG CTTTAGAAGT GGCTAAAAAA TCTGCTGGAA 2580 

CGGCAGGTGT TGCTCATTTA ATCAGTTTTG AAAGAGCAAA TGCAATGAAA CTTCCTTATC 2640 
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AAGATGCTAG TTTTCATATT GTTATAAATG AAGCTATGCT GACTATGCAA CCCGATCAAG 2700 

CTAAGAAAAA ATGTGTAATG GAATATCTAA GGGTATTAAA ACCTGGAGCT CTTCTCTTGA 27 60 

CACATGATGT GCTTCTTAAG GAAGCTAAAG AGTCTATCAG ACAGGAATTA TCACAAGCAA 2820 

TTCATGTAAA TGTAGGTCCT TTAACTCAAG ATGGTTGGGA ACAGGTCATG ATAGAATCAG 2880 

GTTATTGTGA TGTGAAAGCA TTGACTGGTG AAATGACATT AATGAAATTA TCGGGTATGA 2940 

- TTTATGACGA-AGGTTTGCTA GGAACTTTGA AAATTTCTGT AAATGCTTGT AAAAAGGAGA 3000 

ATAGAAAGCA GTTTTTAACT ATGTATAAAA TGTTTGCTAA GAATAAACAG AAATTGGGCT 3060 

TTATTGCGAT GGCTAGTTAT AAATCGTCAA AACGTTACAT AATTATTGAA GTTAACTTTT 3120 

CCTTTTTTCT TTCTTAAAAA ATATGCTATA ATAGAGAGTA AAAAACTTTG AAAGAAAGAA 3180 

AAAGATGAAT TTAAAAGATT ACATTGCAAC AATTGAAAAT TATCCAAAGG GTACCG 3 236 
(2) INFORMATION FOR SEQ ID NO: 223: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

CCTGACTTTT CAAATTGGTT AGTTTGCCAC ACTTGGTTTA TATGGTCGTG GAAAGCATGG 60 

CTATTACTTC TCAAAGGCCG ATTTCTCACC CCATGAAAAG TGTCTATTTT TGTTTAGGTT 120 

TGTAAGTTAA TTCATTGTCA CATATTACTC TTTAACTGAT TGAGTGAGTA CCGCTTATAT 180 

TTGATGCCAA ACGCCTTAAA AGTGTTACCC TCAAGTCCTT TTAGAATACG GCTATAATTC 240 

CGCTCATTGT AAACTATCTT AAGCTCATCA CTATCTAGGT TGGTATTAAA AATGGTATTT 300 

TCACGATTGT TTAGCACCTC AAAGAGTAAA TCCTGCTCCC AGTCACTCTT AGGCTTAATA 3 60 

ACAGCATTTT TTGCTCCTAA ATCATCAATA ATTAAGTAAT CAACAGACTT CATGAGTTCA 420 

GTAGCTTCAA ACTCTGTAAG TGTTGCACCT TTACCATAAT TCCACCCCTC TTTAATTTGT 480 

TTGATCATTT CGGTTAGGCT TACAAAAACC ACACTCTTAG GTTCTCCTTT TGTCTTATAC 540 

CCCTCATTTA TACCTTTGGC AATAGCAACT GATAAAAGTG TTTTTCCAAT CCCTGTACCT 600 

CCTGTGATAA GCGTATTTCC CCTCATGCCA TCAAGATATT TTTGTACCTG ACCTTTTGCA 660 

AATTCTAAAA ATCGCTTTTC TTCTGATGTT ACAGCATTAA AATCATCAAA AGTTTTAGTT 720 

TTAAACTCAT CTGCTACATA GCTCTTATTG CTCATCAACA CATTATAAGT TTGCATATAT 780 
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AGTTTACCAT TCAAATTATC AGCAATCGCA TCTT CT TCAT CTTCCTTTTT CTGTTCTTCT 840 

TGGCATTGTT CACAATACCG TGGGATACAC CGAACTTCTT TTATTGCCTC TCCGTTCTCA 900 

TTCCACCCCA CTACTACATG TCTTTCTCCT TTGATTTCTG TTAGCTGTAT TTCATGCTTA 960 

GGACACAATT CGTCTAGTTT AAATGTCTCA ATATTTCCTA AACTAGATTG TAATGATTTC 1020 

ATTTTCTGAC CTCCTAAAAT GGTTTTTCTT GTGTTGGTAT CCAATCTTCA TAGCTGGTAG 1080 

GCTCTAGTTG ATTGGTTTGC TGTTTTTTAG CCTCACGCGC TGCCCTGCTA TTTCTAACAA 1140 

GTTCCACCGT CAATAAATTG TCCTGTTTCC AACGGTTAAG CATTACCTTG ATGTATGCAA 1200 

AGTTTGCTTT ACCCTGACTG ACAGCCTCTT TTAACGCCTC ATGGATAAGC TCTGGGCTAA 12 60 

AATCTTCTAG CATATACTGC AATTCTTGAA TCTGTAACGG TGACAATGCT TTACCTGTCT 1320 

CAGCTCGCTT CATATTCAAC AAGTCGTCTA TTTCCACACT GGTTACTTTT TTATTTACAA 13 80 

AATCAGAAAT CAGTTGAAAA ATGTTTGGAC TTTGTAGCTG GATTTCAGCC ATTACCTCAT 1440 

CAAATTCTGC TTGTGTCATG TTGTCTAAAT CTAGTGTCAT TGCATTGCCT CCTCAAACTT 1500 

CTCTATAAGA CAACTTTTAT TTGCTTTCTG AGTTCCATTT TTAGAGTTAA AAAGAATATC 1560 

TTTTAAGGTT ACAGTAGCCT CTAAATACTC CTTTTCAGCA TGCTCTATAT ACGCCTGTTC 1620 

CTCTGCTTCG TTCTCAAAAA AGTGCTTAGC TTGGCGTTTA AAGAATGCTT TTCGCATAGC 1680 

GTCCATTTCA AAAATACCAG GCGCGAAAAA CATTCCCGTA GTGCTTTTAG AGACCGCTTC 1740 

GATTTTATGG CTTTCATTCA ATTCAGGAAG TTCAATCCAA AGTAAACGGG ACAACTC AT C 1800 

TTTGATGGAT TTTGTCTGAC TTTCCAATAA AGAAAGGATT CTTAGGCCAT TTTCTTCGCT 1860 

AATTTCTCGC ATTTCTGCGC TAATTCTGTC TATACGTCTA GTTAAATTCT CATATGTTGT 1920 

TTCTGTCATG TTTTTACCTC TGTTTCTTTG TTGGTGTGAT TTTTTAGCTT ATTTTTTTAC 1980 

TTCTAAACAT CATTGTCTTA ATTTCCTGAT AACTCATTTT CAATTCAATC ATAGCTATTG 2040 

CCATATCCTC AAATGCCTGG TACTGCTCCA ACTCCTCACT AGTCAAGCTA TCGATACCGT 2100 

TATAGCCCCC ACGCTCTTCT CTTAACTGCT TAGCGTTCAT GTCTGTTACT GCCTTTAGTA 216C 

GCAAGTTCTT CATGGTGCTA TGCGCGTGCT TTGGTGCATT AGGCCATCTT TCTATACTGT 2220 

CATGCAAGGT TTTTCTTTTC GGTTTTTCTA GCGCCCTCTG CAGACGAATT TCAGAAAGTT 2280 

CCTCACGCAT TTCAAAGAAT GCTTTGACTA GGTTTAGTTT GAATTGCCGT ACTGTTTCGG 2340 

TATTCTTTAA ATAAGTGATC AGAAAAGTAG CCTGTTGCTC GTTCAGAATA TAGGATTTTT 2400 

TAGGTTGTCC TCTAGTATCT AATTTATGGA TTTTAAATCC AAGTATTCCC AACTCTTCAA 2460 

AGTCACCCTT ATTTTCTCTT ATTAAGCGCG TGATAGTGTG GTGTTGTACT TCAGCACATT 2520 

CAGCGATGAT CTCGCTTGTG GTGTACGGCT CTTTCTTACC GTCCATGTAA ACTAGTTCCA 2580 
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TTACCGTTCT ACCTCCTGTA TAAATCTGGT TACCTTACTT TTTAATTGCC TCCTCTAGCC 2640 

TCTTTTTTAG CCTCTAAAAC GGCTTTGGCT ACTGGTTAAT ATTATTTACC ACTTGTCTCT 2700 

ATAAACGTGT TACAGGCCTT TATAACGACT TGTATCGCTG TATCGATATC CTCCGTGGAA 2760 

TAGTAGATTT ATTTTCTAAT ATCATTCAAG ACTTCTTTAA CCCATTTCTT GAAAGAAATA 2820 

AAATTACATC TTCTTTATCC TTGGCATCTG CTTTGTCTGA GACAAATTAG AATGTCAATA 2880 

CTTGG _ „ _ . _. - -- - 

(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 
TATCAATCCT TTCCCATTAT AGGAGCAACA GAGTGGGAGT AGTCATCTAA GGACTAATTT 
ATGTATTTTT ACGAGTCAGT ATCTTGGGAT ACTGGTTTTT ACTTTTCTAG ACTTTTTGAC 
TACTTGTTAA AACTGGGATA ATTTTCGACT GTTTAACAGT TATTATGCAA AGTCTAAAAG 
ATTAGAATTG TCAAAACAAT CCGTCTAGGC TTGATTTTAT CCTTTATTTA CTATAAAATG 
AGAAGGAAAA ATGTCAAACT TTTATATTGC AAATAGGAGA AATCATGACA AAAACATTAA 
AACGTCCTGA GGTTTTATCA CCTGCAGCGA CTTTAGAGAA GCTAAAGGTA CCTCTTCAGT 
ATGGAGCAGA TGCTGTCTTT ATCGGTGGTC AGGCCTATGG TCTTCGTAGC CGTGCGGGAA 
ACTTTACTTT CGAACAGATG GAAGAAGGCG TGCAGTTTGC GGCCAAGTAT GGTGCCAAGG 
TCTATGTAGC GGCTAATATG GTTATGCACG AAGGAAATGA AGCTGGTGCT GGTGAGTGGT 
TCCGTAAACT CCGTGATATC GGGATTGCAG CAGTTATCGT ATCTGACCCA GCCTTGATTA 
TGATTGCAGT GACTGAAGCA CCAGGCCTTG AAATCCACCT TTCTACCCAA GCCAGTCCCA 
CTAACTATGA AACCCTTGAG TTCTGGAAAG AGCTAGGCTT GACTCGTGTC GTTTTAGCGC 720 
GTGAGGTTTC AATGGAAGAA TTAGCTGACA TCCGCAAACG TACAGATGTT GAAATTGAAG 780 
CCTTTGTCCA TGGAGCTATG TGTATTTCAT ACTCTGGACG TTGTACTCTT TCAAACCACA 
TGAGTATGCG TGATGCCAAC CGTGGTGGAT GTTCTCAGTC ATGCCGTTGG AAATACGACC 
TTTACGATAT GCCATTTGGG AAAGAACGTA AGAGTTTGCA GGGTGAGATT CCAGAAGAAT 
TTTCAATGTC AGCCGTTGAy ATGTCTATGA TTGACCACAT TCcAGATATG ATTGAAAATG 1020 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



340 
900 
960 
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GTGTGGACAG 


TCTAAAAATC 


GAAGGACGTA 


TGrAGTCTAT 


TCACTAyGTA 


TCAACAGTAA 


1080 


CCAACTGCTA 


CAAGGCGGCT 


GTGGATGCCT 


ATCTTGAAAG 


TCCTCAAAAG 


TTTGAAGCTA 


1140 


TCAAACAAGA 


CTTGGTGGAC 


GAGATGTGGA 


AGGTTGCCCA 


ACGTGAACTG 


GCTACAGGAT 


1200 


TTTACTATGG 


TACACCATCT 


GAAAATGAGC 


AGTTGTTTGG 


TGCTCGTCGT 


AAAATCCCTG 


1260 


AGTACAAGTT 


TGTCGCTGAA 


GTGGTTTCTT 


ATGATGATGC 


GGCACAAACA 


GCAACTATTC 


1320 


GTCAACGAAA 


CGTCATTAAC 


GAAGGGGACC 


AAGTTGAGTT 


TTATGGTCCA 


GGTTTCCGTC 


1380 


ATTTTGAAAC 


CTATATTGAA 


GATTTGCATG 


ATGCTAAAGG 


CAATAAAATC 


GACCGCGCTC 


1440 


CAAATCCAAT 


GGAACTATTG 


ACTATTAAAG 


TCCCACAACC 


TGTTCAATCA 


GGAGACATGG 


1500 


TTCGAGCTCT 


TAAAGAGGGG 


CTTATCAATC 


TTTATAAGGA 


AGATGGAACC 


AG CGTC AC AG 


1560 


TTCGTGCTTA 


ATGTAGTTGT 


TTAGTTTTAA 


AAAACTATGC 


AAAGCTCCAT 


ATACAACACT 


1620 


TAAACGAGAT 


TAAAGAATGG 


CGAAATCCCT 


TGATGCGCAA 


GAGATTAGCT 


GTCTTTTTTA 


1680 


TTTTTTAAGT 


GATAAAGTCG 


GAGTTTAGGC 


ATCAAAGCCT 


ATCAAATTAA 


ACAAAGAAGC 


1740 


GATGTCTTAG 


ATATTTTGAA 


AAAAATTAAT 


AAGCAGAAAA 


CTCTCTATTA 


TTTTGTTGTA 


1800 


GAGAGTTTTT 


TGTTAATAAA 


ATTTCACAAA 


ATGACATTTA 


TATATTGCAT 


TAAGTTAGAT 


1860 


ATATGATATA 


ATATTGTTAA 


AAAGAGGCGC 


AACTTTTTAA 


AATTAATGAG 


AATCAAAGAG 


1920 


AAAACCAATA 


ATATTAATGG 


AGGAATAAAA 


AATGTAAGTA 


AGCATTATGG 


TCATTCAATC 


1980 


ATTCTCAAAG 


ATATAAATTT 


TGCACTTAAC 


AAGGGTGAAA 


TTGTTGGTCT 


AGCAGGGAGA 


2040 


AATGGAGTTG 


GTAAGAGTAC 


GTTGATGAAA 


ATTCTTGTTC 


AGAATAATCA 


ACCGACTTCA 


2100 


GGTAATATTA 


TAAGCAGTGA 


TAATGTTGGG 


TATTTAATCG 


AAGAACCAAA 


ATTATTTTTA 


2160 


TCTAAAACAG 


GTTTAGAGAA 


TTTAAAATAT 


TTGTCAAATT 


TATATGGTGT 


TGACTACAAT 


2220 


CAAGAAAGAT 


TTAGATGTTT 


GATCCAAGAG 


TTAGATTTGA 


CTCAGTCTAT 


TAATAAAAAA 


2280 


GTAAAGACCT 


ATTCTTTGGG 


TACAAAACAA 


AAATTAGCTT 


TGCTTCTAAC 


TCTCGTTACG 


2340 


GAACCTGATA 


TATTGATTTT 


AGATGAACCG 


ACTAATGGTT 


TAGATATTQA 


ATCATCACAA 


2400 


ATAGTTTTAG 


CGGTTCTAAA 


AAAATTAGCT 


TTACATGAAA 


ATGTGGGAAT 


TTTAATATCG 


2460 


AGTCATAAAT 


TAGAAGACAT 


TGAAGAAATT 


TGTGAGAGAG 


TTCTTTTCTT 


GGAGAACGGG 


2520 


CTTTTGACAT 


TTCAAAAAGT 


AGGAAAAGAT 


AGTCATAATT 


TCTTGTTTGA 


GATAGCTTTT 


2580 


TCATCAGCTA 


CAGATAGAGA 


CATTTTCATT 


ACCAAACAAG 


AATTTTGGGA 


TATTGTTTAG 


2640 


GAAGAGGGAT 


TGAGAATTAC 


TATGTCTGGG 


AATATTCAAA 


ATAGTGAGCT 


TTTTAAATTT 


2700 


TTTAACGAAA 


ACTCTATTAA 


AGTAGTTGAT 


TTTGAAACTA 


AAAAAGAGAC 


GCTTAAAGAT 


2760 


ATTTACCTAA 


ATCGTTCAAA 


ATAAAGGAAG 


GTTATAATCA 


TGAAATTAAA 


TAAACAGAAG 


2820 



WO 98/18931 



PCT/US97/19588 



1239 

AATCGGATGA TTTACGTCTT GTCTAATTTT CTATATGCTA TCTCAGTTTC CATTATTTAT 2880 

GCTTTGAATG GCATTGTGTT ACTAGTCATA GTAAGTAAAT TGGGTATTCC AGGTGATTTA 2940 

GGATTAAATT TTATAGTAGC TATTGTAGTC AATACAATTT TGTTAGTCCT GTTTTATTTT 3000 

CTATTATCTT ACATTTTCTA TTTATACAAA TTGAAAAGTG GCTTGGTATw TGGTATTTTA 3060 

GTAGCTTTAC TACTCTTTAT CTCTAATATA TTAAATACGA TGATGATGAA TACTAGTAAT- 3120 



GATTTGTTTA TCAAAGCAAT TGAA " - 
(2) INFORMATION FOR SEQ ID NO: 225: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



3144 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

TACGGTATTA TTTTTAAGGA GAAAGAATCA TGAAAATCAA AAAATGGCTT GGTCTAGCAG 60 

CCCTTGCTAC AGTCGCAGGT TTGGCTCTTG CAGCTTGCGG AAACTCAGAA AAGAAAGCAG 120 

ACAATGCAAC AACTATCAAA ATCGCAACTG TTAACCGTAG CGGTTCTGAA GAAAAACGTT 180 

GGGACAAAAT CCAAGAATTG GTTAAAAAAG ACGGAATTAC CTTGGAATTT ACAGAGTTCA 240 

CAGACTACTC ACAACCAAAC AAAGCAACTG CTGATGGCGA AGTAGATTTG AACGCTTTCC 300 

AACACTATAA CTTCTTGAAC AACTGGAACA AAGAAAACGG AAAAGACCTT GTAGCGATTG 3 60 

CAGATACTTA CATCTCTCCA ATCCGCCTTT ACTCAGGTTT GAATGGAAGT GCCAACAAGT 420 

ACACTAAAGT AGAAGACATC CCAGCAAACG GAGAAATCGC TGTACCGAAT GACCCTACAA 480 

ACGAAAGCCG TGCGCTTTAT TTGCTTCAAT CAGCTGGCTT GATTAAATTG GATGTTTCTG 540 

GAACTGCTCT TGCAACAGTT GCCAACATCA AAGAAAATCC AAAGAACTTG AAAATCACTG 600 

AATTGGACGC TAGCCAAACA CCTCGTTCAT TGTCATCAGT TCACGCTCCC GTTGTAAACA 660 

ATACCTTCGT TACAGAAGCA AAATTGGACT ACAAGAAATC ACTTTTCAAA GAACAAGCTG 720 

ATGAAAACTC AAAACAATGG TACAACATCA TTGTTGCAAA AAAAGATTGG GAAACATCAC 780 

CTAAGGCTGA TGCTATCAAG AAAGTAATCG CAGCTTACCA CACAGATGAC GTGAAAAAAG 840 

TTATCGAAGA ATCATCAGAT GGTTTGGATC AACCAGTTTG GTAATAAGAA ACAGGGAGGT 900 

GGGAGAGAAA ATTCCACCTC TTGCTTTTGT ATAGAGTATA GATTGTAAAG AAGACTATTC 9 60 

GTTCATAGAA AGGTAGAGAG AATATGGTTT TTCCTAGCGA ACAAGAACAG ATTGAAAAAT 1020 
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TTGAAAAGGA TCATGTAGCC CACCATTATT TTGAGGTTTT GCGTACCTTG ATTTCTAAGA 1080 

AATCAGTCTT TGCCCAGCAG GTTGGACTCA AGGAAGTCGC AAATTATCTG GGTGAGATTT 1140 

TCAAGCGTGT TGGAGCTGAA GTGGAGATTG ATGAGAGCTA TACAGCGCCC TTTGTCATGG 1200 

CACATTTCAA GAGTTCGCCT CCAGATGCCA AGACCTTGAT TTTCTATAAC CACTATGACA 1260 

CTGTGCCAGC GGATGGGGAT CAGGTCTGGA CAGAGGATCC JcTTTACGCTT TCGGTCCGCA 1320 

ATGGCTTCAT GTATGGGCGT GGGGTTGATG ACGACAAGGG TCATATCACA GCTCGCTTGA 1380 

GTGCTTTGAC AAAATATATG CAGCACCATG ATGATTTACC TGTCAATATC AGCTTTATCA 1440 

TGGAGGGAGC GGAGGAATCG GCTTCAACAG ACCTAGATAA GTATTTGGAA AACCATGCAG 1500 

ACAAACTCCG TGGGGCGGAT TTGTTGGTCT GGGAACAAGG GACCAAAAAT GCCTTGGAAC 1560 

AGCTGGAAAT TTCTGGTGGC AATAAGGGGA TTGTGACCTT TGATGCCAAG GTAAAAAGCG 1620 

CTGATGTGGA TATCCACTCG AGTTATGGTG GTGTTGTGGA ATCAGCTCCT TGGTATCTCC 1680 

TCCAAGCCTT ACAGTCTCTT CGTGCTGCGG ATGGCCGTAT CTTGCTTCAA GGCTTGTACG 1740 

AAGAAGTACA AGAGCCCAAT GAACGAGAAA TGGCCTTGCT AGAAACTTAT GGTCAACGAA 1800 

ACCCAGAGGA AGTTAGTCGG ATTTATGGAT TGGAGTTGCC TCTCTTACAG GAGGAGCGGA 1860 

TGGCCTTTCT AAAACGTTTC TTTTTCGATC CAGCGCTTAA TATCGAAGGA ATCCAGTCTG 1920 

GTTATCAAGG TCAGGGTGTT AAGACTATTT TACCTGCAGA AGCCAGTGCC AAGCTAGAGG 1980 

TTCGTCTGGT TCCGGGCCTA GAACCGCATG ATGTTCTGGA AAAAATTCGG AAACAGCTAG 204 0 

ACAAAAATGG CTTTGATAAG GTAGAATTAT ACTATACCTT GGGAGAGATG AGCTATCGAA 2100 

GCGATATGAG CGCACCAGCC ATTCTCAATG TGATCGAGTT GGCCAAGAAA TTCTATCCAC 2160 

AGGGCGTTTC AGTCTTGCCG ACGACAGCGG GGACAGGACC TATGCATACG GTCTTTGATG 2220 

CCCTAGAGGT ACCAATGGTT GCATTCGGTC TAGGAAATGC CAATAGCCGA GACCACGGTG 2280 

GAGATGAAAA TCTGCGAATC GCTGATTATT ACACCCATAT CGAATTACTA GAGGAGCTGA 2340 

TTAGAAGCTA TGAGTAGAGA TATTATCAAG TTAGATCAGA TCGATGTGAC TTTTCACCAA 2400 

AAGAAGAGAA CCATCACAGC GGTTAAGGAT GTGACCATTC ACATCCAAGA AGGGGATATC 2460 

TACGGAATCG TTGGATATTC TGGAGCAGGA AAATCAACCC TTGTACGGGT GATTAATCTC 2520 

TTGCAAAAAC CATCTGCAGG GAAAATTACC ATTGACGACG ATGTGATTTT TGACGGCAAG 2580 

GTGACCTTGA CGGCAGAGCA GTTGCGTCGT AAACGTCAAG ATATCGGAAT GATTTTCCAG .2640 

CATTTTAACC TGATGACCCA AAAGACAGCA GAGGAGAATG TAGCCTTTGC CCTTAAACAC 2700 

TCTGAACTCA GCAAGGAAGA AAAGAAGGCT AAAGTAGCTA AGTTGTTGGA CTTGGTTGGT 27 60 

TTGGCAGATC GTGCTGAAAA CTACCCTTCA CAACTATCTG GAGGGCAAAA ACAGCGTGTG 2820 
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GCAATTGCGC GTGCCTTGGC CAATGATCCA AAAATCTTGA TTTCAGACGA CTCAACTTCT 2880 

GCCCTTGATC CGAAGACAAC CAAGCAGATT TTGGCCTTGT TGCAACATTT GAACCAAAAA 2940 

TTAGGCTTGA CTGTTGTCTT GATTACCCAT GAAATGCAGA TTGTCAAAGA CATTGCCAAC 3000 

CGTGTTGCAG TTATGCAGGA TGGGCATTTG ATTGAAGAGG GTAGTGTGCT TGAAATCTTC 3060 

TCAAACCCTA AACAACCTTT GACTCAAGAC TTTATCTCAA CAGCTACAGG TATTGACGAA 3120 

GCCATGGTCA AAATCGAGAA GCAAGAAATC GTGCAACACT TGTCTGAAAA CAGTCTCTTG 3180 

GTGCAACTCA AGTACGCTGG AGCTTCAACA GACGAGCCAC TTTTGAATGA ATTGTACAAC 3240 

CATTACCAAG TAATGGCTAA TATTCTCTAT GGGAATATCG AAATTCTCGA TGGTACTCCT 3 300 

GTTGGAGAAT TGGTGGTGGT TTTGTCAGGT GAAAAAGCAG CGTTGGCAGG TGCCCAAGAA 3 3 60 

GCCATTCGTC AAGCAGGTGT ACAACTAAAA GTATTGAAGG GAGTACAGTA AGATGGAATC 3420 

ATTGATTCAA ACCTATTTAC CAAATCTCTA TAAGATGGGT TGGGCTGGTC AGGCAGGCTG 3480 

GGGAACGCCT ATCTACTTAA CTCTTTATAT GACAGTTCTT TCCTTCATTA TCCGACGCTT 3 540 

CTTGGGGCTA GTGGCAGGTC TCTTTCTCCT CTTGACAGCG CCAGGTGGTG TCTTGGAGAA 3 600 

TAAACTCGTA TTCTGGATTT TAGACAAAAT TACCTCAATT TTTCGTGCGG TTCCCTTTAT 3 660 

CATCCTCTTG GCAATCTTGT CACCACTTTC TCACTTGATT GTTAAAACAA GTATCGGGCC 3720 

AAATGCAGCC CTTGTCCCAC TTT C TTTTGC AGTCTTTGCC TTCTGG 3766 
(2) INFORMATION FOR SEQ ID NO; 226: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2520 base pairs 

(B) TYPE: nucleic acid 
(Ci STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

TGTTGCTGAG TTAATCGGTA CGTTCATGTT TGTATTCGTC GGGACAGGAG CTGTTGTTTT 60 

TGGAAATGGT CTTGATGGCC TTGGTCACCT TGGAATCGCC TTTGCCTTTG GTTTGGCAAT 120 

CGTGGTGGCA GCCTACTCAA TCGGAACTGT TTCAGGTGCT CACTTGAACC CGGCTGTTTC 180 

GATTGCTATG TTTGTAAACA AACGTTTGTC ATCTTCAGAA CTTGTAAACT ACATCCTTGG 240 

TCAGGTTGTT GGAGCTTTCA TCGCTTCTGG CGCTGTCTTC TTCCTCTTGG CTAACTCAGG 300 

TATGTCAACT GCTAGTCTTG GTGAAAATGC CTTGGCAAAC CGTGTCACTG TCTTTGGTGG 360 

TTTCTTGTTT GAAGTCATCG CAACTTTCTT GTTTGTATTG GTTATCATCA CTGTGACTTC 420 
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AGAAAGCAAG GGCAATGCCG CGATTGCTGG TTTGGTAATC GGTTTGTCAT TGATGGCCAT 480 

GATTCTTGTC GGATTGAAGA TTACTGGACT TTCAGTAAAC CCAGCTCGTA GCTTGGCACC 540 

AGCTGTCTTG GTAGGCCGCG CAsCCTTCAA CAAGTTTGGA TTTTCATCCT TGCACCAATC 600 

GCTGGTGGAG TTCTTGCAGC CCTTGTTGCA AAAAATTTCC TTGGAACAGA AGAATAATTG 660 

AAACTCAAAA AGCCTTGCTC CTCATCTTGA GGAACAGGGC TTTTTCGTAT GATACTCTTC 720 

GAAAATCTCT TCAAACCACG TCAGCTTCAT CTTCCCGTAG TATGGTTACT GACTTCGTCA 780 

GTTCTATCCA CAACCTCAAA ACAGTGTTTT GATCTGACTT CGTCAGTTCT ATCTGCAACC 840 

TCAAAACAGT GTTTTAAGCT GACTTCGTCA GTTCTATCTG CAACCTCAAA ACAGTGTTTT . 900 

AAGCTGACTT CGTCAGTTCT ATCTGCAACC TCAAAACAGT GTTTTAAGCT GACTTCGTCA 960 

GTTCTATCTG CAACCTCAAA ACAGTGTTTT AAGCTGACTT CGTCAGTTCT ATCCACAACC 1020 

TCAAAACAGT GTTTTGATCT GACTTCGTCA GTTCTATCCA CAACCTCAAA ACAGTGTTTT 1080 

GATCTGACTT CGTCAGTTCT ATCCACAACC TCAAAACAGT GCTTTGAGCA ACcTGCGCCT 1140 

AACTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATGACT TTAGCGGTTG TCAATTTTCT 1200 

CTGGATAAAG GTCGTGTTGG AAGAGGCGTT GTTCTGCCAA GCCCTCATAC TTAGTTCCTT 1260 

GCTTACCGTA GTTGTAGTAG GGCTCCATTG AAATGCCACC GCGCGGAGTG AATTTTCCCC 13 20 

AGACTTCTAA ATAGCGAGGG TCTAGCAAGT TGACCAAGTC TTTCCCGATG GTGTTGATAC 13 80 

AGTTTTCGTG GAAATCTCCG TGGTTTCGGT AGCTAAATAG ATATAGTTTG AGGGATTTTG 1440 

ACTCGACACA GAGCTTGTCA GGAATGTAGG AAATATGAAT CGTCGCAAAG TCTGGCTGAG 1500 

CAGTGATTTG TCCCAGCAGA GACATATCGA GGATATGGTG ACGAATGCCC TGTTCCTTAG 1560 

CGATTTCTCT AGTAATTTGA ATTTCGAGGT GATGACGTTG GCCGTAGGCA AAGGTGACAG 1620 

CTTCGACTGT TTCATAGTGT TGCATGACCC AGAAAAGGCA GGTTGTTGAA TCTTGACCAC 1680 

CACTAAAGAC GACCAAGGCT AATTGACGTT TCATAGTACT CCTTCCAAAA TGGGAAATGT 1740 

TCAGAGCACG CAAAAAGCTC CCATTAGGGA GCTAAAAAAT ACCAAATCGA GGTTTTTTTA 1800 

GCGATGGCAT ATCCCAAACA TCGTAATATT CTACTTATAT AGTAAAATGA AATAAGAACA 1860 

GGACAAATCG ATCAGGACAG TCAAATCCAT TTCTAACAAT GTTTTAGAAG TAGAGCTGTA 1920 

CTATTCTAGT TTCAATCTAC TATAGTCTAG CATATTTTTT GAAAAATGGC AAAGGGCAAG 1980 

AAAAAAGAGA CCAAAGAAAG TACTTGGTCT CTCGTTTGAT TAGCTCAATT CAGCAATCAT 2040 

GG CCTTGATT TGTTCTGCTG TGTGAACACC TGCAACTTGT TTGACAACTT GGCCGTCTTT 2100 

TTTGAAGAGA AGAGTTGGAA TAGACATGAT TCCAAAAGCA CGAGCTGTGT TTGGATTTTC 2160 

ATCAACGTCC ATTTTAACGA TTTTCAAGAC ATCTTCTGAA AGTTCTTCAG ACAATTTGTC 2220 
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CAACATTCGA CC7TCCATAC CACATGCACC ACACCAAGTT GCCCAGAAGT CTACTAAGAC 2 280 

CAAACCGTCT TTTGTTTCTT GTTCGAATGT TCCATCTGTA ATTGCTTTTG CCATTGTATT 2 340 

TCTCCTTTTT TTAGTTATAT TGGCTTAAAT CTTGTTTCAT GACATAGAAG AAGATATCTC 2400 

CATAAGTCCC ATCCTACTCC AAATTATGAC CCTTGTAAGT TAATTTTTGG ACACGGTACT 2460 

AkJcCTGCGAC GCCGATAAGG CAAGCTTGTT GCGAACGTTC AAAGTCTTCA TAAGACTCGG 2520 

' f2')- INFORMATION- FOR "SEQ _ ID~NOT"2 27~; ----- . _ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


227: 






ACTCAGTTAG 


ATTTTGTTTT 


CAAAAACAAC 


GAAGAAAAAG 


ACCATGTTGC 


k V- 1 A I. 1 1 


fin 


AGAATTGGCT 


CCCAACGTGT 


TTATCGATAT 


ATTAATAAAA 


AATATTTAGA 


TTTACCGGAA 


120 


ACATTCGAAA 


ATTATAATGT 


TTTTGTACCA 


GAAGCTAATG 


GAAGTGGTGC 


CTTAGCTGAA 


180 


GTCTTATCAA 


CACCCCTAAT 


CGGGGAACCC 


CTAATCGGGC 


ATACAGATAC 


TTTTTTATCT 


240 


ATTGGTAATT 


TTAAAACAAA 


ATTTGAAGCC 


GATCCTTGTA 


TTAAATTTAT 


TAAAACTAAA 


300 


TTCGCTAGAG 


TATTATTAGG 


TGTTTTGAAA 


GTTACTCAGC 


ATAATTCACG 


CAAAACTTGG 


360 


TATTACGTCC 


CCCTCCAAGA 


CTTTACGGTC 


AATTCGGACA 


TTGATTGGAC 


ACAATCAGTG 


420 


ACTGATATTG 


ACCGCCAGCT 


TGATCAAAAA 


TATGACTTTT 


CCCCTGAAGA 


AATTCCCTTT 


480 


ATTGAGAATC 


ATGTAAGGGA 


GATGGATTAG 


AAAACTATTT 


TTATTTGACA 


AATAGTGCTC 


540 


AATGATCTAA 


AATGACTATA 


TAGGATTAGG 


TCAGGAAGCA 


TACGATGCCC 


TGACCCTTTT 


600 


TGTACTTATG 


AGATGAGAAA 


GTCATTTGTT 


AGATAAATTG 


ACTCGTTAGC 


AAACGTTCAA 


660 


AAAAGGAAAA 


CTTATGCCAG 


TAGAAATTAA 


AACCACTAAA 


GAAATTCATC 


CTAAAATCTA 


720 


TCCCTACACC 


ACACCGACAG 


TAACCAGTAA 


TGAAGGCTGG 


ATTAAGATTG 


GGTATACAGA 


780 


ACGTGATGTC 


ACACAACGTA 


TCAAGGAGCA 


AACGCATACA 


GCTCATATAG 


CTACAGATGT 


840 


CTTATGGACT 


GGTGATGCAG 


CTTATACAGA 


AGAGCCTGAT 


AAGGGGAAAA 


CTTTCAAGGA 


900 


CCATGATTTC 


CACCATTTCC 


TTTCTTTCCA 


TGATGTAGAA 


CGTCGTCCCA 


AGACGGAATG 


960 


GTTCTATTTT 


AATGGAACTC 


CTGAAAAATC 


AAAAAATCTT 


TTTGATAAGT 


TTGTTCAGCA 


1020 


TGATTTGTCT 


GGTTATCAGC 


CTGGAAAAGG 


ACAGGACTAT 


ACTCTGCGAC 


AAGAGCAAGA 


1080 
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ACAAGCAGTT GCTAAGACAT TAGCTTATTT CCAAGAACAT GCTGGAGGCA AGTTTCTCTG 1140 

GAATGCCAAG CCACGCTTTC GTAAAACCTT GTCTACCTAT GACCTAGCTC GACGCATGGA 1200 

AGCTGTCAAT GTCCTAATTG TAACAAACCG CCCTGCCATT GCTAACTCAT GGTATGATGA 12 60 

TTTTGAAACA TTCATAGCAG GTCAAACGAC TTACAAGTTT GTTTCTGAAT CAGATAGCCT 1320 

TAAGAGTCGT CCAATCTTGT CACGACAAGA ATTTCTTGGT ATTTTAGCTG ACCATCTAAG 1380 

ACAACTTGCT TTTATCAGTC TCCAAGACTT GAAAGGATCT GTTTATTTAG GTGGAGAGCA 1440 

CG AT AAACTC AAATGGGTAA CTGATCTGCA TTGGGACTTG TTGCTTATTG ACGAGGCTCA 1500 

TGAAGGAGTT GATACCTTCA AGACTGACCA AGCCTTTAAT AAGATTCGAC GAAATTTTAC 1560 

TCTGCATTTG TCAGGTACAT CATTTAAAGC ATTGGCTAAA GGAGATTTTA CAGAGGAACA 1620 

AATCTACAAC TGGTCTTATG CTGATGAGCA GCCTGCTAAG TATTCGTGGT CTCTTGAGCA 1680 

AGAAGAGGAA AATCCTTATG AAAGCTTGCC TCAGTTGAAT CTCTTTACCT ATCAAATGTC 1740 

TCAGATGATT GGCGAAAAGT TAGAAAAAGG CGCTCAGATC GATGGTGAAA ATATTGACTA 1800 

TGTTTTTGAC TTAAGTGAAT TTTTCGCTAC AGATGATAAA GGGAAATTTA TTCATGACCA 1860 

TGATGTCAGA AATTGGTTAG ATACTCTATC AAGCAATGAA AAATATCCAT TTTCAACCAA 1920 

AGAACTCCGT AATGAACTCA AGCATACTTT TTGGCTTTTA GAACGTGTCG CTTCGGCCAA 1980 

AGCATTAAAA GCCCTACTAG AAGAACACCC AATCTATGAA AACTATGAGA TCGTTCTAGC 2040 

TGCTGGTGAC GGACGTATGT CCGAAGAAGA CGATAAAGTC AAACTC AAAT CCTTGGACTT 2100 

GGTTAGAAAA GCGATACCAG AGAATGACAA AACCATTACC CTATCCGTTG GTCAGCTGAC 2160 

GACAGGTGTC ACTATCCCTG AATGGACAGG TGTATTGATG TTATCAAATT TGAAATCACC 2220 

AGCTCTTTAT ATGCAGGCCG CCTTCCGTGC TCAAAATCCT TACTCATGGA GCGATAACAA 2280 

AGGAAATCAC TTTCGCAAAG AAAGAGCCTA TGTATTTGAC TTTGCGCCGG AAAGAACCTT 2340 

GATTCTCTTT GATGAGTTTG CCAACAACTT ATTGCTTGTA ACTGCAGCTG GTAGAGGAAC 2400 

TTCAGCTACA CGCGAAGAAA ATATTAGAGA ATTATTAAAC TTCTTTCCAA TTATTGCCCA 2460 

AGACCGTGCT GGTAAGATGG TTGAAATTGA TGCAAAGGCA GTTCTAACCA CTCCTCGCCA 2520 

GATAAAAGCT AGAGAAGTTC TTAAACGAGG TTTTATGTCC AATCTCTTAT TTGATAATAT 2580 

TAGTGGTATT TTCCAAGCAA GTCAAACAGT TTTAGATATT TTAAATGAGC TGCCAGTTGA 2640 

AAAGGAAGGG AAGGTACAAG ATAGTTCTGA TTTATTAGAT TTTTCAGATG TTACAGTCGA 2700 

TGATGAGGGA AATGCAGTAG TAGACCATGA AATTGTAGTT AATCAGCAAA TGCGACTTTT 2760 

TGGTGAAAAA GTTTATGGAC TTGGTGAATC TGTTGCTGAG TTAGTCACAA AAGATGAGGA 2820 

ACGAACTCAA AAACAGCTGG TCAATGACTT GAGTAAGACC GTTTCTTCAG TGATTGTAGA 2 880 
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GGAATTGAAA GCAGATTATT CTCTAAAAAC AAGCCAAACT GAGCAAATTA AGAAACAAAT 2940 

TACAGCAACA CTTGAGAATG AAATTCGAAA AAATGATATC GAAAGAAAAA TTTCTGAAGC 3000 

TCATATCAAG CAAGAGTTGC AACAGCAGCT CAAAGAAGCA AATGATAAAG CGCAAAAAGA 3060 

TAAGATTCAA CAAGATTTGG AAAAACGTTT AGAAGAAAAT AAACTCATTC ATAAAGAAAA 3120 

ACTAGAACAA ACACTCAAAA AAGAAGTGGA AAAAATGCCT GAGAAATTTA TCGAACAGGT 3180 

--- - TGAGATAAAA CGTGTGGAAC AGTTG AAACA ATCAGCTCAA GATGAAATTC CTGACCATTT 3240 

ACGAGGGTTT GCAAGAACAA TTCCAAGTTT TATTATGGCT TACGGTGATC AAACTCTAAC 3 3 00 

ACTTGATAAT TTTGATGCCT TTGTTCCTGA ACATGTTTTT TATGAAGTAA CAGGGATTAC 3 360 

GATTGATCAG TTTAGATATT TGCGAGATGG TGGGCAGGAT TTTGCAGGGC ATCTCTTTGA 3420 

TAAAGCAACA TTTGACGAAG CTATTCAAGA ATTTCTTCGC AAGAAAAAGG AGTTGGCGGA 34 80 

TTATTTTAAA GATCAAAAAG AAGACATTTT TGACTATATT CCACCGCAGA AGACCAACCA 3 540 

AATTTTCACT CCTAAACGAG TGGTGAAAAG GATGGTAGAT GATTTGGAAA AGGAAAATCC 3 600 

AGGGATTTTT GATGATCCAT CTAAGACTTT TATTGATTTA TATATGAAGT CAGGCCTCTA 3660 

TATTGCAGAA CTTGTGAAGC GGTTATATAA TAGCAATGGC TTGAAAGAGG CCTTTCCAAA 3720 

TCCTGAAGAA CGCTTAAAAC ATATTTTGGA AAAGCAAGTT TATGGATTTG CTCCGTCTGA 3780 

GATTATCTAT AACATTTCCA CTAATTTTAT ATTTGGCAAT CTTTCTAAAG ATATCAGTAG 3840 

GAAGAATTTT GTTTTAGCAG ATACCATTCC AGCGGCTAAA GAAGGGAGCA TTCAAAAGTT 3 900 

GGTTGATTCC TATTTTGAAA ATAATTAAAA AGAAGGCCGA GTCAAAATTC TTTGAAATCA 3960 

GAAAAAACGC ATAATATTGA GTGCTTTTGT ACTGCCCCCC AAAAGTTAGA CAGAAAAAAT 4020 

CTAACTTTTG GGGGGCAGTT CAGACAATCC TTGGTATTAT GCGTTTTATT GTGGGAAGAT 4080 

GTATAATGGA TTGAAATAAG ATATGAACAA ATCAATTAGG AATTTAAAGC ATTTTATAAC 4140 

AACGTTTTAG AGTAATGGGG GGCTATTTCA ACTTCAACCT ACTATAATAC AGAAAAAAAC 4 200 

AACTCCCTGA TAATTCAAGG AGTTGTCTAT AGTTAAATTA GTTTTTAGAA GCTTCTTGGA 42 60 

ATTCTGGGTT TTTCCATGCT TCGTCAATGA TAGCTTGTAA TTCTTTAGCA GATGCTTGCA 4 320 

TTTTTTGAGT TTCTGCGTCG TTCAATGGGA TATTTACTGG ACGAACGATA CCATGTGCAC 43 80 

CAACAACAGC TGGTTGACCG ATAAAGACAT TCTCAACTCC GTATTGACCT TCTTGGAATA 4440 

CTGAAAGTGG AAGTACTGCG TTTTCATCGT CAAGGATTGC TTTAGTGATA CGAGCAAGGG 4500 

CTACTGCGAT ACCGTAGTAT GTTGCACCTT TTTTGTTGAT GATTGTGTAG GCTGCATCAC 4 560 

CAACACCTTC GAACAATTCA ATCAATTCAG CTTCTTGAAC ATTTTGAGTG TCTTTAAGGA 4 620 
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ATTCTTCAAG GTTTACACCA CCGATGTTAG CGTGTGACCA AACAGCGAAC TCAGAGTCAC 4680 

CGTCTTCACC CATGATGTAG GCGTGCACTG AACGAGCATC CACATCCAAT TTTTCAGCAA 4740 

GTGCTTGACG GAAACGAGCT GAGTCAAGTG AAGTACCTGA ACCGATAACG CGTTCTTTAG 4800 

GGAAACCAGA GAATTTCCAA GTTGAGTAAG TCAAAACGTC AACTGGGTTA GCAGCAACAA 4860 

GGAAGATACC TTTGAAACCA GATTCAACAA CTTGAGTTAC GftTTCATTTG TTGATAGCAA 4920 

GGTTTTTACC TACAAGGTCA AGACGAGTTT CACCTGGTTT TTGAGGTGCA CCTGCAGTGA 4980 

TCACAACAAC GTCAGCGTCT GCACAGTCAG AGTATTGAGC TGCATAGATT TTTTTAGGTG 5040 

AAGTGAAGGC AAGGGCGTGA CTAAGGTCAA GCGCATCACC AACAGCTTTT TCATGCAATT S100 

GTGGAATTTC GATAATTCCA AGCTCTTGTG CAATTCCTTG GTTAACAAGT GCAAAAGCGT 5160 

AAGATGAACC TACAGCACCA TCACCGACAA GGATAACTTT TTTGTGTTGT TTACTTGAAC 5220 

TCATTGTTTT AAACATCTCC TTAATTTTAT TAGGGGATTT TCCCTAGACA ACTTCATT 527 8 
(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1941 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 



ATAAGGAATC 


TCTAAAAAAT 


TTTAAGGAGA 


ATCTAGCAAA 


TGGATTTCAC 


ATGGGCACTG 


60 


AAGTATGCCA 


CTGAATTTTT 


GGGAACTGCC 


ATTTTGATCA 


TTCTTGGGAA 


TGGTGCAGTT 


120 


GCCAACGTTG 


AACTTAAAGG 


TACGAAAGGT 


CACCAAAGTG 


GCTGGATCGT 


CATCGCTCTT 


180 


GGTTATGGTA 


TGGGGGTTAT 


GATCCCAGCC 


TTGATGTTTG 


GTAACGTATC 


TGGGAATCAC 


240 


ATCAACCCTG 


CTTTCACTCT 


AGGGCTTGCA 


GTTAGCGGTC 


TTTTCCCTTG 


GGCACAAGTG 


300 


GTACCTTACA 


TTATCGCGCA 


AGTCTTGGGG 


GCTATCTTTG 


GCCAAGCCTT 


AGTTGTGGCA 


360 


ACATACCGTC 


CATTCTACTT 


GAAAACTGAA 


AACCCAAATA 


ACATCTTGGG 


AACTTTCTCA 


420 


ACTATTTCAA 


GTATTGACCA 


TGGTACAAAA 


GAAAGTCGCT 


ATGCAGCAAC 


TGTCAATGGT 


480 


TTGATTAATG 


AGTTTGTTGG 


TTCATTTGTT 




CAGCTCTTGG 


TTTGACTAAA 


540 


AACTTCTTTG 


GTGCTGAAGT 


GCTTCAATTC 


ATGAAACAAA 


AGGCAACAGA 


AGCAGGACAA 


600 


ACAGTTGATT 


TTTCTGACTT 


GGCTATTAAA 


GCACAGGTGC 


CTCCACACAC 


TGCTTCAGGA 


660 


CTTTCTGTGG 


CTCACTTGGC 


ACTPGGATTC 


CTCGTTATGG 


CTTTGGTAAC 


ATCACTTGGA 


720 


GGACCTACAG 


GACCTGCCTT 


GAACCCAGCC 


CGTGACTTGG 


GACCACGTCT 


CCTTCATGCT 


780 
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TTCCTTCCCA AATCAGTTCT TGGTGAGCAT AAAGGCGATT CAAAATGGTG GTATTCTTCG 840 

GTACCAGTAG TAGCACCTAT CGCAGCAGCA ATTGCGGCAG TAGCTGTATT CAAATTCCTT 900 

TATCTCTAAG AAATAGCTCC TTTAACATTT GAGTGAGCAC CATCTATAAG TAAGAGAGGA 960 

TCACACTGGJc TCTCTCTTTT JcGATTTTTaG GGAAATGAAA GAAcTCTAAA CAAACTCCTC 1020 

TCCAGCAGTG GTTTAGAAGT CTCAGTGGGC TATTCCAGCT TCAATGGACT ATAGTAGGTT 1080 

GCAGTTGAAA TAATAGACCC TTGTTTCTAA- AACATTGTGA G AAATTGGTT TGAATTCTCC " 1140 

AATCAAATTG TGCAGTTTTC ATTCTACTAT ATATTATCGG AATATTATCG GAGATGGGTT 1200 

CCCTATCTTG TAAGTCTGCT TTATAGTGGC TTGAAGTTGG AATAGTCCTC CCTTCTTTCT 12 60 

CAAACATTGT GAGGAATTGA TTTACCTTCC TCAACAAAAT GTTCAGTTTC TATTTCATTT 1320 

TACTATAAAA TAAGCGATTA GGGGGGCTAT TCTTCGACCT ACATTGACTC TGCTGAGTCC 13 80 

TATGATTGTT ATCGTTTTAT CTGCAATTTT ATACTCAATG AAAATCAAAG GGCAAACTAA 1440 

GAAGCTAGCC GCAGGTTGTT CAAAACACAG TTTTGAGGTT GTATAGTAGA TTGAAACTAG 1500 

AATAGTACAC ATCTACTTCT AAAACATTGT TAGAAATCGA TTTGACTGTC CTGAACGATT 1560 

TGCCCTATTC TTGTTTCATT TTACTATATA AACCAGAGAC TGTTTACATT TTCAGCAAGT 1620 

GAGTGGATGG ATAATGCTGA AAACTCCTTG AAGGATAAGT CTATTTAGTA CTTTCTATTA 1680 

ATTAGTTAAA TTTTTACCAA GAATAATTCA CAAAAACCTT GTAAAACACT TGCAATTTAG 174 0 

CTGAAATTTG ATAAAATAGT AAGGAAAGTT AGACTGTATT GCCTACTGTC TATCTATAAA 1800 

ATATATTTTA TTGGAGGCTT TTACTCAAAT GGCAAAAGAA AAATACGATC GTAGTAAACC 1860 

ACACGTTAAC ATTGGTACTA TCGGACACGT TGACCACGCT AAAACTACCC TAACTGCAGC 1920 

TATCACAACT GTTTTGGCAC G 1941 

(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 755 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

ATTTGAAGAA ATTGAAGAAA TCGTAGCCCC TACAGATGGT GAATTTTTGG GGGAAGTTTT 60 

ACTTGGAACT GGGGTAGTTC TCTTAATTGG AGTACCCTGT TGTTAAAAAG ATAGGGAGTC 120 

ATAATCATGC AAGATAACTT TTTATTTGAG GAAATTGAAG AAATTTCAGT ACCAGTTAAT 180 
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GATTTTTCAG CTGGACTTGC AACAGGTATC GGATTTGGTT TAGCAATCCT TCCTCTTCCT 240 

GGTTGTTGAA GTTTGTTCAT TTACTAACAT CAAGCTTTTT CAATTTCATT TTAGACAGTC 300 

ATTTAAATTT TCCGTATTAG TCTTGCAGCA AGAGATTAAT AGAATTAGTC ATTATTTTAT 360 

TGATTGCGGA CTGAGGGACT AGAGTATGTT TTACTTAACC CCTCTTTTAT TTATTAAAGG 420 

TTAGGTTTGT TATGAGAATT CTTGATAACA TTAAGATATT ACCTACTCCT TATGAGGGAC 480 

ACTATCATTT ATATATACCA TCCAGTAAGA AACATGTATT AGTTGGGAAA CAGGAAAAAA 540 

ATGGTTAGAG CAACTAATAG GTCAAGAATT TACCATATCG GACTTATTAG TGTTAGTAGG 600 

GAAGAAATAT TTTTAAAATA TCTTCGGACT TTAATATAAC ATTATCTGAA AAATTAAACT 660 

ATAAAAGATT TAATAAGAAT TTTGAAAAAA TCCTATCTTG TTCTCATTAT ATTTGCAACG 720 

ATACATGAAA TTAGTCATGC AATAATTGCT AATAA 755 



(2) INFORMATION FOR SEQ 10 NO: 230: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1483 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

CCAGAAAAAC CGTAGTGGAG CTCGTGGAAC AGTGGAATTG ATTTTCCAAA AAGAATACAA 60 

TAAATTTTCA AGTATCTCAA AGAGGGAGGC ATAAGATGTC AGATGCATTT ACAGATGTAG 120 

CCAAGATGAA AAAAATCAAA GAAGAAATCA AGGCACATGA GGGACAAGTC GTAGAAATGA 180 

CTTTGGAGAA TGGTCGTAAG CGCCAAAAAA ATAGATTGGG TAAGCTAATT GAAGTTTATC 240 

CATCTCTATT TATTGTGGAG TTTGGGGATG TGGAAGGAGA TAAACAAGTT AATGTTTACG 300 

TTGAATCCTT TACTTACTCA GATATTCTTA CAGAAAAGAA TTTGATTCAT TATCTTGACT 3 60 

AAACTCAGAA ATTTTCTCAC ' rmTCTm TTCTCCGAAT AATTTAGGTG AAGGCAATCA 42 0 

TCCCTTTATA TTATTTTTCA AGGAGGAAGA ATGAAAATTT TACCGTTTAT AGCAAGAGGA 4 90 

ACAAGTTATT ACTTGAAGAT GTCAGTTAAA AAGCTTGTTC CTTTTTTAGT AGTAGGATTG 540 

ATGCTAGCAG CTGGTGATAG TGTCTATGCC TATTCCAGAG GAAATGGATC GATTGCGCGT 600 

GGGGATGATT ATCCTGCTTA TTATAAAAAT GGGAGCCAGG AGATTGATCA CTGGCGCATG 660 

TATTCTCGTC AGTGTACTTC T TTTGTAGCC TTTCGTTTGA GTAATGTCAA TGGTTTTGAA 720 

ATTCCGGCAG CTTATGGAAA TGCGAATCAA TGGGGACATC GTGCTCCTCG GGAAGGTTAT 780 

CGTGTAGATA AT AC AC CG AC GATTGGTTCC ATTACTTGGT CTACTGCAGG AACTTATGGT 840 
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CATGTTCCCT GGGTCTCAAA TGTAATGGGA GATCACATTG AGATTGAGGA ATATAACTAT 900 

GGTTATACAG AATCCTATAA TAAACGAGTT ATAAAAGCAA ACACGATGAC AGGATTTATT 960 

CATTTTAAAG ATTTGGATGG TGGCAGTGTT GGGAATAGTC AATCCTCAAC TTCAACAGGC 1020 

GGAACTCATT ATTTTAAGAC CAAGTCTGCT ATTAAAACTG AACCTCTAGC TAGCCGAACT 1080 

GTGATTGATT ACTATTATCC TGGGGAGAAG GTTCATTATG ATCAGATACT TGAAAAAGAC 1140 

GGCTATAAGT GCTTGAGTTA TACTGCCTAT AATGGAAGCT ATCCTTATGT TCAATTGGAG " 1200 

GCTGTGAATA AAAATCCTCT AGGTAAtTCT GTTCTTTCTT CAACAGGTGG AACTCATTAT 1260 

TTTAAGACCA ACTCTCCTAT CAAAACTGAA CCCCTAGTTA GTGCAACTGT GATTGATTAC 1320 

TATTATCCTC GAGAGAAGGT TCATTATGAT CAAATTCTCG AAAAAGACGG CTACAAGTGG 1380 

TTGAGTTATA CGGCTTATAA CGGAAGTCGT CGCTATATAC AGCTAGAGGG AGTGACTTCT 1440 

TCACAAAATT ATCAGAATCA ATCAGGAAAC ATCTCTAGCT ATG 1483 

(2) INFORMATION FOR SEQ ID NO: 231: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 
CCCGGAAAAC AAGTTAAAGT TGAACTTGGT CAAGCAGTTT ACGTTGAAAA ATTGAACGTT 60 

GAAGCTGGTC AAGAAGTTAC TTTTAACGAA TTGTTCTTGT TGGTGGTGAA AACACTGTTG 120 

TCGGAACTCC ACTTGTTGCT GGAGCTACTG TAGTTGGAAC TGTTGAAAAA CAAGGAAAAC 180 

AAAAGAAAGT GGTTACTTAC AAGTACAAAC CTAAAAAAGG TAGCCACCGT AAACAAGGTC 240 

ACCGTCAACC ATATACAAAA GTTGTCATCA ACGCAATCAA CCCTTAATTT TAAGGAGAAC 3 00 

ACATGATACA GGCAGTCTTT GAGAGAGCCG AAGATGGCGA GCTGAGGAGT GCGGAAATTA 3 60 

CTGGACACCC CGAGAGTGGC GAATACCGCT TAGATGTCGT GTGTGCATCG GTTTCTACGC 420 

TTGCCATTAA CTTTATCAAT TCTATTGAGA AATTTGCAGG CTATGAACCA ATCCTAGAAT 480 

TAAACCAAGA TGAAGGTGGC TATCTGATGG TTGAAATACC AAAAGATCTT CCTTCACACC 540 

AGAGAGAAAT GACCCAGTTA TTCTTTGAAT CATTTTTCTT AGGTATGGCA AACTTATCGG 600 

AGAACTATTC TGAGTTCGTC CAAACCAGAG TTATCACAGA AAACTAACAC GGAGGAAAAC 660 

ATTATGTTAA AAATGACTCT TAACAACTTG CAACTTTTCG CCCACAAAAA AGGTGGAGGT 720 
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TCTACATCAA ACGGACGTGA TTCACAAGCA AAACGTCTTG GAGCTAAAGC AGCTGACGGA 7 80 

CAAACTGTAA CAGGTGGATC AATCCTTTAC CGTCAACGTG GTACACACAT CTATCCAGGT 840 

GTAAACGTTG GTCGTGGTGG AGATGATACT TTGTTCGCTA AAGTTGAAGG CGTAGTACGC 900 

TTTCAACCTA AAGGACGCGA TAAAAAACAA GTGTCTGTTT ACCCAATCGC TAAATAAAAA 960 

GGTCCATTGA ACCTTTTATC CCGAACCTTG AAATGTAGAG GTGAGGAAGC TAGAAACAGC 1020 

TTAAAAT 1027 
(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1990 base pairs 

(B) TYPE: nucleic acid 
CO STRAND EDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 



CGGTTCAAAT 


GGTGCAGGTA 


AATCTACGTT 


AATTAATTCT 


ATTGTAGGTT 


TTCAAGAGAT 


60 


TTATTTAGGA 


GAAATAGACT 


ATTGTGATAA 


AGATTTGATA 


GTTAGTTCTC 


AACCTTTTGC 


120 


TCATTTAGGC 


TTTACTCCTC 


AAACCACAGT 


AATTGATTTT 


TATACTACTG 


TGAAGGACAA 


180 


TGTAATATTG 


GGGCTGAACC 


TTGCTGGAAA 


CTTTGGGAAA 


AATGCTGAGA 


AGTTGTGTCA 


240 


AATAGCCTTA 


GAAATTGTTG 


GGTTAGCTGA 


TAAAAAAAAT 


AATTTGGTAG 


AAACATTGTC 


300 


AGGTGGACAA 


CTGCAACGCG 


TCCAGATTGC 


TAGAGCAATA 


CCTCATAATC 


CAGATTTTTA 


360 


T A r ITT AG AT 


GAACCTACCG 


TTGCTTTAGA 


TACTGAATCT 


GCCGAAAAAT 


TTTTAATGTA 


420 


TTTAAAAGAT 


AAGAGTTTGG 


AAGGAAAAAC 


TATTATCATA 


TCTTCACATG 


ACATAAATCT 


480 


ACTCGAAAAG 


TTTTGTAAAA 


AAATACTTTT 


TTTACAAAAT 


GGCTCCATAT 


CATTTTTTGG 


540 


TGATATGCGT 


GACTTTGTAG 


ATAATTCAAC 


TATCAAATTA 


AATTTTTCAA 


TGCAGAATAG 


600 


AATTTCTAGA 


TATCAAATTG 


AATTTTTAGA 


AAATTTTAGA 


TTTAAAGTTC 


ACATCGAAGA 


660 


TAATGATAGT 


TTTACAATAG 


AAGTCCCTAT 


AGAAGAAAAG 


ATCTTAGATG 


TTATCAATGA 


720 


GGTAGGAAAA 


GCATGTGAAA 


TTAAAAACTT 


TTCAACAAGT 


AAATTAACCT 


TACAAGAAAG 


780 


TTATTTGCAA 


AGAATAGGAG 


GAGAAAAATG 


AACCCTCATC 


AATTAAGGCA 


CAAATCGGAC 


840 


TTAGGTTTAA 


GAGGTCTAGC 


GATTATTGCT 


AAAAATGAGA 


TTATTGCTTT 


TTTTAGAAGT 


900 


AAAGGTTTAA 


TTATTTCTCA 


GTTTCTACAA 


CCAATCTTAT 


ATGTTGTTTT 


TATAATAATA 


960 


GGATTAAATT 


CTTCGATAAA 


GAACATTCAG 


TTTAATGATA 


TAAAAACCTC 


TTATGCAGAA 


1020 


TATACAATCA 


TTGGTGTTAT 


AGCTTTATTG 


ATAATCGGGC 


AGATGACTCA 


AGTTATTTAT 


1080 
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AGGGTGACAA TAGATAAAAA ATATGGGCTA CTTGCTCTTA AGTTATGCAG TGGAGTTCGT 1140 

CCTTTATATT ATATTTTAGG GATGAGTATC TATTCTATAT TACGGTTCAT AGTTCAAGAA 1200 

ATTATTATAT ATATAATTAC GTTAGCGTTT GAGATAAATA TCGCAATGGA TAGATTTTTT 1260 

TATACAGTTT TGTTATCTAT TGTTGTTTTA TTATTTT G GG ACTCCCTTGC AATTTTACTT 1320 

ACAATGTTTA TCAATGATTA CAGAAGACGT GATATTGTAA TACGTTTTGT ACTAACACCG 1380 

CTTGGTTTTA CACCTCCTGT TTTCTACTTA ATAGATTCTG CTCCTACTAT TGTGAGATGG 1440 

ATTGGTCAGT TAAATCCCTT AACTTATCAA TTAACTATTT TGAGAAACTT TTATTTTAAA 1500 

AATTCAACAA CTTTGGAATT ACTTTTCTTA TTGTTAACAT CATTACTTGT CCTTATATCT 1560 

GTATCTTTTA TTATACCAAA GATAAAATTG ATACTGATAG AAAGATAAAA GTTGGGTCAT 1620 

CCAACTTTTT TGTTGTCTCC CGAAAACCAC TAGCTATGCT AGTGGTTCCA TAGAGCTTTT 1680 

AGCGTGGTAA CAAAAAGAAC CTCCTAAAAT GATAAGATAG AAGTGGTTTC TCCGCCACTA 1740 

CAACATATCA TACAGGAGGT ACCTCATGAG AGAGGATAAT CAAAGTTTAT CACATACCAC 1800_ 

ATGGAATTGT AAATATCATA TTGTTTTTGC ACCCAAATAT CGTCGTCAAA TCATTTATGG 1860 

CAGATACAAA CCTAGTATCG GAAGAATCAT ACGTGACTTA TGTGAGCGTA AGGGTGTAAT 1920 

AATCCATGAA GCGAATCCTT GTTCAGACCA TATTCACATG CTTATCAGTA TTCCTCCGAA 1980 

ACTT AGTGTT 1990 
(2) INFORMATION FOR SEQ ID NO: 23 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233 ; 

GAACTATATT GCATATATTT CTAGCAATGA TCATGGCGAA TCTTGGTCTG CACCAACTTT 60 

ATTACCTCCT ATAATGGGAC TTAATCGGAA TGCCCCATAT TTAGGTCCTG GACGTGGAAT 12 0 

CATTGAAAGC TCAACTGGAC GTATTCTTAT TCCGTCTTAC ACTGGTAAAG AGTCTGCGTT 180 

CATTTATAGT GACGATAATG GAGCATCTTG GAAAGTTAAA GTAGTGCCAC TTCCTTCTAG 240 

TTGGTCAGCA GAAGCACAAT TTGTAGAATT GAGTCCAGGA GTAATTCAAG CATATATGCG 300 

TACAAATAAT GGTAAAATTG CATATTTAAC AAGTAAAGAC GCAGGTACTA CTTGGAGTGC 3 60, 

ACCGGAATAT TTGAAATTTG TTTCAAATCC AAGTTATGGA ACACAATTAT CAATCATCAA 420 
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TTATAGCCAA TTGATTGATG GTAAAAAGGC TCTCATTTTA AGTACTCCAA ACTCCACAAA 480 

TGGTCGTAAA CACGGACAAA TTTGGATTGG TCTAATTAAT GATGATAATA CAATTGATTG 540 

GCGTTATCAT CACGACGTTG ATTATACTAA CTATGGATAC TCATATTCAA CATTGACAGA 600 

GTTACCAAAT CATGAAATTG GATTGATGTT TGAAAAATTT CATTCATGGT CTCGTAATGA 660 

ACTTCATATG AAAAATGTTG TACCATATAT AACATTTAAG ATTGAAGATC TGAAAAAGAA 720 

TTAAAGCTGA AATTTGAAAA TATATAAAAA GAGGATAAAA ATTATGGTAA ATTACGGTAT 780 

TGTTGGAGCT GGATATTTTG GAGCTGATTT AGCTCGCTCA ATGAACAAAA TTGAAGATGC 840 

AAAAGTGGTT GCGGTATTTG ACCCAAATCA TGGAGAAGAA GTTGCTCAAG AGTTGGGATC 900 

AGATGTTTGT GCAAGTTTAG ATCAACTTGT AGCACGTGAA GATATTGATT GTCTGATCGT 960 

AGCTTCACCT AGCTACCTTC ACCGTGAACC AGTTGTGAAA GCTGCTCAAC ATGGCAAACA 1020 

CGTATTTTGT GAAAAGCCAA TTGCATTGTC TTATGAAGAT TGTAAAGCCA TGGTTGACGC 1080 

ATGTAAAGAA AATAATGTCA TCTTTATGGC TGGTCACATC ATGAACTTCT TTAACGGTGT 1140 

ACACCATGCT AAAGAATTGA TTACTCAAGC TAAAATCGGT AAAGTTCTTT ATTGCCATGC 1200 

TGCTCGTACA GGTTGGGAAG AACAACAACC AACTGTATCA TCGAAGAAAC TTCGTTCTCA 12 60 

ATCTGGAGGA CATTTGTACC ACCATATTCA TGAATTAGAT TGCATTCAGT TTATCATGGG 1320 

AGGACTTCCT GAAAAAGCGA CAATGGTAGG AGGCAATGTA TATCATAAAG GTGAAAACTT 1380 

TGCTGATGAA GATGATATGC TCATTGTAAA CTTAGAATAC TCTGATGATC GTTATGCTGT 1440 

TTTGGAATAT GGTAATGCTT TCCGTTGGGG TGAACACTAC GTCTTGATTC AAGGAACTGA 1500 

AGGAGCTATC AAACTTGACT TGTTCAATAC TGGCGGTACT CTTCGTGTTA AAGGTGAAGG 1560 

AGAATCACAC TTCTTAGTTC ATGAAACTCA AGAGGAAGAT GATGATCGTA CAGCTATCTA 1620 
TACCGGTCGT GGTATGGATG GAGCAATTGC GTACGGTAAA CCAGGAGTAC GTTGCCCATT . 1680 

ATGGTTGCAA ACATGTATTG ATAAAGAAAT GGAATATCTA CATGACATCA TTAAAGGTGG 1740 

AGAAATTACA GAAGAATTTG AAAAACTTCT CAATGGTGTA GCTGCTTTAG AATCAATCiX 1800 

TACCGCTGAT GCATGTACTT TATCAGTTAA AGAAGATCGA AAAGTAAGTC TTTCAGAAAT 1860 

CACAAATGCT TAACTTPTGT AAAACAGAAT AGTAAATTCT TGTCATTATA TAATTTCTAA 1920 

AGTTCTGTGA TACAACTCAT TGAATAAAGA AATAGAGATG GGACTGGGAT AATGCCCAGT 1980 

CCCATTTTTT ATCAAAAAGT AATGAGATCA AAAATGTGGG AGTGTTGAAA TGAAGATTAT 2040 

AGGTATCGAT ATTGGCGGAA CAACAATTAA GGCAGATTTA TACGATCAGT TTGGAACGAG 2100 

TTTGAATCAT TTCAAAGAGA TAGAAACAAT TATTGACTAT GATTTGGGAA CGAATCAGAT 2160 

ATTAAATCAG GTCTGTGATT TAATTGGTGA GTATACTTTA AATCATTCAA TTGATGGTGT 2220 
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TGCGATTTCC 


ACTGCTGGAG 


TTGTTAATGC 


TAATACTGGA 


GAAATCATCT 


ATGCAGGCTA 


2280 


TACAATACCA GGCTATATCG 


GAGTAAACTT 


TACTGCCGAA 


ATAGAAAAAC 


GTTTTGGGTT 


2340 


GTATACTTTT 


GTTGAAAATG 


ATGTTAATTG 


TGCTGCATTA 


GGTCAATTGT GGAAGGGACA 


2400 


AGCCAAAGAT 


AAGAAAAATG 


TAGTAATCCT 


TACTATTGGA 


ACAGCTATAG 


GAGGCAGTAT 


2460 


TATTGTCAAC 


GGACAAATTG 


TTAACGGATT 


TAACTATACT 


GCTGGTGAAG 


TAGGTTATAT 


2520 


TCCf GTAGGT 


AATTCGGATT 


GGCAAAGTAA 


AGCCTCAACA 


ACCGCATTGA 


TTCATTTATA 


2580 


TCAAAAAAAG 


AGCTTGAAAA 


CTAATCAAAC 


TGGACCTACT 


TTCTTCACTG 


ATTTAAGATC 


2640 


TGGAGATAAA 


GTTGCTGAAG 


AAACTTTTGA 


AATTTTTGTA 


GAAAATCTAA 


CAAAAGGTTT 


2700 


ATTAACGATT 


TCTTATCTAC 


TTAATCCAGA 


AATTCTCATA 


TTAGGAGGTG 


GGATTCTGGA 


2760 


TAGTAAGGAT 


ATTTTGTTAC 


CTGAAATTCA 


AAGTTCTTTA 


GCTAAAAATG 


CAATGGATAA 


2820 


TAGGTTTTTA 


CCTAAAAATC 


TTGTGGCAGC 


TACATTAGGA 


AATGAACCTG 


GTCGTATAGG 


2880 


AGCTGTAAAA 


AATTTCTTAG 


ATAGAATTTC 


TAATAAATAG 


TATGTAAGAT 


AAGGAGGTGT 


2940 


CACAATGACT 


AACTCTGTAT 


TTTCGACAAT 


GCAAGATATT 


GAGAATGTTG 


CAACCGATAT 


3000 


TATAAAATCA 


TATGATAATG 


AGATTTATAC 


TTATAAAGCT 


CTTTCCCAAG 


AAGAATTGGA 


3060 


AAAACTAGAA 


AAAAGTTATG 


ATGAAAAAAG 


TCACGAAGAA 


TTAGTTTCAA 


TAGAAAGCAA 


3120 


TTTAGAAATG 


AAACAACAGA 


ACCTTATTGA 


TGAGGTTAAT 


AAAACAATCA 


AGGAAAATGA 


3180 


TGCAAATATT CAGTATATTT 


CATCAAGTAG 


GAGAGGAGAA 


TTTGTAGAAA 


AAATTATTGG 


3240 


TAGGGTGGTA 


GAAAAATATG 


GCCATTAGTC 


AGATGAAAAG 


AATCTCTCTA 


CTATTTTCTA 


3300 


AAAGTAGTCT 


TGATGATGTT 


TTAAAAACTA 


TTCAAGAACT 


A(j AVj i A- Atj 1 U 


CAGTTrCCGTG 


3360 


ATTTAAAGGT 


TCAGGATAAC 


TGGTCAGAAG 


CTCTAGAAAA 


AGATGAAGTT 


GTATTTCCAA 


3420 


CTATTCAAAT 


TTTTCATACT 


TCTAATTCCA 


ATCATGGGGT 


TATTGAGGGA 


AATGATGCCT 


3480 


TGACTTATTT 


GATGAATCAA 


CAACAACATT 


TAGAAGCAAC 


TGTAGAGAAA 


TTACAAGAAT 


3540 


ACCTACCGAA 


AGAAAACACG 


TTTAAATTAT 


TGCAGCAACC 


TCCGATAACT 


ACCTCTTATG 


3600 


AAGAATTAGA 


GAAATTTGGT 


AAAGCTAATG 


TTGCTGAGGG 


TGTTCTTAAA 


AAAGTGAATC 


3660 


ATCAAATTAA 


CAGAGTTCAT 


GAATTAGAAA 


GACACATTCA 


AAGTAATAAT 


GAGGAAATAG 


3720 


AGCGATTAAT 


AAAGTGGGAA 


AAATTAGAAA 


TTGTTCCTGC 


GAATTTAGAA 


CAATTTTCTT 


3780 


TCTGTAAAGG 


AAAAGTCGGA 


ACAATTCCAA 


GGACTGAAGA 


TAATCGCTTA 


TACAATAGTC 


3840 


TTTTAGAAAA 


CAATATTGAA 


GTTCAAGAAA 


TATTTTCTAA 


TGATAGAGAG 


TACGGTGTTG 


3900 


TTGTTTTCTA 


TCAGTCTAGT 


TACTCTATAG 


ATTTTGATGA 


ATACTTATTT 


GAACCATTTG 


3960 
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ATTATTCTAG AAAGGAATTA CCGAAGCAGC GAGTAGTAGA TTTAGATCAA GAAAACATGC 4020 

AGTTAATAAC TGAAAAAGAG AATATTATCG CATCGTTGCA AGATTCAAAG AAATATTTGA 4080 

TAGATTTACA ATGGCAAATA GACTATATTT TATCTATCTA TGCTCGTCAA ATCTCTAAGA 4140 

ATAACTTTTT GTGCACTCCG CATCTAGTTC CATTAGAAGG ATGGATAGAA GAAACTCGTA 4200 

TTTTATATTT TATAAAAGTT ATGGATGAGC ATTTTGGACA TTCTATTTAT ATTTATGAAT 4260 

CGGAAACATT GACGGATAAT CAAGATGAAA TACCTATCAA ATTAACGAAT CATTCTTTAA 4320 

TTGAACCATT TGAATTATTG ACAGAAATGT ATGCTCTGCC CAAATATTAT GAGAAAGATC 43 80 

CTACACCTGT ATTAGCACCA TTTTACTTTA CATTTTTTGG AATGATGGTT GCTGATTTAG 4440 

GCTATGGTTT ACTATTGTTT TTAGGAACAA TGTTAGCATT AAAAATTTTT CATCTACCTT 4500 

CAGCAACTAA GAGATTTTTA AAATTCTTTA ATATATTAGG GGTAGCCGTT GCAATTTGGG 4 560 

GTGGAATCTA TGCCTCATTT TTTGGATATG AGTTGCCATT TCATCTGATA TCTACAACCT 4620 

CTGATGTCAT GACTATATTA GTAGTGTCAG TTGTGTTTGG GTTTATTACA GTATTTGCAG 4680 

GTTTGTTAGC TTCAGGACTA CAAAAAGTAA GAATGAATAA ATATGCAGAA GCATATAATT 4740 

CACGATTTGC GTGGTGTGTT ATTCTG 4766 
(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 

CCTTTTAGAA AAAATTAAAG AATACGACAC CATTATCATT CATCGTCATA TGAAACCAGA 60 

CCCTGATGCC TTGGGAAGTC AGGTGGGATT GAAAGCCTTG CTGGAACATC ATTTCCCAGA 120 

AAAAACCATC AAAGCCGTCG GTTTTGATGA ACCAACTCTT ACTTGGATGC CTGAGATGGA 180 

TCTTGTTGAA GATAGAGCCT ACCAAGGCGC ACTTGTCATC GTCTGTGATA CAGCTAATAC 240 

TGCTCGTATC GATGATAAGC GCTATAGTCA AGGTGATTTT CTCATTAAGA TTGACCACCA 300 

TCCAAATGAT GATGTATACG GTGACCTGTC TTGGGTCGAT ACTAGTTCAA GTAGCGCTAg 360 

aGaTGATTAC CCTATTTGCC CAAACAACCC AACTAGCCTT GGCAGATCGC GATGCTGAGT 420 

TGCTCTTTGC AGGAATTGTC GGTGATACAG GTCGCTTCCT CTACCCTTCT ACCACTGCAC 480 

GGACTCTTCG CCTGGCTGCT TATTTGAGAG AACATAACTT TGACTTTGCG GCTCTCACTC 540 

GCAAAATGGA CACTATGAGC TACAAAATTG CTAAACTGCA AGGCTACATC TACGACCATC 600 
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TGGAAGTGGA 


TGAAAATGGT 


GCTGCTCGCG 


TTATCCTGAG 


TCAGAAAATC 


TTGAAACAAT 


660 


ACAATATAAC 


CGATGCTGAA 


ACTGCGGCCA 


TTGTAGGTGC 


ACCTGGACGC 


ATTGACAGAG 


720 


TGAGTCTCTG 


GGGAATTTTT 


GTCGAACAGG 


CTGATGGCCA 


CTACCGAGTT 


CCCTTACGCA 


780 


GTAAAGTCCA 


TCCTATCAAT GAAATTGCCA AGGAGCATGA TGGTGGAGGC 


CACCCTCTAG 


840 


CAACTGGTGC 


TAATTCCTAT 


AGCCTAGAAG 


AAAACGAAAT 


CATCTACCAA 


AAGTTAGAAG 


900 


ACTTGCTTAA 


AAACTGATAA 


AAT ACTTGCC 


AAACTTTTCA 


GAATCTGATA 


GACTAGTATA 


960 


GTAACAATCT 


ATGGCTCGCA 


AAGAGACCAT 


GGCAGAAAGG 


AAATATTGCA 


AAATGAAAAr 


1020 


AGATATCCAT 


CCAGAATATC 


GCCCAGTTGT 


CTTCATGGAC 


ACAACTACTG 


GTTACCArTT 


1080 


CCTTAGCGGT 


TCAACAAAAC 


GCTCTAACGA 


AACAGTTGAG 


TTCGAAGGCG 


AAACTTACCC 


1140 


ATTGATCCGT 


GTGGAAATTT 


CATCAGACTC 


ACACCCATTC 


TACACTGGAC 


GTCAAAAGTT 


1200 


CACTCAAGCA 


GATGGACGCG 


TGGATCGTTT 


CAACAAAAAA 


TACGGTCTCA 


AATAATGATA 


1260 


AGAGAACAGT 


TTTGGCTGTT 


CTTTTTTGTT 


TCTTGAAATC 


AACTGCTGTT 


TTCATGTTCC 


1320 


AGACTCATCT 


GTAGGTTCGA 


TTTCCATGCT 


ACTACGCACG 


AAGGAAATAG 


CTGTTTCAAC 


1380 


ACGTCCATAA 


TGAGCTATAC 


TATTGTCACG 


AACCACACTT 


TCATTGATGG 


TCCAAGTGGA 


1440 


ATTCATTTTC 


TTAAAAGCTT 


CTCGGACTTT 


TTCCAAATCT 


TTGGAGGCAA 


TGGCCTGCTC 


1500 


TAAGGTTTCA 


AAACGAGGAC 


TTATACTCAT 


CTGCTTTCAA 


AAAGCATTCT 


AGTCCATCTC 


1560 


CGATTACCCA 


TGGACTTTAT 


CACCTCCTTC 


TCCAGTCCTT 


GTATGACATC 


TTGAAGTTGA 


1620 


TTCATGACAT 


CTTCCAAAGT TCgAAAGGCT TTATTCTTAA ATCCACGTTT 


ACGAATCTCT 


1680 


TTCCACACTT 


GTTCAATGGG 


TTCATCTCTG 


GTGTGTATGG 


AGGAATAAAG 


GTAAAATCAA 


1740 


TATTAGTCGG 


AATATTTAAG 


GTACTTGATT 


TATGCCATAT 


AGCATTGTCC 


ATAACGAGTA 


1800 


AAAGGATAAG 


CTTGTGAAAG 


CTCTTCTAAA 


AAGGCGTTCA 


TCCACACTCC 


TTTTTATAAA 


1860 


CCTCAAATAA 


GGCATCAATT 


GTAACAAATT 


CTCCTGCCTC 


TGTAGCCTTC 


AAATGACGGG 


1920 


CAAGAAAGGC 


TTTCTCTTCC 


TCAACTGTCA 


TATATGCATG 


GTTACGACCA 


CCACCTGTTT 


1980 


CTTGAAGGAG 


AGAGTCGAGT 


CCGAACTCCT 


CATATTTTTT 


TACGTTTCGC 


CAAATCGTTG 


2040 


TTTGATTACA 


GTCTAAAAGC 


TCTATAATCT 


CTTTATAAGA 


TTTGCCCATC 


AGACGAAATA 


2100 


TACTAGATTG 


AAACTAGAAT 


AGTACACCTC 


TACTTCTAAA 


ACATTGTTAG 


AAATCGATTT 


2160 


GTCCTGTTCT 


TGTTTCATTT 


TACTATAGAA 


CGATTTGAAG 


GCGTTTATAA 


TATTTAGCTG 


2220 


TACGAGACTC 


TTTTAAAAGT 


GTTTTGATGG 


TTTGGATTTC 


TTCTTTAGTT 


GATTTCATAT 


2280 


TACTATTATA 


TAATGCTTTT 


TGATTTTAGT 


CTGGTATAAA 


TATTGCTTTC 


CTCCAAAATG 


2340 
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GTCATACTTT TACTGGCAAA TCTAACATAT CACGGATAAA TTAACAAGTG ATTTCTGAAT 2400 

TGCTAAACAT TTTCTTTTCT TATAGCATAC TTTAAGATTT TGTCTTTGAG AAAGATATTT 2460 

CCAAGAAAAA CGTTCGTTTT TTGG 2484 

(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1766 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 



CTAGATATAG 


CTATAATTTT 


ATTTATAACA 


AGAGGATAGA 


AATGACCGAA 


TTAGAAAGAA 


60 


AAAATCGAAA 


AATTAGCTAA 


GAAATATTCT 


GATAACTTAA 


ACATCAAAGT 


TCAAGAGAGA 


120 


GTTCGTGAAA 


TGGCAAATGA 


TAATAAGAGC 


CATTATTTGA 


TATACAGAGT 


TTTAGGTATT 


180 


TCATTTGAAG 


AAGGAGAAAA 


TATCGATTTG 


TATCAAAATA 


AAGGTCGTTT 


TTTATACAAA 


240 


TATGCTGGTT 


CATTTTTAGA 


AGAAGCTGCA 


GTACTATGCT 


TTAACGAAAA 


ATTTGGTACA 


300 


GAAAATACTT 


AAAAAGTTAA 


CATTCCTAAT 


TCTGAAAGTA 


CAAAACCTAA 


GACTTTTGAA 


360 


ATTGATTGTT 


TACTCGGAGA 


AAAACACGCA 


TACGAAATAA 


AATGGTGGGA 


TGCAACTACA 


420 


GATGGAGACC 


ATATAACTAA 


AGAACACACT 


AGAATAAAAG 


TTATTCATAA 


CAAAGGATAT 


480 


ATACCAATTC 


GGTTAATGTT 


CTACTATCCA 


AATAGAACTC 


AAGCTATAAA 


AATTCAGCAA 


540 


ACTTTAGAAA 


CATTGTATAA 


CGGTATTGGA 


GGGAAATATT 


ATTATGGACA 


TTCTGCCTGG 


600 


GAACATTTAA 


GAGCAGTGAC 


CGGTATTGAT 


TTACTTAGTA 


TTCTAACAGA 


TATTGCAAAT 


660 


AAAAAAACAG 


GGGTAAAATC 


AAAATGACAG 


TATTAAAAGG 


AGATAACTTA 


GAAATATTAA 


720 


AAACTATTGA 


ATCCTCAAGT 


ATTGATTTAA 


TCTATATGGA 


CCCTCCTTTC 


TTTACACAGA 


780 


AAACCCAAAA 


ATTATCTAAT 


AACAAAAATA 


TTATGTATTC 


ATTCGAAGAT 


ACGTGGACt T 


840 


CGATTGAGGA 


TTACAAAGAA 


TTTTTGTCTG 


TAAGATTAGA 


AGAATGCAAA 


AGAGTGCTAA 


900 


AAAATAGTGG 


CAGTATTTTC 


GTTCATTGTG 


ATAAAATTGC 


AAATCATCAT 


ATTAGATTAA 


960 


TTTTAGATAA 


TATCTTTGGA 


GTAGATATGT 


TTCAAAGCGA 


AATTATATGG 


AACTATAAAC 


1020 


GGTGGTCTAA 


TTCAAAAAAG 


GGATTATTGA 


ACAATCATCA 


AAACATTTAC 


TTTTATTCAA 


1080 


AGTCAAAAGA 


TTTTAAATTT 


AATACAATTT 


TTACAGAGTA 


TTCTTCTACT 


ACAAATATCG 


1140 


ACCAAATACT 


AGTGGAACGA 


AAACGAGATG 


GAAACTCTAA 


AACTATATAT 


AAGGTTGATA 


1200 


ATAATGGTAA 


CTATATTCTA 


GCAAAAGAGA 


AAAATGGACT 


TCCCCTTTCA 


GATGTTTGGA 


1260 



/ 



WO 98/18931 



PCT/US97/19588 



1257 

ATATACCATT TCTTAATCCA AAAGCTAAAG AAAGACTAGG TTATCCTACA CAAAAACCTA 1320 

TTCTGTTATT AGAACAAATT ATAAAGATTG CTACTGATAA AAATGATATA GTTTTAGACC 1380 

CCTTCTGTGG AAGTGGAACT ACTTTAGTAG CCTCCAAGAT TTTGAATAGA AATTATATGG 144 0 

GGATTGATTT ATCTGAGGAA GCTATCAATA TAACTCAGCA ACGTCTGCAA AATGTTATAA IS 00 

AAACAAGTTC AAATTTATTG AATAAAGGAA TCGAACCATA TAGAACCAAA ACTGAGGAAG 1560 

AGGAAAACAT TCTTAAATTA TT ACAGGCAA_ AAATTGTTCA AAGAAATAAA GGAATTGATG 1620" 

GTTTTTTACC TAAACATTTT CAAAAAAAAC CGATACCTAT AAAAATTCAA AAAAATAATG 1680 

AATGTCTGAA TGAGAGTATC TCTTTATTAC AGAATGCTAT AAACTCCAAA AAACTTGATT 1740 

TTGGAGTAGT TATAAAAACT CATTCG 17 6g 
(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 6: 

CCGAAAATCA AATTCAAACC ACGTCAACGT CGCCTTGCCG TACTCAAGTA CAGCCTGCGG 60 

CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA AACTAAATTA AATAATATTA 120 
GCGCGGAGAA TTTCTAATTC TTCCTTGGTC AAGCGACGCC ATTCCCCTCG TTCTAGGTTC * 180 

TCATCTAATA CTAAAGTTCC CATAGTCAAT CGTTGCAAGT CCACCACTTC CTTGCCACAG 240 

TAGCCCACCA TACGCTTGAT CTGATGAAAC TTCCCTTCTG CAATGGTCAC ACGGATTTGG 300 

CTTTGATTCT TTTCTGTATC TATGGATACA AGCTCCAGTA TAGCGGGTTG ACAGGTAAAG 3 60 

TCTTTGAGAG GAATACCCTC AGCAAATGTC TCCACATCTT CTTGGGTCAT GATTCCCTTG 420 

ACTTGTGCCA GATAAGTCTT GTCCACATGA CGCTTGGGCG AAAGAAGAAC ATGAGCCAGC 480 

TGACCATCAT TGGTCAAGAG CAAAAGACCA TGCGTCTCAA TATCCAAGCG TCCTACTGGG 540 

AAAACTTCCT TACTCCGCGC CAAGTCATCC AACAAGTCCA GAACGGTTCT GTGCTTGGGA 600 

TCCTCAGTCG CTGAGATAAC TCCTTTGGGC TTGTTCATCA TGTAGTAGAC AAACTCTTCA 660 

TACTCCAACA CTTGCCCATC AAAGCGAATC TCATCTATTT TTTCATCAAT CTGCAATTTA 720 

GCTGATTTTT CTTTTTGACC ATTTACAG 748 
(2) INFORMATION FOR SEQ ID NO: 237: 
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ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1449 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) 


SEQUENCE DESCRIPTION: : 


SEQ ID NO: : 


23TT— 






AAAAGATTAC 


ATTGCAACAA 


TTGAAAATTA 


TCCAAAGGAA 


GGCATTACCT 


TCCGTGATAT 


60 


TAGTCCTTTG 


ATGGCTGATG 


GAAATGCTTA 


TAGCTACGCT 


GTTCGTGAAA 


TCCTTCAGTA 


120 


TGCTACTGAC 


AAGAAAGTCG 


ACATGATCCT 


GGGACCTGAA 


GCTCGTGGAT 


TTATCGTGGG 


180 


TTGTCCAGTT 


GCCTTTGAGT 


TGGGAATTGG 


TTTTGCGCCT 


GTTCGTAAGC 


CAGGTAAATT 


240 


GCCACGCGAA 


GTTATTTCTG 


CTGACTATGA 


AAAAGACTAC 


GGTGTCGATA 


CCTTGACTAT 


300 


GCACGCGGAT 


GCCATTAAGC 


CAGCTCAACG 


TGTTCTTATT 


GTAGATGACC 


TTTTGGCGAC 


360 


AGGTGGAACT 


GTTAAGGCAA 


CTATCGAGAT 


GATTGAAAAA 


CTTGGTGGTG 


TTATGGCAGG 


420 


TTGTGCCTTC 


CTTGTTGAAT 


TGGATGAATT 


GAACGGCCGT 


GAAAAAATTG 


GTGACTACGA 


480 


CTACAAAGTT 


CTTATGCATT 


ATTAATGAAA 


ACAGTCCCTA 


GGGCTGTTTT 


CTCTACACTA 


540 


GGATATAAAA 


ATAGACTATA 


ACTAGTTAGA 


GAAAAACTAT 


AATTGAAAAC 


TATATCTTCT 


600 


TGCAGTATAA 


TAAAAGGACT 


AAGTGTTTGA 


GATTTGTCTT 


CAAACATATG 


CAATTATTCC 


660 


TGAAAGAGTA 


CAGTTAGGAG 


AGGGTTATCC 


CGATTCGAAT 


TGATAAAAAA 


TTGCCAGCTG 


720 


TTGAGATTTT 


ACGGACAGAG 


AATATCTTTG 


TCATGGATGA 


TCAACGTGCT 


GCCCACCAAG 


780 


ATATCCGTCC 


TTTGAAGATT 


TTAATTTTAA 


ATCTCATGCC 


ACAGAAAATG 


GTCACAGAGA 


840 


CCCAGTTGTT 


GCGCCACTTG 


GCTAATACAC 


CCCTACAACT 


GGATATTGAT 


TTTCTCTATA 


900 


TGGAGAGCCA 


CCGTTCTAAA 


ACAACTCGTT 


CAGAGCACAT 


GGAGACCTTC 


TATAAAACTT 


960 


TTCCTGAAGT 


CAAGGATGAG 


TATTTTGATG 


GGATGATCAT 


CACGGGTGCT 


CCAGTTGAGC 


1020 


ATTTACCATT 


TGAGGAAGTG 


GACTATTGGG 


AGGAATTTAG 


ACAGATGCTT 


GAGTGGTCTA 


1080 


AGACTCATGT 


CTATTCGACC 


CTTCATATCT 


GTTGGGGGGC 


TCAGGCTGGG 


CTTTATCTGC 


1140 


GCTATGGTGT 


AGAAAAATAC 


CAGATGGACA 


GTAAGCTATC 


AGGTATTTAT 


CCTCAGGACA 


1200 


CCCTAAAAGA 


GGCTCACCTT 


CTATTTAGAG 


GCTTTGATGA 


TAGCTATGTA 


TCCCCTCATT 


1260 


CACGGCACAC 


GGAGATTTCT 


AAGGAAGAGG 


TCTTAAACAA 


GACCAATCTC 


GAGATTTTAT 


1320 


CAGAAGGACC 


TCAGGTTGGG 


GTTTCTATTw 


TGGCCAGTCG 


TGATTTACGA 


GAAATTTATA 


1380 


GTTTTGGTCA 


TTTGGAGTAT 


GACCGTGATA 


CTTTGCCAAA 


AGAGTATTTT 


CGAGATCGTG 


1440 


ATGCAGGTT 












1449 
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(2) INFORMATION FOR SEQ ID NO: 238: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 904 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

TACCCGCTTC TTTCAAGAGT TGCAGCAGGC CTTGTTTGCG ATCTTTTGTC ATAGTTCTTC 60 

CTTTTAACGG CGTTTTCGAA GCACTTTATA GACAGCTAGT CCTAATGTAT ACTCTACCAT 120 

ACTATGGATA ATTCTACCAA ATCCAACTAG TACAAATAGA ACATAAAACA TATTTTCTAC 180 

ATTGGTACCA GAAGTTGCGT AAAAAACGAC ACAGGCCAAT ACTTCAGCAA GGCCATGAAC 240 

AACAGCCAAA ACAAAGTTGA AAATCCAGGA AGATTTTGGT TTATCTAGGC TATCGGGGAA 300 

TTTTTGTAGG TAAAGAGCTC CTAAAGCACC AAAAGATATA TGGGAAAAAG CCCGAAAAAC 360 

GATAACCATG GGATAGCCAG CCATCAAAAA TCCAAAACTA GAGGCTAGGA TGACAAAAAC 420 

TGCCATCAAG GGCGACAAGA ACATGGCTAT AAAAATAGCG ATGTGCCTCC CCAAAGTATA 4 80 

GGAAGCAGGT GGAATGACAA TCTTGAAAGG CATAACAATT GGAATCAAAA TCGCAATAGC 54 0 

CCTTAAAACC GCTGTCATTG TCATAAATTG TGTCTTTTTC CCTGTATTCA CAAGAATCTC 600 

CTTTTTAACT GCATATACAC TAGTATGGTA CAATAAACCA GACAATAAAG CAAGAATTTA ,660 

CTTGGGTTTA TAGATCATTT TTTAGTTAAA AGTTATAGTA GATTGAAACT AGAATAGTCC 720 

ACCTCTACTT CTAAAACATT GTTAGAAATC GATTTGGCTG TCCTGATCGA TTTGTCCTCT 780 

TCTTATTTCG TTTTACTATA CTAAAGATTT CATTAAAAAG AAACTGTATA GAGCAAAATC 840 

TCCACCTTCA GGTTTGGAAA GCGGAGATTG TTTnTTATTT TTTCCAGGGT TTGTAGTCGT 900 

GGGA 904 
(2i INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 946 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 
ID) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 
CACTCAAACA TGACTTATAT CAACACGGAT GGACTTCAAC ACGATGCCAA TCGCTTGAAT 



60 
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CGTAACATTC AGTTTGCTGT TCGTGAATTT GCAATGGGAA CAATCTTGAA CGGGATGGCC 
CTTCATGGTG GACtTCCTCT ATACGGTGGA ACTTTCTTCG TCTTCTCTGA CTATGTGAAG 

GCAGCTGTCC GCTTGTCAGC CTTACAAGGA CTTCCTGTGA CTTATGTCTT TACCCATGAT 240 

TCAATCGCAG TTGGGGAAGA TGGTCCGACT CATGAACCAG TTGAGCATTT AGCAGGTCTT 300 

CGTGCTATGC CAAATCTAAA TGTTTTCCGT CCACCAGATG CGCGTGAAAC GCAAGCAGCT 360 

TGGTACCTTG CAGTCACAAG TGAGAAAACA CCAACTGCCC TTGTCTTGAC ACGTCAAAAT 420 

TTGACTGTTG AAGATGGAAC AGACTTCGAC AACGTTGCTA AAGGTGCTTA TGTTGTATAT 4 80 

GAAAATGCAG CCGACTTTGA TACCATCTTG ATTGCGACAG GTTCAGAGGT TAATCTTGCT 540 

GTCTCAGCTG CTAAAGAATT GGCTAGTCAA GGCGAAAAAA TCCGCCTAGT CAGCATGCCA 600 

TCTACAGATG TCTTTGATAA ACAAGATGCA GCTTACAACG AACAAATTCT TCCAAATGCA 660 

GTCCGCCGTC GTGTTGCAGT CGAAATGGGT GCAAGTCAAA ACTGGTACAA ATATGTTGGT 720 

CTCGATGGTG CCGTTCTAGG TATTGATACT TCGGAGCCTC TGCCCCAGCA CCAAAAGTAT ?80 

TGGCAGAATA TCGCTTTACT GTACAAAATC TTGTAAAAGT TGTTCGAAAC TTGAAATAAT 840 

CCTAAAAATC AGGGCGTAAG CTCTGGTTTT TCTTACCAGA AAAGTAACGT ACAATCTTGT 900 

AAAAGTAGCT GAAATTTGAT ATAGTAGTCC TATGTAAAAG ACAAAG 946 
(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2764 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

CGGGGCTCCC TAGTTCTTAG GGAGCTATTT TTGTTTTTTC AAGAAGTTAT CTTCTTCTAT 60 

TTTATACTCA ATGAAAATCA AAGAGCAAGC TAGGAAACTA GCCGTAssTG CTCAAAACAC 120 

TGTTTTGAGG TTGTAGATAA GACTGACAAA GTCAGGAACA CATATCTACG GCAAGCCGAC 180 

GTTGACGCGG TTTGAAGAGA TTTTCGAAGA GTATTAGTTG TGAATCTGGT GCAGTCGTCC 240 

CAGATTATTC TTATTAGTAG GGTCTTGTTT TCTATATCCC CTCGTAGTTA ACAAGACCTT 300 

GAGCATTTTA GAAAGAGGAA TCTATGTCTA CGAAATATAT TTTTGTAACT GGTGGTGTGG 360 

TATCGTCCAT TGGGAAAGGG ATTGTGGCAG CGAGTCTAGG CCGTCTCTTG AAAAATCGTG 420 

CTCTCAAACT AACCATTCAA AAGTTTGACC CTTATATCAA TATTGATCCG GGAACCATCA 480 

GTCCTTACCA GCACGGGGAA GTTTTTGTGA CAGATGACCG AGCTGAGACA GATTTGGACT 540 
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TGGCTCACTA TGAACGTTTC ATCCATATCA ATCTCAACAA ATATTCCAAC GTGACAACTG 
GGAAAATTTA CAGTGAAGTT CTTCGTAAAG AACGCCGTGG AGAATACCTT GGGGCAACTG 
TTCAAGTCAT TCCTCATATC ACAGATGCTT TGAAAGAAAA AATCAACCCT GCCGCTCTAA 
CGACCGACTC TGATGTCATT ATCACACAGG TTGGTGGAAC ACTAGGAGAT ATCGAGTCCT 
TGCCATTCCT AGAGGCTCTT CGTCAGATGA AGGCAGATGT GGGTGCGGAT AATGTCATGT 
ATATCCATAC AACCTTGCTT CCTTACCTCA AGGCTGCTGG TGAAATGAAA ACCAAACCAA 
CCCAACACTC TGTCAAAGAA TTGCGTGGCT TGGGAATCCA ACCAAATATG TTGGTTATTC 
GTACAGAAGA GCCACCTGGT CAAGGAATTA AAAATAAACT GGCCCAGTTC TGTGATGTGG 
CACCAGAAGC CGTTATCGAA TCGTTGGATG TTGAACACCT TTACCAAATT CCACTGAACT 
TGCAGGCACA AGGGATGGAC CAAATTGTTT GTGATCATTT GAAATTAGAC GCACCAGCAG 
CGGATATGAC AGAATGGTCA CCCATGGTGG ACAAGGTCAT GAACCTCAAG AAACAAGTTA 
AGATTTCCCT TGTTGGTAAG TATGTGGAGT TGCAAGATGC CTATATCTCA GTGGTCGAAG 
CCTTGAAACA CTCTGGCTAT GTCAATGATG CAGAAGTTAA AATCAATTCG GTCAATGCCA 
ATGATGTGAC AGCAGAGAAT GTAGCAGAAC TCTTGTCTGA TCCGGACGGG ATCATCGTAC 
CAGGTGGTTT TGGTCAACGT GGTACAGAAG GGAAAATCCA AGCCATCCGC TATGCGCGTG 
AAAATGATGT TCCAATGTTG GGAGTCTGCT TGGGAATCCA GTTGACATGT ATCGAGTTTG 
CTCGTCACGT TTTAGGTCTT GAAGGTGCCA ATTCTGCAGA GCTTGCACCA GAAACAAAAT 
ACCCTATCAT TGATATCATG CGTGATCAGA TTGATATTGA GGATATGGGT GGAACCcTTC 
GTTTGGGACT TTATCCGTCT AAGTTGAAAC GTGGCTCTAA GGCTGCTGCT GCTTATCACA 
ATCAAGAAGT GGTGCAACGC CGTCACCGTC ACCGTTATGA GTTTAATAAT GCCTTCCGTG 
AGCAGTTTGA GGCAGCAGCT TTTGTCTTTT CAGGAGTTTC TCCAGACAAT CGTTTGGTAG 
AAATCCTGGA AATTCCTGAA AATAAATTCT TTGTAGCTTG TCAGTATCAC CCTGAACTGT 
CAAGCCGTCC AAACCGACCA GAAGAACTCT ACACTGCCTT TGTTACTGCA GCAGTTGAGA 
ACAGCAATTA GCAAAATCAG AACCTTTGAG AAAAATCTCA GAGGTTTTTT GCATACGATG 
ATATTGCAGT ATATCTGAGG TAGGGGTCCT CTGTATGTAC CTGCTACCGT TGAAATCAAT 
AGCGACTCCC TCTTGCCCTG TGCTAGTGAA TGGATTTATC AGTATATTGA AATGAAATAA 
AATTTGAACA AATTAATTCG GAAAGCCAAA TCAATTTCTA GCAAAGTTTT AGGAACTGGA 
TTGTATAGTG AATTGAAATA AGATGTGAAC ATCTCTATCA GGAAAGTCAA ATTAATTTAT 
AGAAATATTT TAGCAGTCAA GATGTACTGT TATACATTCA AT AC ATT AT A CTTTTTTAAT 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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TTAATCCACT 


ATAGTAAAAT 


GAAATAATAA CAGGACAAAT CGATCAGGAC AGTCAAATCG 


2340 


ATTTCTAACA 


ATCTTTTACA 


AATAGAGGTG TACTATTCTA GTTTCAATAT ACTATCCCAA 


2400 


ATCATTCATA 


CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC ATGAGCCACT 


2460 


TTCTTCCTCC 


TCATGAGGTC 


AGTTTTACTT TCTGCTGTTC CAGTATCGTT TTTCCTCGCT 


2520 


ACATTTCCTC 


AAAAGGGCAG 


ACTCCTCCCT TGGTGCGTCA CACGATTTTT TCATCTCGAC 


2580 


TGTTCTTTAA 


TGCATCATTA 


ACGACGCTTT TCTTCTACGT GGTTCATAAG GAACAGGAAG 


2640 


ATTCAGGTTG 


ACTTTTCTAA 


TCCTAGAATA AAGTGCTGAA AACAATTCGG AATAGGCATA 


2700 


GAG ACT AG AC 


AATTTGAGGA 


GCTGCTTGCG TCCTGTTCGA ACACATTTTC CCACCACGTG 


2760 


AAGA 






2764 



(2) INFORMATION FOR SEQ ID NO: 241: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



60 
120 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 241: 
CCCTTTTTTT CATTGTTCAG TACTACAACT TACGTTGTAG CGCCCTGCAC ATTGGTTCGT 
CTTGTTCAGT TTTCAAAGGT CTTTGTCACT TGCTTCTCTC AAGCGACAAC TATATTAGTA 

TATCACAACT GCTTTCGCTT GTCAACACTT TTTTGAAGAT TTTTAAGTTT TTTTAAACTT 180 

TTTTTCATCA AGTGGTCCTG ACGCAACATA CCATAGTCCG TACGGGATTC GAACCCGTGT 24 0 

TACCGCCGTG AAAAGGCGGT GTCTTAACCC CTTGACCAAC GGACCTGAGT TGTTATTTTC 300 

AACTCTTACT ATTATACAGT CTTTTCAAAC TTTGTCAACT ACTTTTTTAA ACTTTTTTTA 360 

TTAATTTTAC AACAGCTTCA GTTCGAGCTG TATGTGGGAA CATATCGACC GACTGGATAT 420 

AATGAAGATC ATAGACTTCT ACTAAGCCTA CCAAATCACG AGCCAAGGTC GAAACATTAC 480 

AAGAAATATA AACCATTTTT TCTGGTACAT AAGTAAGAAT AGTATCTAAT AACTTATCAT 540 

CCAGACCTGT ACGTGGTGGG TCAACAATCA AAGCATCTGC TCGCTAGCCT TCCTTGTACC 600 

AACGAGGAAT AATCTCTTCT GCCGTTCCAG CTTCATAATG AGTATTGTCA AATCCCATTC 660 

TTTTACCATT TCGCTTGGCA TCTTCAATAG CTTCTGGAAT AATATCCATA CCTCTGAGTC 720 

TTTTTACTTT CTTTGCAAAG GCAAATCCAA TCGTTCCAAC TCCACAATAA GCGTCAATCA 780 

AATGGTCTTC TTTATCAACA TCCAGCGCTT TTACTGCTTC GCTATAGAGG ACTTCTGTTT 840 
GCTCAGGATT TAGTTGATAA AAAGCTCGAG GGGATAGTGA AAATTCATAA TTGAGTACAC 
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CTTCTTGAAT ACTCTCTTGC CCCCAGATAA TCTCTGTCTT TTCACCATAT ATCTCACTGG 9 60 

TTTTAGCTGT ATTTGTATTA ACAGCTACTG TCACAACTTC TGGGAAATCT TTAACCAACT 1020 

CTTTTACCAA TTGAGTTAAA TTAAGCTGGC GGTTTGTAAC AATAATAATC TGAACCTGTC 1080 

CGGTCTTTCT CGCGCGTCGG ACCATAATAG TACGGACACC TAGAACTTTT CTCTCATCCG 1140 

TGATTGGAAT CTGGTGATAA GTAAGTAATT CTGCTAAGCG ATTAGCAATC ACTTGGGTTT 1200 

CCTTATCTTG TACCAGGCAG TCTTTCAACT CTACTAAATA GTGAGAGTTT TGTGCATATA 12 60 

AGCCCGCCTT GACCTGATTT TTAAATTTTC GAGTCTGAAA TTCTAACTTA GCTCTGTAAT 13 20 

ATTTTGGTTC CTGCATTCCA ATAGTTGGAC GAATTTCATA ATTTTCATAT CCTGCAGGAG 13 80 

CAAATTTTTT CAGCGCTTGA TGAAGTAAGT CCGTCTTGAA CTCCAGCTGC TTATCATAAT 1440 

GCAGGTGCAT GATTTGGCAG CCTCCGCATT CATTATAAAT AGTACAAGAT GGCACAATTC 1500 

GAAATTTAGA CTTCTTGTTG ACCTTCAGTA ATTTTGCTTC AACAAAGTTG CGTCTAATAG 1560 

AAGTAATCTG ACAATAGATA TCTTCGCCTT TGAGAGCTCC TGGTACAAAG ACTAATGTTT 1620 

TTTGGTAAAA GCCGATTCCC TCACCGTTAA TTCCCATGCC CTTGATTTTT AATGGTATTT 1680 

TT 1682 
(2) INFORMATION FOR SEQ ID NO: 242: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

TTAACTTTGG TCAATTCTTT AAAGTCATCC TCTGTAAGCA TGTCTAACCA TTGATGTTTC 60 

CCTTTATTGC TAAAATCACC AATTCCGACT ACAGCTATAT CTAAATCTTT CCAACTATTT 120 

TTCAAATTTT CAAAATATCT TGATTGCAAA ATACCATCTG CTAACAATTT ATTTTCTTGC 180 

ACAATCGTTG CATTCATAAA TGTACACTCT CCATGAAATT TTCTAGACAT TTCATAAATC 240 

AGTGTATTCA CATGGTATTT AGCGTGTATG TGACTAGGAC CACCTGCTAG AGGATAGAAG 300 

TGAACATTTC GGACACTTTT ACTGTGAATT AAATCTACTA AATTACTTAA ACTTTTCCCC 360 

CAAGAAAAGC CAATTTTCAT ATTATCATCA ATTAGATTCC TAAGGACCCC TGCTGCAACT 420 

TGAGAAATTC TTTCAGATAA AATTGTTGGA GTATCATCAA ATTCATTTGG AATAATTTCT 480 

AAACTTTCCA AACTGTATTT TTCTTTTACA TAATTTTCCA ACTTAAACAT ATTGGTATCA 540 
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AAATTCTCTA TTTCAATTTT AACAATTCCT ACATTCCTTG CTTCTGTTAA CATTCTACTA 600 

ATAGAGGTTC TATAAATTCC TAATTTTGCT GCTATTTGTG ACTGATTTAA GTTTTCAATA 660 

TAATACAGAT AAGCAATTTT AGAAAGCAGT TTATTCCTAT CTTGATTCAT ACACTTAACC 720 

TCTTACGAAA CTACCTTAAC CATTATCCCA GCATTTTCTA ATGTAGCTAT ATTTTGTTTA 780 

GAAAGTTTTT CGTCTGTTAT TACTTCATAG ACTTGACTTA AAGCAAATCT TCTTACTGTA 840 

CCTCTTTTAT CAAATTTACT TGAGTCAGTT AGGACAATGA CTTTATCCGA CACTGCTGAA 900 

ATATATTGAA CTACCTCACT GCGCATTAAA TCTTTTCCCG TAAAGCCCAT CTCTTTATCG 960 

TAACCATCTG TCCCAACAAA AGCTTGACAC ACATGAAAAG TCTGTATCAT TTCTTTTAAT 1020 

AAAGGTCCTA CAGTCACCTG TGAATCTTTC TGAAACTCAC CACCAAGAAC AATAACACGA 1080 

CATGAATCAT AAGCTCTCAC AAAATTTGCT ATAAAAAACG AATTTGTTAC AATCGTAACA 1140 

TTTCTTTTTT GCTTGCAAAT TTCCTCAGCA AGTAAAGCAC AGGTCGATCC AGATTCTATC 1200 

ATTATTGTTT CATTATCTGA CACCAATTTT ACTGCTTCCT GAACAATTTT TCTCTTAGTT 1260 

TCATAATTAA TTGACAAACG TACATTTAAG TCATCTCCAC TATTTAATAC AGCATATCCA 13 20 

TGCTCTCTGT GTAATAAACC TTTTGACTCT AATTTATCTA AATCTTTTCT AATCGTTACT 13 80 

TTCGATACAT TTAATTTTTC CGATAATGTA TTAACGTCGA TCTTTTCATA TTCTGATACT 1440 

AATTTAATAA TTTGTTCCAA TCTTTTCATT TTACACCTCC GTTTTATTCT ACCAAAATAA 1500 

AAAGCAAAAA ACAACAAATT AACCTTTCCT TCGTAATTGT TTTTCTTTCG TTTTTGTGAT 1560 

AGGATAGACT TATGAAGAGG AGGAACTCTT ATGGAAATAT CTAAAGGAAT TATTTTTAAT 1620 

ATTCAACACT TTTCAATTCA TGACGGTCCG GGTATTCGTA CAACTGTTTT TTTAAAAGGA 1680 

TGTCCTCTGC GCTGTCCATG GTGTTCTAAT CCTGAATCTC AAAGAATGAA ACCTGAAAAA 1740 

ATGAAAGATG CTCAACGAGA GAAATTCACC TTAGTCGGTG AAGAAAAGAC TGTAGAAGAA 1800 

ATTATTACAG AGGTATTAAA AGACAAAGAA TTTTACGAAG AATCCGGTGG AGGTTTAACT 1860 

TTATCAGGAG GTGAAATATT TGCTCAGTTT GAATTTGCTA AAGCCATCTT AAAATCAGCT 1920 

AAAGAACATC ACATACACAC TGCCATTGAA ACTACTGCCT TTGTTGATCA TGAAAAATTT 1980 

ATTGATTTAA TTCAATATGT GCATTTTATC TACACAGACC TAAAACATTA TAATTCTATA 2040 

AAACATAAAA AAGTGACTGG GGTTTTTAAT CAAATGATTA TTAAAAACAT TCATTATGCT 21O0 

TTTTCACAAA ATAAAACTAT CGTTTTAAGA ATCCCAGTTA TTCCTAATTT TAACAATAGT 2160 

TTAGAGGATG CAGAAAAATT CGCTACTCTA TTTAACTCAT TAAATATCGA CCAAGTTCAA 2 220 

CTACTCCCTT TTCATCAATT TGGTGAAAAC AAATATCGTT TATTAAATCG GAAATATGAA 2280 

ATGGATGGAA TCAACGCACT TCATCCwGAA GATCTTATTG ATTATCAAAA GGTATTTCTG 2340 
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AACCACCATA TTAATTCTTA TTTCTAGTTT ATTTCCTTGA AATGCTCTAG CTATTTGCAG 
ATAACAAGCA TCTATAATAC ATACTTAACT TTTCAAAAGG TTTAGCTAAA AAATTTTAGC 
CAAACCTTTT CTATTTTACC TTGCTCTAGA ATTTTTAAAC TGCTATACTT ATCACAAAAA 



(2) INFORMATION FOR SEQ ID NO: 243: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2359 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANOEONESS : double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 3: 

CGTGCTTGGG GGCTTGTGGT CAAAAGGAAA GTCACACAGG AAAGGGGATG AAAATTGTGA 60 

CCAGTTTTTA TCCTATCTAC GCTATGGTTA AGGAAGTATC TGGTGACTTG AATGATGTTC 120 

GGATGATTCA CTCAACTAGT GGTATTCACT CCTTTGAACC TTCGGCAAAT GATATCGCAG 180 

CCATCTATGA TGCAGATGTC TTTGTTTACC ATTCTCATAC ACTCGAATCT TGGGCAGGAA 240 

GTCTGGATCC AAATCTAAAA AAATCCAAAG TGAAGGTCTT AGAGGCTTCT GAGGGAATGA 300 

CCTTGGAACG TGTCCCTGGA CTAGAGGATG TGGAAGCAGG GGATGGAGTT GATGAAAAAA 360 

CGCTCTATGA CCCTCACACA TGGCTAGATC CTGAAAAAGC TGGAGAAGAA GCCCAAATTA 420 

TCGCTGATAA ACTTTCAGAG GTGGATAGTG AGCATAAAGA GACTTATCAA AAAAATGCGC 4 80 

AAGCCTTTAT CAAAAAAGCT CAGGAATTGA CTAAGAAATT CCAACCAAAA TTTGAAAAAG 540 

CGACTCAGAA AACATTTGTA ACACAACATA CAGCCTTTTC TTATCTAGCG AAGAGATTTG 600 

GGCTTAATCA ACTTGGTATT GCACGTATCT CTCCTGAACA AGAACCAAGT CCACCACAAC 660 

TAACAGAAAT TCAGGAATTT GTTAAGACCT ATAAGGTTAA AACGATTTTT ACAGAAAGTA 720 

ACCCTTCTTC AAAAGTAGCT GAAACTCTTG TCAAATCAAC AGGTGTGGGT CTTAAAACTC 780 

TGAATCCTTT AGAGTCAGAC CCACAAAATG ACAAGACCTA TTTAGAAAAT CTTGAAGAAA 840 

ATATGAGTAT TCTAGCAGAA GAATTAAAGT GAGGAAAGAA TGAAAATTAA TAAAAAATAT 900 

CTAGCAGGTT CAGTGGCAGT CCTTGCCCTA AGTGTTTGTT CCTATGAGCT TGGACGTTAC 960 

CAAGCTGGTC AGGATAAGAA AGAGTCTAAT CGAGTTGCTT ATATAGATGG TGATCAGGCT 1020 

GGTCAAAAGG CAGAAAACTT GACACCAGAT GAAGTCAGTA AGAGGGAGGG GATCAACCCC 1080 

GAACAAATTG TTATCAAGAT TACGGATCAA GGTTATGTGA CCTCTCATCG AGACCATTAT 1140 
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CATTACTATA ATGGCAAGGT TCCTTATGAT GCCATCATCA GTCAAGACCT CCTCATGAAA 1200 

GATCCGAATT ATCAGTTGAA GG ATTCAGAC ATTGTCAATG AAATCAAGGG TGCTTATGTC 1260 

ATTAAGGTAA ACGGTAAATA CTATGTTTAC CTTAAGGATG CAGCTCATGC GGATAATATT 1320 

CGGACAAAAG AAGAGATTAA ACGTCAGAAG CAGGAACGCA GTCATAATCA TAACTCAAGA 1380 

GCAGATAATG CTGTTGCTGC AGCCAGAGCC CAAGGACGTT ATACAACGGA TCATGCGTAT 1440 

ATCTTCAATG CATCTGATAT CATTGAGGAC ACGGGTGATG CTTATATCGT TCCTCACGGC 1500 

GACCATTACC ATTACATTCC TAAGAATGAG TTATCAGCTA CCGAGTTAGC TGCTGCAGAA 1560 

GCCTATTGGA ATGGGAAGCA GGGATCTCGT CCTTCTTCAA GTTCTAGTTA TAATGCAAAT 1620 

CCAGCTCAAC CAAGATTGTC AGAGAACCAC AATCTGACTG TCACTCCAAC TTATCATCAA 1680 

AATCAAGGGG AAAACATTTC AAGCCTTTTA CGTGAATTCT ATGCTAAACC CTTATCAGAA 1740 

CGCCATGTGG AATCTGATGG CCTTATTTTC GACCCAGCGC AAATCACAAG TCGAACCGCC 1800 

AGAGGTGTAG CTGTCCCTCA TGGTAACCAT TACCACTTTA TCCCTTATGA ACAAATGTCT 1860 

GAATTGGAAA AACGAATTGC TCGTATTATT CCCCTTCGTT ATCGTTCAAA CCATTGGGTA 1920 

CCAGATTCAA GACCAGAAGA ACCAAGTCCA CAACCGACTC CAGAACCTAG TCCAAGTCCG 1980 

CAACCAGCTC CAAGCAATCC AATTGATGAG AAATTGGTCA AAGAAGCTGT TCGAAAAGTA 2040 

GGCGATGGTT ATGTCTTTGA GGAGAATGGA GTTTCTCGTT ATATCCCAGC CAAGGATCTT 2100 

TCAGCAGAAA CAGCAGCAGG CATTGATAGC AAACTGGCCA AGCAGGAAAG TTTATCTCAT 2160 

AAGCTAGGAA CTAAGAAAAC TGACCTCCCA TCTAGTGATC GAGAATTTTA CAATAAGGCT 2220 

TATGACTTAC TAGCAAGAAT TCACCAAGAT TTACTTGATA ATAAAGGTCG ACAAGTTGAT 2280 

TTTGAGGCTT TGGATAACCT GTTGGAACGA CTCAAGGATG TCTCAAGTGA TAAAGTCAAG 2340 

TTAGTGGAAG ATATTCTTG 2359 
(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

TTCTTTCTGC TATAATCGTA TAAAATACTT ACTTTAGGAG TTCTTATGAA AGTTGTTAAA 60 

TTTGGAGGTA GTTCTCTTGC CTCTGCTAGT CAATTAGAAA AAGTTTTAAA CATCGTCAAA 120 

AG CG ATT CAG AGCGTCGTTT TGTAGTCGTT TCTGCGCCTG GTAAACGCAA TGCTGAAGAT 180 
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ACTAAGGTTA CGCATCCCCT GATTAAATAC TACCGCCACT ATGTTGCGGG TAACGATATT 240 

AGCAAGAACC AAAGCTGGAT TATCGACCGC TATGCTGCTA TGGTTAGTGA ATTGGGACTA 

AAACCAGCTG TGCTAGAAAA AATTTCTAAA AGCATTCACG CCTTCCCCAC TCTTCCTATT 

GAAGAAAATG AATTTCTCTA CGATACTTTC CTAGCAGCCG CTGAAAATAA CAATGCCAAA 

TTCATTGCTC CCTACTTTAA CCAAAATGGT ATCGATGCAC GCTATATGCA CCCTAGAGAA 

GCTGGGATTG TGGTCACAAC TGAACCTGCT CACGCTCGCA TCATTCCATC AACTTATGAC 

AAGATTGAAG AATTGACAAA CACCAATGAa" GTCCTTGTCA TTCCTGGTT^CTTTGGTGTC 

ACTAAGGAAA ATCAAATCTG TACTTTCTCA CGTGGAGGTT CTGATATTAC AGGTTCTATC 660 

ATTGCTGCTG GTGTCAAAGC TGACCTCTAT GAAAACTTTA CGGACGTTGA TGGTATCTTT 7 20 

GCAGCCCACC CTGGTATTAT CCACCAACCA CACTCGATTC CTGAGTTGAC CTACCGTGAA 780 

ATGCGCGAGT TCGCCTATGC AGGCTTCTCA GTCCTTCATG ACGAGGCTCT TCTTCCTGCC 840 

TACCGTGGAA AAATTCCTCT GGTTATCAAG AATACCAACA ACCCTGACCA TCCAGGTACT 

CGTATCGTTC TAAAACACAG TAATGATGAA TTTCCAGTTG TCGGAATTCC TGGTGACTCA 

GGCTTTGTCA GCATTAACAT GTCGAAATAC CTCATGAACC GTGAGGTTGC ATTTGGCCGC !020 

AAGGTTCTGC AAATCCTGGA AGAACTTAAC AT 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 855 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



900 
960 



1052 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

CCCTCGAAAA CTAAGCCGAT GAAGTCAGAA CACTTCAATC CTGTTCGTGA CTGGTGGGAA 60 

AATCGTGAAG AGATTCTGGA ACCTAAGTTC TACAAATCTA AATCATTTAC ACCTAGTGAA 120 

TTGGCTGAGT TGAATTATAA TTTAGACCAG TGTGACTTTC CAAAAGAGGA AGAGGAAATC 180 

TTAAATCCCT TTGAGTTGAT TCAGAATTAT CAAGCGGAAA GAGCAACTTT AAATCATAAG 240 

ATTGATAATG TATTAGCTGA TATTTTGCAG TTGTTGGAGG ACAAATAATG ACACCAGAAC 300 

AACTTAAAGC AAGTATTCTC CAAAGAGCGA TGGAAGGGAA ATTAGTGCCG CAAAATCCCA 360 

ATGACCAACC TGCAAGTGAA TTATTAAAGA GAATTAAAGC TGAAAAAGAA AAACTTATCA 420 

GTGAAGGAAA AATCAAACGA GATAAAAAGG AAACTGAGAT ATTTCGTGGT GATGATGGGA 480 



WO 98/18931 



PCT/US97/19588 



1268 

AACATTATGC GAAGTTTGCT GATGGAAGCA CTCAAGAAAT TGATGTTCCT TATGATATTC 540 

CTGATACTTC GGAGTGGGTG AGGATAAAAT CAATTTATTG GAATTTTGCG CAAAATAAGC 600 

CAGAGAAATC CTTTAGGTAT ATAGATACGT CTAGTATTGA TAGAAAAAAG AACATAATCA 660 

ACTACAAAAA TCTACAATAT CTTTCACCTG AACAAGCGCC TTCCCGTGCT AGAAAATTAG 720 

TTTCGCAGAA TAGTGTCTTA TTTTCAACAG TTAGACCATA TCTAAAAAAT ATTGCTGTAG 780 

TTAGAGAACT TAAAGAGTAT TTGATAGCTA CTACAGCATT TAATGTTTTG GGATACTTTA 840 



CTTAACGAAA CATAT 

(2} INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



855 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 
TTTAGGAAGG CTATCCGTAA TTTTACAAAG GATTTAGATA TTACAGAGGA ACATTTAGAT 60 

ATTATCAAAA CACAGATCTT TGGCGAATTT TTCAGTAGCA TGAACTCTCT TGAATTTATT 120 

GCAACGCAAT ATGATGCTTT TGAAAATGGT GAGATAATTT TTGATTTGCC GAAAATTTTA 180 

CAGGAAATTA CTTTAGAGGA TGTCCTTGAT GCTGGACATC ATTTAATAGA TGATGGTGAC 240 

ATAGTTGATT TTACAATATT CCCATCGTAG TAACCTATTA TAATAGACAC TAGAAAGAAG 300 

GGATGACAAG TATGAGAAAA AAAACAATTG GAGAGGTTTT ACGATTAGCT AGAATCAATC 360 

AGGGATTGAG TTTAGATGAA TTGCAGAAAA AGACAGAAAT CCAGTTAGAT ATGTTCGAAG 4 20 

CAATGGAAGC AGACGATTTC GATCAACTTC CAACTCCTTT TTACACGCGT TCTTTCTTGA 480 

AAAAATATGC ATGGGCTGTT GAGTTAGATG ACCAAATTGT TTTGGATCCT TATGATTCTG 54 0 

GCAGTATGAT TACTTATGAG GAACTAGATG TTGATGAAGA TGAGTTGACA GGTCCTAGA: 600 

GTTCAAGTAA GAAAAAGAAG AAAAAAACAT CATTTTTACC TTTATTTTAT TTTATCCTGG 660 
(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1805 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
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CCGGTTGCAC AGGATCGTGC ATAGTCAACT CTTCAAGTAT AGCATATCTC CTATTTTCTT 60 

ACAAGTAATA ACACCTAAAA TGAAGCTTTT TCTTTTACTT TTTTCTGCCA AGAGGCAAAA 120 

AGCATGCTGA GGTAAAAAAC GCTCATCATA ATAGGAACAC CAAGAATGGT CTTTTCATGA 130 

TAGAAAATCG TCAAATAGCC TGAAAAGACA ACGCCAAGGA CAAAACTACT AAGCAGGCTA 240 

ACAAATATGA ATCCTTCACG CAAAAAAGGA GTGTGCTTGG TTCGGAAATA ATCTCCAAAA 300 

GCCAGCATGG TCCGTTTGAT ATTCCCTGTC ATAAAAGCGT TATTATAGGC AATACCCGAC 360 

ACTTCTCCAA AAGCAGTTGT CACCAGTCCC ATACAGAAGG CCAAGGCCGG CACTAGATAG 420 

ATATTATCCA CAGTTTGCGG CACAAAAGCA ATAATGATTG ATAAGATTGC CAAGGGAATC 4 80 

AAGGACAGAA TACGTTTTTT CACAATTCTC AATTTTTCCT TATAAATCGT TAATAAAAAG 540 

ACTCCCATCA TAAACGCTAG CAAGGTGAGA ACCTTGTCCC TAACATCCGA AACATTATTT 600 

TTAATTAATT CTACTGAAAG AAAGACAACA TTTCCAGTTT GTCCAGCTAC AAGGGTATTC 660 

CCGCGAACAA TAAAAGTGTA AGCATCCACA TATCCAGCAC AAAACGTCAA AAAAAGTGCT 720 

AACCTTTTAG ACTGACGTGA TATTTTTCTT ATAGGTAATA ACCTCATTTT ACCTCCCATT 780 

GTATTTTCTC TTAGAAATAT TGTACCATTT TCTTTCTAAA AAATCGTAGG CTACCATTTA 840 

GATTTTACTA TTAGCATAAA AATAATAATA GACAACTATT TATCCAAAAA TAGATAGATG 900 

TAACATGTTT GCAAACAAAG CATACGAACC TTTAGTAAAA TCATTTCCAT GAAACTAGAA 960 

TAGAGCCCTC TTAGCAAAAA TCATTATTTT AATTTATTTC TAATCACTCC TTGACATAAA 1020 

TAACTCTCAC CAATAAAAGA CTATGTCTTA AAAAAATGGT ATAATAAAAT CAATACTTGG 1080 

GCTTGATGGC TATGCTACTA ATAACAATTA GGAGAGAAAA TCAGGCACTT GTTAACAACA 1140 

AGGATTATCC CCTTGAGATG AAAGGAACTT TAGAAATCTT ATGATGAACA TGCAAAACAT 1200 

GATGCGTCAA GCACAAAAAC TTCAAAAACA AATGGAACAA AGCCAAGCTG AACTTGCTGC 1260 

TATGCAATTT GTTGGCAAAT CTGCTCAAGA TCTTGTCCAA GCGACCTTAA CTGGCGATAA 1320 

GAAAGTTGTC AGCATTGATT TCAATCCAGC TGTCGTTGAC CCAGAGCACC TTGAGACTCT 1380 

TTCTGATATG ACCCTTCAAG CCATCAACTC TGCTCTTGAA CAAATCGATG AAACTACCAA 1440 

GAAAAAACTG GGTGCTTTCG CTGGGAAATT ACCTTTCTAA AAACAAGGAG CTAGAACAAT 1500 

GCTTGTCGAT AACAAAGGCT AAGAAAGGTG CAAAAATGAC TCTATAATAT TTGTAGTGGG 1560 

TAAATCCCCT ATGGATATTA TGGAGCCTAT TTTTGTGTAG AAAAAAGTCC CATATGACCT 1620 

ATAATGAAAA GCCACAAAAC AACTCATTAG AAAGAATCAT ATGGAACAAT TACATTTTAT 1680 

CACAAAATTA CTAGACATTA AAGACCCTAA TATCCAGATT TTAGACATCG TCAATAAGGA 1740 
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TACACACAAC CwAATCATCC CCAAACTGGr CTATGAACCT CCATCTTGTC CTGAGTGCGG 1800 



AAGTC 

(2) INFORMATION FOR SEQ ID NO; 2 48: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2516 base pairs 
(B> TYPE: nucleic acid 
(C) STRANOEDNESS: double 
(D> TOPOLOGY: linear 



1805 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

CTG CATC TAG TTTGTTTCTC CCTACAGTTT TAGCTAGACA GATTGGAGAT TATGATTTAA 60 

CGTCGCCGCG TTGGGGTTCG GATACAACTA GTGAGCTTGA GAAAGAAAAC TCCTCTGCTG 120 

GAATTAATAA TAATGACACC ACTGGTGGCG GTAAAAGGTT AAATACCTCT ATTCGTAGCG 180 

CCTATACTGG GTCAGATATT ACCCCGGTAT ATTCATTGGG GTCTGGCTCT AGGATTGTCA 240 

TGTACTATAA TGGAGGTGGT GACAATTATA TTGGTTCTGG TACT AG ATT A GCTATGGCGC 300 

CACAATTTCG AAATCATGTA AGAATTCATA CTTCAGGTTC TTGGAATCCA GATTCTTATT 360 

AACTTACTTG TCAGAGTAAG CCTTAAAGAT GGTTGATTGT GGGTGTAGCA TGAAAAAAGA 420 

ATGCTACACC CTATTTTTAT TATAAGGAGG AGTAAGGATG GAATTTTTCA TTTGTAATCT 480 

TGTACGAGTC GTTCAATCAC CTCGATTTTA TATGTCTTTA TTTTTGACCC TTCTTTGCAT 54 0 

GAGTTTAGGA AATTTCCTTG CTTTCAATGG TATTTATAAA ATTGAAGGTT TATCGATTTT 600 

TTTTGCCCCT TCTTCTATTC GAGGATTTTC ACCGATTAGC CTAGTAGCTG CACTTATCTG 660 

TACACTGCCC TATTCTAGTC AGATAATAGA GGATGCTGAG AGTCATTTTC TAACAGCACA 720 

ATTGTGTCGA ATTTCTAAAA AGAAGTATCT GGCTATTGTG GGTAGTACTG TAATTATTTC 780 

TTCTTTTCTA GTCTTTTTTC TCCCCTATTT ATTATTATTA GGAATTAATC TTTTAGTGAC 840 

TCCTTATCAG GAAATTTATA TTGGAGATTA TAGTGGTGCC TTAAAAGAAT TATTTGATTC 900 

CAATCAGTTT CTCTATAGTC TTGTAACGAC TCTCTGGTAT GGAGTTT G CG GCGCTGTGTT 960 

CTCTATTTTT GGACTAGCTA GTGCTTTGCT AGTGAAGAAA AAAATAGGAG CTATTTTCAT 1020 

CCCAGTTGCC TATATGATGG TTGGTGGTAT TTTTTGGGCT ATTTTAGGGC TATCTTACTT 1080 

AGAACCTGTG ACAACGCTAG CTTTGGGATA TCAGAAAGAT ATCAGTCTTT CCTTAGTTAG 1140 

TGCTCATCTT GCTTTTATTT TATTTGTTAG TTGTTTGGTT GTTTATGGTA CATTTTTTCT 1200 

ACATTCAGAG GACTATGTAT AATGAAACAA TTTGTTCAAT TTTATAAAAA AGATTTCTTA 1260 

GCAGTATTGG TTTATTTTAT ATTACTGCTA TCCTCTGTTT TATCTAGTAC AGTATATTTA 1320 
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TTGCGCtCTC GCCAATATTC AATCCATCCA AATCTATTAG AATGGATCTT AGTTTTACTT 1380 

CAAGATATCA CGACTGGAGT ATATTGCTTT CCCTTCACAT ATATATTGTT CTTTTTTTAT 14 40 

TTGATGAATA ACTATTTTAA TAGGTTGGAG TGTCGCATTC GTCTGAAATC AATTAAGCAC 1500 

TTTACCAGTT TTAGTTTCAA ATTAGCAGCT CTTAGTACGG GGATTTGGAC GGCGACTTTA 1560 

TTTTTATTGA TTTTTCTAAT TGCATTTAGT AATGGTTTTA GCTTCTCTTT GGAGATAAAG 1620 

GAGCTTGATT TTTTAAGAGA ATTTTATGGT ATAAGTATTG CAAACAATGC TACTTTCTTT 1680 

ATAGGATTTT TTTTCTCTTA TATAGCATAC TATTTCTTTT TATCCTTACT TACTATTAGC 1740 

AGTTTTTCTT GCTTTAAAAA ATCAAACATG AGCTTAGTAT TTCTGTTTAC TTTTTTATTT 1800 

GTAGAATCCT TATTC7GGAT TTATCAGTTG GACAATCGGA TAATTGGATT ATTGCCAATT I860 

TTTCAGTATA TGGTAAATTC CAATCCGTAT GCATTGATTT ATTGGCTTAC ATTACTATCT 1920 

ATCATAATTC CATTGACTGT ATTTTCTGTT CATAGAAACT GGAGGAGAGT GTAAAAGTTG 1980 

GAAATGGGAA AGTTAAGTAG TCACATGTGG AGGTTGAATC AGATAATCTA TACCAAGTAC 2 04O 

TTTTGGGGTT ATGTTCTTTT TTGGATATTG ATTTGTTTAG GATTATGGTA TTGGTTAGAA 2100 

GGAAATGATA GACTTGTTAT AGAAATTTTA AAAGGGCCTA ATCTGAGTCA AAACTCTTTT 2160 

TTAGTCTTAT CTATATGGTT GCTTCATTGG TTTATTATTC ATACATTTTT TCTAGCAGTT 2220 

GTATATCGTA GAAGAGCATC CGATTTCTTT ATGGAAGTGA TTCGATTTTC TTCTATTAAG 2280 

CTCTGGATTA GGTATCAGAT TTGGACCTGT TTTCTTTATC GACTCATTTT AATCATGCTA 2340 

AAAGTTCTAG TGATTCAATT TATGTTACAG TTACCAAACT GCGATATAGG AGTTTTGTTT 2400 

ATAGTTGATT CTTTGAATGC TTGTGTGTTA CTCTTGTTTT GCTTTATGTT ATACGCACTA 24 60 

GCAGCGAATG TACAAATGAA CTTTGCTTGC GTTAGTTTCT TTTTACTCAT GATTGG 2516 

(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(Ai LENGTH; 1364 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

CGGTGTTTTT TTGTAAATTT TCTAGCACTT GTATGGTAAA ATAGATACAG GTGTTCATTA 60 

AACTAGACTA AAAACCTATT TAAGCAGGCA AAATGAACAA ATACCAACAA TTATTTAAGC 120 

AAATCCAAGA AACCATTCAA AACGAGACTT ACGCTGTCGG AGATTTCCTT CCTAGCGAGC 180 
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ACGACCTTAT GGACCAATAT CAAGTGAGTC GTGATACCGT CCGAAAGcCC TGTCTCTCCT 240 



300 



CCAAGAGGAA GGATTGATCA AAAAGATAAG AGGGCAACCT TCTCAAGTCG TCAAAGAAGA 

AACCGTCAAT TTCCCTGTAT CCAACCTAAC CAGCTACCAA GAACTAGTTA AAGAACTTGG 3 60 

ACTGCGCTCT AAAACCAACG TGGTCAGTCT GGACAAGATT ATTATTGATA AAAAATCCTC 420 

ACTGATAACC GGTTTCCCAG AGTTTCGGAT GGTTTGGAAG GTGGTCCGCC AGCGTGTGGT 4 80 

GGATGATCTG GTATCCGTTC TGGATACGGA CTATCTGGAT ATGGAACTCA TCCCAAATCT 540 

CACTCCCCAA ATTGCTGAGC AGTCTATCTA TTCTTATATA GAAAATGGCC TCAAACTCCT 600 

TATTGATTAT GCTCAGAAGG AAATCACCAT TGACCACTCA AGCGACCGAG ACAAGATTCT 660 

CATGGACATT GGCAAAGACC CTTATGTCCT TTCGATTAAA TCAAAAGTCT ATCTCCAAGA 720 

CGGACGCCAA TTTCACTTTA CCGAAAGTCG CCATAAGTTA GAGAAATTTA GATTTGTAGA 780 

TTTTGCAAAA CGCAAGAAAT AAAAGACTGA GACACCAGAT CTCAGCCTTT TTCGGC7CTA 840 

TAATATTTGT AGTCGGTAAC CCCCCTATGG ATATTATGGA GCCTATTTTG TGTAGAAAAA 900 

AAGTCCCATA TGACCTATAA TGAAAAGCGA CAAAACAACT CATTAGAAAG ATTCATATGG 960 

AACAATTACA TTTTATCACA AAACTGCTCG ATATTAAAGA CCCAAACATC AAGATTCTAG 1020 

ACATCATCAA TATGGATACC CACAAAGAAA TTATCGCTAA GCTGGATTAT GAGGCTCCAT 1080 

CTTGCCCTGA TTGTGGAAGT CTAATGAAGA AATATGACTT TCAAAAACCG TCTAAGATCC 1140 

CTTACCTCGA AACAACTGGT ATGCCTACTA GAATTCTCCT TAGAAAGCCT CGTTTCAAGT 1200 

GCTATCATTG TTCTAAAATG ATGGTCGCTG AAACTTCTAT CGTCAAGAAG AATCATCAAA 1260 

TTCCTCCTAT TATCAACCAA AAAATTGCGC AAAAGTTGAT TGAGAAGATT TCTATGACCG 13 20 

ATATTGCTCA TCAGCTGGCC ATTTCAACTT CAACTGTCAT TCGG 13 64 
(2) INFORMATION FOR SEQ ID NO: 250: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEONESS: double 
(D) TOPOLOGY: linear 



(xij SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

CCATGAAGAC CGCTTGGAAT TGGAATGGCA CAAGTCTTTG TTGAATGGTC TATTCCCATT 60 

GACAATCGCT GGAGGAATTG GACAATCTCG TATGGCCATG TTCCTACTTC GCAAGAGACA 120 

CATCGGAGAA GTGCAAACAA CTCTTTGGCC TCAAGAAGTC CGCGATACTT ACGAAAATAT 180 

TTTGTAGAGA ATCGAACCGC AAGGTTCGGT TTTCTTTCTC TTTTTGTCTA TAATTTGCTA 240 
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TAATAAACAG TATGAAAATC CTATCAGGAA TCTATGGGGG 

AAGGCAACAC GACAAGACCT ACTTCCGATA ACGTTAGGGG 

GTCCCTACTT TGAAGTGGGA CGACTCTTGG ACCTTTATGC 

TCGAAGCAGT ATCGCGTGGC ATGTCCAGTG CTGTTTTGGT 

GACCATCGTG CCTGAAAATA TCCAGATGAC CAAGGAAGTT 

GATGGATGCA GAAAGGGCAT TGGAACAGGT ATCTGGGGAA 

CCCTCCCTAT GCCAAGGAAC AAATCGTAGC AGATATTGAA 

TTTTTCTGAA GATGTTATGG TTGTGTGCGA GACGGATAAA 

AATTCCCTGT CTGGGTATCT GGAAGGAAAA GATTTATGGA 

TGTCAGATAA GATTGGCTTA TTCACAGGCT CATTTGATCC 

ATATCATTGA ACGGGCGAGC AGACTTTTTG ATAAGCTTTA 

CCCACAAACA AGGATTTCTC CCTCTTGAAA ATCGTAAACG 

AACATTTGGG AAATGTTAAA GTCGTGTCTT CTCATGATAA 

AAAGACTGGG GGCTACTTCC CTAGTGCGAG GTTTGAGAAA 

AAGCCAGTTT TGATTACTAC AATCATCAGC TGTCTTCTGA 

ATAGTCCACC TGAACATCTC TATATCAGTT CATCAGGCGT 

GTCAGGATAT TGCCTGCTAT GTTCCCG 

(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3652 base pairs 
IB) TYPE: nucleic acid 
(C) STRAND EDNESS : double 
{01 TOPOLOGY: linear 



ACGTCCCCTC 

AGCCATTTTT 

AGGTAGTGGT 

GGACCGAGAC 

GGAAAATTTC 

TTTGACCTCG 

AAAATGGCTG 

GCCGTTGAAC 

ATTAGTAAGG 

GATGACAAAT 

TGTGGGTATT 

CGGGTTAGAA 

ATTGGTGGTC 

TGCGTCGGAT 

TATAGAGACT 

TAGAGAGCTT 



AAGACACTAG 

AACATGATTG 

GGTTTATCTA 

CGTAAGcTCA 

AACTCCTCAA 

TTTTCTTAGA 

AGAGAGAGCT 

TTCCAGAAGA 

TGACAGTCTA 

GGGCATCTGG 

TTTTTTAATC 

AAGGCTGTGA 

GATGTCGCAA 

TTGCAATATG 

ATTTATTTAC 

TTGAAGTTTG 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1227 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

CCGGTCAAGT TAAAAACGCT ATTTCTTCCC ATTTTATTTA TTTTTTAGGA GTGGTAACCT 60 

ATCAAAATAG CCCAAGCGTT CTCACCCGTG TGAGTTTGAA TAATGGAACC CGTTTCCAAA 120 

ACAGAAATTG GCTTTTCAAC ATAAGCTTGT AAGCTTTCTT TCATCTCTTT TGCCCAATCA 180 

TCACTACCAG AATATGAAAT TCCAATCTCT GCTACAGCAC GTTCAGAAAG CGATGTTATC 2 40 

AACTCATCTA ACCATTTTTT AAATGTTTTA GTTCCACGAC CTTTAACCAT TGGCTGCAAT 300 

TCATGGTCTT TCATTTGCAT GACAGCACGG ATATTGAGAA GAGAGCTCAA CAAGCCAGTT 360 
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ACACGGCTAA 


TTCGTCCACC 


TTTGACAAGA 


TTTTCCAAAG 


TTGAAACACC 


AATATAAAGC 


420 


TCTGTATGCT 


TTTTAACCTC 


TTCTACATGA 


GATAAAATTG 


CCTCCATATC 


TTTACCTTCT 


480 


TCAGCTAACT 


TCGCAGCCTC 


AACAACTTGG 


AATTTCAGGG 


CTTGGTCAGT 


GAAGGAACTA 


540 


TCAACAACAG 


TCACATCTGC 


AGTAGATAGG 


CTAGCACCTT 


GGCGTGCTCC TTCTACCGTA 


600 


CCCGAAAGAG 


CATGGGACAT 


ATGAATAGCA 


AGAATCTGGC 


CACCATCTTT 


GCATAGGTCT 


660 


TCAAAAATCT 


CAGCAAAGAC 


ACCTACAGGT 


GGCTGACTTG 


TTTTCGGAAG 


ATTCTTACTT 


720 


TCTTGCATCA 


ACTGAAGAAA 


TTTACCTTCT 


TCTTTCAAAT 


CCGCATCAGA 


ATAAACAACA 


780 


TTATCAATCA 


TTACAGATAA 


TGGAACAATT 


GTAATATCTA 


ATTGCTTTAC 


TAGTTCAGGT 


840 


TCAATAGTAA 


CAGATGAATC 


GGTTACAATC 


TTAATTTTTG 


TCATAGTATC 


AATCTTTCTA 


900 


TTTTAGGATT. 


CAGATTGGTT 


TCCTTACTTC 


TAATTATATC 


AAAAAAAAGA 


TTAAAAATCC 


960 


TAATGGAGTC 


AATCAAATTT 


TCCGTAAAAT 


TTGATATAAT 


CAACTTATAA 


GAAAAGAGGT 


1020 


GTCCTATCAT 


TAAAAAAATT 


TACCCCATTT 


TTACCATTTT 


ACTAGGTGCT 


GCTATTTATG 


1080 


CTTTTGGACT 


GACTTATTTT 


GTAGTTCCCC 


ATCATCTCTT 


TGAAGGAGGG 


GCGACAGGCA 


1140 


TTACCCTCAT 


CACCTTTTAT 


CTTTTTAAAA 


TCCCTGTTTC 


CCTCATGAAC 


CTGCTGATTA 


1200 


ATATTCCCCT 


TTTCATCCTA 


GCTTGGAAGA 


TTTTTGGAGC 


CAAATCCCTC 


TATTCTAGTT 


1260 


TACTAGGAAC 


CTTAGCTTTG 


TCCGGCTGGT 


TAGCTTTTTT 


TGAGCATATT 


CCCCTTCATA 


1320 


TTGATCTTCA 


AGGTGATTTA 


CTAATCACAG 


CCCTTATAGC 


GGGAATCCTA 


TTGGGAATTG 


1380 


GCCTTGGAAT 


TATTTTTAAT 


GCTGGAGGTA 


CAACTGGCGG 


AACTGATATT 


CTAGCTCGTA 


1440 


TTCTCAACAA 


ATACACTCAT 


ATATCCATAG 


GAAAACTGCT 


CTTTATCTTA 


GATTTTTGTA 


1500 


TTCTCATGTT 


GATTCTCCTA 


ATCTTCAAGG 


ATTTGAGATT 


GGTTTCCTAC 


ACGCTTTTGT 


1560 


TTGATTTTAT 


TGTTTCTCGT 


GTTATTGATT 


TGATTGGTGA 


AGGAGGATAT 


GCCGGCAAAG 


1620 


GCTTTATGAT 


TATCACAAAA 


CGTCCTGACC 


AACTTGCTAA 


GGCGATTAAT 


GATGACCTCG 


1680 


GAAGAGGTGT 


TACTTTTATT 


TCTGGTCAAG 


GCTACTATAG 


TAAAGAAAAT 


TTGAAAATCA 


1740 


TCTACTGTAT 


TGTCGGAAGA 


AATGAAATTG 


TGAAAACGAA 


GGAAATGATT 


CATCGAATCG 


1800 


ATCCTCAAGC 


CTTTATAACT 


ATTACAGAAG 


CCCATGAAAT 


CCTAGGAGAA 


GGCTTCACCT 


1860 


TTGAAAAAGA 


ATAAAAAGAG 


GTAATGTCGT 


GACCTCAAAA 


GTTAGACTAA 


ATCATCTATC 


1920 


TTTTGGGTTA 


CAGACAACCT 


CTTTTTTATT 


TTATTTACTC 


AACCTCTTAA 


GACCAATTCC 


1980 


GAGTTACTTC 


TTCATCAGCC 


TTTAACTGAT 


CCACTAATTG 


GTCAACTGAG 


TCAAATTTGG 


2040 


TCATATCTCG 


AATGCGATCA 


AGCCAATAAA 


CCATGACGGT 


TTCCCCATAA 


ATATCTTGAT 


2100 


TAAAATCAAA 


AATATTGACT 


TCAAAACGTG 


CTTCTTCTCC 


ATCAAAGGTC 


ACATTTTTCC 


2160 
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CGACACTACC CATAGCACGA TACTTCTGTC TTTGAATCTC AACATCAACA ACATAAACGC 2220 

CATCTGCTGG CATATAAGTA CGCTCTAAAA GCACTAAATT CCCTCTCGCA TAACCAATTG 2280 

TACGACCACG AGCATTACCA TGAACCACCA TACCTCTTCA TGGAAGCGCT GCCCCCAAAA 2 340 

CTTTTCCTCC TTCTTTCACA TTTCCATCTA AAATAGCTTG ACGGATACGA CTTCAACTAA 2400 

TCTTTCCTTT CTCATCTTCT ACAGGTGGAA CAATGATAAC TTCTCCATCA AAGTAATTCT 2 460 

TTAAATCTTC TGCTGTTTTT TTGTCAGAAC CAAATCTATA ATCAAAACCT GCAACAATAA 2520 

TTTTGGCATT CATAGCCTTG ATATAAGTTG CAAAGAATTC TTGTGCAGTG AGACTAGCGA 2S80 

ATTGACTACT AAAATCAAGG AGATATAATT CTTCTACACC TTCGCGCTTT AATTTTCTTT 2640 

CACGTTCAGC AGGCTTCAAA ATATGCAAAA ACAAATCTGC ATGATAACGC TCTAAAGCGA 2700 

TCTTTGGAGA TTCATTAAAG GTCATAACGA CGATAGGCAA CAAATCCTTT CTCGCAGCC7 2760 

TGTTGGCAAC ACGAAATAAT TCTTGATGCC CCTTATGTAT GCCATCAAAA TAGCCGAGAA 2820 

CAACGACTGA ATCAGATGGT GTGCCAATAT CTTTTTGGTT TTTTATAGGA ATAGTAATAA 2 880 

TCATAAAATA ATTATATCAT AGCGATAGCT ATTTCTGCAA CAGAAAATCT GAAATGTTGT 294 0 

TTTTTTCACA TGAAGTGTAC CTGTTTTCAA AAAGCACTTT ATTCTATCGT TGCTTAACTA 3000 

TGAACTTTGC AATATTCTTC TCAAAAACTT GTAGGACATC TTCAAAATTT TGCAAGGAGT 3060 

GATTAGACTT GTTCGGTAAC CATAAAGTGT CATACTATGC TTATGTATGA AAAAGCAATG 3120 

CAACTAACTC CTGAGAACTT TAAATTACTA ATTGGTCCCC AAAAGGTAGA ATTTAGAATC 3180 

GAGGTACACC TATGGCTGTA AAATTTACAA AATGAGACAA CTTGGGCAAG ATGTTTGAAG 3240 

AATTTCCTAA ACTCCCTGAT TTGAAGCAAG TCACTTTCCC TAATGACAAA GAAAAAACCC 3 300 

AAAACAGCAA AGAAAAACTA GATGACTGCT TTCCAACAAC TCCCATCTAC TGTGCTTCAG 3 360 

ACTGGGCTAT TTTTCTCTCC ATCTGTTAGC TTGGATTCTC AGACCGTTTC AGCTAAAGAA 34 20 

TATCTTTTCC CTTATCAGAA GGAACGGCTC AAGCCATTCA GACAAGTGAA GCGACGACAA 3480 

GCCAATATTT GAAACCAGAT AGCAGTTCTT ATACTCAATT GAAATAAAAT CTGAAGAAAT 3 540 

CGAGTAGGAA ACTCATATCA ATGTTTAACA GTGTTCTATT CCAGATTCAT ACTCAATCAw 3 600 

AATTAAAGTC CAAACTAGGA AGTTAGCCGC AGGTGATACT TTGGGTACGG CA 3 652 

(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS : 

{A) LENGTH: 743 base pairs 
(B> TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 2S2 : 

GTACCCTGGT GCCAAAGTAC ACCAACGTTG GCTTTTTGAC AAACAATACC AATCTTGGTT 60 

TTACATCAAA GAAAATGGAA ACTATGCTGA TAAAGAATCG ATTTTCGAGA ATGGTCACTA 120 

TTATTATCTA AAATCCGGTG GCTACATGGC AGCCAATGAA TGGATTTGGG ATAAGGAATC 180 

TTGGTTTTAT CTCAAATTTG ATGGGAAAAT GGCTGAAAAA GAATGGGTCT ACGATTCTCA 240 

TAGTCAAGCT TGCTACTACT TCAAATCCGG TGGTTACATG ACAGCCAATG AATCGATTTG 300 

CCATAAGGAA TCTTGGTTTT ATCTCAAATC TGATGGGAAA ATACCTGAAA AAGAATGCGT 360 

CTACGATTCT CATACTCAAG CTTGGTACTA CTTCAAATCC GGTGGTTACA TGACAGCCAA 420 

TGAATGGATT TGGGATAAGG AATCTTGGTT TTACCTCAAA TCTGATGGGA AAATAGCTGA 480 

AAAAGAATGG GTCTACGATT CTCATAGTCA AGCTTCGTAC TACTTCAAAT CTGGTGGCTA 540 

CATGGCGAAA AATGAGACAG TAGATGGTTA TCAGCTTGGA AGCGATGGTA AATGGCTTGG 600 

AGGAAAAACT ACAAATGAAA ATGCTGCTTA CTATCAAGTA GTGCCTGTTA CAGCCAATGT 660 

TTATGATTCA GATGGTGAAA AGCTTTCCTA TATATCGCAA AGTAGTGTCG TATGGCTAGA 720 

TAAGGATAGA AAAAGTGATG ACA 74 3 

(2) INFORMATION FOR SEQ ID NO: 253: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4010 base pairs 
O) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

TTTTGGTTGA TGATACGAGG GATTTGGTGA TTCTTCTTGA CGATAGAAGT TTCAGCGACC 60 

ATCATTTTTG AACAGTGATA GCACTTGAAT CGACGCTTTC TAAGGAGAAT TCTAGTAGGC 120 

ATACCAGTCG TTTCAAGATA AGGAATTTTA GAAGGTTTTT GAAAGTCATA TTTCTTCAAT 180 

TGGTTTCCGC ACTCAGGGCA AGATGGGGCG TCGTAGTCCA GTTTGGCGAT GATTTCCTTG 240 

TGTGTATCCT TATTGATGAT GTCTAAAATC TGGATATTAG GGTCTTTAAT CTCTAGTAAT 300 

TTTGTGATAA AATGTAATTG TTCCATATGA TTCTTTCTAA TGAGT TG TTT TGTCGCTTTT 360 

CATTATAGGT CATATGGGAC TTTTTTTCTA CAATAAAATA GGCTCCATAA TATCTATAGT 420 

GGATTTACCC ACTACAAATA TTATAGAACC GAATTAATTT AATTAGAGAG CCAACTTTCT 480 

AATATAGTAA TCGCGTCATA ACAAGGTATC TATCATTCAT GGAGTTCCTC CTGTATACTA 540 
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TTACTAAACT AAAACTATTG GAGGATATTT TAATGCCACA ACCTATTGTT CCTGTAGAGA 600 

TTCCACAATC TCGTCGTTTT GATTCTAAAA AGAGAAATGA TATTCTGCTT AAAATTCGTA 660 

TTGGCAAGCT TGAAGTAAGT TTTTTTCAAT CTCTCAATCT CCAAATGGTA GAACAGCTTT 720 

TGGATAAAGT GTTGCTCTAT GACAATTCAT CTATCTAGCC TAGGGCAGGT CTATCTCGTA 780 

TGTGGGAAAA CGGATATGAG GCAAGGCATT GATTCATTGG CTTATCTGGT TAAAACCCAG 840 

TTTGAATTAC ATCCTTTCTC CGCTCAAGTT TTTCTCTTTT GTGGTGGACG TAAAGACCGC 900 

TTTAAACCCC TTTACTGGGA TGGTCAAGGA TTTTGGCTAC TATATAAACG CTTTGAGAAC 960 

GGAAAACTGA CTTGGCCCAG TACAGAAAAG GATGTCAAAG CTCTCACACC TGAACAAGTA 1020 

GATTGGCTTA TGAAGGGCTT TTCTATCACT CCAAAAATAA ATTTATGAGA AAGTCGTGAT 1080 

TTCTATTGAA ATGAGGACTT TCTTTTTAGT TATAATAAAG TTAGGAAATA AGGAGAGGAA 1140 

GCCCATGGAA GAAGATTGAA AATCATTCAA CAACAGACTG CTACAATTGA TAGTCTCACC 1200 

AATGAACTTG CCCTTCTTCG TGAACAAGTG GCTTATCTAA CGCAAAAGCT CTATGGAAAA 1260 

TCCTCTGAGA AAAGTCTTTG CCCATCTGGA CAACTCAGTC TTTTTGAACA GGAACAAAAT 13 20 

ATGGAAGAAG ACTCTCACTT ACCCAGTTGA AAGAGAAGAA ATCACCTATA AACGTAAGAA 1380 

AGCTAAAGGG AAACGTCAAG CTCTTCTTGC CCAATTTGAT TCAGAAGAAG TTCATCATCA 1440 

AGTAGAAGAG AGCATTTGCC CTGATTGTCA GGGAGATCTA AAAGAGATTG GAGCAACCCT 1500 

TCAACGACAA GAATTAGTCT TTATTCCTGC GCAATTAAAA CGAATAGATC ATATCCAACA 1560 

CGCTTATAAG TGCCAAGCAT GCAGTGATAA AAATCCGAGT GATAAAATCG TGAAAGCTCC 1620 

TATTCCTAAA GCCCCTTTGG CGCATAGCCT TGCCTCAGCT TCTATTATCG CTCACACCAT 1680 

CCATCAGAAG TTTAATCTGA AGGTACCCAA TTATCGCCAA GAAGAAGATT GCGCTAAGAT 1740 

GGGTTTACCA ATCACACGTA AGGAAATTGC TAATTGGCAT ATCAAGGCGA GTCAATACTA 1800 

TTTGCACCCC CTTTATAATC TTTTACGAGA AAAGTTCTTA GAACAAGCTC TTCTTCATGC 1860 

GGATGAAACC TCTTATCGGG TTCTAGAGAG TGATAGTCAG TTGCCTTACT ATTGGACTTT 1920 

TTTCTCTGGG AAAGCTGAGA ATCAAGCAAT CACGCTGTAC CACCATGATC AGCGTCGGAG 1980 

TGGTTTAGTA GTACAAGAAT TCCTAGGAGA TTATTCTGGC TATGTTCATT GTGACATGTT 2040 

GCGGCAGTAA CTTAGGACTT TACTCCTCTA CTTCTGCCTA TGCGATAGCA GTCCAAGGTT 2100 

TAGGAGTAAG GCGACGCTAA GCTTGGTAAA CTGCCAACAG CTAGAAGCTT ATCGTCAACT 2160 

GGAAGAAGCT GCACTTGTTG GATGTTGGGC GCATGTGAGA AGGAAGTTTT TTGAAGTGCC 2220 

CCCCAAGCAA GCAGATAAAT CATCCTTAGG AGCTAAAGGT TTAGCTTATT GTGATCAGTT 2280 
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ATTTTCCTTG 


CAAAGAGACT 


GGCACGCTTT 


GCCAGCTGAT 


GAACGACTAC 


AGAAACGTCA 


2340 


AGAACATCTC 


CAGCCCCTAA 


TGGAAGACTT 


CTTTGCTTGG 


TGCCGCCGTC 


AGTCAGTTTT 


2400 


AGCAGGTTCA 


AAACTAGGAA 


GGGCAATTGA 


ATACAGCCTC 


AAGTATGAAG 


AAACCTTTAA 


2460 


GACTATTTTG 


AAAGACGGAC 


ATCTGGTCCT 


TTCCAATAAT 


CTAGCTGAAC 


GCGCCATTAA 


2520 


ATCATTGGTT 


ATGGGACGGA 


GTAAAAGAGT 


CCAGTGGACT 


CTTTTAGCCT 


GAGCTCAGTT 


2580 


TAAAAAAGCG 


AGGGTGGTTA 


TTTTCTCAAA 


GTTTTGAAGG 


AGCTAAAGCA 


AGAGCTATTG 


2640 


TTATGAGCTT 


GTTGGAAACA 


GCTAAACGTC 


ATCAATTATA 


GTGCGTTGAA 


TCTATAACAG 


2700 


TACGCATCGA 


CTGCTAAAAC 


ATTTCTATAA 


ATCAATTTTC 


CTTTCCTAAT 


CGATTTGTTC 


2760 


ATATCTTATT 


TCAATCCATT 


ATAAATAGCG 


AGAAATATCT 


ATCCTATCTT 


CTAGAATGTC 


2820 


TTCCAAACGA 


GGAAACTCTC 


GTAAACAAAG 


AGGTTTTAGA 


GGCCTATTTA 


CCGTCGACTA 


2880 


AAGTTGTACA 


AGAAAAGTGC 


AAATAAGAAA 


TCTCCAGATT 


AGGAACTATC 


CGTGAGTTCT 


2940 


CTAGTCTGGA 


GATTTTTCAA 


TAGACTTCCT 


TATTGGACGG 


TTACAATTTA 


TTATATGAAA 


3000 


ATCCCATATT 


ATTCTCCAAT 


TCTATATTTT 


ACCTTTCTAA 


ATGTATAGAT 


TAACTACCTA 


3060 


ATTATAGCAT 


ATAACGCAGA 


TTCCTTTCAA 


TCGTATGATT 


TACTCCATTA 


AATTAAGTAA 


3120 


AAAAATAAAG 


GCAGTCCGAA 


GACTGCCGAT 


ATTTATCTCT 


CATCTCTTTA 


ATTATGGTAA 


3180 


GTAAATAAAT 


AATTTCCCTA 


AAGATATGGA 


AATTATTAAT 


ACTATAAATA 


CAT ATT AT AA 


3240 


AGTTTATAAA 


TACTGTAAAA 


ATCCTGAAGT 


TAATTTTCTA 


ATAAATATCA 


ATATGTGTTA 


3300 


GTATCTTTTA 


AATTTTTAGA 


CAATTTACTA 


GTTCTATAGA 


CATGTTTAAC 


ACACTCTATT 


3360 


TTACAATTCA 


AAAATTTCAT 


CTGCCACTTC 


ATTTAAAAAT 


TCTATATCAT 


GGGAAACAAT 


3420 


AAAAATTATT 


TTATCCATGG 


TTTTATACTT 


ATTAATCAGT 


TCAGATATTT 


TTATCATATT 


3480 


GGAATAATCC 


ATACCACTTG 


AAGGTTCGTC 


AAAAAAGACA 


AATCGAGAAT 


TCTTGCACAT 


3540 


AACAGATGCT 


ATTGCAAGCC 


TTTGCTTTTG 


CCCTCCTGAT 


AAACTCATCG 


GATGCCTTTC 


3600 


AATAAATTCG 


TCCAGGCATA 


AATCTTTTAA 


CCCAAATCAT 


TCATACCTCT 


CTCAACTAGA 


3660 


TGTAACTTAC 


AAAACCCCTG 


ACCTCATGAG 


CCACTTTCTT 


CCTCCTCATG 


AGGTCAGTTT 


3720 


TACTTTCTGC 


TGTTCCAGTA 


TCGTTTTTCC 


TCGCTAGATT 


TCCTCAAAAG 


GCCAGACTCC 


3780 


TCCCTTGGTT 


CGTCACACGA TTTTTTCATC 


TCGACTGTTC 


TTTAATGCAT 


CATTAACGAC 


3840 


GCTTTTCTTC 


TAGGTGGTTC 


ATAAGGAACA 


GGAAGATTCA 


GGTTGACTTT 


TCTAATCCTA 


3900 


GAATAAAGTG 


CTGAAAACAA 


TTCGGAATAG 


GCATAGAGAC 


TAGACAATTT 


GAGGAGCTGC 


3960 


TTGCGTCCTG 


TTCGAACACA 


TTTTCCCACC 


ACGTGAAGAA 


AAAGATGGCG 




4010 


(2) INFORMATION FOR SEQ ID NO: 254: 
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ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2789 base pairs - 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 254: 
ATGCATCCGT TTCTCAAGCC TAAATTGTAA ^TTTTTTTCAA TTTAAAACAG AAAAACCCAC 
GAAAATGACA TAAAAATATC ATTCCTAGGC CTATTTATGC TATTTCTCTC TGAAAAATAT 
GAGTATTCAG TCGGTCAAAT GAAGCTGAAC CAACTCATTT TCCCTCGCCT AATTCAATGA 
TTCGATGACA TTGTTGGGCT ACATAAGCAT CGTGGGTCAC GATAATGACT GTTTTCCCCT 
CTCGATTCAT CTCTAAGAGA AACTTCAAGA CCAAATCTCT ATTTTCAGGA TCCAGAGAAC 
CTGTCGGTTC ATCGGCTAAA ATCAGCTGGC TGGGTTTTAA GATGCCTCTA GCAACTGCAA 
TTCGTTGTTG TTCGCCCCCA GACAACTCGG AGACCCTTTG ATGCAAAGTA CCTGATAAAC 
CTACTCTCTC TAAAATCTCT TCCACCTTTT TGAGCTTGTC TTTCTTAGGC AATTTCACAT 
ATTTCAGCGC CACATGAGAT TGTACTCGAC CGTTTCATCA TCAATCAGGG CAAAATTTTG 
AAACAGATAA GAGATATGTT CACGGATTAT TGTTTGCGAC TTAGCAGAAT TAACCGCTAG 
ATTTGTCTGA CCAAAAATCT CATACCGTCC GCTATAATCA CCATCTATCA AACCCAATAA 
ATTTAACAAG GTCGACTTCC CACTACCACT CTTACCAACA ATAGCTACCA AATCCCCCTC 
ATCAATCCTG AGAGATAAGT TATCCAAAAT CACTTTTCCC CCAATGGTTT TGGTAATATT 
TTTCAACTCA ATCATAAGAT GCCCCCTTTC AATAACTCTA CTAGACTTCT TTTCTCCATC 
CTAGAAGCTA ACCCTAGCAC AAATAGTATA TCCAGACATG TAAAACCTGC AAACAGTAGA 
AGTGGTAAGA ACGCATGGGC AAAGAAAATC AAGACTAGAA GAGGGAAACT ATAGCCCAGC 
AAGAGCAGAA CGAGGAGAGG ACGGTAGCGA TCGACCACTT TCCACCCCAT AAACTTCTTG 
GTAATGATAT CCCTGCGCTT CAATAAGAAA GTTGTTACTA GTAAGAAGTA GGAAATCATC 
ATGCTAAGGA GACCAAACAA AGCAAAGACT AGGTTAAAAT TCCGAACAGC ATCTCGATAA 
GAATCCACTT TCTCTTGTTG AATGGCTTGA ATAGATGAAA ATTTTAAATA ATTTCCATCT 
GACAATTTCT CAACTAACTC TGTAATCTCT TTTTGATGTT GAACCGTATT TTCAATTTTA 
ATCGGATTAT TTAAGCCAGT TGTTGACAGG GAGGCTTTCT CATCCCACAT CATATCAGAA 
TCATTGACCA AGCTAATAAT TGGATTGGAG AGATTTTCCT TTCGCTTATC ACTATATGGG 
AAAAATGACC AATCTCCTTC ATAATAGGCA ATCTCGACAT CCATCTCCTC TATCGTTCGT 



60 
120 
180 
240 
3 00 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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TTTTGCTCCT CTTCATACTT CATCGAATGA AAGGCAATTA ACTTCCCCAA GACCTGATTT 1500 

TTATCTTCTT CACCTTTCGT ACTTGCTGGC ATCAAAATAA CTTTTTTAAT ACCGGTATTT 1560 

GGTAGCTTGA ATCCCTTGCT CTTTAGAAAA TTGCGATTGG CATAGTAAAC ATCCACCGTA 1620 

TCTGTTAACT GATATTGCTG AATCTGTTCT GATTGGACAA AATTTTTTAC AGGAAGACTG 1680 

CTACTCTGCA CATAGCCCGC CTGCGTTTTT TCTACCAAAT CCTGATAAAA TCGATAGAAA 1740 

TAATCTGTAG ATTTCCCTGA CCCTGCTAGC TCTTCTTGCC ACAGATTATC ATTGAGTTTG 1800 

AAGGTTTCTA AGGTCAGGTA ATTACCTTGA CTTACCCACT GTTGCTGATA AGCAAGTTCT 1860 

TTGTTTTCTT CTTCTAAACT TCTGCCCACC CCAATCAGTA AGGCCGTCAG TAAAATACTT 1920 

GTCCCTATTT TCATCACATA ATTGAAGATA AGACCAAATT TGAAAGATCA AAAACCTTTC 1980 

AGCAGAGAGC TGATTGTCAT TTTTTGGATT AAGAGGTAAG TCAACCAACT GATAAAGAGA 2040 

TAAAGCTGCA ACAGCAAAAA ATGAGACAAC CACAGCATAG CAAACAAATC TTTTGGCTTA 2100 

TAATCAAGCA AGAAAAACAC GCCTAGATTG ATCACAAGAG CCCCACCTAG GAGGAGGTAA 2160 

AGGTTGCCTT TTACAACATC AGCTAAAACA GCCCTATCTT GAAAACCAAG TAATTTTTGT 2220 

ACCCCAACTC TTTTCATCTC CATCATCGGT TGATACACTG TCACTAACAC AAGAAGCAAA 2 280 

ATAGCCAAGA CAAAAACAAT GCCAGATAAA AGCAAATCTC GATTTATGAC TTCCACTGCA 2340 

CTTTTGTAGG TCGGCTCTAG CAAGGTAGCC TGGTCTATCT TGAAAAAATC GCTCCATTTC 2400 

TGTACAATCC TATCCTTGTC CATCTCTTGT GTAGAACTTA TCGTATAGCG ACCATTTAAA 24 60 

CTACGAGATG TATCCTTGAT ATAGGTTTGA AAAGTCATAA GCTGAATAGG TTTGGCTTTT 2520 

AGAAAGGTCG GAATCGTACC AAGTTTATTG GAAATTTCTT TATTACTATA GACTCCTTCA 2580 

CCATCTGTGG TAAAATCAAG AGAAGAAATC CCAAACTCTT GGTAGGGGAA GGTATCTTTA 2640 

TCAAAAACAO CAGACTTGAC CACCTCATCA CCACTGTCTG TTTTGATGAT GGAGACTTTA 2700 

TACTCCTTTG ATACATCCTC AAAAAATCGA AGAACAGACG CTGCAGGTTC GTTAATATCT 2760 

TTCAAATACA AATCCAAAGA ATCTACACG 2789 

(2) INFORMATION FOR SEQ ID NO: 255: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2495 base pairs 
O) TYPE: nucleic acid 
CO STRANDEONESS : double 
<D) TOPOLOGY: linear 



(Xii SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
CTGCGAATTT TATTAAAGAT AATGTGTTAA TTACAGCGGC TCACAACTAC TACAGACATG 



60 
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ACTATGGGAA AGAACCGGAT GATATTTATG TTCTTCCCGC TGTTAGTCCA AGTCAAGAAC 120 

CATTTCGAAA GATCAAAGTA AAGGAAGTTC GTTATTTGAA GGAATTTAGA AATTTAAATT 180 

CTAAGGATGC AAGGGAATAT GACTTGGCTT TATTAATTCT AGAAGAGCCC ATTGGTGCAA 240 

AATTAGGGAC TTTGGGTCTT CCTACTACTC AAAAAAATTT GACAGGAATA ACTGTGACTA 300 

TCACAGGCTA TCCATCATAT AATTTTAAAA TTCATCAAAT GTATACAGAT AAGAAACAAG 360 

TTTTAAGTGA TGATGGCATG _TTCTTGGATT .ACCAAGTTGA TACTTTAGAG^GGGTCTAGTG- 420 

GATCTACAGT TTATGATGCT AGTCACCGTG TAGTAGGAGT GCATACTTTA GG AGATGGAG 4 80 

CTAATCAAAT TAACAGTGCA GTTAAATTAA ATGAACGAAA TTTGCCATTT ATTTAwTCGG 540 

TTCTTAAAGG TTACTCTCTT GAAGGATGGA AGAAAATAAA TGGTAGTTCG TACCATTATA 600 

GACAACATGA TAAACAAACG GGTTGGCAGG AGATAAATGA TACCTGGTAT TATTTAGACA 660 

GTTCCGGTAA GATGCTTACA GATTCGCAAA AAGTCCATGG AAAATGGTAT TATCTCAATT 720 

CAAATCGAGC AATGGTTACA CGTAGCCAAA CTATCGATGG TAAAGTTTAT AACTTCGCTT 7 80 

CATCTGGTGA GTGGATTTAA TGTTGGAGGA TATATAAAAT GAAGCTTTTG AAAAAAATGA 84 0 

TGCAAATCGC ACTAGCCACA TTTTTCTTCG GTTTGTTAGC GACAAATACA GTATTTGCAG 900 

ATCATTCTGA AGGATGGCAG TTTGTCCAAG AAAATGGTAG AACCTACTAC AAAAAGGGGG 960 

ATCTAAAAGA AACCTACTGG AGAGTGATAG ATGGGAAGTA CTATTATTTT GATCCTTTAT 1020 

CCGGAGAGAT GGTTGTCGGC TGCCAATATA TACCTGCTCC ACACAAGGGG GTTACGATTG 1080 

GTCCTTCTCC AAGAATAGAG ATTGCTCTTA GACCAGATTG GTTTTATTTT GGTCAAGATG 1140 

GTGTATTACA AGAATTTGTT GGCAAGCAAG TTTTAGAAGC AAAAACTGCT ACCAATACCA 1200 

ACAAACATCA TGGGGAAGAA TATGATAGCC AAGCAGAGAA ACGAGTCTAT TATTTTGAAC 1260 

ATCACCGTAG TTATCATACT TTAAAAACTG GTTGGATTTA TGAAGAGGGT CATTGGTATT 1320 

ATTTACAGAA GGATGGTGGC TTTGATTCGC GCATCAACAG ATTGACGGTT GCAGAGCTAG 1380 

CACGTGGTTG GGTTAAGGAT TACCCTCTTA CGTATGATGA AGAGAAQCTA AAAGCAGCTC 1440 

CATGGTACTA TCTAAATCCA GCAACTGGCA TTATGCAAAC AGGTTGGCAA TATCTAGGTA 1500 

ATAGATGGTA CTACCTCCAT TCGTCAGGAG CTATGGCAAC TGGCTGGTAT AAGGAAGGCT 1560 

CAACTTGGTA CTATCTAGAT GCTGAAAATG GTGATATGAC AACTGGCTGG CAAAACCTTG 1620 

GGAACAAATG GTACTATCTC CGTTCATCAG GAGCTATGGC AACTGGTTGG TATCAGGAAA 1680 

GTTCGACTTG GTACTATCTA AATGCAAGTA ATG GAGA TAT GAAAACAGGC TGGTTCCAAG 1740 

TCAATGGTAA CTGGTACTAT GCCTATGATT CAGGTGCTTT AGCTGTTAAT ACCACAGTAG 1800 
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GTCCTTACTA CTTAAACTAT AATGGTGAAT CCGTTAAGTA ATGAAGGCTA ATTGTAAACT 1860 

GTGATGGATA CTTAACTTTG TATAATAGGT GGATAAAACT CTTCACAATC AAAAAACCCA 1920 

TAGTATCAAG GTTTTTCTGT ACTGCCCTCA AACAGTTAGA CAATTAATTT ATCCCAAGgA 1980 

TTTAGTTCTG TATTGCACAG GGCTAAGTCC TTTTAGTTTT ACCTTAATTC GTTTATTGTT 2040 

GTAGTAATCA ATATAGTCTA TAATGGCTTG TTCCAATTGC ' TTRAGCGACT GAAACGACTT 2100 

CTCATAACCG TAAAACATTT CCGATTTCAG AATCCCAAAG AAGGACTCCA TCATACTATT 2160 

GTCTGGGCTG TTTCCCTTAC GTGACATGGA TGCTTGAATT CCCTTACTCT CTAGGAACCG 2220 

ATGATAAGAA TCGTGTTGGT ATTGCCAGCC TTGGTCACTA TGGAGAATCG TATTCTCGTA 2 280 

GTGCTTCTCT GTGAATGCCT GTTCCAACAT TCTTTGTACT TCTTCTAACT TGGGTGAAGT 2340 

TGAAAGATTA TAGGCGATAA TTTCGCTATT AAAGCCATCT AAAACTGGTG ATAAGTAAAG 2400 

CTTTTCAGTA CTTGCTGGAA TGGCAAATTC TGTCACATCT GTGTAGCACT TTTCCATTGT 2460 

TTTAGAGCCT TCAAATTGGC CTTGAATGAG ATTCG 2495 
(2) INFORMATION FOR SEQ ID NO: 256: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND E ONE SS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 



TACCACCGTA 


TTCATCCAGC 


AAGATTGCCA 


TTTGTCTTTG 


GGTATTTCGC 


AGTTCTTTTA 


60 


GCAAGTCATC 


CACAAAAATA 


GTTTCAGGTA 


CAAAAAGTGG 


ATCTTGTAAA 


ATTCTCTTCC 


120 


AAACAATATT 


GTCAAAACCG 


TCCACAAAGC 


CTGCCTTAAG 


GAGACTCTTG 


GTGTGAATGA 


180 


TTCCAATTAC 


ATTGTCCTTA 


TCCCCATCAT 


AAACCGGGAT 


ACGAGAATAA 


TTTTGTTTTA 


240 


AAATACTTTG 


GATAATGGCT 


TGACTATCAT 


CCTGAATATC 


CACCATAAAG 


GCATCCGTTC 


300 


GAGGAACCAT 


AACCTCTCGT 


GCCATCAGTT 


CATCGAGCGA 


AAAGACACCT 


TGTAGCATCT 


360 


CAATCTCATC 


AGCATCCAAT 


GTTTCTTCAC 


TATTTGTCAG 


CATATAGGCA 


ATTTCATCAC 


420 


GGGTCATCTT 


TTCATCCGCA 


TCATCGAATG 


ACATAGGAGT 


CAAATGGCTC 


AAGAAATTGG 


480 


TCGAAGCAGC 


TAAAAGCCAA 


ACAAAAGGAC 


TGACTAGTTT 


TCCGATCCCA 


ATGATAATCG 


540 


GCGCTGTACG 


AATTGCCAAG 


GCATCCTTTA 


GATTAAGAGC 


GATTCTCTTA 


GGATATAATT 


600 


CCCCAAAAAC 


GATGGAAATA 


TAGGTCAAAA 


ATGCCAAGGA 


TAGAAAAGTT 


GCCACGGCTT 


660 


GTGCTGTTTC 


GCCATTCCCA 


AGCCAAGAGG 


CAATCACACG 


TCCTAGAGTA 


TCACTTAAAC 


720 
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TCGCCCCTGA TAAGATTGTA ATCAGGGTGA TTCCTACCTG GATGGTTGAT AAAAAGTGGT 780 
TAGGATTTTC TAGTACCTTC AGCAGGCGGA TGTAGCGTCT GTCTCCTTCT TCCGCCTTTT 840 
GTTCAACTCG GGCACGATTA AGAGAAACGG g 70 
(2) INFORMATION FOR SEQ ID NO: 257: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 
ID> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

CGTTCCCAGA AGCCCGCATT CTCATCGCCA ATGTCGTGAT TGATTTGGCC CTTTCTCCAA 60 

AATCCAACTC AGCCTATGTA GCTATGGATA AGGCACTTGC TGACCTCAAA ACATCAGGGC 120 

ACTTGCCTAT TCCGCGACAC CTGCGTGATG GGCACTACAG TGGAAGCAAG GAACTGGGGA 130 

ATGCCCAAGA CTATCTCTAT CCACACAACT ATCCTGGAAA TTGGGTCAAG CAAGACTATC 24 0 

TGCCAGAAAA AATTCGTAAT CATCACTATT TCCAACCAGA AGATACTGGT AAAT ATGAAC 300 

GGGCTTTGGC TCAAAGAAAG GAAGCTATCG ACCGTTTGCG AAAAATCTGA AATCCTTTTC 360 

AAAAAATTGC ACTTTCCTCT TGATTTTTTT TGAAAAAGTG GTATCATATA AATATAGAAA 420 

CCCTCTGGTG TACGACTTCA CACTTAAGTG TTGACCGACT ATTTTTTGTA TTATTAGGGA 480 

AACAAAAGTC TTCTAACAGC ATGTAGGCCG TCTCACACGG AAACAGCTTC AGTTAGAGCG 540 

AGTTGCCCAC CTGCTTAATT GCGCGGGTTC AATACAAACC GTGAAGTTTC GGCACCAATA 600 

CAGCTTTTTT CTTTGCCTCC TTACCTCAGC TGGCAGAGCA GCGGACTCTT AATCCGTGGG 660 

TCACAGCTTC GATCCCTGTA GGGGGCATAT AAATACAACA GGAAAAGCCT TATAATATAG 720 

GGCTTTTTTT GCTTTCCTTT TAAAAATTGT CGTGCAATTT GCCGT G TTTT TACAACAAAC 7 80 

TTTTCACAGC CATAAACTCC TCACTAATTT TTTCCTCCAA GGTATGCCCA TAAACGTCAA 84 0 

TCAACATGGA GATATCTTTA TGTCCTAAAA TTTGGCTCTT TGTCAACTGT AGTGGGTTGA 900 

AGTCAGCTAA GCTCGAGAAA GGACAAATTT TGTCCTTTCT TTTTTGATAT TCAGAGCGAT 9 60 

AAAAATCCGT TTTTTGAAGT TTTCAAAGTT CCGAAAACCA AAGGCATTGC GCTTGATAAG 1020 

TTTGATGAGA TTATTGGTCG CTTCCAATTT GGCGTTAGAA TAGTGTAGTT GAAGGGCGTT 1080 

GACGATTTTC TCTTTGTCCT TTAGAAAGGT TTTAAAGACA GTCTGAAAAA GAGGAGGAAC 1140 

CTGCTTTAGA TTGTCCTCAA TGAGTCCGAA AAATTTCTCC GGTGCCTTAT TCTGAAAGTG 1200 
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AAACAGCAAG AGTTGATAGA GCTGATAGTG ATGTTTCAAG TCTTG 
(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1684 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 
(D» TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
ATGCCTATCT AACTCCACAT ATGACCCATA GCCACTGGAT TAAAAAAGAT AGTTTGTCTG 
AAGCTGAGAG AGCGGCAcCC AGGCTTATGC TAAAGAGAAA GGTTTGACCC CTCCTTCGAC 
AGACCATCAG GATTCAGGAA ATACTGAGGC AAAAGGAGCA GAAGCTATCT ACAACCGCGT 
GAAAGCAGCT AAGAAGGTGC CACTTGATCG TATGCCTTAC AATCTTCAAT ATACTGTAGA 
AGTCAAAAAC GGTAGTTTAA TCATACCTCA TTATGACCAT TACCATAACA TCAAATTTGA 
GTGGTTTGAC GAAGCCCTTT ATGACGCACC TAAGGGGTAT ACTCTTGAGG ATCTTTTGGC 
GACTGTCAAG TACTATGTCG AACATCCAAA CGAACGTCCG CATTCAGATA ATGCTTTTGG 
TAACGCTAGC GACCATCTTC AAAGAAACAA AAATGGTCAA GCTGATACCA ATCAAACGGA 
AAAACCAAGC GAGGAGAAAC CTCAGACAGA AAAACCTGAG GAAGAAACCC CTCGAGAAGA 
GAAACCGCAA AGCGAGAAAC CAGAGTCTCC AAAACCAACA GACGAACCAG AAGAATCACC 
AGAGGAATCA GAAGAACCTC AGCTCGACAC TGAAAAGGTT GAAGAAAAAC TGAGAGAGCC 
TGAAGATTTA CTTGGAAAAA TCCAGGATCC AATTATCAAG TCCAATGCCA AAGAGACTCT 720 
CACAGGATTA AAAAATAATT TACTATTTGG CACCCAGGAC AACAATACTA TTATGGCAGA 7 80 

AGCTGAAAAA CTATTGGCTT TATTAAAGGA GAGTAAGTAA AGGTAGCAGC ATTTTCTAAC 840 
TCCTAAAAAC AGGATAGGAG AACGGGAAAA CGAAAAATGA GACCAGAATG TGAGTTCTAC 
TTCTCATTTT TTTCATGAAA ATGTGCAAAA TATAGTAGAT TGAAACTAGA AT ACT AT AC C 
TCTACTTCTA AAACATTGTT AGAAATCGAT TTGACTGTCC TGTTCTTATT TCATTTTACT 1020 
ATATCTTAAC AGATACTGTA AATAAAGATA AACTATTTAC TGGCTAATTA ATCAGTTAAA 1080 
CACTAGTTAA GGAGTAATGA TGAAAAAAAG AACAATACTA TTATTGATGG CCAGTCTGTT 1140 
ACCTCTTGTC TTAGGAGCAT GTGGTTTCTT GGACATATTG ATCCTGGATC ATTCTCATCA 1200 
GGATTACTCT TTACTGCTAT TTTAGAAACT GGCGTGGTTt' GATGGAAAGT ATTGGTCTTG 1260 
TTATCGTTTC ACATTCCAAA CACATTGCAG AAGGTGTTCT TGAACTGATT AGTAAAGTAG 1320 
CTAAAGATGT TCCGATTACT TATGTAAGAG GAACCGAGGC CGGAGGAATT GGAACGAGTT 1380 



900 
960 
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TTGAACAAGT AGATAGGGTT GTTTCCGAAA ATCCAGCAGA TACTTTACTT GCCTTTTTTC 1440 

ACCTACGTTC TGCTAAAATG AACTTAAAAA TGGTGACTGA TTTCAGTGAT AAAAGTATCA 1500 

TCATCAACAG GGTTCCAATT GTAGAAGGTG CCTATAATGC AGCTGCTCTT CTTCAGGCTG 1560 

CTGCAGAACT GTCAGTTATT CAAACACAGT TaGCGGAgCC TGAAATCAAT AAATAAGGAA 1620 

TTTTACTATA ACTCTTTTTA TAGATAAGCT ATTCaTTATC TCAACTATAA TAATGTTAAG 1680 

TnAA 1684 
(2) INFORMATION FOR SEQ ID NO: 259: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 970 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

AGGAGTGGAG AnATATGAAG ACACAAATTT TCACATTATT GAAAATCGTT GCTGAGATTA 60 

TTATTATTTT GCCATTTCTA ACTAATCTAT AAGTTCTTTA TATTGCTGAA AACGCAATTC 120 

AAAAAGGGCT ATTAATTGTG GATTTTCTAA TACCTGCAGA GATTGGATAA AGCGTTCAAT 180 

CTCTTTTTGA TTGCTTCCCT TTGTTTGAAG AAAGACACTC ATCTTCTTTA AAAATTGCCA 240 

CGATACTTTT TCAAAAACAT CATACGGTCG TAACATCCTC TCCAACTCGG CTTCGAAGAT 300 

TGGGATGTAG GAGAAAAGTT TTCGCTCCAT GAGTTCTGAT AAGATATTTA AGAGTCCTTG 360 

CTTCATATAC AATCGATTGT GTACTAACTC TTTAAATTCT TTGCATTTTT CGAGTAAGGA 420 

GGTTGATAAA AAAATCAGAT CTTGATTGCT CAAGAAGGGC ATGGTATTCC AAAAGAGATA 480 

GAGTTCAAAC CAGGTCCAAG ACTCGATAGC ATAGAGATAG GTGGTCAAAA ACTCGCTATC 540 

CTCCTCTGCT AGTGGGTAGC TTTTATTTAG TGAATGGATG CCATCTTTAA TCACGATGGC 600 

ATTCAAACGA CGATAGCTCT GCGCCATCTG TTCTTGATCG ACTTCCTCCA ATAGCTGCTC 660 

TAAAGCAGCT ATATCCTGAT GGGCAAAGCG ATTCACAACC TTTCGACCGA TTCGCATATG 720 

TGGAGATTCT TGATAGTTGT TGAGCTTGTG CCCAAACTCA TCAAAGGTCA CATTTATACC 780 

TTGGATAGCT AGAATCAACT TATCCGCAGA CAGCATAGAC TGCCCTAGTT CAAACTTGGA 840 

CAACTGAGAA GCTGTTAGAC CCTCACAAGC CACATCTGAC TGCTTGAGCT TTCTCGCCAA 900 

ACGTAATTCC TTGTAAAATT CCCCCAGTTC CATTCTCTCA ATCATCTGAC CACCTCCTAG 960 

CTTTTGCAGG 970 
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(2) INFORMATION FOR SEQ ID NO: 260: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2996 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 



GTTGACCACG GGTAAAACTA CCCTAACTGC AGCTATCACA ACTGTTTTGG 


CACGTCGCTT 


60 


GCCTTCATCA GTTAACCAAC CTAAAGACTA TGCGTCTATC GATGCTGCTC 


CAGAAGAACG 


120 


CCAACGCGGT ATCACTATCA ACACTGCGCA CGTTGAGTAC GAAACTGAAA 


AACCTCACTA 


180 


CGCTCACATC GACGCTCCAG GACACGCGGA CTACGTTAAA AACATGATCA 


CTGGTGCTGC 


240 


TCAAATGGAC GGAGCTATCC TTGTAGTAGC TTCAACTGAC GGACCAATGC 


CACAAACTCG 


300 


TGAGCACATC CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT 


TCATGAACAA 


360 


AGTTGACTTG GTTGACCACG AAGAATTGCT TGAATTGGTT GAAATGGAAA 


TCCGTGACCT 


420 


ATTGTCAGAA TACGACTTCC CAGGTGACGA TCTTCCAGTT ATCCAAGCTT 


CAGCACTTAA 


480 


AGCTCTTGAA GGTGACTCTA AATACGAAGA CATCGTTATG GAATTGATGA 


ACACAGTTGA 


540 


TGAGTATATC CCAGAACCAG AACGTGACAC TGACAAACCA TTGCTTCTTC 


CAGTCGAGGA 


600 


CGTATTCTCA ATCACTGGAC GTGGTACAGT TGCTTCAGGA CGTATCGACC 


GTGGTATCGT 


660 


TAAAGTCAAC GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTCAAA 


AAGCAGTTGT 


720 


TACTGGTGTT GAAATGTTCC GTAAACAACT TGACCAAGGT CTTGCTGGAG 


ATAACGTAGG 


780 


TGTCCTTCTT CGTGGTGTTC AACGTGATGA AATCGAACGT GGACAACTTA 


TCGCTAAACC 


840 


AGGTTCAATC AACCCACACA CTAAATTCAA AGGTGAAGTC TACATCCTTA 


CTAAAGAAGA 


900 


AGGTGGACGT CACACTCCAT TCTTCAACAA CTACCGTCCA CAATTCTACT 


TCCGTACTAC 


960 


TGACGTTACA GGTTCAATCG AACTTCCAGC AGGTACTGAA ATGGTAATGC 


CTGGTGATAA 


1020 


CGTGACAATC GACGTTGAGT TGATTCACCC AATCGCCGTA GAACAAGGTA 


CTACATTCTC 


1080 


TATCCGTGAG GGTGGACGTA CTGTTGGTTC AGGTATGGTT ACAGAAATCG 


AAGCTTAATT 


1140 


CGATTTAGTT CCCAGAAGAA CAATTATTTA AGTTAGACAC TAAAAGAATC 


TTGCTTGGCA 


1200 


AGGTTCTTTT TTTAGATATT GAACTAATAC TCAATGAAAA TCAAAGAGCA 


AACTATAATA 


1260 


TATTGAAACT AGAATAGTAC ACATCTACTT CTAAAACATT GTTAGAAATC 


GATTTGACTG 


1320 


TCCTGATCGA TTTGTCTTGT TCTTATTTCA TTTTACTATA GAAAGTTAGC 


TACAGACTCC 


1380 


TCAAAACATT GTTTTTAGGT TGTAGATAGA ACTGACGAAG TCAGtAACAT 


CTATACGACA 


1440 
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ACCCCAACCT GACGCGGTTT GAAGAGATTT TCCAAGACTA TAATACTACA CTAAAATCAA 1500 

AAACCATTAT ACAATAGTAA TATGAAATCA ATTAAAGAAG AAATCCAAAC CATCAAAACA 1560 

CTTTTAAAAG ACTCTCGTAC AGCTAAATAT CATAAACGCC TTCAAATCGT TCTATTTCGT 1620 

CTGATGGGCA AATCTTATAA AGAGATTATA GAACTTTTAT AGTGGTTTGA AATAACATGT 1680 

GAACAACTCT ATCAGGAAAG TCAAACTAAT TTATAGAAAT ATTTTAGCAG CCAAGGTGTA 1740 

CTGTTATAGA TTCAATACAC TTTAGACTGT AATCAAACAA CGATTTGGCG AAATGTAAAA 1800 

AATATGAGGA GTTCGGACTC CACTCTCTCC TTCAAGAAAC ACGTCGTGGT CCTAACCATG 1860 

CTTATATGAC CGTTGAGCAA GAGAAAGTCT TTCTTCCCCC CCATTTGAAG GCTACAGAGG 1920 

CAGGAGAATT TGTTACAATT GATGCCTTAT TTCAGGCTTA TAAAAAGGAG TTAGGTCGTT 1980 

CCTACACACG TGATGCCTTC TATCAACTGT TGAAGCGCCA TGGTTGGCGA AATATTACCC 2040 

CACGTCCAGA ACATCCTAAG AAAGCAGATG CTCAAACCAT TGTCGCGTCT AAAAATAAAG 2100 

TCTCAATTCA AGAAGACAAG TGAACTGCAC CCCAAAAGTT AGACAGAAAA AATCTAACTT 2160 

TTGGCGTGTT TTTATTATGA AATTAACTTA TCATGATAAA GTTCAGATCT ATGAACTTAG 2220 

AAAACAAGGA TATACCTTAG AGAAGCTTTC AAATAAATTT GGGATAAACA ATTCTAATCT 2280 

TAGGTACATC ATTAAATTGA TTGATCGTTA CGGAATAGAG TTCGTCAAAA AAGGAAAAAA 2340 

TCGTTACTAT TCTCCTGATT TAAAACAAGA AATGATTCAT AAAGTCTGAC ATGAAGGCTG 2400 

GACTAAAGAT AGAGTTTCTC TTGAATACTG TCTCCCAAGT CGTACGATAC TTCTTAACTC 2460 

GCTAGCACAA TACAGGAAAA ACGGGTATAC TATTGTTGAG AAAACAAGAG GGAGAGTACC 2520 

TGAGAGCGGA GAATGCCATC CTAAAAAAGT TAAGAGAACT CCGATTGAAG GAGGAAAAAG 2580 

AGAAAGAAGA AAGACAGAAA TTATTCAACA ATTAATGACT GAGTTTTCGT TAGATATTCT 2640 

TCTAAAAGCC ATTAAACTAG CTCGTTTGAC CTACTACTAT CACTTGAAAC ACCTAGATAA 2700 

ACCAGATAAG GACCAAGAGC TTAAAGCTGA AATTCAATCC ATTTTTATCG AACACAAGGG 2760 

AAATTATGCT TATCGTCGGA TTTATTTAGA ACTAAGAAAT CCTGGTTATC TGGTAAATCA 2820 

TAAAAGAGTT CAAGGCTTGA TAAAAGTACT CAATTTACAA GCTAAAATGC GACAGAAACG 2880 

AAAATATTCT TCTCATAAAG GAGACGTTGG CAAGAAGGCA GAGAATCTCA TTCAAGGACA 2940 

ATTTGAAGGC TCTAAAACAA TGGAAAAGTG CTACACAGAT GTGACAGAAT TTGCCG 2996 
(2) IN FORHATI ON FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 base pairs 

(B) TYPE: nucieic acid 

(C) STRANDEDNESS: double 
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(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

CTTATCAACT CCCGACATGG CTCTCAGACC AATCCAAATC CCTAAAAAAA TCAGAACAAG 60 

GATGGTGGTC AAGATCAAAC TCTCGAAATA TAAAGAAAAT AGTTCCAGTA GCATGATTTC 120 

TCTCATTTCT ATCTTTTTTA AAGAGTAAAC TCAGCTAGTC CAACTAACTG AGTTTTCCTT 180 

TATCTATTAT ATCAAATATA AGTCCGTTTG TAACTAGCGA AGAATTCTTT TGTCCGCTCT 240 

TCTTTAGGGG TGTGGATAAT CTCATCCGGA GTTCCAGACT CGATGATTTT CCCCTTATCT 300 

AAGAAGAGAA TTTTATCCGC AACTTGGGCT ACAAAGGACA TGTCATGACT GACCAAAATC 360 

ATGGTCTGAC CTGACTTAGC AGCATCTGCA ATAGACTTTT CTACTTCACC GACCAATTCT 420 

GGGTCAAGGG CTGAAGTTGG TTCGTCTAAG AGCAAAACAT CTGGTTTCAT AGCAAGCGCA 480 

CGCGCTAGGG CAACCCGTTG CTTCTGTCCA CCTGATAAAT GGCGAGGATA ATGGTTTTCA 540 

CGGTCCGAAA GCCCAACCTT AGCCAACTCT TCCTTGGCAA TCTTAGTCGC TTCTTGGTCA 600 

GATAATTTCT TGACAACAAC CAAGCCTTCT TTCACATTAT CAAGTGCTGT TCGGCGTTCA 660 

AACAAATTAA ACTGTTGGAA AACCATAGAC AACTTACGAC GTAGGGCAAG GATTTCTTCT 7 20 

TGAGTGATTT TAGAAAAATC AACTGAAAAA CCATCAATCT GAATAGAGCC ACTGTCAGGT 780 

GTTTCTAGAT AATTGAGACT GCGAGAAAGG TTGATTTTCA GCTCTGAAGA CCAATCA 837 
(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

CCGAACAAAA TGGGCTAATT AGATTATAGT AAGAAAGGTA AGTTAAAAAT GAGAATTGCA 60 

ATTGGATGTG ACCACATCGT AACTGATGAA AAAATGGCGG TTTCAGAATT TTTGAAATCA 120 

AAAGGATATG AAGTCATTGA CTTTGGTACC TATGACCATA CACGGACTCA CTACCCAATC 180 

TTTGGTAAAA AAGTAGGGGA AGCTGTAACT AGCGGTCAAG CTGATCTTGG AGTATGTATC 240 

TGTGGTACTG GTGTTGGTAT CAACAACGCT GTAAATAAAG TTCCAGGTGT TCGTTCTGCC 300 

TTGGTTCGTG ATATGACAAC AGCCCTTTAT GCTAAAGAAC AATTGAACGC TAACGTTATT 360 

GGTTTTGGTG GTAAAATTAC TGCTGAATTC CTTATGTGTG ATATCATCGA AGCTTTCATC 420 
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CATGCTGAAT ACAAACCAAC TGAAGAAAAC AAAAAATTGA TTGCGAAAAT TGAACATGTT 



480 



GAAAGTCACA ATGCTCAACA AACAGACGCA AACTTCTTTA CAGAATTCCT TGAGAAATGG 



540 



GATCGTGGAG AATACCACGA CTAAGACGTC ACCTATGATT TTAACAGTCA CAATGAACCC 



600 



ATCCATCGAT ATTTCCTATC CCTTGGATGA GTTGAAGATT GATACTGTCA ATCGTGTGCT 



660 



GGATGTAACC AAAACGGCTG GTGGTAAGGG ACTCAATCTT ACCCCACTAC TTTCAGAATT 



720 



TCGCGATTCT GTTCTTGCTA CTGGTTTAGT GGGTGGCAAA CTTGGTGAGT TTTTGGTTGA 



780 



ACATATCGAT AATCAAGTAA AGAAAGATTT CTTCTCAATT AAGGGAGAAA CTCGTAACTC 



840 



TATCGCTATT CTCCACGGAG ACAACCAA 



368 



(2) INFORMATION FOR SEQ ID NO: 263: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3744 base pairs 

( B) TYPE; nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 263 : 

CCGTTCAAAG TCTTCATAAG ACTCGAAAGT CACAGTTCTT TCGTTCTTGC TGGCATCTAT 60 

ATAGGTAATT TCAATCATGT TTAAAACTCC TTTCTTTAAT GCTAACTTTA TTTTACTCCT 120 

TATAAAAGAG AATGTCAACA AAAATGATTG CGCACGCAAC TTTTTTTAAA ATCATCTTAA 180 

ATCAAGAAAT CCAAACCTGC TTCCAAGCTT TCTTCGACAG TCTTTTGTAG CGAGGCCAGT 240 

GTCTTTTGCC CATCATTTGT CAGGCAGATA AAACTAGAGC GTCTATCTTG ATGGCAACAC 300 

ATGCGACTGA GTAGACCGCA ATTTTTAGCT TCCAAGCGAG CCACCATCCT AGAAACTGCG 360 

CTCGGGCTCA GATGAAGCTT ATCTGGCAGG TCAATCTGGC GTAGAGATTT TTCTTCAGCC 420 

AAGTCCAGAT AGTAGAGCAG GTAGAACTCT TTCAAGGTCA GACTTTGCTC GCTCTGTTGG 480 

GCAATGGTCT CTTCCAAGAG ACTTTCAATT TCTTTCTGAC GCCGATTGAA GTCAAACCAT S40 

TTTTCCAAAT AGGTCATAGT GTCTCCTTTC TTTTTAGAGT CATAAATAGA AGAAAGTCCA 600 

TTAACGGGCA GTCTCTGCGT CACAAGATGA TTGCGCATGC AATAATTATA CTACTTTTCA 660 

AGAATGCTGG CAAGCTCTGT TTTTTAGTGG TTTTATTTTT GTGTGAATAA TGGGGGAATC 720 

CTATTGTTTC AATTTCTAAC TCCTTATCAC ATTCGAATTC AGATTTTATT TCATTTCTCT 7 80 

ATCTATAGTT GCTTAGTTTA AAATAAGCAT GGTCTAATAA AGCTATGCAT ATAGTACTGA 840 

TTTTAAACAA GGAGCATTAG ATTCCATTAA AGGAGGGCAC AGACATGTCG AGGCGGCCAA 900 
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AGTTTTTCAT CTCGGCGTCA GAACTCTCTT CACGTGGGAA AAGAAAGACG TAAACAAGGG 960 

AACTTAGAGC GGAAAAACCG AGTCGTCAAA AAGCGTAAGA TCCCTTTAGA AGAATTGAAA 1020 

GCCTTTGTAG AGGCTCATCC AGACGCTTTT TTACGGGAAA TTGCGGCCCG TTTTGATTGT 1080 

GCTTTGCCCT CCGTATGGGC AGTTTTAAAG CAGATTAAGG TCATTTTAAA AAAGACGACC 1140 

AGTTTTAGGG AACAAAAGCC TGAGAAAGTT TCTGAGTTTC TTGATATTTT CGATAACCTA 1200 

AAAGATTTAC CAGTCCTATA TATTGACGAA ACGGGAATCG ACCGCTACCT CTATCGTCCT 1260 

TATGCAGCGG CTCCTAGAGG GGAGAAACTC TATGGCAAGA TTAGCGGACG GCGTTTTGAG 1320 

CGGACTAATG AGGTGGACCA AAAACTCAAT GGTAGTTTTC TAATCAGATA TATTGATTCA 1380 

CAAATTAGAG AATGAAAGAA TAATTATGCA TAAAAATAGG AATATAAACC AAAAATTAGC 1440 

TGATTTATAC TCATTTGCGT GTCTTTATAA AAAACTTATC TTATAATATA TATATATATA 1500 

TATACAAAAT AGTAAAATGC 1 I Tm - l Vrr TAGCAAAAAT ACCTCAAGTT TCTTGCTATT 1560 

TTGGGTTCCC TATTCTATAA TTATAGTATG GTAATTTATT TATATCCATA CATGAAAATA 1620 

ATACTCGAAA GGAAATTTCA AAATATTTTT TAGACGTCAG AAGGGTGAAT ATAGAGAAAC 1680 

AGACCGAGTA ACTCGGTTCA AATTAATCAA ATCAGGGAAG CATTGGCTAC GGGCCTCGAC 1740 

TTCTCTTTTT GCCTTGTTTA AGGTCTTGCG AGGTGGTGTT GATACTACTC AGGTCATGAC 1800 

CGAAACGGTA GAAGATAAAG TAAGTCATTC AATTACTGGC CTTGATATCC TCAAGGGGAT I860 

AGTTGCTGCG GGAGCTGTCA TAAGTGGAAC CGTTGCAACT CAAACGAAGG TATTTACAAA 1920 

TGAGTCAGCA GTACTTGAAA AAACTGTAGA GAAAACGGAT GCTTTGGCAA CAAATGATAC 1980 

AGTAGTTCTA GGTACGATAT CTACAAGTAA TTCAGCGAGT TCAACTAGTT TGTCAGCTTC 2040 

AGAGTCGGCA AGTACATCTG CATCTGAGTC AGCCTCAACC AGCGCTTCGA CCTCAGCAAG 2100 

TACAAGTGCA TCAGAATCAG CAAGTACATC GGCTTCCACA AGTATTTCTG CATCATCTAC 2160 

TGTGGTAGGT TCACAAACAG CTGCCGCTAC AGAAGCAACT GCTAAGAAGG TCGAAGAAGA 2220 

TCGTAAGAAA CCAGCTAGTG ATTATGTAGC ATCAGTTACA AATGTCAATC TCCAATCTTA 2280 

TGCTAAGCGA CGCAAGCGTT CAGTGGATTC CATCGAGCAA TTGCTGGCTT CTATAAAAAA 2340 

TGCTGCTGTT TTTTCTGGCA ATACGATTGT AAATGGCGCC CCTGCAATTA ATGCAAGTCT 2400 

AAACATTGCT AAAAGTGAGA CAAAAGTTTA TACAGGTGAA GGTGTAGATT CGGTATATCG 2460 

TGTTCCAATT TACTATAAAT TGAAAGTGAC AAATGATGGT TCAAAATTCA CCTTTACCTA 2S20 

TACGGTTACG TATGTGAATC CTAAAACAAA TGATCTTGGT AATATATCAA GTATGCGTCC 2580 

TGGATATTCT ATCTATAATT CAGGTACTTC AACACAAACA ATGTTAACCC TTGGCAGTGA 2640 

TCTTGGTAAA CCTTCAGGTG TAAAGAACTA CATTACTGAC AAAAATGGTA GACAGGTTCT 2700 
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ATCCTATAAT ACATCTACAA TGACGACGCA GGGTAGTGGG TATACTTGCG GAAATGGTGC 
CCAAATGAAT GGTTTCTTTG CTAAGAAAGG ATATGGATTA ACATCATCTT CGACTGTACC 
AATTACTGGA ACGGATACAT CCTTTACATT TACCCCTTAC GCTGCTAGAA CAGATAGAAT 
TGGAATTAAC TACTTCAATG GTGGAGGAAA GGTAGTTGAA TCTAGCACCA CCAGTCAGTC 
ACTTTCACAG TCTAAGTCAC TCTCAGTAAG TGCTAGTCAA AGCGCCTCAG CTTCAGCATC 
AACAAGTGCG TCGGCTTCAG CATCAACCAG TGCCTCGCCT TCAGCCTCAA CCACTGCGTC 
AGCTTCAGCA AGTACCAGTG CTTCAGTCTC AGCATCAACA AGTGCTTCAG CCTCAGCATC 
GACAAGTGCC TCGGCTTCAG CAAGCACATC AGCATCTGAA TCAGCCTCAA CCAGTGCTTC 
GGCTTCAGCA AGTACCAGTG CTTCAGCTTC AGCATCAACC AGCGCCTCCG CCTCAGCAAG 
CACCTCAGCT TCTGAATCGG CCTCAACCAG CGCCTCGGCC TCAGCAAGCA CCTCAGCTTC 
TGAATCGGCC TCAACCAGCG CCTCAGCCTC AGCATCAACC AGTGCTTCGG CTTCAGCAAC 
CACAAGCGCC TCGGCTTCAG CATCAACGAG TACGTCACCT TCAGCCTCAA CCAGTGCTTC 
AGCCTCAGCA TCAACAAGTG CGTCAGCTCA GCAAGTATCT CAGCGTCTGA ATCGGCATCA 
ACGAGTGCGT CTGAGTCAGC ATCAACGAGT ACGTCAGCCT CACCAACCAC CTCACCTTCT 
GAATCGGCCT CAACCAGTGC GTCACCTCAG CATCGACAAG CGCCTCAGCT TCAGCAAGTA 
CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG 
CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTACTGCATC GGCTTCAGCA TCAACCAGTC 
CCTCGGCTTC AGCGTCAAAC AGTG 
<2> INFORMATION FOR SEQ ID NO: 264 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 795 base pairs 
(8) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

CGATAAAGAG GCCTTGACTA ATCTCAATTT GCAGATTGAA AATGGAGAGA TTATGGGCTT 60 

GATTGGTCAT AATGGGGCTG GAAAATCGAC CACTATAAAA TCCCTAGTCA GTATCATTTC 120 

ACCCAGCAGT GGTCGTATTT TGGTAGACGG TCAGGAGTTA TCCGAAAATC GCTTGGCTAT 180 

TAAACGAAAG ATTGGCTACG TAGCAGACTC GCCTGACTTA TTTTTACGCT TAACGGCCAA 240 

TGAATTTTGG GAATTGATCG CCTCATCCTA TGATCTGACT AGATCTGACT TGGAGGCTAG 300 



2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3744 , 
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TCTAGCTAGG CTATTGAACG TTTTTGATTT TGCTGAAAAT CGCTATCAGC TTATTGAAAC 360 

TCTTTCTCAC GCAATGCGTC AGAAACTCTT TGTCATCGGA GCACTCTTGT CTGATCCCGA 420 

TATTTGGGTC TTGGATGAAC CCTTGACTGG TTTGGATCCC CAGGCTGCCT TTGATTTGAA 480 

ACAGATGATG AAGGAACATG CACAAAAAGG GAAGACAGTC TTGTTTTCAA CTCATGTCCT 

AGAGGTGGCA GAGCAAGTCT GTGATCGGAT TGCCATTTTG AAAAAGGGCC ATTTGATTTA 

TTGTGGTAGT GTAGAGGACT TGAGAAAAGA TTACCCAGAC CAGTCTTTGG AAACTATCTA 

CCTTAGTCTT GCTGGTAGAA AAGAGGAGGT TGCGGATGCG TCTCAAGGTC ATTAAAAAAT 720 

TAGTTGATAT CAATATCCTT TATTCATCTC AAGAAGCTAA TCTGGCTAAT CTACGAAACA 780 

AGCAGGCTAA GAATC ?9g 

(2) INFORMATION FOR SEQ ID NO: 265: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2231 base pairs 
(B> TYPE: nucleic acid 
<C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

TGGTAATGTG CTTGGCAGCw TCCTTGACAC TGCTACTACC ATTTCCCATA GCGACCGACA 60 

TACCAACGCC AGCCAGCATT TCAAGATCAT TATCTGAGTC ACCAAAAGCC ATGACTTGGT 120 

TGAGGTCAAA GCCATATTCT TTCCCAACTC GGCGAATGCC TTCTAATTTA GAATTTCCCT 180 

GATTGATGAC ATCCGATGCA AAAGGATTGC TACGTGTCAA TTTCAAGTCT TCAAAATCAG 240 

CTGCCGCCTT CTCAGATTCT TCTGGTGTCA TCAGCATCAA AACTTGGTAG ATAGGCTGAT 300 

TCATCAGGTG AAGCAGGTCC TCTTCCTTTT GGGGAACAAC CTTGCTGACC ATGCGATTAA 360 

AAGACTGACT CACCGTCCGA GTTAAAACAG AGGGAACGAA GCGACTAATT CGTTGGGAAA 4 20 

AAGAACCCAG ACCAAAGGAC ATGATTTTAG AACCCAACAT GGCATCCTTG GTCCCTAG/.G 480 

CAATCTCCGT GCCCTCTTTT TTAGCATAGC TAATTAGATG GCGCAAATGT AACTTGGAAA 540 

TAGGGCTCGT GAACAAGACT CTGTCTTTAC TAAACATATA CTCGCCATTA TAGGTTACCG 600 

CAAAATCCAG ATCCAAATCG TCCATCAATT CCTTAACAAA AAAAGGTCCT CGCCCTGTCG 660 

CTACGCCAAC TAGTACCCCT TGTTCTTTGA CAATCTTAAT CGCATCCTTA GTGGATTTCA 720 

AAACACTCTT CCGATTGTTC ACCAAGGTTC CATCGATATC AAAAAAAACA GCTTTGACTT 780 

CCATCCTATC CCAATCTCCC CTTTTGTGAT ACAATGATTA TACCACATTT CAGAAAGAGT 840 

GAGTAAATCA TGCCTAAGAA AATCCTTGTT TTACATACGG GTGGAACTAT TTCCATCCAG 900 
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GCCGATCCTT CTGGCGCTGT TCTGACGAGT TCAGATAATC 
CCACTTGAAG GAATCCAAGT CCACGCCTTG GACTTTTTTA 
AAACCCAAAC ATATGCTGGT CCTCTACCAG AAAATTAAAG 
GGAGTGGTGA TCACACACGG AACCGATACT TTAGAGGAAA 
ATGGAAGTTC CCCATATGCC TATCGTTCTA ACAGGAGCCA 
CTAGTCATCG TGTTTATAAT TACCTAAGTG CTTTACGAGT 
CTGACAAAGG AGTTTTGGTC GTTATGAACG ATGAAATCCA 
AAACACATAC GACTAATGTC AGCACCTTCC ACACTCCAAC 
TCATGAAACA GGAAATCCTC TACTTCAAAA CAGCTGAACC 
ATCACATACA AGGTTTAGTC CCTATCATCT CGGCTTATGC 
TTGATATGCT GGATTTAGAA CACTTGGACG GTTTGATTAT 
ATATTCCCAA ACAAACGGCT CAAAAATTAG AAAGCCTTCT 
CTCTGGTATC ACGATGCTTT AACGGTATTG CCGAGCCTGT 
GCGTACAGTT GCAAAAAGCA GGCGTTTTCT TTGTTAAAGA 
GCTTGAAACT CCTCATCGCC CTCAATGCCG GACTAACAGG 
TGGAAGGCTA ATACTCTTCG AAAATCTCTG CAAACCACGT 
ATGCtACTGA CTTCGTCAGT TTCATCTACA ACCTCAAAAA 
TCACTTCTAT CTACAACCTC AAAAACATCT TTTGAGCTGA 
ACCTCAAAAA CATGTTTTGA GCTGACTTCG TCAGTTCTAT 
TTTGAGCTGA CTTCGTCAGT TCTATCTACA ACCTCAAAAA 
TCAGTTCTAT CTACAACCTC AAAAACATGT TTTGAGCTGA 
ACCTCAAAAA CATGTTTTGA GCTGACTTCG TTAGTTTCAT 
TTTGAGCTGA C 

(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 1310 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



CCATCAACCA 
ACCTTCCAAG 
ACCAAGCAGA 
CAGCCTATTT 
TGCGTACtCC 
GGCCAGCGAT 
CGCTGCCAAG 
ACATGGCCCC 
TCGTCTTCCC 
TGCTATGACA 
CCAAGCCTTC 
GCAAAAAGGA 
TTATGCATAC 
ACTCAACGCC 
ACAGGCTTTG 
CACGTCGCCT 
CATGTTTTGA 
CTTCGTCAGT 
CTACAACCTC 
CATGTTTTGA 
CTTCGTCAGk 
CTACAACCTC 



TGTGTCCAAC 
TCCCCATATC 
TAACTACGAT 
CCTTGATACC 
AATGAGCTCG 
GACACGGCTG 
TATGTCACCA 
CTTGGTCTCA 
TTTGACCTTG 
GATGAGCTGA 
GGAGCTGGTA 
ATTCCAGTCG 
CAGGGTGGGG 
CAAAAAGCTC 
AAAGACTATA 
TACCCTATGT 
GCTGACTTCG 
TCTATCTACA 
AAAAACATGT 
GCTGACTTCG 
TCTATCTACA 
AAAAACATGT 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2231 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
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GAGTCAAAGG CTCCGAGGTT GACTTTTTAC AAGGCGACAG GTGAATATTA TCTAGACCTG 60 

TCAGAAATTC TCTTCTTTGA AACAGAAGGG AGCAAGATCT ACGCTCATAA CCAGAAGGAA 120 

GCTTATGAGG TTCGCCTCAA GCTCTATGAG TTGGAGTCTA TCTTCCCTCG CTATTTTAAT 180 

CGAGTTTCCA AGTCAACGAT CGCAAACATC CGTCAGATTT ACTCAGTGGA CAAGTCCTTT 240 

TCAGGAACGG GCACCATTTC CTTTTATCAG ACGCACAAGG AGGTTCATGT CTCACGGCAT 300 

TACCAATCCC TCCTAAAAGA AAATCTAAGA AACATGAGGT AAAAAACATG AAAAAGAAAG 3 60 

CATTTGGTAT TGTTTTATTG GTTTTAGCAG CTTGGATCTT GCTGCAAGGG AATTTTGGAA 4 20 

TTCCTTCTTT GGATGGTAAA ATATGGCCTT TACTAGGTAT TGTTTTTTTT GCTTATAAGT 480 

CCATTGAGTC CATCCTTAGA CGTCATCTCA CTTCGGCAGT TTTTACAGGT TTACTGGCGC 54 0 

TCATCATTGC AAATTACGCT TATGACTTGT TACCAGTTAC CAATCATTCT CTTATTTGGC 600 

CTAGCATCTT GGTGCTACTT GGTGTTGGTT ATCTGACGCA TTCAAGTAAG TTCTGGAATG 660 

AAAAAAAATG GTGGTACAAT GGGAAAAAAA CAGTCGTCAC GGATAAGGAA GTCGCTTTTG 720 

GTAGCGGGAC CTTCTATAAG CAAGATCAAG ATCTCGTAGA TGACCAAGTG GAAGTCCCTT 780 

TTGGGGATGC TAAAATCTAC TATGATAATG CAGAGATGCT AGGTGATTTT GCAACTTTAA 840 

ATATTGAAGT GGCCTTCGGG AATGCAACCG TCTATGTTCC ACAACACTGG CGTGTAGATT 900 

TGAAAGTAGA AACCTCCTTT GGTGCAGCTA AGGCTGACGC TCCTGTAGCC CCAACCAGCA 960 

AAACCTTGAT TATCCGTGGA GATGTGGCTT TTGGGAAGTT GGAAATTGTC TACGTTAAAT 1020 

AAAAAAATCT TCACTTCAAC CATCAAAATA GACGTACTAA GAGTAGGAAA TTGATGCCTT 1080 

GCTCTGATTT CAGTTCTATG GTTGTTAGAC TTTAAAAAAT GAAATGCTGC CTTTAAAAGT 1140 

TGTATATTTT TCGATATTTT GGCTTTTACG TTTGATGTAT CTATGTACTA CAGCGTAGAT 1200 

GATGTAGTGT CAAATGCTTT TAAAAAACGG ATGATATTGG ACAGTTTTTT TGCCTTTAAT 12 60 

TGCTCAGGAA CCATGAAAGT CAGTACCTGG GTTTATGACA AGGGAGAATG 1310 

(2) . INFORMATION FOR SEQ ID NO: 267: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5922 base pairs 
f B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 
ACTCTGATTT GATTGGAACG ACAGTCGGTG CCATTGCAGT TACTTCAAAC GTAACGACTT 
ATGTTGAGTC TGCTGCTGGT ATCGGTGCAG GTGGACGTAC TGGTTTGACA GCCTTGGTTG 



60 
120 
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TAGCTATCTG TTTTGCGATT TCAAGCTTCT TTAGCCCACT TCTAGCGATC GTACCAACAG 180 

CGCCTACAGC TCCAATCTTG ATTATCGTTG GGATTATGAT CCTTGCTACC TTGAAAAATA 240 

TCCATTGGCA TGATATGTCT GAAGCAGTTC CTGCCTTCTT CACATCTATC TTTATGGGAT 300 

TCAGCTACTC TATCACTCAA GGGATTGCAG TTG G TTTCTT GACTTACACT TTGACTAAGC 360 

TTGTTAAAGG TCAAGTTAAA GATGTTCATG TCATGATTTG CATTTTGGAT GCCTTGTTTA 420 

TCCTTAACTA CATCAGCATG GCCTTATAAT AGAATCACCC AGGCGCATTT CCCCCCTTTT 480 

TTAATACAaG GAGATAGGTG ATGAAAGAGA AAAATATGTG GAAAGAATTG TTGAATCGTG 540 

CAGGCTGGAT TTTGGTCTTT TTACTTGCCG TCCTTTTATA TCAGGTTCCC CTAGTCGTTA 600 

CCTCTATTTT GACTTTAAAA GAAGTAGCCC TGCTACACTC AGGGCTGATA CTTGCTGGCC 660 

TTTCAATTGT GGTTCTGGCT CTATTTATTA TGGGAGCTCG TAAAACCAAG TTACCTACTT 7 20 

TTAATTTTTC TTTTTTTAGA GCTAAACATT TGGCACGTTT GGGCTTGAGT TATCTAGTTA 780 

TTGTCGGCTC AAATATACTT GGTTCCATTT TATTGCAACT GTCAAATGAG ACGACAACAC 840 

CTAACCAGTC TCAGATTAAT GATATGGTTC AAAATAGTTC GTTGATTTCC AGTTTCTTCT 900 

TGCTAGCCTT CCTTGCTCCG ATTTGTGAGG AAATCTTGTG TCGTGGGATT GTTCCTAAAA 960 

AGATTTTCCG AGGCAAGGAG AACTTGGGAT TTGTAGTCGG TACGATTGTG TTTGCTTTAT 1020 

TGCATCAACC AAGTAATTTA CCTTCTTTAT TGATTTATGG AGGTATGTCG ACAGTTCTAT 1080 

CTTGGACAGC CTACAAGACC CAACGTTTGG AAATGTCGAT CTTGCTTCAC ATGATTGTTA 1140 

ATGGGATTGC TTTCTGTTTG TTGGCTCTTG TGCTGATTAT GAGTCGGACA TTAGGAATTT 1200 

CTGTTTAAAA CTTTTTATGT AGGAACCGAC CTCTTTCTAC CAGGGAAAGA TGAATGCAAT 1260 

CCTGTCCATC TTTTTCTTTT TATGGTAAAA TAGAAAAATA ATATGATGAA AATCCTTGAG 1320 

GGAGTGACCG ATATGTCAAG TAAAGCCAAT CATGCAAAGA CAGTTATTTG CGGAATTATC 1380 

AATGTAACCC CAGACTCCTT TTCGGACGGT GGTCAATTTT TTGCTCTTGA GCAGCCCCTC 14 40 

CAGCAGGCTC GTAAATTGAT AGCAGAAGGA CCCACTATGC TAGATATCGG CGGAGAATCG 1500 

ACTCGGCCGG GAAGTAGCTA TGTTGAGATA GAAGAGGAAA TCCAGCGTGT TGTTCCAGTG 1560 

ATCAAAGCGA TTCGCAAGGA AAGTGATCTC CTCATCTCTA TTGATACTTG GAAGAGTCAA 1620 

GTAGCAGAGG CTGCTTTGGC TGCTGGTGCC GATCTAGTCA ATGATATCAC TGGTCTTATG 1680 

GGTGATGAGA AAATGGCTTA TGTGGTAGCT GAAGCGAGAg CGAAAGTGGT CATCATGTTT 1740 

AACCCAGTTA TGGCTCGACC TCAGCATCCT AGTTCGCTTA TCTTCCCTCA TTTTGGTTTT 1800 

GGTCAAACCT TTACAGAAAA AGAGTTAGCT GACTTTGAAA CATTGCCAAT CGAAGACTTG 1860 
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ATCGTGGCTT 


TCTTTGAACG 


AGCACTAGCG 


AGAGCGGCAG 


AAGCTGGTAT 


TGCACCAGAA 


1920 


AATATCCTCT 


TGCATCCAGG 


AATTGGCTTT 


GGTCTGACCA 


AGAAAGAAAA 


TCTGCTTCTT 


1980 


TTACGCGACC 


TGGATAAACT 


ACATCAGAAG 


GGCTATCCAA 


TCTTTCTCGG 


AGTGTCGCGC 


2040 


AAGCgATTTC TCATCAATAT 


CCTAGAGGAG 


AATGGTTTTG AAGTCAATCC TGAGACACAG 


2100 


CTTGCTTTCC 


GAAATCGGGA 


CACGGCTTCG 


GCTCATGTAA 


CTACTATCGC 


TGCGAGACAG 


2160 


GGTGTAGAAG 


TGGTGCGCGT 


GCATGACGTA 


GCTAGTCACA 


GGATGGCAGT 


TGAAATTGCC 


2220 


TCTGCCATTC 


GTCTGGCTGA 


TGAAGCGGAA 


AATTTAGATT 


TAAAACAATA 


TAAATAAGAT 


2280 


GAAAGAAATT 


GAAAACAATC 


AGTGGATTGC 


TAACTACCGG 


ACCGATCAAC 


CGCATTTTGG 


2340 


CTTGGAACGA 


ATGGTGGAAC 


TCTTAGCTTT 


GCGTGGCAAT 


CCCCATCTCA 


AACTCAAGGT 


2400 


CCTCCATATC 


GGAGGGACTA 


ACGGCAAGGG 


CTCGACTATT 


GCTTTFrTCA 


AAAAGATGCT 


2460 


AGAAAAGCTA 


GGGTTGAGAG 


TTGGCGTGTT 


TAGCTCGCCC 


TATCTCATTC 


ATTACACAGA 


2520 


CCAGATTAGC 


ATCAATGGGG 


AATCGATCTC 


AGAAGCGAGG 


CTAGAAGCTC 


TCATGGCAGA 


2580 


CTATCAGTCT 


TTGCTGGAGG 


GAGAAGCGGT 


CGCCAATTTA 


CAGGGCACAA 


CCGAGTTTCA 


2640 


GATTATCACA 


GCCCTGGCCT 


ATGACTACTT 


TGCCTCAGAG 


CAAGTAGATG 


TGGCCATCAT 


2700 


GGAAGTTGCC 


ATGGGTGGAC 


TTTTGGATAG 


TACCAATGTC 


TGTCAGCCCA 


TTTTGACAGG 


2760 


AATTACAACT 


ATTGGCTTGG 


ATCATGTGGC 


TCTACTTGGT 


GACACCTTGG 


AGGTCATAGC 


2820 


AGAGCAGAAG 


GCAGGTATTA 


TCAAACAAGG 


GATGCCCTTG 


GTAACAGGGC 


GTATTGCTCC 


2880 


AGAAGCCTTG 


GCTGTGATTG 


ACCGCATTGC 


GGAAGGGAAA 


GATGCGCCGA 


GACTTGCCTA 


2940 


CGGGACAGAT 


TATCAGGTTC 


GTCATCAAGA 


AAGTGTGGTG 


ACAGGGGAAG 


TCTTTGACTA 


3000 


TACAAGTGCT 


GTGAGACAAG 


GTCGCTTCCA 


GACTAGCCTG 


CTTGGTTTGT 


ACCAAATAGA 


3060 


GAATGCTGGC 


ATGGCCATAG 


CTTTACTTGA 


TACTTTTTGT 


CAAGAAGATG 


GTCGAGAGCT 


3120 


AGCAAGCAAT 


GATTTTCTTG 


GTCAAGCCTT 


GGAAGAAACA 


AGTTGGCCAG 


GGCGTTTGGA 


3180 


AATCGTGTCA 


AGAGATCCCT 


'Pfi ATYI A.'PTT'P 

4 Wf\ * \U\ A. X 1 l 


GGATGGAGCC 


CACAATCCCC 


ATGCTATCAA 


1 Jan 


GGCCTTGTTG 


GTAACCTTGC 


AAGAACGTTT 


TGCGGATTAT 


CATAAGGAAA 


TCCTCTTCAC 


uoo 


TTGTATCAAA 


ACCAAGGCCT 


TGGAGGATAT 


GTTGGACTTG 


CTGGGAGCCA 


TGCCAGTTAC 


3360 


CGAGCTTACT 


CTAACACATT 


TTGCGGATAG 


TCGGGCGACG 


GATGAAAACG 


TGCTGAAAGA 


3420 


GGCAGCTAAG 


TCTAGAAATC 


TCAGCTACCA 


AGATTGGCAT 


GATTTTCTAG 


AGCAGAATTT 


3480 


GACAGATAAA 


AAAGAAGAGA 


AACAAACAGT 


TAGGATTGTC 


ACACGTTCCT 


TGTATTTCTT 


3540 


GAGCCAAGTG 


AGGGCCTATC 


TGATGGAGAG 


GAAGAACGAG 


AATGGATACA 


CAAAAGATTC 


3600 


AACCGGCTGT 


AAAAATGATT 


ATCGAGGCTG 


TAGGAGAGGA 


CGCTAATCGC 


GAGGGCTTGC 


3660 
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AGGAAACACC TGCTCGTGTA GCCCCTATCT ATCAAGAGAT TTTTTCAGGT CTTGGTCAAA 
CAGCAGAGGA ACATTTGTCA AAATCCTTTG AAATTATTGA CCATAATATG GTGGTAGAAA 
AGGATATCTT TTTCCATACC ATGTGTGAAC ACCACTTCTT GCCATTTTAT CGTAGAGCGC 
ACATTGCCTA CATTCCAGAT GGTCGTGTGG CAGGCTTGTC TAAGCTAGCC CGTACGGTTC 
AAGTTTATTC GAAAAAACCA CAAATTCAAG AACGTTTGAA TATCGAAGTG GCCGATGCCT 
TGATGGACTA TCTACGTCCT AAAGGAGCCT TTCTTCTCAT TGAGGCCCAA CATATGTGTA 
TGAGTATGCG TCGTGTTAGA AAACCACGCA CTGCAACCTT GACGACACTA GCTCGTGGTC 
TATTTGAAAC AGATAAGGAT CTCCGTGACC AAGCTTATCG TTTAATGGGG CTATAAAAAG 
AATCCGCTTC AAGCGGATTT TTCTACAAAC GAATCATTAT GGATCAACTG CAGATTAAGG 
ATTTGGAAAT GTTTGCCTAT CATGGTCTTT TTCCTAGTGA GAAAGAATTG GGGCAGAAAT 
TTGTCGTTTC AGCCATCCTA TCCTATGATA TGACCAAGGC AGCTACAGAC TTGGATTTAA 
CAGCCTCTGT CCATTACGGA CAATTGTGTC AGCAGTGGAC GACTTGGTTT CAGGAAACGA 
GTCAAGATTT GATTGAAACG GTAGCCTATA AACTGGTGGA ACGTACCTTT GAGTTTTATC 
CTCTTGTCCA AGAAATGAAG TTGGAACTGA AAAAACCTTG GGCACCGGTG CATTTGTCAC 
TAGATACTTG CTCGGTAACC ATTCATCGCC GCAAGCAACG AGCCTTTATC GCCCTAGGAA 
GCAATATGGG AGATAAACAA GCAAACTTGA AGCAAGCCAT TGACAAACTG CGAGCTCGTG 
GCATCCATAT TCTCAAAGAG TCCAGTGTCT TAGCGACGGA GCCTTGCGGT GGAGTGGAGC 
AGGATAGCTT TGCCAATCAA GTGGTTGACG TGGAAACCTG GCTACCAGCA CAAGACTTGT 
TAGAAACCTT GTTAGCCATT GAGTCAGAGC TCCGACGCCT GAGAGAAGTG CATTGGGGAC 
CTCGTTTGAT TGATTTGGAC TTGCTCTTTG TGGAGGACCA GATCCTTTAT ACAGACGACC 
TCATATTGCC TCATCCTTAC ATAGCGGAAC GCCTTTTTGT CCTTGAGTCC TACAGGAAAT 
TGCGCCTCAT TTTATCCATC CGATATTAAA ACAACCGATC CGCAACTTGT ATGATGCTTT 
GAAAAAATAG AAAAACTCTA GTTTTCAGTT ACTTGCAACT GAAGGCTAGA GTTTTTATAC 
TCTTCGAAAA TCTCTTCAAA CCACGTCAGC GTCGCCTTAC CGTACTCAAG TACAGCTTGC 
GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT TAAAATACGT CATTTTCTTC 
TGGGAGGAGG ATAGTTTCTC TACCGTCCAT GTCTAAAACC AGTACTCTTG GGGGATAACG 
AGGGTCGAAA GGATGGTTAA ACTCAAAATC AATGGCTGTA GGGAGGTGTT GACTTGAAAA 
GTGGAAGGTA ATCTTTCCTT GGTTATTAAG CAATTGAAAC TCGAGTTCTT CTTCCAATTC 
AAAGACATTT TTTAAGAAAT GGTCGATGAT ATACCAAAAA GAGTCAATGA TGTCATCAGG 



3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4S00 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 
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CAAGCTGGTA ACAATACCAA AACTAGCAGA TCGCATGTGG GTATTGGTAA AAGCCATATC 5460 

TCTGTCCCCT TTCTTTTCCC TTATCATACA GCAAATACGA TTAAAAATCA AGAAAAGCTC 5520 

ATTTTTTGAA AAGGATTTTA GTTACAGGGA GAAATAGGGA AAAAATTCCT AAAAATCTAC 5580 

CGAAGTTAAT AGGTAAATTC CCAAATTAAC TTCATTATAT AACTTTCAGT TACTTTGAGA 5640 

AGTTACCGAA AAATATTTTT CATATCTATT GACTTTTAGG GGTAAAATTT GGTATGATAG 5700 

TAGGCGGTAT TGTTTACCCC ATTTGAAAGG CCCCGGAACC TTCCAAATAC TTTTCGATGG 57 60 

GAAGGAACAC CCATCACCGT AAACAAAAAT CGAACTATAT ATAGGAGAAA TCATGAACAA 5820 

AACAACATTT ATGGCTAAAC CAGCCCAACT TGAACGTAAA TGGTACGTAG TTGACGCAAC 5880 

TGATGTACCA CTTGGACGTC TTTCTGCAGT AGTTGCTAGC GT 5922 
(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1988 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
iD) TOPOLOGY: linear 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 
TAACTATCTA CGATGAGCTG TTCTGATTCT CATTAGTTCC CCTTTCCCAA GAGGCATAGG 60 

GCTGCGCATA ATAGATGTGC TCCTCACAAA ATATATCAAA CAAGCGATTG AATTCCGTTC 120 

CATTATCTGC CGTGATGGAA AGAATCTTGT GTTGTTTTAA GATGAGTTTT AGAGCCTGAT 180 

TGACCACCTC AGCACTTTTA TTTGGAATCA ATCGGATGAT CTGATGTCTA CTCTTTCCAT 240 

CCGTCAAGAC AATCAAGCAG TAGTTTTTCG ATCTCGTAAG TAGAACCGTA TCAATCTCAT 300 

AATGCCCATT CTCCAAGCGA AGATTGATAG CTTCAGGCCG CTGTTCGATG GATTGACCAG 360 

CAGGTTTAAA GTTGGTGCTA GCCTGTTTCT TAAGCGCTTT TCCTTTTCTA GGGTAAAGCA 420 

AATCCTGCTT GCTTAACCCC AATTTTCCAT GATGAATCCA ATAGTAAATG GTTGAAATTC 480 

CCACGTTAAC CCCTTTAGCC ATAACCATCA TTTCAGGCGA AAATTTTTGG TTATGATAGT 540 

GGAGAATCTT TTCCTTTAGT TCCTTGGTCA AGCTTGATTT CTTGACCGAG CGCTTGCGAT 600 

TGTTTTCATA AGACTGTTGA GCGTAGTCGG CAGAATAAAC CTCTTTGAAG CGCCCTTTTC 660 

CAAGACATTG TCGGACTCTC CCACGCTTGA TTTCAGTGTG ATAGTTTGAG GAC C TTTTCC 720 

AAGTAGAGAG GCAATTTCTC TATTTGATTT TCCTTCTTTT TTCCATCTTT CGATTAAGCG 7 80 

ACGGCTATCG ATTGTCAAAT GTTTGGCTTT TGTAGTATAA TTGTCTTGCA TCTCTGTGCC 840 

TTTCTTGTGT TTGTGGTTGA ACAACAAGTA TAACACAGAG GTGCTTTCTT ATGCCTACAA 900 
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GAGCTTTCAT TATTTCCATT TTCTTTTGGA 


TTTCACTCTA TTCTGAAAAA 


CTTGTGTATA 


960 


TTTACTGAAG CTAGCAAGTC TTACCTGTAA 


ATTTAATGAA AGCAACACAA 


AATCCGAGAG 


1020 


GGGAATCTCG GATTAATAGA TAGAGAGTTT 


TTAGTTTAAA TAAATTGTTT 


AAAATATCAA 


1080 


CAACATCACT 7CTTTTCTTA ACCTGATAAG 


TCTTGATTCC TAATTTTGGG GCTACGATTA 


1140 


TATTGTCCTC AATATCCTCT AGAAAGACAC 


AATTTCTAGG TTATAACTGG 


TATTTATCGA 


1200 


TAGTTACTCA TATACATCAG TCCACCTCCA TACTTATGTC CGAGCCTCTC 


TTTCTATTAT_ 


1260 


ACCTCCATAC TCACCTTACA GATTCTTTTG 


GTAATAATAT CTTTGCCTAA 


TGTAGAGACA 


1320 


GTCTTGCAAA GAAAAAACTT CCTTGTAGCC 


ATGTTTCTGA TAAAAGTCCG 


GTGCCTGGAA 


1380 


CTGGTAAGTA TTGACAAAGG CAAAACAACA 


ATTTCGATTC TTAGCTTCAC 


TTTCTGCCTG 


1440 


TTGCAATAGT TTTGAACCGA TTCCTTGCCC 


TCGCAGTTCC TCTTTTACAA 


ACAAATACTC 


1500 


GATTTCTAGC CAATTTCCAA AAGTCTCTCC 


TATCAAACCT GCCACGAGAT 


TGCCCTTTTC 


1560 


ATCTTCGACA TAAAGATTAA GTGGCTCACT 


TTCAGCCTCT TCTCTTTTTG 


AACGGTTATA 


1620 


AACACGAATC AGATTCCCTA TTTCTTGCGA 


TTTATGTGAT TCCTTATTTT 


CCAATCTAAA 


1680 


GTATAGTGAA ATGAAATAAA ACATGCGCAA 


ATCGATTAAG GAATTTAATC 


TAATTTCTAA 


1740 


CAATGTCTTA GAAATCAAAG TGTACTATTT 


TAACTTCAAT CCACTATACA 


TCTAATACTC 


1800 


AATAAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCAGGT TGCTCAAAAC 


ACTGTTTTGA 


1860 


GGTTGTAGAT AGAAcTGACG AAGTCAGCTC 


AAAACATAGT TTTGAGGTTG 


TAGATGAAAC 


1920 


TGACGAAGTC GGCTCAAAAC ATGGTTTTGA GGTTGTAGAT GAAACTGACG 


AAGTCAGCTC 


1980 


AAAACAGG 






1988 


(2) INFORMATION FOR SEQ ID NO: 269: 







(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

CCGGATATTT GTTTTATGTA ATTTTCTTGC AAGTTTCTTC TTAGTAGCTT GTCAGTCAGG 60 

TTCTAATGGT TCTCAGTCTG CTGTGGATGC TATCAAACAA AAAGGGAAAT TAGTTGTGCC 120 

AACCAGTCCT GACTATGCAC CCTTTGAATT TCAATCATTG GTTGATGGAA AGAACCAGGT 180 

ACTCGGTGCA GACATCCACA TGGCTCAGGC TATCGCTGAT GAACTTGGGG TTAAGTTGCA 240 
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AATCTCAAGC ATGAGTTTTG 


1300 

AUJVM^iiiii UACCAGTCTT CAAACTCGTA 


AGGCTGACCT 


300 


AGCAGTTGCA GGAATTArrr 


CTACTGACGA GAGAAAAGAA GTCTTTGATT 


TTTCAATCCC 


360 


ATACTATGAA AACAAGATTA 


GTTTCTTGGT TCCTAAGGCT GATGTGGAAA 


AATACAAGGA 


420 


TTTAACTAGC CTAGAAAGTG 


CTAATATTGC AGCCCAAAAA GGGACTCTTC 


CAGAATCAAT 


480 


GGTCAAGGAA CAATTGCCAA 


AAGTTCAATT AACTTCCCTA ACTAATATGG 


GTGAAGCAGT 


540 


CAATGAATTG CAGGCTGGAA 


AAATAGATGC TGTTCATATG GATGAGCCTG 


TTGCACTTAG 


600 


TTATGCTGCT AAAAACGCTG 


GCTTAGCTGT CGCAACTGTC AGCTTGAAGA 


TCAAGGACGC 


660 


CGACGCCAAT GCCGyTCCTC 


TTAGAAaATA GTGATGATTT GAAAGAAGT 




709 


(2) INFORMATION FOR SEQ ID NO: 270: 







Ui SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 base pairs 
<8) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 



60 
120 
180 



<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 270: 
TATAAAATCT TAACTTAAAT GATTTCAAAA TTCAGAAAGG GATTGCTTTA TGCAGTTCCT 
TTTTATTTTA ACAGGAGTGA AACTATAGTG TTTCTAAATT CTGAATCAAT CAAAACTGAT 
TGTGATGGGG CTATTCTAGC TTTAGAAACC TTCAAAAATT AAAATTTAAG CCAATCAATT 
ACTTGGAACA GTATGAAAGC ATTTAGTTTA TAGGAATTCT AGGTCTAGAA TTACATATAT 240 
ATATTTATGA AGACGGGGTG TTCGATAGTT AGTATTGTTC TATTCTGAAA GATTTGAGCT 300 
GTCAGTTGTA TAGAAAGTGT TCGAATTTTT TTAAGTGATT AAATTAGTTA ATTGTATGAG 3 60 

CTGCTTTATG ATATAATGTT CTTAATGAAT TTTCAGAAAG GAAAACCTCA AATTGTTCTA 420 
CAAATTTCTA CTCTTCGACC TCGACCACAC TCTTCTTGAT TTTGATGCTG CTGAGGATGT 



480 



CGCTTTGACC CAACTTCTAA AAGAAGAAGG AGTTGCGGAT ATTCAGGCTT ATAAAGATYA 540 

TTACGTTCCT ATGAACAAGG CTCTCTGGAA AGACTTGGAG CTGAAGAAAA TCAGTAAACA 600 

AGAGCTGGTT AACACGCGCT TTTCTCGTTT ATTTGCTCAT TTTGGACAGG AAAAAGACGG 660 

TAGTTTTCTT GCCCAGCGTT ACCAATTTTA CCTCGCCCAG CAGGGACAAA CACTATCGGG 720 

CGCTCATCAT CTCTTGGACA GCCTCATTGA GCGTGATTAT AACTTGTATG CTGCGACAAA 780 

TGGCATTACT GCCATTCAGA CAGGACGTTT GGCTCAATCT GGTCTACCAC CTTATTTCAA 840 
TCAAGTCTTT ATCTCAGAAC AGTTGCAAAC TCAAAAGCCG GATGCTCTTT TTTATGAAAA 



900 



GATTGGCCAG CAAATTGCTG GATTTAGTAA AGAAAAGACG CTGATGATTG GAGATTCTCT 960 
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AACCGCCGAC ATTCAAGGTG GCAATAATGC GGGGATTGAC ACTATCTGGT ATAATCCTCA 1020 

TCACCTCGAA AATCACACAC AAGCCCAGCC GACTTACGAA GTCTATTCTT ACCAAGACTT 1080 

GCTGGATTGT TTAGATAAAA ATATTCTTGA AAAGATCACA TTTTAAAGGA GACGAGCTAA 1140 

TGACTACAAA AAAGCTAATA TTACTATTGA AGAGTACATT GAAATGTCTG AAGTTGATTT 1200 

TAATGAAGCT GTTAATTATG AATTTACATC TGACACTTGT CAATTAGCAA ATAGTATTTA 1260 
-TCAATCTCTT TTTAAGTTTT TTGATAAGAA AAATTTCTCT GGCGATTTAA TTTTTACTTG .„ 1320 _._ 

GAAATCTCCA TCATTAGTCA AAGAAGGGGA TTATATTGGG AGAAGGGATT CACAAGTAGA 13 80 

TAATCTTAGA GTAATAGGAA ATATATTTCC GAATTATCTT ACTAATCGAA AATATAGCCT 1440 

CAATATGAAT CGTAATGGCT GTATGGGAGA TTTTCCTCAT GACTTTTTTG ATATATACCT 1500 

AGATCATGTA GCAAAATATG CCTACGAACA AAAAGTTAAT AATATTAAAG AGTATTATCC 1560 

TTTAAAAAGA GCGATTTTAC ACCAAGAGAA TGCATTGTAT TTTCGATTTT TTTCTAATTT 1620 

TGACGACTTT TTAGAAAAAA ATTATTTAAA GACTATATCG CAAGTTTCTA AAGAAACTCC 1680 



(2) INFORMATION FOR SEQ ID NO: 271; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 596 base pairs 
(8) TYPE: nucleic acid 
(C) STRAND EDNESS : double 
(O) TOPOLOGY : linear 



(xi» SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

AGCTCGGTAC GTAGTATnTG TGGTGCATAA ATGAGTGAAA AGAGGATAGA GAGGATGAGG 60 

CCGATAAGAA CACCGGTAGC TGCATCGTGA AATACTTGTT TTTTCATAGT TCTAATTTCT 120 

CCTTCATGGT TTTTAGATAA CGGCGTGAAG AGTAGGTGAA GCTTTCGTTT TTCAAGAAAA 180 

TTTCTACCAG ACCGTTTGGC GTGAgCTTGA GGTGAGAGAT GGAATCGATA TTGATGATTT 2 40 

CTGATTGGGA AATTTGGATA AAATTGGTTG GCAAGAGTTT AAGAACCTGA TAGAGTCGCA 300 

AATCAATGCT GTAGGTCTGA CTCGCGGTTT CTGCTAGAAC CTTCCGATTC TCGATATAGA 3 60 

AGCGCTGAAT CTTGCCAATC TCAACTAGAT AGACCTGATC ATCGATTTTT CCTTTGATTT 4 20 

TTTCTCTTTG GTCCAGATTT TCTGCGAACT CGATGACTTT CTGGACTTTT TCGGTTTCTT 4 80 

GAGGTGCTTG GACAATCAGC TTTTCCTCCT CGTAAGTCTC ACTAATCTGT AGTTCTACTT 540 

TCATAGTTTT CTCTCCTTTT CAGTTATACA AGGTTGTGAT CACTTCCTGT ATATCCGG 598 
(2) INFORMATION FOR SEQ ID NO: 272: 
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(U SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1099 base pai;s 
tB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

CCAGCAAATC AATAACTGCA ATTGCTATAA AATGGATTCT ATAGAGTTTT TTCATGACAA 60 

GACCTCCCTC TTTTATCTAA CTTCATTCTA CTCCAAAAGA ATGGGAGTTA CAACTAAAAT 120 

GATAAAAATA GCAGAAGGGA GATTCTCTTA AGTTGGCTAG TATTCTTTAT TTGAGTTTCC 180 

TTCTATTATC TAACTTCTTC ATCATTCCAG ACAAATAAAG CTCCGATTCC ATTGAGGATA 240 

TAAAAGATGT ATTTACCGAT ATTGGCGAAG TTTCCTTGAA TACCAGCTTT TGTCAGCTGA 300 

ACGAAATTGT AAATCAACCA AAAGCCCCAC TGAGTTGTTA GTTTTAATGC ATTCAAAGCA 360 

TTGGCAATGA GGGACAGTGC AAAGGCAATA GTTGTTACGT AGGCAAGGAG ATTCATCTTG 420 

CCCCCATATC CGATATAGTT GGTCACAAAG GCAAAGAGGA AGGCGATGAT GGAAATGATG 480 

ATGGCCGCCA ATTTTACCTG TTTTTGGCTC ATTTGGTTGG GTCTGCCTTC TTGCGAAGCT 540 

TCCCACTTCT TTATAGCAAA GGTATAAATG AGGAAGGTGA CGGGATAGGT AATGATGGCC 600 

GCCTTATTTC CAAGGATATA ATCAATAGCA CCGGACAAAA TGGTATTAAC AATACCAAAG 660 

TAATTTCCCC ATTTGCTTAA TTTCCCCGTG AAACGAGTGG ACAACATGGA AATCCCAACG 720 

TTGGTTACGG AAATCAATCC AAAGGGTACA AGAGCTGTCC ATGATCCCCA GTCTACAAAT 780 

TTATCGAGGT GTGAGTTGAG GTAACCAGAT GCAATCGCAA TCCCAACGAC CAAAGCAACC 840 

CCGAAGAGGT CAAACTATTT AGATGTAGCA AAAATTTTTA GTGATTTTTT CATAGGTTAA 900 

ACTACCTTTC TTTTTTTCAA ATATTCTCCC ACCAAATGAA AGTAAAATAA AATGATAGAA 960 

ATAAAACCCT GAAAATAAAG GTTCTATAAT ATTTGTAGTG GGTAAATCCA CTATAGATAT 1020 

TATGGAGCCT ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA 1080 

AACAACTCAT TAGAAAGAT 1099 



(2) INFORMATION FOR SEQ ID NO: 273: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
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CTGGGATTCA 
TGAAATTGAA 
AGCTCGTATC 
GCAATTAGAG 
CCTCTATGTT 
AGTCTATGTT 
ACCAGCAGGA 
AGTGATTTAC 
TCATACTCCG 
TGAGACAGTT 
GGTTCAGGTG 
GATTGTGGGT 
TCCTAATTAC 
CGAAGAGGGG 
GACGACCAAT 
GCTTACTCAA 
GTATAATACG 
AGATATCATT 
CCTCCTCAAA 
TCCCAAGTCC 
CAAGTTTACC 
GACAGATAAT 
GGTCGATAAT 
TCAGAGATAT 
CAATGTACAC 
TGCAGAGCAA 
ATGTATAAAG 
ATTCCCTTTG 
CTAGAATATG 



CGTGAAAAGG 
TATTACCATA 
GAGGCAATCT 
CGCAAAGCTT 
CGAAATGATT 
TCTACTAAAG 
AATAGTTTTG 
ATCTCCTTGG 
ATGGCAGTAA 
GATAAGGAGA 
GCAGTTCCTA 
TTGAGCCTTC 
CAAAAGCAGG 
CGTCGGATTG 
GATATGTTGG 
AAAGATGCCA 
CTGGAGTTCT 
TATGAATTCA 
CAGGAATTAG 
ATTGCCTATG 
TTGCAACCGC 
GTGATTAGCA 
GGTAGAGGAA 
TTTGAACACC 
GAGCGTTTTG 
GCCGGTGTTC 
TATTATTAGT 
ATAAGTGGGA 
TTCAGGAAAA 



AAGCCCAGAG 
GAGAGTCAAC 
ATAAATACTT 
CGCCTTATAT 
TTGTAACTGG 
ACAAACGTAG 
CCATTCCAGT 
ATCCTGCTGT 
CAGTGACCTC 
GTGAAAATTG 
AAAACTTTGT 
TCTTTATTGT 
TAGTGGATTT 
ACATTTCCGA 
ATCGATTGGA 
ATATGCGAGC 
TGCGCATGTA 
GTAGTCTCTT 
AATTTTGCCG 
GTTTCAAGAT 
TGGTAGAAAA 
TCAAGGCTCT 
TGTCGGCTGA 
AAGCCAGCTA 
TGCTCTATTT 
AGTATCGTAT 
AGATGATGAG 
TATGGAGCTC 
TCCTGTCGAT 



1303 
AGTAGCCAGG 
CCAGATAGCT 
TAGCCTTAGC 
ATCAGTCTCT 
GGTGGCCATT 
GWcAGAAAAA 
GTCAGATCCA 
TTTATACCAT 
ACCTTTTGAT 
GCTAGTTCCC 
TTTACAAGGA 
CATTCTTTAT 
AGTAGAATCC 
GAAAGATCAG 
AAAGAATATC 
CTTGCAGGCG 
TGCAGTTATG 
GCGTAACAAT 
TAAATACAGC 
AGATCCAGAG 
CTATTTCGCG 
TAAACAGGAT 
AAAGTTGGCA 
CAGTGATCAA 
TGGACACCCC 
TACAATTCAA 
TACATGGTGA 
GTCGCAACAG 
GTCATCATTT 



TGTACTGCTA 
CAGGCTTTAG 

atgccagact 
ctgtatgaaa 
ccttttcaag 
atcaggg ctg 

GTGTCAGATC 
GCCATTGATA 
ACGGAGATTT 
TTAACTTCTC 
ACAGTGACTA 
CTGACTTTGA 
ATTCAACTCA 
GAATTACTCC 
CATGATATTT 
CAAATCAATC 
CAGAGTCAAG 
ATTTCCGACG 
TATCTCTGCA 
TTAGAGAATA 
CATGGTGTTG 
GGTTTTGTGC 
AATATCCGAG 
AGGCAGTCTA 
TATGCCATTA 
GATGAGTAGA 
CAGAAGGTCT 
CCAGTCATGC 
CCGATGTCAA 



GAACAGTGAG 
TTGAAAATCA 
ATTTTTACTG 
ATGTTGATGA 
ATTACAAGGA 
__AGG ATTTC AA 
AAGACTTAGG 
ATACTAGAGG 
TTCATATGGG 
ATGGATATCA 
GCTCTCCTTT 
GCCAGACTTT 
TTGCTCAAGG 
TAATCGCGGA 
ACCAGTTAGA 
CTCATTTTAT 
ATGAGTTGCC 
AAAGAGAGAC 
TGGTTCGCTA 
TGAAGATTCC 
ACCACAGGCG 
AAATTTTGGT 
AAAAATTAAG 
TCGGGATTGT 
CTATAGAGTC 
AAGGGAGAAA 
GAAGCGTTTG 
CGATGAAGCT 
TATGCCAGAC 



60 
120 
180 
240 
300 
-360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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AAAACAGGGC TTGATATGAT TCCGGACATG AAAGAGATCT TACCAGATGC TGCCTATATC 1800 

CTGCTCTCAG GTTATCAGGA GTTTGATTAT CTAAAAAGAG CAATGAACCT TAGTGTGGTG 1860 

GACTATTTGG TCAAGCCTGT TGATAAGGTA GAGCTGGGAA ATCTGCTGGA GAAGATTGCA 1920 

GGTCAGCTCG GCGAGAGAGG GAAGAAAAGT CAGACTCTTA GTCAAGAATT AGACGACGCT 1980 

GGATTTGTTA GTTATTTAGG GGATAAGGAG AATTGCTGCA TAGGTCTATC CAAGGAAAAA 2040 

CAAGGTTCCT TCACCATTCC CTACTATGTC TTGGGTCAAG ACTGGCAGAT TTTCATTTCT . 2100 

GGCCACCCCC TAGATGGTTT AGTCGTTACA CCTTTTGAAG CTCCTTATCA AGAACACTTT 2160 

GAACGCTGGA AGCTGAATGC TGAGAAAACC CT C TTTTACG CTTCTGTAAA TCTGCAGCAG 2220 

TCTGAGAGTC TCTTTGCCTA TTACGAACCG ATTTATAGGG TTATCATTCA GGGAAATCTC 2280 

AATCAAATCG TAGAAGAGTT AAATCTCTTG GAGAAGGTAG TTCTTGAAAA TACACCTCGT 2340 

GTTTCGATTA CTAAACAGCT TTTTATCCAG TTTGTCATGG ATGTTTTCCA TTTATTTGAA 2400 

CATCTCAAAG CTGATGATAT GACCGACATT GTCAAAACCA TTCATGCTAT TCAATCCTTC 2460 

GATGAATTGG TTTCTTATAT CAAGGAAACT CTGATCAGCT TTTTCGGTCA ATACCGTATG 2520 

AATGAAAATG TGGTCAGTGT GCTGGAAGTC ATTGGTCGTG ATTACCAAAA AGAGCTTTCC 2580 

CTCAAGGATA TCAGTAAGGC CCTCTTTATC AATCCTGTCT ATCTAGGGCA GTTGATTAAG 2640 

CGTGAAACCG ATTCGACCTT TGCAGAGTTA CTAAACAAAC AACGTATTAA GGCTGCCCAG 2700 

CAGCTCTTGC TTTCAACTAG TGA 2723 



(2) INFORMATION FOR SEQ ID NO: 274: 

U) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 836 base pairs 
(B) TYPE: nucleic acid 
(CJ STRANDS DNESS : double 
(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

CCGCAGTTTT TTTAAACCGT ATATAAGTAT AGCATAGTCA AAAAAAGAAT GCAAGATTTT 60 

TGCAAACTTT TTTAAAATTT TTCGTAATTT TTCTTTTAAA GTTCTACTGT CAGGACTTGA 120 

CCTTGCTTAA CAACCTGTTC TCCGGCGATA TAAACATCAT CTACATCACT AGATTTAACT 180 

GCATAAACCA GGTGAGACAG CATATTTTCC TGAGGTTGGA GATGAATTTT CCCTTGTGGT 240 

TGAATGACCA GAAAATCTGC TTGCTTGCCG ACTTCCAGAC TTCCTATCTG ATTTTCCATT 300 

CCAAGGACCT TAGCCCCTTC GATTGTCAGT ACCTTGAGAG CTGTTTCGAT TGGAAACTGG 360 

CTGGCATCCC CACTTTTCAT CTTCTGAAGA AGAGCTGCAG TCCTTCCTTC CTCAAACATA 420 
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TCTAGATTGT TATTGCAACC AACCGAGTCA GTCGCAATTC CCACTGCTAC TCCCGCTTTT 480 
TGGACCTGGA TAATTGGAGC AATTCCTGAT GCCACTTTGA GGTTACTGAT AGGATTGTGG 540 
GCGATAGCnA CTTGAGAAGA TCCCAAGCGT TCAATTTCTC TCTCGTTTAA TTCGACCCCC 600 
TCAGCAAATA CGGACGGATG ATCTAAATAA CCCAGTTCTT CAAGAAAAGC AAGGGGGCGT 660 
TTGCCGTATC GTTTGAGGAT AATTCCTGAC TCCTCCTTGG TCTCCGCCAC ATGGACATGG 720 
AGCGGAATAT TTAGCTCTTT TGCCATTTCC AAACTCGCTT CCAGCAAGTC TCTACTGCAC 780_ 
CTATACGGAG AATCAGGTGC TACCATAACC TTGAAATTTG GATTTTTATA TTTTAA 836 
(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2335 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 
ATTTTATTTC ACTTTTTAGG TGGTCTGGGG CTATTCTTAT ATAGCnTCAA GACCATGGGA 60 

GACGGTTTAC AACAAGCTGC TGGAGATCGC CTGGGTTTTT ACATTGACAA ATATACTAGT 120 

AATCCTTTGT TTGGAGTTCT GGTTGGTATT GGGATGACTG CTCTAATTCA GTCTAGTTCT 180 

GGTCTAACAG TTATCACAGT CGGCCTGGTC AGTGCCGGTC TCTTAACCTT ACGTCAGGCT 240 

ATCGGGATTG TCATGGGTGC TAATATTGGG ACAACTGTCA CATCCTTTCT CATCGGTTTT 300 

AAATTAGGTA ACTATGCCCT ACCTATGCTC TTTATCGGTG CCGTCTGTCT TT T TT TT ACC 360 

AAAAATCGGA CAGTCAATAA TATCGGACGC ATCCTCTTTG GTGTCGGTGG TATCTTTTTT 420 

GCCCTCAATC TCATGAGCGG CGCAATGGCT CCACTCAAGG ATTTACAGGT CTTTAAGGAC 480 

TATATGATTG AGCTAAGTAA GAATCCTCTT TTGGGTCTCT TTGTCGGTAC TGGCTTGACC 540 

TTGCTAATTC AAGCTTCTTC GGCTACCATT GGGATTTTAC AAAACCTCTA CGCCGGCAAT 600 

CTAATTGATC TACAGGGAGC TTTGCCAGTT CTATTTGGTG ACAATATCGG GACAACCATT 660 

ACAGCCATCA TTGCCTCTTT AGGGGCTAAT ATTGCAGCTA AACGGGTAGC AGGAGCTCAT 720 

GTTGCCTTCA ACGTTATCGG AACAGTTGTC TGCGTTATTT TTCTAGTTCC TTTTACTGTC 780 

CTGATTCATT GGTTTGAAGC TACGCTAAAT CTAGCACCGG AAATGACCAT CGCCTTTGCT 840 

CACGGAACCT TTAATATTAC CAACACCATT CTCCAATTTC CATTTATCGG AGCTCTGGCT 900 

TACTTTGTAA CCAAGATTAT TCCTGGAGAG GACGAGGTTG TCAAATACGA ACCCTTATAT 960 
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CTTGATGAAC ATTTCATCAA ACAGGCCCCA TCTATCGCTC TAGGAAATGC TAAGAAAGAG 1020 

CTCTTGCACT TAGGAAACTA CGCTGCTAAA GCCTTTGACC TTTCCTATAA GTACATCATT 1080 

GACTTGGATG AAAAACTTGC TGAAAAAGGG CATAAAACCG AAGAAGCAAT TAACACCATC 1140 

GATGAGCAAT TAACACGTTA TCTCATTGCC CTTTCAAGCC AAGCTCTCAG CCAAAAAGAA 1200 

AGTGAAGTGC TTACCAATAT CCTTGATTCC TCCCGTGATT TGGAACGGAT TGGAGACCAC 1260 

ACGGAGGCTC TACTCAATCT GACTGACTAT CTTCAACGGA AAAATGTTGA ATTTTCTGAT 1320 

GCCGCCTTGA AAGAATTAGA GGAAGTTTAC CGCCAAACTA GTGACTTTAT CAAAGATGCT 1380 

CTGGATAGTG TCGAAAACAA TGATATTGAA AAAGCACGCA GTCTTGTAGA ACGTCATCAA 1440 

GCAATCAATA AGATAGAACG TGTTCTCAGA AAAACCCACA TCAAACGCCT CAACAAAGGC 1500 

GAATGTTCAA CACAAGCTGG GGTCAACTTT ATCGACATCA TCTCACACTA CACTCGTGTA 1560 

TCAGACCACG CTATGAACCT TGCTGAAAAG GTTTTTGCAG AACAAATCTA AGAACCAAGA 1620 

ACCTATCCAT CATAATTGGA TGGCTTTTTA CTTTTTCCTA AGCAAGACTA GGATGAATGA 1680 

AACTGAAAGA GTATTCTGCA GATATATAGT CCCCAATTAT TCACCCCAAA TCTAAAAACC 1740 

ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC CAATGGGTGT 1800 

TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA AAACACAAAA 1860 

AGAAAGGAAA CTCACATGAA CAGTTTACCA AATCATCACT TCCAAAACAA GTCTTTTTAC 1920 

CAACTATCTT TCGATGGAGG TCATTTAACC CAGTATGGTG GTCTTATCTT TTTTCAGGAA 1980 

CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC GAATGACCAA 2040 

CGCCGCTACT CTCGTTATTC GGATTCAGAT ATCCTTGTCC AGTTCCTCTT TCAACTGTTA 2100 

ACAGGTTATG GAACGGACTA TGCTTGTAAA GAATTGTCAG CTGATGCCTA CTTTCCAAAA 2160 

TTGTTGGAAG GAGGGCAGCT TCCTTCACAG CCAACCTTAT CCCGTTTTCT TTCCAGAACT 2220 

GACGAGGAAA CAGTCCATAG TTTGCGATGC CTCAACCTTG AATgCJcCGAA TTCTTTTTAc 2280 

AGTTTCACCA GCTAAACCAA CTCATTGTAG ATATCGATTC TACCCATTTC ACAAC 2335 
(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D> TOPOtOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

CGGATTCACT CTTGTTCACT AATCAATAAC ACAGTAGAAA ATCTCACAGC AGTCTATTAG 60 
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TTGCTTTTCA TACTAGGCAA GTGACTGAGG CTTGTACTTG GGTACAGCAA GGGAGCTTAA 

GGCCGTAGAA GAGAAAAATA GTAGACTGAA AACCCGCAAC ACTTCATCAT TTCGAGAAGT 

GACGTGGGAG ATGAAAATCG ATTGAACCAC TTACAAGGAG AATAGAAAAT GGCTAAAAAA 

AGCAAACAAC TTCGTGCTGC TCTTGAGAAA ATCGACAGCA CAAAAGCATA CAGTGTAGAA 

GAAGCTGTAG CACTTGCAAA AGAAACTAAC TTTGCAAAAT TTGATGCAAC TGTAGAAGTT 

GCTTACAACT TGAACATCGA CGTTAAAAAA GCTGACCAAC AAATCCGTGG AGCAATGGTA 

TTGCCAAACG GTACTGGTAA AACTTCACGT GTTCTTGTTT TCGCACGTGG TGCAAAAGCT 480 

GAAGAAGCAA AAGCTGCTGG TGCAGACTTT GTTGGTGAAG ATGACCTTGT TGCTAAAATC 540 

AACGACGGTT GGTTGGACTT CGACGTAGtT ATCGCTACAC CTGATATGAT GGCTCTTGTT 600 

GGACGTCTTG GACCTGTCCT TGGACCACGT AACTTGATGC CAAACCCTAA AACTGGTACT 660 

GTAACAATGG ATGTTGGCAA AGCGGTTGAA GAGTCTAAAG GTGGTAAAAT CACTTACCGT 720 

GCTGACCGTG CAGGTAACGT TCAAGCAATC AT 

(2) INFORMATION FOR SEQ ID NO: 277: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2643 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
ID) TOPOLOGY; linear 



7S2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

GTCAACATTG ATTTCAAGGC TGTTTGCTTT CTATCTCCCC TTTTTCATAA TGTATAATAA 60 

AATGAAATAA TAACAGGACG AATTGATCGG GACAGTCAAA TCGATTTCTA ACAATGTTTT 120 

AGAAGTAGAG GTGTACTATT CTAGTTTCAA TCTACTATAT TTTCGTACAG GTGCTTCAAC 180 

CATTTGAACG ATTTCAAATC CTTCTTTTTG GTAAAGATTC TGAGCTCTTT GATTTGCCTC 240 
GAAGACATTT AGAGAAATAC TGTCTATATC TCTATTTTCA AATGCTAAAC TAACAAATTT 
CCTTAAAGCC TTGCTACCTA AGCCTTGCTC CTGTTTCTGG GGGTTGATAA AAAATCTCCC 

GATATGAAGA TTGCTGTCTT CTAGCCTGAT TTTCTGGATA AATCCCACAA ACTCTTCTTC 420 

ATCAAAGATT GAAAAGACTC CTTCCAAGGC TTGAAGTGTC AGTAGAAAAG GAATCCTTCC 480 

TCCCATCCAT TGTTCTTGAA AGGATTTGCC TAGGGAGTTG GACCACTGGC ATACAAATTG 540 

AGCGTTTTCT GTGCTCACCT TTTCTTCAAA ACGAATTGTC ATCTTTTCCT CACCACCTTA 600 
TCTATGTTTC TCCATTATAC TATTTCTCCC ATTTTTTACG AATAGATAAG TATCATTGAT 



300 
360 



660 
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TTTTATTTTT 


TTCTCGTCGG 


GACCATTCTA 


GCTTCCTTTC 


TTGGTTTGGT 


CATTGACCGT 


720 


TTTCCAGAGC 


AATCCATTAT 


CAGTTCAGCC 


AGTCACTGCG 


ATTCCTGTCA 


GACTCCCTTG 


780 


CCTCCCTTAG 


ATTTGATTCC 


GATTCTCTCA 


CAGGTCTTCA 


ATCGCTTTCG 


CTGTCGCTAC 


840 


TGCAAAGTTC 


GCTATCCTGT 


CTGCTATGCC 


CTCTTTGAAT 


TAAGCTTAGG 


ACTCCTCTTT 


900 


CTGCTTTACT 


CTTGGGGATG 


GCTCTCCTTG 


GGGCAAGTCG 


TCCTAATCAC 


CGCTGGTTTG 


960 


ACCTTGGGTA 


TCTACGACTT 


TCACCATCAG 


GAATATCCCT 


TACTGGTCTG 


GATGACTTTC 


1020 


CAGCTAATCC 


TAATAGCTTC 


CTC7GGCTGG 


AATCTGGTCA 


TGGTCTCCTT 


CCTCATACTT 


1080 


GGAATTTTGG 


CTCATTTTAT 


CGATATCCGC 


ATGGGTGCAC 


GGGATTTCCT 


CTTTTTAGCT 


1140 


TCTTGTGCTC 


TCGTCTTTAG 


CGTAACGGAG 


TTACTGATCT 


TGATTCAGTT 


CGCTTCTGCG 


1200 


ACGGGTATCC 


TGGCCTTTCT 


CCTGCAAAAG 


AAAAAGGAAA 


GACTTCCTTT 


CGTGCCTTTC 


1260 


CTCTTACTTG 


CTACTTGTTT 


GATTATTTTT 


GGTAAGCTAC 


TGCTTGTCTG 


ATAAAATCCA 


1320 


ATTTCTGCCA 


TATATCCTTC 


ATGAAATTAT 


TTCACAGTTA 


AATTATAAAT 


TATTTCTTTT 


1380 


GTACAAAGGG 


ATGATGTTAT 


CAAATCGATC 


TGTTCTTCTA 


TCTTCTTGAT 


ACTGATCAAA 


1440 


AAATTTCATT 


TCGACTGAAA 


ATATTTCGCT 


TATAAACTGT 


AAACGAATAC 


TTTGTTTAGA 


1500 


CATTATAGTC 


GCTAGACTGA 


CTAGATGATT 


ACTCAAAACG 


ACGTCCAGAA 


TACTCTTTAC 


1S60 


TTTGCTTGGT 


TTTTTAACAA 


AAATTTGATC 


ATCCAAGGGT 


TCAATCATTT 


TGTAACCTTT 


1620 


TTGCGCAATT 


TGACGATAAA 


AGTAAGAATG 


TTGCTTTGGA 


GTCAATAATC 


CTAACTTAAA 


1680 


AGCTCGATAC 


TCTAAAGCCT 


GTATCGAAAC 


ATTCAAATCC 


GACTTCAATA 


AAATATAACT 


1740 


ATCAGGATTG 


CTGACACGCT 


TGCCAACCCT 


CTCTTCAAAT 


TTGACTAAAA 


ACTCTTCTTT 


1800 


TGGCAATAAA 


AAACATGATG 


CAAAATAATT 


TGCTTCTTGC 


TCCAAACGAT 


CGCCATCTTC 


1860 


ATTCATATCT 


TTATATTTAT 


GTAAAAGAAT 


ATGTCCTAGC 


TCATGAGCTA 


AGTCAAAATT 


1920 


TCGACGTACA 


GATGATTTAT 


TCGTTCCTAA 


CACAATATAA 


GGTCTTCCCA 


ATTTTGACCA 


1980 


TGCGC7ATAA 


GCATCAGCTT 


GGCCATTAAT 


TAATCGTTCC 


ACGATATAGA 


TGCCTGAACij 


2040 


TTCTAATTTA 


TAAAGCAAAT 


CATGATTATC 


TTTTGAAATA 


CCTAATTTTT 


CCCTGGCATA 


1100 


AAGAGCCAAT 


TCCTCAATGG 


ATTCTCCCTT 


ATGATAAGAT 


TCACTCACTA 


CATTACTTAG 


2160 


GTCATGAATT 


ATAATATTAG 


GTATAATTAC 


AAAACTTTCA 


AAATAATCAA 


TCAAACTATC 


2220 


TACCTTATGT 


AAATACATAG 


TTTGAATATC 


TATTGTTTTC 


CGTGTTGCTA 


GGTCTGCATT 


2280 


TCTAAAGGCA 


ATTACAGAAG 


AATCAAATCG 


AATGCTCTCT 


TCTTCCTGTT 


CAAAATAAGT 


2340 


TAAATCAACA 


TGAAATTGGT 


TGGCCAAATG 


CATTTTGGTT 


GATAATTTAG 


GTTTCGTTTC 


2400 


GTTGGACTCA 


AACTGCCAAA 


TGGCTTGTTC 


CGTTAAATTA 


ATTCTCTGAG 


CTAATTCTGC 


2460 



WO 98/18931 



PCT/US97/19588 



1309 

TCTACTTAAA CCATTTAACA GCCGTAATTC TTTCAATACC CGACCATTAA ACATTTACAT 2520 
ACTCCTTACT ACTTTTGACC TTCTTGTTTT TCTATTCTTG GAATAATTTC AAAATCTTCT 2580 
GTTTCCGATA ATTCTGAAAA ATTACCAATA TCTTGATATT TAGCTTCTTC GAAATGGTAC 2640 
GGG 

(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 582 base pairs" 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 



2643 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 


278: 






TGACCAGTGG CAAAATGGCT ATCCAAATGC AGATGTTATT 


ATCGATGATA 


TCATCTCAGG 


60 


GCAAGCCTAC GTAGCCTTGG AAGAGGGAGA ACTCCTAGCC 


TATGCTGCTG 


TGACCAAGAG 


120 


TCCAGAGGAG CCCTATGAAG CTATTTATGA GGGAAACTGG 


CAAGCTGGAG 


AGTCAGAGTA 


180 


TCTAGTCTTT CACCGTATTG CTGTGGCACC AGATGTGCAG 


GGAAaAGGAG 


TTGCTCAAAC 


240 


CTTCTTAGAG GGCTTGATTG AAGGTTTTGA TTATCTTGAT 


TTTCGCTCAG 


ATACGCATGC 


300 


TGAAAACAAG GTTATGCAAC ATATTTTTGA AAAACTTGGT 


TTTAAACAAG 


TCGGTAAGAT 


360 


GCCAGTAGAT GGCGAACGCT TGGCCTATCA AAAATTAAAG 


AAATAATGCA 


AAAGAAGTAT 


420 


GTAAAAATCC TCTACTCCTC ACCAATTGGT ATTCTATCAC 


TTGTAGCTGA 


TGACCATTAT 


480 


TTGTATGGAA TTTGGCTTCA GGAGCAGAAG CATTTTGAGA 


GGGGACTAGG 


AGATGAAACG 


540 


ATAGAAGAAG TTGTwAGTCA TCCTATTTTA GACCCAGTTA 


TT 




582 


(2) INFORMATION FOR SEQ ID NO: 279: 









(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

CCCAAGCTAC TAAGACACTA AAACTTGCTA GAGAAGCAAC AGAAAGTGTG AATCTTTTTA 60 

ATTTCATGAT GAATTTCCTT TCTGCTACCA ATTTAGAGAA ATTTTCTCTA ACCAGCAATT 120 

CCCCTAGTAT AACAAGTTCA AAAAATGGAG TCAATTTATC TGCTCACGGT CCAGCAGGTA 180 
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GCCCCGTACT TCTGAGATAA AATAGAGAGA CCCTGTAACG AACAGCAAGT CTTGAGCGTC 240 

TGCCCTTTCT TCAAAATCGC TGATAAATTC TCGGTAAGAA GAAACTATAT CGTAACCTGT 300 

CACATCCCTT TCGTCCAAAG CCCCCTGATA GTCAAAGCCG GTCACCTTGA GTTCCACCTG 360 

AGGCAATTTT TCAGTCAGAT AACCCAACAT CCCTTGATAA TCCTTACGTT TCAAGGATCC 420 

AAAGAGGATT TGAGGTCGAT ACCCTTCCTG CTCTTTTTCT TTGATAAACT CAGCCAAGCG 480 

AGTCAACGCA GGGAGGTTAT GAGCACCATC CAAATAAATC TCTGGGCGAA TACGCTCCAA 540 

GCGAsCAGCC CAAT 554 

(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 766 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xii SEQUENCE DESCRIPTION: SEQ ID NO: 280: 

CCCCTTTTTC AAATGAATTT CTTGGTTGTG GCTAAAAAAT ATGCTACACT ATCAATATGA 60 

AAATTTTAAT CCCAACAGCA AAAGAAATGA ACACAGACTT CCCAAGTATC GACCCAATTC 120 

CTTTAAAACC AGAAAGTCAG GCCGTGCTTG ATGCCTTGGC TCTCTATTCT GCCAGTCAAT 180 

TGGAGAGTTT CTACAAGGTA TCAGCTGAGA AAGCGGCGGA AGAATTTCAA AATATCCAAG 240 

CTTTGAAAAG GCAAACTGCT CAACACTATC CAGCCTTGAA ACTTTTTGAT GGGCTTATGT 300 

ACCGCAACAT TAAGAGAGAT AAGCTGACCG AGGCGGAACA AGATTATCTT GAAAATCATG 360 

TTTTCATTAC CTCGGCTTTG TACGGTGTTG TTCCAGTCTT GTCACCCATG GCTCCTCACC 420 

GTTTGGATTT TTTGATGAAA TTAAAAGTCG CTGGTAAGAC TTTGAAGAGC CATTGGAAGG 480 

CAGCCTATGA TGAAACTCTG AAGAAGGAAG AAGTGATTTT CTCTCTCTTG TCATCAGAGT 540 

TTGAGACTGT ATTTTCTAAG GAAATCAGAG CAAAGATGGT GACCTTCAAA TTCATGGACG 600 

ATAGAGGCGG TCAGCTGAAG ATTCACTCAA CTATCTCCAA GAAAGCGCGC GGGGCCTTTC 660 

TAACAGCTTT AATAGAAAAT CAAGTACAAA CTGTGGGGGa AGCACGTCGC TTGAACTTTG 720 

CTGGATTTGT TTACCGAGAA GATTTGTCAC AACCACAGGG GGATGG 766 
(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 901 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



WO 98/18931 



PCT/US97/I9588 



1311 



(xi) SEQUENCE DESCRIPTION: 


SEQ ID NO: 


281: 






CCCCCCACGG TTCCATCCAA 


CTTCACAGGT 


GTCCACTTGA 


TTGTGTATGT 


AATTGTCACT 


60 


AACGCTAGAA TTTCACCTAT 


CCCTCCTATC 


TGCTCGCAGT 


ACCCGCAGAC 


TTTCTGAAAG 


120 


AAGAAGATAA CCTACTTATC 


CGTTGCTATG 


ATTATACTAA 


AGTTTCTACT TTTTTGCAAA 


180 


TAGATTTTTA AATTTTTGGC_ TAATTGTCTG 


AATCAGGGTC 


GGAAGTTTGA 




240 — 


ATTGCCTAGT TTTTCGCGTG 


CAATTTTGAG 


AATGGCACCT 


GAGTCTTTTG 


AAGCAAAGAG 


300 


GAATTTTCCT TTGTCTGTAA 


AGACTTCGAA 


GTGGCGGCTG 


ATTTTGCGTC 


CAGTGACATT 


360 


CCCTCCAATC TGATTGATAT 


GGCTCCAAGG 


AATCTGGATA 


AATTGTTCGA 


CATTGACATC 


420 


TGGGTAAAAT TCCAAAGCCT 


GATCTCCGAC 


AAGGAATTTC 


CCAACTTTCC 


CAGCGATAGA 


480 


GAGGTAGGAA GTGCCTGTCG 


TACTGAGGAG 


TACTGTTTTG 


TTAAGTGATT 


GGGCCATGCT 


540 


TAGTCTTCCT TACTTTCTCC 


AAAAAAGGCA 


TTGTAGAGGG 


CTTTAATTGC 


TGCTTTCTCT 


600 


TGGTCTTTAT TGACAACAAA 


CATAATAGAA 


ACTTCACTAG 


AACCTTGAGA 


CATCATCTGG 


660 


ATGTTGATTT TGTTTTCAGA 


TAGAGCGCGT 


GTCGCAGTAG 


CAGTCACTCC 


GATATGGCTC 


720 


TTCATTTTTT CACCAACAAT 


CATAATGATA 


GAAAGGTCGT 


GTTCGATTTC 


TGCATGATCT 


780 


ACTTTAGCCT TTTGAACCAA 


CTGACGCAGG 


ATTTCTTCTT 


CCTTGATGGG 


AGTTAGTTGG 


840 


CGAGAACGGA GAATGATAGA 


AAGAwCGTCG 


ATACCTGTTG 


GCATATGTTC 


CCAACCGATG 


900 


T 










901 



(2) INFORMATION FOR SEQ ID NO: 282: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1765 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EON ESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 
CCCTGTTACG TGGATAATAG GGTAAGACTG CTCAGCATTT CCTAACAAAT CCACCGCTTG 
CTGCATTCGA CCCAAACCTG ATCGAAAATT CAAACCAATC CGACTATGGA GCCATTCTTC 
TACTTCAAAC ATACACATCT CCTTGACAAA AGTCCAATCA ATTATCGCAT TAAAGTATGG 
TTACTAATAA AAACAAGGCC AGGATTTTCG TCCCGACCTC TTACCTGGTT AGCTAATAAC 
TAG CT ACT AT GAATGTGAAT ATGGGCTAAA AACATCCACT GGACGTTCCA ACTCTTCCCC 
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ATTTCTGGGA 


GTTGGGGTAA 


AAATCTTCAC 


TGGACGTTCC 


AACTCTTCCC 


CATTTCTGGC 


360 


AGTTCGGCTG 


ATACAGTCTC 


CCAGACTGTA 


TCACTCCTCC 


ATAAAGCTGT 


TGAAGACTTC 


420 


TTCAATCATG 


TTCCATTCGT 


CTTCTGAGTC 


TTCTGGGATT 


GGTTGCAATT 


CCCCTTCTGT 


480 


TCCATCTTCG 


TTTTCGATGA 


ATGAGTAAGC TTGGATTTCA ACTTGTCCCT CTTCGTCTTC 


540 


TTCTGCGTTA 


ACTGGTACTA 


GAAGAACATA 


GTTTTTACCA 


AATTCTTCTT 


TTCCATCAAT 


600 


TCTCAAAAGG 


ATTTCAAACA 


AGGTTTCATT 


TCCTTGCTCA 


TCTACTAGTG 


TGATTAGTTC 


660 


ACGTTCTTCG 


TGGTCGTGGT 


TATGATCGTG 


TGACATAGCC 


TCGCCTTTAT 


ATTAAAATTT 


720 


TCTATCTAAA 


TAATTTTGTA 


AAATCAGCTG 


AGCTGCTAAC 


TTATCAATGA 


CTTTCTTGCC 


780 


CTTATTGCGA 


CTGATATCTG 


CTTGTTCAAT 


CAACATGCGC 


TCAGCAGCCA 


CTGTTGTCAA 


840 


GCGTTCATCC 


TGATAGTCTA 


CTGGTAAACC 


AAAAAACTCT 


TCTAGCTTTG 


CTCCGTAGCT 


900 


TGACTAGCTT 


CTACGCGCGG 


TCCACTTCTA 


TTGTTCATGT 


TTTTAGGCAA 


GCCCACTACA 


960 


AATCGTTCCA 


CCTTGTAAGT 


ATCAACCAAT 


TCCTTAACGC 


GGTCAAAACC 


AAATTGGCCT 


1020 


TGTTCTTCAT 


TTATCTGGAT 


CATTTCAAGC 


CCTTGAGCTG 


TAAAACCAAG 


CGGATCGCTA 


1080 


ATCGCCACCC 


CTACCGTTTT 


TGAACCGACG 


TCCAATCCCA 


TAATTCTCAT 


AGGTTATAGA 


1140 


TCGACTCCTT 


GTCCTTTGAG 


GTAGTAGCGA 


ACCAATTCCT 


CAACGATTTC 


ATCACGCTCA 


1200 


TACTTACGGA 


TTTGATTTCG 


TGCATTATTA 


TAACCAGGAA 


CGTAGGCAGG 


GTCTCCACTC 


1260 


AATACGTAAC 


CTACGATTTG 


GTTAATTGGG 


TTGTAaCCCT 


TATCGTTCAA 


CGAAGCATAA 


1320 


ACATCTGTCA 


AAGTTTCGCT 


AATTTCTTTT 


TTATTGGAAT 


CGTCCAATTT 


AAAACGTACT 


1380 


GTTTCTTCAG 


TAAATCCCAT 


TCTAACACCC 


TCTTTCCTTA 


GAATAGTACC 


ATTATAGCAT 


1440 


AATTCCTTAC 


CTTCTACAAT 


TCAGGCACTC 


TATTTATTTG 


GATTTTCTAT 


TGTTCTGTCG 


1500 


CGCCATTTGC 


CAATCTATCT 


GAAATATATT 


TGCTTGGTTC 


ATTTTTCAAA 


AGATTTTCCA 


1560 


AACCAATATT 


CTTCAGATGT 


TCCAACTGGG 


AAGCCTTCTT 


GACATCCAGA 


ACTTGAAAAT 


1620 


CAAAACTAGT 


CGTTGTTTGA 


AGTTCCGTTG 


CGCTCAATAG 


TTTTGTTTCA 


AGTTTGAAAC 


1680 


CTGCCAATTT 


ACGAGCTTCA 


ATGATAGACT 


TATCCTTCTC 


CTCCGCTTCA 


AGAAGAGCTT 


1740 


TTTGAGTTTC 


CTCCACTCCA 


TGTTG 








1765 


(2) INFORMATION FOR SEQ ID NO: 283: 









(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1346 base pairs 
(B> TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 
CTTATCCATT CACTTTCTTG TCTGTTATTC TATAAATCTT ACTCCTAAGT AT AC C AC ATT 
TGCCCCTAGA TGTGAACGAG AGAAACGCTC TAGACATTGC CAAGAAGGAA AAAAAAGGGT 
ACAATGTAAC AAAATCAAGG GAGGTCTGGA ATGAAGAAAC AAAGCAAGTA CAAAGAGGTC 
GTTTCCTATC TGAAAAATGG TATCGACTCT GGACCATTTC CGACGGCTAG TCGCCTGCCT 
TCTATCCCTC AACTGAGCCT TGACTTTCAC TGCAGCAAGG ACACCATTCA ACCAGCCCTG 
CTGGAATTAC GGCACGAACA ATACCTCTAT GCCAAGCCTC AGAGTGGCTA CTATGTATTA 
GAACAAGGGC AACATCAAGA CCTAGAAATC GAGGTTACCG ACGAACATGC CAGTGCCTAT 
GACGATTTCC GACTCTGTCT CAATGAAACC TTGATTGGCC GAGAAAACTA CCTCTTCAAC 
TACTATGACA ATCAAGAAGG ATTAGAAGAC CTAAGACAGT CCATTCACAA ACTCCTCTTT 
GAGCAAGCTC TCTACTGCAA GGCTAACCAA CTAGTACTGA CTTCTGCAAC CCAACAAGCC 
TTGTTTATCC TCTCTCAAAT ATCCTTTCCT AGACAAGCCA AGGAAATCTT CCTCCAACAC 
CCAACCTACC ATCGGATGAA TCGCCTCTTG ATTGCACAGG GGCTGGACTA TCAAACGATT 
GAACCAGCCA TTGATGGGAT TGACTTGGAG GAGCTGGAAG GCCACTTCAA AACAGGAAAA 
ATTAAGTTTT TCTACACCAT TCCCCGATTT CACTATCCCC TGGGACATTC CTATTCTGAG 840 
CAAGACAAAC GATCTATTCT TAACTTAGCT GCCAAGTATG ATGTCTATAT CGTAGAGGAC 
GATTATCTGG GTGATTTGGA CTCCAAGAAG GGCCAAACCT TCCACTATCT TGATACAGAG 
GAGCGTGTCA TTTATATCAA GTCCTTCTCG ACCAGCCTTT TTCCTGCCCT TCGTATTACA 
GCACTCATTC TTCCAAATGC TATCAAAGAA GCATTTGTCG CCTACAAAAA TATCCTAGAC 
TACGACAGCA ACCTCATTAT GCAAAAGGCC CTGTCACTCT ATATTGACAG TCAATTGTTT 
GAAAAAAATC GTTTGGCTCG CTTGACCAAT CATGAATCTT ACCAAAAACA AATCGAGGAA 
AGGATAACTA AAACACCTTG TCCCCTTCCT CATTATTCCC TACACGATCG yTTATTGCTA L260 
GACGTGAGAC AGTATCCTAA AATCGCCAGT CTCAAACACA GTCAACTGGG cTTGGACTTC 1320 
TTTGAAGAGG CCTATTTAAG CACCTG 
(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUQfCE CHARACTERISTICS: 

(A| LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
-3 00 
360 
420 
480 
540 
600 
660 
720 
780 



900 
960 
1020 
1080 
1140 
1200 



1346 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

CTATATTCAG AATATGCCAA AAATTCGGAA TGGTATAAAT TTCCGGACGC TTCATTTGAC 60 

ATATTTACAA AACTCCCCCA AAGAATTAAT TTTAACAAAG ATTTTTCTAG AATTTTGGCC 120 

CCCTTTATTA TTAATTTCCT TAAATTAATC AATAATTATC TAGAGAATAA AGAATACGAG 180 

TGGATTGACA AGAATGGAAA TATTTTTTCC TCTCTACTAT TTTATTTAGA AGATTTAATC 240 

TATCCTTGGA TTGTTAAACC TTTGGTTTTA GAGATAAATT CATTGCGTGA AAAAGGTTTA 300 

CTTGAAGGGG AATCGGAGCA GCAACGGTAC AAATATTTTA TAACATTGTT TGACAAGGAA 3 60 

GAGAATATAT TAAATTTTTA TAACAAATAT CCCGTTTTAC TGAGGCAAAT ATCGGACTCT 420 

TGTCTTCGGT TCTATACTTA TTTTATAGAA ATTTTATCAA ATTTAGAAAA TGATTTTAGT 480 

GTGCTAGAAG AAGAATTAGG GCTAAGGGGG AAATTAAATG ATATAAAATT TGGAAAGGGT 540 

GATACACACA GCCAAGGAAA AACTCTTTTG ATACTCTTCT TTGATGACGC GAAAATTGTT 600 

TACAAGCCTA AAAATTTAAT AATCAATAAC TCACTAAATA CTATTGCTGA GTATATCCGA 660 

AAGGTTGATG AAAAAATTAG GATAAGAATA CCTCGAACTA TTGCTTATTC GGATCACACC 720 

TATGAAGAAT TTATTGATTA TCTACCTCTA GAGCAAAAGA AAAATTTACC TGAATATTAT 780 

TATAATTTTG GTGTGCTTTT AGCATTTATA TATTTATTTA ATGGGAGTGA TATACATTTT 840 

GAAAATTTAA TTTCCTATGG AGATATGCCT GTAATAATAG ACTTTGAAAC AATGTTACGC 900 
(2) INFORMATION FOR SEQ ID NO: 285: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 
<C> STRAND EDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

TTATTTACCA GAGGCACTTT TAAATGTGAA GGATTTGGTC AGTCAAACAG TTTTTTATC. A 60 

GCAGATTATT GGTTTAGAAA TCCTATCTCA AACGGATACA GAGGTCGTTC TGGGACTTGG 120 

AGGAAAAGCC TTGGTACACT TGATTCAAGC ACAAGAGGGT GGAGAAGTAA GGGAACATTA 180 

TGGTCTTTAC CATCTGGCTA TTCTTTTGCC GACACGAAAG GCTTTGGCGG ATGTCTTGAA 240 

GCACCTGACG GATTTACAGA TTCCTCTTGT TGGCGGTGCA GATCACGGTT ACAGTGAGGC 300 

CCTTTACTTA GAGGACTTGG AGGGAAATGG CATTGAACTC TATCGAGATA AGCCAGTTTC 360 

CACATGGGAT ATTCGAGAAG ATGGACGTAT TATCGGGGTG ACTGAAGTCC TTGCGGCTCA 420 

GGATATCTAT GAGTTGGGGG AAAGAGTAGA GCCTTTTATC CTAGCAGAGG GTACGAGAAT 480 
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CGCCCATATT CATCTTTCTC TCAAGGATAC TCGAAACTCC AGACACTTT? ATCAAACGCT 540 
GTTAGGGCTC GAGGATAAAT TCAGTGTGCC TACTGCTAGT TGGATCGCAG CTGGGGACTA 600 
CCATCATCAT TTAGCAGTCA ACGAATGGGG AGGAAAAGGT CTGGATCCGC GTAAACAAGT 
CCTACCAGGT TTAGCCTACT ATGTCATCGA AGTCGCACAT AAAGAAGAAC TGTTAACGAT 
TGCCCAACGA GCACAAGAAG TTGACGCACC AATCAAATGG ATGACATCGA TCCAATTGGA 790 
AATCACAGAC_TCAGATGGCA_TCGTGACCCG_TATTCGTTTA- GCTAGATAGA TGGTATGTCA - 840 

TGAAGGTAGA GCATCAATTG TA 
(2 J INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 

(B) TYPE: nucleic acid 
<C) STRANOEDNESS : double 
<D> TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 286: 
TCGTTTACAA GATCGCTAAA ATGCATCTCA TGATCGCGAC CACGAATTCC AAGATAGCAC 
GCGCTACCTC AATCATAGAT AGTTCACTTT TTTCTTGCCC AGCAAATACT TCTAATTCCA 
AAGCGTTTCT CCTCATTTAT ACTACTATCG CCAGAGCGAA CAGACTCTGA CCTCATTTTA 
TCATTTACTC TTTATTTTAC GATAATTTTG CGGAATAGTC AAAGGTTAAG GGGGAGAAAG 
TGGCAGGATT AGACTAATTC CAATATAAAA CTCATTCCTT TTTCTGTTGC TCCATTTTCC 
ACAAATCCAA GCCACTTGAA ACACCTCCTA GAAGCATGAT TGTAGGTGTA GATTTTCTTG 
ACTCTCAATT CTTTCCATCC TTTTACTCGA GCCAATTCAA TCAAAGCACT TAGAATCTTT 
TTTCCAAGTC CTCGATGTTG GTAAGCGGAA TTCCCAATCA CAATGGGGAG ATTATCCTGA 4 80 

GATAGTGTAA TATCCCCAAT TGGAAACCAT TCTCCCTTCT CCTTGACTTC AATCCAAAAA 54 0 

AGCTCACCAT GCCGATyCAr ATAGGAATAC ATGGCTTCCA AGGTCGcTtG ACTGTAAGGA 600 
AGCTTCACCC CATCTACGAG GtAAcCAAGT TCACATCCGT GATACCAAGC 
(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

GATAGCAATC CGCTTCAGAA ACTTCTCCCT TACCTCTAAC TCCCATCGCT ACTTTGCGAC 60 

AACATACTTC CATTCTCATA CTATCTGTTG GCTTTGCAGG CTGTAAAAAC AACTTTTCTC 120 

TTGCTACTTC CTGAAAATCT GAATCTTGCA GTTCTTTGCT TTCAAAATAG TCCTGTACTC 180 

GCTCCACATC AAAATTCCCA GCTAAAGACA GAGACATGTT TACAGGTTTG TAAAACTTTG 240 

TAAAATTTTC TTGCAAATTA GTTAGATTGA TTTGGGAAAT GGACTCCTCA CTTCCAACTA 300 

TATCAGTTGC TAAAGGTGTA CCAGGATACA AATTCGCTAA AGTTGAAAAG AATAAACACG 3 60 

AATCTCCATC ATCTTGGTAC ATTTCTCGTT CTTGCTGAAT AATATCCTGC TCTGTCAGAA 420 

TGGAAGCTTC AGTAAACTGT GCTGATGTTA CCAATTCATC AAGTAAATCT AAATTTTCTA 480 

AAAAATAATC CGTTGCTGAA AAAAGATAGT TTGTTTTTGT AAAGCTTGTA AACCCATTAC S40 

TATCTGCACC TAGACTCGTA AAAGCCGACA TCAAATCACT ACAATCTTCT CTCTCAAATA 600 

ATTTATGTTC AAGAAAATGA GCAATTCCTC CAGGATATTG TTTTACATCT CCGTCAACTT 660 

CTGTGACAAA CGTATCTACC GAACCAAACT GTACAGTGAC ACTCCCGTAA ACCTCTTTAA 720 

ATTCCTTTTT AGGCAAAAGA GCAACTGTCA ATCCGTTGGC CAAACGAGTT CGATAAACCA 780 

TTTCTTTTAC AGCTGGATAG TATTTTTCTT CAAAAACAAC CTTTGTCATT CTATTCCTTC 840 

CATAAAGTAA ATCGCTTGTA GTTTCACATT ATTAGCTACT CTACAAATAG CATCTTTGTC 900 

AATTTGTTCA AGCTTTGCAA TCCAACTTTT AAAGTCTGCT GAAGATTTTC CAAATAAGGC 960 

ATTTTGATAA GCACGTTCAA TCAATGAAGA ATGATTATCT TGAGAAAGTA ACAACGACCA 1020 

ACGAATCATT TCCTTGGTCT GATTTAACTC AAACTCTGTA AAAAAACCTT TTTTTAAATC 1080 

AAGCCGTTGA TTATTCATCA ATTTACGAGC CTGGTTACG 1119 

(2) INFORMATION FOR SEQ ID NO: 288: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 
(8) TYPE: nucleic acid 
(C> STRAND EDNESS : double 
tO) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

ACGCCCTCGC GGGGACATGA CGAATTCCCC GTTCATCACG AAGGCCGCCG AGGAGTGGGG 60 

GGTGCCGTCC AAGTCAAAAG CGGCCCCACA TCGATTCAGT TCCCCGACGA ACAGCCCTTT 120 

CCCCCAGCGT TCCTGGCTTT GCAACCGTTT CACAACAGCC TCGTAAAGTA GGCOGGACAA 180 

GGCAGACGGA CTCCAAAGGA GTTCTTCCAT CTGCAAGTCC GCCTGCGTTA TGTGATCCCG 240 
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GTCTTTTGCA 


TGTCTCTCGC ATGAATGCTG TTCCCAATCC 


CACTCCAGAA CATTCTCCTC 


300 


AAAAGTGCCC 


AACCTCGCCC TGAATGAATC CTGCCTTCTA GTCGTGACCA TTCCTATGAA 


360 


CGCTCGCAGA 


GGATTTTCCC CGAGTGCAAG CGCATCCTCC 


CGCTCAAATC GGGTGCATTT 


420 


CACACTCCCG 


CTCAACGCTA GCCCGATCCC TTTTTGGCAT 


GGTGACTCAA GCGTCCTTTC 


480 


AAACAAAAGC 


TCCTCATCCG CTCCAACCGG CCCGACGTAG 


ACGCGTAGAC CGAAGTCGTC 


540 


(2) INFORMATION FOR SEQ ID NO: 289: 







(i> SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 1949 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 
AAACAATTCC ACCAATTCAA GGTTGAGGCA TCGCAAACTA TGGACTGTTC CCCCGTCAGT 
TCTGGACAGA AAACGGGATA AGGTTGGCTG TGAAGCAAGC TGCCCTCCTA CCAACAATTT 
TGGAAAGTAG GCATCAGCTG ACAATTCTTT ACAAGCATAG TCCGTTCCAT AACCTGTTAA 
CAGTTGAAAG AGGAACTGGA CAAGGATATC TGAATCCGAA TAACGACAGT AGCGGCGTTG 
GTCATTCGTT ACTAAATACT TAGAAATCCG CTCTTTTAGT TTCAACTGGG AAAAAAGTTC 
CTGAAAAAAG ATAAGACCAC CATACTGGCT TAAATGACCT CCATCGAAAG ATACTTGGTA 
AAAAGACTTG TTTTGGAAGT GATGATTTGG TAAACTGTTC ATGTGAGTTT CCTTTCTTTT 
TGTGTTTTTT TCTACACTTA TACCATAAAG GGGAAACTCT TTTTTCTCTA GTAAAAAACA 
CCCATTGGGT GAAAAAAGAA ACCATCCAGG ATCTAAGCTA AGGCAAGGAT TCTCGATGGT 
TTTTAGATTT GGGGTCAATA ATTGGGGTTT TACAATATCA ACTCCCATGA TAGTCATGAG 
ATGACTCTTC ACGAATTGAC GTGATGACTG TCCTTCCTTT TGCATAATTA CCTCCGAAAC 
ACAAAAAAAG GGGTAGACAA TCTAGTGTCT ACCCCCGAAA GTTTATTAAA ACAAAAATCC 
TGCCAAAGAA TTTTTGGCAG GAAACCAAAT CAATTTATCA GTTTCTATCA ATCGCTTATC 
GCTCTCAAAG ACTGGTAAAT AGGGATTCCG CAATCAAATT GCGATACTCT ATTATTTAAG 
ACTAACTGAA GCTCCAGCTT CTTCCAATTT AGCTTTGATT TCTTCAGCTT CTGCAGTTGC 
AACGCCTTCT TTAACAAGTG CTGGTGCACC GTCAACAAGT TCTTTAGCTT CTTTAAGACC 
AAGACCAGTG ATTTCACGTA CAACTTTGAT AACGCCAACT TTTTTGTCGC CTGCAGATGT 
CAATTCAACG TCGAATGAAT CTTTAGCAGC ACCAGCATCA GCTGCATCAG CTGCAGCAAC 



60 
120 
180 
240 
300 
360 
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480 
540 
600 
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720 
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840 
900 
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AGCTACACGA GCACCTGCAG TTACACCAAA TTCTTC TT CG ATAGCTTTTA CAAGGTCGTT 1140 

CAATTCAAGG ATTGAAGCTT CTTTAATTTC AGCAATAATG TTTTCAATGT TCAATGCCAT 1200 

TGTTATTTCC TCCAAATAAG TTTTAAATTT TATAATACTT TTTTTCGTAG CTAGksTACG 1260 

CTCTGTAGCT TAAGATTAAG CCGCGTCTTC TTTGCTTTCT GCAACCGCTT TCACTGCAAC 1320 

AGCAACGTTG CGCACTGGCG CTTGAACTAC AGAAAGGAGC ATAGAAAGAA GTCCTTCGCG 1380 

GTTTGCAAGA GTTGCAAGTG CAAGAATCTC TTCTTTAGAT GCGACACCGC CTTCGATTGC 1440 

ACCACCTTTA ATTTCAAGTC CTTCAGCGTT TTTAGAAAAG TCGTTCAAGA TT TTCGCTCG 1500 

TGCGATAACA TCTTCATTAG AAAATGCTAC TGCAGATGCT CCAACAAATA CAGATGCAAG 1560 

ATCTTCAAGA CCAG CT TT TT CAGCTGCACG ACGCAACATT GAGTTTTTAA TAACTTTATA 1620 

CTCAACTTCG CTTCCACGAA GCTCACGACG AAGAACTGTA TCTTGCTCAA CTGTCAAACC 1680 

ACGAGCGTCT ACAACGACCA TAGATGCAGC AGCTTTCATT TTTTCAGCTA CACGTCAACT 1740 

AGTTCCGCTT TTTTAGCAAT AATTGCTTCA CTCATTAGTG TGTTCACCTC CGTAATTATT 1800 

TTGCTTGGGG AATTTTTCAA AAAGAAAAAC GCGCCCAATC CTAGACACGA AAGTACAATA 1860 

CGCTTCTTTT TACATGATAC GTTTTGTCCT CGGTAGGATA TTTATGAGTC GAGCTCCCCT 1920 

ACTGTCTTAG GCA G T T TTTT TAGATACGG 1949 
(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1023 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNE SS : double 

( D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GGACTGTTTG ATCTTATACA GTAGCTGCTT GATCCAAGCT TTCACCGATA GCGGCTAGGC 60 

GCTCGATAAC TTCAGCTTGT GTCAATTCAT TTTTTGAAAC ATAGCGGTTA CGTGGCTG.\A 120 

CACGGCACTC GTGTGAGCAT CCACGAAGCT ACTTGTCTTC ATTTTCTTCT GATGTCAAGA 180 

TACGACGGTT ACAGAATGGA TTTCCACAGT TGACATAACG TTCACATGGT GTTCCATCAA 240 

ACCAGTCTTT CCCTACGATA CTTGGGTTGA CATGGTTGAC ATCAACGGCA ATACGCTCGT 300 

CAAAGACGTA CATTTTCCCA TCCCAAAGCT CACCTTGAAC TTCTGGGTCT TTACCGTAAG 360 

TTGCGATTCC TCCGTGCAAT TGGCCGACAT CTTTGTAGCC TTCACGGACC ATCCAGCCTG 420 

AGAATTTCTC ACAGCGAACG CCACCTGTAC AGTAAACCAC GACACGCTTG TCCATGAATT 480 

TTTCCTTGTT ATCACGGACC CATTGTGGTA ACTCACGGAA GTTGCGAATA TCTGGGCGAA 540 
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TAGCTCCACG GAAATGTCCT 


AGGTCGTACT 


CATAATCGTT ACCTCTGTCA AGGACAACGG 


600 


TATCTTTATC AAGAAGCGCT 


TCTTTGAACT 


CTTTTGGACA CAAGTAAGCA CCTGTTGTTT 


660 


CAAGTCGGTT GATGTCATTG TCAAAGTCGT 


TGTCTTCCAA ACCAAGGTGG ACAATTTCTT 


720 


TCTTGTACCG AACAAACATC 


TTCTTGAAGG 


CTTGTTCATT TTCTTCGTCA ATCTTGAACC 


780 


AGAGTTCTTC CATTCCTGGA 


AGGCTGTCAA 


CGTAgTCCAT GTATTTTTGA GTTGTTTCAT 


840 


AGTCACCTGA AACTGTTCCG 


TTAATTCCCT 


CCTCAGCGAC TAGGATACCG CCTTTAAGGn. 


900. 


CGATTGATTT ACAGAAAGCC 


AAGTGGTCTG 


CAGCAAATTG CTCTGCATTT TCAATTGGAG 


960 


TATAAAGGTA GTAAAGTAAG 


ACACGAATAT 


CTTTTGJcCaw AAGATTTGTA TCTCTTTATC 


1020 


TAT 






1023 



(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 



ACTATGAACA AGACCCAGAA AAAGTACCCT 


TATTTCTTAA 


GAATTTTAAT 


1 AGTTTAAAGC 


60 


ACCTAGCACC TGTTTAGATT GACGAAACAG 


GATTCGATAC 


TTATTTTTAT 


CGAGAATATG 


120 


GTCGCTCATT AAAAGGTCAA TTAATAAGAG 


GCAAAGTATC 


TGGAAGAAGA 


TATCAGAGGA 


180 


TTTCTTTGGT TGCAGGTCTA ACAAATGGTG 


AATTAATCCC 


TCCAATGACT 


TACGAAGAGA 


240 


CGATGACGAC CGACTTTTTT GAAGCTTGGT 


TTCAGAATTT 


TCTCTTACCA 


ACATTAAACA 


300 


CACCATCGGT TATTATTATG GATAATGTAA 


GATTCCATAG 


AATGGGGAAG 


CTAGAACTTT 


360 


TATGCGAAGA GTTTGGGCAT AAACTTTTAC 


CTCTTCCTCC 


CTACTCGCCT 


GAGTACAATC 


420 


CTATTGAGAA AACATGGGCT CATATCAAAA 


AGCACCTCAA 


AAAGGTATTA 


CCAAGTTGCA 


480 


ATACCTTTTA CGAGGCTTTT TTATCCTGCT 


CTTGTTTCAA 


TTGACTATAT 


TAGAGGCGAC 


540 


ACATTTTTCG GTTCTTTGTC AACTGTAGTG 


GGTTGAAGAA 


AGCGAAGATC 


TAGAAAGGAC 


600 


AAATTTCGTC CTTTC'HTTT TGAAGTTTTC 


AAAGTTCCTA 


AAACCAAAGG 


CATTGTGCTT 


660 


GATAAGTTTG ATGAGATTAT TGGTGGCTTC 


CAGTTTGGCG 


TTGGAATAAC 


GTAATTGAAG 


720 


GGCGTTGACG ATTTTCTCTT TATCTTTGAG 


GAAGGTTTTA 


AACAAAGTCT 


GAAACAGAGG 


780 


TGGAAAAGCA AGAGCTGATA GAGATTATAG 


TGGTGTTTAA AGTCTTCGGA 


ATAGCTCAAA 


840 
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ACTTTATCTA GAATTTCTTT ATTAGTCAAG TGCATACGAA AAGTAGGGCG ATAAAATCGT 900 

TTATCACTCA GTTTCTGACT ATCTTGTTGA ATGAGCTTCC AGTAGCGCTT GATAGCCTTG 960 

TATTCATCGG ATTTCGGATG ATGGCTTGTG TTCTGCTCTC AAGAACAGTT ATGATATTGA ' 1020 

GTTTATCAAA GTCCTGAGCA ATAAAGCTCA TCTCCATCTC CCGATTGAAA CAGTCACTCC 1080 

CCGGACTGTT TCAACsTCCT AGGACATAAT CTCAGGAAGA eQeGAAAAAT CATGCTCAAA 1140 

GTGAAAATCA TTGTTCTTGC GAATGACAGT TGAAGTTGAA ATAGACAACT GATGATCAAT 1200 

GTCCGTCATA GAAGTCTTTT TAATTAGCTT CTGAGCAATC TTTTGGTTGA TGATACAAGG 1260 

AATTTGATGA TTCTTCTTGA CGATAGAAGT CTCAGCGAGC TCCATTTTTG AGCAATGATA 1320 

GCACTTAAAA CGGCCTTTTC TAAGAAGAAT TCTAGTTTGA ATTTTTTTAT ACTAGAAAAT 1380 

CAGAACCATA ATACCTATAT AAAAATATTA TAGTTCTAAT AGGATTTACC CAAAAGTTTT 1440 

AAGGCGGTCT TTTTAGAACT TTAATTGTTT GAAATTTAGG TAGCAAATTT GTTTCTATTT 1500 

TGTCAACTTT TCCTATTTTT ATCTTGTTGA GGCTGGTATT TTAACAATTC AGGAATTGAT 1560 

ACTCAATCTG TAAAATTTTT TGTTAGAATA AGTTTATAAA AAAGAAAAGG AGTATTTGAT 1620 

TATGTTACAA AAAATTTATG AGCAGATGGC TAATTTCTAT GATAGTATTG AAGAAGAGTA 1680 

TGGTCCTACA TTTGGTGATA ATTTTGACTG GGAACATGTT CATTTTAAAT TTTTAATTTA 1740 

TTATTTAGTC AGATATGGCA TTGGTT G TCG TAAGGATTTT ATTGTTTACC ATTATCCTCT 1800 

TGCTTATCGT TTGTATCTTG AAAAATTGGT AATGAATCGG GGTTTTATTT CTTCTTGAGG 1860 

TAATTTTAGT AAATTTCCGA ACTAATTTAC TCTTTT AT GG AAAGATGATA CTAAATACCT 1920 

AGTAATTTTT CTAAATCATT TTTTAATAGT TCGAAATACC AAATCTTTCT ATTGTTTCTT 1980 

CTTGATAAAA AGGCGATTTT TTATTATAAT AAATTGTAAG ATATAATTGC AGGTGAGAGT 2040 

CCTCCCATGT ATGTGAGAAA GGAAGAGCCT CATGGCTCAG ACAAGATTAT GACTTCAGTT 2100 

GTTGTTGTAG GTACCCAATG GGGTGATGAA GGTAAAGGGA AGATTACAGA CTTCCTTTCA 2160 

GCGAATGCAG AAGTGATTGC ACGTTACCAA GGTGGTGATA ATGCTGGTCA CACGATTGTG 2220 

ATTGACGGTA AGAAATTTAA GTTGCACTTG ATTCCATCTG GGATTTTCTT CCCTGAAAAA 2280 

ATATCTGTCA TTGGGAATGG TATGCTTGTA AATCCTAAAT CTCTTGTAAA AGAGTTGAGC 2340 

TATCTTCATG AGGAAGGTGT AACAACTGAT AACTTGCGTA TTTCTGATCG TGCGCATGTT 2400 

ATTTTGCCTT ATCATATCGA GTTGGATCGC TTGCAAGAAG AAGCTAAGGG CGACAATAAG 2460 

ATTGGTACGA CAATTAAGGG AATTGGTCCA GCTTATATGG ACAAGGCTGC TCGTGTTGGA 2520 

ATTCGTATTG CAGATCTTTT AGATAAAGAT ATTTTCCGTG AGCGTTTAGA ACGTAACCTT 2580 

GCTGAAAAGA ATCGTCTTTT TGAAAAATTG TATGACAGTA AAGCGATTGT TTTCGATGAT 2640 
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ATTTTTGAAG AATATTACCA ATATGGTCAA CAAATCAACA AATACGTGAT AGATACATCT 2700 

GTTATCTTGA ATGATGCGCT TGATAATGGC AAACGTGTGC TTTTTGAAGG TGCACAAGGT 2760 

GTTATGCTAG ATATCGACCA AGGTACTTAT CCATTTGTTA CCTCATCAAA CCCTGTAGCT 2820 

GCTGGTGTGA CAATTGGTTC TGGTGTCGGT CCAAGCAAGA TTGACAAGGT TGTAGGTGTA 2880 

TCTAAAGCTT ATACGAGTCG TGTAGGAGAT GGTCCTTTCC CAACTGAGTT GTTTGATGAA 2 940 

GTGGGAGAAC GTATCCGTCA AGTGGGTCAT GAATATGGTA CAACAACTGG TCGTCCACGT 3000 

CGTGTAGGTT GGTTTGACTC AGTTGTGATG CGTCATAGCC GTCGTGTTTC TGGTATTACT 3060 

AACCTTTCTT TGAACTCTAT TGATGTTTTG AGCGGTTTGG ATACTGTGAA AATCTGTGTG 3120 

GCCTATGATC TTGACGGTCA ACGTATTGAC TACTATCCAG CTAGTCTTGA ACAATTGAAA 3180 

CGTTGCAAGC CTATCTATGA AGAGTTGCCA GGTTGGTCAG AAGATATTAC CGGAGTTCGC 3240 

AATTTGGAAG ATCTTCCTGA GAATGCGCGT AACTATGTTC GTCGTGTGAG TGAATTGGTT 3 300 

GGCGTTCGTA TTTCTACTTT CTCAGTAGGT CCTGGTCGTG AACAAACAAA TATTTTAGAA 3 3 60 

AGTGTTTGGT CCTAAGAGAT TTTTAAGATT TGTTTAAGAT AGGTCGGGTA TACTATAGAC 3420 

GGTTACAAGA AGACCTCCTA ACTTGTTGTA ACAAATATCC TAAACTTTTC TTTTTCATAA 3480 

TAATCTCCCT ATAGAGTCAC CGCATTCGGT GGCTTTTTTT GTGTTGGGAT TCATGATATA 3 540 

ATAATAAAAT CGATAAGTAC GAAAAGAGAA AAGAGATGTA TTATACCCTT GAAGAAAAAG 3 600 

AAGTCTTTAT GAGGGAGGCT TTGAGAGAGG CTGAGATTGC TCTTGAACAC GATGAAATTC 3 660 

CAATTGGTTG TGTGATTGTC AAAGATGCGG AAATCATTGG TCGTGGGCAT AATCCGCGTG 3720 

AGGAATTACA GCGAGCGGTT ATGCATGCGG AAATTATGGC TATAGAGGAT GCGAACTTGA 3780 

GTGACGAGAG TGCGCTTGCT GGATTGCACA CTTTTTGTGA CCATTGAACC G 3 831 

(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1441 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

CCGCTGTTCC AACCGCAACA TACCATAGTC CGTACGGGAT TCGAACCCGT GTTACCGCCG 60 

TGAAAAGGCG GATGACTTAA CCCCTTGACC AACGGACCTG A G TTGTTATT TTCAACTCTT 120 

ACT ATT AT AC AGTCTTTTCA AACTTTGTCA ACTACTTTTT CTAATTTTTG TTTATTTTTT 180 
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CAACTTATAG 


TAAAAAAAGC 


CAGAATTATA CTGACTCTTC 


TATCCCTCAT 


TAAACTTAGA 


240 


ACCACCTTCT TTTCCCCACC AATAAGGGAT TAGTTCTGCG 


ACTTTAACTG 


TTTTTCTTAT 


300 


ATTATAGTCC 


ATCATGAATT 


CTGCATCTTT ATTTTCAGCA 


TTAACCTCTA 


AAAGGAATTC 


360 


TCTACAAGCA 


CCGCAAGGCA 


TGGCTGAACT TCCACCATAA GGTGGTTTGT 


CTCGAAAGGC 


420 


TAATACTTTC 


TTAACCTTAG 


TTTGTCCTCA AAATTGGTAC 


ATATTGAAGA 


GGGCCGCCCG 


480 


TTCTGCGCAG 


AGATGGAAAA 


CACCACAGGT TCCCTCCATA 


CAGAATCCTG 


TAAATATTTG 


540 


TCCATCTCCT 


GCTTCTACTC 


CAGCTACAAC ATGATTGGCA 


TAAACAAAGT 


CTGATACTTC 


600 


ATGTGGATTG 


TATAGTTTCT 


GTGCTTCTTC GTACATCTTT 


TCCCAGATGT 


CCATTATTGT 


660 


ATCCTCTTTA 


TTTAGAGATT 


TCTTTTAGCA TCTTTTCGAT 


ATGCTGAATT 


GATTTTTCAC 


720 


GTCCAAGCAA 


GAAAATTGTA 


TCTGGTAATT CTGGCCCATG 


CATTTCGCCT 


GAAACTCCGA 


780 


TACGAATAGG 


CATGAAAAGA 


TTTTTCCCTT TAATACCTGT 


TTCTTTTTGG 


ACTGCTTTAA 


840 


TTTCTGGGAA 


GATATTTTCT 


GTCACAAATT CATCATCTGT 


CATCGCTTCA 


AGTTTTGCTT 


900 


TGAATGCTTC 


AAGAACTGTT 


GGAACTGTTT CACCCGTCAT 


GACTTCGCGC 


TCTGCTTCTG 


960 


TCAATTCTGG 


GAAATCTGAG 


AAGAAAAGAT CTGTCAATGG 


GATAATCTCA 


TCTACTGATT 


1020 


TCATTTGTGG 


TTTATAGAGC 


TCAACTAATT TTTCAGCCTT 


GTCACTCAAA 


CGGCCTGCTT 


1080 


CCTCTAAGAA 


TGGTTTTGCC 


ATTTCAAAGA TGGTTTCAAG 


GTCTGCATTC 


TTGATATAAT 


1140 


CATTGCTCAT 


CCAGTCTAGT 


TTTTTCTGAT CAAAGCCTGC 


TCGTGACTTG 


CTGAGGCGGT 


1200 


TTTCATCAAA 


AAGTTTAATG 


AATTCTTCAC GAGAGAAAAT 


CTCATCCCCA 


CCACCTGGGT 


1260 


TCCAACCAAG 


AAGAGCAATA 


AAGTTAAAGA CTGCTTCTGG 


AAGGTAACCT 


TTCTTTCCGT 


1320 


AATCTTCGAT 


AAATTGAAGT 


GTATTAGTAT CACGTTTAGA 


TAACTTCTTA 


CCAGTTTCAG 


1380 


AGTTGATAAT 


CAAGTGTCAT 


GTGACCGAAC TCTGGAGCTT 


CCTCAACCTA 


AGAGCGGGTA 


1440 


T 










1441 



(2) INFORMATION FOR SEQ ID NO: 293: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4398 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND E ONE SS : double 
(D> TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 293: 
CGGCTTATGT ACTGGCAATC TTTCTACGTA AGCGAAACCA GGCCAGATTA GAGGCGCTAG 60 
AAGAAAAAAA AGAAGAACTA TACAATCTTC CAGTAAATGA TGAAGTAGAA GCTGTAAAAA 120 
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420 
480 



660 
720 
780 
840 
900 
960 
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ATATGCACTT CATTGGACAA AGTCAAGTGG CTTTCCGTGA ATGGAATCAA AAATGGGTCG 180 
ATTTATCTCT CAACTCTTTT GCCGATATTG AAAATAATCT CTTTGAAGCA GAAGGCTATA 
ACCATTCATT TCGTTTTCTC AAGGCCAGTC ATCAAATTGA CCAAATTGAG AGTCAAATTA 
CTTTGATTGA AGAAGATATT GCGGCAATTC GCAATGCTTT GGCAGACTTA GAGAAGCAAG 
AATCTAAAAA TAGTCGTCGT GTTCTTCATG CTTTGGATTT ATTTGAGGAA CTTCAGCATA 
CACTTGCTGA AAATTCAGAA CAGTATGGTC AAGCCTTGGA TGAAATTGAA AAACAATTAG 
AAAATATCCA ATCTGAATTT TCACAATTTG TAACCTTGAA TTCATCGGGT GACCCTGTGG 540 
AAGCCGCAGT GATTTTGGAT AATACAGAAA ATCACATTTT GGCCTTAAGT CATATTGTCC 600 
ATCGTCTTCC AGCCTTGGTT ACGACGCTTT CTACAGAATT CCCAGATCAA TTACAGGATT 
TGGAAGCCGG TTATCGTAAA CTAATTGATG CTAATTATCA TTTTGTTGAA ACGGATATTG 
AAGCGCGTTT CCACTTCCTT TATGAAGCAT TCAAGAAAAA CCAAGAGAAT ATTCGTCAGT 
TGGAATTGGA TAATGCCGAA TATGAGAATG GACAGGCACA AGAGGAAATC AATGCCTTCT 
ATGATATTTT TACTCGAGAA ATTGCTGCTC AGAAAGTAGT GGAAAATCTA CTTGCAACTC 
TTCCAACTTA TCTTCAACAT ATGAAAGAGA ATAATACTTT ATTGGGAGAA GATATTGCAC 
GTTTGAACAA GACCTATTTA CTTCCTGAGA CAGCTGCAAG CCATGTTCGT CGTATTCAGA 1020 
CAGAATTAGA GAGTTTTGAG GCAGCTATTG TTGAGGTAAC TTCAAATCAA GAAGAACCAA 1080 
CCCAAGCTTA TTCAGTTCTT GAAGAAAATC TTGAGCATTT ACAAACTCAA CTAAAACATA U40 
TTGAAGATGA GCAAATTTCA GTTAGTGAGC GCCTGACACA AATTGAGAAA GATGATATTA 1200 
ATGCACGTCA AAAGGCCAAT GTTTATGTCA ATCGTCTCCA TACTATCAAG CGATACATGG 1260 
AAAAACGCAA TCTGCCAGGT ATTCCACAAA CTTTCTTGAA GTTATTCTTT ACGGCAAGCA 1320 
ATAATACCGA GGATTTAATG GTTGAGTTAG AACAAAAAAT GATTAACATT GAATCTGTTA 1380 
CCCGAGTTCT TGAAATTGCA ACGAATGATA TGGAAGCTTT AGAAACGGAA ACTTATAATA 1440 
TTGTACAATA TGCAACTTTG ACAGAGCAAC TCTTGCAATA TTCTAACCGC TATCGCTCAT 1500 
TTGATGAACG CATTCAAGAA GC APTTAACG AaGCTTTAGA TATTTTTGAA AAAGAATTTG 1560 
ATTATCACGC TTCATTTGAC AAGATTTCTC AACCATTGGA AGTGGCAGAG CCTGGTGTAA 1620 
CCAATCGCTT TGTTACCTCA TATGAGAAAA CACGTGAAAC GATTCGTTTT TAATAAAAGA 1680 
AAAAGATTTT ATTGTGTGAG GAGCAGAATC AAATCTTTTT CTATAGTTGT GGGGAGATTT 1740 
ACTTCATTTT CTCCTGAGAT TGAGTTTTTG CCCAGCCGAT TTATCCACTA CCTCAAAACA 1800 
GTGTTTTATA CTCTTCGAAA ATCTTTTCAA ATCACGTCAG CGTCGCCTTA CCGTACTCAA i860 
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CTACAGCCTG AGGCTAGCTT CTTACTTTGC TTTTTGATTT TCATTTACTA TTAAACTGAT 1920 

TTCGCCAGTC TTATCTGCAG CTTCAAATCT CTACTTTGAG TAACTTCGTA ACCCTCCAAT 1980 

AACGAAGTCT ATTGAAAAAT CTCCAGACTA GACAACTCAC GGATAGTTCC TAATCTGGAG 2040 

ATTTCTTATT TGCACTTTTC TTGTACAACT TTAGTCCACG GTAAATAGAC CTCTAAAACC 2100 

TCTTTGTTTA CGAGAGTTTC CTCGTTTGGA AGACATTCTA GAAGATAGGA TAGATATTTC 2160 

TCGCTATTTA TACTAGACTA AAATCAAAAA GCATTATATA ATAGTGATAT GAAATCAACT 2220 

AAAGAAGAAA TCCAAACCAT CAAAACACTT TTAAAAGACT CTCGTACAGC TAAATATCAT 2280 

AAACGCCTTC AAATCGTTCT ATAGTAAAAT GAAATAAGAA CAGTACAAAT CGATCAGGAC 2340 

AGTCAAATTG ATTTCTAACA ATGTTTTAGA AGTAGACGTG TACTATTCTA CTTTCAATC? 2400 

ATTATATTTC GTCTGATGGG CAAATCTTAT AAAGAGATTA TAGAACTTTT ATAGTAGATT 24 60 

GAAATAAGAT GTGAACAACT CTATCAGGAA AGTCAAATTA ATTTATAGAA ATATTTTAGC 2520 

AGCCAAGGTG TACTGTTATA GATTCAATAC ACTATAGACT GTAATCAAAC AACGATTTGG 2 5 80 

CGAAATGTAA AAAAATATGA GGAGTTCGGA CTCGACTCTC TCCTTCAAGA AACACGTGGT 2640 

GGTCGTAACC ATGCATATAT GACAGTTGAG GAAAAGAAAG TCTTTCTTGC CCGCCATTTG 2700 

AAGGCTGCAG AGGCAGGAGA ATTTGTTACA ATTGATGCCT TATTTCAGGC TTATAAAAAG 2760 

GAGTTAGGTC GTTCCTACAC ACGTGATGCC TTCTATCAAC TGTTGAAGTG CCATGGTTGG 2820 

CGAAATATTA TGCCACGTCC AGAACATCCT AAGAAAGCAC ACGCTCAAAC CATTGTCGCG 2880 

TCTAAAAATA AAATCTCAAT TCAAGAAGAA AAGAAAGCGC TTTAAAACCA GTAGACGTTT 2940 

TCGTAAGCTT CCCTTGATCT ACCAAGATGA GGCTGGTTTC GCTAGAATCA GTAAACTGGG 3000 

ATCTTGTTGG GCTCCAATAG GAGTAGGTCC ACATATCCAT AGTCACTATA TACGAGAATT 3060 

TCGCTATTGT TATGGAGCTG TTGATGCCCA TACAGGCGAA TCATTTTTCT TAATAGCTGG 3120 

TAGATCTAAT ACTGAGTGGA TGAACGCCTT TTTAGAAGAG CTTTCACAAG CTTATCCAGA 3180 

TGATTATCTT TTACTCGTTA TGGACAATGC TATATGGCAT AAATCAAGTA CCTTAAAGAT 3240 

TCCGACTAAT ATTGGTTTTA CCTTTATTCC TC CAT AC AC A CCAGAGATGA ACCCCATTGA 3300 

ACAAGTGTGG AAAGAGATTC GTAAACGTGG ATTTAAGAAT AAAGCCTTTC AAACTTTGGA 3360 

AGATGTCATG AATCAACTCC AAGATGTTAT ACAAGGATTG GAGAAGGAGG TGATAAAGTC 3420 

CATCGTTAAT CGGAGATGGA CTAGAATGCT TTTTGAAAAC AGATGACTAT AAAAAGAAAG 3480 

TCCTCATTTC AATAGAAATC ACGACTTTCT GATGGATTTA TAGTAAAATG AAATAAGAAC 3540 

AGGACAAATC GATCAGCACA GTCAAATCGA TTTCTAACAA TGTTTTAGAA GCAGAGGTGT 3600 

ACTATTCTAG TTTCAATCTA CTATATTTTT GGAGTGATAG AAAAGCCCTT CATAAGCTAG 3 660 
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TCTACTTGTT CAGCTCCGAG AGCTTTGACA TCTTTTTCTG TACTTAGCCA AGTCAGTTTT 3720 

CCGTTCTCAA AGCGTTTATA TAGTAGCCAA AATCCTTGAC CATCCCAGTA AAGGGCTTTA 3780 

AAGCGGTCTT TACGTCCACC ACAAAAGAGA AAGACTTGAC CGGAGAAAGA ATCCAATTCA 3840 

AAGTGGCTTT TAACTACATA GGCTAATGAG TCTATTCCCT GCCTCATATC TGTCTTGCCA 3900 

CAAACAAGGT GAACTTGACC TAAATCACTT AGTTGAATTA TCATAGTACA ATACCTTTCC 3960 

TCCGATAATT ATTTTTTATC TAGTATACTG GAAGTTGGGG -AATTAGGATA GATACCTTGT - 4020 

TATGACGCGC TTACGTAACT TGTAACTAGC TGCCTAGTTT GATCTTTGCT TCTTCATTGA 4080 

TTAGCAGTAG ATTTCAAAAT GATAAAAACG CATAGTATCA GGTATTGAAA TGTACTGCCC 4140 

CAAAAGTTAG ACAGAAAAAA TCTAACTTTT GGGGTGTTTT TGTTATGAAA TTAAGTTATG 4 200 

ATGATAAAGT TCAGATCTAT GAACTTAGAA AACAAGGATA TAGCTTAGAG AAGCTTTCAA 4260 

ATAAATTTGG GATAAATAAT TCTAATCTTA GGTATATCAT TAAATTGATT GATCGTTACG 4 320 

GAATAGAGTT CGTCAAAAAA GGAAAAAATC GTTACTATTT TCCTGATTTA AAACAAGAAA 4 380 

TGATTAATAA AGTCTTAC 4393 

(2) INFORMATION FOR SEQ 10 NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 
AGATTTTTAG ACTTTGTCTT TAATCGTTTC TTTTTAGGGA TGATTGCGAC ACCTTCTTTT 60 

GGCTATTAAC TTTAGCAGGA GGGATTATCC TTGGTCTAGC GCCGGCTAGT GCCACCTTGA 120 

TGAGCTTATA TGCAGAACAT GGTTATAGCT TTCGGGAATA CAGTTTGAAG GAGGCTTGGT 180 

CTCTTTACAA GCAAAATTTT GTCTCAAGCA ACCTGATTTT CTATAGCTTT TTAGGTGTGG 24 0 

GTCTAGTTTT GACCTATGGT TTGTATCTCT TGGTGCAATT GCCTCATCAG ACCATTGTTC 3 00 

ATTTGATTGC GACCCTTTTG AATGTCCTAG TAGTTGCCCT GATCTTTTTG GCTTATACAC 360 

TATCTTTAAA ATTACAAGTT TATTTTGCCT TGTCCTATCG AAATAGTCTC AAATTATCCT 420 

TGATTGGCAT CTTTATGAGT CTAGCAGCTG TGGCTAAGGT TCTCCTTGCG ACTGTGCTAC 480 

TTGTAGCAAT TGGTTATTAT ATGCCTGCCC TGCTATTTTT TGTAGGAATT GGGATGTGGC 54 0 

ATTTCTTTAT CAGTGATATG TTGGAACCTG TCTATGAAAT CATCCATGAA AAATTGGCGT 600 
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CAAAATACAA TCAACCAGTT TTGGCTACAT ACGCTTCTAA GAACCTATAG TTCACTGATG 660 

ATCATTATCA TTGCGAGTTT TGCAATCTTA CTCTCTTACG CTGTCTGGGA TTCACGTG 718 

(2> INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 718 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEONESS: double 
(0) TOPOLOGY : linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

TCGGTACCAA AATTCTGGAT TTATACTAGC AAAGATCCAA GAGCAAATTA TTTAACAGAT 60 

TTAGGTCTAG TTTTCCCTGA ATCATTAAAA GAATTTGACA GTGAAGATAG TTTTGCAAAG 120 

GAAATTTCTG CAGAAGAAGC AAATAAGATA AATGATGCTG ATGTAATCAT AACTTATGGT 180 

GATGATAAAA CTCTTGAAGC TTTACAAAAA GATCCTCTTT TAGGTAAAAT AAATGCAATT 240 

AAAAATGGTG CCGTTGCTGT AATTCCAGAT AATACACCGT TAGCAGCCTC ATGCACTCCA 300 

ACACCACTTT CAATAAACTA TACTATTGAA GAATACCTAA ATCTTTTAGG AAATGCATGC 3 60 

AAAAATGCGA AATAAAAAAC AAATAAACCT AGGCATAATT TTTATAATCT GCCTAGGTCT 420 

TCTTATTACA ATATTTTTGT CATTAAAGCT TGGAACAAAA GAAATTAATA TCAGAGATTT 480 

TTTAGCAGCT TTTGGAATGG GTAATACAAA TGATGATTTT ATTAAATCAA TTATATATAA 540 

rAGAATACCT AGAACTATTT TTGCAATTTT AGCAGGTTCT AGTCTTGCCA TAAGCGGTGT 600 

ATTGATGCAA TCAGTTACTA GAAACCCAAT AGCTGATCCA GGTATACTCG GTATAAACAC 660 

AGGAGCAAGT CTTAGTGTAG TAATTGGTCC TTCtTTTTAC GGAATTCATC AAGCATAA 718 

(2) INFORMATION FOR SEQ ID NO: 296: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 
(B> TYPE: nucleic acid 
<C> STRANDEONESS: double 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

GAACTAATCA TTTTTACAGG ATGAGATTTA CAGCAGAGAG TTTGAAGGCT TTATCAAAGG 60 

TTTTTCTTGG CATAATGACT TTTC C TCGTT TCCACTTAAT TTTGTGTCTA CTTTATTATA 120 

CCAAGTCCAC sCTTAAGTTA GATAATAAAT CTAACTTAAG GAAGCTAGAA GGATGAGAAT 180 

CCAGGTGGTC AAGAGTCCCA AACTTAAGCT GATGGGGACA CCCAGAATAA TTT GCTTrTT 240 
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GAAGGCAAGG CCACGTTCCT CTATATTGGG AAGTGAGAGT TGAATGAGAG AACCAGCTGA 3 00 

TGAAAAGGGT GAGATATTAG TAGATAGAGC GCCAATAACG GTGGCTGTTG TGAGTAACTG 3 60 

AATATCAATC TGAGGATTTT GAGCACTGAT GATAGCAATG ATGCGAAAGA GGGCTGGAGC 420 

TACAACCGAT AGGGTGGAAC TAAAGAGTGA CATCACTCCG GCTATCACAC AAAAGAACAG 480 

. AGGTAACCAG AAATGAGGAA TGGTTGTTGT CATGAGGTGC CCTATCAGTG TGACTAAACC 540 

TGACTTGACC GCTAGAGA CA TTAG TAAGCT CATGCCGCAG AGCATGATAA_TTGTAGCCCA .600- 

GGGAACCTTA GCTAAAATGG CTTCTTGCTT CCCTAATTTG AGCCTTAAGG CGAGGCAGAC 660 

CATGAGTATT GAGACAAACC CAATATCAAA TGTTTTTTGA TAAGTAGCTA TCCAGGCGAT 720 

GTTTGGGAAA ATGACATGCA ACAAGGCAAA AAGCCAAACC AAAACCATGC TGCTGATCAT 780 

GAGCAAGGTG GTTTGTCTTT GAACCTTGCT GAGGAGTGGT GGTTGGTCAA TAGTCAAGGA 840 

TGAGTTTCTT CTTCCCTTAC TATAGTGACT GTAACAGGAT AATAAAAGCA AGACGATCAG 900 

TGGGTAGATA ATGCTGACGA TAAAGATATG ATTGCCAAGT GAAAAAGCTT GCTCTTCCCA 960 

TCCCATTTGC TTAAACAGGC CTTCAAAGAC AATGCCTGAC CTACTGGTTA TCAAATTAGC 1020 

CCCTCCTGAA GCTCCCCAAT TGACGGCTTG AGCTCCAATC AAAGGGTGTT TGTCCGCTTT 1080 ' 

TTCACAGAGG GTAATCGCTA GAGGACAGCA AACGGCCATA GTAGTCAAAA ATCCAGCACC 1140 

TAAAGCAGAC AAAAGCGTTG CCATCAGGTA TAAAATCATG TAGAGGGCGT TAGGGTGGGT 1200 

GCGTGTGCGG TAGAGAATGT GTTGAGCCAA AACATCAAGA GTACCGTTAG TTGTTGCAAC 1260 

GTTATAAAAG AGAGAGACOC TAAAAATCGT AAAAAAGACT GAGGTTGGCC AAAAATGAAG 1320 

AAGTTCTTTG GGGCTTAATC CCATGAGAGT GGTTGCGATG AGGTAAGAAA AAGCAATACC 1380 

CAGCAGGCCA ATATTGATTT TGGTGCGGTA ACCAATTCCA ATGGCTAGAG CAATGG 1436 
(2) INFORMATION FOR SEQ ID NO: 297: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1696 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

CCATTTGGGA AAGAACGTAA GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA 60 

GCCGTTGACA TGTCTATGAT TGACCACATT CCAGATATGA TTGAAAATGG TGTGGACAGT 120 

CTAAAAATCG AAGCACGTAT GAAGTCTATT CACTACGTAT CAACAGTAAC CAACTGCTAC 180 
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AAGGCGCCTG TGGATGCCTA TCTTGAAAGT CCTGAAAAGT 
TTGGTGGACG AGATGTGGAA GGTTGCCCAA CGTGAACTGC 
ACACCATCTG AAAATGAGCA CTTCTTTGGT GCTCCCCCTA 
GTCGCTGAAG TGGTTTCTTA TGATGATGCG GCACAAACAG 
GTCATTAACG AAGGGGACCA AGTTGAGTTT TATGGTCCAG 
TATATTGAAG ATTTGCATGA TGCCAAAGGC AATAAAATCG 
GAACTATTGA CTATTAAGGT GCCTCAACCC GTTCAATCAG 
AAAGAAGCAC TCATCAATCT TTATAAGGAA GATGGAACCA 
GAAAGGAAAA GGAAATGATA GAGGCACAGG GTTTCTTAGT 
TTCATTACCA TAGCAAGCTG GATATTATTG CTTTACAATG 
ATGCTTGTTA TCGGTGTCAT GATTCATTAG AACATCACCC 
CTTTGATACA GGATAAGCCT ATTTTATGTG GTGTTTGTCT 
AATATAAAGA AAGCTTAAGT TGCCCCTTTT GTTTTTCTCG 
ATCATAAGGA ACGCTATTTT AAATAGCAAA TCATCTAGTT 
ATTTCAAGAG AAAATGAAGT AAATCTTCCC ACAATAAAAC 
AATACCTGAT ACTATGCGTT TTTAAGATTT TAAAGACTTT 
GACTACTTGT TAAAACTGGG TTAATTTTCG ACTGTTTAAT 
AAGGTTAGAA TTGTCAAAAC AATCCGTCTA GAGTATGCGT 
TTCTCAGTCA TGCCGTTCGA AGTACGACCT TTACGATATG 
GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA 
TGACCATATC TCAGATATGA TTGAAAATGG TGTGGACAGT 
GGAGTCTATT CACTATGTAT CAACAGTAAC CAACTGCTAC 
TCTTGAAAGT CCTGAAAAGT TTGAAGCTAT CAAACAAGAC 
GGTTGCCCAA CGTGAACTGG CTACAGGATT TTACTATGGT 
GTTGTTTGGT GCTCGTCGTA AAATCCCTGA GTACAAGTTT 
TGATGATGCG GCGGTA 

(2 J INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



TTGAAGCTAT 
CTACAGGATT 
AAATTCCTGA 
CAACAATTCG 
GTTTCCGTCA 
ACCGCCCTCC 
GAGATATGGT 
GCGTCACAGT 
GGATAAGCAA 
CTATGATTGT 
TTTTGAGCCG 
AAAACTACTA 
CTTTAATCCA 
TTGAAGTAGG 
GCATAATATC 
TTTCCTTTAT 
AGTTATTATG 
GATGCCAACC 
CCATTTCGGA 
GCCGTTGATA 
CTAAAAATCG 
AAGGCGCCTG 
TTGGTGGACG 
ACACCATCTG 
GTCGCTGAAG 



CAAACAAGAC 
TTACTATGGT 
GTACAAGTTT 
TCAACGAAAT 
TTTTGAAACC 
AAATCCAATG 
TCGTCCATTA 
TCGAGCTTAA 
ACAAGATCCA 
AAAAAGTATT 
TATCCCTTAT 
ACATATAAGC 
GGTTGCCAAA 
AGAAAACTCA 
AAGATTGTTC 
CTGGTATTTT 
CAAACTCTAA 
GTGGTGGATG 
AAGAACGTAA 
TGTCTATGAT 
AAGGACGTAT 
TGGATGCCTA 
AGATGTGGAA 
AAAATGAGCA 
TGGTTTCTTA 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1696 
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<xi> SEQUENCE DESCRIPTION : SEQ ID NO: 298: 

CCGAGTTTAT TATGCTTTCT TCGCAATTTA TCTCAAAGAT TGAATTTGCT TCCAATAACA 60 

AAGAAAGTCT TTATACTCAA AGCAAATTTA AGTATGCGAT TCCTTCGATG TTCGCAGGTG 120 

CATTTTTAAC CTTCAGTACT GCTGCAGGTG CAGTTGGGGC TGACTTGATT AATAAAATTG 180 

CACCAGGTAG TGGACGCTTC_ CTCTTTCCAT . TCGTTTTTGC TTGGGGCTTG _ GCCTACATTG 240 

TTTTTTTGAA TGCCGAGTTG GTCACTTCAA ACATGATGTT CTTGACTGCT GGTAGTTTCT 300 

TAAAAAAAAT CTCTTGGAGA AAAACAGCTG AGATTTTACT ATACTGTACC TTGTTCAACC 3 60 

TTATCGGAGC CTTGATAGCA GGGTGGGGCT TTGCTCATTC GGCAGCCTAT GCCAATCTGA 420 

CACACGATAG TTTCATCTCA GGTGTTGTTG AGATCAACTT AGGCCGCTCA AATGAATTGG 480 

TCTTGCTTGA GGCGATTTTG GCAAATATTT TTGTAAATAT TGCGATTCTG TCATTTATTT 540 

TGGTCAAAGA TGGTGGTGCC AAACTTTGGC TTGTGTTGTC AGCTATTTAC ATGTTTGTAT 600 

TCTTAACAAA CGAGCACATT GCGGCGAACT TTGCTTCTTT CGCCATTGTG AAATTCAGTG 660 

TTGCTGCGGA TTCAATTGCC AACTTCGGTG TTGGAAATAT GCTTCGCCAC TGGGGTGTGA 720 

CTTTCATCGG AAACTTTATC GGAGGAGGCC TCTTGATGGG TCTTCCATAT GCCTTCCTCA 780 

ATAAAAACGA AGATACTTAT GTAGATTAAG AAAATGAGCA CGATTGAGTC GTGCTTTTTT 840 

CATTTTCAAA ATAAGGTAAT AGCTATTTCT TATATCAAAA TATAGAAAAC TGATATTTGT 900 

ArACTATAAC TCAAGGTGCT ACAATATCCT TAATAAAATA ATATGGAGGT GACCTTATGA 960 

CTTGTGATTT TAAATnTGAA ACTCTACAAC TACATGCTGG TCAACTTGTG GCTCCACCTA 1020 

CT 1022 

{2i INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 663 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

CCTTAAGTAA TCTCTGATAA TATTTTCTTT ATTAGCATAG GGGAATATCG ATATAATGGC 60 

TTCATTATGA GTGGCACGAA TATCCAATAT GGCAACTTTT CCAATAGATA ATTTAAAACT 120 

CATTAATAAA GTTCCTTTAG GTGAAATGTC TATTTTCTTT GATTTTAATG CTAATTTAGA 180 
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AATAGATTCT CTCGCATTAG TTACATAACC AGATATAGCC ATATCTGATA TAGATACCCA 2 40 

AGGTATTTCA GTTCCCCAAA AAGTAGCTTC ACTGCGTGGA GCAGTTTTTC CTATTCTGAA 300 

GTTAACTAGG CTAGCAAATT TAATATATCT CCATGCTTCT GGGATTTCAT ATATAGGATA 3 60 

AGAGGTTGTT TCGTCTTTGT TCCCATAATA AGAGTTATCA TCTCCTTGGG AAACAATAGA 420 

AATGTCCAAA TCTTTCTTTT TAATCTTGCC TTCTTCAAAG AGTTTTTGTT TTTCTGCTCG 4 80 

TATTTTTTCA AGTAAAACTT CCACTGATTC ATCATTTGGG TCTTGTTCAA CTAATTTTCC 540 

TTGCATAGCA TATTGAAGAA TAGATTTTTT TAGTTTATCT GGAAATTCTT TATCTAGCTG 600 

TTCTAGTCTA TTATAACTTT CACCATATTC ATCTACTTTT TCTAAAGCTG ATTCGATTGC 660 

TTC 663 
(2) INFORMATION FOR SEQ ID NO: 300: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 



CGTCGCTGAA 


CATGTCAACA 


GCAAATTAAA 


CTAAACAAAC 


TAAAATTATG 


TGATACTTCA 


60 


CATAATTTTC 


TTTAGAAAAT 


ATTATCAGAA 


GAAAGTTGAG 


AAAAATGGCA 


GAAAAAACAT 


120 


ATCCTATGAC 


CCTTGAGGAA 


AAGGAAAAAC 


TTGAAAAAGA 


ATTAGAAGAA 


TTGAAATTGG 


180 


TTCGTCGACC 


AGAAGTGGTA 


GAACGCATTA 


AGATTGCCCG 


TTCATACGGT 


GACCTTTCAG 


240 


AAAACAGTGA 


GTACGAAGCA 


GCTAAGCATG 


AACAAGCCTT 


TGTCGAAGGA 


CAAATCTCTA 


300 


GCTTAGAAAC 


AAAAATCCGC 


TATGCTGAAA 


TCGTCAATAG 


CGACGCAGTT 


GCCCAGGACG 


360 


AAGTAGCGAT 


TGGTAAAACA 


GTCACCATCC 


AAGAAATTGG 


TGAGGACGAA 


GAAGAAGTTT 


420 


ATATTATCGT 


AGGTTCAGCT 


GGTGCAGATG 


CCTTTGTAGG 


TAAGGTTTCA 


AATGAAAGCC 


480 


CAATTGGGCA 


GGCCTTGATT 


GGCAAGAAAA 


CAGGTGATAC 


AGCAACCATT 


GAAACGCCTG 


540 


TTGGTAGCTA 


TGATCTAAAA 


ATCTTGAAGG 


TTGAAAAAAC 


AGCCTAAAAA 


CAGAAAAAGG 


600 


AGTGGGGAGG 


CGATGTGCTT 


CACTCACTCC 


TTTTTCCATT 


TTGCTACTCT 


TCGAAAATCT 


660 


CTTCAAACCA 


CGTCAGCGTC 


GCCTTGCCGT 


ATGTATGGTT 


ACTGACTTTG 


TCAGTTTCAT 


720 


CTACAACCTC 


AAAACAGTGT 


TTTGAGCTAA 


CTTCGTCAGT 


TTCATCTACA 


ACCTCAAAAC 


780 


TATCTTTTGA 


GCTGACTTCG 


TCAGTTTCAT 


CTACAACCTC 


AAAACCATGT 


TTTGAGCCGA 


840 


CTTCGTCAGT 


TTCATCTACA 


ACCTCAAAAC 


TATGTTTTGA 


G 




881 
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(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 949 base pairs 
f B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: _ . 

CCTTTTTTAA TACAAGTTAT TTTGATTTAA CCGGCTTGTC TTCAGCTCTC TGCAAAGCTG 60 ' 

TGGCAATCGT ATCTGCATAC AATTTTGCTC CTGCTTCGAT AGTGCTACTC TCACTCCCGA 120 

AATGAACCTG GTCTGTTCCA GCCCAAATTT CTGGATGCTC TTTCGCAACT TGATTCCAAT 180 

CTGCTATCGT AATGTAAGGT GTCTTCTCTG CCAATTCTCT CATATACGCA GCAGCCTTCT 240 

CAACGATCGC ATAGGTCTCT TTTGTCTTAT CTCCCTCATA AGGAGTCACC AAAATCATAT 300 

GGTGTCCCTT AGGAAGATTT TTCACGATAC TGTCCCAGTC ATCCTTGTAA TTCTCAGGAT 360 

TATTTACCCC AGTCGCAATG ACCACCCTCT TAGGTAAAAA TTTATTCTGG CTATTATTTA 420 

GCATGATTTC ATTTGCGGTC TTGGTTGTTA CGCTGACCTG CGCGTTAATC TGTGCTCCAG 480 

GAAGAGCTGT CTGTACTGCT GTATTTGCCC TTAAAGCCAC TGAGTCACCA ATTAACATAG 540 

TGCCATCAGC AATTCCCAAA CTGTTTGCAT CTGCCCGTTC TGCCATCACC TTGGTCTGGC 600 

CAATATTTGT TGCAGCTTGC TTCAACCCAT TGACAGTCAA GTCTGTCTCA AACGCTCCCA 660 

CTTGTGCTGC CAACAAGGTC ACCGTGCAGA CAATGATGGT CAAGATTCCT GTACCTGCTG 720 

CAAGAATTGC GTGAATATAA GGCAGGGGAC GAAsGGTTTG GACAATAGGT GTGTTCTTGC 7 80 

CTGCAATCCA AGGTTCCAAT ACATAAAATG ACAGACTGGC AAAGCCATAA GAACAAATCA 840 

GAGTCAGTAA TACAGCAAGA AGATTTGATG TCAACTGTGA GAAAATGATA TAGAAAGGCC 900 

AATGGAAAAG ATAAACCGCA TAGCTAGTAT CCGCTAAAAA GCTGATAAT 94 9 
(2) INFORMATION FOR SEQ ID NO: 302: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 302: 
AAGATATATT TTTTACACAG AAGTATGCAA AAGTAAAGAG TGCAAAAAAT GGAATTAAAG 



60 
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CGAAAATAAA AGCCGTGTAC AGGCGACCAA ACCAACGTAC ACGGCTAACG AAAAATAACA 

AAACTCAAGC AAAGGCAAGG CGCGTGGTTT TCTTAGCTAT TTAGCAAGGG GACAAACCCC 

TTTGTAAATA ATCTCCTCTT ATTTTATCAA AATTAGAGCA AAATGACAAC TTAATTTATA 

AAAAGGAAAA ATGGAGCATA TAAATGGAAA TTCTGTCTAA AGAAATACAG TTACAGGGCT 

TACAACTTCT TAAACAGACT CTTGAAACTT TAGTTGAGCT AOAAAAACAA CGATCTAGTA 

AGTTAGATTT AATTTCTCGT AAAGAATTAA TGGATCTGCT AGGTATAAGT GCTACAACCC 

TTGATAACTG GGAGGATCTT GGTCTTAAAC GATATCAGAC TCCGATGGAT GGAGCTAAGA 

AAGTATTCTA TCGTCCGTCA GATGTGTATT TATTTTTAGC AATAAAATAG GAGTTATGAA 

ATGAAAATTG TTACTTTCAA ACCAACTAAA CAAATAGACG ATGGGTTTTA ACTGCCAGGT 

ATTGACATTC TATTTGTCTC AG 

(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 1929 base pairs 
(31 TYPE: nucleic acid 

(C) STRANOEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 
CGCTAACTTG CAAACAAAAC AAGAACCCAA ACTCCACAAA TCCTTTACGC AGAAACTCAA 
TCTCATCTAC TTACCTTGCT GACTTGGTAG AGTATGTTGC AGACAAAGAC TTCTCAGTAA 
ACGTAATTTC TAAATCAGGT ACAACAACTG AACCAGCGAT TGCTTTCCGT GTCTTTAAAG 
AACTCTTGGT TAAGAAATAC GGTCAAGAAG AAGCTAACAA ACGTATCTAT GCAACAACTG 
ACCGCCAAAA GGGTGCTGTT AAGGTTGAAC CAGACGCTAA CGGTTGGGGA ACATTTGTTG 
TTCCAGATGA TATCGGTGGA CGCTTCTCAG TATTGACAGC CGTTGGTTTG CTTTCAATCG 
CAGCATCAGG AGCTGACATA AAAGCTCTTA TGGAAGGTGC GAATGCAGCT CGCAAACACT 
ACACTTCAGA CAAAATCTCT GAAAACGAAG CTTACCAATA CGCAGCTGTT CGTAACATCC 
TTTATCGTAA AGGCTATGCA ACTGAGATCT TGGTAAACTA TGAGCCATCA CTTCAATACT 
TCTCAGAATG GTGGAAACAA TTGGCTGCTG AATCAGAAGG AAAAGACCAA AAAGGTATCT 
ACCCAACTTC AGCCAACTTC TCAACTGACT TGCACTCACT TGGTCAATTT ATCCAAGAAG 
GAACTCGTAT CATGTTTGAA ACAGTTGTCC GTGTTGACAA ACCTCGTAAA AACGTGCTTA 
TTCCTACTTT GGAAGAAGAC CTTGACGGAC TTGGTTACCT TCAAGGAAAA GACGTTGACT 
TTGTAAACAA AAAAGCAACT GACGGTGTTC TTCTTGCCCA CACAGATGGT GATGTACCAA 
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ACATGTATGT CACTCTTCCA GAGCAAGACG CTTTCACTCT TGCTTACAC7 ATCTACTTCT 900 

TCGAATTGGC AATTGCCCTT TCAGGTTACT TGAATGCTAT CAACCCATTT GACCAACCAG 960 

GTGTTGAAGC TTATAAACGT AACATGTTTG CCCTTCTTGG AAAACCAGGA TTTGAAGAAT 1020 

TCAGCAAAGA ACTTAACGCA CGTCTATAAT AGAAGAAAAG AGTGGTTTGC CCACTCTTTT 1080 

TACTCTCTTT ATCCATAGAA ATTGGACTCA GCCAAGACTT GTGATATAAT ATAGAAAGCA 1140 

.AAAAGGCAGA_CGCCTAGATA-ATAGGAGAAA -CTATGTCAAA -AGATATCCGC-GTACGTTAGG 1200 

CACCAAGTCC AACAGGACTA CTACACATCG GAAATGCTCG TACAGCATTG TTTAATTACT 12 60 

TGTATGCGCG CCATCATGGT GGAACATTTC TCATCCGTAT CGAAGATACT GACCGTAAAC 1320 

CCCATGTCGA GGATGGTGAA CCTTCACAAC TTGAAAACCT TCGCTGGTTA GGCATGGATT 1380 

GGGATGAAAG TCCAGAATCA CATGAGAATT ATCGCCAGTC TGAGCGTTTG GACTTGTATC 1440 

AAAAATATAT TGACCAACTA TTAGCTGAAG GAAAAGCCTA TAAATCTTAC GTTACAGAAG 1500 

AAGAGTTGGC AGCTGAACGC GAACGCCAAG AAGTAGCTGG CGAAACACCA CGCTACATCA 1560 

ATGAATACCT TGGTATGAGT GAAGAAGAAA AAGCAGCTTA CATCGCAGAA CGTGAAGCAG 1620 

CAGGGATCAT CCCAACTGTT CGTTTGGCTG TCAATGAGTC AGCTATCTAC AAGTGGCATG 1680 

ATATGGTCAA AGGCGATATC GAATTTGAAG GTGGCAATAT CGGTGGTGAC TGGGTTATCC 1740 

AAAAGAAAGA CGGTTACCCA ACTTACAACT TTGCCGTTGT TATCGATGAC CACGATATGC 1800 

AAATCTCTCA TGTTATCCGT GGAGATGACC ATATTGCTAA TACACCAAAA CAGCTTATGG 1860 

TCTATGAAGC TCTTGGTTGG GAAGCTCCAG AGTTCGGTCA CATGACCTTG ATTATCCACT 1920 

CTGAAACTG 192 9 
(2) INFORMATION FOR SEQ ID NO: 304: 

(ii SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 708 base pairs 
(B> TYPE: nucleic acid 
{C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

AAATTTAAGA AAAAGGAGAC ACATCATGTC TAAAAAAGTA TTATTTATCG TCGGATCACT 60 

ACGTCAAGGT TCTTTCAACC ACCAAATGGC GCTCGAACCT GAGAAAGCAC TTGCTGGTAA 120 

AGCGGAAGTT AGCTACCTTG ATTATTCAGC CCTTCCTCTC TTCAGCCAAG ATTTGGAAGT 180 

TCCAACACAT CCAGCTGTAG CTGCTGCTCG TGAAGCAGTT CTCGTTGCGG ATGCTATCTG 240 
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GATTTTCTCT CCAGTCTACA ACTTCTCTAT CCCTGGTACA CTGAAAAACT TGCTTGACTG 



300 



GCTATCTCGT GCCCTTGACT TGTCTGATAC ACGTGGCGTT TCTCCCCTTC AAGACAAGTT 



360 



TGTCACACTA TCATCTCTAG CCAATGCAGG GCACGATCAA CTTTTCCCTA TCTACAAAGA 



420 



CCTCTTGCCA TTTATCCGTA CACAAGGCGT TGGTGATTTC ACTGCTGCAC GTGTTAATGA 



480 



CTCTGCCTGG GCAsACGGAA AATTGGTTCT TGAAGAAACA GTCCTAAACT CACTTGAAAA 



540 



ACAAGCTCAA GACTTGGTCG AAGCTATCAA GTAACTAACA CTCAATAAAA ATCAAAAAGC 



600 



AAACTAXGAA GCTArCCGCA ACCTACTCaA gCACTGCTTT GAGGTTGTAG ATAGAACTGA 



660 



CGAGTGTnnA ACATATATAC GGTAAGGCGA CACTGACGTG GCTTGAAn 



708 



(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 781 base pairs 

(B) TYPE: nucleic acid 
<C> STRAND EDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

CTTCTTTTCT TGGAAATAGG TGTATAATAC GTTTATTAAA TTTTTGAGGA GTTGTCTATG 60 

AAGAAAAGTT TTATCCATCA ACAAGAAGAA ATTTCCTTTG TCAAAAACAC TTTTACCCAG 120 

TATTTGAAAG ATAAGCTAGA AGTTGTCGAA GTTCAAGGTC CTATCTTGAG TAAGGTCGGT 180 

GACGGAATGC AGGACAACCT GTCTGGTGTG GAAAATCCAG TATCGGTCAA GGTTCTCCAA 240 
ATCCCTGATG CTACTTATGA AGTGGTGCAC TCACTTGCTA AATGGAAACG CCACACCTTG " 300 

GCTCCTTTTC GCTTTGGTGA AGGAGAGGGT CTCTTTGTCC ACATGAAAGC CCTTCGTCCA 3 60 

GATGAGGATT CCTTGGATGC AACCCACTCT GTTTATGTTG ACCAGTGGGA CTGGGAGAAG 420 

GTTATCCCAA ATGGTAAGCG TAACATCGTT TATCTAAAAG AAACAGTTGA GAAGATTTAT 4 80 

AAGGCTATTC GCCTGACTGA GCTAGCTGTT GAAGCCCGCT ATGACATCGA GTCTATCTiG 540 

CCAAAACAAA TTACCTTTAT CCATACAGAA GAATTGGTAG AACGCTACCC AGACTTGACA 600 

CCGAAAGAAC GTGAAAATGC GATTTGTAAA GAATTTGGAG CCGTCTTTTT GATTGGTATC 660 

GGTGGCGAGT TGCCAGATGG TAAACCGCAC GATGGACGTG CACCAGACTA TGATGACTGG 720 

ACAAGCGAGT CTGAGAATGG CTACAAGGGT CTAAATGGTG ATATTCTTGT CTGGAATGAG 780 

T " 781 
(2) INFORMATION FOR SEQ ID NO : 306: 

(i) SEQUENCE CHARACTERISTICS : 
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(Ai LENGTH: 846 base pairs 
(BJ TYPE: nucleic acid 
tC) STRAND EDN ESS : double 
(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


306: 




CCCGCATCTT 


GTAGGGTTTT AACGGGCACG 


ATTTTCATAT 


CCCTCTTGAT TCTTTTAGCC 


60 


GCTTCTAGGG 


CTGTTTGGTA GTTGTTTTTC 


„QCGTCCGGAT 


. _GCGCCTTTTG_TTCTTCTTCG 

CCCTAGCCGA AGCTACAACC 


-120- 

180 


CTAACAGGGT 


TATCAGGAGC AAAGAAAATA 


CCAGCACCTC 


TTCTTATCAA 


TACCTCCAAT GTCTCCCACA 


TTACCATCGC 


GGTCAATGGT ACCTGTACCG 


240 


GCAACAATAC 


GACCATTACG AAGATCTGGG 


TGAGCTATTT 


GAGTATAGAT AGCTAGACTA 


300 


AACATGAGAC 


CAGCACTTGG ACCGCCAATA 


CCAGCTGTTG 


AAAAGCTAAT TGGGACATTG 


360 


CTGATTACCT 


CTGTACGGTC AATCAAGCCG 


ATTCCAATTC 


CATTTTTGCC ATTTTCCAAG 


420 


GTGATGATTT 


TTCCTTCTGC AGACTTGGTT 


TGCCCATCCT 


CTTCATAGGT GACCTTGACG 


480 


GAATCCCCTA 


ATTTTTGAGA ACTGACGTAA 


TCAATCAAGT 


CTTTGGAACT ATCAAAGGTC 


540 


TGATCATTGA 


CTGCTGTGAC TGTATCAGAG 


ATATTGAGAA 


TCCCTTTAAA GGTTGAATTA 


600 


TCCGTCACAT 


TCAAAACATA AACTCCAAAG 


TACTTGAGTT 


CGATATCCTT ACCAGCTGTT 


660 


TTTAGTCCTT 


GATACTTGGC CATATTTTGC 


GATGTTTGCA 


TGTAGAATTG ATTGATTCGC 


720 


ATAAATTCAA 


CATCGGAAGA ACCACCTGTA 


GTCTCCTGAG 


CACTACGAAT ATCTGTAAAA 


780 


GGTGTCAACC 


AAGCATAAAT CATATGAGCT 


AAAGTGGCAT 


GTTGAACACC AACCGTAACG 


840 


AATTGT 








846 



(2) INFORMATION FOR SEQ ID NO: 307: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 
GCGATCTGCT TGGGCTTTTC CTATTACCTT ATCTAATAAA TAGGTACGCA GACTCATAAC 
CATATAAAGT CCACCCCCCA TGGCACCGAC AAGAGCTACA TAAAAGAAGC TCCACAAACG 
TCCACTTGGT TGGAAGAAAA ATCCTAACAG CCACTGGATG GTTCCTATTA ACAGAAACAT 
GACTAGGGTC AGCAAACTGA TTAAAATGGT TCGCTTCAAA ATCACCTTGC GCTTGACACC 
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AGTTACTTTA CAAATATCCC GATACATCAA GACCTTAGGA ATGATGAGAG CAATGGTTGT 300 

TGAAATCAAA GGACCATAAC TGTGGAAGAG GGCGATGGTA GGTAGTTGCA AGACTAGCTT 360 

GGCAATAGAA CCATAGATAA AATAGAGAAC GGCCTTGCGG TTGCCGAACA TGGCCTGAAG 420 

CATTGGAGAC AAGACCATCT ACAAGCCTAA AATAATAGAC TGCAAAACTG CAAAGACAAA 480 

TAAGCCCAGA GCCAAACTAT CTGGCTTACC ATAGAAGACC GTATAAAGAG GTTCTCCTAC 540 

CATAACCACT CCAACCCTTG CTGGTAGCAA GAACATAAAG AGTAGGGTGA GACTGTCCTG 600 

AACGAGACGA GAAGCTGCTT TCAAGTCCCC CTTGACATAG TTTTCCGTCA AAAGTGGCAA 660 

ACCAACACTC CCAATCGAAA CCCCTACAGA AATCAAAATC ATCGTGATTT TATTAGGATT 720 

GGCTGAGAAA T AAGAAAAC A TGAGAACCAA CTCCTCATTG CTGTAGTTGC TAAACCACCT 7 80 

CATACTATTG ATAAAGGTCA GCTGAGTCCA AATCTGGAAG AGCTGGATG 829 
(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 
tC) ST RAND E ONES S : double 
(D) TOPOLOGY: linear 



(Xij SEQUENCE DESCRIPTION : SEQ ID NO: 308: 



CGAACATCTT 


GCTGGCTGAT 




CCATCGCAGC 


CCCGAACACA 


TTGCGACCCA 


60 


TGGCAAGCGG 


GCTCAATCCG 


CACATGGGAT 


CCGTGCCAAA 


GCCCCGCGTG 


TGCATCATTT 


120 


GCTCATCTAG 


TAACGTATGA 


GGTTTGCCTT 


CGCTGTCGAT 


AAACCGATAT 


TCAATCGCAC 


180 


CACTGCTCGT 


TCTCCGCGGA 


GGGGAAACCG 


ACTGCGGTAG 


GATGAACTCC 


AGAGAAGAGA 


240 


GATCACGACC 


TACCAGGTGC 


GGCTCGTTGA 


AGCTGTTGCC 


GCTTAGCAGC 


AGGCTCGCCA 


300 


CCACGCATTC 


CCAGAACTCA 


ACGGGGGTTT 


GATCGGCGTT 


CGGTTGCTGA 


CTAATAACTC 


360 


GGTGCACGGG 


ATGCGAAGTG 


GCCACTTCTG 


GCACACCGTT 


CTTGTCTTCG 


TAGAGAGCAA 


420 


TTGGGAGGGT 


GGCCAGCGTT 


TCGGCGATGA 


GGCGCACGCA 


GGCC 




464 



(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
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CCCTCTATAA TGGTAATACA TTTTATTTGG ACGTTTTTAT GTCATTTCTA TCAAAAAATG 50 

GAGCAGGTAT CTTGGCCTGC CTTCTCATTT CCATCCTATC TTGGTACTTA GGAGGATTCT 120 

TCCCTCTCGT TGGCGCGCCC CTTTTTCCCA TTTTCATAGG CATGCTCCTA CATCCCTTTC L80 

TCTCCTCCTA TAAACAACTG GATGCTGGTT TGACCTTTAG TTCCAAGAAG TTGCTCCAAT 24 0 

ATGCCGTTCT CTTGCTTGGT TTTGGTCTCA ATATCTCGCA GGTCTTCGCA GTTGGCCAAT 300 

CTTCACTCCC TGTCATCCTG ^TCCACTATCT CAATAGCTCT GATTATTGCC TACCTCTTCC 3 60- 

AGCGTTTCTT TGCCCTGGAT ACAAAACTGG CTACCTTCGT TCCAGTACGT TCTTCTATCT 420 

GTGGGGGTTC TGCCATTGCA GCGACAGgCC CGTTATTGAT GCTAAGGAAA AGGAAGTACC 480 

CCAAGCCATT TCCGTTATCT TTTTCTTCAA TGTCTTGCCT GCGCTCATCT TTCCAACCCT 540 

CGGCACCTGG CTTCATCTAT CCAATGAAGG CTTCGCCCTC TTTGCAGGGA CTGCGGTCAA 600 

CGACACTTCC TCTGTAACGG CTGCCGCCAG CGCTTGGGAC AGTCTTTACC AAAGCAATAC 660 

CCTCGAGTCT GCAACCATTG TTAAACTCAC ACGTACTTTG GCCATTATCC CTATCACGC? 720 

CTTTCTATCC TACTGGCAAA GTCGCCAACA ACAAAACAAG CAAAGCCTGC AACTGAAAAA 780 

AGTCTTCCCA CTTTTTATCC TTTACTTTAT CCTTGCCTCT CTCCTCACTA CACTACTCAC 840 

CTCTCTAGGT GTGTCCAGTA GTTTCTTTAC TCCTCTCAAA GAACTCTCTA AATTCCTTAT 900 

TGTCATGGAC ATGAGTGCTA TCGGTCTCAA AACCAATCTG GTCGCTATGG TCAAATCCAC 960 

TGGAAAATCC ATTCATCATG GA 992 
(2) INFORMATION FOR SEQ ID NO : 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1939 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EONESS : double 

(D) TOPOLOGY : linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

CTACCTGCCA ATATGATTGG GGTGCAGAAG CGCGTGATTA TCTTTAATCT TGGCTTGGTT 60 

CCTGTGGTCA TGTTTAACCC AGTGCTTCTG TCCTTTGAAG GATCCTATGA GGCAGAAGAA 120 

GGCTGTTTGT CCTTGGTAGG TGTGAGATCA ACTAAGCGTT ATGAAACCAT AAGGCTTGCC 180 

TATCGTGACA GCAAGTGGCA GGAACAGACC ATTACCTTGA CAGGCTTCCC AGCTCAGATT 240 

TGCCAGCATG AGCTGGATCA CTTGGAAGGA CGAATCATTT AGGAGGAAAG CAAATGAAAC 300 

GAATAGTCTT TGAACTTATT TTTATCGCAA CGACCTGGTA TATCTTTTTA CCGCCCCTTA 360 
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ACCTGACCAG CTGGCAATTT CTCTTCTTCC TCTGTGGGCA TTTCTTAGTT GTGGCAATAT 420 

TATTTGGCTT TGGCAAGGGG ATAAACCTTG TCAAAACGCT TCATGTCCGC CACGGTAAGC 480 

CGGAAGCTGC CTTAAATCTT GAGGGTTTCA AAATCAATCG CTTAGGGAAA ATTCTGTTAG S40 

CTTCGATTGG AGGAATTCTT CTCTTGCCAG CTTTGGTTTc CTTGGTAACT TCCAGCATGT 600 
TTCAGGCTAA AAATTATGCC AATGTAGTCA CGGTTACGGA AAAAGACTTT ACTGAATTTC 
CTAAGAGTGA CACCACTAAG CTTCCTATCC TAGATAGAAC TACTGCTGAA AAAATTCGAG 
ACCGCTACTT GGGTTCCCTA ACCGATAAGG TGTCGCAATA CGTAGCGGCA GATACCTATA 
CCCAATTGAC AATTGATGCG AAACCTTATC GGGTCACACC ACTAGAATAT CCAGACCCTA 
TCAAATGGTT TAACAATCAA GCCAAGGGAA TCGGTGAGTA TATTAAGGTG GACATGGTAA 
CTGGAAATGC GGATTTGGTG GACTTGAAGA CACCAATCAA GTATTCAGAC TCGGAGTATT 

TTAACCGTGA TGTCAAACGT CACCTGCGCT TGAAGTACCC GACCAAAATC TTTAAAACTC 1020 

CATCTTTTGA GGTCCACGAT GAGGGCAATC CTTTCTATGT AGCAACGGTT TACCAAAAGC 1080 

AATTTGGACT TGCTGTTCCT CGTCCTGCTT CACTCATTAT CTTGGATGCT ACAAATGGAG 1140 

AAACCAAGGA ATACAGCTTA TCAGATGTTC CAGAATGCGT GGACAGGATC TATCCAGCAG 1200 

AGGAAACCAT TGACCAAATC AACTACAACG GCAAGTACAA GGACGGTTTC TTGAATGCCA 1260 

TCATTTCCAA GAAAAACGTG ACCCAGACTA CCAATGGCTA TAATTACTTG TCTATCGGTA 1320 

ATGACATCTA TCTCTACACA GGTGTGACGT CGGCTAATGC GGATGAGAGT AATCTTGGTT 1380 

TCATCCTTGA AAATATGCGA ACAGGAGAAA TCACTAAGTA TAGCTTGCCT TCTGCGACAG 1440 

AAGAATCAGC CCGTCAATCA GCAGAAGGTG CTGTTCAGGA GAAATCCTAC AAAGCAACCT 1500 

TCCCAATCCT CATCAACCTC AATGACAAGC CTCTCTACAT CATGGGCTTG AAGGACAATG 1560 

CTGGCTTGGT CAAAGAGTAC GCCCTGGTAG ACGCAGTCGA GTACCAAAAT GTTATCGTTG 1620 

CTACTACAGT GCAACAGATG CTCAGCAAGT ATGCCAATAA AAACGACCTT GAAATTGACA 1690 

ATGCAACGAC AGAAAGCATC AATGCAGTAG TAGCAGACCT CAAATCAGCT CTTATCAAoG 1740 

GAGACACTGT CTACTTCTTT AAAGTTGATG GCAACATCTA CAAGGTCAAG GCTTCAGTAT 1800 

CCGATGACCT TCCTTACCTT GAAAATGCTA AAACCTTCCA ACGTCAAGTA GGAAAAGACA 1860 

ATTATCTCAA GACCTTTAAG CTACGGTAAA AATAGGTTTT TTTCAGAAAG TATATGTTAT 1920 
AATAAGGTAA ATTAAGCCG 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90? base pairs 
(8) TYPE: nucleic acid 



660 
720 
780 
840 
900 
960 



1939 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 311: 
CCTGCTAATA GAGAGAAAGA CTAGGAGTAG AAGTAAGCCA ATTAAATAAT GAGAAAGTTT 60 

CATACCCCGT CCTTTCATGT AGATTTGGTA TCGAAAGATA TCTCCGGATA TAAATGTAAC 120 

ATTA TTTTTC TAATCTCTCA_ATAA^TTTC TGACAATTTA _ ATAAATACAA CAAGGACAGA 180- 

GCAACAAGAC TTTCTCCTTT GTTATCCTAT TCTAAAATGT TTTTACCTTA ATCTGATAAA 240 

ATAATATCTT CGAGGGAGTA GCTAGCCGTC CAATCAAGAT ATTGTTTAGC TTTTGAAGCA 300 

TCTGCTAGGA CACTGGCTGG GTCACTAGCA CGTCGAGCAA CAATCTCGTG TGGGATTTTT 360 

TAATTTAGTA ATTCTTCAGC AGTTTTAAAG ATTTCTTTGA TAGTATAGCC TTTTTTAGTT 4 20 

CCTAAGTTAA AGATTTGAGA AGAACTGTCT TCTTCAAATA GGTAGTTCAT TCCTTTAACA 480 

TGAGCCTATG CAAGGTCCAA CACATAAATG TAATCTCGAA TACATGAACC GTCACGTGTA 540 

TCGTAGTCAT CTCCAAATAT TTTTAAGCTA TCATTTTGTC CCAATGCGGT CTTGTTGATA 600 

TTTGGAATGA TGTGAGTTGG ATTTTTCACA CGCAGACCGT TTGAAGCATC CATTTCAGCC 660 

CCAGCAACAT TAAAGTAACG GAAAATAACA TATTTCCAGT CGTAGCGATT GCCCATCCAG 720 

TAAATCATTC GTTCGCCCAT CAGTTTTGTC TCTGCATAAG GGTTGACAGG GTCGAGCACG 780 

GTATCTTCAG TCACCGCCTT CTCAATACAG TTATTTCCAT AGAGAGAAGC AGTCGAAGAG 840 

AACATGATTT TTTGAATGCC AACTTCAGAT AAGACTTTGA GAACTTCCTT CATACCACCA 900 

ACGTTGG g 07 
(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

CCACATAAAG GTAAATATCT TTTCTACTAT CTTGGGCATC CAAGAAAAGC AATTCGGCAA 60 

TAACAGAGTT AGCCATATTG TCTTCAACCG GACCTGTCAG CATAATCATG CGGTCTTTGA 120 

GAAGACGTGA GTAAATATCG TAAGAACGTT CTCCACGGCT TGTTTGTTCA ATAACTACAG 180 

GAATCATTCA TTTCTCCTTT TGAGTTTTAA TTTTGTTGGT CAAATCACTG AAGATAAGAC 240 
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TATTATAATA TCTTGGTCAA AAAAGGTCAA ATTTTTGCTC TGCTTTCATT AGACAGAAAC 300 

AAAAACCCAA CCTCCTTTCG TGACTGGAAA TACTTTTCCA ACTCATTCTT CTTTTCCATC 360 

TTATTTTGTA CCGAACAAGC GGTCTCCAGC ATCTCCAAGA CCTGGAACGA TATAACCGTG 420 

TTCGTTCAAA CGTTCATCCA AGGCTGCTGT AAAGATTTCT ACATCTGGAT GAGCTTCTTG 480 

AAGGGCTTTT ACACCCTCTG GAGCAGATAC AAGGCAGACA AATTTGATAT TTGATGCGCC 540 

ACGTTTTTTA AGAGAATCAA CAGCCAAGAT TGCTGAGCCA CCTGTTGCCA ACATTGGCTC 600 

TACTACAAAA ATTTGACCTT GGTCAATGTC CTCAGGCAAT TTCACCAAGT ATTCAACTGG 660 

TTGAAGTGTT TCTTCATCAC GCTACATACC GATGTGGCCA ACTTTAGCAG CTGGAACCAA 720 

GTTCAACAGA CCATCAACCA TCCCGATACC TGCACGCAAG ATTGGGACGA TCCCCAATTT 780 

CTTACCTCCC AATTGTTTTT GAACTGTTTT TCTAATTGGT GTTTCGATTT CCACATCTTC 840 

TAGTGGAAGA TCACGAAGTA CTTCATACCC CATCAACATT GCAATCTCAT CTACTAGCTC 900 

ACGAAAAGCT TTTGTAGAAG TATCTGTACG ACGCAAGATT GACAATTTGT GTTGAATCAG 960 

TGGGTGATTA ATAACTTCAA TTTTTCCCAT TTTTGGAATT CCTTCTTTCA ATTTATTCTT 1020 

CTTATTATAC CAAAAAACGG TTTAAAAATC TTTCTAAACC ATTTATTTTT GATAATTTTT 1080 

ACATTAGATC AGCCTCTTTA AGAGCTGTCT CTACTCTCTC AAGTGGTAAA TGGGTCAATT tl40 

CTGTCCCTTT TTCTTGATAA AGGTATTGGG CGTAGTCGTC CATTCGGTAC TGGTTCATAT 1200 

AAACCACGCG CTTGCAGCCG ACCTGAAGCA ATTGTTTTGT ACAGTTGAGA CAAGGAAAAT 12 60 

GGGTTACATA GGCTGTAAAG CCTTTGGGAA CACCACGCTC AGCACCTTGA ACGATACCAT 1320 

TGACCTCAGC GTGAAGGCTG CGAACGCAGT GGCCTTCAAT GACCAAACAT TCGTGATCAA 1380 

TACAATGCTC AGTCCCTGAC ACCGAACCAT TGTAACCAGT GGAAATAACC TTATTATCTT 1440 

TTACCAGAAT CGCGCCCACT TTAGCACGTT TACAAGTGGA ACGATTCGCA ATTAGTAGAG 1500 

CTTGGGCTGC AAAATACTCA TCCCAGGCCA GTCTTTTTTC AGTCATCTCT TTTCTCCTTT 1560 

TTCTCTATTT TTTAAAAAAT GGTAAACCTA AATCTGCAAT CTTTTCAGCT GGTACCTTCA 1620 

TGCCATCCTT GATCCATTTT AGAAGGACAG AGACGATGGC TGAGCTCCAG AAGGAATGAA 1680 

GATAAGAGCT CACACCTTTT GATTTCCCAT GGTATTTTTC TAGAAATTCC TGCATGGCTT 1740 

GGACAAAGAT TTTTTCCAGA TGGTAATCCA AGGCCAATTG AATTACTCTA GCTTCCTTTC 1800 

TGGCCTCCCG GAAAAGGTGA ACCCAAACCA AATAAAGGTC TGTCTTTAAA TCGTAATGAT 1860 

GCAGCTGTTC CATAATATTG TGGACAGTTC GTTTAAAGAC GCTCTCTAAA ATTTCCTCTT 1920 

TGGAGTCATA ATTGCGATAA AAGGCCGCAC GCGAAACACC TGCACGTTTG ACCAATTCAG 1980 

AAATACTAAT CTTGGTCAGT TCCTTTTTTT CCAAGACTTC CAAGAGGGCT GTTTCAATGG 2040 
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CTTCTCTGGT TAATAAATTG GATTCTTGCT TTGATTTTCT GACATTTTCA AGAGACTTTT 
CAGAGATTCT ACGTTCAGAC ATAACATTTT CTTTCTACTT GTCACAACAG ACGGATGATC 
CTTTTGTTTC 

(2) INFORMATION FOR SEQ 10 NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 539 base pairs 

. (B) .TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

ATCTGCACGA ATCAGGGCTT TCTAAGTGAC TATTTCCACC GAAATATTAT TTATATCAGG 

AGGACATTCA TATGTCACGT TATACAGGAC CATCTTGGAA ACAAGCTCGT CGTCTTGGCC 

TTTCACTTAC AGGTACAGGT AAAGAATTGG CACGTCGTAA CTACGTACCA GGACAACACG 

GACCAAACAA CCGTTCTAAA TTGTCAGAAT ACGGTTTGCA ATTGGCTGAA AAACAAAAAC 

TTCGTTTCAC TTACGCTGTA GGTGAAAAAC AATTCCGTAA CTTGTTCGTA CAAGCTACAA 

AAATCAAAGG CGGAATCCTA GCTTTCAACT TTATGCTTCT TTTGGAACCT CGTTTGGATA 

ACGTTGTTTA CCGTCTTGGT CTCGCGACTA CTCGTCGTCA AGCTCGTCAA TTCGTAAACC 

ACCGTCACAT CCTTGTTGAC GGGAAACGCG TTGATATCCC ATCATrCCGC GTAACTCCAG 

GTCAAGTGAT CTCAGTTCGT GAAArATCAT TGAAAGTTCC AGCAATCCTT GAAGCACTA 

(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
(DJ TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 314: 
CCGGTTTTGC TCCTTCTCTA CGGCTACGAC GTGATGTATC TCTGATGATA TCCACTGTTT 
CTGTAGCAGG CGTAGGTGTT TCTGGACCTG CTTGTTCTGC TTTTTTCTCT GCCGTCGTAT 
AGGAAACAGC TACCCTTGTT GGGGTTTCAT TGTATTCTCT TTCAAGTTTC TTAGGTCTAA 
CAGGACCTGG ACCTGGTCTT GATCCACTTT CTTCCGCTGG AGAAGAAGGT ACATCTTCAC 
TTGGATGACT TGGAACACCA GGACTTTCTC TTTGAATCTC ATCTGCTCGA GAAGCTGGTA 
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CACCTTGACT TGGGTGAGTA GGCACGCTAG GACCTTTTCT CATAATCTCC TCTACCGTTG 360 

ACAAGGAATC AGCCATGAGT TCTTCAGTTG AACGTTCATT TGCAGGAGTG CGAACTACTG 420 

CCTCATCTTC TTTCAGAACT TCATCATAGC CTTTTACTTT TTCTAAATCT CTCAGAATCT 480 

GCTCTTTAAA GCGTAATTTC TCTTCTGCTC TTGACTTTTC ACTCAAAAGT TTTTCCTCCT 540 

TGTTGAGAAT CCATAATATT AGAGCTGAGA AGTCCAAAAA AAGCAATCTA TGATACTTTT 600 

CCTAACGGAT TTTGTCATTT CCCAGACCAT ATCATACCAT GTTTCCCCTG CAAAGGTTGA 660 
CTGGGAA 

(2) INFORMATION FOR SEQ ID NO: 315: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



667 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

GGGAAGCCAA GGTATTTTAT CGGATGAACT TGTTACTAGT TCTTCACCGA TGGCTACAAA 60 

AGAGTCTTCT AATGCAATTA CTAATGATTT AGATAATTCA CCAACTCTTA ATCAGAATCG 12C 

TTCTGCTGAA ATGATTGCCT CTAATTCAAC CACTAATGGT TTAGATAATT CGTTAAGTGT 180 

TAATACTATC AGCTCTAATG GTACTATTCG TTCCAATTCA CAATTAGACA ACAGAACAGT 240 

TGAATCTACA GTAACATCTA CTAATGAAAA TAAGAGTTAT AAGGAAGATG TTATAAGTGA 300 

CAGAATTATC AAAAAAGAAT TTGAAGATAC TGCTTTAAGT GTAAAAGATT ATGGTGCGGT 360 

AGGTGATGGG ATTCATGATG ATCGACAAGC AATTCAAGAT GCAATAGATG CTGCAGCTCA 420 

AGGGCTAGGT GGAGGAAATG TATATTTTCC TGAAGGAACT TATTTAGTAA AAGAAATTGT 4 80 

TTTTTTAAAA AGTCATACAC ACTTAGAATT GAATGAGAAA GCTACAATTC TAAATGGTAT 540 

AAATATTAAG AATCACCCTT CCATTGTTTT TATGACAGGT TTATTTACGG ATGATGGTUC 600 

GCAAGTAGAA TGGGGCCCAA CAGAAGATAT TAGTTATTCT GGTGGTACGA TTGATATGAA 660 

CGGTGCTTTG AATGAAGAAG GAACTAAAGC AAAAAATCTA CCACTTATAA ATTCTTCACG 720 

TGCATTTGCT ATTGGGAATT CAAATAACGT AACTATAAAA AATGTAACAT TCAAGGATAG 780 

TTATCAAGGG CATGCTATTC AAATTGCAGG TTCGAAAAAT GTATTAGTTG ATAATTCTCG 840 

TTTTCTTGGG CAAGCCTTAC CCAAAACGAT GAAGGATGGG CAAATCATAA GTAAGGAGAG 900 

CATTCAGATT GAACCATTAA CTAGAAAAGG TTTTCCTTAT GCCTTGAATG ATGATGGGAA 9 60 

AAAATCTGAA AATGTGACTA TTCAAAATTC CTATTTTGGC AAAAGTGATA AATCTGGGGA 1020 
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ATTAGTAACA GCAATTGGCA CACACTATCA AACATTGTCG ACACAGAACC CCTCTAATAT 1080 

TAAAATTCAA AATAATCATT TTGATAACAT GATGTATGCA GGTGTACCTT TTACAGGATT 1X40 

CACTGATGTA TTAATCAAAG GAAATCGCTT TGATAAGAAA GTTAAAGGAG AGAGTGTACA 1200 

TTATCGAGAA AGCGGAGCAG CTTTAGTAAA TGCTTATAGC TATAAAAACA CTAAAGACCT 12 60 

ATTAGATTTA AATAAACAGG TGGTTATCGC CGAAAATATA TTTAATATTG CCGATCCTAA 1320 

AAC AAA AGCG ATACGA CTTG CAAAAGATAG TGCAGAaTVT TTAGGAAAAG--TATCAGATAT 1380 

TACTGTAACA AAAAATGTAA TTAATAATAA TTCTAAGGAA ACAGAACAAC CAAATATTGA 1440 

ATTATTACGA GTTAGTGATA ATTTAGTAGT CTCAGAGAAT AGT 1483 

(2) INFORMATION FOR SEQ ID NO: 316: 

U) SEQUENCE CHARACTERISTICS: 

fA) LENGTH: 2453 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND EON ESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 

CCTGAACGCT TTTTTATAAA TATCATAAAG CCAATCTGAT TTATCAAGTG TGTCTAAGCG 60 

ACGCGAATTA AAATTCATTG CATACTCCAT CGCTTCTAAA AAACTCATTT TTGAAAAGAC 120 

GTTAAAATCA TCTAAATTCT GACTCCAATA TAATAACAAA ACCAATCCCA TAATATCCTC 180 

TCCTTCATTA TTCAATAAAT TTAAGTTGGT TTCATAAAAC CCTGGAGTTC CAAATAGAGG 240 

CAACTTTTTT TCTTCAATTT GAGTTTCTTT CCTTAGGGCA TGCTCAAAGT CTATAATATA 300 

AATATTATTT CTATTATCAA TAAGTATATT ATTAAATGAT AAATCTCTAT AGGAAAGATT 360 

ATATTTGGAG TTTATTATCT CCATATAATC AATTAATGTT AAAAACCAAT CATACGAGCC 4 20 

ACTAACCATA TTATACTCGC TTAATTTATC TGCAATAATA AACTCAAATT CCACAAAATA 480 

CGAATTCTTT ATGTAAAAAT CGTTAAAAAC TTTTGGAGTA AATTCCTCCT TTTCCAATTC 540 

TACTAATATT TCTCTTTCAT TTATTAAACG ATTCACAGAA TCTCTATTTG TAAAATCAAC 600 

CAACGATAAA TCACTAGCTT CTTTTAATAA AGAATAAACT CGCTTTTGAG TATTAAATAC 660 

TTTATAAACT CCACCTTTGG CATTTTTAGA AATCACTTCC AAAATAATAT ATTGATCAGG 720 

AATAGTGTTA TATCTTGGAA TATAGTAATC CCTTATTGGA ACATTCACAT TTGAAGGGAT 780 

TTTCTTATCT CTTTTATCCT TGAAAGTGCT ATCTTTTACG AACTCCCCAT ATCTCTAATA 840 

TACAACCTCG CTAAGTTGAA ATCTGAAATC TGATGGTATG TTTACACCCT TTACACCTTT 900 
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ATACAATATT TCTAATTTGT GTAACAAACG TTGAAACTCT TTATTATCTT TTGGATAAAT 960 

TGTAATCAAT TTCCCGACTT GTGAATAACC ATTAAGCCCT GTATTTTGCA AAGAAACTTC 1020 

TTTAATGCTA ACCAAAATTT TGAAATTTAT CTTCTTCTCT CTAGAAAATA TAAAATCAAA 1080 

GAATTTTTTA GCAACCAAAT TAGCATTTAA TATTGAAGCG CTCAGCTGTA TTTTAAATCC 1140 

CTTAGATTGC GTGATATTAG ACGGCAAATT ATATAACCAA" TGTTCATCAC TAAAATTATC 1200 

ACTAATTTTA TATTCTAATA ATAAATTATG GTATGCGTCT TCTATTTCAG TTTCATAGTC 1260 

CAAATACTTT AAATACTTTT CGTAATTCAT ATTAAGAAAT CTTCTCCATA AATTTTTAGA 1320 

C CATC ATT T A AAGCCAAACA ATTTAAAGCG TGATAATAAA TGTTGATAAT CAATGTAACT 1380 

TTCAGTCCTC TATTTTGTAA TTCCTTCACC AATAATTTTA TGCTATATCT ATTTTCTCGA 1440 

GGCAATTTAT AGGACTTCAA GATAAAACCA TAAAAGAGAT AAGTATTATA ATCTGACAAT 1500 

CCAGTTTCAG AATAATTTTT TAGAAAAATA TCTAGTGATT CTGATAATTC ATCCGGAATA 1560 

ATTCTTTTAA CATCGTATTT ATTTTTCATA TCGCCCACTC TTCCTTAAAA AGCTCACAAT 1620 

AAAATTTTAA ATTTCTATAC AACAATCCGA GAGTAGTCTC ACAATTTGAA CATTTCACAT 1680 

CACTCTTAAT ATATAAAAAA TGAATTAATC AGAAACCTCT GACTAAGATT TCCTAATTAA 1740 

TTCACTTTCT ATATCATAGT AAGGAATTCT ATTATCCCTA ATTGAAAATT GAAATTTTAT 1800 

GTTTTATATA TTAACAATTA TGCGGATTGT AAATCTTGTC TAACAAAATG GCAAGTGCTA 1860 

CTATCTGCCC CAGAAGGCGA TGCAACGCTA TTTTGAATTG AAAGAGCATA ATCATCCATA 1920 

TCATTTAAGT CACGGATTAG CAATGCTTCC TTCTCTCTTC CGACAATTCC AAATTTTCTA 1980 

ATTACCTTTT CAGGATTATC AAAAAATTCT CCAACAACTT CCATATTTCC TTGAAGTTCA 2040 

TTCAAGAAAG CTTTCATTTG ACTACTCATT ATATAGCTCC TTTTCTATTA CTTTATTTGG 2100 

AATCAAAACT TACTTGTACA TTGGAAACAC CTCTATTCTA CGCTTTCATA TTGCTGCATG 2160 

ACACTTTCAA AATCAAATTG CTAAAAATAA TTTTTTAAAG CTTAATTTAG ATTTAATTAC 2220 

ATATATCTCA AAAAATTGTT TTGAAATTAG TAAATTAAAA TAGGTTTCTG TACTTATAGG 2280 

AACTAGTTAT AAAAACTTCC CCCATCATAA AATATCTATT TAAGTAAAAC AAAAATTTTA 2340 

TAATTTTTTG ATTTTTAAGT GACTATAATC TCCTATCTAT AAATACCATT CGCACGACCT 2400 

GGATCAATCC CTCTACCCAT CTTATGAACT TGAGTTCCTC CAGACAGTCC CGG 2453 
(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY : linear 



1 
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<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 


317: 






CCAATTTGAA GCCTCTAAAA CAATGGAAAA CTGCTACACA 


GATGTGACAG 


AATTTGCCAT 


60 


TCCAGCAgTA CTCAAAAACT TTACTTATCA CCACTTTTAG 


ATGCCTTTAA 


CAGCGAAATT 


120 


ATTCCTTTTA ATCTTTCTTC TTCGCCTAAT TTACAATAAC 


TACAAACAAT 


CTTCGAACAG 


180 


GCATTCAAAC AGAAGCACTA . TGAGAATACG ATTCTCCATA ^GTGACCAAGG CTGGCAATAC 


-- -240 


CAACACGATT CTTATCATCG GTTCCTAGAG AGTAAGGGAA 


TTCAAGCATC 


CATGTCACGC 


300 


AAGGGCAACA GCCCAGACAA CGGCATGATG GAATCTTTCT 


TTGGCATTTT 


GAAATCGGAG 


360 


ATGTTTTATG GTTATGAGAA GAACTTTAGA TCTTTAGAAA 


ACCTTGAACA 


AGCTATTGTG 


420 


GACTACATTG ATTATTACAA CAACAAGAGA ATTAAGGTAA 


AGCTAAAAGG 


ACTTAGCCCT 


480 


GTGCAATACA GAACTAAATC CTTCGGATAA ATTAATTGTC 


TAACTTTTGG 


GGTGCAGTAC 


540 


ATTTTTGGTA TATATAAAAT TTGTAGGAGC TATATCTACA 


ATTTTATATT 


CCCAGTTTAT 


600 


GGATGTAACT TACTATATTC ACAATGTTAT CCAGTGTTTT 


TTCTCTAATA 


TTTAAGGAGT 




GTTCTGTTTC TCCAATAAAT TCTTCAAAGT TTAACCCGTC AACTTGTTCC TGAACAAGAA 


720 


AATAATCATC CACGATATAA AATTCATCAG TTAAATTAGT 


AGTATAACTT 


TTATCGGCTA 


780 


ATTTTTTTAG CATGTGAGCT TCATTTTTTA TATCATCAAG 


AGCTGTCCAT 


TCTCCTTCAG 


840 


CATCATAATT CACAAAAGGT CTTGACTGCT TGATGATTAC 


TTTTTGCCCG 


TCCGATTTTC 


900 


TAATTGCCCG ATAAACATTT CCTTTATTTG ATCTCTTAAT 


AATTTTTTCC 


ATTTTGTATT 


960 


TATTTATTGC AGAGTCCTTA CTTGAAACTT CACATGTGGT 


TTGAAAATAA 


ATCCTTTTTT 


1020 


CTTCTTCTGA AAATAAATCC ATTTTCCGG 






1049 


(2) INFORMATION FOR SEQ ID NO: 318: 









(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 base pairs 
IB) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

TTAGTTGGTT AGAATCAGAA AATCGCCGAA GTGGTTATTT ATTTTTGAAT AAATTTAACG 60 

AACCAATTAC AGCAAGAGGA GTTGCTCAAC AGTTAAAAAA TTATGCTGAT AAATACAAAA 120 

TGAATCCTAA AGTAATTTAC CCTCATTCTT TTAGGCATTT ATTTGCTAAG AATTTTTTAG 180 



/ 
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CGAAGTATAA TGATATTGCC TTCCTTGCAG ATTTGATGGG ACACGAAACT ATAGAAACTA 240 
CTCGAATTTA TCTAAGGAAA ACAGCTACTG AACAACAAAA TATTGTAGAT AAAATTCTTA 300 
ATTGGTAAAA AATAACAGGT GGTCAAACTG ACTACCTCCT ATTTTTGTGA TTATGGCTCT 360 
TATTATGGCA ATATACCTAT GAATTGGGTT CTTATAAAAA TAAAAGATAT TTTTTCAATA 420 

AATACAGGTC TTTCTTACAA GAAGGGCGAT TTAAGCATTA ATAATAAAGG TGTTAGAATT 4 80 

ATACGTGGTG CTAATATTAA GCCTTTAGAA TTTTCTCTGT TGGATAATGA TTACTACATT 540 

GATACACAAT TCATCTCCTC TGAGCAAGTT TATTTAAAAC ATAATCAGCT AATAACACCT 600 

GTATCAACCT CTTTAGAACA TATTCGAAAC TTTGCAAGAA TCGAGAAAGA CTATGATGGT 660 

GTTGTGGCTG GTGGATGTAT TTTCCAATTA ACACCATTCG AAAGTGCAGA GATGATGTCA 720 

AAATGTCTAT TATGTAACTT GTCCTCTCCG TTATTTTATA AACAATTGAA AGCAAT 77 6 
< 2 > INFORMATION FOR SEQ ID NO: 319: 

(ii SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 
TGCAATGCGG CGCCTGCATA CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA 60 

ACATCGCATC GAGCGAGCAC GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT 120 

GGACGAAGAG CATCAGGGGC TCGCGCCACC GAACTGTTCG CCAGGCTCAA GGCGCGCATG 180 

CCCGACGGCG AGGATCTCGT CGTGACCCAT GGCGATGCCT GCTTGCCGAA TATCATGGTG 24 0 

GAAAATGGCC GCTTTTCTGG ATT CATC G AC TGTGGCCGGC TGGGTGTGGC GGACCGCTAT 3 00 

CAGGACATAG CGTTGGCTAC CCGTGATATT GCTGAAGAGC TTGGCGGCGA ATCCGCTGAC 3 60 

CGCTTCCTCG TGCTTTACGG TATCGCCGCT CCCCATTCGC AGCGCATCGC CTTCTATCGC 420 

CTTCTTGACG AGTTCTTCTG AGCGGGACTC TGGGGTTCGA TGTCGACAGC CCGCCTAATG 480 

AGCGGGCTTT TTTTTCCTGA GGCTGGACGA CCTCGCGGAG TTCTACCGGC AGTGCAAATC 540 

CGTCGGCATC CAGGAAACCA GCAGCGGCTA TCCGCGCATC CATGCCCCCG AACTGCAGGA 600 

GTGGGGAGGC ACGATGGCCG CTTTGGTCCC GGATCAATTC GCGCGACCGG ATCCATCC 658 
(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1475 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 
CCGGCTTAAT TTTTAGAAAA CGTCGGCAGG GAACCTTTGT TCTCTCTCCT GGCAGCTCAA 60 
AAAGAAAATT AATCGTTCCA GAAAGAGATA TCCGGGGACT GACAAAAATA TCTGAAGATG 120 

C TCATTC TAC AATTGACTCG AGGATTATTC ACTTCAAATT AGAATTTGCA AATGAATTTT 180 

TAGCAGAAAA ACTACAGGTC GCTTTGCAGA GTCCAGTTTA TAATATTTAC CGCCTGCGTA 240 
TTATTGACGG TAAACCTTAT GTTCTGGAAC AAACTTATAT GAGTACCGAT GTTATTCCAG 300 
GTATTACTGA AGATATTTTA CAAAAATCGA TTTACAATTA CATTGAAGGA AAGTTAGGAT 360 
TGCATATTGC CAGTGCTACA AAAATCTTAC GAGCTTCTTC TAGTTCAGAA AATGAGCAAC 4 20 

ATTACTTGCA GCTCCTTCCA ACGGAACCGG TATTTGAAGT AGAACAAGTG GCTTATTTGG 4 80 

ATAACGGAAC TCCCTTTGAG TACTCGATTA GTCGTCATCC CTATGATTTA TTTGAATTTA 540 
ATTCTTTTGC ATTACGACAT TCCTCCTAGG AGAAAATGTG AAAATGAAGC CAATCTTTTA 600 
CAGACTCTAG TTTAAGAAAA ATTTAAAACA GGGCAAGAAC GTCCCATCTA TGCTTAAATG 660 
GTTTCTCTTT TCTAAATAAG ATGGCTTTAA AAGAGTGATC GTTGTATCCA TCATGTTGAA 720 
AAATATCTTC GTATAGCTTA TAGAGTAGGT ACTGAAATTG TTCACCTGAT CTACTTCTTA 780 
TAGTTATTTA GTTTTAAATA GTGTTTCAAA CATTCTTACA CTGACGAGAA GTTTTTGAGT 840 
CTTTTCTTGT AACACATATA GTATACTGTG GTTAGAATAG TAGACTGTGA CTTCTAACAA 900 
ATTGCTAGAA ATGAATTTCA ATCTCCCAAT TTATTTGTTC ATATCTTCTT TTAATATATT 960 

AAATAAATTC TAAATCATAA TCATTTAAAA AAATTTTATT TTTTATTTTT CATTACGAAT 1020 

AATATAGATG AAGGGGAAAG AGTATGAAAA CAGAACTGTT TCTTTTGCTA TTAGTTCAAA 1080 

AGGAGAAAAA ATGAAAGTAG AAAATATTTC GTATAGGGTG GATCATCCTA AATTGTTTGA 11-40 

TAATATTTCT TTTGATACTT CGAGTTCAGA CGTGACATTA ATTACTGGTA AAAATGGTAC 1200 

AGGAAAGTCA ACTTTACTAT AGTAGATTGA AACTAGAATA GTACACATCT ACTTCTAAAA 1260 

TATTGTTAGA AATCGATTTG ACTATCCTGA TCTATTTGTC CTGTTCTTAT TTCATTTCAC 1320 

TATATCTCAA ATTGAGTATG ACGAAGTGCG CTCCCATCTC CTGGGAACGC ACTTTCTTCA 1380 

TATTTTTCAT ATTCTTGAAT CCATCGATAA AGACTATTGG GATGAATTTT TAAAGTTGAA 1440 

CTAATCATTT TTACAGGATG AGATTTACAG CAGAG 1475 
(2) INFORMATION FOR SEQ ID NO: 321: 
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ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 
IB) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

GAAATATATA TACTTCATCT TAATACTGAG CAACCTAAAC TTAGCATTTC ATGCCCTCAT 60 

ATGGCATGTT CTTTGACTAA ATAATATGAT TATCGAGATA TATCTGGATA AATGAACTAA 120 

TAAGTCTCAC GCGTAGACTT ATCAAAGTCA TTGGCATACA CCACTATGAA CTCGTTGGTC 180 

TGTTCAAATC CCAACACATT ACCTGAGAAG AAAGTTGCAA TGTTGTTTTT GGTGCGGGTT 240 

TGAATTTAAA AAATTTGTTA TGTAGTACCT AATCTAAGGA ATTAGAACAA TGCCTCTAAT 300 

TTTTCTTTAA TACACTGAAA CATTGATGAT TCTGGCTGTA TTTTTGAAAC ACCTCTTCTT 3 60 

TCCTCCTGGA AAATATCTTC AGAAGTTATA TTCTCTATTC CTAACGCTAC TTGAGTTTTT 420 

TTTCTAAAAT ATTCTTTTCC GTTGCCATCT TTAGAAAAAT CATAACCTTC CCTATCTACG 480 

CTGTTACACA AATTAGCTAA AAAArACTCT GGGGTTGGGA AAGGAAGATA AGAAaCGTAT 540 

TTAGCCCATA ATCTATAAAG 560 

(2) INFORMATION FOR SEQ ID NO: 322: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

CCGCCCGGCC ACCGCTGCCT ATCCTCGGGA GAGGGTCACC TGGAGTGAAC CTAGAACGAT 60 

AGACACGGTG CGGTACGACC TCGTACTACT TTCGCCGACG GCCTCGTCCG TTGTCATCCA 120 

CGAACTGATC GGACATGGGT GCGAACACTT CAGAGAAAAA ATCGTTGGAC TGCGTGTCGG 180 

GCCTGAGGAA CTACGGGTGG TGGCTTTTCC GAAGAACGGC TCCGGGTTTG ATGACGAGGG 240 

TACACCCTCC GAAGAGATTG TACTTGTGGA GAACGGCATT GTGAGGCACG CTGTCAGGGA 300 

TCGGGCGACT GGAGGAATGG CCC C T T T T TC CGGTTTGACC AAAGTGGCAT CACATGGTGT 360 

CAAACCTGGC TCAAGATGTA CGCATCTCAA GGCGGAAGGG GAATCGTCAC AGGAAGGAGT 420 

TACCGGAGTA CCCGCCGAAC GCACCGTTTG GATAGAGCAT TTTTCTGCAG CGAACTACCA 480 

TTCAGGTCGA GCCTTTTTCA GGTCTGGCCT TGCCTGGGTA GGCAGCCGAG AAGAACTCTT 540 



ATATCCCTTA ATGCCTTTCA CCATGTCAAT TGATATCTAC GAACTCCCCA GCTTATTGTG 
GCATTTAGAC GGTCAAACGG AACCAGCACG TAGGGTACTG TGC 
(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 
{C> STRANDEDNESS : double 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

GGTACCCACT CATTCTTGAT GAATTGTGAA CAGTTGCCCT TGGGTCGTTT TGCGAGTTGA 

AGTCAAGAAG AGGAAAAAAA CAAAAAGGAG AAATACTCAT GGCAGTAATT TCAATGAAAC 

AACTTCTTGA GGCTGGTGTA CACTTTGGTC ACCAAACTCG TCGCTGGAAT CCTAAGATGG 

CTAAGTACAT CTTTACTGAA CGTAACGGAA TCCACGTTAT CGACTTGCAA CAAACTGTAA 

AATACGCTGA CCAAGCATAC GACTTCATGC GTCATCCAGC AGCTAACGAT GCAGTTGTAT 

TGTTCGTTGG TACTAAGAAA CAAGCAGCTG ATGCAGTTGC TGAAGAAGCA GTACGTTCAG 

GTCAATACTT CATCAACCAC CGTTGGTTGG GTGGAACTCT TACAAACTGC GGAACAATCC 

AAAAACGTAT CGCTCGTTTG AAAGAAATTA AACCTATGGA AGAAGATGGA ACTTTCGAAC 

TTCTTCCTAA GAAAGAAGTT GCACTTCTTA ACAAACAACG TGCGCGTCTT GAAAAATTCT 

TGGGCGGTAT CGAAGATATG CCTCGTATCC CAGATGTGAT CTACGTAtTC ACCCACATAA 

AGAGCAAATC GCTGTTAAAG AACCTAAAAA ATTGGGAATC CCAGTTGTAG CGATGGTTGA 

CACCAATACT GATCCAGATG ATATCGATGT AATCATCCCA GCTAACGATG ACGCTATCCG 

TGCTGTTAAA TTGATCACAG CTAAATTGGC TGACGCTATT ATCGAAGGAC GTCAAGGTGT 

(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 base pairs 
<B) TYPE: nucleic acid 
(CI STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
CGGGAAAAAT CAGATTGTGG GTTCAGATAT CGAATTAGCC AAGGCTATCG CAACAAAACT 
AGGTGTCGAA TTGGAACTAT CTCCCATGAG TTTTGATAAT GTACTGGCTA GTGTTCAATC 
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AGGAAAACCC GACCTTGCCA TATCAGGTGT TTCTAAGACA GATCAACCGA GCAACGTGTT 180 

TGACTTTTCC ATTCCCTACT ATACTGCAAA AAATAAACTC ATTGTCAAAA AATCTGACTT 240 

GACTACTTAT CACTCTGTAA ACGACTTGGC GCAGAAAAAG GTTGGAGCGC AGAAACGTTC 300 

GATTCAAGAG ACGATGGCGA AAGATTTGCT ACAAAATTCT TCCCTCCTAT CTCTGCCTAA 360 

AAATGGGAAT TTAATCACAG ATTTAAAATC AGCACAAGTG GATGCCGTTA TCTTTGAAGA 420 

ACCTGTTTCC AAGGGATTTG TGGAAAATAA TCCTGATTTA GCAATCGCAG ACCTCAATTT 480 

TGAAAAAGAG CAAGATGATT CCTACGCGGT AGCCATgAAA AAAGATAGCA AGAAATTGAA 540 

AGAGGCAGTT CGATAAAACC ATTCAAAAGT TGAAGGAGTC TGGGGAATTA GACAAACTCA 600 

TTGAGGAAGC CTTATAAGCA TCCA 624 



(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1237 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

TCTTATGAAG CCGAAGCGTG ATTTATGGCG CATAGGTTTC GTCTCCAGAA AGTGACAAAT 60 

CTAGTGCCAT CAGCGTATAT GGAATCTnTC GCTGAGAAAC AGTCCCGGGG TGAACTGACT 120 

TATGACCAGG TTTATGAGGA TGCAACGGCT TATCATCATA CCATTGATGC GAGTACAGAG 180 

GAGGCAGACT TGGTTTCTCT ACGTATTGTA GAACTATTGT CTCGAAGAGG CTTTAGCTTC 240 

AGTCCTGCGA TCTTACTTGC TATTCATAAG GAGTTGTTTC AAGATATATT TGAACCCTCG 300 

ATTCCGGTAG GTCAATTTCG TCAGACTAAT ATCACAAAGA ATGAACCTGT TTTGAATGGT 360 

GAAAGTGTTG TGTACTCTGA TTACTCCATG ATTCAAATGA CCTTCGATTA TGATTTTAAT 420 

CAGGAAAAAC AAGTTGCATA TGCGACACTA ACCCAGGCGG ATATGGTTAA AAAAATCCAG 480 

CATTTTATTT CAGGAATCTG GCAGATTCAT CCATTTCGCG AAGGAAACAC TCGGACGGTA 540 

ACGGTATTTT TGATTCAGTA TCTTCGTGAG TTTGGTTTTG ATATTGATAA TACACCATTT 600 

CAGCAACATT CCAAGTATTT TCGTGATGCC TTAGTGTTAG ATAATGCAAA CATTTTACAG 660 

CGACGTCCTG AGTTTTTAAC AGCTTTTTTT GAAAATCTCT TGCTCGGTGG TCAAAATGAT 720 

TTGTCTTCAG AAAAAATGTA TCTAGATTTA GACCTCGATC TTTCATAATC CTAATACTGA 780 

GTAAACATTG AATTTTAGGA AAAAATGAAG TAAATATTCT CACAAGAAAA CGTATATCAT 340 

CAAA C TTTCG CTCTTTGTCA ATTGTACTGG GTTGAAGAAA AGCTAAGTTC GAGAAAGGGC 900 
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AAATTTCCGC CTTTCCTTTT TGATGTTCAG AGCCATAAAA ATCCGGTTTT TTGAAGTTTT 960 

CAAACTTTCG AAAACCAAAG CCATTGCGCT TGATAAGTTT GATGAGATTA TTGGGCGCTT 1020 

CCAGTTTGGC ATTAGAATAG TGTAGTTGAA GCCCGTTGAT AACCTTTTCT TTATCTTTGA 1080 

GGAAGGGTTT AAAGACAGTC TGAAAAATAG GATGAACCTC CTTAAGATTG TCCTCGATAA 1140 

GTTCGAAAAA TTTCTCCGGG TCCTTATTCT GAAAGTGAAA CAGCAAGAGT TTGAAGAGCC 1200 

GATAGTGATG TATCAACTCT TGTGAATAGC TCAAAAG 1237 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGATTTTT CTGAATTAGA AGAGATTGAA TTGCCTCCAT CTCTAGAATA TATTGGAACA 60 

AGTGCATTTT CTTTTAGTCA AAAATTGAAA AAGCTAACCT TTTCCTCAAG TTCAAAATTA 120 

GAATTAATAT CACATGAGGC TTTTGCTAAT TTATCAAATT TAGAGAAACT AACATTACCA 180 

AAATCGCTTA AAACATTAGG AAGTAATCTA TTTACACTCA CTACTAGCTT AAAACATGTT 240 

GATGTTGAAG AAGGAAATGA ATCGTTTGCC TCAGTTGATG GTGTTTTGTT TTCAAAAGAT 300 

AAAACCCAAT TAATTTATTA TCCAAGTCAA AAAAATGACG AAAGTTATAA AACGCCTAAG 360 

GAGACAAAAG AACTTGCATC ATATTCGTTT AATAAAAATT CTTACTTGAA AAAACTCGAA 420 

TTGAATGAAG GTTTAGAAAA AATCGGTACT TTTGCATTTG C 461 

(2i INFORMATION FOR SEQ ID NO: 327: 

< i > SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1436 base pairs 
(8> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D) TOPOLOGY : linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

TAACATTTAG GTACCTCTTC TTAACAAAGT TCAATAGTAA CAATTAATAT TTTAAACAAT 60 

ATATCAAACA TCAATGACTA GAATACTTGC ATCATCCTTC TTTCCATAGA TTGGATCAAT 120 

AGCAGAAGAA TTAAATCTCA TCTTAATTAA CTCTTCAAAA GTTTTATTTT GATTATTTTG 180 
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ATAGAATTCA TAAAAGCCAT CGCTCATTAA AACAATTTGT TCACTAGTAA CATC7ATTTG 240 

ATTAATAATA GCATGGTCTA AAAATCTCTC ATCCAACGAA CCTATCCAGT ACCCACTCGG 300 

TTGATTACAT AATTTTCTGA TTTTTTGTAA AATAATTTTT TTATTTAAAA CACTATTTGT 360 

ACCAATTGAA TCTTTTATCT CATTTTTCCC TTTTTCAAAT AAGTTATCTA CTCTATGATC 420 

AGTTATTTCC ATTTCGTTTA CTAACATGAC GCAGTCACCT AGCATCATAT ACTCCAACTT 480 

TTTTPCTGAA AGTTTAGCAA ATATTGGTAA CCGATAATAT ACT AT ATT GA AACTAGAATA 540 

CTACACCTCT ACTPCTAAAA CATTGTTAGA AATCGATTTG ACTGTCCTGA TTGATTTGTC 600 

CTATTATTAT TTCATTTTAC TATACTCTGT TAATTTATAT GAGTTTAAAC CGATTTCATC 660 

TTTAACCTCG AGTAAAGCAG TTTCAAATAT TTGTTTAAGA GTTTTTCATT CTTTACAATT 720 

AACCGACAAA CTTTCTGATA AAATATGTAC AACTTCTGAG ACTGAATAAC CTATCTCCTC 780 

TTTAGAATTA TATAAATCTG TAGCTCCACC AATAATCCAA AAATACTCAT TTTGTGAACC 840 

TACAATATCC TCATTTTCTA CGGAACTTCC TTGTATCGAA CAAATTTTAT TTATCTTTAC 900 

CATAATACTT CAACCCTTTT AGTGTCAAAA GTAAACCAAT TCCTGTCACT GTTAAGAATA 960 

GTTCCATAAT CTTATTCGAA CCAGTCTTTG GTAATTTTTG TTTkACATCT ACTATyTCTT 1020 

TAGATTTATT AATATGATTT TCAGTTTCTC TCCCATCTCC AACTATTTTA TACTTTACTT 1080 

CTTCTGTCTT ATTATCTTGT TTATTGTCGA TCTTGTCATT CATTTGTCTA TTATCTTTAC 1140 

TTGAGTTAAA CTCTCCGTTC TTCTGGTTAC TATCAATTAC ATTATTTGAA TTAGATTGTT 1200 

TTTCCTCTTT GTTTTTTTCT T TT TC G TT I T TATCACTTAA ATTATTTGTT ACAATTTTGT 1260 

AAAGCCCATT CTCCGTTACA ATATTGAAAT TACCATCGCT ATCACGTATA ACACCTTCTT 1320 

TCCCATTTGC ATTAGATTTG ATGAATGATA TATACTTACC GGATAAATTA TAAAATTGGT 1380 

TATTTAAAAC GGTTATTTTA CCCTTTGAAT CCTCAATAAC AATTCCTTCT TTACCC 1436 

(2) INFORMATION FOR SEQ ID NO: 328: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pairs 
(B> TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 

CCGGCAGACA GGACAAGGTG TTAAATATCA ATCTCAAATG GTTCGTCAAT GGTTTCTGAT 60 

ACGTATTTTC CGTCTTTCTT CCGTTGCTTG ACACACTCTG TGAGCAGATA TTCGATTTGC 120 

CCATTGACTG AACGAAAGTC GTCTTCTCCC CATGATGCGA GTGCAGCGTA TAACTTTGTT 180 
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GAGAGTCCAA GGGGGATCTG CTTTTTTTTA GCTTCAGCCA TCTTTAGTAA AGGCTTCCTG 
TGTTGACAAT TGGTTGTGCA TCATGATTGC CACAAAGAAC GACAAGGAGA TTTGAAACCA 
TGGCAGCTTT TCGTTCTTCG TCAACTTCTA CCAATTCCCC TTCATTGAGC CCTTCTAGTG 
CCATTTCAAC CATTCCTACA GCACCATCTA CAATCATCTT CCGTGCATCA ATAATGGCAG 
ATCCTTGTTG GCGTTGAAGC ATAACGGCAG CAATTTCTGG AGCATAAGCT AGGTAAGTGA 
TACGTGCTTC AAGG ATTTCC AAG C CAGCAT CCTCAACA CG ACTTTGGAT T TCT TCACGAA 
TACGGGTAGC AACAATTTCG CTAGAGCCAC GGAGACTACC TTCATCTGCG TCCCCATCAC 
CCGGAGTATC CACATTAGGA GACACATCGT AACGATAGAT GCGGAC 
<2> INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 329: 



CTTGCAGGTG 


CAGTAGGTGT 


TACTTCAGAT 


ACATTTGAAC 


GTGCAGAGGC 


TCTTTTTGAG 


60 


GCAGGAGCGG 


ATGCGATTGT 


TATTGATACT 


GCACATGGTC 


ATTCTGCAGG 


TGTCTTGCGT 


120 


AAAATTGCCG 


AGATTCGTGC 


TCATTTCCCA 


GATCGGACTT 


TGATTGCTGG 


AAATATTGCT 


180 


ACTGCTGAAG 


GTGCACGTGC 


CCTTTATGAA 


GCGGCTGTAC 


ACGTTGTTAA 


GGTTGGTATT 


240 


GGACCAGGTT 


CTATCTGTAC 


TACTCGTGTG 


ATTGCTGGTG 


TTGGTGTTCC 


GCAAGTAACA 


300 


GCTATCTACG 


ATGCTGCAGC 


TGTTGCGCGC 


GAATATGGTA 


AAACGATTAT 


TGCTGACGGT 


360 


GGGATCAAGT 


ATTCTGGAGA 


TATTGTAAAA 


GCACTTGCTG 


CAGGTGGAAA 


TGCTGTTATG 


420 


CTTGGATCTA TGTTTGCTGG 


AACTGATGAA 


GCTCCAGGCG 


AAACTGAAAT 


CTTCCAAGGA 


480 


CGTAAATTCA 


AGACTTACCG 


TGGTATGGGA 


TCAATTGCTG 


CTATGAAGAA 


AGGTTCAAGC 


540 


GACCGTTATT 


TCCAAGGTTC 


TGTCAATGAA 


GCAAACAAGC 


TTGTTCCAGA 


AGGAATTGAA 


600 


GGTCGTGTTG 


CTTATAAAGG 


AGCGGCAGCT 


GATATTGTTT 


TCCAAATGAT 


TGGTGGTATT 


660 


CGCTCTGGTA 


TGGGTTACTG 


TGGTGCAGCT 


AACCTTAAAG 


AACTACACGA 


TAATGCTCAA 


720 


TTTATTGAAA 


TGTCTGGTGC 


TGGTTTGAAA 


GAAAGCCATC 


CTCATGATGT 


GCAAATTACT 


780 


AATGAGGCAC 


CAAATTATTC 


TATCTAAAAA 


ACAATGAAAA 


GAACTCCAGT 


GAAAACAGGA 


840 


GTTCTTTTAC 


AATGTTGTCA 


ATTTCCATTT 


ACAGCAGCTT 


TACCATCCTG 


AATAGTGAAG 


900 



240 
300 
360 
420 
480 
540 
600 
646 
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ATACTTAGAT TTTCTGGCAG ATTTTGAAGA TGGTCTAAGC TT G T T GTTCT GATAAAGGTT 9 60 

TGGATTGATT GAGAAATCGT TTCTAATAAT TTTAACTGTC TAGTGTTGTC AAGTTCACTC 1020 

ATCACATCGT CAAGCAGTAA TATAGGAGAT TCTGTGGTAA TGCTTTCCAT TAATTCGATT 1080 

TCTGCTAATT TTATCGAGAG GACGAGACTA CGATGTTGAC CTTGGCTTCC GAAACTAGCA 1140 

TCCATCCCAT TTATATAAAA AGAAATGTCA TCTCGATGAG GACCGACACC AGTATTCTTT 1200 

TTAAATAAAT CTCTGGATCT ACTTTTTTCT AAAGCAATTT TGAAAGATTC GGATAAGTTT 1260 

TGTTTGTCAG TTATATTGAC AGAAGATTGA TAGCATATTG ACAACTCTTC GATCTGATTA 1320 

GAGAGTTCAA AATCTTTCTT ACGCCCAAAT GATTCTACTT TTTTTATGAA ATCTAAGCGG 1380 

TGATTCATTA CACGACATCC ATAATCAACT AGCTGATCAT CTAACACAGA AAGGAATCTT 1440 

TCATCTATTT TTTGAGCTGA TTTTAGGTAA GTCTTTCTTT GCTTTAGGAT GTGGTTATAA 1500 

TTGGTTAACT CAGATAAATA GATTGGCTTA ATTTGCCCAA GTTCCATATC AATGAATTTT 1560 

CGTCGAATCG AAGGTGCTCC TTTAATTAGT TGTAAATCTT CAGGAGCAAA TAAGACAACA 1620 

TTCATGTGTC CTACATAATC TGAAAGGCGT GCC 1653 
(2) INFORMATION FOR SEQ ID NO: 330: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

GAAACACTGT ATTTCAAAGC ATTTTTTGTT ACTTTAAAAT TACTCCCATT CTTCTTTTCC 60 

AAACGTACAA TATATCCAAA ACCATTCAAA ATACTAGATT CTATTTTTTA TAATATCACT 120 

AAATCCACCT AATTATAGGA C GTT TTCAGA TTTTTAGTCC CAGTCCCAGT ACCGGAGAAA 180 

TATTGTTTTA ATATAATATC TCTTTTTGTC TTCTAAGCTC TTAAAAGCAA AAGAACAAGT 240 

AAAGAGTCAA GACAAGGATA AAAAGTCCAT ATTAGGGCAA ATAAAAAGCT TTAAGACAGA 300 

TGACAAATCT AAGTCAAATA AGAAAGACCA TAGCAAAGGT GCAGAGAGAT AAATATTGGC 360 

GGTCTTCGGA CTGCCTTTAT TTTTTTATCC ATTTTTCAAA TCAAATTTAT TC AG ACT AT A 420 

TATGCACATA TACACTTAAA TT CAT AT AAA AACATGGCTT GTAAAAAATT ACTTTAATCA 480 

CAATAATCGC ATTTAAAATT GTGATGTTTG CAAGCTAAAT TACGGACTTC ACTTCGAAGT 540 

TTTCCCTTGT ATCTTTTATA ATAGATAGAA AATTTGCTGG CAGATGAATA TCCAACAGAT 600 

TCTGCTATCT CTTTTATAGG TAGTTCAGTG TTTAAAACAA GAGTTTCAGC TACATTCATT 660 
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CTTTTTCTTT GAGTGTACTC TGTAATGCTT TGACAATATT TTTCCTTAAA TAAATTTTTT 720 

AATTTAGTAC CACTCATTTT AGATATTTTT TCAAGCGTGC CTTGATTTAC ATTCGTTGCA 780 

AAATGATCAT CTAAGAATCT TGCTACATCT TCAAGTGCTT TATCATCATC AATTTCAATT 340 

TTATATTTTT TTCTATTTAA GTATGTGTCA ATTACTATAC TTATCCATTC ATTTGCCTTT 900 

GCTTTAAAGA AAAAATCAGC GGCAGGAGCG TCCATCTTAC AATTTAATAT TTCCATTGCC 960 

ACTC TTTCTA A GGCC TTTGT AAGTATTATT TGATTCGG TT GAAGCAAGGT TGAATAAAAJ . 1020 

GATTCTGGAT TAATGTTAAT AGATGCTAAA TGTTTTTCTA TTAGCTCTTT TTTAAAACCm 1080 

ATGGAAACAG CAAGATAACA ACAATTCTCG TGTAATAAAA AAACAAAATT ATCTTTTATA 1140 

TTATCAAAAT CAAAAGTACA TAGAGAGTTT GCGGTAATAG TTTGATACGG ATTAAACTTT 1200 

TCTCCGTTTG CACTGACAAT GTAACTTGAA TAAATTGAAA CATAGTCTGA CATACTATAA 1260 

GTGCTATTTT GAACTACTTC CTCTTTGATA TAAAAATCAT GTATATCGAT AATGAAGATG 1320 

CCTCCTTCAT AAAACCGGTA 1340 - 

(2) INFORMATION FOR SEQ ID NO: 331: 

U) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH : 607 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

TATGTTCGTG ATGAGTTTTT AAGTAGGAAA AACGTGCTAA CCTCTCAGAT TTTGGAACTT 60 

GTAAAAGAAA CTCTTTTTTC ACCCGTAGTA GTTGATAATG GGTTTGATCC GGCCTTATTT 120 

GAAATTGAGA AAAAACAATT GCTAGCAAGT TTAGCAGCTG ATATGGATGA TTCTTTTTAT 180 

TTTGCACATA AAGAATTGGA TAAATTGTTT TTTCATGATG AACGTCTTCA ATTGGAATAT 240 

AGTGATTTAC GAAATCGTAT TTTAGCTGAA ACTCCACAAA GTTCTTATTC TTGTTTCCAA 300 

GAATTTTTAG CCAATGATCG AATAGATTTC TTTTTCCTAG GTGATTTTAA TGAGGTTGAA 360 

ATTCAAAATG TATTAGAATC ATTTGGCTTT AAAGGTCGAA AAGCAGATGT GAAGGTTCAG 420 

TATTGTCAAC CTTATTCTAA TATCCTTCAG GAAGGTATGG TTCGGAAAAA TGTGGGACAA 430 

TCCATTTTGG AATTAGGTTA TCATTACTGT TCTAAATATG GTGATGAGCA ACATTTACCC 540 



ATGGATTGAA TGAATGGTTT ACTTGGTGGA TTTGCTCACT CTAAGCTCTT TACAAATGTC 
CGGGAAA 



600 
607 
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(2) INFORMATION FOR SEQ ID NO: 332: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 



TTAAAATACC 


GAATTTTGTT 


TTGTCCTCTA 


TTTCAACATT 


GTGAATCGCC 


TCAGGCAGAG 


60 


AACCGATACT 


AAAGATATAA 


CCAAAATAGT 


TGTCATTTGC 


TTTACCGATA 


TCAATCTTAT 


120 


TGGTTAAATC 


AAAATCCAGT 


TCGTCAATTG 


CGCCATCGAT 


GTCTTGATTG 


ATTTCCAAAA 


180 


GTTTTGTAAT 


GAGGTTACCC 


GTACCGCCTG 


GGATAATCCC 


TAACTTAGGA 


ATGTAGTCTC 


240 


TCTCATCAAT 


ACCTGAAATG 


ACTTCATTGA 


CAGTTCCATC 


TCCACCAAAC 


ACAACCACTG 


300 


CATCATACTG 


CTCACGAGAA 


GCTTCTTCAG 


CAAAATGTGT 


TGCATCCAGC 


GCTTTTTCGG 


360 


TAATTTTGGT 


TTCAACATAT 


TCAAAGTATT 


CTTTTGCTTT 


ATTCTCCAGC 


TTTTCTTTGT 


420 


AATCCAAAGC 


CTTCTCGCCA 


CCAGAAGTAG 


GGTTGATAAT 


TACCATTGCT 


TTTTTCATTG 


480 


ATTTTATCCT 


TAATTTTAAA 


CAGAAATGTT 


TACATTTCGT 


CGTATGCAAG 


TAAATGTAAT 


540 


CCTATTATAC 


AATGAAAATA 


CAGAAAAGAG 


AAATCTGACG 


TACTGGAGAT 


TAATACGCTT 


600 


TTATTCTATT 


TTCCCATCGC 


CTAACTACAT 


CCTTTAAGGG 


TTCATCCAAG 


TAAGAATAGG 


660 


CCTTATCCTT 


GATCCAATCA 


GGAATACCGT 


AAGCTGCCTC 


TGCTAwGCTA 


CAAGTGATTG 


720 


CTGCGAGAGT 


ATCACTGTCG 


CCACCAAGTG 


AGATGGCATT 


TCTTATCGCA 


TCTTCGAAGT 


780 


CTCTACTTTC 


AAGAAAGGCG 


ATAATGGCTT 


GAGGGACAGT 


TTCCTGACAT 


GTTTCGTTAA 


840 


AACGATAGTT 


AGGACGGATT 


TCATCTAAAG 


TTTGAGATAG 


ATTGTAATCG 


TATTCTTTTT 


900 



(2) INFORMATION FOR SEQ ID NO: 333: 

< i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 

(xi) SEQUOICE DESCRIPTION: SEQ ID NO: 333: 

CCTTTCTGGC ACACTGGTCT TGGAATACGG CAAAACCTCT GAAAATATCT ATGCTGGAAT 60 

GGACGAGGAA TACCGTCGTT ATCAGCCTGC CATCATCACT TGGTACGAAA CAGCCAAACA 120 

TGCTTTTGAT CGCGGACAGA TTGGCAAAAT ATGGGTGGAA TCGAAAACGA CCTCAAGGGC 180 
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GGTCTCTACA GCTTTAAATC CAAGTTCAAT CCGACCATTG AGGAATTCCC TGGTGAGTTC 240 

AACCTGCCAA CTAATCCTCT TTACCACCTC TCCAATCTCG CCTACACTCT CAGAAAGAAA 300 

CTGCGCAGaA GcATTAACAG AAAGGAAGCC TATGACCTTT AAACTTCTCA GCCAAGAAGA 360 

ATTCATCCAG CATACCTCAG CTAGATCCCA ACGCTCTTTT ATGCAGACCG TAGAAATGGC 420 

AGAGCTGCTG AGCAAGCGTG GCTTCAGTAC CCAGTATGTC GGCTACACTG ACCCACAAGG 480 

GAAG GTAG TG GTGTCAGCTG TCCTCTACAG _CAT|GCCT^TG_^CTGCT _53 3 
(2) INFORMATION FOR SEQ ID NO: 334; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

CCAGCAAACT AGGAAGCTAG CCGTAGTTCC TCAAAGCACA GCTTTGAGGT TGTAGATAAG 60 

ACTGACGAAG TCATGTACAA AACACTGTTT TGAGGTTGCA GATAGAACTG ACGAAGTCAC 120 

TCAAAACACT GTTTTGAGGT TGCAGATAGA ACTGACGAAG TCACTCAAAA CACTGTTTTG 180 

AGGTTGCAGA TAGAACTGAC GAAGTCAnnA ACCACACCTA CGGCAAAGTG AATCTGAAGT 240 

GGTTTGAAGA GAGTACAACT TGTCTTTTAG AAAAGGAGCC TATAATGAAA GTCTTTCAGC 300 

ATGTAAATAT CGTGACTTGT GATCAAGATT TCCATGTTTA TCTTGATGGA ATCTTAGCAG 3 60 

TCAAGGATTC TCAAATCGTC TATGTCGGTC AAGATAAGCC AGCGTTTTTA GAGCAAGCTG 420 

AGCAGATTAT AGACTATCAG GGAGCTTGGA TTATGCCTGG TTTGGTCAAT TGTCACACCC 480 

ATTCTGCAAT GACAGGTCTG AGAGGGATCC GAGATGACAG CAATCTCCAT GAATGGCTCA 540 

ATGA 544 
(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 5: 
CCAGGAACTC AAATGTAAGT ACGGGTTCCT TTTTTGTATA TTTTTCAAAT AACGCCTCTA 



60 
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CACTATTTGT ACCAAATTCA CCAACTACAG TTGTATCTTA GTTAAAATAA CTTAGAATAT 120 

GTAAGTGAGT ACCAGATATA CCAAGACATC GTCACCATCT AAGGTATATT CAAAATACAA 180 

AAGTTGACCA ACTAGATTTC TGAATATCCT TATATATCCA TTCTTAAAAT TGGTTTAAAT 240 

AGCGTAGTCT TTTAAACTAG TTTTGAGAAT CCAAAAAATC TTCCTACATA TGTAAGAAGA 300 

TTTTTTAGTT CAGAATGATT AGaTTTAGCT AATGGATACC TATCCTACC 349 

(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 base pairs 
<B> TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 



CTCCGATAAC 


CACACCAGCA 


ATGGAAATAA 


TTCCATCGTT 


AGCATCAAGA 


ACACCCGCAC 


60 


GCAGGATATT 


TAAACGACCT 


GCAAAATTTG 


AATCAATTTC 


GTGATTTGTT 


TCTGACGCTA 


120 


AATTTCAAGT 


TCAAGTTAGC 


CATCAAGAAG 


TCTTCTCTGG 


GTGACTTGTA 


GTCCAAGCAT 


180 


TTTTTAGGAT 


AGTTGTTAAT 


CCACTTTTCG 


ATGAATGCGA 


CTTCTTTGGG 


AGTCATTTTC 


240 


TTGGTTCCCT 


TAGGTAACCA 


TCTACGAATG 


AGCCTGTTGT 


GATTCTCATT 


AGTTCCCGGG 


300 


ATCCTCTAGA GTCGACCTGC 


AGGCATGCAA 


GCTTGGCACT 


GGCCGTCGTT 


TTACAACGTC 


360 


GATGACTGGG 


GAAAACCCTG 


GCGTTACCCA 


ACTTAATCGC 


CTTGCAGCAC 


ATCCCCCTTT 


420 


CGCCAGCTGG 


CGTAATAGCG 


AAGAGGCCCG 


CACCGATCGC 


CCTTCCCAAC 


AGTTGCGCAG 


480 


CCTGAATGGC 


GAATGGGGCC 


TGATGCGGTA 


TTTTCTCCTT 


ACGCATCTGT 


GCGGTATTTC 


540 


ACACCGCATA 


TGGTGCACTC 


TCAGTACAAT 


CTGCTCTGAT 


GCCGCATAGT 


TAAGCCAGCC 


600 


CCGACACCCG 


CCAACACCCG 


CTGACGCGCC 


CTGACGGGCT 


TGTCTGCTCC 


CGGCATCCGC 


660 


TTACAGACAA 


GCTGTCACCG 


TCTCCGGGAG 


CTGCATGTGT 


CAGAAGTTTT 


CACCGTCATC 


720 


ACCGAAACGC 


GCGAAACGAA 


AGGGCCTCGT 


GATACGCCTA 


TTTTTATAGG 


TTAATGTCAT 


780 


GATAAGGATG 


orrrcTTAGA 


CGTCAAGTGG 


CACTTATCGG 


GGAAATGTGC 


GCCGAGACCC 


840 


TATTTGTTTA 


TTTGTCTAAA 


TACATTCAAA 


TATGTATCCC 


CTCGTGAGAA 


AATAAACCTG 


900 


ATAAATGCGT 


CAATAATATT 


GAAAAATGAA 


GAGTATGAGT 


ATTCTACATT 


TCCGTGTCGC 


960 


CCTTATACCC 


TTTTTTGCGG 


CATGTTGCCT 


TCCTGTTTTT 


GCTCACCCAG 


AAAACGCTGG 


1020 


TGAAAGTTTA AGATGCTGAA 


AAATCATTTG 


GGTGCACAAC 


TGGGGTTACA 


TCCAACTGGA 


1080 


ATCTCCAnCA 


GCAGTTAAGA 


TCCTCTGACA 


GTTGTACACG 


CCGCAAGAAC 


TATTCCCGAT 


1140 
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CAATGAGCAA CTTTTAAAAG TCCTGCGAAT GTTGGGGCGG TAATAATCCC CGTGTTGTAG 1200 
GCCCGG 120 6 
(2) INFORMATION FOR SEQ ID NO: 337: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

CTGCTCAACT CAGACAGTCA AATTTCTCAC TTTACCAAAA GAACCATCAA AAAAGTTGCT 60 

GAAAAAGGCC ATCAGGTTAT TATTACGACA GGTCGCCCTT ACCGTATGTC AAAAGATTTT 120 

TACCGTGAAC TCGGCTTAGA CACTCCTATG ATTAACTTCA ACGGATCCCT TACTCATTTA 180 

CCAGACCAAG TTTGGGATTT TGAAAAGTGT TTCACTGTAC ACAAAAAATA TCTGCTAGAT 240 

ATGGTTCAAC GTTCAGAGGA CATTCAAGCC GATTTTATCG CTGGAGAATA TCGTAAAAAA 300 

TTCTACATTA CAAATCCCAA TGAAGAAATT GCCAATCCCA AACTATTTGG TGTAGAAGCT 360 

TTCCAGCCTG AAGATCAATT CCAGCCTGAA TTGGTGACCA AGGACCCTAA CTGTATCCTC 420 

TTGCAGACTA GAGCCAGTGA CAAATATTCC TTGGCAAAAG AAATGAACGC CTTCTACCAG 480 

CATCAACTTT CTATCAATAC CTGGGGAGGT CCGCTCAATA TCCTTGAATG TACCCCAAAA 540 

GGTGTCAACA AGGCCTTTGC TTTGGACTAC TTGCTCAAGA TAATGAATCG TGACAAAAAA 600 

GATTTGATTG CCTTTGGAGA TGAACACAAT GATACCGAAA TGCTCGCTTT TGCTGGGAAG 660 

GGTTATGCCA TGAAAAATGC CAATCCAGAG CTACTCCCTT ATGCAGATGA GCAAATTTCC 720 

CTTACCAACG ACCAAGATGG GGTTGCCAAA ACCCTACAAG ACTTATTCTT ATAACCTATA 7 80 

CTGATACTCA ATGAGGGGCA AAGAGCGAAC TTA 813 
(2) INFORMATION FOR SEQ ID NO: 338: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
{D> TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 
CCTAGATAAA TGATATAATT CTATTATTGT TCGTAAAAAT TAAAAGGAGA TTGATGATGG 60 
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ACAAATTATT TAAACTAAAA GAGAACGGTA CAGACCTTCG TACAGAGCTT CTCGCTGGTT 120 

TAACAACTTT CTTTGCAATG AGCTATATTC TCTTTGTAAA CCCACAAATA CTTTCACAAA 180 

CAGGAATGCC TGCTCAGGGC GTCTTCCTAC CGACGATTAT TGGTGCAGTA GCGGGTACCT 240 

TGATGATGGC TTTTTATGCT AACTTACCTT ATGCCCAAGC GCCAGGTATG GGACTCAATG 300 

CCTTCTTTAC CTTTACAGTT GTATTCGGGC TTGGTTATTC TTGGCAAGAA GCCCTAGCTA 360 

TGGTCTTCAT CTGTGGGATT ATTTCATTGA TTATTACCTT GACAAATGTT CGTAAAATGA 420 

TCATTGAATC GATTCCCAAT GCTCTTCGCT CACCTATTTC AGCTGGTATC GGTGTCTTCC 480 

TTGCCTATGT AGGGATTAAG AATGCTGGAC TTTTGAAATT CACCATTGAT CCAGGCAACT 540 

ATACTGTTGT AGGAGAAGGG GCTGACAAAG CTCAAGCAAC GATTGCAGCA AACTCTTCAG 600 

CAGTTCCAGG ATTGGTCAGC TTTAATAATC CAGCTGTTTT AGTGGCTCTT GCAGGACTTG 660 

CCATTACTAT CTTCTTTGTC ATC 683 
(2) INFORMATION FOR SEQ ID NO: 339: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 



CTACTTTACA TGCAAGTAGT 


CACTGAATTC 


CAGTTAGAAA 


TTACTTTGTA 


ACTACGTTTT 


60 


GAGGAGGAGT AAAATGCTTT 


CCTACGTTCG 


ATATTACCCA 


CTAGCGATAG 


CTAAATTAAT 


120 


GTGTCTGTGC TCTCCTAAAA 


TCTGCTGATT 


TATTACTGAC 


TAATACAGGA 


GGTTTTTTTT 


180 


ATGgACAGAC AATCATATCT 


GCTATTGGTG 


TTTATATTTC 


CACCAGTATC 


GATTATTTAA 


240 


TTATTTTAAT TATTTTATTT 


GCACAGCTAT 


CACAGAATAA 


ACAGAAATGG 


CATATTTATG 


300 


CGGGGCAATA TCTAGGCACA 


GGCTTACTTG 


TAGGGGCGAG 


TTTAGTTGCT 


GCTTATGTCG 


360 


TTAATTTCGT GCCTGAAGAA 


TGGATGGTTG 


GATTGCTTGG 


TTTAATCCCT 


ATCTATTTAG 


420 


GGATTCGCTT TGCAATTCTT GGAGAAGATG CGGAAGAAGA AGAGGAAGAA 


ATTATTGAAA 


480 


GATTAGAACA AAGCAAGGCA 


AATCAACTGT 


TTTGGACAGT 


TACATTGCTG 


ACAATTGCGT 


540 


CTGGCGGAGA TAATTTAGGT 


ATCTATATAC 


CTTATTTTGC 


TTCGTTAGAT 


TGGTCACAGA 


600 


CCCTCGTGGC CTTGCTTGTG 


TTTGTAATCG 


GCATAATTAT 


CTTTTGCGAG 


ATTAGTCGGG 


660 


TGTTATCCTC TATTCCGTTA 


ATATTCGAGA 


CAATTGAAAA 


ATACGAGCGA 


ATCATTGTGC 


720 


CCTTAGTATT CATTCTACTT GGACTATACA TCATGTATGA AAATGGCACG 


ATAGAGACTT 


780 
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TTCTGATCCT GTAGATTTTT TTGTTTCACT AGGGATTTAG CCCGAGCTCA AATCAGCTCT 
CTGATTTTCA GA 

(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 754 base pairs 
(B) TYPE: nucleic acid 
CO STRANDEDNESS: double 
) JPOPOLpGY :_ 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 

CCGCACAAAA GCGCATACTA TCAAGATTCT ATAAAGCCTT GATACTATGC CTTTTTAATG " 60 

GATAAATAGT TACTCTTTTT TAAAGACCGG ATCTTTCAAA CTCTGCATAC TGGCATTGAT 120 

CACCGCGCCT AGGATAACAA TTTTAGCAAT CAAGATAAAC CAAAACATCA TAACAACAAC 180 

AAGAACGGAA CCTAAAATTC GGACATCCAC CAAATGATGG ACATAGTAAT TGAGATAACT 240 

AGAGAACAGA GTTAGTAAAC CTAAAATCAC TAAGAGAACA AAGGCACTGC CTGGTACCGT 300 

ATAGCTAATT TTCCTGTTAG ATAGATTGGG AAGAAAATAA TAAAGCATGA CCAAGATAGC 3 60 

AAAGACGAGG GCGTAAATCA GAGGACCTGC CAACCCTTGT AAAGCCTGAT AGATAATGCC 420 

ATCTTTTCTC CAATAATGAG CAAGTAAAGC CAAAATCATC TGACCAAATA AGATCAAAAA 4 80 

CAAGGCAAAC GCAAAGAGGA GCTGCAACCA AAACTCACTA GGAGACTTAG CATCTGATGG 540 

GAAATAAGTC CACGACTCTT TTCGACGCCA TAAGCCTTGT TAAAAGCTTT TTGCAAGAAA 600 

TT CAT AG ATT TTGAAAAACT CCATAACGCC GATAAAACAG AAAAACTCAA TAAACCTGTT 660 

GAAGGTTGCG TCAAGACTTC TCTGGCTATT TTTTCCACAC CTTCATAGAG GCTTCGGGGG 720 

CAGACCTCTT TCATAAAGCC CAAAAATTCT CCCA 754 

(2) INFORMATION FOR SEQ ID NO: 341: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 
GGGCATAACT CTAGGAGTAC CGCTATTACT CGACTTAATG AGTGCACAAG AAGTCAGGAT 
TTTTATGCAG GTTGGGCGCT TCATCAGACA GGGAAGATTT ACAGCGACTA TTATGGAAGT 



60 



120 
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CAAGGTTTGC TTTATTATTT GCTGACTTAC GTGAGTCAGG GCGGATTTTT CTTTGCCATC 180 

TTTGAGTGGT TAGCCTTGGT AGCAGGAGGA TTTTTCCTTT TTAGATCAGC GGACACCTTG 240 

ACAGAGCAAG GAGACCAAGC TGGACATCTG CTGACTATTT TTTACATGCT AGTTACAGGT 300 

CTTGCTTTTG GTGGAGGCTA TGCGACTCTT TTAGCGCTTC CTTTCTTATT CGCAGCCTTT 3 60 

AGTTTAGTTG CGGCTTACCT AAGCAATCCA AGCCATGATA AGGGATTTGT ACGGATTGGG 420 

CTAGCTTTGG CAGGCGGATT TTTCTTTGCT CCCTTATCAT CGCTCCTGTT TATTGCTGTA 4 80 

GTGAGTTTAG GCTT G TTGGT CTTTAACCTT GGGCATAGAC GCTTTGCGCA TGGGTTTTAT 540 

CAGTTTCTTG CAGTGCCTTT AGGTTTTTCA CTTGTCTTTT ATCCAACTGC CTACTATAGT 600 

GCTGCAACAG GAA G TTTTGG GGATGCGwTT AGTGGTATTC GTTATCCTAT TGACAGTATT 660 

CGCTTTGATT TTACTTCTAA AA TT TTAGAG AATATGTTTT TTTAAGG 707 

t2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 
IB) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

GGATTTTGAA AAACCATACC GATTTGACGA CGTATATTCC AAACATTTTC CTCAGTCAAA 60 

CGTTGGCCAT CAATTACAAT CTCTCCGGAT TCTGCTTCCA GTAAGCCATC AATTAATCGA 120 

ACCGTCGTTG ATTTACCACT ACCATTATGC CCTACAATCG AAAGCCATTC TCCACCTTTC 180 

ACGTGAAAgT AATATCCTTC ACATCGTAGT AGTTCTGATT TTCTTTATAG CGAAAAGAAA 240 

GATTTTTTAC ATCAATTATT GATTTCATTT CGAACCAAAT GTCCCTTTAA ATACATAGGC 300 

ACTACCCTTG AAATAGTCAT AGCCAGAGTA GATAGTGAAA AATAAGGCTA CATAAAGTAG 360 

AACTTGACCA AGCAAAGTCC AATGTAATAG CAAGAAAATA ATGGCAAACA TCTGACTAAA 4 20 

AGTTTTAATT TTTCCAGGCA TTGCTGCTGC TAAAATTGTT CCACCAGTTT CAACCAATAA 480 

AAGCCTTAAA CCTGTCACAG CTAACTCACG ACAGATAATC ACTGCAACAA TCCAAGCCGG 540 

AGCCATACCT AACTCAATCA ACATAATAAA AGCCGACATA ACTAGTAACT TATCCGCCAT 600 

AGGATCTGCA AATTTACCAA AATTACTGAC CACATTCCAT TTACGAGCTA AATATCCATC 660 

TAAATAGTCG GTAATACTGG CAACAGCAAA GATAATAGCT GCAACTATAT GACTCTCTAT 720 

CGAATTTCCT ATCGTTAAAA TAAAGATAAA AATAGGTATA AA 762 
(2) INFORMATION FOR SEQ ID NO: 343: 
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(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 base pairs- 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 
CD) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 3: 

ClTM'GAT AC ACTTAAACTA TGAATACAAA TCTCAAGCCC AAACTTC AGC GTTTTGCTTC .6 0_ 

TGCGACTGCC TTTGCCTGTC CTATCTGTCA AGAAAATCTG ACTCTGTTAG AGACTAATTT 120 

CAAGTGCTGC AACCGTCATT CTTTTGACTT GGCGAAATTT GGCTATGTCA ATCTAGTCCC 180 

TCAAATCAAG CAATCTGCTA ACTACGACAA GGAAAATTTT CAAAACCGTC AACAAATCCT 240 

AGAAGCCGGC TTTTACCAAG CTATCTTAGA TGCTGTATCT GACTTGCTTG CAAGCTCAAA 300 

AACTACCACA ACAATTTTGG ATATCGGTTG TGGTGAAGGA TTCTATTCTC GCAAACTACA 360 

AGAAAGTCAC TCTGAAAAAA CTTTCTATGC CTTTGACATC TCCAAAGATT CAGTCCAAAT 420 

CGCGGCTAAA AGTGAACCCA ACTGGCCAGT CAATTGGTTC GTTGGCGACT TGGCACGACT 480 
TC 

(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



482 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

TTTATTTTTA TAAAGTCAAT ACCTGTCTTT ACTTTTTCTT AAAAAAAGTT TATTATGTTC 60 

TTTAAGGAGG TGTAAAACAT GAAAATAAAT AATAAACTCG TTGGAGAACG TATTCAAAAT 120 

ATCCGTTTAA GCCATGGCGA CTCTATGGAA AAATTTGGAG AAAAATTTAA TACTAGCAAA 180 

GGTACAGTTA ACAACTGGGA AAAAGGTCGC AATTTACCAA ATAAAGAAAA CCTACTAAAA 240 

ATTGCATCTA TTGGAAAAAT GAGTGTTGAA GAGTTACTCT ACGGCGATTA CAATACTTAT 300 

CTACACTTAA AGATTATGGA TTTAGCTCCT GAATGTATAA AAAATTATGA TGAGTATAAC 360 

TCTTTACACG ATGATATAAC AAATAAAGCG TTACAGATCG CTCAAAATAC CATTTCTAAG 420 

ATTGATTATC AAATTTCAGA CGAAACGATC AAAAAATTTA TTGATTTAGC TATCGAACAA 480 

TCGAGAGATT TGCAAGGAAA TTTGTTGAAA AATAACGGGT 520 
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(2} INFORMATION FOR SEQ ID NO: 345: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EONESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 



GCATCAAATC 


CGCCATCAAA 


GAAGTTCTCT 


GGATTTACCA 


AGACCAGTCA 


AATACCTTAG 


60 


AAGTGCTTAA 


TGACAAGTAC 


AATGTTCACT 


ACTGGAATGA 


CTGGGAAGTT 


GGAGACACGG 


120 


GAACCATTGG 


TGAGCGCTAT 


GGTGCCGTTG 


TTAAGAAACA 


CGACATTATC 


AATAAGCTTC 


iao 


TCAAACAGTT 


GGAAACCAAT 


CCTTGGAACC 


GCCGCAATAT 


TATTTCGCTC 


TGGGATTACC 


240 


AAGCTTTCGA 


AGAAACAGAT 


GGGCTGCTCC 


CGTGCGCCTT 


TCAGACCATG 


TTTGATGTTC 


300 


GGCGTGTTGA 


TGGGGAAATC 


TATCTGGATG 


CGACCTTGAC 


CCAGCGCTCC 


AATGATATGC 


360 


TGGTGGCCCA 


CCACATCAAC 


GCTATGCAGT 


ATGTGGCTTT 


GCAGATGATG 


ATTGCCAAAC 


420 


ATTTTGGCTG 


GAAGGTTGGG 


AAGTTCTTCT 


ACTTCATCAA 


CAACCTCCAT 


ATCTATGATA 


480 


ATCAATTTGA 


ACAAGCTCAG 


GAATTGCTCC 


GTCGGGAgCC GTCAAACTGC 


CAACCACGCT 


540 


TGGTTTTAAA 


TGTTCCTGAT 


GGGACTAATT 


TCTTTGATAT 


CAAAGCAGAA 


GATTTTGAGT 


600 


TGGTGGATTA 


TGACCCTGTT 


AAGCCACAGT 


TGAAGTTTGA 


CCTAGCTATT 


TAAAAGAATA 


660 


GAAAAAAGAA 


GTTGAGAATA 


ATCCCAACTT 


CTTTTGTTTC 


TTAACGTGAT 


ACGCGGCGAC 


720 


GAGCTGCTTT 


TTTACGGTTT 


TCTTCGATGA 


AAGCTGCTTT 


TTGCTCTTCT 


GGTTCGATTA 


780 


CTTTCTTTTT 


AAATGCGTAT 


ACTGCACCTG 


CAACGGCAGC 


GACAGTTCCT 


GCGACACCTG 


840 


TTACAAGACC 


TTTAGCGAAT 


CCTTTAGCCA 


TGAGTCTTCC 


TCCTTTATAT 


TCTCAATCAG 


900 


CCAGCCTCCT 


CAAGAGGTCA 


CATTTTTCTG 


ACTGACCTTT 


TTGTGTTATA 


ATAATAGTAA 


960 


CGAAAAAATG 


GGAATTTTTC 


AAGGAAAAAA 


GATGAGAACA 


AAA 




1003 



(2> INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 
CCGCACGTAC TATTCCAGAT GCCGAGGAAG TGGACCTCAT CCTCGTTGCC GCAACTGCTC 60 
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TCAACCCCTT TGAACGCCTC TTGGTCGGCT CTTCATCTGA ATACATACTC CGCCATGCTA 120 
AGGTCGATTT GCTGGTTGTG AGAGAACAAG AAAAAACCTT ATAATCACAA AGAAAAGGAG 130 
CCCCTAGCTC CTTTTTGTTT ACGATTTATT TCTCTCTTTA TGGCGTTCGT AAGCCTTGAG 240 
CTGGCGCTGC AGTTCCTTTT TAATAGCAGG TTCTGGACCA TATTTTTCTT CCCAATTATC 
TCCTTTTAAC ATTTTATGGG TCACTGGATC AAAATGAGCC TTGCCATCTG GAAAAATTTT 
CCCCATATTC GCCTGATGGA CAATATCAAA AATACGTTCT GGGTCCACCC CCATCAACAC 
AAAACTGCCG TAGGTGAACT AAAGCGTGTC AATCAAGGCA TCCACTTCCC CTATCAAATC 
TTGCTGAGCA GGTCTCTTCT TGGCTACTTT ATCTGCTGCC TTATCAAGGG CCTGATGAAG 540 
TTGCGATACA GCTTGACCAA AATCTTCTTC AGAAGGACTG GCTGCTCGAA CAAACTCCAC 600 
CAATTCTTCT ATTTTAAAAC CAGCCCTATG GGTTGCACCC TCTAAATCCC AAGCTCGAGG 



300 
360 
420 
480 



660 



750 



TTCTTCTTGG GTTCGTTCAT CCATCATGTG GTGGAAAGTC TTGACCTTAT TGAAATGATA 720 
GTCACGGCTG ACAAAGACTT TTTCTGAAGA 
(2) INFORMATION FOR SEQ ID NO: 347: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EONESS : double 
(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

CGCAACATAC GGATAACCTC CAAAGAATAT TTTTATATTA TAGCAAAGCT TTAAATTGAA 60 

TGTTAGAGTC TTGTTCAAAA CAATCATCAA AACCACGTGG ATGATGGTAT TCTACTAAGT 120 

GTTGATCTTG AGGATAAGTG TACTTACCGC CAACTTCCCA GATAAATGGA TGGAAATCGT 180 

ATTGCAAGCG ATCTTTTCGC ATTTTCCAAA GTTCTAGAAT CTCATTAGTA GAAGCCATGA 240. 

AGTTAGACCA GATATCATAG TGAACTGGGA TAATGACTTT GGTACGCAGA TTTTCTGCCA 300 

TACGAAGAAG GTCGATAGAT GTCAJcTTTGT CTTGGATACC TACCGGATTT TCACCATAGT 3 60 

TATTCAAAGC AACATCAATT TTAAAGTCTT TACCATGTTT TGCAAAATAG TTTGAGAAGT 420 

GAGAATCTGC ACCATGATAG ATGGTTCCAC CTGGTGTTTC AAAGATATAG TTAACAGCCT 480 

TTTGAGCCAT TTCTTCATCT GTAACAGCCA AGCCAGCAgT TCACCGCCTG TCTCATCAGC 540 

ACCGTTCACT GGGAGAGTTA CCAAGCAAGT ACGGTCAAAT GATTCTACTG CATGAA 596 
(2) INFORMATION FOR SEQ ID NO: 348: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 



CAGAGTCAAC 


AGCCTGAGTT 


GAAGGCAACT 


TTAGACACAG 


CAGTTACGAC 


AGCTGAATGA 


60 


GCTCCTCCAT 


CAGTTTTTTC 


TTTAATGAGT 


CCAGCTACAT 


CTTCAACTTC 


GAGGCCGTTA 


120 


ATCACAATGT 


CAGCGCCTAC 


TTCTTTTGCA 


AGGGCAAGTT 


TGTCATTGTT 


GATATCGACT 


180 


GCGATAACAT 


GAGCATTGAA 


TACTTTTTTA 


GCGTATTGAA 


CAGCGAGGTT 


ACCAAGTCCA 


240 


CCAGCACCGT 


AAAGAACAAC 


CCATTGGCCT 


GCTTCAACTT 


TTGCTTCTTT 


GATAGCTTTA 


300 


TAGGTTGTTA 


CTCCAGCACA 


TGTGATAGAA 


GAAGCTTGGG 


CTGGATCAAG 


TCCGTCAGGA 


360 


ACTTTGACAG 


CATAGTCAGC 


AGTTACGATA 


CATTGTTCAG 


CCATACCACC 


GTCTACTGAG 


420 


TAGCCAGCAT 


TTTTCACTGT 


ACGGCAAAGG 


GTTTCGCGAC 


CAGTTGTACA 


GTATTCGCAA 


480 


GTGCCACATC 


CTTCAAAGAA 


CCAAGCAACG 


CTGACGCGGT 


CACCGACTTT 


AAGGCTTTTC 


540 


ACATCTGGAG 


CAATCTCTTT 


AACGATACCG 


ATACCTTCGT 


GCCCAAGAAC 


ACCTCCTGGG 


600 


ACTTGACCAA 


AGTCACCATG 


AGCAACGTGG 


AGGTCGGTGT 


GGCAAACGCC 


CACAGTATTC 


660 


ACTTCTACAA 


GTG 










673 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 
GTACCCTACA AATGCTTTAC AGTATGGGTT GAGGCTGGTC AATGGAACTA TGGAGTAGGT 
TGGACAGGAA CTTTTGGATA TTCTGATTAC TTACATTCTA CTCGATATCA TACAGCAACT 
GTTAGACATG GGGGTAGAAC CTCTAAGGAT TATGCAAAAC CTGAGGCATG GGCTAGAGCT 
TCCCTCACCA AGATTCCG 
(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 



60 
120 
180 
198 
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(C) STRANDEDNESS : double 
(Dl TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

GCTTCTTCTA TAGACAAAAA TATCATGGGT AAAATAATCA AGGCTATAGC TAGAAGGAGG 60 

GACCAATCCA CTACTAATCC TAAGAACAAA ACACTCAAGA GAGCAGAAGA GAGAGGTTCA 120 

^TGGCACTCA„TAACGGCAAC_CACCAAAGGA J#A „ 180 

AAAAAAGCAA AAGCCGTTCC AAAGAAAGCG ATAATGAGGC AAATCAAGAT ACTCCAAATA 240 

TCAAGAGTAA AGGAAAGCTG ATAAACCGGC GAGAGGACAT TGCTAAACAA ACCTGCCAAA 300 

ATCATCCCCC ACCCAACCGT AGGAACAAAA CCATAACGCT TAGCAAAAGG TTGGGGCAAG 360 

ATAACATTAA AC ATAAC A CC CATGGCACTC AGCAAACCTG TTATAAGAGC TAGCCGCGTC 420 

ATGGATAACT GACAGAGGTC TCCCTTTGTC GCCATCAAGC AAACACCCAG CATGGCAACC 480 

AAAACATAGA AAACAGCGCT TTTTGACGCT CGTTTTTCAT AAACCAAGCG ATTGTAAAAG 540 

AGGATAAAGA CAGGGCTAAT AAACTGTAAA ATAGTTGCTG TCGTAGCATT TGAGTATTCT 600 

ACACAGAGAT AGAAAAAATA CTGAACTGAA AAAATCCCCA AAATAGCATA GGCTAAAAAG 660 

GGCAGGTAAT TTTTCTTGTC TCGCCAAATA TCTAGCACTT GCGATTTTAA TTGTATTGCA 720 

GACCAAATGA GTACAAGACT CCCTGCCAGT GTCAAACGCA TAGAGGTAAT CCAGCCCGAA 7 80 

GACACCTGAT AATGAGTAAA GAAGTACTCT CCTAAAATTC CACACATTCC CCATATTAAG 840 

CCGGATAGGA GCGAATAAAT TTTTCCGTTA ACAATCTTTT TCTGATACTG A 891 



(2) INFORMATION FOR SEQ ID NO: 351: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

GAAAGCGTTC AATAGAACAT TGCTTTTTTA TTTTTAGAGT AAGCTAAGCG CTTCAGCATC 60 

TGCGATGATG GTTACATCAG GGTGATTTTG GAGGCTACTT GCAGGTAGGT TCTCAGTCAC 120 

TGGGCCAGAT ACTGTTCCGG CAATGGCTTC TGCTTTCGAC TCACCGTAAG CAAAAAGAAT 180 

AATAGACTTG GCATCCAAAA TGTTTTTAAT CCCCATTGAA ATAGCTTGGG TTGGGACGTC 240 

TTCAATCTTG GCAAAGAAGC GTGCATTGGC TTCGATAGTA GACTGGTCAA GTTCTACTAG . 300 
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ATGCGTTTCA CTGTCAAATG GAGTG 

(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 344 base pairs 
(BJ TYPE: nucleic acid 
tC) STRAND EDNBSS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

CAAGAGCAGT TTGATGATTT TTGATAAGCA TGCGAATTTA AAATACAAAT ATGGCAATCG 60 

CAAGTTTTGG TGTAGAGGCT ATTATGTAGA TACGGTAGGC CGTAATCAGA AAGTGATAGC 120 

TGAATATATT CAGAATCAAT TACAAGAAGA CAGAGTAGCA GACCTAGCTC ACGTTATTCG 180 

AGTCAGTAGA TCCGTTTACT GGCGAAATAA ATAAGAGGAA GTAACGTnAA CTGCTTTAGC 240 

ACCTGCTCGG GAAAGTGGTG CGCGAGGAAG CTATTTCAGG ATGCTTTGGC CCTGGCCGGT 300 

AGAAGCGTTA TAGCCGCAGA CTACGACACT TCACACTGGT GGTT 344 
(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

CCCTATCCCT GCTATTGGGG CTGCTCTCAT TGCTGCTTTG GCACAAATCA CTCTTCCAAT 60 

TGGACCTGTT CCCTTCACTC TGCAAAACTT TGCAATCGGC TTGATTCTAC TGTCTTTAGA 120 

CCGAGAGAGG CTGTACTTTC TGCTGGACTC TATCTTCTTC TAGGTGCTAT CGGTCTTCCT 180 

CTCTTTGCAG GAGGTGGAGC TGGTTTTCAG GCTTTAGTTG GCCCTACTGC AGGCTATCTT 240 

TGGTTTTATC TCGTTTACTC TGGACTTACT TCCTCTCTAA CCAACAGCAA GAGTGGTGTT 300 

GTTAAGATTT TTCTTGCAAA CCTCTTGGGT GATGCCCTTG TCTTTGTCGG CGGGATTCTC 360 

AGCTTGCATT TCCTAGCTGG AATGGCATTT GAAAAAGCTC TTGCTGTGGG G G TTCTTCCC 420 

TTTATCATTC CAGACCTTGG CAAACTTCTA GCTATTAGTT TTATTAGCCG TCCCCTACTT 480 

CAACGCCTTA AAAATCAGGC TTACTTTACT AACTAAAAAA GGATATCGAG TTATCATGAC 540 

TCAATATCCT TTTCTTTTAT TTTGAAAACT TATACTCAAT GAAAATCAAA GAGCAAACTA 600 

GGAAGCTAGC CGCAGGCTnG CAAAACACTG TTTTGAGCTT GTGGATGAAA CTGACGAGTA 660 
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AnATCTCATA CATACGCCAA GCCAAAGCTG AC 692 

(2) INFORMATION FOR SEO 10 MO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1005 base pairs 
O) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

GTGATGGACT ACTGGTTCAA AACGCATCCA GAAGATTTTT TCGATAATGT CGGACCTCTT 60 

GTAGCCAGTA ACTTTTTTCA TACTTACACC GAAGATTTCC ACTTGATGAA GGAAATTGGA 120 

GTTAATTCTT TCCGCACTTC CATCCAATGG AGTCGACTCA TCAAGAATTT AGAGACAGGT 180 

GAGCCTGATC CAAAAGGTAT TGCTTTCTAC AATGCCATCA TTGAAGAAGC TAAAAAGAAC 240 

CAGATGGATC TTGTGATGAA TTTACATCAT TTTGATTTAC CAGTGGAACT TCTTCAAAAA 3 00 

TACGGTGGTT GGGAAAGCAA ACATGTAGTG GAGTTATTCG TGAAGTTTGC CAAGACTGCT 3 60 

TTCACATGCT TTGGAGATAA GGTTCATTAC TGGACAACTT TCAATGAGCC AATGGTCATT 420 

CCAGAAGCAG GGTACTTATA TGCTTTCCAT TATCCAAATC TAAAAGGAAA GGGAAAAGAG 480 

GCCGTACAAG TCATCTATAA TCTAAACCTT GCTAGTGCAA AAGTGATTCA ACTATATCGC 540 

TCATTAGAAC TTGATGGAAA GATTGGGATT ATTTTAAACT TGACACCTGC TTATCCAAGA 600 

AGTAATTCTC CAGAAGACTT AGAAGCAAGT CGATTTACAG ATGACTTCTT TAACAAAGTC 660 

TTCTTGAATC CAGCTGTTAA AGGAACTTTC CCAGAAAGAT TGGTAAAACA GCTAGAGAGA 720 

GATGGCGTGT TATGGAGTCA TACCGAAAAA GAGCTTCAAC TGATGAAATC AAATACGGTT 7 BO 

GATTTTCTTG GAGTAAACTA CTACCATCCA AAACGTGTTC AAGCACAAGC AAATCCTGAG 840 

GAATATCAGA CGCCCTGGAT GCCAGACCAA TACTTCAAAG AGTATGAATG GCTGGAGCGT 900 

CGCATGAATC CATATCGTGG TTGGGAAATT TTTCCGAAAC CCATTTATCA TATTGCTATG 960 

ATTGTGAAGG AAGAATATGG TAATATCCCA TGGTTTATCA GTGAA 1005 
(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUOJCE DESCRIPTION: SEQ 10 NO: 355: 

CCGACAAGCA ATATTAAAAA GAGTAAACTA TTAACTAGTT AATTAACCGG TTTATTACTT 60 

TATAGTGAAT CAAATATACT TAAGAAAAGA GGAAACAATG AAAATTAATA AAAAATATCT 120 

AGCAGGTTCA GTGGCAGTCC TTGCCCTAAG TGTTTGTTCC TATGACCTTG GACGTTACCA 180 

AGCTGGTCAG GATAAGAAAG AGTCTAATCG AGTTGCTTAT ATAGATGGTG ATCAGGCTGG 240 

TCAAAAGGCA GAAAACTTGA CACCAGATGA AGTCAGTAAG AGCGAGGGGA TCAACGCCGA 300 

ACAAATTGTT ATCAAGATTA CGGATCAAGG TTATGTGACC TCTCATGGAG ACCATTATCA 3 60 

TTACTATAAT GGCAAGGTTC CTTATGATGC CATCATCAGT GAAGAGCTCC TCATGAAAGA 4 20 

TCCGAATTAT CAGTTGAAGG ATTCAGACAT TGTCAATGAA ATCAAGGGTG GTTATGTCAT 480 

TAAGGTAAAC GGTAAATACT ATGTTTACCT TAAGGATGCA GCTCATGCGG ATAATATTCG 540 

GACAAAAGAA GAGATTAAAC GTCAGAAGCA GGAACGCAGT CATAATCATA ACTCAAGAGC 600 

AGATAATGCT GTTGCTGCAG CCAGAGCCCA AGGACGTTAT ACAACGCATG ATGGGTATAT 660 

CTTCAATGCA TCTGATATCA TTGAGGACAC GGGTGATGCT TATATCGTTC CTCACGGCGA 720 

CCATTACCAT TACATTCCTA AGAATGAGTT ATCAGCTAGC GAGTTAGCTG CTGCAGAAGC 780 

CTATTGGAAT GGGAAGCAGG GATCTCGTCC TTCTTCAAGT TCTAGTTATA ATGCAAATCC 840 

AGCTCAACCA AGATTGTCAG AGAACCACAA TCTGACTGTC ACTCCAACTT ATCATCAAAA 900 

TCAAGGGGGA AACATTTCAA CCCTTTTACG TGAATTGTAT GCTAACCCTT ATCAGAACGC 960 

CATGTGGGAT CTG 973 
(2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

GGTCGCATCT GCAATATCTG TCGCCTCCAC ATAAGCGACA CCAGCCTTGT CTGCTGCCCG 60 

TTTGACACGT TCTGCAGATT GACCCAGGAT GACCATCTTC TTGAGTCCAG TAATGTCTGG 120 

CACCAATTCG TCAAACTCAT TGCCACGGTC CAAACCACCT GCAATCAAGA CGACCTTGCT 180 

GTTGTCAAAT CCTGACAAGC TTTTTGAGTA GCCAAGATAT TAGTTGATTT ACTGTCGTTA 240 

TAGAATTTAA CACsCTTGAT GTCATCCACA AACTGGAGAC GGTCTTTGAC ACCACCGAAG 300 

GCTGAAAGAG TTTCCTTGAT CCTTTGATTG TCCACATCAC GAAGCTTGGC TACAGCAATA 360 
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CTCGCAAGGG CATTTTCCAC ATTCTGGCTA CCTGGAACAC CGATTTCATT CCCTGCCATG 420 

ACTACTTCAC CACGGAAGTA GAGTTGACCA TCTTCCAGAT AACCTCCATC AACCTTTTCA 480 

AGTGTTGAAA ATGGTACAAC ACTGCCTTCT GTCTTGGAAG TCAAGTCTTT TGCCAAGTCT 540 

TGATTAAAGT TCAAGACAAG GAAATCAGCT GCTGTCATCT TGTTCTGGAT ATTCCACTTG 600 

GCTGCTACAT ATTCCGAAAA TGACCCATGG TAGTCGATAT GAGTTGGCAT GAGGTTGGTA 660 
ATAACCCCAA TCTCTGGATC GAATTCTTGA ACACCCATGA GTTGGAAAGA AGAAAGTTCC . . 720 

ATAACAAGCG TGTCCTTATC TGATGCTATT TGAGCAACCT GACTAGCTGG ATAGCCGATA 780 

TTCCCTGATA AAAGACCATG TTGGCCAGCA GCAGTCAAAA CTTCCCGGGn TCCTCTAGAG 840 

TCG 843 



(2) INFORMATION FOR SEQ ID MO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 
TTTTTTTTAT ATTTTTTTTA TTTATTATTT TTTGGCAAAA AAGACCAATT TGCTTTGGAG 60 

CATTGCTTCT GCATTAAATT GTCTATTTTT GCTCGTGCTG TTACCCTCTT TGTATCATGT 120 

ATTAACTAGC AAGTGCAACT TGCAAACTAC TAGTAAGAGG AGAAAAACAA AATGGTTATG 180 

ACTGACCCAA TCGCAGACTT CCTAACTCGT ATTCGTAATG CTAACCAAGC TAAACACGAA 240 

GTACTTGAAG TACCTGCATC AAACATCAAA AAAGGGATTG CTGAAATCCT TAAACGCGAA 300 

GGTTTTGTAA AAAACGTTGA AATCATTGAA GATGACAAAC AAGGCGTCAT CCGTGTATTT 3 60 

CTTAAATACG GACCAAATGG TGAGAAAGTT ATCACTAACT TGAAACCTGT TTCTAAACCA 4 20 

GGACTTCGTG TCTACAAAAA ACGTGAAGAC CTTCCAAAAG TTCTTAACGG ACTTGGAATT 4 80 

GCCATCCTTT CAACTTCTGA AGGTTTGCTT ACTGATAAAG AAGCACGCCA AAAGAATGTT 540 

GGTGGTGAGG TTATCGCTTA CGTTTGGTAA AATCAAGATA CAAAGCTCGT AAAGAACAAA 600 

GCAAAATTAG GAAGTTGGAG AAGTTTGTTT ACAAACAGGC CAACTTATCT ATTTTGCACA 660 

GTTCTTAGAG CGTGTTCAGT TCAGCTCTTG AGCTAAGTAA GTATCTGAAC CCCGTGAAAA 720 

CTGGCCCTGC TGGCATGTTC GGGTAACAGG AGAnAATAAA CATGTCACGT ATTGGTAATA 780 

AGTTCACCTA AGGCCTTCGT AAAACTT 807 
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(2). INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 



CCCAGTATTT 


TTGTCCAAGC 


ACGACCAGAA AAGGATGATA 


CAGATCTGGA 


ATTGGCTCTC 


60 


TTAACCATCT 


tTGAACAAAA 


TCCTCAGGCT 


CAGGTCACTA 


TTTTCCGTGC 


CTTGGGTGGC 


120 


CGTATTGACC 


ATATGTTGGC 


CAATGTCTTT 


CTGCCTAGCA 


ATCCTAACTT 


GGCACCCTA? 


180 


ATGCATCAAA 


TAGAAATTGA 


GGATGGGCAA 


AACTTGATTA 


CTTATTGTCC 


AGAAGGAATC 


240 


AGTCAGCTAG 


AACCTCGTTC 


AGACTACGAC 


TATCTAGCCT 


TTATGCCAGT 


TCGGCATAGC 


300 


CAAGTATGAG 


TTGACAGAGG 


AAAATTTTTT 


CTTTAAAAAA 


GTCTACGCTT 


CTAACGAATA 


360 


TATAGATAGG 


GAAGTGTCGG 


TAACTTGCCC 


AGATGGTTAT 


GTGGTCGTAC 


TGCATAGCAA 


420 


GGACAGGAGG 


TAGGATGGAA 


AGTTTACTTA 


TTCTATTATT 


AATTGCCAAT 


CTAGCTGGTC 


480 


TCTTTCTGAT 


TTGGCAAAGG 


CAGGATAGGC 


AGGAGAAACA 


CTTAAGTAAG 


AGCTTGGAGG 


540 


ATCAGGCAGA 


TCATTTGTCA 


GACCAGCTGG 


ATTACCGCTT 


TGACCAAGCC 


AGACAAGCCA 


600 


GCCACTTAGA 


CCAAAAAGAT 


TTGGAAGTGG 


TTGTCAGCGA 


CCGTTTGCAA 


GAA 


653 


(2) INFORMATION FOR SEQ ID NO: 359: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 






CACCATGTGA 


TGTGACGCTG 


GCCACAGCTG 


TCAGAAATCT 


GGCGAGCCAT 


CGTGTGCAAT 


60 


GACTCTTCCC 


GATGTAATCT 


TGTTCATAGT CCTTTGATGA 


ATATGTTCAA 


GCTGTAGAAG 


120 


GTGCGCTTCC 


TGAACACTTA 


TCAACTGTTA 


CAGGCGAGTT 


GACCAGTCAG 


GAAACAGATG 


180 


GCTGGTACAC 


ACTTGCCAAC 


ACTTCTTCAT 


CCCGCATTTA 


CCTAAAACAA 


GCCTTCCAAG 


240 


AAAATAGCAA 


CCTCCTAGAG 


CAAGTGGTAG 


AACCCTTGAC 


TATTATCACT 


GCTGGACACA 


300 


ACCACAAGGA 


CCAGTTGACC 


TATGCTTGGA 


AAACACTTTT 


GCAGAATGCG 


CCACATGATA 


360 


GTATCTGTGG 


CTGTAGCGTG 


GACGAAGTTC 


ACCGCGAGAT 


GGAAACGCGT 


TTTGCCAAGG 


420 



WO 98/18931 



PCT/US97/19588 



TCAACCAAGT AGGAAACTTT CTTAAAACTA ACTTGCTCAA CCACTGCAAG GGTAAAATTG 480 

CTACGCATAA GGCTCAAAGT GACTATCTCT TTACTGTCAT TAACACAGGC TTGCATGATA 540 

AGGTCGATAC TGTCAGCACA GTGATTGATG TGGCGACTTG TGATTTCAAG GAATTGCACC 600 

CAACAGAAGG CTACAAAAAG ATGGCTGCTC TTATCTTGCC G 641 

(2) INFORMATION FOR SEQ ID NO: 360: 

(X) SEQUENCE_CHARACT ERISTICS : - 

(A) LENGTH: 1958 base pairs 
<B> TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

CCTCAAGGCC AATTTGAAGG CTCTAAAACA ATGGAAAAGT GCTACACAGA TGTGACAGAA 60 

TTTGCCATTC CAGCAAGTAC TCAAAAGCTT TACTTATCAC CACTTTTACA TGGCTTTAAT 120 

AGCGAAATTA TTGCTTTTAA TCTTTCCACT TCACCCAACT TACAACAAGT ACAAACAATG 180 

TTAGAACAGG CATTCAAAGA GAAGCACTAC GAGAATACGA TTCTCCATAG TGACCAAGGC 240 

TGGCAATATC AACACGATTC TTATCATCGG TTCCTAGAGA GTAAGGGAAT TCAAGCATCT 300 

ATGTCACGCA AGGGCAACAG CCAAGACAAC GGTATGATGG AATCTTTCTT TGGCATTTTA 360 

AAATCCGAAA TGTTTTATGG CTATGAGAAA ACATTTAAAT CACTTAACCA ATTGGAACAA 4 20 

GCCATTATAG ACTATATTGA TTACTACAAC AACAAACGAA TTAAGGTAAA ACTAAAAGGA 480 

CTTAGTCCTG TGCAGTACAG AACTAAATCC TTTGGATAAA TTAATTGTCT AACTTTTTGG 540 

GGTCAGTACA AAACTCTTGC TACTATGCGT TTTATTATTG AAAGACTTAT TGGACTTTCT 600 

CTCAAATCGA GTTTTTACTC AATTTTCTTA CTTGATTGGG ATTGAAATTC CAATTAATTT 660 

CTCTGAGTAG AGTGTCTTGA TATTGGCTTC ATCAACAGAC GCCTTATCAA TTTTACGTTT 7 20 

CAAGAAAAAT TCTTGAATGG TTTCGATTTC AGGCTCACGA ATAGCACGGT GTTT G TTTGA 780 

GATGAGGATT TCATAGTGAA GCGGAGCTTG GGTAAAAATA ACATCTGTAT TCCCTGCAGA 840 

ATAAACCTCA ACAAGGGTTG CATCGGTACT TTCTAGCTGA CTTTTTACAA GTTGCGAGTG 900 

TGAGTTTGTC GTATTGATAA GCTTCATAAT ATTTCCTCCG ATTTTCTAAT TCTATTATAG 960 

CACTTTTTGA ATAAAGTCGC TTGATTTATA CTCAATGAAA ATCAAAGAGC AAACTAGGAA 1020 

GCTAGCCGCA GGCTATACTT GAGTACGGTA AGGCGACGCT GACGTGGTTT GAATTTTATT 1080 

TTCGAAGAGT ATTAGCCAAT CTTATGCTGT TTTTTCCAAG ATTCAATGGC CCATTTATGG 1140 
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CTACCACGTT TAAGCTTTTT GATAGCCTCG TCAATAGGGA ACCAGGCAAT ATGATTAAAG 1200 

TTTTCTAGTG GCTTTTGTAC TTCTTTGAAA GGAGTTGCTT CATAGAGGTA GGCAGGATTG 12 60 

TAGTAGTAGG TATCACGATG ACGAGAATAG AAATATTCCT CAGCTTGTCC GTAATAGGTA 1320 

CCAATTTCTG CTGTGAAACC AAGCTCTTCA ATCAACTCAT GCTTTAGGGC TTCCTGATGA 1380 

TTTTCACCTG CTTCAATTTC TCCACATGGT AGGAACCAAG CACCATTTGG TTCTTGAACA 1440 

AGAACAATTT CTTTTTGTTC AGGATTAGGG ATAACTGCAT ATACGCCATA GCGAGCAATA 1500 

TAGTCTGTAT TCACTTTTTT TCTCCGAAAG TTGGGTTTGC CATTGCATTT TCCTCATTAT 1560 

CTAGTATCGT TATTATTATA GTGAAATGAA CCAAAAATAG TACACAATGT GGTATAATCT 1620 

TCTTATGGCA TATTCAATAG ATTTTCGTAA AAAAGTTCTC TCTTATTGTG AGCGAACAGG 1580 

TAGTATAACA GAAGCATCAC ACGTTTTCCA AATCTCACCT AATACCATTT ATGGCTGGTT 1740 

AAAGCTAAAA GAGAAAACAG GAGAGCTAAA CCACCAAGTA AAAGGAATAA AACCAAGAAA 1800 

GGTTGATAGA GATAGACTTA AAAACTATCT TACTGACAAT CCAGACGCTT ATTTGACTGA 1860 

AATAGCTTCT GAATTTGGCT GTCATCCAAC TACCATCCAC TATGCGCTCA AAGCTATGGG 1920 

tACACTCGAA AAAAAAAAGA ACTACACCTA CTATGAAC 1958 
(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(O) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

TATGAAATTA ACTTATGATG ATAAAGTTCA GATCTATGAA CTTAGAAAAC AAGGATATAG 60 

CTTAGAGAAG CTTTCAAATA AATTTGGGAT AmACAATTCT AATCTTAGGT ATATGATTAA 120 

ATTGATTGAT CGTTACGGAA TAGAGTTCGT CAAAAAAGGA AAAAATCGTT ACTATTCTCC 180 

TGATTTAAAA CAAGAAATGA TTAATAAAGT CTGACATGAA GGCTGGACTA AAGATAGAGT 240 

TTCTCTTGAA TACGGTCTCC CAAGTCGTAC GATACTTCTT AACTGGCTAG CACAATACAG 300 

GAAAAACGGG TATACTATTG TTGAGAAACC AAGAGGGAGA GTACCTGAGA GCGGAGAATG 3 60 

C CATC CT AAA AAAGTTAAGA GAACTCCGAT TGAAGGAGGA AAAAGAGAAA GAAGAAAGAC '420 

AGAAATTGTT TAAGAATTAA TGACTGAGTT TTCGTTAGAT CTTCTTTTAA AAGTCATTAA 480 

ACTAGCTCGT TCGACCTACT ACTATCACTT GAAACAGCTA GATAAACCAG ATAAGGACCA 540 

AGAGCTTAAA GCTGAAATTC AATCCATTTT TATCGAACAC AAAGGAAATT ATGCTTATCG 600 
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TCGGATTTAT TTAGAACTAA GAAATCCTGC TTATCTGGTA AATCATAAAA CAGTTCAAGG 660 

CTTGATGAAA GTACTCAATC TACAAGCTAA AACGCGACAG AAACGAAAAT ATTCTTCTCA 720 

TAAAGGAGAC GTTGGCAAGA AGGCAGAGAA TCTCATTCAA CGCCAATTTG AAGGCTCTAA 780 

AACAATGGAA CAGTGCTACA CAGATGTGAC AGAATTTGCC ATTCCAGTAA GTACTTAAAA 840 

GCTTTACTTA T 851 

(2) INFORMATION FOR SEQ ID N O: 36 2: 

ti) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


362: 






GGCTACAATC 


GATATCTCCA 


ATGAGTTGGT 


CTAGCTGGTG 


AAACTGTAAA 


i i l Luw 


ou 


CCAATTCAAG 


GTTGAGGCAT 


CGCAAACTAT 


GGACTGTTTC 


CTCGTCAGTT 


CTGGAAAGAA 


120 


AACGCGATAA 


GGTTGGCTGT 


GAAGCAAGCT 


GCCCTCCTTC 


CAACAATTTT 


GGAAAGTAGG 


180 


CATCAGCTGA 


CAATTCTTTA 


CAAGCATAGT 


CCGTTCCATA 


ACCTGTTAAC 


AGTTGAAAGA 


240 


GGAACTGGAC 


AAGGATATCT 


GAATCCGAAT 


AACGACAGTA 


GCGGCGTTGG 


TCATTCGTTA 


300 


CTAAATACTT 


AGAAATCCGC 


TCTTTTAGTT 


TCAACTGGGA 


AAAAAGTTCC 


TGAAAAAAGA 


360 


TAAGACCACC 


ATACTGCCTT 


AAATGACCTC 


CATCGAAAGA 


TAGTTGGTAA 


AAAGACTTGT 


420 


TTTGGAAGTG 


ATGATTTGGT 


AAACTGTTCA 


TGTGAGTTTC 


CTTTCTTTTT 


GTGTTTTTTT 


480 


CTACACTTAT 


ACCATAAAGG 


GGAAACTCTT 


TTTTGTCTAG 


TAAAAAACAC 


CCATTGGGTG 


540 


AAAAAAGAAA 


CCATCCAGGA 


TCTAAGCTAA 


GGCAAGGATT 


CTGGATGGTT 


TTTAGATTTG 


600 


GGGTGAATAA 


TTGGGGTTTT 


AGCTGCTTGC 


GGCCAATCAG 


GTTCAGATAC 


AAAAACTTAC 


660 


TCATCAACCT 


TTAGTGGAAA 


TCCAACTACA 


TTTAACTATC 


TATTAGACTA 


TTACGCTGAT 


720 


AATATAGTCA 


ATTGAAACAA 


GAACAAGACA 


AAAGAGCCTC 


ATAAAAGGTA 


TTGCAACTTG 


780 


GTAATACCTT 


TTTGAGGTGC 


TTTTTGATAT 


GAGCCCATGT 


TTTCTCAATA 


GGATTGTACT 


840 


CAGGTGAGTA 


GGGAGGAAGA 


GGTAAAAGTT 


TATACCCAAA 


CTCTTCACAC 


AAGAGTTCTA 


900 


ACTTACCCAT 


TCTATGGAAT 


CTTGCATTAT 


CCATAATAAT 


AACCGATGGT 


GTGTTTAATG 


960 


TTGGTAAGAG 


AAATTTCTGA 


AACCAAGCTT 


CAAAAAAGTC 


GCTCGTCATC 


GTCTCTTCGT 


1020 


AAGTTATTGG 


AGCGATTAAC 


TCACCATTTG 


TTAGACCTGC 


AACCAAAGAA 


ATCCTCTGAT 


1080 
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ATCTTCTTCC ACATACTTTG CCTCTTCTTA ACTGACCTTT TAATGAGCGA CCATATTCTC 1140 

GATAAAAATA ACTATCGAAT CCTGTTTC 1168 

(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4483 base pairs 
(B> TYPE: nucleic acid 

(C) STRAND E ON ESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

GTCAGCTTCA GCAAGCCCAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CCGCTTCAGC 60 

GTCAACCAGT GCGTCGGCTT CAGCGTCGAC AAGTGCTTCG GCTTCAGCAT CAACGAGTGC 120 

GTCGGCCTCA GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTGCGTCGGC 180 

TTCAGCAAGC ACAAGTGCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT CGGCATCAAC 24 0 

GAGTGCGTCT GAGTCAGCAT CAACGAGTAC GTCAGCCTCA GCAAGCACAT CAGCTTCTGA 300 

ATCTGCATCA ACCAGTGCGT CAGCCTCAGC ATCGACAACC GCCTCAGCTT CAGCAAGTAC 360 

CAGTGCTTCA GCCTCAGCGT CGACAAGTGC GTCGGCCTCA ACCAGTGCAT CTGAATCGGC 420 

ATCAACCAGT GCGTCAGCCT CAGCAAGTAC TAGTGCATCA GCTTCAGCAT CAACGAGTGC 480 

ATCGCCTTCA GCATCAACCA GTGCCTCGGC TTCAGCGTCA ACCAGTGCGT CAGCTTCAGC 540 

AAGTACCAGT GCTTCAGTCT CAGCATCAAC AAGTGCTTCA GCCTCAGCAT CGACAAGTGC 600 

CTCGGCTTCA GCAAGCACAT CAGCATCTGA ATCAGCCTCG ACAAGCGCCT CAGCTTCAGC 660 

AAGTACCAGT GCGTCAGCCT CAGCGTCGAC AAGTGCGTCA GCCTCAGCAA GTACTAGTGC 720 

ATCAGCTTCA GCATCAACGA GTGCATCGGC TTCGGCGTCA ACCAGTGCAT CACAGTCAGC 780 

AAGTACCAGT GCGTCAGCTT CCGCATCAAC AAGTGCCTCG GCTTCAGCAA GCACCAGTGC 840 

CTCGGCTTCA GCAACTACTA GCGCCTCAGC CTCAGCCTCA ACCAGTGCGT CAGCCTCAGC 900 

AAGTATCTCA GCGTCTGAAT CGGCATCAAC GAGTGCGTCC GCTTCAGCAA GTACTAGCGC 960 

CTCAGCCTCA GCGTCAACAA GTGCATCGGC TTCAGCGTCA ACCAGTGCGT CTGAATCGGC 1020 

ATCAACGACT GCCTCCGCTT CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC 1080 

ATCGCCTTCA GCATCAACGA GTGCGTCCGC TTCAGCAAGT ACTAGCGCCT CAGCCTCAGC 1140 

CTCAACAAGT GCATCGGCTT CAGCGTCAAC GAGTGCGTCT GAGTCAGCAT CAACGAGTGC 1200 

GTCAGCCTCA GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CAgCCTCAGC 1260 

ATCGACAAGC GCCTCAGCTT CAGCAAGTAC CAGTGCGTCA GcTCAGCGTC GACAAGTGCs 1320 
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TCrGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCAAGTA CCAGTGCkTC AGCCTCAGCG 1380 

TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC 1440 

TCAGCAAGTA CTAGCCCCTC AGCCTCAGCA TCAACGAGTG CGTCCGCTTC AGCAAGTACT 1500 

AGTCCATCAG CTTCAGCAAG TACTAGCGCC TCAGCCTCAG CGTCGACAAG CGCCTCACCT 1560 

TCAGCAAGTA CCAGTGCGTC AGCCTCAGCG TCGACAAGTG CGTCCGCTTC AGCAACTACC 1620 
TCAGCGTCTC AATCAGCATC AACAAGTGCG TCCGCTTCAG CATCAACGAG TGCATCAGCT . 1680 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTACCAGTG CGTCCGCTTC AGCATCAACG 1740 

AGTGCTTCAG TCTCAGCGTC AACCAGTGCC TCTGAATCCG CATCAACAAG TGCCTCGGCT 1800 

TCAGCAAGCA CCAGTGCTTC GGCTTCAGCG TCAACGAGTG CGTCTGAGTC AGCATCAACG 1860 

AGTGCCTCAC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATCAACCAGT GCCTCACTTC 1920 

CGCATCAACA AGCGCCTCGG CCTCAGCAAG TACAAGTGCT TCAGCCTCAG CATCAACCAG 1980 

TGCATCAGCT TCACCCTCAA CAAGTGCTTC AGCCTCAGCG TCAACCAGTG CCTCGGCTTC 204 0 

AGCAAGTACC AGTGCGTCAG CTTCAGCAAG CACAAGTGCG TCAGCTTCAG CATCAACCAG 2100 

TGCTTCGGCT TCGGCATCAA CAAGTGCCTC AGCATCACCA TCAACGAGTG CGTCAsCTCA 2160 

GCAAGTACTA GTGCATCAGC ATCAGCATCA ACCAGTGCAT CAGCCTCAGC AAGTATCTCA 2220 

GCGTCTGAAT CGGCATCAAC GAGTGCATCA GCATCAGCAT CAACGAGTGC ATCCGCTTCA 2280 

GCGTCAACCA GTGCATCAGT CTCAGCAAGC ACCAGTGCGT CGCCTTCAGC ATCAACCAGT 2 340 

GCCTCAGCCT CAGCAAGTAT CTCAGCGTCT GAATCGGCAT CAACGAGTGC GTCAGcCTCA 2400 

GCAAGTACTA GTGCATCAGC ATCAGCATCA ACGAGTGCAT CGCCTTCAGC AAGTACCAGC 24 60 

GCCTCAGCTT CAGCAAGCAC CAGTGCGTCA GCCTCAGCAA GTACCAGCGC CTCAGCCTCA 2520 

GCAAGCACCA GTGCCTCAGC TTCAGCAAGT ACCAGTGCGT CAGCCTCAGC GTCGACAAGT 2580 

GCGTCGGCTT CAGCAAGTAC CTCAGCGTCT GAATCAGCAT CAACGAGTGC ATCACCTTCA 2 640 

GCATCAACAA GTGCTTCAGC TTCAGCAAGT ACCAGTGCGT CGGCTTCAGC ATCAACCAGT 2700 

GCTTCAGTCT CAGCGTCAAC CAGTGCCTCT GAATCAGCAT CAACAAGTGC CTCGGCTTCA 27 60 

GCAAGCACCA GTGCGTCGGC TTCAGCAAGT ACTAGTGCAT CGGCTTCAGC ATCGACAAGT 2820 

GCGTCTGAAT CGGCATCAAC GAGTGCTTCG GCTTCAGCAT CAACGAGTGC GTCAGCCTCA 2880 

GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CCGCTTCAGC GTCAACCAGT 2940 

GCGTCGGCTT CAGCGTCGAC AAGTGCTTCG GCTTCAGCAT CAACGAGTGC GTCGGCCTCA 3000 

GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTCCGTCCGC TTCAGCAAGC 3060 
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ACAAGTGCGT CAGCCTCAGC 


AAGTATCTCA 


GCGTCTGAAT 


CGGCATCAAC 


GAGTGCGTCG 


3120 


GCCTCAGCAA GCGCAAGTAC 


CTCAGCGTCA 


GCTTCCGCCT 


CAACCAGTGC 


CTCGGCTTCA 


3180 


GCAAGCACAA GTGCGTCAGC 


CTCAGCAACT 


ATCTCAGCGT 


CTGAATCGGC 


ATCAACGAGT 


3240 


GCGTCTGAGT CACCATCAAC 


GAGTACGTCA 


GCCTCAGCAA 


GCACATCAGC 


TTCTGAATCG 


3300 


GCATCAACCA GTGCGTCAGC 


CTCAGCATCG 


ACAAGCGCCT 


CAGCTTCAGC 


AAGTACCACT 


3360 


GCTTCAGCCT CAGCGTCGAC 


AAGTGCGTCG 


GCCTCAACCA 


GTGCATCTGA 


ATCGGCATCA 


3420 


ACCAGTGCGT CAGCCTCAGC 


AAGTACTAGT 


GCATCAGCTT 


CAGCATCAAC 


GACTGCATCG 


3480 


GCTTCAGCAT CAACCAGTGC 


CTCGGCTTCA 


GCCTCAACCA 


GTGCGTCAGC 


TTCAGCAAGT 


3540 


ACCAGTGCTT CAGTCTCAGC 


ATCAACAAGT 


GCTTCAGCCT 


CAGCATCGAC AAGTGCCTCC 


3600 


GCTTCAGCAA GCACATCAGC 


ATCTGAATCA 


GCCTCGACAA 


GCGCCTCAGC 


TTCAGCAAGT 


3660 


ACCAGTGCGT CAGCCTCAGC 


GTCGACAAGT 


GCGTCAGCCT 


CAGCAAGTAC 


TAGTGCATCA 


3720 


GCTTCAGCAT CAACGAGTGC 


ATCGGCTTCG 


GCCTCAACCA 


GTGCATCAGA 


GTCAGCAAGT 


3780 


ACCAGTGCGT CAGCTTCCGC 


ATCAACAAGT 


GCCTCGGCTT 


CAGCAAGCAC 


CAGTGCGTCG 


3840 


GCTTCAGCAA GTACTAGCGC 


CTCAGCCTCA 


GCCTCAACCA 


GTGCGTCAGC 


CTCAGCAACT 


3900 


ATCTCAGCGT CTGAATCGGC 


ATCAACGAGT 


GCGTCCGCTT 


CAGCAAGTAC 


TAGCGCCTCA 


3960 


GCCTCAGCGT CAACAAGTGC 


ATCGGCTTCA 


GCGTCAACGA 


GTGCGTCTGA 


ATCGGCATCA 


4020 


ACGAGTGCGT CCGCTTCAGC 


AAGTACTAGC 


GCCTCAGCCT 


CAGCGTCAAC 


AACTGCATCG 


4080 


GCTTCAGCAT CAACGAGTGC 


GTCCGCTTCA 


GCAAGTACTA 


GCGCCTCAGC 


CTCAGCGTCA 


4140 


ACAAGTGCAT CGGGTTCAGC 


GTCAACGAGT 


GCGTCTGAGT 


CAGCATCAAC 


GAGTGCGTCA 


4200 


CCTCAJcCAAG CACATCAGCT 


TCTGAATCTG 


CATCAACCAG 


TGCGTCACTT 


CCGCATCAAC 


4260 


AAGCGCCTCG GCCTCAGCAA 


GTACAAGTGC 


TTCAGCCTCA 


GCATCAACCA 


GTGCATCAGC 


4320 


TTCAGCCTCA ACAAGTGCTT 


CAGCCTCAGC 


GTCAGACCAG 


TGCCTCGGCT 


TCAGCAAGTA 


4380 


CCAGTGCGTC ACTTCAGCAA 


GCACAACTGC 


GTCAGCTTCA 


GCATCAACCA 


GTGCTTCGt;C 


4440 


TTCGGCATCA ACAAGTGCCT 


CAGCATCAGC 


ATCAACGAGT 


GCG 




4483 


(2) INFORMATION FOR SEQ ID NO: 364: 









ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25S0 base pairs 

(B) TYPE: nucleic acid 
(C> STRAND EDNESS : double 
(D) TOPOLOGY : linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
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GTACCTCAGC GTCCTTCCGC CTCAACCAGT GCGTCCGCTT CAGCAAGCAC AAGTGCGTCA 60 

CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCCGCC TCAGCAAGCG 120 

CAAGTACCTC AGCGTCACTT CCGCCTCAAC CAGTGCGTCG GCTTCAGCAA GCACAAGTGC 180 

CTCAsCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA 240 

TCAACGAGTA CGTCAGCCTC AGCAAGCACA TCAGCTTCTG AATCGGCATC AACCAGTGCG 300 

_ -TCAGCCTCAG- CATCGACAAG - CGCCTCAGCT--TCAGCAAGTA-CCAGTGCTTC -AGCCTCAGCG —360 

TCGACAAGTG CGTCGGCCTC AACCACTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC 420 

TCAGCAAGTA CTAGTGCATC AGCTTCAGCA TCAACGAGTG CATCGGCTTC AGCATCAACC 480 

AGTGCCTCGG CTTCAGCGTC AACCAGTGCG TCAGCTTCAG CAAGTACCAG TGCTTCAGTC 54 0 

TCAGCATCAA CAAGTGCTTC AGCCTCAGCA TCGACAAGTG CCTCGGCTTC AGCAAGCACA 600 

TCAGCATCTG AATCAGCGTC GACAAGTGCG TCGGCCTCAA CCAGTGCATC TGAATCGGCA 660 

TCAACCAGTG CGTCAGCCTC AGCAAGTACT ACTCCATCAC CTTCAGCATC AACGAGTGCA 720 

TCGGCTTCGG CGTCAACCAG TGCATCAGAG TCAGCAAGTA CCAGTGCGTC AGCTTCCGCA 780 

TCAACAAGTG CCTCGGCTTC AGCAAGCACA TCAGCATCTG AATCAGCGTC AACCAGTGCT 840 

TCGGCTTCAG CAAGTACCAG TGCTTCAGCT TCAGCATCAA CCAGCGCCTC GGCCTCAGCA 900 

AGCACCTCAG CTTCTGAATC GGCCTCAACC AGCGCCTCGG CCTCAGCAAG CACCTCAGCT 960 

TCTGAATCGG CCTCAACCAG CGCCTCAGCC TCAGCATCAA CGAGTGCTTC GGCTTCAGCA 1020 

AGCACAAGCG CCTCGGGTTC AGCATCAACG AGTACGTCAG CTTCAGCGTC AACCAGTGCT 1080 

TCAGCCTCAG CATCAACAAG TGCGTCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 1140 

TCAACGAGTG CGTCTGAGTC AGCATCAACG AGTACGTCAG CCTCAGCAAG CACAAGTGCT 1200 

TCAGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC CGCTTCAGCA 1260 

AGTACTAGCG CCTCAGCATC AGCGTCAACA ACTGCTTCGG CTTCAGCGTC AACGAGTGCG 1320 

TCTGAGTCAG CATCAACGAG TACGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA 1380 

TCAACCAGTG CGTCAGCCTC AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG 1440 

TCAGCCTCAG CAAGTACCAG TGCTTCAGCC TCAGCGTCGA CAAGTGCGTC GGCCTCAACC 1500 

AGTGCATCTG AATCGGCATC AACCAGTGCG TCAGCTCAGC AAGTACTAGT GCATCAGCTT 1560 

CAGCATCAAC GAGTGCATCG GCTTCGGCGT CAACCAGTGC ATCAGAGTCA GCAAGTACCA 1620 

GTGCGTCACt TCCGCATCAA CAAGTGCCTC GGCTTCAGCA AGCACATCAG CATCTGAATC 1680 

AGCGTCAACC ACTGCTTCGG CTTCAGCAAG TACCAGTGCT TCAGCTTCAG CATCAACCAG 1740 
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CGCCTCGCCC TCAGCAAGCA CCTCAGCTTC TGAATCCGCC TCAACCAGCG CCTCGGCCTC 1800 

AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGCGCC TCAGCCTCAG CATCAACGAG 1860 

TGCTTCGGCT TCAGCAAGCA CAAGCGCCTC GGGTTCAGCA TCAACGAGTA CGTCAGCTTC 1920 

AGCGTCAACC AGTGCTTCAG CCTCAGCATC AACAAGTGCG TCAGCCTCAG CAAGTATCTC 1980 

AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGACTCAGCA TCAACGAGTA CGTCAGCCTC 2040 

AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCACTGCG TCAGCCTCAG CATCGACAAG 2100 

CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CCTCGGCCTC 2160 

AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTAGTGCATC 2220 

GGCTTCAGCA TCAACCAGTG CCTCCGCTTC AGCGTCAACC AGTGCGTCAG CTTCAGCAAG 2280 

TACCAGTGCT TCAGTCTCAG CATCAACAAG TGCTTCAGCC TCAGCATCGA CAAGTGCCTC 2340 

GGCTTCAGCA AGCACATCAG CATCTGAATC AGCGTCGACA AGCGCCTCAG CTTCAGCAAG 2400 

TACCACTGCG TCAGCCTCAG CGTCGACAAG TGCGTCACCT ACAGCAAGTA CTAGTGCATC 2460 

AGCTTCAGCA TCAACGAGTG CATCGGCTTC GGCGTCAACC AGTGCATCAG AGTCAGCAAG 2520 

TACCAGTGCG TCAGTTCACG CATCAACAAG 2S50 
(2) INFORMATION FOR SEQ ID NO: 365: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

ACCCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG TCCCTCGCCT TCAGCATCAA 60 

CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAC 120 

CTCAGCAAGT ACTAGTGCAT CAGCATCAGC ATCAACGAGT GCATCGGCTT CAGCAAGTAC 180 

CAGCGCCTCA GCTTCAGCAA GCACCAGTGC GTCAsCTCAG CAAGTACCAG CGCCTCAGCC 240 

TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA 300 

AGTGCGTCGG CTTCAGCAAG TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT 360 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG 420 

AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCATCAG CGTCAACAAG TGCTTCGGCT 480 

TCAGCGTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA CGTCAGCCTC AGCAAGCACA 540 

TCAGCTTCTG AATCTGCATC AACCACTGCG TCAGCCTCAG CATCGACAAG CGCCTCAGCT 600 
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TCAGCAAGTA 


CCAGTGCGTC AgCCTCAGCA AGTACCAGTG CTTCAGCCTC 


AGCGTCGACA 


660 


ACTCCCTCGG 


CCTCAACCAG 


TGCATCTGAA 


TCGGCATCAA 


CCAGTGCGTC 


AGCCTCAGCA 


720 


AGTACTAGCG 


CCTCAGCCTC 


AGCATCAACG 


AGTGCGTCCG 


CTTCACCAAG 


TACTAGTGCA 


780 


TCAGCTTCAG 


CAAGTACTAG 


CGCCTCAGCC 


TCAGCGTCGA 


CAAGCGCCTC 


AGCTTCAGCA 


840 


AGTACCAGTG 


CCTCAGCCTC 


AGCGTCGACA 


AGTGCGTCGG 


CTTCACCAAG 


TACCTCAGCG 


900 


TCTGAATCAG 


CATCAACAAG 


TGCGTCGGCT 


TCAGCATCAA 


CGACTGCATC 


AGCTTCAGCA 


960 


TCAACAAGTG 


CTTCAGCTTC 


AGCAAGTACC 


AGTGCGTCGG 


CTTCAGCATC 


AACGAGTGCT 


1020 


TCAGTCTCAG 


CCTCAACCAG 


TGCCTCTGAA 


TCCGCATCAA 


CAAGTGCCTC 


GGCTTCAGCA 


1080 


AGCACCAGTG 


CTTCGGCTTC 


AGCGTCAACG 


AGTGCGTCTG 


AGTCAGCATC 


AACGAGTGCG 


1140 


TCAGCCTCAG 


CAAGCACATC 


AGCTTCTGAA 


TCTGCATCAA 


CCAGTGCGTC 


AGCTTCCGCA 


1200 


TCAACAAGCG 


CCTCGGCCTC 


AGCAAGTACA 


AGTGCTTCAG 


CCTCAGCATC 


AACCAGTGCA 


1260 


TCAGCTTCAG 


CCTCAACAAG 


TGCTTCAGCC 


TCAGCGTCAA 


CCAGTGCCTC 


GGCTTCAGCA 


1320 


AGTACCAGTG 


CGTCAGCTTC 


AGCAAGCACA 


AGTGCGTCAG 


CTTCAGCATC 


AACCACTGCT 


1380 


TCGGCTTCGG 


CATCAACAAG 


TGCCTCAGCA 


TCAGCATCAA 


CCAGTGCGTC 


AGCCGG 


1436 



(2) INFORMATION FOR SEQ 10 NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 



GCAGTTGCCA CACCGTGCTG 


ACCAGCACCC 


GTTCCTGCGA 


TAATTTTCTT 


TTTACCCATG 


60 


CGTwTGGCAA 


GCCAAACTTG 


TCCTAAGGCA 


TTGTTAATCT 


TGTGGGCTCC 


TGTATGGTTA 


120 


AGGTCTTCCC 


GTTTGAGATA 


AATCTTGCTC 


CGCCAATATG 


CTGCCTCAAG 


TTTTTTGCGT 


180 


AATAAAGAGG 


AGTTTCACGT 


CCTACGTACT 


GGCGCAAAAG 


CTGGTTTAAT 


TCCTCTTGGA 


240 


AACTTGGGTC 


TGCCTGACTT 


TCACGGTAGG 


CCTTCTCCAA 


CTCCAAAACT 


GCTGTCATCA 


300 


ATGTTTCTGG 


GACAAAACGT 


CCGCCGAATT 


TTCCGTAAAA 


TCCATCTTTA 


TTTGGTTCCT 


360 


GATATGCCAT GCTTTACCCT 


CTCTATAAAT 


CTTCTAATCT 


TTTCATGATC 


TTTTTGTCCA 


420 


TCTGTCTCCA 


CTCCGCTCGA 


TACATCTACT 


GCATAGGGAG 


TAAACTGTTG 


AATTCCTTTT 


480 


ACT AC ATT AT 


CTTCATTAAG 


GCCACCTGCG 


ATAAAGAAGG 


GCTGTCCTAG 


TCCAGTCGTA 


540 
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TCCAGTTGAC CCCAATCAAA GGCCTGGCCA CTTCCTGCCA CAGGGGCATC AAAGAGTAGA 600 

TAATCTGCCT GAGAATTGGG GACATGCCCA TTTCCATCTA CCTGCACAGC CTGAATACTG 660 

GCACAACGCA AATTCTCAAA TAAATCATCT GCCACCTGAC CGTGAACTTG AACCAAGTCC 7 20 

AAGCCGGCGA TCCTC 735 

(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1702 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 



TACTAGCGCC 


TCAGCCTCAG 


CGTCAACAAG 


TGCATCGGCT 


TCAGCATCAA 


CGAGTGCGTC 


60 


CGCTTCAGCA 


AGTACTAGCG 


CCTCAGCCTC 


AGCGTCAACA 


AGTGCATCGG 


CTTCAGCGTC 


120 


AACGAGTGCG 


TCTGAGTCAG 


CATCAACGAG 


TGCGTCAGCC 


TCAGCAAGCA 


CATCAGCTTC 


180 


TCAATCTGCA 


TCAACCAGTG 


CCTCAGCCTC 


AGCATCGACA 


AGCGCCTCAG 


CTTCAGCAAG 


240 


TACCAGTGCG 


TCAGCCTCAG 


CGTCGACAAG 


TGCGTCGGCT 


TCAGCAAGTA 


CCAGTGCGTC 


300 


AGCCTCAGCA 


AGTACCAGTG 


CGTCAGCCTC 


AGCGTCGACA 


AGTGCGTCGG 


CCTCAACCAG 


360 


TGCATCTGAA 


TCGGCATCAA 


CCAGTGCGTC 


AGCCTCAGCA 


AGTACTAGTG 


CATCAGCTTC 


420 


AGCATCAACG 


AGTGCATCGG 


CTTCAGCATC 


AACCAGTGCA TCAGAGTCAG 


CAAGTACCAG 


480 


TGCGTCA«CT 


TCCGCATCAA 


CAACTGCCTC 


GGCTTCAGCA 


AGTACTAGCG 


CCTCAGCCTC 


540 


AGCGTCAACA 


AGTGCTTCAG 


CTTCCGCGTC 


AACCAGCGCC 


TCGGCCTCAG 


CAACTATCTC 


600 


AGCGTCTGAA 


TCGGCATCAA 


CAAGTGCCTC 


GGCTTCAGCA 


TCAACGAGTG 


CATCAGTCTC 


660 


AGCAAGCACC 


AGTGCGTCGG 


CCTCAGCAAG 


CACCAGCCCG 


TCTGAATCCG 


CATCAACCAG 


720 


TGCCTCAGCT 


TCAGCAAGTA 


CCTCAGCATC 


TGAATCAGCA 


TCAACAAGTG 


CATCCCCTX 


780 


AGCAAGCACA 


AGTGCTTCAG 


CCTCAGCAAG 


TATCTCAGCG 


TCTGAATCGG 


CATCAACGAG 


840 


TGCGTCCGCT 


TCAGCAAGTA 


CTAGCGCCTC 


AGCATCAGCG 


TCAACAAGTG 


CTTCGGCTTC 


900 


AGCGTCAACG 


AGTGCGTCTG 


AGTCAGCATC 


AACGAGTACG 


TCAGCCTCAG 


CAAGCACATC 


960 


ACCTTCTGAA 


TCTGCATCAA 


CCAGTGCGTC 


AGCCTCAGCA 


TCGACAAGCG 


CCTCAGCTTC 


1020 


AGCAAGTACC 


AGTGCGTCAG 


CCTCAGCAAG 


TACCAGTGCT 


TCAGCCTCAG 


CGTCGACAAG 


1080 




TCAACCAGTG 


CATCTGAATC 


GGCATCAACC 


AGTGCGTCAG 


CCTCAGCAAG 


1140 


TACTAGCGCC 


TCAGCCTCAG 


CATCAACGAG 


TGCGTCCGCT 


TCAGCAAGTA 


CTACTCCATC 


1200 
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1380 

1440 

1500 

1560- 

1620 

1680 

1702 



1383 

AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAG CTTCACCAAG 1260 
CACCAGTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC TCAGCAAGCA CCAGTGCCTC 
AGCTTCAGCA AGTACCAGTG CGTCAGCCTC ACCGTCGACA AGTGCGTCGC CTTCACCAAG 
TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 
AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCCTC 
AACCAGTGCC_TCTGAATCA^ CAXCA^ 

GGCTTCAGCA ACTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 
AACGAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 
TGAATCTGCA TCAACCAGTG CG 
(2) INFORMATION FOR SEQ ID NO: 368: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 941 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 
ACCAGTGCAT CAGCTTCAGC CTCAACAAGT GCTTCAGCCT CAGCGTCAAC CAGTGCCTCG 
GCTTCAGCAA GTACCAGTGC GTCACTTCAG CAAGCACAAG TGCGTCACTT CAGCATCAAC 
CAGTGCTTCG GCTTCGGCAT CAACAAGTGC CTCAGCATCA GCATCAACGA GTGCGTCACC 
TCAGCAAGTA CTAGTGCATC AGCATCAGCA TCAACCAGTG CATCAGCCTC AGCAAGTATC 
TCAGCGTCTG AATCGGCATC AACGAGTGCA TCAGCATCAG CATCAACGAG TGCATCGGCT 
TCAGCGTCAA CCAGTGCATC AGTCTCAGCA AGCACCAGTG CGTCGGCTTC AGCATCAACG 3 60 

AGTGCCTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCAGCC 420 
TCAGCAAGTA CTAGTGCATC GGCTTCAGCA AGCACCAGTG CGTCGGCTTC AGCATCAACC 



60 
120 
180 
240 

300 



480 



AGTGCCTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCAGCC 540 

TCAGCAAGTA CTAGTGCATC AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC 600 

AGCGCCTCAG CTTCACCAAG CACCAGTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC 660 

TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA 720 

AGTGCGTCGC CTTCACCAAG TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT 780 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG 840 
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AGTCCTTCAC TCTCAGCCTC AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCCCCT 900 

TCAGCAAGCA CCACTGCCTC GGCTTCAGCA AGTACTAGTG C 941 

(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

CAGCAAGTAC TAGTCCATCA CCTTCAGCAT CAACGAGTGC ATCGGCTTCT GCCTCAACCA 60 

GTGCATCAGA GTCAGCAAGT ACCAGTGCGT CAGCTTCCGC ATCAACAAGT GCCTCGGCTT 120 

CAGCAAGCAC CAGTGCGTCG GCTTCAGCAA GTACTAGCGC CTCAGCCTCA GCCTCAACCA 180 

GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT CTGAATCGGC ATCAACGAGT CCGTCCCCTT 240 

CAGCAAGTAC TAGCGCCTCA GCCTCACCCT CAACAAGTCC ATCGGCTTCA GCGTCAACGA 300 

GTGCGTCTGA ATCGGCATCA ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCACCCT 360 

CAGCGTCAAC AAGTGCATCG GCTTCAGCAT CAACGAGTGC GTCCGCTTCA GCAAGTACTA 420 

GCGCCTCAGC CTCAGCCTCA ACAAGTGCAT CGGCTTCAGC GTCAACGAGT GCGTCTGAGT 480 

CAGCATCAAC GAGTGCGTCA GCCTCAGCAA GCACATCAGC TTCTGAATCT GCATCAACCA 540 

GTGCGTCAGC CTCAGCATCG ACAAGCGCCT CAGCTTCAGC AAGTACCAGT GCGTCAGCCT 600 

CAGCGTCGAC AAGTGCGTCG GCTTCAGCAA GTACCAGTGC GTCACCCTCA GCAAGTACCA 660 

GTGCGTCAGC CTCAGCGTCG ACAAGTGCGT CGGCCTCAAC CAGTGCATCT GAATCGGCAT 720 

CAACCAGTGC CTCAGCCTCA GCAAGTACTA GTGCATCAGC TTCAGCATCA ACGACTGCAT 780 

CGGCTTCAGC ATCAACCAGT G CATC AG ACT CAGCAAGTAC CAGTGCGTCA GnTTCCGCAT 840 

GCAACAAGTG CCTCGGCTTC AGCAAGTAC 869 
(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 
<C> STRAND EDNESS : double 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 
TCAACAACTG CCTCAGCATC AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 60 
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TCAGCATCAG CATCAACCAG TGCATCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 120 

TCAACGAGTG CATCAGCATC AGCATCAACG AGTGCATCGG CTTCAGCGTC AACCAGTGCA 180 

TCAGTCTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA 240 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTACTGCA 300 

TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CCAGTGCCTC AGCCTCAGCA 360 

AGTATCTCAG-CGTCTGAATC-GGCATCAACG-AGTGCGTCAG-GGTCAGCAAG-TACTAGTGCA- '420 

TCAGCATCAG CATCAACGAG TGCATCGGCT TCAGCAAGTA CCAGCGCCTC AGCTTCAGCA 4 80 

AGCACCAGTG CGTCAGCCTC AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC 540 

TCACCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCGTCGA CAAGTGCGTC GCCTTCAGCA 600 

AGTACCTCAG CGTCTGAATC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT 660 

TCAGCTTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC CGCTTCAGCA 720 

AGTACTAGCG CCTCAGCATC AGCGTCAACG 750 

(2) INFORMATION FOR SEQ ID NO: 371: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 957 base pairs 
(8) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

CCGGAAAACA GCTCTGGCGC TTCGTCTTGC CCAGCGTATT GCTAGTGGTG ACGTGCCTGC 60 

GGAAATGGCT AAGATGCGCG TGTTAGAACT TGATTTGATG AATGTCGTTG CAGGGACACG 120 

CTTCCGTGGT GACTTTGAAG AACGCATGAA TAATATCATC AAGGATATTG AAGAAGATGG 180 

CCAAGTCATC CTCTTTATCG ATGAACTCCA CACCATCATG GGTTCTGCTA GCGGGATTGA 240 

TTCGACTCTG GATGCGGCCA ATATCTTGAA ACCAGCCTTG GCGCGTGGAA CTTTGAGAAC 300 

GGTTGGTGCC ACTACTCAGG AAGAATATCA AAAACATATC GAAAAAGATG CGGCACTTTC 3 60 

TCGTCGTTTC GCTAAAGTGA CGATTCAAGA ACCAAGTGTG GCAGATAGTA TGACTATTTT 4 20 

ACAAGGTTTG AAGGCGACTT ATGAGAAACA TCACCGTGTA CAAATCACAG ATGAAGCGGT 4 80 

TGAAACAGCG GTTAAGATGG CTCATCGTTA TTTAACCAGT CGTCACTTGC CAGACTCTGC 540 

TATCGATCTC TTGGATGAGG CGGCAGCAAC AGTGCAAAAT AAGGCAAAGC ATGTAAAAGC 600 

AGACGATTCA GATTTGAGTC CAGCTGACAA GGCCCTGATG GATGGCAAGT GGAAACAGGC 660 
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ACCCCAGCTA ATCGCAAAAG AAGAGGAACT ACCTGTCTAC AAAGACTTGG TGACAGAGTC 7 20 

TGATATTTTG ACCACCTTGA GTCGCTTGTC AGGAATCCCA GTTCAAAAAC TGACTCAAAC 780 

GGATGCTAAG AAGTATTTAA ATCTTGAAGC AGAACTCCAT AAACGGGTTA TCGGTCAAGA 840 

TCAAGCTGTT TCAAGCATTA GCCGTGCCAT TCGCCGCAAC CAGTCAGGGA TTCGCAGTCA 900 

TAAGCGTCCG ATTGGTTCCT TTATCTTCCT AGGGCCTACA GGTGTCGGGG TATCCGA 957 



(2) INFORMATION FOR SEQ ID NO: 372: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
{ D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 



CAAAGCGCCT 


CAGCTTCAGC 


ATCAACAAGT 


GCGTCGGCTT 


CAGCATCAAC 


CAGTGCCTCG 


60 


GCTTCAGCGT 


CAACCAGTGC 


GTCACATTCA 


GCAAGTACCA 


GTGCTTCAGT 


CTCACCATCA 


120 


ACAAGTGCTT 


CAGCCTCAGC 


ATCGACAAGT 


GCCTCGGCTT 


CAGCAAGCAC 


ATCAGCATCT 


180 


GAATCAGCGT 


CAACCAGTGC 


TTCGGCTTCA 


GCAAGTACCA 


GTGCTTCAGC 


TTCAGCATCA 


240 


ACCAGCGCCT 


CGGCCTCAGC 


AAGCACCTCA 


GCTTCTGAAT 


CGGCCTCAAC 


CACCGCCTCG 


300 


GCCTCAGCAA 


GCACCTCAGC 


TTCTGAATCG 


GCCTCAACCA 


GCGCCTCAGC 


CTCACCATCA 


360 


ACGAGTGCTT 


CGGCTTCAGC 


AAGCACAAGC 


GCCTCGGGTT 


CAGCATCAAC 


GAGTACGTCA 


420 


GCTTCAGCGT 


CAACCAGTGC 


TTCAGCCTCA 


GCATCAACAA 


GTGCGTCAGC 


CTCAGCAAGT 


480 


ATCTCAGCGT 


CTGAATCGGC 


ATCAACGAGT 


GCGTCTGAGT 


CAGCATCAAC 


GAGTACGTCA 


540 


GCCTCAGCAA 


GCACCTCAGC 


TTCTGAATCG 


GCCTCAACCA 


GTGCGTCAGC 


CTCAGCATCG 


600 


ACAAGCGCCT 


CAGCTTCAGC 


AAGTACCAGT 


GCTTCAGCCT 


CAGCGTCGAC 


AAGTGCGTCG 


660 


GCCTCAACCA 


GTGCATCTGA 


ATCGGCATCA 


ACCAGTGCGT 


CAGCCTCAGC 


AAGTACTAGT 


720 


GCATCGGCTT 


CAGCATCAAC 


CAGTGCCTCG 


GCTTCAGCGT 


CAACCAGTGC 


GTCAGCTTCA 


780 


GCAAGTACCA 


TGTGCTTCAT 


GTCTCAG 








807 



(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 
IB) TYPE: nucleic acid 
iC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 



CATCGGCTTC 


AGCATCAACG 


AGTCCCTCCC 


CTTCAGCAAG 


TACTACCGCC 


TCAGCCTCAG 


60 


CGTCAACAAG 


TGCATCGGCT 


TCAGCGTCAA 


CGAGTGCGTC 


TGAGTCAGCA 


TCAACGAGTG 


120 


CGTCACCTCA 


GCAAGCACAT 


CAGCTTCTGA 


ATCTGCATCA 


ACCAGTGCGT 


CACCTCAGCA 


180 


TCGACAAGCG 


CCTCACCTTC 


AGCAAGTACC 


AGTGCGTCAC 


CTCAGCGTCG 


ACAAGTGCGT 


240 


CGGCTTCAGC 


AAGTACCAGT 


GCGTCASCTC 


AGCAAGTACC 


AGTGCGTCAC 


CTCAGCGTCG 


300 


ACAAGTGCGT 


CGGCCTCAAC 


CAGTGCATCT 


GAATCGGCAT 


CAACCAGTGC 


GTCACCTCAG 


360 


CAAGTACTAG 


TGCATCAGCT 


TCAGCATCAA 


CGAGTGCATC 


GGCTTCAGCA 


TCAACCAGTG 


420 


CATCAGAGTC 


AGCAAGTACC 


AGTGCGTCAG 


CTTCCGCATC 


AACAAGTGCC 


TCGGCTTCAG 


480 


CAAGTACTAG 


CGCCTCAGCC 


TCAGCGTCAA 


CAAGTGCTTC 


AGCTTCCGCG 


TCAACCAGCG 


540 






TCACCGTCTG 


AATCGGCATC 


AACAAGTGCC 


TCGGCTTCAG 


600 


CATCAACGAG 


TGCATCAGTC 


TCAGCAAGCA 


CCAGTGCCTC 


GGCCTCAGCA 


AGCACCAGCG 


660 


CGTCTGAATC 


CGCATCAACC 


AGTGCCTCAG 


CTTCAGCAAG 


TACCTCAGCA 


TCTGAATCAG 


*720 


CATCAACAAG 


TGCATCGGCT 


TCAGCAAGCA 


CAAGTGCTTC 


AGCCTCAGCA 


AGTATCTCAG 


780 


CGTCTGAATC 


GGCATCAACG 


AGTGCGTCCG 


CTTCAGCAAG 


TACTACCGCC 


TCAGCATCAG 


840 : 


CGTCAACAAG 


TGCTTCGGCT 


TCAGCGTCAA 


CGAGTGCGTC 


TGAGTCAGCA 


TCAACGAGTA 


900 * 


CGTCAGCCTC 


AGCAAGCACA 


TCAGCTTCTG 


AATCTGCATC 


AACCAGTGCG 


TCAGCCTCAG 


960 


CATCGACAAG 


CGCCTCAGCT 


TCAGCAAGTA 


CCAGTGCGTC 


AGCCTCAGCA 


AGTACCAGTG 


1020 * 


CTTCAGCCTC 


AGCGTCGACA 


AGTGCGTCGG 


GCTCAACCAG 


TGCATCTC 




1068 - 



(2) INFORMATION FOR SEQ ID NO: 374: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 
(B> TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

CAGCATCAAC GAGTGCTTCA GTTTCAGCGT CAACCAGTGC CTCTGAATCA GCTTCAACAA 60 

GTGCCTCGGC TTCAGCAAGC CCCAGTGCGT CGGCTTCAGC AAGTACTAGT GCATCGGCTT 120 

CAGCATCGAC AAGTGCGTCT GAATCGGCAT CAACGAGTGC TTCGGCTTCA GCATCAACGA 180 

GTGCGTCAGC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATcAACCAGT GCGTCCGyTT 24 0 
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CAGCCTCAAC CAGTGCGTCG GCTTCAGCCT CCACAAGTGC TTCGGCTTCA GCATCAACGA 300 

GTGCGTCGGC CTCAGCAAGC GCAAGTACCT CAGCGTCAGC TTCCGCCTCA ACCAGTGCGT 360 

CGGCTTCAGC AAGCACAACT GCGTCAGCCT CAGCAACTAT CTCAGCCTCT GAATCGGCAT 420 

CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC ACATCAGCTT 480 

CTGAATCTCC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA GCTTCAGCAA 540 

GTACCAGTGC TTCAGCCTCA GCGTCGACAA GTGCGTCGGC CTCAACCAGT GCATCTGAAT 600 

CGGCATCAAC CAGTGCGTCA 620 
(2) INFORMATION FOR SEQ ID NO: 375: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
iO) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



GTATTGGGGC 


GCCCCAACCT 


CTATGTGACT 


ACGGATTATT 


TCCTAGATTA CATGgGGATA 


60 


AACCATTTAG 


AAGAATTACC 


AGTGATTGAT 


GAGCTTGAGA 


TTCAAGCCCA 


AGAAAGCCAA 


120 


TTATTTGGTG 


AAAGGATAGA 


AGAAGATGAG 


AATCAATAAG 


TATATTGCCC 


ACGCAGGTGT 


180 


GGCCAGTAGG 


AGAAAAGCAG 


AAGAGCTGAT 


TAAGCAAGGC 


TTGGTGACGG 


TTAACGGCCA 


240 


AGTGGTGCGT 


GAACTAGCAA 


CCACTATCAA 


GTCAGGCGAC 


AAGGTCGAAG 


TTGAAGGTCA 


300 


ACCTATCTAC 


AACGAAGAAA 


AGGTCTACTA 


TCTGCTTAAC 


AAACCACGCG 


GTGTGATTTC 


360 


CAGTGTGACA 


GATGATAAGG 


GTCGCAAGAC 


GGTTGTCGAC 


CTCTTGCCCA 


ATGTCAAAGA 


420 


GCGTATTTAC 


CCTGTGGGTC 


GTTTGGACTG 


GGATACATCA 


GGTGTCTTGA 


TTTTGACCAA 


480 


TGATGGGGAC 


TTTACAGACG 


AGATGATTCA 


CCCTCGTAAT 


GAGATTGACA 


AGG'PTTATGT 


540 


CGCGCGTGTT 


AAAGGTGTGG 


CCAATAAGGA 


CAATCTCCGC 


CCCTTGACCC 


GTGGTCTTGA 


600 


GATTGATGGT 


AAGAAAACCA 


AGCCATAATA 


TATAGGTTTT 


GTAGCCTCTA 


CACCATAAAT 


660 


ATTTGCTAAT 


AAAAATACTG 


TATTATTACC 


CTCTTAAGGT 


GCGAAATTAT 


TCAAGTTCTT 


720 



(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 
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txi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

CGCCATTTCC CATCGTACCG CCGAAAATCC CAGCGCCTCA GCCATCAAAT ATCCTATCAA 60 

CGTTCTCAAA AAAAGTGACC GCTCTCTCAT CATGTTTCCA AGTGGTAGCC GCCACTCAAA 120 

CGATGTCAAG GGGGGCGCAC ACTsJcATTGC CAAAATGGCC AAGGTCCGTA TCATGCCCGT 180 

TACCTACACC GGTCCCATGA CTTTGAAGGG CTTGATTAGC CGTGAACGTG TCGATATGAA 240 

CTTTGGAAAT CCAATCGATA TCTCAGATAT CAAGAAAATG AATGATCAAG GCATTGAAAC 300 

AGTCGCCAAT CGTATTCAAA CAGAATTCCA ACGTCTGGAC GAAGAAACGA AACAATGGCA 360 

CAATGATAAA AAACCAAATC CACTCTGGTG GTTTATCCGC ATCCCTGCCC TCATCCTTGC 420 

TATTATCCTC GCTATCCTAA CCATCATCTT TAGCTTTATC GCAAGCTTCA TCTGGAACCC 480 

AGATAAGAAA AGAGAAGAAC TTGCATAGAA GAAATGAACC TTGGCCAAAC AGCTAAGGTT 540 

TTCATTTATA TAGTAGATTG GwACTAGAAT AGTACACCTC TACTTCTAAA ACATTTTTAC 600 

AAATCGATTT GACTGTCCTG ATCGATTTGT CCTAATCTTA TTTCAATT 648 

(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 690 base pairs 

( B ) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 



GTGCATCGCT 


TTCAGCATCG 


ACAAGTGCGT 


CTGAATCGGC 


ATCAACGAGT 


GCTTCGGCTT 


60 


CAGCATCAAC 


GAGTGCGTCA 


GCTTCAGCAA 


GCACATCAGC 


TTCTGAATCT 


GCATCAACCA 


120 


GTGCGTCCGC 


TTCAGCGTCA 


ACCAGTGCGT 


CGGCTTCACC 


GTCGACAAGT 


GCTTCGGCTT 


180 


CAGCATCAAC 


GAGTGCGTCG 


GCCTCAGCAA 


GCGCAAGTAC 


CTCAGCGTCA 


GCTTCCGCCT 


240 


CAACCAGTCC 


GTCCGCTTCA 


GCAAGCACAA 


GTGCGTCAGC 


CTCAGCAACT 


ATCTCAGCGT 


300 


CTGAATCGGC 


ATCAACGAGT 


GCGTCGGCCT 


CAGCAAGCGC 


AAGTACCTCA 


GCGTCAGCTT 


360 


CCGCCTCAAC 


CAGTGCGTCG 


GCTTCAGCAA 


GCACAAGTGC 


GTCAGCCTCA 


GCAAGTATCT 


420 


CAGCGTCTGA 


ATCGGCATCA 


ACGAGTGCGT 


CTGAGTCAGC 


ATCAACGAGT 


ACGTCAGCCT 


480 


CAGCAAGCAC 


ATCAGCTTCT 


GAATCGGCAT 


CAACCAGTCC 


GTCAGCCTCA 


GCATCGACAA 


540 


GCGCCTCAGC 


TTCAGCAAGT 


ACCAGTGCTT 


CACCCTCAGC 


GTCGACAAGT 


GCGTCGGCCT 


600 


CAACCAGTCC 


ATCTGAATCG 


GCATCAACCA 


GTGCGTCAGC 


CTCAGCAAGT 


ACTAGTGCAT 


660 
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CACCTTCAGC ATCAACGACT GCATCGGCTT 690 
(2) INFORMATION FOR SEQ ID NO: 378: 

{ i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
< 0) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 



CGAGATTCTC 


TGGAGTTATG 


GATGTCGTTC 


CAATATGTGC 


ACGTTGGAAT 


GTTAGTGCTT 


60 


ATATCCCCGG 


AACAGAATCC 


TCTCTTGATT 


GAAGACAAGC 


TAGTCATTAG 


GCTGGTTTGT 


120 


k 4 & * I k 


ACTGTAGTGG 


GTTGATATAA 


TAGTATTAGT 


GAGTGGGATA 


AAAGTTTCAT 


180 


TTAGTTTATT 


CAGTACAAAT 


TTAACGGGTC 


AAGATTTATA 


TACTAGTGCT 


GTTTTTGGGG 


240 


CTGAGAGAAG 


TATCTTGATT 


TTATGTCTGG 


TTTTTATACT 


TACAGTTGTT 


CTGCTCCAAA 


300 


GAGCTTGTAG 


AGAAGAATTA 


GCTCATAAAG 


GAGATTGATT 


ATTTTGATAT 


CAAAAAAATG 


360 


CACAGGATAA 


CCTGATGCAT 


TTTTTTAGCG 


ACAATGCTTG 


CTACTTCCTT 


CTGTCGAATT 


420 


TAGACAATTT 


TAAACCCCAA 


TTATTCACCC 


CAAATCTAAA 


AACCATCCAG 


AATCCTTGCC 


480 


TTAGCTTAGA 


TCCTGGATGG 


TTTcrrrTTT 


CACCCAATGG 


GTGTTTTTTA 


CTAGACAAAA 


540 


AAGAGTTTCC 


CCTTTATGCT 


ATAACTGTAG 


AAAAAAACAC 


AAAAAGAAAG 


GAAACTCACA 


600 


TGAACAGTTT 


ACCAAATCAT 


CACTTCCAAA 


ACAAGTCTTT 


TTACCAACTA 


TCTTTCGATG 


660 


GAGGTCATTT 


AACCCAGTAT 


GGTGGTCTTA 


TCTTTTTTCA 


GGAACTTTTT 


TCCCAGTTGA 


720 


AACTAAAAGA 


GCGGATTTCT 


AAGTATTTAG 


TAACCAATGA 


CCAACGCCGC 


TACTGTCGTT 


780 


ATTCGGATTC 


AGATATCCTT 


GTCCAGTTCC 


TCTTTCAACT 


GTTAACAGGT 


TATGGAACGC 


840 


ACTATGCTTG 


TAAAGAATTG 


TCAGCTGATG 


CCTACTTTCC 


AAAATTATTG 


GAAGGAGGGC 


900 


AGCTTGCTTC 


ACAGCCAACC 


TTATCCCGTT 


TTCTTTCCAG 


AACTGACGAG 


GAAACAGTCC 


960 


ATAGTTTCCG 


ATGCCTCAAC 


CTTGAATTGG 


TCGAATTCTT 


TTT 




1003 



(2) INFORMATION FOR SEQ ID NO: 379: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 
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CCGATGATTC TGATTGGTTT GCTCTTTACT TTGCTOGGAA TTTTGAGGTA GATCTATCAT 60 

TG AAA TACT A ATTGTTTTAG CTATTATCCT ATCTCTTCCT TTGATTGTAT TGGTAACTAT 120 

ACAACCCCGT CAAAATCAAC TATTTTCCAT GGATGCCACT AGTAATATTG GTAAACCAAG 180 

CTACTGGCAG AGCAACACCT TGGTCAAGGT GCTCACTTTA TTGGTGAGTT TGGCTTTATT 240 

TATTCTACTA TTAACCTTTA TGGTGATTAC TTATAAATAA AAGAAAACTT CAGATATTCA 300 

CCTTTTGTGG ^TTPGTC^ A _ AGTTTrCTTT_TTT ATACTCA . ATGAAAATCA - AAGAGCAAAC -360- 

TAGGAAGCTA GCCGCAckGC TCAAAACACC GTTTTGAGGT TGTAGATATA ACTCACGACc 420 

GACTCAAAAC ACCGTTTTGA GGTTGTAGAT ATAACTGACG AGcGACTCAA AACACCGTTT 480 

TGAGGTTGTG GATAGAACTG ACGAGcGACT CAAAACACCG TTTTGAGGTT GTGGATAGAA 540 

CTGACGAAGT CGcTCAAAAC ACCGTTTTGA GGTTGTGGAT AGAACTGACG AAtgctCAAA 600 

ACACCGTTTT GAGGTTGTGG ATAGAACTGA CGAAGCgaaC ATATATACAG CAAGGCGACG 660 

CTGACGTGGT TTGAAGAGTA TTACTGTCTA TATTTTTGGT AAAAATCAAC TTTTACTTGG 720 

ATGAAGGTTT T T ' ITTTIT 733 
(2) INFORMATION FOR SEQ ID NO: 380: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

CCGTCTTATC AAAGAGGTTA ACAAAGGCAC CAAATTTCTC GATACGAACG ACTTTAGCAC 60 

GGTAAACTTC ATCCACTTTG GCTTCACGAA CCAAACCAGC AATAATTTCT TTGGCACGGT 120 

TAATAGCATC TTGGTCACTA GAGTAGATAG ACACATTTCC TTCTTCGTCT ATATCAATCT 180 

TAACACCTGT TTCAGCGATA ATCTTGTCGA TGGTTTCTCC ACCCTTACCG ATGACAATCT 24 0 

TAATCTTGTC CACATCAATC TTGATCGTAT CAATTTTCGG AGCAGTTGGA GCCAATTCTG 300 

GACGAACTTC TGGAATGGTT GCTTCAATGA CATCAACGAT TTCAAAACGC GCTTTCTTGG 360 

CTTGAGCAAG AGCCTCCCTC AAGATTTCTG CAGTAATCCC TTGAATCTTG ATATCCATTT 420 

GAAGGGCTGT AATCCCATCA CGAGTACCTG CAACCTTGAA GTCCATATCT CCAAAGTGAT 480 

CTTCCAAACC TTGGATATCT GTCAATACTG TGTAGTTATT TCCATCTGAG ATAAGCCCCA 540 

TAGCAATACC AGCTACTGGC GCCTTGATTG CCACACCACC AGCCATAAGG GCAAGAGTTC 600 
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CCGCACAGAT AGAAGCTTGA GATGAAGAAC CGTTTGATTC CAAAACTTCT GCTACTAGAC 660 
GGATAGCGTA GGGGAATTCT TCCAAGCTTG GCAGG 695 
(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 691 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

GACATCTTAT CTAAATACAT GCTAATATAT TTAGATACAA ACATTCCAAC TTGATAATTT 60 

TCACTCATCT TTCATCATTC CTTATACAAC TATGCAGTAT AAATAGAATA CTTTTCTCAT 120 

CAGAATGAGA CTATTTTAAT ATTAGATCCC CAATTATTCA CCCCAAATCT AAAAACCATC 180 

CAGAATCCTT GCCTTAGCTT AGATCCTGGA TGGTTTCTTT TTTCACCCAA TGGGT G TTTT 240 

TTACTAGACA AAAAAGAGTT TCCCCTTTAT GGTATAAGTG TAGAAAAAAA CACAAAAAGA 300 

AAGGAAACTC ACATGAACAG TTTACCAAAT CATCACTTCC AAAACAAGTC TTTTTACCAA 360 

CTATCTTTCG ATGGAGGTCA TTTAACCCAG TATGGTGGTC TTATCTTTTT TCAGGAACTT 420 

TTTTCCCAGT TGAAACTAAA AGAGCGGATT TCTAAGTATT TAGTAACGAA TGACCAACGC 480 

CGCTACTGTC GTTATTCGGA TTCAGATATC CTTGTCCAGT TCCTCTTTCA ACTGTTAACA 540 

GGTTATGGAA CGGACTATGC TTGTAAAGAA TTGTCAGCTG ATGCCTACTT TCCAAAATTG 600 

TTGGAAGGAG GGCAGCTTGc TTCACAGCCA ACCTTATCCC GwTTTCTTTC CAGAACTGAC 660 

GAGGAAACAG TCCATAGTTT GCGATGCCTC A 691 



(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

ATCTCTCTGC GTAATGGTCC TCAGATAACT CTGATGATGT CTCGCGATAT AGAACTGAGC 60 

CAAGTTATGC CTAAAGGGCC TTAGGAATAG GAGCTTTCAC AAGCTTATCC AGATGATTAT 120 

CTTTTACTCG TTATGGACAA TGCTATATGG CATAAATCAA GTACCTTAAA GATTCCGACT 180 

AATATTGGCT TTGCATTTAT TCCTCCATAC ACACCAGAGA TGAACCCCAT TGAACAAGTG 240 
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TCGAAAGAGA TTCGTAAACG TCCATTTAAG AATAAAGCCT TTCGAACTTT GCAAGATGTC 300 

ATACAAGGAC TGGAGAAGGA GGTGATAAAG TCCATCGTTA ATCGGAGACC GACTAGAATG 360 

CrmTGAAA ACAGATGAGT ATAAAAAGAA AGTCCTCATT TCAATAGAAA TCACGACTTT 4 20 

CTGATGAATT TATAGTAAAA TGAAATAAGA ACAGGATAGT CAAATCGATT TCTAACAATG 480 

TTTTAGAAGC AGAGGTGTAC TATTCTAGTT TAAATCCACT ATATTTGGGG AGTGATAGAA 540 

AAGCCCTTCA TCAGCCAATC TACTPGTTCA. GGTGCGAGAG CTTTGACATC CTTTTCTGTA^ 600 - 

CTGGACCAAG TCAGTTTTCC GTTCTCAAAG CCTTTATATA ATATCCAAAA TCCTTGACCA 660 

TCCCAGTAAA GAACTTTAAA GCGGTCTTTA CGTCCACCAC AAAAGACAAA GACTTGATCG 720 

GAGAAAGGAT CCAATTCAAA GTGGGTTTGG 750 
(2) INFORMATION FOR SEQ ID NO: 383: 

(iJ SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 383: 
TCAAATTCTT CGTGGTCCGC ATATCTnTCT TCGTACACGG CAGTCACTTG GTCTTTCACT 60 

ACTCGAGTCG CAGCTTCACG GGCCAATTTC TCTTCTACTT GAACTGCCTT TTGGAGGTCA 120 

CTGTTGTAGG CTGCAATGAT TTCAGCTTGC AATTCAGCAT CCACGTGAAG CAATTCCACT 180 

TCTGCTTTTT CTTTACCGAC AGCAGCAACG A T TTCTTCTT GGAAGGCAAT CAATTCTTTG 240 

ACAGCTTCGT GCCCTTTAAG GAGCGCTTCC AACATGATTT CTTCTGACAA TTCTTTGGCA 300 

CCAGACTCTA CCATGTTGAT AGCGTGCTTG GTTCCACCTA CTGTCAATTC AAGAAGAGAT 360 

TGCTCTGCTT GTTCTTGACT TGGGTTGATG ATGATTTGGC CATCTACATA TCCCACTTGT 420 

ACCCCAGCAA TTGGTCCGTC AAATGGAATA TCTGAAATAG ACAGTGCCAA AGATGAACCA 4 80 

AACATAGCAG CCATTGGTGC AGATGCATTT TCATCATAAG AAAGCACTGT ATTGATGACT 540 

TGGACTTCAT TACGGAAACC TTCCGCAAAC ATAGGACGAA TCGGACGGTC AATCAAACGC 600 

GCTGTCAAGG TCGCATCTGT TGAAGGACGT CCTTCACGTT TCATAAAGCC ACCAGGAAAC 660 

TTCCCAGCCG CATACATTTT TTCTTCGTAG TTGACTTGGA GTGGGAAGAA ATCCTCAGTT 720 

GCCATTTTCT GGGGATCC 738 
{2) INFORMATION FOR SEQ ID NO: 384: 
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U> SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 657 base pairs, 

(B) TYPE: nucleic acid 

(C) STRAND EDMESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 384T 

CCCCCTATTT ACCGTGGACT AAA G TTGTAC AAGAAAAGTG CAAATAAGAA ATCTCCAGAT 60 

TAGGAACTAT ATATGAGTTC TCTAGTCTGG AGATTTTTCA ATAGACTTCG TTATTGGGCG 120 

GTTACTTTCG AAACTTTGAA AACTTCAAAA AACGGATTTT TATCGCTTTC AAATTCTTTT 180 

GGGGTCAAAC TCAGTAACTT ATTCGCCTTG TAGACTTCAT GACGCTCAGG GTATACTTTC 240 

AAGGTCCCAA ATAGCCAAGA ATCGTCAGCG ATATTATCTG AATCATCTCC TTCTTGTTCT 300 

CCTTTAGTTC GCCTGAGGAC AGCCTTGACA CGCGCCAGAA TTCTCTAGGG CTAAAAGGCT 360 

TGGTCAGGTA GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT 420 

TCGCAGAAAC CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT 4 80 

CCATGCCATC TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT 540 

CTGCCAAAGC TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC 600 

TTAAATGGTA GTCAAGCAAT TTCAGAATGT GTTCTTCATC ATCCACTAAT AAGACTT 657 

(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 
< B > TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

CCGCATCAGC ATCAACGAGT GCATCGGCTT CACGTCAACC AGTGCATCAC TCTCAGCAAG 60 

CACCAGTGCG TCGGCTTCAG CATCAACGAG TGCCTCAGCC TCAGCAAGTA TCTCAGCGTC 120 

TGAATCGGCA TCAACGAGTG CGTCAGCTCA GCAACTACTA GTGCATCGGC TTCAGCAAGC 180 

ACCAGTGCGT CGGCTTCAGC ATCAACCAGT GCCTCAGCCT CAGCAAGTAT CTCAGCCTCT 240 

GAATCGGCAT CAACGAGTGC GTCACCTCAG CAAGTACTAG TGCATCAGCA TCAGCATCAA 300 

CGAGTGCATC GGCTTCAGCA AGTACCAGCG CCTCACCTTC AGCAAGCACC AGTGCGTCAC 360 

CTCAGCAAGT ACCAGCGCCT CAGCCTCAGC AAGCACCAGT GCCTCAGCTT CAGCAAGTAC 420 

CAGTGCGTCA CCTCAGCATC GACAAGTGCG TCGGCTTCAG CAAGTACCTC AGCGTCTGAA 480 
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TCACCATCAA CGAGTGCGTC AGCTTCAGCA TCAACCACTG CCTCAGCCTC AGCAACTATC 540 
AGTGCGTCAG CTTCAGCATC AACGAGTGCG TCACCTGCAG CAAGTA 586 
(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(D> -TOPOLOGY:— linear- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

CGTCCGCTTC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT TCAGCTTCAG 60 

CAACTACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCTTC AGTCTCAGCG TCAACCACTG 120 

CCTCTGAATC CGCATCAACA AGTGCCTCGG CTTCAGCAAG CACCAGTGCT TCGGCTTCAG 180 

CCTCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTGCGTC ACCTCAGCAA GCACATCAGC 240 

TTCTGAATCT GCATCAACCA GTGCGTCAGC TTCCGCATCA ACAACCGCCT CGGCCTCAGC 300 

AAGTACAAGT GCTTCAGCCT CAGCATCAAC CAGTGCATCA GCTTCAGCCT CAACAAGTGC 360 

TTCAGCCTCA GCGTCAACCA GTGCCTCGGC TTCAGCAAGT ACCAGTGCGT CAGTTCAGCA 420 

AGCACAAGTG CGTCAATTTA GCATCAACCA G 451 
(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 387; 

TCTCAGCAAG CACCATTGCG TCGGCTTCAT CAAGCACCAG CGCGTTTCAA TCCGCATCAA 60 

CCAGTGCT TC AGCTTCAGCC AACTTACCTC AGCATCTGAA TCAGCATCAA CAAGTGCATC 120 

GGCTTCAGCA AGCACAAGTG CTTCAGCtCA GCAAGTATCT CAGCGTCTGA ATCGGCATCA 180 

ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCAT CAGCGTCAAC AAGTGCTTCG 240 

GCTTCAGCGT CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC 300 

ACATCAGCTT CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA 360 

GCTTCAGCAA GTACCAGTGC GTCAGCCTCA GCAAGTACCA GTGCTTCAGC CTCAGCCTCG 420 
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ACAAG 425 

(2) INFORMATION FOR SEQ ID NO: 388: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 
(B> TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

AGAGGATCCC CGGATCCTCA GTCGCTGAGA TAACTCCTTT GGGCTTCTTC ATCATGTAGT 60 

AGACAAACTC TTCATACTCC AACACTTGCC CATTTTATGC GAATCTCATC TATTTTTTCT 120 

TTTTTTTGCA ATTTAGCTGA TTTTl ' CTT TT TTACCATTTA CAGTCACGCG CCCAGCCTTC 180 

AGCAAGTTTT TGACCTCAGT CCGACTTCCC ACCGCACAGG CAACTAAAAA TTTATCTAAT 240 

CTCATAGAAC TATTATATCA TATCAAAAGG AGGCTAGTAC AATGACCAAC CTCCTTTTCG 300 

TTTCATACTC TTCAAAAATC TCTTCAAACC GCGTCAACGT CGCCTTGCCG TATATATGTT 360 

ACTGACTTCG TCAGTTCTAT CTGCAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT 420 

TCTATCTGCA ACCTCAAAGC AGTGCTTTGA GCATCCTGCG GCTAGTTTCC JcAGTWGCTC 480 

TTTGATTTwC ATTGAGTATC AGATTTAGGA AATTAACTTC CTCGJcCTCCA AAAAAkAGCT 540 

AAAACAATCA AGGCTCCTAA AATCGCTGGG AT 572 
(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

CAACAAGTGC CTCGGCTTCA GCATGCACAA GTGCTTCAGC TTCAGCATGT ACCTGAGCGT 60 

CTGAATCAGC ATCAACGTGT GCGTCCGCTT CAGCATGTAC TGCTGCCTCA GCATCAGCGT 120 

CAAcAwGTGC TTCGGCTTCA GCGTCAACGA CTC C GT C TCA GTCAGCATCA ACGAGTACGT 180 

CAGCCTCAGC AAGCACATCA GCTTCTGAAT CTGCATCAAC CAGTGCGTCA GCCTCAGCAT 240 

CGACAAGCGC CTCAGCTTCA GCAAGTACCA GTGCGTCAGC CTCAGCAAGT ACCAGTGCTT 300 

CAGCCTCAGC GTCGACAAGT GCGTCGGCCT CAACCACTGC ATCTGAATCG GCATCAACCA 360 

GTGCGTCAGC CTCAGCAAGT ACTAGCGCCT CAGCCTCAGC ATCAACGAGT GCGTCCGCTT 420 
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CAGCAACTAC TACTCCATCA GCATCAGCAT CAACGAGTGC ATCGCCTTCA GCAAGTACCA 480 
GCGCCTCACC TTCAGCAAGC ACCGG SOS 
(2) INFORMATION FOR SEQ ID NO: 390: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 447 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND EDN ESS : double 

<D) TOPOLOGY:- linear - ■ 

{xi> SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

GCTAAGACTA CCTCATTAGG GGCATAGGCT GCTAAAATAA CTGCAGCTGT GGTTAATGAC 60 

AATACTGTAC TTTTTTTCAT TTTAATTCCT TACATATTTA TATAACTTCC AATAGATAAT 120 

AAACTTTAAC TTTGCTAGCC TTTGTTATAA AAAGTTTTAC TAAGTATTAT CTAGGAAATA 180 

GAGTAGTACA TTTATATATA ATTGTTATCT CTCTATAAAA ACAGTATATC ATTTAAAAAA 240 

ATTTAAGTCA AAAAAATTAA CATTAGTTAA TTTATTTTTT AGCACACATT AAAAAATAAG 300 

ATTAGTACTC AATGAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCAGAT TGCTCAAAAC 360 

AGTCTTTTGA GGTTGTAGAT GGAATGACGT AGTCAGCTCA AAACACTGTT TTGAAGTTGT 420 

GGATAGAACT GACGAAGTCG GTACCGA 447 

(2) INFORMATION FOR SEQ ID NO: 391: 

<i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 572 base pairs 
(B) TYPE: nucleic acid 
<C> STRAND EDNESS : double 

(D) TOPOLOGY: linear 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

AGCA C TTGTC GTTGAATTCT ACAACAAAAT GTTGTAATAT TTTATTGAAT AAGATAGGCC 60 

TTGATATTAA GCACTTTGGG ACGTTCTCCC TTAGTGCTTT TTTGATTTCT CTTAGTATCC 120 

AGCTATAATC GTTGAGACAT AACTAGACCG ATATAGTCCA AAGTGATATA GTAAAATGAA 180 

CCAAAAATAG TACACAATGT GGTATAATCC TTTTATGGCA TATTCAATAG ATTTTCGTAA 240 

AAAAGTTCTC TCTTATTGTG AGCGAACAGG TAGTATAACA GAAGCATCAC ACGTTTTCCA 300 

AATCTCACGT AATACCATTT ATGGCTGGTT AAAGCTAAAA GAGAAAACAG GAGAGCTAAA 360 

CCACCAAGTA TAGTGTATTG AATCTATAAC AGTACACCTT GGCTGCTAAA ATATTTCTAT 420 
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AAATTAATTT CACTTTCCTG ATAGAGATGT TCACATCTTA TTTCAAACTA CTATATAAGT 480 

TCTATAATCT CTTTATAAGA TTPGCCCATC AGACAAAATA GAACGATTTG AAGGCGTTTA 540 

TGATATTTAG CTGTACGAGA GTCTTTTAAA AG 572 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed by 
the applicant, any request made by a third party for the furnishing of a sample shall indicate the 
expert to be used. That expert may be any person entered on a list of recognized experts drawn 
up by the Danish Patent Office or any person approved by the applicant in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the aft. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 16 months from the priority date (preferably on the 
Form PUT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such a 
request has been filed by the applicant, any request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by the applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for the International 
publication of the application. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapse, the microorganism shall be made 
available as provided in Rule 3 1F(1) of the Patent Rules only by the issue of a sample to an 
expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever two dates occurs earlier. 
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SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegians Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent Office or any person approved by the applicant in the 
individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 



FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. 

ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected in the art. 
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What Is Claimed Is: 

1. Computer readable medium having recorded thereon the nucleotide 
sequence depicted in SEQ ID NOS: 1-391, a representative fragment thereof or a 
nucleotide sequence at least 95% identical to a nucleotide sequence depicted in SEQ 
ID NOS: 1-391. 

2. Computer readable medium having recorded thereon any one of the 
fragments of SEQ ID NOS : 1 -39 1 depicted in Tables 2 and 3 or a degenerate variant 
thereof. 

3. The computer readable medium of claim 1, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

4. The computer readable medium of claim 3, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

5. A computer-based system for identifying fragments of the Streptococcus 
pneumoniae genome of commercial importance comprising the following elements: 

a) a data storage means comprising the nucleotide sequence of SEQ D 
NOS: 1-391, a representative fragment thereof, or a nucleotide sequence at least 
95% identical to a nucleotide sequence of SEQ ID NOS: 1-391; 

b) search means for comparing a target sequence to the nucleotide sequence 
of the data storage means of step (a) to identify homologous sequence(s), and 

c) retrieval means for obtaining said homologous sequence(s) of step (b). 

6. A method for identifying commercially important nucleic acid fragments 
of the Streptococcus pneumoniae genome comprising the step of comparing a 
database comprising the nucleotide sequences depicted in SEQ ID NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to a 
nucleotide sequence of SEQ ED NOS:l-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said - 
target sequence, wherein said target sequence is not randomly selected. 
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7. A method for identifying an expression modulating fragment of 
Streptococcus pneumoniae genome comprising the step of comparing a database 
comprising the nucleotide sequences depicted in SEQ ID NOS: I -391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to 
the nucleotide sequence of SEQ ID NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence comprises sequences known to 
regulate gene expression. 

8. An isolated protein-encoding nucleic acid fragment of the Streptococcus 
pneumoniae genome, wherein said fragment consists of the nucleotide sequence of 
any one of the fragments of SEQ ID NOS: I -39 1 depicted in Tables 2 and 3, or a 
degenerate variant thereof. 

9. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome SEQ ID NOS:l-391 depicted in Tables 2 and 3 or a 
degenerate variant thereof. 

10. An isolated fragment of the Streptococcus pneumoniae genome, 
wherein said fragment modulates the expression of an operably linked open reading 
frame, wherein said fragment consists of the nucleotide sequence from about 10 to 
200 bases in length which is 5' to any one of the open reading frames depicted in 
Tables 2 and 3 or a degenerate variant thereof. 

11. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome of claim 8. 

12. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 8. 

13. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 10. 
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14. A method for regulating the expression of a nucleic acid molecule 
comprising the step of covalently attaching to said nucleic acid molecule a nucleic 
acid molecule consisting of the nucleotide sequence from about 10 to 100 bases 5' 
to any one of the fragments of the Streptococcus pneumoniae genome depicted in 
SEQ ID NOS: 1-391 and Tables 2 and 3 or a degenerate variant thereof. 

15. An isolated nucleic acid molecule encoding a hpmolog of any of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced by a process 
comprising steps of: 

a) screening a genomic DNA library using as a probe a target sequence 
defined by any of SEQ ID NOS: 1-391 and Tables 2 and 3, including fragments 
thereof; 

b) identifying members of said library which contain sequences that 
hybridize to said target sequence; and 

c) isolating the nucleic acid molecules from said members identified in step 

(b). 



16. An isolated DNA molecule encoding a homolog of any one of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced a process 
comprising steps of: 

a) isolating mRNA, DNA, or cDNA produced from an organism; 

b) amplifying nucleic acid molecules whose nucleotide sequence is 
homologous to amplification primers derived from said fragment of said 
Streptococcus pneumoniae genome to prime said amplification; 

c) isolating said amplified sequences produced in step (b). 

J7. An isolated polypeptide encoded by any of the fragments of the 
Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and depicted in Table 2 
and 3 or by a degenerate variant of said fragments. 

18. An isolated polynucleotide molecule encoding any one of the - 
polypeptides of claim 17. 
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19. An antibody which selectively binds to any one of the polypeptides of 
claim 17. . 

130 

~20~ A~methdd~f6f^ a host cell comprising the 

steps of: 

a) incubating a host containing a heterologous nucleic acid molecule whose 
nucleotide sequence consists of any one of the fragments of the Streptococcus 

135 pneumoniae genome of SEQ ID NOS:l-391 and depicted in Tables 2 and 3, under 

conditions where said heterologous nucleic acid molecule is expressed to produce 
said protein, and 

b) isolating said protein. 
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(57) Abstract 



The present invention provides polynucleotide sequences of the genome of Streptococcus pneumoniae, polypeptide sequences encoded 
by the polynucleotide sequences, corresponding polynucleotides and polypeptides, vectors and hosts comprising the polynucleotides, and 
assays and other uses thereof. The present invention further provides polynucleotide and polypeptide sequence information stored on 
computer readable media, and computer-based systems and methods which facilitate its use. 
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comparing a target sequence to the nucleotide sequence of 
the data storage means of step (a) to identify homologous 
sequence(s), and c) retrieval means for obtaining said 
homologous sequence(s) of step (b); a method for identifying 
cornnercially important nucleic acid fragments of the 
Streptococcus pneumoniae genome comprising the step of 
comparing a database comprising said nucleotide sequence(s) 
with a target sequence to obtain a nucleic acid molecule 
comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence is not 
randomly selected; a method for identifying an expression 
modulating fragments of the Streptococcus pneumoniae genome 
comprising the step of comparing a database comprising said 
nucleotide sequence(s) with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary 
nucleotide sequence to said target sequence, wherein said 
target sequence comprises sequences known to regulate gene 
expression; 



2. Claims: (8-20) partially 

An isolated protein-encoded nucleic acid fragment of the 
Streptococcus pneumoniae genome, wherein said fragment 
consists of the nucleotide sequence of the fragment of 
SEQ ID no.l depicted in Tables 2 and 3, or a degenerate 
variant thereof; a vector comprising the fragment of the 
Streptococcus pneumoniae genome SEQ ID no.l; an isolated 
fragment of the Streptococcus pneumoniae genome, wherein 
said fragment modulates the expression of an operably linked 
open reading frame, wherein said fragment consists of the 
nucleotide sequence from about 10 to 200 bases in length 
which is 5' to any one of the open reading frame of SEQ ID 
no.l depicted in Tables 2 and 3 or a degenerate variant 
thereof; a method for regulating the expression of a nucleic 
acid molecule comprising the step of covalently attaching to 
said nucleic acid molecule a nucleic acid molecule 
consisting of the nucleotide sequence from about 10 to 100 
bases 5 1 to any one of the open reading frame of SEQ ID 
no.l and Tables 2 and 3 or a degenerate variant thereof; an 
isolated nucleic acid molecule encoding a homolog of SEQ 
ID no.l; an isolated oolypeptide encoded by SEQ ID no.l and 
depicted-.in Table 2 and 3; an antibody which selectively 
binds to any one of said polypeptides, a method for 
producing a polypeptide* in a host cell comprising a) 
incubating a host containing a heterologous nucleic acid 
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molecule whose nucleotide sequence consists of SEQ ID no.l 
and depicted in Table 2 and 3, under conditions where said 
heterologous nucleic acid molecule is expressed to produce 
said protein, and b) isolating said protein; 



3-392. Claims: (8-20) partially 



ted to eacn ot the" sequences -of 



Idem as subject 2 but limi 
SEQ ID no. 2 to 391; 

For the sake of conciseness, the second subject matter is 
explicitly defined, the other subject matters are defined by 
analogy hereto. 
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